访问HuggingFace网站获取所需模型:
https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/tree/main
https://huggingface.co/deepseek-ai/DeepSeek-R1/tree/main
https://huggingface.co/deepseek-ai/DeepSeek-V3/tree/main
以deepseek-ai/DeepSeek-R1-Distill-Qwen-7B为例,下载“Files and versions”下所有文件。
config.json文件需要修改torch_dtype类型为float16,否则可能导致模型服务化失败。
https://support.huawei.com/enterprise/zh/ascend-computing/ascend-hdk-pid-252764743/software
以Ascend HDK 24.1.0.3为例:
选定软件类型并选择对应驱动软件包。
操作系统 |
安装依赖 |
检查命令 |
备注 |
---|---|---|---|
openEuler |
dkms |
rpm -qa | grep dkms |
|
gcc |
rpm -qa | grep gcc |
||
kernel-headers |
rpm -qa | grep kernel-headers-$(uname -r) |
||
kernel-devel |
rpm -qa | grep kernel-devel-$(uname -r) |
cd /root
chmod +x root/Ascend-hdk-310p-npu-driver_x.x.x_linux-{arch}.run
./Ascend-hdk-310p-npu-driver_x.x.x_linux-{arch}.run --check
./Ascend-hdk-310p-npu-driver_x.x.x_linux-{arch}.run --full
安装过程可能会出现如下报错:
npu-smi info
出现以下回显信息则加载成功。
+----------------------------------------------------------------------------------------------------+ | npu-smi 24.1.0 Version: 24.1.0 | +-------------------+-----------------+--------------------------------------------------------------+ | NPU Name | Health | Power(W) Temp(C) Hugepages-Usage(page) | | Chip Device | Bus-Id | AICore(%) Memory-Usage(MB) | +===================+=================+==============================================================+ | 4 310P3 | OK | NA 43 0 / 970 | | 0 0 | 0000:81:00.0 | 0 861 / 21534 | +===================+=================+==============================================================+ | No running processes found in NPU 4 | +===================+=================+==============================================================+
首次安装按照“驱动->固件”的顺序,覆盖安装或升级则按照“固件->驱动”的顺序。
cd /root
chmod +x Ascend-hdk-310p-npu-firmware_x.x.x.x.X.run
./Ascend-hdk-310p-npu-firmware_x.x.x.x.X.run --check
./Ascend-hdk-310p-npu-firmware_x.x.x.x.X.run --full
/usr/local/Ascend/driver/tools/upgrade-tool --device_index -1 --component -1 -version
若与固件软件包版本号一致,则表示安装成功。
访问链接并下载下列软件:https://www.hiascend.com/developer/ascendhub/detail/af85b724a7e5469ebd7ea13c3439d48f
docker run -it -d --net=host --shm-size=1000g \ --privileged \ --name=qwen-deepseek \ --device=/dev/davinci_manager \ --device=/dev/hisi_hdc \ --device=/dev/devmm_svm \ -v /usr/local/Ascend/driver:/usr/local/Ascend/driver:ro \ -v /usr/local/sbin:/usr/local/sbin:ro \ -v /root/DeepSeek-R1-Distill-Qwen-7B:/root/DeepSeek-R1-Distill-Qwen-7B\ llm_models_infer:24.0.0.T0.B092-300I-Duo(即镜像id) bash
其中,“--name=qwen-deepseek”、“/root/DeepSeek-R1-Distill-Qwen-7B:/root/DeepSeek-R1-Distill-Qwen-7B”和 “llm_models_infer:24.0.0.T0.B092-300I-Duo”可根据实际情况进行配置。
docker exec -it qwen-deepseek bash
若不存在以下config.json文件,则进入“/opt/package”目录,执行命令:source install_and_enable_cann.sh(若存在部分库未成功安装,则重新执行该命令,否则将无法启动服务:Failed to init endpoint!)。
vim /usr/local/Ascend/mindie/latest/mindie-service/conf/config.json
cd /usr/local/Ascend/mindie/latest/mindie-service Source set_env.sh
cd bin ./mindieservice_daemon
curl -H "Accept: application/json" -H "Content-type: application/json" -X POST -d '{"model": "qwen","messages": [{"role": "user", "content": "介绍一下杭州"}],"max_tokens": 128}' http://ip:port/v1/chat/completions