环境:
Xinference
问题描述:
Xinference如何注册自定义模型
解决方案:
1.写个model_config.json,内容如下
{
"version": 1,
"context_length": 2048,
"model_name": "custom-llama-3",
"model_lang": [
"en",
"ch"
],
"model_ability": [
"generate",
"chat"
],
"model_family": "other",
"model_specs": [
{
"model_format": "ggufv2",
"model_size_in_billions": 8,
"quantizations": [
"4-bit",
"8-bit",
"none"
],
"model_id": "Llama3-8B-Chinese-Chat.Q6_K",
"model_uri": "/mnt/e/7B/koboldcpp1.63/koboldcpp1.63",
"model_file_name_template": "llama-3-8b-ggmlv3.{quantization}.bin"
}
]
}
2.运行注册命令
xinference register -f model_config.json
3.查看自定义模型,出现了就成功
4.最后运行模型













![[Meachines] [Easy] Postman redis未授权访问-SSH公钥注入+RSA私钥解密+Webmin-RCE权限提升](https://img-blog.csdnimg.cn/img_convert/2f3e67ab9d315ea98434a574c9fe6cb7.jpeg)





![二十天刷leetcode【hot100】算法- day1[后端golang]](https://i-blog.csdnimg.cn/direct/247599f2ca1040459a629782695463fb.jpeg)
