Instructions to use koboldcpp/music with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use koboldcpp/music with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="koboldcpp/music", filename="Ace-Qwen3-Embedding-0.6B-BF16.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use koboldcpp/music with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf koboldcpp/music:Q4_K_M # Run inference directly in the terminal: llama-cli -hf koboldcpp/music:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf koboldcpp/music:Q4_K_M # Run inference directly in the terminal: llama-cli -hf koboldcpp/music:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf koboldcpp/music:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf koboldcpp/music:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf koboldcpp/music:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf koboldcpp/music:Q4_K_M
Use Docker
docker model run hf.co/koboldcpp/music:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use koboldcpp/music with Ollama:
ollama run hf.co/koboldcpp/music:Q4_K_M
- Unsloth Studio
How to use koboldcpp/music with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for koboldcpp/music to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for koboldcpp/music to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for koboldcpp/music to start chatting
- Docker Model Runner
How to use koboldcpp/music with Docker Model Runner:
docker model run hf.co/koboldcpp/music:Q4_K_M
- Lemonade
How to use koboldcpp/music with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull koboldcpp/music:Q4_K_M
Run and chat with the model
lemonade run user.music-Q4_K_M
List all available models
lemonade list
| { | |
| "model": [], | |
| "model_param": null, | |
| "port": 5001, | |
| "port_param": 5001, | |
| "host": "", | |
| "launch": true, | |
| "config": null, | |
| "threads": -1, | |
| "usecuda": null, | |
| "usevulkan": null, | |
| "usecpu": null, | |
| "contextsize": 8192, | |
| "gpulayers": -1, | |
| "tensor_split": null, | |
| "autofit": false, | |
| "version": false, | |
| "analyze": "", | |
| "maingpu": -1, | |
| "batchsize": 512, | |
| "blasthreads": null, | |
| "lora": null, | |
| "loramult": 1.0, | |
| "noshift": false, | |
| "nofastforward": false, | |
| "useswa": false, | |
| "smartcache": 0, | |
| "ropeconfig": [ | |
| 0.0, | |
| 10000.0 | |
| ], | |
| "overridenativecontext": 0, | |
| "usemmap": false, | |
| "usemlock": false, | |
| "noavx2": false, | |
| "failsafe": false, | |
| "debugmode": 0, | |
| "onready": "", | |
| "benchmark": null, | |
| "prompt": "", | |
| "cli": false, | |
| "genlimit": 0, | |
| "multiuser": 1, | |
| "multiplayer": false, | |
| "websearch": false, | |
| "remotetunnel": false, | |
| "highpriority": false, | |
| "foreground": false, | |
| "preloadstory": null, | |
| "savedatafile": null, | |
| "quiet": false, | |
| "ssl": null, | |
| "nocertify": false, | |
| "mmproj": null, | |
| "mmprojcpu": false, | |
| "visionmaxres": 1024, | |
| "draftmodel": null, | |
| "draftamount": 8, | |
| "draftgpulayers": 999, | |
| "draftgpusplit": null, | |
| "password": null, | |
| "ratelimit": 0, | |
| "ignoremissing": false, | |
| "chatcompletionsadapter": "AutoGuess", | |
| "jinja": false, | |
| "jinja_tools": false, | |
| "noflashattention": false, | |
| "lowvram": false, | |
| "quantkv": 0, | |
| "smartcontext": false, | |
| "unpack": "", | |
| "exportconfig": "", | |
| "exporttemplate": "", | |
| "nomodel": false, | |
| "moeexperts": -1, | |
| "moecpu": 0, | |
| "defaultgenamt": 1024, | |
| "nobostoken": false, | |
| "enableguidance": false, | |
| "maxrequestsize": 32, | |
| "overridekv": null, | |
| "overridetensors": null, | |
| "showgui": false, | |
| "skiplauncher": false, | |
| "singleinstance": true, | |
| "nopipelineparallel": false, | |
| "gendefaults": "", | |
| "gendefaultsoverwrite": false, | |
| "mcpfile": null, | |
| "device": "", | |
| "downloaddir": "", | |
| "autofitpadding": 1024, | |
| "hordemodelname": "", | |
| "hordeworkername": "", | |
| "hordekey": "", | |
| "hordemaxctx": 0, | |
| "hordegenlen": 0, | |
| "sdmodel": "", | |
| "sdthreads": 0, | |
| "sdclamped": 0, | |
| "sdclampedsoft": 0, | |
| "sdt5xxl": "", | |
| "sdclip1": "", | |
| "sdclip2": "", | |
| "sdphotomaker": "", | |
| "sdupscaler": "", | |
| "sdflashattention": false, | |
| "sdoffloadcpu": false, | |
| "sdvaecpu": false, | |
| "sdclipgpu": false, | |
| "sdconvdirect": "off", | |
| "sdvae": "", | |
| "sdvaeauto": false, | |
| "sdquant": 0, | |
| "sdlora": null, | |
| "sdloramult": 1.0, | |
| "sdtiledvae": 768, | |
| "whispermodel": "", | |
| "ttsmodel": "", | |
| "ttswavtokenizer": "", | |
| "ttsgpu": false, | |
| "ttsmaxlen": 4096, | |
| "ttsthreads": 0, | |
| "ttsdir": "", | |
| "musicllm": "https://huggingface.co/koboldcpp/music/resolve/main/acestep-5Hz-lm-1.7B-Q8_0.gguf?download=true", | |
| "musicembeddings": "https://huggingface.co/koboldcpp/music/resolve/main/Ace-Qwen3-Embedding-0.6B-BF16.gguf?download=true", | |
| "musicdiffusion": "https://huggingface.co/koboldcpp/music/resolve/main/acestep-v15-sftturbo50-Q8_0.gguf?download=true", | |
| "musicvae": "https://huggingface.co/koboldcpp/music/resolve/main/ace-vae-BF16.gguf?download=true", | |
| "musiclowvram": true, | |
| "embeddingsmodel": "", | |
| "embeddingsmaxctx": 0, | |
| "embeddingsgpu": false, | |
| "admin": false, | |
| "adminpassword": null, | |
| "admindir": "", | |
| "hordeconfig": null, | |
| "sdconfig": null, | |
| "noblas": false, | |
| "nommap": false, | |
| "pipelineparallel": false, | |
| "sdnotile": false, | |
| "forceversion": false, | |
| "sdgendefaults": false, | |
| "flashattention": false, | |
| "testmemory": false, | |
| "istemplate": true | |
| } |