starlight trinity nano

Very early WIP !!

extremely minimal 300 steps / 60 data. this is a training test, not a functional finetune (yet!)

proved training will work if i scale data and compute. currently filtering more data - next run should have way fewer bugs/confabulations.

This model was converted to GGUF format from bleepybloops/trinity-nano-starlight-v0.5 using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Downloads last month
56
GGUF
Model size
6B params
Architecture
afmoe
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for bleepybloops/trinity-nano-starlight-v0.5-Q8_0-GGUF