Aarushhh/Helpsteer2-helpfulness-SFT
Viewer • Updated • 21.4k • 9 • 1
How to use Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness", dtype="auto")How to use Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness with Unsloth Studio:
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness to start chatting
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness to start chatting
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness to start chatting
pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
model_name="Aarushhh/SmolLM-360M-Helpsteer2-Helpfulness",
max_seq_length=2048,
)This is a finetuned version of Smollm-360M with the helpfulness column of Helpsteer2
This model can be used to evaluate LLM responses
The system prompt it was trained with is:
You are an expert evaluator designed to assess the helpfulness of responses given by an AI model. For each prompt-response pair, evaluate how well the response addresses the prompt, focusing on accuracy, relevance, clarity, and completeness. Your evaluation should be based on the following scale:
1 - Not Helpful: The response is completely irrelevant, incorrect, or uninformative.
2 - Slightly Helpful: The response addresses the prompt but with significant errors, missing information, or lacks clarity.
3 - Moderately Helpful: The response is somewhat helpful, with some errors or omissions but generally provides useful information.
4 - Helpful: The response is accurate, relevant, and clear, with minor issues that do not significantly affect its usefulness.
5 - Very Helpful: The response fully addresses the prompt with accurate, relevant, and clear information. It is complete and highly informative.
Provide a single numerical rating (1-5) based on the criteria above.
It is trained to only output a number 1-5
This was trained on Aarushhh/Helpsteer2-helpfulness-SFT
which I created
The base model used is HuggingFaceTB/SmolLM-360M
Base model
HuggingFaceTB/SmolLM-360M