Text-to-text generation models, language models, chat models, and Natural Language Processing (NLP) models are all types of models that generate new text based on an input text. Each language model is trained differently, making it more effective for specific tasks, such as following instructions or writing stories.
Hugging Face experts identify three main categories of language models:
- Base models: These are suitable for fine-tuning and few-shot prompting, such as Mistral 7B.
- Instruction-trained models: These generally produce better responses to instructions than base models, like models with "-instruct" in their name (e.g., Mistral-7B-Instruct-v0.3, Llama-3.1-70B-Instruct, Llama-3.3-70B-Instruct, etc.).
- Human feedback models: These are refined through human ratings, which are incorporated into the model via reinforcement learning, making them better aligned with human preferences.
With the launch of Deepseek R1 in early 2025, reasoning models became a fourth category. These models are specialized in reasoning, mathematical problem-solving, and code generation.