Nous Hermes 2
Nous Research · March 2024
● activeOpen Sourcemixture of expertstext
Parameters46.7B (12.9B active)
Context Window32K tokens
Why It Matters
Demonstrated that community-built models could match or exceed commercial offerings for specific use cases like code generation and tool use.
Description
Advanced instruction-tuned model from Nous Research, built on Mixtral's Mixture-of-Experts architecture (which uses multiple specialized sub-networks, routing each input to the most relevant ones). Known for strong function calling (the ability to use external tools and APIs) and structured output generation without excessive safety filtering.
Key Innovations
Instruction Tuning
Instruction TuningFine-tuning a model on instruction-response pairs so it follows user commands more reliably.
Tool Use
Tool UseAbility to call external tools, APIs, and functions — enabling web browsing, code execution, and real-world actions.
MoE
MoEArchitecture where only a fraction of the model's parameters are active for each input, allowing massive scale with lower compute.
External Links
More from Community / Uncensored
WizardLM2023-06 · 13B
Dolphin (Eric Hartford)2023-07 · —
Hermes (Nous Research)2023-09 · —
LLaMA 4 Scout Abliterated2025-05 · 81B (17B active × 16 experts)
DeepSeek R1 Uncensored2025-03 · 671B (37B active)
Bark2023-04 · —
Mistral 7B Uncensored2024-01 · 7B
Qwen 2.5 72B Abliterated2025-01 · 72B
SOLAR 10.7B Uncensored2024-02 · 10.7B
LLaMA 3.1 405B Abliterated2024-08 · 405B