A Reasoning Model LLM.

Training Methodology

  • Instead of training R1 on the entire internet, it taught it a series of questions and answers for how language works