Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. P1 P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    68 2

  2. SimpleVLA-RL SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 1.2k 65

  3. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 405 13

  4. RL-Compositionality RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    Python 46 3

  5. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 939 65

  6. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.8k 100

Repositories

Showing 7 of 7 repositories
  • P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    PRIME-RL/P1’s past year of commit activity
    68 2 1 0 Updated Nov 18, 2025
  • RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    PRIME-RL/RL-Compositionality’s past year of commit activity
    Python 46 Apache-2.0 3 2 0 Updated Nov 7, 2025
  • SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    PRIME-RL/SimpleVLA-RL’s past year of commit activity
    Python 1,155 MIT 65 42 1 Updated Oct 13, 2025
  • TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    PRIME-RL/TTRL’s past year of commit activity
    Python 939 MIT 65 13 0 Updated Sep 26, 2025
  • Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    PRIME-RL/Entropy-Mechanism-of-RL’s past year of commit activity
    Python 405 13 2 0 Updated Jul 11, 2025
  • PRIME Public

    Scalable RL solution for advanced reasoning of language models

    PRIME-RL/PRIME’s past year of commit activity
    Python 1,787 Apache-2.0 100 8 1 Updated Mar 18, 2025
  • ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    PRIME-RL/ImplicitPRM’s past year of commit activity
    Python 168 Apache-2.0 11 12 0 Updated Mar 14, 2025

Top languages

Loading…

Most used topics