Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost

The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.

Jan 20, 2025 - 19:17
 0
Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost
VentureBeat/Midjourney
The company developed DeepSeek-R1 by using pure reinforcement learning on top of DeepSeek-V3-Base, and matched or beat o1 on some benchmarks.Read More

What's Your Reaction?

like

dislike

love

funny

angry

sad

wow