wtf is deepseek r1? v3?
R1 as a reasoning powerhouse || what's a reasoning model? || wtf is the difference?
Last updated
R1 as a reasoning powerhouse || what's a reasoning model? || wtf is the difference?
Last updated
"The study of reasoning model draws its inspiration from the observation that reasoning is one of the main intelligent activities of human beings, and of the ways by which human thinking comes from one idea to a related idea." –
ty, smart person.
Deepseek-R1 is a reasoning model (unlike GPT-4o, Sonnet3.5, JLLM, etc. When it thinks about your prompt and instruction, this is its thought process (relative):
First, I should set the scene...
Next, sensory details...
I need to show their internal state…
Ending with a poignant image...
Let me check if I'm covering all elements...
[More information in this article: ]
The result:
thinking output (because of CoT= Chain of Thinking, aka: "Okay, let me think about this...") ⇒ slower response time
stricter adherence to API— you need to be specific about exactly WHAT and HOW you want it to act.
very fussy about how you structure your prompt, what words you use etc.
however, when it understands you, it cooks a full-course meal
i'll leave you with this from datacamp.com,
DeepSeek-V3 doesn’t have reasoning ability. As we mentioned, it works as a next-word predictor. This means that it can answer questions whose answers are encoded in the training data.
Because the amount of data used to train these models is so huge, it is capable of answering questions on almost any topic. Like other LLMs, it excels in natural-sounding conversation and creativity. It's the model we want for creating writing, content creation, or answering generic questions that are likely to have been solved already numerous times.
DeepSeek-R1, on the other hand, shines when it comes to complex problem-solving, logic, and step-by-step reasoning tasks. It's designed to tackle challenging queries that require thorough analysis and structured solutions. When faced with complex coding challenges or detailed logical puzzles, R1 is the tool to rely on.
so now you know!
Official DeepSeek API (paid)
OpenRouter method (free****‼️)
COPY THE KEY INTO YOUR NOTES || looks something like sk-v1-xxxxxxx
For more troubleshooting and setup tips (also how to set up Chutes),
‼️ With this method, TEMP doesn't affect R1.
FOR V3: this is how temp works for this model
if this doesn't work for you, refer to (last edit: 14 Apr 25)