@Dyf_Tfh - Programiranje

Dyf_Tfh@lemmy.sdf.org

0 Posts
1 Comment

Joined 2 years ago

Cake day: June 23rd, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

Dyf_Tfh@lemmy.sdf.orgtoOpen Source@lemmy.ml•Proton's biased article on Deepseek
link
fedilink
arrow-up
10
arrow-down
4·
edit-2
15 hours ago
Those are not deepseek R1. They are unrelated models like llama3 from Meta or Qwen from Alibaba “distilled” by deepseek.

This is a common method to smarten a smaller model from a larger one.

Ollama should have never labelled them deepseek:8B/32B. Way too many people misunderstood that.

link
fedilink