• 0 Posts
  • 11 Comments
Joined 2 years ago
cake
Cake day: June 16th, 2023

help-circle

  • LLMs aren’t capable of maintaining an even remotely convincing simulacrum of human connection,

    Eh, maybe, maybe not. 99% of the human-written stuff in IM chats, or posted to social media, is superficial fluff that a fine-tuned LLM should have no problem imitating. It’s still relatively easy to recognize AI models outputs in their default settings, because of their characteristic earnest/helpful tone and writing style, but that’s quite easily adjustable.

    One example worth considering: people are already using fine tuned LLMs to copilot tabletop RPGs, with decent success. In that setting, you don’t need fine literature, just a “good enough” quality of prose. And that is already far exceeding the average quality that you see in social media.






  • Kudos to Deepseek for continuing to releasing the code and model under a permissive license. Would be nicer if the weights were under an MIT license rather than a custom license, but I guess they’re afraid of liability. Strange situation we’re now in, where the future of open AI (as opposed to “open but actually closed” AI) now almost entirely depends on Chinese companies.

    In practice, though, I wonder how many people would actually self host and tinker with this, since the model is way too large to run on any desktop. It would be very interesting to find downstream use-cases and modifications, which is supposed to be a strength of the open source model. Deepseek themselves don’t seem to be much concerned about applications; from my understanding, they are basically funded by a sugar daddy and are happy to just do R&D (funnily enough, that is kinda what OpenAI was originally supposed to be before they sold out to Microsoft).





  • See, this was always the problem with Chinese efforts to indigenize their semiconductor industry. Each individual Chinese firm had no incentive to use Chinese suppliers, rather than their more established Western competitors. Well, guess what, the US Government has solved that coordination problem for them. Just about every Chinese company, up and down the supply chain, now has an excellent reason to buy Chinese. Sure, they’ll take years to work out the kinks, and there will be lots of chances to point and laugh in the meantime. But in the long run, the Sullivan-Blinken strategy of squeezing the Chinese chip industry might end up being one of the most counterproductive geostrategic ideas of all time.