• sunzu2@thebrainbin.org
    link
    fedilink
    arrow-up
    2
    ·
    2 hours ago

    But the new DeepSeek model comes with a catch if run in the cloud-hosted version—being Chinese in origin, R1 will not generate responses about certain topics like Tiananmen Square or Taiwan’s autonomy, as it must “embody core socialist values,” according to Chinese Internet regulations. This filtering comes from an additional moderation layer that isn’t an issue if the model is run locally outside of China.

  • gaiussabinus@lemmy.world
    link
    fedilink
    arrow-up
    5
    arrow-down
    1
    ·
    4 hours ago

    It is very censored but is very fast and very good for normal use. Can code simple games on request and work as a one shot as well as make and follow design documents to make more sophisticated projects. Smaller models are super fast even on consumer hardware. It post its “thinking” so you can follow its pattern and address issues that would not be apparent in the output. I would recommend.

  • Aria@lemmygrad.ml
    link
    fedilink
    arrow-up
    2
    ·
    4 hours ago

    It’s the 671B model that’s competitive with o1. So you need 16 80GB cards. The comments seem very happy with the smaller versions, and I’m going to try one now, but it doesn’t seem like anything you can run on a home computer with 4 4090s is going to be in the ballpark comparable to ChatGPT.