Thousands of moderators overseeing the site’s subreddits are on strike. It’s a wrinkle in Reddit’s plan to go public, and a sign that plan is premature, columnist Anita Ramaswamy writes.

  • earthling@kbin.social
    link
    fedilink
    arrow-up
    7
    ·
    edit-2
    1 year ago

    The ongoing strike, spurred by Huffman’s plan to charge fees to third-party apps that serve up Reddit content, was supposed to last for 48 hours.

    Not just charge fees… Exorbitant fees. Outrageous fees.

    If Huffman wanted to target these much higher costs to LLMs, they could have instituted an approval process for 3PAs which got charged sane API fees while they charge much more for LLMs. I’m no dev but I think they could tell the difference between the two by just analyzing the API traffic.

    But they aren’t doing that. Maybe LLMs were the primary target but they sure aren’t even trying to keep 3PAs around.

    • CookieJarObserver@feddit.de
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      Ai training data gets gathered with scrapers… It literally doesn’t need the api. And they only need everything once. And maybe update it every year or so…

      Also the whole nsfw thing…

      This is just and only to kill third party clients.

      And spez is close to someone that works for “open” ai… So a even more clownshow move.

      • earthling@kbin.social
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        Huh… I would have thought they’d use the API when available but I honestly know nothing about it. Wouldn’t gathering data via API provide more structured data thereby making it easier to feed into their models?

        • CookieJarObserver@feddit.de
          link
          fedilink
          arrow-up
          2
          ·
          1 year ago

          Often its easier to just scrape everything rather than making a whole code to pull api requests and put them into a database and sort them while doing so. These companies scrape the entire internet, they don’t have time or necessarie to use api, they don’t need permanent access to a two way communication, they just need the data.