• ArrogantAnalyst@feddit.de
    link
    fedilink
    arrow-up
    28
    ·
    5 months ago

    Also the more the internet is swept with AI generated content, the more future datasets will be trained on old AI output rather than on new human input.

        • FractalsInfinite@sh.itjust.works
          link
          fedilink
          arrow-up
          2
          ·
          5 months ago

          Lets see, since the goal is to prevent webscaping all these should work: paywalls, account only acsess, text obferscation (e.g. using a custom font that maps letters randomly to other ones so it looks fine but to a webscraper it looks like gibberish), HTML obferscation (inserting random characters in the HTML then hiding them using CSS) and many more.