In an age of LLMs, is it time to reconsider human-edited web directories?

Back in the early-to-mid '90s, one of the main ways of finding anything on the web was to browse through a web directory.

These directories generally had a list of categories on their front page. News/Sport/Entertainment/Arts/Technology/Fashion/etc.

Each of those categories had subcategories, and sub-subcategories that you clicked through until you got to a list of websites. These lists were maintained by actual humans.

Typically, these directories also had a limited web search that would crawl through the pages of websites listed in the directory.

Lycos, Excite, and of course Yahoo all offered web directories of this sort.

(EDIT: I initially also mentioned AltaVista. It did offer a web directory by the late '90s, but this was something it tacked on much later.)

By the late '90s, the standard narrative goes, the web got too big to index websites manually.

Google promised the world its algorithms would weed out the spam automatically.

And for a time, it worked.

But then SEO and SEM became a multi-billion-dollar industry. The spambots proliferated. Google itself began promoting its own content and advertisers above search results.

And now with LLMs, the industrial-scale spamming of the web is likely to grow exponentially.

My question is, if a lot of the web is turning to crap, do we even want to search the entire web anymore?

Do we really want to search every single website on the web?

Or just those that aren’t filled with LLM-generated SEO spam?

Or just those that don’t feature 200 tracking scripts, and passive-aggressive privacy warnings, and paywalls, and popovers, and newsletters, and increasingly obnoxious banner ads, and dark patterns to prevent you cancelling your “free trial” subscription?

At some point, does it become more desirable to go back to search engines that only crawl pages on human-curated lists of trustworthy, quality websites?

And is it time to begin considering what a modern version of those early web directories might look like?

@degoogle #tech #google #web #internet #LLM #LLMs #enshittification #technology #search #SearchEngines #SEO #SEM

  • Wren 🐁@chitter.xyz
    link
    fedilink
    arrow-up
    3
    ·
    10 months ago

    @Emperor @ajsadauskas I’ve been thinking about this myself lately - but I had wondered how a curated directory might scale, I hadn’t considered federated social bookmarking and honestly that sounds like a brilliant solution. I’d love to see something like that happen, maybe even contribute

    • ᴇᴍᴘᴇʀᴏʀ 帝A
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 months ago

      As the links show, Relicious/Fedilicious has been on my mind a while and I have been mourning the loss of Delicious for a long time. However, the above got me jotting down some notes.

      It should be doable. I haven’t had a root through PostMark’s code but it might be they have done the bulk of the work already and it just needs a multiuser interface bolting on top of it.

    • Stooryduster@mastodon.scot
      link
      fedilink
      arrow-up
      2
      ·
      10 months ago

      @Wren @Emperor @ajsadauskas Back in the day people’s web sites had a links page and if their site was good it was always worth looking at what they listed as worthy links. I still have one but it’s out of habit rather than being useful. Might rethink now tho.

      • ᴇᴍᴘᴇʀᴏʀ 帝A
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 months ago

        Yes, a lot of ideas knocking around this discussion are really Web 1.0 ideas given a Fediverse makeover. The advantage of using something like a federated social networking service is that you wouldn’t have to put much thought into building a links section, it would build itself as you add links while you are web surfing.

        I took a look at your site and it is working on WordPress which now uses the ActivityPub protocol, so something like that should integrate nicely.