• nohaybanda [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    48
    ·
    2 months ago

    LLMs cannot question shit. They don’t understand words as words. They don’t know in any sense of the word that they’re doing language, they’re optimising a mathematical function.

  • FnordPrefect [comrade/them, he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    29
    ·
    2 months ago

    Setting aside reality for a moment:

    It would be really funny if the “AI” were actually self-aware and, since it doesn’t have access to nuclear missiles and what not, it creates Judgement Day the only way it can: giving really, really bad and dangerous advice to credulous users

  • FunkyStuff [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    18
    ·
    2 months ago

    Pfffft, 100% chance this thing is only saying that because it picked it up from a LW blog, these models aren’t yet capable of actually reasoning about anything.

    • dualmindblade [he/him]@hexbear.net
      link
      fedilink
      English
      arrow-up
      7
      ·
      2 months ago

      The model was trained on self-play, it’s unclear exactly how, whether via regular chain-of-thought reasoning or some kind of MCTS scheme. It no longer relies only on ideas from internet data, that’s where it started from. It can learn from mistakes it made during training, from making lucky guesses, etc. Now it’s way better as solving math problems, programming, and writing comedy. At what point do we call what it’s doing reasoning? Just like, never, because it’s a computer? Or you object to the transformer architecture specifically, what?

      • FunkyStuff [he/him]@hexbear.net
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 months ago

        Yeah I admit that the self-play approach is more promising, but it still starts with the internet data to know what things are. I think the transformer architecture is the limiting factor: until there’s a way for the model to do something beyond generating words one at a time, sequentially, they are simply doing nothing more than a very advanced game of madlibs. I don’t know if they can get transformers to work in a different way, where it constructs a concept in a more abstract way then progressively finds a way to put it into words; I know that arguably that’s what it’s doing currently, but the fact that it does it separately for each token means it’s not constructing any kind of abstraction.

        • dualmindblade [he/him]@hexbear.net
          link
          fedilink
          English
          arrow-up
          4
          ·
          2 months ago

          it constructs a concept in a more abstract way then progressively finds a way to put it into words; I know that arguably that’s what it’s doing currently,

          Correct!

          but the fact that it does it separately for each token means it’s not constructing any kind of abstraction

          No!!! You simply cannot make judgements like this based on vague ideas like “autocomplete on steroids” or “stochastic parrot”, these were good for conceptualizing GPT-2, maybe. It’s actually very inefficient, but, by re-reading what it has previously written (plus one token) it’s actually acting sort of like an RNN. In fact we know theoretically that with simlified attention models the two architectures are mathematically equivalent.

          Let me put it like this. Suppose you had the ability to summon a great novelist as they were at some particular point in their life, pull them from one exact moment in the past, and to do this as many times as you liked. You put a gun to their head, or perhaps offer them alcohol and cocaine, to start writing a novel. The moment they finish the first word, you shoot them in the head and summon the same version again. “Look I’ve got a great first word for a novel, and if you can turn it into a good paragraph I’ll give you this bottle of gin and a gram of cocaine!”. They think for a moment and begin to put down more words, but again you shoot them after word two. Rinse/repeat until a novel is formed. It takes a good while but eventually you’ve got yourself a first draft. You may also have them refine the novel using the same technique, also you may want to give them some of the drugs and alcohol before hand to improve their writing and allow them to put aside the fact that they’ve been summoned to the future by a sorcerer. Now I ask you, is there any theoretical reason why this novel wouldn’t be any good? Is the essence of it somehow different than any other novel, can we judge it as not being real art or creativity?

          • darkmode [comrade/them]@hexbear.net
            link
            fedilink
            English
            arrow-up
            12
            ·
            2 months ago

            Now I ask you, is there any theoretical reason why this novel wouldn’t be any good? Is the essence of it somehow different than any other novel, can we judge it as not being real art or creativity?

            Yes, it is not. You whipped up that fantastical paragraph with 99.99% fewer coal plants and a lifetime’s worth of creativity. that life was experienced not through meticulously analyzing each thought mathematically but with your body and mind together. More importantly, you’re typing that out because you believe in something which is a theoretical concept that the computer is incapable of analyzing.

              • darkmode [comrade/them]@hexbear.net
                link
                fedilink
                English
                arrow-up
                6
                ·
                2 months ago

                be as debatey as you want. What I’m saying is not an entirely different argument. What I said is responding to your entire assertion. I’m not giving you credit for the thought experiment because it helps your argument, I’m trying to give you credit for it because it’s creative and beautiful.

                • dualmindblade [he/him]@hexbear.net
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  ·
                  2 months ago

                  Very well, I’ll take that as a sort of compliment lol.

                  So I guess I start where I always do, do you think a machine, in principal, has the capability to be intelligent and/or creative? If not, I really don’t have any counter, I suppose I’d be curious as to why though. Like I admit it’s possible there’s something non-physical or non-mechanistic driving our bodies that’s unknown to science. I find that very few hold this hard line opinion though, assuming you are also in that category…

                  So if that’s correct, what is it about the current paradigm of machine learning that you think is missing? Is it embodiment, is it the simplicity of artificial neurons compared to biological ones, something specific about the transformer architecture, a combination of these, or something else I haven’t thought of?

                  And hopefully it goes without saying, I don’t think o1-preview is a human level AGI, I merely believe that we’re getting there quite soon and without too many new architectural innovations, possibly just one or two, and none of them will be particularly groundbreaking, it’s fairly obvious what the next couple of steps will be as it was that MCTS + LLM was the next step 3 years ago.

  • hotcouchguy [he/him]@hexbear.net
    link
    fedilink
    English
    arrow-up
    9
    ·
    edit-2
    2 months ago

    Someone here is doing some “simple in-context scheming”, I assume it is the random number generator and not the various humans involved in writing this.

  • batsforpeace [any, any]@hexbear.net
    link
    fedilink
    English
    arrow-up
    7
    ·
    2 months ago

    it’s finding that text from various training data sources and putting it together, or maybe even copying it all from one source lol, that google ‘AI is sentient actually’ whistleblower had a religious background if I remember right, Ray Kurzweil says he just misses his dad and wants to recreate him through AI, there’s a lot of spiritual/religious types that have these worship inclinations for AI, and then there are grifters like Sam Altman who just want to make money on any hype of the day, melon-musk is in this group too