Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • ryannathans@aussie.zone
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    7
    ·
    4 days ago

    Opus 4.6 has been excellent at problem solving in software development, no surprises it nails it

    It’s no surprise public opinion is these tools are trash when the free models are unable to answer simple questions

    • NaibofTabr@infosec.pub
      link
      fedilink
      English
      arrow-up
      31
      arrow-down
      7
      ·
      4 days ago

      It’s no surprise public opinion is these tools are trash when the free models are unable to answer simple questions

      The tools are trash not because they are unreliable but because they are actively destroying human society and culture. They are destroying art, science, journalism, open source software, the internet at large, and the environment we all live in. It wouldn’t matter if the generative models were accurate, they would still be garbage.

      The fact that they are unreliable just serves to highlight what a colossally destructive waste of time and resources this entire exercise has been.

        • NaibofTabr@infosec.pub
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          2
          ·
          4 days ago

          The fact is AI can make as-good or better art than most “artists” because most “art” is just cookie-cutter shit for morons.

          This is an obvious misstatement. If you actually believe this then you’re not qualified to have opinions on art in general.

          “AI” (in this context meaning generative algorithms, because there is no intelligence) can no more “make art” than it can think, or care.

          • Iconoclast@feddit.uk
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 days ago

            In computer science Artificial Intelligence refers to any system designed to perform tasks that would typically require human intelligence. That includes everything from playing chess to recognizing patterns, translating languages, or generating text.

            The first ever AI system was Logic Theorist written by Allen Newell in 1956.

            Trying to redefine terms is not helpful. GenAI is AI. It’s not misuse of the term.

            • tortina_original@lemmy.world
              link
              fedilink
              English
              arrow-up
              8
              arrow-down
              3
              ·
              4 days ago

              Not sure at what point will you realize that what you quoted/said has absolutely nothing to do with the actual topic.

              Probably never.

                • atomicorange@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  4
                  ·
                  4 days ago

                  Could you define what you mean when you say the word “art”? I think this may be a semantic disagreement. I think the people you’re arguing with are using a definition similar to “human creative expression” while you seem to mean something different.

    • Fizz@lemmy.nz
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      5
      ·
      4 days ago

      The free models feel years behind so people constantly underestimate what its capable of. I still hear people say ai can’t generate fingers.

      • KeenFlame@feddit.nu
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 days ago

        No that is what the megacorps wishes. Open weight models are exactly as good but there are no commercial gpus for that so the point is only and only a class war issue

        • Fizz@lemmy.nz
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          3 days ago

          I am not able to test the open weight ones since I dont have 200gb+ of vram. So for now im gonna stay on my statement that the bleeding edge mega corp models are the best.