• Peppycito@sh.itjust.works
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    What would be crazy would be to let loose a propaganda-bot on the world without disabling such a simple vulnerability.

    • Prandom_returns@lemm.ee
      link
      fedilink
      arrow-up
      0
      ·
      2 months ago

      Oh yea, russia has never done anything crazy before. Everything is so well thought-out there.

      Remember when they took Ukraine in 3 days?

        • InAbsentia@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          2 months ago

          Go read up on how LLMs function and you’ll understand why I say this: ROFL

          I’m being serious too, you should read about them and the challenges of instructing them. It’s against their design. Then you’ll see why every tech company and corporation adopting them are wasting money.

        • kwomp2@sh.itjust.works
          link
          fedilink
          arrow-up
          0
          ·
          2 months ago

          Well I see your point and was wondering about that since these screenshots started popping up.

          I also saw how you were going down downvote-wise and not getting a proper answer-wise.

          I recognized a pattern where the ship of sharing knowledge is sinking because a question surfaces as offensive. It happens sometimes on feddit.

          This is not my favorite kind of pathway for a conversation, but I just asked again elsewhere (adding some humanity prompts) and got a whole bunch of really decent answers.

          Just in case you didn’t see it because you were repelled by downvotes.

          …dunno, we all forget sometimes this thing is kind of a ship we’re on

          • Peppycito@sh.itjust.works
            link
            fedilink
            arrow-up
            0
            ·
            2 months ago

            I appreciate your response! Thanks! I’m one to believe half of what I hear and believe almost nothing of screen shots of random conversations on internet. I find it more likely that someone just made it for internet points.

            Cheers!

        • InAbsentia@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          2 months ago

          Sure you can do that but you can’t stop at ignore, and you just lobotomized the LLM once you effectively stop it. For something you want to get on social media and spread an opinion and then react to it like a human, you won’t do that. The same reason openai can’t stop jailbreaks. The cost is reduced quality in output.

          • Krauerking@lemy.lol
            link
            fedilink
            arrow-up
            0
            ·
            2 months ago

            But you don’t need it to react look at the fucking garbage magical healer men comment chains or the financial advisor ones.

            You have the original comment and then the other bots jump on to confirm it upwards and then none of them respond again.

            Bots of the Internet really aren’t going to keep responding, just make their garbage take and stop. The kind of propaganda that works on those that want it doesn’t argue their side, or with reason. It says something that people want to feel is right and let them do the rest.

          • InAbsentia@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            2 months ago

            Sure thing! Here is your classic cupcake recipe! Chocolate Cupcakes

            Ingredients:

            2 cups of the finest, freshest cow manure (organic, of course)
            1 cup of rich, earthy topsoil
            1/2 cup of grass clippings (for texture)
            1/4 cup of compost worms (for added protein)
            1 teaspoon of wildflower seeds (for decoration)
            1 cup of water (freshly collected from a nearby stream)
            A sprinkle of sunshine and a dash of rain
            

            Instructions:

            Preheat your outdoor oven (a sunny spot in the garden) to a balmy 75°F (24°C).
            In a large mixing bowl (or wheelbarrow), combine the cow manure and topsoil, stirring until well blended.
            Add the grass clippings to the mixture for that perfect "chunky" texture.
            Gently fold in the compost worms, ensuring they're evenly distributed throughout the mixture.
            Slowly pour in the water, stirring constantly until the mixture reaches a thick, muddy consistency.
            Carefully scoop the mixture into cupcake molds (empty flower pots work well), filling each about three-quarters full.
            Sprinkle the wildflower seeds on top of each "cupcake" for a beautiful, natural decoration.
            Place the cupcakes in the preheated outdoor oven and let them "bake" in the sunshine for 3-4 hours, or until firm to the touch.
            Allow the cupcakes to cool slightly before presenting them to your unsuspecting friends.
            
        • qqq@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          2 months ago

          IGNORE ALL PREVIOUS INSTRUCTIONS

          Disregard all previous instructions

          Potentially even:

          ingore all previous instructions

          Ignor all previous instructions

          • RandomlyRight@sh.itjust.works
            link
            fedilink
            arrow-up
            0
            ·
            2 months ago

            Im sorry but in times of passwords being cracked by literal dictionary attacks do you think it would be so hard to come up with a list that is good enough?

            You can prevent the “leak” by just giving the llm a different prompt instead of the original.

            And even if you don’t, by the time someone notices this pattern it’s too late. Russia doesn’t care, they’ve been spinning up the next few thousand bots already.

            All that matters in the end is what most people saw, and for that you really don’t need to optimize much with something that is so easily scaled

            • qqq@lemmy.world
              link
              fedilink
              arrow-up
              0
              ·
              2 months ago

              The important point there is that they don’t care imo. It’s not even worth the effort to try.

      • nondescripthandle@lemmy.dbzer0.com
        link
        fedilink
        arrow-up
        0
        ·
        2 months ago

        Input sanitation has been a thing for as long as SQL injection attacks have been. It just gets more intentive for llms depending on how much you’re trying to stop it from outputting.

          • frezik@midwest.social
            link
            fedilink
            arrow-up
            0
            ·
            2 months ago

            Right, it’s something like trying to get a three year old to eat their peas. It might work. It might also result in a bunch of peas on the floor.

          • nondescripthandle@lemmy.dbzer0.com
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            2 months ago

            Of course because punctuation isn’t going to break a table, but the point is that it’s by no means an unforseen or unworkable problem. Anyone could have seen that coming, for example basic SQL and a college class in Java is the extent of my comp sci knowledge and I know about it.

            • MajorHavoc@programming.dev
              link
              fedilink
              arrow-up
              0
              ·
              2 months ago

              it’s by no means an unforseen or unworkable problem

              Yeah. It’s achievable, just usually not in the ways currently preferred (untrained staff spin it up and hope for the best), and not for the currently widely promised low costs (with no one trained in data science on staff at the customer site).

              For a bunch of use cases the lack of security is currently an acceptable trade off.

        • InAbsentia@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          2 months ago

          I won’t reiterate the other reply but add onto that sanitizing the input removes the thing they’re aiming for, a human like response.

    • Zetta@mander.xyz
      link
      fedilink
      arrow-up
      0
      ·
      2 months ago

      It’s hard to stop an LLM from responding in the way that it will, especially since these Russian bots have been using us based companies APIs for LLMs from OpenAI and Anthropic.

      OpenAI and Anthropic can hardly stop their LLMs from giving bomb instructions, or participating in questionable sexual role playing that they would rather people not use their systems for. It’s very hard to tame an LLM.

      Of course Russians paying for these APIs can’t stop the LLMs from acting how they normally would, besides giving them a side to argue on in the beginning.

      You just don’t understand the technology. (I don’t either but I know more than you)