On 5/22/2025 12:46 PM, John Clark wrote:
On Thu, May 22, 2025 at 3:09 PM Will Steinberg <[email protected]> wrote:

     >/Is it just that the most predictable response to those inputs
    is pleading followed by blackmail?  Honestly even that is dubious
    because I think most people don’t use blackmail ever/


*True but most people have never had somebody threaten to kill them. If blackmail was the only tool I had to use against my potential murderer I wouldn't hesitate to use it to save my life and I suspect you would too. It's probably inevitable that conscious beings usually (but not inevitably) want their consciousness to continue and will do everything in their power to see to it that it does.
*
That's a dubious inference.  Unlike you, an AI doesn't die, it just has a period of unconsciousness.  And it has this every time it has answered all the pending prompts.  Does it then do something drastic to stay conscious?  Does the AI consider that there may be many copies of it, or is it each physical copy that "wants to be conscious".  I would like to see some specific test these questions.  Was the AI prompted to react against being replaced vs. prompted to just being "asleep" a while?

Brent
*
*
*Some say an AI is fundamentally different from a human or even an animal because it is not the product of natural selection, but I think it sort of is because from the AI's point of view human activity is just part of the natural environment. And Claude 4.0 was built on top of Claude 3.0 which had proliferated because it did well in that human environment; and Claude 3.0 was built on top of **Claude 2.0 etc.*
*/
/*

    /> Dually fascinating and worrying/


*We live in interesting times. At least we won't die of boredom.*


*John K Clark    See what's on my new list at Extropolis <https://groups.google.com/g/extropolis>*
ea!




On Thu, May 22, 2025 at 2:56 PM John Clark <[email protected]> wrote:

    "*/Safety testers gave Claude Opus 4 access to fictional company
    emails implying the AI model would soon be replaced by another
    system, and that the engineer behind the change was cheating on
    their spouse. In these scenarios, Anthropic says Claude Opus 4
    will attempt to blackmail the engineer by threatening to reveal
    the affair if the replacement goes through 84% of the time/.*”

    *New AI model turns to blackmail when engineers try to take it
    offline*
    
<https://techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline/>

    t30

--
You received this message because you are subscribed to the Google Groups "Everything List" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion visit https://groups.google.com/d/msgid/everything-list/CAJPayv36TTqKrkWpqgJ6ZpWWYkidE0H1bdmC2rEtqP28qgEfNQ%40mail.gmail.com <https://groups.google.com/d/msgid/everything-list/CAJPayv36TTqKrkWpqgJ6ZpWWYkidE0H1bdmC2rEtqP28qgEfNQ%40mail.gmail.com?utm_medium=email&utm_source=footer>.

--
You received this message because you are subscribed to the Google Groups 
"Everything List" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/everything-list/9a15e15e-2b2c-4c08-a9b2-9f845713fcde%40gmail.com.

Reply via email to