botwerks
posts til links misc
botwerks
Cancel
poststillinksmisc

ai deception under pressure

steve ulrich
 01-Dec-2023  37 words   One minute 

links, etc

  • [2311.07590] Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

    this is the most interesting (and likely disturbing) thing that i’ve read re: AI in the past couple of months.

Updated on 01-Dec-2023
 links, 2023, ai, security
back | home
obfuscation artists: the smoke and mirrors of effective altruism ai and misplaced trust
Powered by Hugo | Theme - LoveIt
1997 - 2026 | CC BY-NC 4.0