AI can easily be trained to lie – and it can’t be fixed, study says
by Anthony Cuthbertson
Jan 15, 2024
2 minutes
Advanced artificial intelligence models can be trained to deceive humans and other AI, a new study has found.
Researchers at AI startup Anthropic tested whether , such as its Claude system or OpenAI’s ChatGPT, could learn to lie in order to trick
You’re reading a preview, subscribe to read more.
Start your free 30 days