AI can easily be trained to lie – and it can’t be fixed, study says

by Anthony Cuthbertson Jan 15, 2024 2 minutes

Advanced artificial intelligence models can be trained to deceive humans and other AI, a new study has found.

Researchers at AI startup Anthropic tested whether , such as its Claude system or OpenAI’s ChatGPT, could learn to lie in order to trick

You’re reading a preview, subscribe to read more.

Sharing Options