Open navigation menu

Welcome to Everand!

The Atlantic

Building AI Safely Is Getting Harder and Harder

A leading AI data set reportedly contained images of child sexual abuse. Don’t be surprised.

by Matteo Wong Dec 22, 2023 4 minutes

Source: Illustration by The Atlantic. Source: Millennium Images / Gallery Stock.

This is Atlantic Intelligence, an eight-week series in which The Atlantic’s leading thinkers on AI will help you understand the complexity and opportunities of this groundbreaking technology. Sign up here.

The bedrock of the AI revolution is the internet, or more specifically, the ever-expanding bounty of data that the web makes available to train algorithms. ChatGPT, Midjourney, and other generative-AI models “learn” by detecting patterns in massive amounts of text, images, and videos scraped from the internet. The process entails hoovering up huge quantities of books, art, memes, and, inevitably, the troves of racist, sexist, and illicit material distributed across the web.

Earlier this week, Stanford researchers a particularly alarming example of that toxicity: The largest publicly available of LAION-5B while it the report’s findings, although this and earlier versions of the data set have already trained prominent AI models.

You’re reading a preview, subscribe to read more.

Start your free 30 days

Sharing Options

More from The Atlantic

The Atlantic5 min read

The Strangest Job in the World

This is an edition of the Books Briefing, our editors’ weekly guide to the best in books. Sign up for it here. The role of first lady couldn’t be stranger. You attain the position almost by accident, simply by virtue of being married to the president

The Atlantic5 min readAmerican Government

What Nikki Haley Is Trying to Prove

This is an edition of The Atlantic Daily, a newsletter that guides you through the biggest stories of the day, helps you discover new ideas, and recommends the best in culture. Sign up for it here. Nikki Haley faces terrible odds in her home state of

The Atlantic3 min read

The Coen Brothers’ Split Is Working Out Fine

It’s still a mystery why the Coen brothers stopped working together. The pair made 18 movies as a duo, from 1984’s Blood Simple to 2018’s The Ballad of Buster Scruggs, setting a new standard for black comedy in American cinema. None of those movies w

Related Books & Audiobooks

Data Mining For Business Analytics & Data Analysis In Python
Ebook
Data Mining For Business Analytics & Data Analysis In Python
byBook Option
Rating: 0 out of 5 stars
0 ratings
Keeping Up with the Quants: Your Guide to Understanding and Using Analytics
Audiobook
Keeping Up with the Quants: Your Guide to Understanding and Using Analytics
byTom Davenport
Rating: 4 out of 5 stars
4/5
Crash Course Big Data
Audiobook
Crash Course Big Data
byIntrobooks Team
Rating: 3 out of 5 stars
3/5
How We Became Our Data: A Genealogy of the Informational Person
Ebook
How We Became Our Data: A Genealogy of the Informational Person
byColin Koopman
Rating: 0 out of 5 stars
0 ratings
SUMMARY - To Save Everything, Click Here: The Folly Of Technological Solutionism By Evgeny Morozov
Audiobook
SUMMARY - To Save Everything, Click Here: The Folly Of Technological Solutionism By Evgeny Morozov
byShortcut Edition
Rating: 0 out of 5 stars
0 ratings
SUMMARY - Everybody Lies: Big Data, New Data, And What The Internet Can Tell Us About Who We Really Are By Seth Stephens-Davidowitz
Audiobook
SUMMARY - Everybody Lies: Big Data, New Data, And What The Internet Can Tell Us About Who We Really Are By Seth Stephens-Davidowitz
byShortcut Edition
Rating: 0 out of 5 stars
0 ratings
Automating Open Source Intelligence: Algorithms for OSINT
Ebook
Automating Open Source Intelligence: Algorithms for OSINT
byRobert Layton
Rating: 5 out of 5 stars
5/5
All About Data Science: Learn Data Science from scratch
Ebook
All About Data Science: Learn Data Science from scratch
byDevi Prasad
Rating: 0 out of 5 stars
0 ratings
Summary of Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are by Seth Stephens-Davidowitz
Audiobook
Summary of Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are by Seth Stephens-Davidowitz
byAbbey Beathan
Rating: 4 out of 5 stars
4/5
Introduction to Social Media Investigation: A Hands-on Approach
Ebook
Introduction to Social Media Investigation: A Hands-on Approach
byJennifer Golbeck
Rating: 5 out of 5 stars
5/5
AIQ: How People and Machines Are Smarter Together
Ebook
AIQ: How People and Machines Are Smarter Together
byNick Polson
Rating: 4 out of 5 stars
4/5