Ebook830 pages9 hours

Data Privacy: A runbook for engineers

Name: Data Privacy: A runbook for engineers
Author: Nishant Bhajaria
ISBN: 9781638357186

By Nishant Bhajaria

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Engineer privacy into your systems with these hands-on techniques for data governance, legal compliance, and surviving security audits.

In Data Privacy you will learn how to:

Classify data based on privacy risk
Build technical tools to catalog and discover data in your systems
Share data with technical privacy controls to measure reidentification risk
Implement technical privacy architectures to delete data
Set up technical capabilities for data export to meet legal requirements like Data Subject Asset Requests (DSAR)
Establish a technical privacy review process to help accelerate the legal Privacy Impact Assessment (PIA)
Design a Consent Management Platform (CMP) to capture user consent
Implement security tooling to help optimize privacy
Build a holistic program that will get support and funding from the C-Level and board

Data Privacy teaches you to design, develop, and measure the effectiveness of privacy programs. You’ll learn from author Nishant Bhajaria, an industry-renowned expert who has overseen privacy at Google, Netflix, and Uber. The terminology and legal requirements of privacy are all explained in clear, jargon-free language. The book’s constant awareness of business requirements will help you balance trade-offs, and ensure your user’s privacy can be improved without spiraling time and resource costs.

About the technology
Data privacy is essential for any business. Data breaches, vague policies, and poor communication all erode a user’s trust in your applications. You may also face substantial legal consequences for failing to protect user data. Fortunately, there are clear practices and guidelines to keep your data secure and your users happy.

About the book
Data Privacy: A runbook for engineers teaches you how to navigate the trade-offs between strict data security and real world business needs. In this practical book, you’ll learn how to design and implement privacy programs that are easy to scale and automate. There’s no bureaucratic process—just workable solutions and smart repurposing of existing security tools to help set and achieve your privacy goals.

What's inside

Classify data based on privacy risk
Set up capabilities for data export that meet legal requirements
Establish a review process to accelerate privacy impact assessment
Design a consent management platform to capture user consent

About the reader
For engineers and business leaders looking to deliver better privacy.

About the author
Nishant Bhajaria leads the Technical Privacy and Strategy teams for Uber. His previous roles include head of privacy engineering at Netflix, and data security and privacy at Google.

Table of Contents
PART 1 PRIVACY, DATA, AND YOUR BUSINESS
1 Privacy engineering: Why it’s needed, how to scale it
2 Understanding data and privacy
PART 2 A PROACTIVE PRIVACY PROGRAM: DATA GOVERNANCE
3 Data classification
4 Data inventory
5 Data sharing
PART 3 BUILDING TOOLS AND PROCESSES
6 The technical privacy review
7 Data deletion
8 Exporting user data: Data Subject Access Requests
PART 4 SECURITY, SCALING, AND STAFFING
9 Building a consent management platform
10 Closing security vulnerabilities
11 Scaling, hiring, and considering regulations

Skip carousel

LanguageEnglish

PublisherManning

Release dateMar 22, 2022

ISBN9781638357186

Author

Nishant Bhajaria

Related authors

Skip carousel

Related to Data Privacy

Related ebooks

Skip carousel

The Impact of the General Data Protection Regulation (GDPR) on the Online Advertising Market
Ebook
The Impact of the General Data Protection Regulation (GDPR) on the Online Advertising Market
byBernd Skiera
Rating: 0 out of 5 stars
0 ratings
GDPR for DevOp(Sec) - The laws, Controls and solutions
Ebook
GDPR for DevOp(Sec) - The laws, Controls and solutions
byalasdair gilchrist
Rating: 5 out of 5 stars
5/5
Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy, and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
Data Protection and the Cloud: Are the risks too great?
Ebook
Data Protection and the Cloud: Are the risks too great?
byPaul Ticher
Rating: 4 out of 5 stars
4/5
Intro to GDPR: A Plain English Guide to Compliance
Ebook
Intro to GDPR: A Plain English Guide to Compliance
byPunit Bhatia
Rating: 0 out of 5 stars
0 ratings
The Case for the Chief Data Officer: Recasting the C-Suite to Leverage Your Most Valuable Asset
Ebook
The Case for the Chief Data Officer: Recasting the C-Suite to Leverage Your Most Valuable Asset
byPeter Aiken
Rating: 4 out of 5 stars
4/5
A Last Minute Hands-on Guide to GDPR Readiness
Ebook
A Last Minute Hands-on Guide to GDPR Readiness
byalasdair gilchrist
Rating: 0 out of 5 stars
0 ratings
Designing Cloud Data Platforms
Ebook
Designing Cloud Data Platforms
byDanil Zburivsky
Rating: 0 out of 5 stars
0 ratings
Data Stewardship: An Actionable Guide to Effective Data Management and Data Governance
Ebook
Data Stewardship: An Actionable Guide to Effective Data Management and Data Governance
byDavid Plotkin
Rating: 3 out of 5 stars
3/5
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
Ebook
PCI Compliance: Understand and Implement Effective PCI Data Security Standard Compliance
byBranden R. Williams
Rating: 5 out of 5 stars
5/5
Enterprise Security: A Data-Centric Approach to Securing the Enterprise
Ebook
Enterprise Security: A Data-Centric Approach to Securing the Enterprise
byAaron Woody
Rating: 0 out of 5 stars
0 ratings
The Data Governance Imperative
Ebook
The Data Governance Imperative
bySteve Sarsfield
Rating: 0 out of 5 stars
0 ratings
Business Metadata: Capturing Enterprise Knowledge
Ebook
Business Metadata: Capturing Enterprise Knowledge
byW.H. Inmon
Rating: 4 out of 5 stars
4/5
Data Governance For Dummies
Ebook
Data Governance For Dummies
byJonathan Reichental
Rating: 0 out of 5 stars
0 ratings
Streaming Data: Understanding the real-time pipeline
Ebook
Streaming Data: Understanding the real-time pipeline
byAndrew Psaltis
Rating: 0 out of 5 stars
0 ratings
Data Warehousing in the Age of Big Data
Ebook
Data Warehousing in the Age of Big Data
byKrish Krishnan
Rating: 0 out of 5 stars
0 ratings
Building an Information Security Awareness Program: Defending Against Social Engineering and Technical Threats
Ebook
Building an Information Security Awareness Program: Defending Against Social Engineering and Technical Threats
byBill Gardner
Rating: 0 out of 5 stars
0 ratings
Data Lake Development with Big Data
Ebook
Data Lake Development with Big Data
byPasupuleti Pradeep
Rating: 0 out of 5 stars
0 ratings
Corporate Data Quality: Prerequisite for Successful Business Models
Ebook
Corporate Data Quality: Prerequisite for Successful Business Models
byBoris Otto
Rating: 0 out of 5 stars
0 ratings
The Cyber Security Handbook – Prepare for, respond to and recover from cyber attacks
Ebook
The Cyber Security Handbook – Prepare for, respond to and recover from cyber attacks
byAlan Calder
Rating: 0 out of 5 stars
0 ratings
Introducing Data Science: Big data, machine learning, and more, using Python tools
Ebook
Introducing Data Science: Big data, machine learning, and more, using Python tools
byDavy Cielen
Rating: 5 out of 5 stars
5/5
Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information (TM)
Ebook
Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information (TM)
byDanette McGilvray
Rating: 3 out of 5 stars
3/5
The Official (ISC)2 Guide to the CCSP CBK
Ebook
The Official (ISC)2 Guide to the CCSP CBK
byAdam Gordon
Rating: 0 out of 5 stars
0 ratings
Data Privacy: What Enterprises Need to Know?
Ebook
Data Privacy: What Enterprises Need to Know?
byDeepak Gupta
Rating: 0 out of 5 stars
0 ratings
Information Management: Strategies for Gaining a Competitive Advantage with Data
Ebook
Information Management: Strategies for Gaining a Competitive Advantage with Data
byWilliam McKnight
Rating: 0 out of 5 stars
0 ratings
The Official (ISC)2 CCSP CBK Reference
Ebook
The Official (ISC)2 CCSP CBK Reference
byAaron Kraus
Rating: 0 out of 5 stars
0 ratings
EU General Data Protection Regulation (GDPR) – An implementation and compliance guide, fourth edition
Ebook
EU General Data Protection Regulation (GDPR) – An implementation and compliance guide, fourth edition
byIT Governance Privacy Team
Rating: 0 out of 5 stars
0 ratings
Data Protection Officer
Ebook
Data Protection Officer
byFilip Johnssén
Rating: 3 out of 5 stars
3/5
Privacy’s Blueprint: The Battle to Control the Design of New Technologies
Ebook
Privacy’s Blueprint: The Battle to Control the Design of New Technologies
byWoodrow Hartzog
Rating: 5 out of 5 stars
5/5
EU General Data Protection Regulation (GDPR): An Implementation and Compliance Guide
Ebook
EU General Data Protection Regulation (GDPR): An Implementation and Compliance Guide
byITGP Privacy Team
Rating: 5 out of 5 stars
5/5

Security For You

Skip carousel

Hacking For Dummies
Ebook
Hacking For Dummies
byKevin Beaver
Rating: 4 out of 5 stars
4/5
Game Console Hacking: Xbox, PlayStation, Nintendo, Game Boy, Atari and Sega
Ebook
Game Console Hacking: Xbox, PlayStation, Nintendo, Game Boy, Atari and Sega
byJoe Grand
Rating: 0 out of 5 stars
0 ratings
IAPP CIPP / US Certified Information Privacy Professional Study Guide
Ebook
IAPP CIPP / US Certified Information Privacy Professional Study Guide
byMike Chapple
Rating: 0 out of 5 stars
0 ratings
CompTIA Security+ Study Guide: Exam SY0-601
Ebook
CompTIA Security+ Study Guide: Exam SY0-601
byMike Chapple
Rating: 5 out of 5 stars
5/5
Cybersecurity For Dummies
Ebook
Cybersecurity For Dummies
byJoseph Steinberg
Rating: 4 out of 5 stars
4/5
Cybersecurity – Attack and Defense Strategies - Second Edition: Counter modern threats and employ state-of-the-art tools and techniques to protect your organization against cybercriminals, 2nd Edition
Ebook
Cybersecurity – Attack and Defense Strategies - Second Edition: Counter modern threats and employ state-of-the-art tools and techniques to protect your organization against cybercriminals, 2nd Edition
byYuri Diogenes
Rating: 5 out of 5 stars
5/5
Computer Networking: The Complete Beginner's Guide to Learning the Basics of Network Security, Computer Architecture, Wireless Technology and Communications Systems (Including Cisco, CCENT, and CCNA)
Ebook
Computer Networking: The Complete Beginner's Guide to Learning the Basics of Network Security, Computer Architecture, Wireless Technology and Communications Systems (Including Cisco, CCENT, and CCNA)
byBenjamin Walker
Rating: 4 out of 5 stars
4/5
Hands on Hacking: Become an Expert at Next Gen Penetration Testing and Purple Teaming
Ebook
Hands on Hacking: Become an Expert at Next Gen Penetration Testing and Purple Teaming
byMatthew Hickey
Rating: 3 out of 5 stars
3/5
Codes and Ciphers - A History of Cryptography
Ebook
Codes and Ciphers - A History of Cryptography
byAlexander D'Agapeyeff
Rating: 4 out of 5 stars
4/5
Wireless Hacking 101
Ebook
Wireless Hacking 101
byKarina Astudillo
Rating: 4 out of 5 stars
4/5
Hacking With Kali Linux : A Comprehensive, Step-By-Step Beginner's Guide to Learn Ethical Hacking With Practical Examples to Computer Hacking, Wireless Network, Cybersecurity and Penetration Testing
Ebook
Hacking With Kali Linux : A Comprehensive, Step-By-Step Beginner's Guide to Learn Ethical Hacking With Practical Examples to Computer Hacking, Wireless Network, Cybersecurity and Penetration Testing
byPeter Bradley
Rating: 5 out of 5 stars
5/5
Tor and the Dark Art of Anonymity
Ebook
Tor and the Dark Art of Anonymity
byLance Henderson
Rating: 5 out of 5 stars
5/5
Practical Lock Picking: A Physical Penetration Tester's Training Guide
Ebook
Practical Lock Picking: A Physical Penetration Tester's Training Guide
byDeviant Ollam
Rating: 5 out of 5 stars
5/5
Ultimate Guide for Being Anonymous: Hacking the Planet, #4
Ebook
Ultimate Guide for Being Anonymous: Hacking the Planet, #4
bysparc Flow
Rating: 5 out of 5 stars
5/5
Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
Ebook
Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
byDr. Erdal Ozkaya
Rating: 5 out of 5 stars
5/5
Make Your Smartphone 007 Smart
Ebook
Make Your Smartphone 007 Smart
byConrad Jaeger
Rating: 4 out of 5 stars
4/5
Mike Meyers CompTIA Security+ Certification Passport, Sixth Edition (Exam SY0-601)
Ebook
Mike Meyers CompTIA Security+ Certification Passport, Sixth Edition (Exam SY0-601)
byDawn Dunkerley
Rating: 5 out of 5 stars
5/5
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
Ebook
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
byKevin Clark
Rating: 5 out of 5 stars
5/5
How to Hack Like a Pornstar
Ebook
How to Hack Like a Pornstar
bySparc Flow
Rating: 5 out of 5 stars
5/5
Network+ Study Guide & Practice Exams
Ebook
Network+ Study Guide & Practice Exams
byRobert Shimonski
Rating: 4 out of 5 stars
4/5
CompTIA Network+ Review Guide: Exam N10-008
Ebook
CompTIA Network+ Review Guide: Exam N10-008
byJon Buhagiar
Rating: 0 out of 5 stars
0 ratings
Mike Meyers' CompTIA Security+ Certification Guide, Third Edition (Exam SY0-601)
Ebook
Mike Meyers' CompTIA Security+ Certification Guide, Third Edition (Exam SY0-601)
byMike Meyers
Rating: 5 out of 5 stars
5/5
Keys to the Kingdom: Impressioning, Privilege Escalation, Bumping, and Other Key-Based Attacks Against Physical Locks
Ebook
Keys to the Kingdom: Impressioning, Privilege Escalation, Bumping, and Other Key-Based Attacks Against Physical Locks
byDeviant Ollam
Rating: 4 out of 5 stars
4/5
The CompTIA Network+ & Security+ Certification: 2 in 1 Book- Simplified Study Guide Eighth Edition (Exam N10-008) | The Complete Exam Prep with Practice Tests and Insider Tips & Tricks | Achieve a 98% Pass Rate on Your First Attempt!
Ebook
The CompTIA Network+ & Security+ Certification: 2 in 1 Book- Simplified Study Guide Eighth Edition (Exam N10-008) | The Complete Exam Prep with Practice Tests and Insider Tips & Tricks | Achieve a 98% Pass Rate on Your First Attempt!
byComptia Ace5
Rating: 0 out of 5 stars
0 ratings
Social Engineering: The Science of Human Hacking
Ebook
Social Engineering: The Science of Human Hacking
byChristopher Hadnagy
Rating: 3 out of 5 stars
3/5
Cybersecurity for Beginners : Learn the Fundamentals of Cybersecurity in an Easy, Step-by-Step Guide: 1
Ebook
Cybersecurity for Beginners : Learn the Fundamentals of Cybersecurity in an Easy, Step-by-Step Guide: 1
byKevin Clark
Rating: 0 out of 5 stars
0 ratings
How to Become Anonymous, Secure and Free Online
Ebook
How to Become Anonymous, Secure and Free Online
byAmy Awol
Rating: 5 out of 5 stars
5/5
The Art of Intrusion: The Real Stories Behind the Exploits of Hackers, Intruders and Deceivers
Ebook
The Art of Intrusion: The Real Stories Behind the Exploits of Hackers, Intruders and Deceivers
byKevin D. Mitnick
Rating: 4 out of 5 stars
4/5
Dark Territory: The Secret History of Cyber War
Ebook
Dark Territory: The Secret History of Cyber War
byFred Kaplan
Rating: 4 out of 5 stars
4/5
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
Ebook
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
byJ. J. Luna
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

The Foolproof Way To Pass The CIPPE Exam In 2 Weeks
Podcast episode
The Foolproof Way To Pass The CIPPE Exam In 2 Weeks
byPrivacy Pros Podcast
0 ratings
0% found this document useful
#50 Weapons of Math Destruction
Podcast episode
#50 Weapons of Math Destruction
byDataFramed
0 ratings
0% found this document useful
All Roads Lead to Kubernetes with Kendall Miller: Kendall Miller is the president at Fairwinds, a shop that helps teams optimize containerized apps and get the most out of Kubernetes that was formerly called ReactiveOps. He's also the host of Authority Issues, a podcast about leadership. Prior to these p
Podcast episode
All Roads Lead to Kubernetes with Kendall Miller: Kendall Miller is the president at Fairwinds, a shop that helps teams optimize containerized apps and get the most out of Kubernetes that was formerly called ReactiveOps. He's also the host of Authority Issues, a podcast about leadership. Prior to these p
byScreaming in the Cloud
0 ratings
0% found this document useful
Cybersecurity: Current and Emerging Industry Priorities and Threats: Cybersecurity has only increased in importance as huge swaths of the workforce continue to work remotely. On this episode, two FINRA cybersecurity experts share how FINRA handled the transition and areas of focus for 2021. Plus, learn why you should think twice before trying to brush that piece of hair off your phone screen.
Podcast episode
Cybersecurity: Current and Emerging Industry Priorities and Threats: Cybersecurity has only increased in importance as huge swaths of the workforce continue to work remotely. On this episode, two FINRA cybersecurity experts share how FINRA handled the transition and areas of focus for 2021. Plus, learn why you should think twice before trying to brush that piece of hair off your phone screen.
byFINRA Unscripted
100%
100% found this document useful
Big breaches (and how to avoid them): with Neil Daswani
Podcast episode
Big breaches (and how to avoid them): with Neil Daswani
byThe Changelog: Software Development, Open Source
0 ratings
0% found this document useful
Dominique West: Security found me. [Strategy] [Career Notes]: Technical account manager Dominique West takes us on her career journey from engineering to cybersecurity.
Podcast episode
Dominique West: Security found me. [Strategy] [Career Notes]: Technical account manager Dominique West takes us on her career journey from engineering to cybersecurity.
byCyberWire Daily
100%
100% found this document useful
It’s Not a Data Science Problem, It’s a Data Engineering Problem with Laurie Voss: Laurie Voss is a senior data analyst at Netlify, makers of a serverless platform designed to help teams build, deploy, and collaborate on web apps more effectively. Previously, Laurie worked as Chief Data Officer at npm, Inc., co-founded Snowball Factory,
Podcast episode
It’s Not a Data Science Problem, It’s a Data Engineering Problem with Laurie Voss: Laurie Voss is a senior data analyst at Netlify, makers of a serverless platform designed to help teams build, deploy, and collaborate on web apps more effectively. Previously, Laurie worked as Chief Data Officer at npm, Inc., co-founded Snowball Factory,
byScreaming in the Cloud
0 ratings
0% found this document useful
Data Deletion and Mapping via a Data Privacy Vault with Lisa Nee and Robert Duffy: The privacy landscape is changing. There is increasing consumer awareness and concern over the use of their personal data and there’s an ever growing list of privacy regulations that companies need to navigate. Regulations like GDPR, CCPA, and others...
Podcast episode
Data Deletion and Mapping via a Data Privacy Vault with Lisa Nee and Robert Duffy: The privacy landscape is changing. There is increasing consumer awareness and concern over the use of their personal data and there’s an ever growing list of privacy regulations that companies need to navigate. Regulations like GDPR, CCPA, and others...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Privacy by Architecture with Anshu Sharma: Both compliance regulations and consumer needs are creating increasing pressure on companies to do a better job of securing and managing their sensitive customer data. Yet, companies continue to struggle to comply with regulations, meet consumer privacy ...
Podcast episode
Privacy by Architecture with Anshu Sharma: Both compliance regulations and consumer needs are creating increasing pressure on companies to do a better job of securing and managing their sensitive customer data. Yet, companies continue to struggle to comply with regulations, meet consumer privacy ...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Data Residency: A Technical Perspective with Skyflow’s Manish Ahluwalia: In this episode, Manish Ahluwalia, the field CTO of Skyflow, discusses the technical aspects of data residency and the usage of a data privacy vault. He explains the concept of data residency and data localization. He noted that with the increasing a...
Podcast episode
Data Residency: A Technical Perspective with Skyflow’s Manish Ahluwalia: In this episode, Manish Ahluwalia, the field CTO of Skyflow, discusses the technical aspects of data residency and the usage of a data privacy vault. He explains the concept of data residency and data localization. He noted that with the increasing a...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Privacy-aware Data Pipelines with Skyflow’s Piper Keyes: A data analytics pipeline is important to modern businesses because it allows them to extract valuable insights from the large amounts of data they generate and collect on a daily basis. This leads to better decision making, improved efficiency, and ...
Podcast episode
Privacy-aware Data Pipelines with Skyflow’s Piper Keyes: A data analytics pipeline is important to modern businesses because it allows them to extract valuable insights from the large amounts of data they generate and collect on a daily basis. This leads to better decision making, improved efficiency, and ...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Engineering for Data Privacy: Navigating Infrastructure, Security, and Compliance with Skyflow’s Roshmik Saha: In this episode Roshmik Saha, Head of Engineering at Skyflow, dives into the fascinating realm of data privacy and security solutions. Whether you're considering building your own privacy solution or seeking insights into the infrastructure requireme...
Podcast episode
Engineering for Data Privacy: Navigating Infrastructure, Security, and Compliance with Skyflow’s Roshmik Saha: In this episode Roshmik Saha, Head of Engineering at Skyflow, dives into the fascinating realm of data privacy and security solutions. Whether you're considering building your own privacy solution or seeking insights into the infrastructure requireme...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Security and Privacy in the Enterprise with Skyflow’s Sam Sternberg: Sam Sternberg, Customer Programs Lead at Skyflow, joins the show to discuss the world of privacy and security at scale within large enterprises. We explore the complex infrastructure, regulatory challenges, and evolving technologies that these giants...
Podcast episode
Security and Privacy in the Enterprise with Skyflow’s Sam Sternberg: Sam Sternberg, Customer Programs Lead at Skyflow, joins the show to discuss the world of privacy and security at scale within large enterprises. We explore the complex infrastructure, regulatory challenges, and evolving technologies that these giants...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
AI and the Democratization of Data of with Alonso Castañeda Andrade: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is joined today by , who is the...
Podcast episode
AI and the Democratization of Data of with Alonso Castañeda Andrade: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is joined today by , who is the...
byAI Live & Unbiased
0 ratings
0% found this document useful
Protecting the Data, Not Just Data Access with Games24x7’s Kuldeep Tomar: In this episode, we discuss the evolving landscape of data protection, especially in the context of India's DPDP law. Kuldeep Tomar, the Head of Information Security at Games24x7, delves into the significance of safeguarding data beyond just access c...
Podcast episode
Protecting the Data, Not Just Data Access with Games24x7’s Kuldeep Tomar: In this episode, we discuss the evolving landscape of data protection, especially in the context of India's DPDP law. Kuldeep Tomar, the Head of Information Security at Games24x7, delves into the significance of safeguarding data beyond just access c...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Humans in the Loop - Lina Weichbrodt
Podcast episode
Humans in the Loop - Lina Weichbrodt
byDataTalks.Club
0 ratings
0% found this document useful
Privacy is a moving target. Here’s how engineering teams can stay on track.: On this sponsored episode of the podcast, we talk with Rob Pickard and Matt Cooper of Vanta, who get that question every day. Their company makes security monitoring software that helps companies get into compliance quickly. We spoke about the shifting sands of privacy rules and regulations, tracking data flows through systems and across corporate borders, and how security automation can put up guardrails instead of gates.
Podcast episode
Privacy is a moving target. Here’s how engineering teams can stay on track.: On this sponsored episode of the podcast, we talk with Rob Pickard and Matt Cooper of Vanta, who get that question every day. Their company makes security monitoring software that helps companies get into compliance quickly. We spoke about the shifting sands of privacy rules and regulations, tracking data flows through systems and across corporate borders, and how security automation can put up guardrails instead of gates.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Episode 15: Nagios was the Original Call of Duty: Let’s chat about the Cloud and everything in between. The people in this world are pretty comfortable with not running physical servers on their own, but trusting someone else to run them. Yet, people suffer from the psychological barrier of thinking they
Podcast episode
Episode 15: Nagios was the Original Call of Duty: Let’s chat about the Cloud and everything in between. The people in this world are pretty comfortable with not running physical servers on their own, but trusting someone else to run them. Yet, people suffer from the psychological barrier of thinking they
byScreaming in the Cloud
0 ratings
0% found this document useful
Data Science and Privacy - sugarcoated or straight up? It Depends (with Katharine Jarmul of Cape Privacy)
Podcast episode
Data Science and Privacy - sugarcoated or straight up? It Depends (with Katharine Jarmul of Cape Privacy)
bySerious Privacy
0 ratings
0% found this document useful
Data Privacy Challenges and Where to Start with Skyflow’s Ari Hoffman: Ari Hoffman has spent his career helping businesses implement and solve technically challenging problems at companies like Cisco and Fivserv. Today, he serves as the Director of Customer Programs at Skyflow, where he helps Skyflow customers tackle th...
Podcast episode
Data Privacy Challenges and Where to Start with Skyflow’s Ari Hoffman: Ari Hoffman has spent his career helping businesses implement and solve technically challenging problems at companies like Cisco and Fivserv. Today, he serves as the Director of Customer Programs at Skyflow, where he helps Skyflow customers tackle th...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Identity Across Multiple Clouds
Podcast episode
Identity Across Multiple Clouds
byThe Cloudcast
0 ratings
0% found this document useful
Operationalizing a Privacy Program with Coinbase’s Pramod Raghavendran: With the changing landscape of privacy regulations and the growing consumer awareness about the collection and management of personal data, more and more companies are prioritizing privacy earlier in their lifecycle than ever before. This is fantasti...
Podcast episode
Operationalizing a Privacy Program with Coinbase’s Pramod Raghavendran: With the changing landscape of privacy regulations and the growing consumer awareness about the collection and management of personal data, more and more companies are prioritizing privacy earlier in their lifecycle than ever before. This is fantasti...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
2553: Fortanix: Why Confidential Computing Will Be the Underlining of Training Data for AI: In the rapidly evolving landscape of artificial intelligence and data science, today's episode of the Tech Talks Daily Podcast brings forth a critically important yet often overlooked aspect—securing data through its entire lifecycle. Our esteemed...
Podcast episode
2553: Fortanix: Why Confidential Computing Will Be the Underlining of Training Data for AI: In the rapidly evolving landscape of artificial intelligence and data science, today's episode of the Tech Talks Daily Podcast brings forth a critically important yet often overlooked aspect—securing data through its entire lifecycle. Our esteemed...
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Why Data Integrity Is More Important Than Ever [Daniel Milek]
Podcast episode
Why Data Integrity Is More Important Than Ever [Daniel Milek]
byThe Qualitalks Podcast
0 ratings
0% found this document useful
Data Governance with Jessi Ashdown and Uri Gilad: Hosts Stephanie Wong and Priyanka Vergadia learn about data governance this week in an interesting chat with Jessi Ashdown and Uri Gilad.
Podcast episode
Data Governance with Jessi Ashdown and Uri Gilad: Hosts Stephanie Wong and Priyanka Vergadia learn about data governance this week in an interesting chat with Jessi Ashdown and Uri Gilad.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
87: Michael Katz: The Evolution of packaged CDPs, democratizing ML and the myths of composable and zero data copy
Podcast episode
87: Michael Katz: The Evolution of packaged CDPs, democratizing ML and the myths of composable and zero data copy
byHumans of Martech
0 ratings
0% found this document useful
#136 - How to Build for Data Privacy - Nishant Bhajaria
Podcast episode
#136 - How to Build for Data Privacy - Nishant Bhajaria
byTech Lead Journal
0 ratings
0% found this document useful
The Importance of Data Collection and a Drip System: One overlooked aspect of the sales process in our industry is staying in front of clients and leads after the initial call. Data collection and utilizing email drip systems can allow for you to stay top of mind with all clients and leads to ensure...
Podcast episode
The Importance of Data Collection and a Drip System: One overlooked aspect of the sales process in our industry is staying in front of clients and leads after the initial call. Data collection and utilizing email drip systems can allow for you to stay top of mind with all clients and leads to ensure...
by7 Figure Annuity Sales
0 ratings
0% found this document useful
SaaStr 180: David Skok on Why You Should Not Focus On CAC/LTV In The Early Days, What Is The Right Way To Analyse Sales Rep Productivity & The Leading Indicators Early Stage VCs Use to Assess Product Market Fit: David Skok is a serial entrepreneur turned VC at Matrix Partners. He founded four companies: Skok Systems, Corporate Software Europe, Watermark Software, and SilverStream Software and did one turnaround with Xionics. Three of the companies he founded...
Podcast episode
SaaStr 180: David Skok on Why You Should Not Focus On CAC/LTV In The Early Days, What Is The Right Way To Analyse Sales Rep Productivity & The Leading Indicators Early Stage VCs Use to Assess Product Market Fit: David Skok is a serial entrepreneur turned VC at Matrix Partners. He founded four companies: Skok Systems, Corporate Software Europe, Watermark Software, and SilverStream Software and did one turnaround with Xionics. Three of the companies he founded...
byThe Official SaaStr Podcast: SaaS | Founders | Investors
0 ratings
0% found this document useful
Episode 13: Governance – what’s needed to run a good data protection regime?: What are the building blocks of good data protection governance? In this broad-ranging discussion, we talk to James Leaton Gray about his assessment of current data protection in the UK, what it takes to run a good data protection regime, different target operating models, how different parts of the business need to work together, the evolving role of the DPO, privacy and privsec, common mistakes and – critically – how move the data protection regime up the value chain. Plus the opportunities open to organisations that manage to establish a relationship of trust with their data subjects.
Podcast episode
Episode 13: Governance – what’s needed to run a good data protection regime?: What are the building blocks of good data protection governance? In this broad-ranging discussion, we talk to James Leaton Gray about his assessment of current data protection in the UK, what it takes to run a good data protection regime, different target operating models, how different parts of the business need to work together, the evolving role of the DPO, privacy and privsec, common mistakes and – critically – how move the data protection regime up the value chain. Plus the opportunities open to organisations that manage to establish a relationship of trust with their data subjects.
byGDPR Now!
0 ratings
0% found this document useful

Skip carousel

Good Governance for Dark Data: GUIDELINES FOR INDUSTRIAL IOT MANAGERS
The European Business Review
Article
Good Governance for Dark Data: GUIDELINES FOR INDUSTRIAL IOT MANAGERS
Mar 31, 2020
7 min read
The Great Trade-off: Balancing Privacy And Trust In Marketing In The Digital Age
NZ Marketing
Article
The Great Trade-off: Balancing Privacy And Trust In Marketing In The Digital Age
Jun 22, 2023
From May 8-12, our Privacy Week 2023 event delved into the ever-pressing question of how to strike a balance between privacy, consumer trust, and the world of marketing. Collaborating with the Office of the Privacy Commissioner, the Marketing Associa
3 min read
Privacy: The Price Of Trust
Inc.
Article
Privacy: The Price Of Trust
Aug 17, 2021
How will cybersecurity and data privacy evolve over the next decade? We’re hearing from our customers that cybersecurity and data privacy are increasingly becoming board-level conversations. Companies will start to think about privacy, security, vend
1 min read
“How Do You Launch A Product Without Alienating Or Damaging Your Customers?”
PC Pro Magazine
Article
“How Do You Launch A Product Without Alienating Or Damaging Your Customers?”
Feb 10, 2022
6 min read
Expert Advice: How to Up Your Cyber Security
Entrepreneur
Article
Expert Advice: How to Up Your Cyber Security
Feb 1, 2015
2 min read
Diagnosis: We Have Pii And Ip… Now What?
The European Business Review
Article
Diagnosis: We Have Pii And Ip… Now What?
Oct 2, 2023
5 min read
Enter the Industry 4.0 Era Today by Using “Dark Data” You Already Have
The European Business Review
Article
Enter the Industry 4.0 Era Today by Using “Dark Data” You Already Have
Aug 2, 2019
7 min read
Leadership Forum: Making Digital Transformation A Reality
Rotman Management
Article
Leadership Forum: Making Digital Transformation A Reality
Jan 1, 2018
Glenda Crisp Senior Vice President and Chief Data Officer, TD Bank Group + Connie Bonello Associate Partner, Financial Services, IBM Canada IN MOST OF TODAY’S ORGANIZATIONS, data underpins every transaction, operation and interaction. And yet, the ab
8 min read
LOW CONSUMER TRUST: Should We Be Okay With It?
NZBusiness and Management
Article
LOW CONSUMER TRUST: Should We Be Okay With It?
Jun 24, 2019
Interactions and transactions today are becoming digital. This is especially true of New Zealand, one of the most digitally active nations in one of the world’s most digitally active regions. In the Asia Pacific retail sector alone, consumers are alr
3 min read
Cybersecurity Made Simple: Taming The Password
The European Business Review
Article
Cybersecurity Made Simple: Taming The Password
Mar 1, 2022
8 min read
How To Keep Data Safe
MoneyWeek
Article
How To Keep Data Safe
Feb 3, 2023
2 min read
Data Protection: Lightening The Load Of Compliance
The European Business Review
Article
Data Protection: Lightening The Load Of Compliance
Sep 25, 2021
7 min read
Privacy Vs Personalisation: A Marketer’s Dilemma Or A Brand-positioning Choice?
NZ Marketing
Article
Privacy Vs Personalisation: A Marketer’s Dilemma Or A Brand-positioning Choice?
Sep 22, 2021
4 min read
How To Make The Most Of Your First-party Data
NZ Marketing
Article
How To Make The Most Of Your First-party Data
Mar 23, 2023
3 min read
How To Earn Cyber Essentials Certification
PC Pro Magazine
Article
How To Earn Cyber Essentials Certification
May 9, 2024
8 min read
Trend Micro Maximum Security: Great Security Suite, But Privacy Protection Features Need Work
PCWorld
Article
Trend Micro Maximum Security: Great Security Suite, But Privacy Protection Features Need Work
Aug 7, 2018
6 min read
“The Biggest Problem I See When People Are Working From Home Is A Poorly Designed Network”
PC Pro Magazine
Article
“The Biggest Problem I See When People Are Working From Home Is A Poorly Designed Network”
Jun 8, 2023
6 min read
Extending The Time Equation
The European Business Review
Article
Extending The Time Equation
Jul 26, 2021
4 min read
Pivoting To First-party Data
NZ Marketing
Article
Pivoting To First-party Data
Jun 9, 2021
5 min read
“When Someone Asks Me To Prepare A Privacy Notice, I Find It Hard To Meet Their Expectations”
PC Pro Magazine
Article
“When Someone Asks Me To Prepare A Privacy Notice, I Find It Hard To Meet Their Expectations”
Mar 7, 2024
6 min read
What Should You Know About Cloud Security Solutions?
HWM Singapore
Article
What Should You Know About Cloud Security Solutions?
Apr 9, 2021
3 min read
Signals Of Change: how To Evolve For The New Global Reality
Rotman Management
Article
Signals Of Change: how To Evolve For The New Global Reality
May 1, 2022
11 min read
The Gold Standard
Marketing
Article
The Gold Standard
Jun 6, 2018
9 min read
Searching For Privacy
NZ Marketing
Article
Searching For Privacy
Dec 8, 2021
6 min read
GDPR - Death Knell or a New Life for Customer Analytics?
The European Business Review
Article
GDPR - Death Knell or a New Life for Customer Analytics?
Sep 20, 2018
4 min read
N° 1 cookie Monster
Inc.
Article
N° 1 cookie Monster
Aug 12, 2020
8 min read
The Future Of The Data Economy
The European Business Review
Article
The Future Of The Data Economy
Jun 1, 2022
6 min read
How Do you Know If You’ve Been Hacked
PC Pro Magazine
Article
How Do you Know If You’ve Been Hacked
Apr 4, 2024
8 min read
Drawing The Digital Curtains
Business Today
Article
Drawing The Digital Curtains
Jun 25, 2018
8 min read
Marketers: Stop Being A Privacy Liability
Marketing
Article
Marketers: Stop Being A Privacy Liability
Jun 6, 2018
Trustworthiness begets a positive customer experience. As a marketer, you intuitively know that having a trustworthy brand is important. Consumers today are also increasingly aware of the value of their personal data. As a result, brands can no longe
2 min read

Related categories

Skip carousel

Reviews for Data Privacy

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Data Privacy - Nishant Bhajaria

inside front cover

The four key privacy expectations that companies face and their associated privacy solutions

Data Privacy

A runbook for engineers

Nishant Bhajaria

Foreword by Neil Hunt

To comment go to liveBook

Manning

Shelter Island

For more information on this and other Manning titles go to

www.manning.com

Copyright

For online information and ordering of these and other Manning books, please visit www.manning.com. The publisher offers discounts on these books when ordered in quantity.

For more information, please contact

Special Sales Department

Manning Publications Co.

20 Baldwin Road

PO Box 761

Shelter Island, NY 11964

Email: orders@manning.com

No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by means electronic, mechanical, photocopying, or otherwise, without prior written permission of the publisher.

Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in the book, and Manning Publications was aware of a trademark claim, the designations have been printed in initial caps or all caps.

♾ Recognizing the importance of preserving what has been written, it is Manning’s policy to have the books we publish printed on acid-free paper, and we exert our best efforts to that end. Recognizing also our responsibility to conserve the resources of our planet, Manning books are printed on paper that is at least 15 percent recycled and processed without the use of elemental chlorine.

ISBN: 9781617298998

brief contents

Part 1. Privacy, Data, and Your Business

1 Privacy engineering: Why it’s needed, how to scale it

2 Understanding data and privacy

Part 2. A proactive privacy program: Data governance

3 Data classification

4 Data inventory

5 Data sharing

Part 3. Building tools and processes

6 The technical privacy review

7 Data deletion

8 Exporting user data: Data Subject Access Requests

Part 4. Security, scaling, and staffing

9 Building a consent management platform

10 Closing security vulnerabilities

11 Scaling, hiring, and considering regulations

Front matter

foreword

preface

acknowledgments

about this book

about the author

about the cover illustration

Part 1. Privacy, Data, and Your Business

1 Privacy engineering: Why it’s needed, how to scale it

1.1 What is privacy?

1.2 How data flows into and within your company

1.3 Why privacy matters

The fines are real

Early-stage efficiency wins can cause late-stage privacy headaches

Privacy investigations could be more than a speed bump

Privacy process can unlock business opportunities: A real-life example

1.4 Privacy: A mental model

1.5 How privacy affects your business at a macro level

Privacy and safety: The COVID edition

Privacy and regulations: A cyclical process

1.6 Privacy tech and tooling: Your options and your choices

The build vs. buy question

Third-party privacy tools: Do they really work and scale?

The risks in buying third-party privacy tools

1.7 What this book will not do

1.8 How the role of engineers has changed, and how that has affected privacy

2 Understanding data and privacy

2.1 Privacy and what it entails

Why privacy is hard

Privacy engineering on the ground: What you have to accomplish

Privacy, data systems, and policy enforcement

2.2 This could be your company

2.3 Data, your business growth strategy, and privacy

2.4 Examples: When privacy is violated

Equifax

The Office of Personnel Management (OPM) breach

LabCorp and Quest Diagnostics

2.5 Privacy and the regulatory landscape

How regulations impact your product and their users

How your program should help prepare for changing privacy law

2.6 Privacy and the user

Becoming an American, and privacy

Today’s users and their privacy concerns

2.7 After building the tools comes the hard part: Building a program

2.8 As you build a program, build a privacy-first culture

Part 2. A proactive privacy program: Data governance

3 Data classification

3.1 Data classification and customer context

3.2 Why data classification is necessary

Data classification as part of data governance

Data classification: How it helps align priorities

Industry benchmarking around data classification

Unstructured data and governance

Data classification as part of your maturity journey

3.3 How you can implement data classification to improve privacy

Data classification and access options

Data classification, access management, and privacy: Example 1

Data classification, access management, and privacy: Example 2

3.4 How to classify data with a focus on privacy laws

Data classification as an abstraction of privacy laws

Data classification to resolve tension between interpretations of privacy laws

3.5 The data classification process

Working with cross-functional stakeholders on your data classification

Formalizing and refactoring your data classification

The data classification process: A Microsoft template

3.6 Data classification: An example

4 Data inventory

4.1 Data inventory: What it is and why you need it

4.2 Machine-readable tags

What are data inventory tags?

Data inventory tags: A specific example

4.3 Creating a baseline

4.4 The technical architecture

Structured and unstructured data

Data inventory architectural capabilities

Data inventory workflow

4.5 Understanding the data

The metadata definition process

The metadata discovery process

4.6 When should you start the data inventory process?

Why is the data inventory process so hard?

Data inventory: Sooner is better than later

4.7 A data inventory is not a binary process

Data inventory level 1

Data inventory level 2

Data inventory level 3

4.8 What does a successful data inventory process look like?

Data inventory objective success metrics

Data inventory subjective success metrics

5 Data sharing

5.1 Data sharing: Why companies need to share data

Data sharing: Taxicab companies

Data sharing: Online advertising

Privacy in advertising

5.2 How to share data safely: Security as an ally of privacy

Tracking President Trump

Protecting data in motion

Protecting data at rest

5.3 Obfuscation techniques for privacy-safe data sharing

Data sharing and US national security

Data anonymization: The relationship between precision and retention

Data anonymization: The relationship between precision and access

Data anonymization: Mapping universal IDs to internal IDs

5.4 Sharing internal IDs with third parties

Use case 1: Minimal session (no linking of user activity is needed)

Use case 2: Single session per dataset (linking of the same user’s activity within a dataset)

Use case 3: Session spanning datasets (linking across datasets)

Recovering pseudonymized values

5.5 Measuring privacy impact

K-anonymity

L-diversity

5.6 Privacy harms: This is not a drill

Facebook and Cambridge Analytica

Sharing data and weaknesses

Part 3. Building tools and processes

6 The technical privacy review

6.1 What are privacy reviews?

The privacy impact assessment (PIA)

The data protection impact assessment (DPIA)

6.2 Implementing the legal privacy review process

6.3 Making the case for a technical privacy review

Timing and scope

What the technical review covers that the legal review does not

6.4 Integrating technical privacy reviews into the innovation pipeline

Where does the technical privacy review belong?

How to implement a technical privacy intake?

6.5 Scaling the technical privacy review process

Data sharing

Machine-learning models

6.6 Sample technical privacy reviews

Messaging apps and engagement apps: Do they connect?

Masks and contact tracing

7 Data deletion

7.1 Why must a company delete data?

7.2 What does a modern data collection architecture look like?

Distributed architecture and microservices: How companies collect data

How real-time data is stored and accessed

Archival data storage

Other data storage locations

How data storage grows from collection to archival

7.3 How the data collection architecture works

7.4 Deleting account-level data: A starting point

Account deletion: Building the tooling and process

Scaling account deletion

7.5 Deleting account-level data: Automation and scaling for distributed services

Registering services and data fields for deletion

Scheduling data deletion

7.6 Sensitive data deletion

7.7 Who should own data deletion?

8 Exporting user data: Data Subject Access Requests

8.1 What are DSARs?

What rights do DSAR regulations give to users?

An overview of the DSAR request fulfillment process

8.2 Setting up the DSAR process

The key steps in creating a DSAR system

Building a DSAR status dashboard

8.3 DSAR automation, data structures, and data flows

DSAR components

Cuboids: A subset of DSAR data

DSAR templates

Data sources for DSAR templates

8.4 Internal-facing screens and dashboards

Part 4. Security, scaling, and staffing

9 Building a consent management platform

9.1 Why consent management is important

Consent management and privacy-related regulation

Consent management and tech industry changes

Consent management and your business

9.2 A consent management platform

9.3 A data schema model for consent management

The entity relationships that help structure a CMP

Entity relationship schemas: A CMP database

9.4 Consent code: Objects

API to check consent status

API to retrieve disclosures

API to update the consent status for a disclosure

API to process multiple disclosures

API to register with the consents service

Useful definitions for the consents service

9.5 Other useful capabilities in a CMP

9.6 Integrating consent management into product workflow

10 Closing security vulnerabilities

10.1 Protecting privacy by reducing the attack surface

Managing the attack surface

How testing can cause security and privacy risks

An enterprise risk model for security and privacy

10.2 Protecting privacy by managing perimeter access

The Target breach

MongoDB security weaknesses

Authorization best practices

Why continuous monitoring of accounts and credentials is important

Remote work and privacy risk

10.3 Protecting privacy by closing access-control gaps

How an IDOR vulnerability works

IDOR testing and mitigation

11 Scaling, hiring, and considering regulations

11.1 A maturity model for privacy engineering

Identification

Protection

Detection

Remediation

11.2 The privacy engineering domain and skills

11.3 Privacy and the regulatory climate

index

front matter

foreword

I met Nishant while I was leading the product and engineering team at Netflix, where I had been since the beginning of the company. The team was about 500 strong, and while we had had early brushes with security challenges, we had not tackled privacy in a significant way until we faced blowback from the Netflix Prize, and then GDPR and CCPA in quick succession. We were building out the team, the philosophy, and the deliverables at the same time, and Nishant was a key part of that team—someone who spoke both engineering and privacy, who understood the pragmatics, the needs of the business, the limits on engineering effort, and the commitments we had made (and needed to make) to our customers and how to fulfill them.

For the Netflix Prize, 2006–2009, we wanted to publish a large dataset of 100M ratings from 500k users (e.g. user N liked title T with 4 stars) and offer a $1M prize for the team who could best build a prediction engine to predict ratings on a test set held back from the competitors. Obviously we needed to anonymize the dataset, but James Bennett, who ran the prize effort for me, also took a sophisticated approach of randomizing a percentage of the ratings so they could not be matched to other public sources. However, Arvind Narayanan and Vitaly Shmatikov at the University of Texas at Austin wrote a paper showing that statistical re-identification techniques could match ratings to IMDB and expose the identities of several individuals—a possibility we hadn’t sufficiently thought through. This was a wakeup call for me.

Around this time, there were an escalating series of breaches at various other companies, disclosing personal information including names, addresses, SSNs, credit cards, etc. It was easy to view these as security problems, but in many cases, the breach was less a penetration of defenses, but was by or through an insider, or an accident. As we studied how to avoid being hit ourselves, it became clearer and clearer that while we needed to have strong security measures, it would also be necessary to design our IT systems to limit and segregate personal information so that accidents were unlikely, insiders had less chance (and more incentive) to avoid a leak, and hackers would have to work much harder to put the pieces together.

Then came the GDPR regulation, as a harbinger of many new privacy regulations that are still rolling out as I write this in 2021. GDPR (and later CCPA) added the new consideration that individuals should have the right to know what data was collected, to be able to see that data, to fix it if incorrect, and to delete it if they wished. This further reinforced the need to design our systems with privacy in mind, to make all these things easier to accomplish.

For Netflix, this meant segregating our personally identifying information in tokenized data stores, ensuring that all references were indirect, and adding policies, controls, and auditing around access to those stores. Accomplishing this on a system running at scale, without impacting performance, was a significant challenge, and one in which Nishant was a key leader. It made me wish that we had planned more for this when starting out, and that we didn’t have to build it after the fact—and I started to think and communicate about principles of design for privacy with my team.

This book takes that thinking further and deeper. It is written for professionals in technology companies facing the same challenges that we faced then, but in an ever more stringent and demanding environment when privacy matters more to individuals and thus regulators, when more data is stored and more breaches and disclosures happen all the time, when technology platforms are less monolithic and more bolted together from various partnerships and services, and public opinion about technology companies has turned increasingly negative on their use and abuse of private data.

Your digital exhaust can be incredibly valuable, and can be used to pay for services and products which are offered for free (or to boost revenue for products sold for a fee). Free (or reduced cost) has always been an attractive model to consumers, but now people are becoming more savvy and demanding about what is done with their data, and companies are being more aggressive about deriving maximum value from that data to pay for ever richer and more interesting products.

But your private information linked to behavior is increasingly used in services or systems that are unavoidable: from government services, health, banking, travel infrastructure, to third party infrastructure like ratings agencies. These systems, being non-optional, have an even bigger responsibility to use your personal information safely, since you can’t vote with your feet and avoid companies that abuse your trust.

This requires that companies think clearly about what they will do with the data, are clear and up-front about it with their users, and do it in a safe way that restores some of the lost trust.

Executing on those requirements starts with the people: inculcating a privacy sensitivity, a state of mind, that makes privacy a first-order topic throughout an organization.

Then it requires thoughtful product and service design, thinking about what is needed, and for how long, and what to do with it afterwards, and the ability to clearly communicate that with users.

And then it requires technology design and implementation that makes it possible to comply with the promises made and the regulations that need to be followed, without becoming a burden on the organization preventing productivity, agility, and the ability to deliver value. The design needs to anticipate future privacy needs that will come in evolving public expectations and future privacy regulations that will inevitably arise as the public concerns develop.

This book will give you a better appreciation for what privacy is and why it matters; with frequent examples of breaches and leaks, it provokes you to think about what if that were my organization; how can I take steps to lower the risk?

Nishant describes methodologies for classifying and talking about data with differing privacy sensitivities, where that data goes and what it is used for, and prompts you to ask the questions: Is it necessary for the purpose? Is it what I would want as a customer? Is it ethical? Is it compliant with our policies and with regulations? Then he considers sharing with other parts of the organization, and (increasingly important) with partners, suppliers, and vendors, and how to ask the right questions of those other organizations before you trust them with your users’ data.

A big part of the book is about technical design to make it easy to keep private information private. Techniques include encryption, hashing, tokenization, and ways to segregate data to secure the private data. Another aspect is avoiding informal data collection (such as logging or debug streams) that inadvertently capture PII in an insecure way. This requires tooling to support collecting useful data without PII, and educational programs that ensure that engineers are mindful of the need to take care.

So much of privacy depends upon what data is collected. Thus an important part of design for privacy is ensuring that there is justification for collecting and for keeping data, and making sure that it is not collected if not needed, or removed when no longer necessary. Defining need matters too—there’s the data you find that you need in the future that you wish you had collected when you had the chance, and there is the data that you need yesterday that doesn’t really add much value, and probably wasn’t that important in the first place.

The new privacy regulations introduce user rights to know, to view, to correct, and to delete their data; this can very quickly become an impossible task unless data collection is designed from the start with an ability to find everything about an individual, and an ability to selectively delete individual records without leaving inconsistencies in the data (such as audit trails for transactions that point to deleted customer records).

Privacy is joined at the hip with security. Without strong identity/authentication and (appropriately fine grained) authorization, it becomes impossible to keep control of, or audit access to private information, and without good controls around unauthorized access, it becomes easier for an intruder to compromise privacy.

The book closes with thoughts on scaling—that is, matching the resources and team focused on privacy to the size and maturity of the organization and the task it is facing. It is easy to undersize the effort and fail to achieve the goals; it is also easy to oversize the effort, waste resources, slow things down, and kill the value that the organization seeks to deliver. Finding the right effort level is a challenge!

I wish I had had this text in 2015 or 2016 at Netflix when we started working on GDPR readiness. It would have been helpful in 2008–2012 in a time of significant architectural evolution of our technology, when we could have implemented some of the ideas much more easily. I would have benefited from the text as far back as 2006 thinking about the Netflix Prize, or even before as we laid the foundations for Netflix in the late 1990s. And now, I find the text valuable as I work on AI in healthcare, where the opportunities for data-driven medicine are so huge but regulatory and public scrutiny are especially prominent, if dated and hard to interpret in the modern era of privacy in technology.

I frequently encounter teams who have ignored or dismissed privacy as something for later, and this text is both a good antidote to that kind of thinking, and also a good primer on how to make progress getting where they need to be, in a balanced and cost-effective, value-enhancing way.

Enjoy your read!

Neil

Hunt

Chief Product Officer

Netflix 1999–2017

preface

There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don’t know. But there are also unknown unknowns. There are things we don’t know we don’t know.

—Donald Rumsfeld, Former United States Secretary of Defense

The above quote by Donald Rumsfeld often came to mind during my early days as a security and privacy engineer. Seemingly trivial problems—locating data, verifying user acceptance, and deleting data—often revealed themselves as unimaginably more complicated than they should have been. The same instincts of data collection and dissemination that served me well in my previous incarnation as an engineer and product manager boomeranged on me in my role as a privacy leader.

I remember looking for resources online and coming up empty. Most frustrating were the moments when those of us working on privacy were deemed by the business to be blockers. The lack of any data hygiene among engineering teams made it hard for me to offer clear and verifiable answers to attorneys representing us in court.

The onset of privacy regulation and scrutiny has led to improvements at companies that use customer data. Even so, existing privacy laws are too segmented and often too confusing. Unsurprisingly, the ambiguity hurts businesses that lack the resources that the bigger companies have. The relationship between businesses and privacy regulators ranges from distrust to disgust, and the consumer is poorer for it.

My favorite example from my Netflix days: The Video Privacy Protection Act (VPPA) was passed by the United States Congress in 1988. It was the outcome of the contentious Supreme Court nomination of Judge Robert Bork. Judge Bork stated that Americans enjoy only those privacy protections conferred by legislation. In response, Michael Dolan, a freelance writer for the Washington City Paper talked a video store clerk into giving him Bork’s rental history.

Congress passed the VPPA to regulate data around our viewing history decades before streaming platforms like Netflix and Amazon Prime existed. These platforms, nonetheless, are impacted by the VPPA’s stipulations. Newer privacy laws suffer from flaws, as well, in that they often do not account for the complexity of building technical privacy solutions.

Also, engineering teams increasingly operate in silos with bespoke processes. That has made it increasingly difficult to execute privacy controls in a way that is scalable and measurable. Companies and governments have feasted on too much data for far too long, with too little restraint.

In 2019, I decided to help other engineers and leaders who were trying to solve problems similar to the ones I had wrestled over the years. I started teaching courses on this topic on LinkedIn Learning, and those were well received. My insights and experience were soon sought after by startup founders, mature companies, venture capitalists, and members of the cybersecurity community at large.

I found that my esoteric skills—a mix of engineering, data protection, regulatory policy—enabled me to run massive and impactful privacy programs. If I could aggregate all my learnings, victories, and missteps as a reference for companies, they could start building privacy into their products from the beginning rather than bolting it on at the end.

There is a need in the market, and in government, for a framework that combines business and policy context with hands-on technical skills. I decided to write a book to offer just that.

Over the span of one year when data was spreading worldwide while most of us were locked in place at home, I wrote this book to increase the number of known knowns and decrease the number of known unknowns and unknown unknowns.

acknowledgments

This book would not have been possible were it not for the growing cybersecurity community, of which privacy and data protection are a key part. Engineers who constantly strive to protect customer data made for an inspiring target audience as well as north star. The many industry experts who have offered solutions and commentary are too numerous to list, but that does not diminish their contribution.

I want to thank the people at Manning who made this book possible: Publisher Marjan Bace, Editor Ian Hough, Acquisitions Editor Michael Stephens. Candace Gillhoolley and Beth Faris from marketing, and others on the editorial and production teams who worked behind the scenes. A heartfelt thanks also to Michael Jensen for technical reviews that made the book more focused on helping its core engineering constituency.

Sincere gratitude is also in order to mentors and experts in industry who helped my career grow in this space and whose contributions have enriched this book: Anthony Dupre, Larry Drebes, Anne Bradley, Jason Chan, Benjamin Malley, Patrick Mueller, Neil Hunt, Naresh Gopalani, Russell Lewis, Charles Smith, Vikram Khare, Vinay Goel, John Four Flynn, Yong Qiao, Ruby Zefo, Derek Care, Uttara Sivaram, Michelle Dennedy, Melanie Ensign, Simon Hania, Mohammad Islam, Catherine Nelson, Peter Dickman, Kim Lucy, Bryan Casper, Ben Feinstein, Engin Bozdag, Calvin Seto, Matt Olsen, Ayana Miller, Ahmed Ibrahim, Avni Verma, Latha Maripuri, Nicolas Lidzborski, Zhengquin Luo, and others.

To all the reviewers: Benjamin Lampert, Brian Liceaga, Des Horsley, Diego Casella, Doniyor Ulmasov, Floris Bouchot, Håvard Wall, Jean-François Beauchef, Jens Gheerardyn, Joe Ivans, John Tyler, Jon Riddle, Jonathan Bourbonnais, Marc Roulleau, Marcin Sęk, Matthew Todd, Maytham Fahmi, Michael Langdon, Nadia Noori, Osama Khan, Paul Love, Peter White, Pietro Alberto Rossi, Tim Wooldridge, and Willem van Ketwich, your suggestions helped make this a better book.

about this book

This book is intended to serve two purposes. First, it is intended to be a stepping stone for engineers looking to solve privacy problems using tools, automation, and process. I have provided not just hands-on implementation techniques, but also the business context that is critical in fast-moving companies. Second, the book is supposed to help decision-makers in companies, governments, and media provide the right guidance to help businesses thrive as well as protect customer data.

Who should read this book

This book’s primary audience is engineers who work with data, especially in highly distributed architectures. They have to solve complex problems and have lacked the framework to embed privacy engineering into their system designs and implementations. This is the first book in the era of cloud computing and identity graphs to help engineers implement complex privacy goals like data governance, technical privacy reviews, data deletion, consent management, etc.

This book will help engineers regardless of whether they choose to build these solutions in-house or onboard third-party solutions. Engineers can also use this book to find overlaps between privacy and security risks, a key consideration given our present threats of ransomware, breaches, and email fraud.

Executives would also benefit from reading this book. While some of the technical details would be out of scope, these readers will be able to partner with engineers more effectively after having read this book to solve privacy problems and make informed decisions.

I also hope that members of the media, regulators, and attorneys use this to build a baseline of knowledge. This will enable them to offer commentary and analysis rooted in context and expertise.

How this book is organized: A roadmap

This book is organized in four parts and 11 chapters. The bookends, i.e. the first and fourth parts, offer contextual guidance and will help engineers develop a scalable privacy program. The second and third parts offer hands-on skills that focus on data governance and tooling respectively.

Part 1 focuses on how privacy engineering fits as part of a company’s overall innovation ecosystem:

Chapter 1 explains how privacy is impacted by the flow of data through the tech stack and storage, and how a company can develop programmatic controls accordingly.

Chapter 2 explains how data can create privacy risk because of breaches, misuse, and regulations.

Part 2 focuses on data governance so as to enable engineers to manage better the data they collect and its attendant risk:

Chapter 3 focuses on classifying data with cross-functional partners so as to align with privacy risk.

Chapter 4 is a deep-dive on data inventory, which entails categorizing data using a mixture of manual and intelligence-powered classification.

Chapter 5 offers techniques to anonymize datasets and measure privacy impact, using data sharing as a use-case.

Part 3 will help engineers develop mission-critical privacy tooling aimed at improving privacy compliance as well as building customer trust:

Chapter 6 will help engineers set up a technical privacy review and consulting process to front-load privacy guidance and reduce the strain on the privacy legal team.

Chapter 7 will walk through a sample architecture for data deletion, a core requirement for data risk minimization as well as for several compliance regimes.

Chapter 8 will help readers design a data export capability so as to help fulfill Data Subject Access Requests or DSARs.

Chapter 9 offers a sample design for a Consent Management Platform (CMP) so that businesses can meet this new requirement that is being enforced by regulators and corporations.

Part 4 will help build on the earlier portions of the book and help engineers scale their privacy program:

Chapter 10 aligns privacy risks to security risks, and offers best practices to mitigate those risks.

Chapter 11 helps engineers plan maturity models for their privacy offering and their staffing models.

If you are a hands-on engineer, parts 2 and 3 are more directly in line with your imminent needs. More senior engineers will benefit from a fuller reading of the book given their responsibilities often cover the full span of the organization. For executives, members of the media, and regulators, I’d recommend a deep dive in sections 1 and 4, while a more self-paced reading of the more technical middle sections could suffice.

About the code

This book contains examples of source code both in numbered listings and in line with normal text. In both cases, source code is formatted in a fixed-width font like this to separate it from ordinary text.

In many cases, the original source code has been reformatted; we’ve added line breaks and reworked indentation to accommodate the available page space in the book. In rare cases, even this was not enough, and listings include line-continuation markers (➥). Additionally, comments in the source code have often been removed from the listings when the code is described in the text.

The code for the examples in this book is available for download from the Manning website at https://www.manning.com/books/data-privacy.

liveBook discussion forum

Purchase of Data Privacy includes free access to a private web forum run by Manning Publications where you can make comments about the book, ask technical questions, and receive help from the author and from other users. To access the forum, go to https://livebook.manning.com/#!/book/data-privacy/discussion. You can also learn more about Manning’s forums and the rules of conduct at https://livebook.manning.com/#!/discussion.

Manning’s commitment to our readers is to provide a venue where a meaningful dialogue between individual readers and between readers and the author can take place. It is not a commitment to any specific amount of participation on the part of the author, whose contribution to the forum remains voluntary (and unpaid). We suggest you try asking the author some challenging questions lest his interest stray! The forum and the archives of previous discussions will be accessible from the publisher’s website as long as the book is in print.

about the author

Nishant Bhajaria has a Bachelors and a Masters Degree in Computer Science. He has been part of the cybersecurity and privacy community since 2010 and has led teams of various sizes in these areas at Nike, Netflix, Google, and Uber, where he currently leads the privacy engineering organization and reports to the CISO. The organizations he has led have included engineers and architects, data analysts and privacy consultants, as well as product managers and incident response specialists.

After having started his career building teams and programs, he has pivoted to building more strategic programs. These programs operate to enable privacy maturity, partnerships with core engineering and data platform teams, and tighter alignment with the legal and PR teams. His areas of impact range from helping the board of directors make data-driven decisions to coaching product management to operate with fairness and trust as a consideration.

Nishant is also active on the cybersecurity circuit with published white papers on privacy and multiple speaking engagements for industry bodies. He advises startups on data protection strategy and teaches courses on LinkedIn Learning on data privacy (https://www.linkedin.com/learning/instructors/nishant-bhajaria) as well as other areas that include career development and inclusivity in tech staffing. He also partnered with researchers at MIT to draft the first-ever privacy principles for COVID-19 contact tracing (https://law.mit.edu/pub/commentaryoncovid19contacttracingprivacyprinciples/release/1) in the early days of the pandemic.

This eclectic set of contributions map back to his days in college, when he was the rare engineer who wrote editorials for the college paper, was part of the debate team, and worked for political science professors.

Outside of work, there are several causes close to his heart and they center on wildlife. Helping to rescue dogs from kill-shelters, fighting back against wildlife smuggling, and protecting elephants from poaching and abuse serve as his moral purpose.

about the cover illustration

The figure on the cover of Data Privacy is captioned Paysanne des Environs de Berne or, a peasant from the area around Bern, Switzerland. The illustration is taken from a collection of dress costumes from various countries by Jacques Grasset de Saint-Sauveur (1757–1810), titled Costumes civils actuels de tous les peuples connus, originally published in France in 1788. Each illustration is finely drawn and colored by hand and the rich variety of drawings in the collection reminds us vividly of how culturally apart the world’s regions, towns, villages, and neighborhoods were just 200 years ago. Isolated from each other, people spoke different dialects and languages. In the streets or in the countryside, it was easy to identify where they lived and what their trade or station in life was just by their dress.

Dress codes have changed since then and the diversity by region, so rich at the time, has faded away. It is now hard to tell apart the inhabitants of different continents, let alone different towns or regions. Perhaps we have traded cultural diversity for a more varied personal life—certainly for a more varied and fast-paced technological life.

At a time when it is hard to tell one computer book from another, Manning celebrates the inventiveness and initiative of the computer business with book covers based on the rich diversity of regional life of two centuries ago, brought back to life by pictures from collections such as this one.

Part 1. Privacy, data, and your business

The target audience for this book is engineers, and this book will also be helpful to leaders in management, media and government as well. However, it is critical that all readers are able to place privacy and data protection in context. They need to understand how software engineering has changed in practice, and the corresponding change in business risk. This will help them avoid mistakes that prove to be hard to undo.

Chapter 1 will serve as an advisor on data flow so that technical leaders can understand how their architecture and data work in conjunction. We will also look at the regulatory risks and dive deep into emerging privacy tech players. This context will help the reader approach their privacy challenges with a clear-eyed and informed lens.

Chapter 2 will explore how various business stakeholders have varying interests in data processing. We will also examine high-profile privacy incidents, thereby giving the reader a sense of the vulnerabilities they need to watch for. Finally, there is context on how to monitor investments and build a program that can scale in line with the business.

1 Privacy engineering: Why it’s needed, how to scale it

This chapter covers

What privacy means

How privacy is impacted by the flow of data through your tech stack and storage

Why privacy matters and how it affects your business

Clarity on privacy tooling, especially the build vs. buy debate

What this book does not do

How the role of engineers has changed in recent years

Over the last few years, privacy seems to have been front and center in the news. There is talk of new laws aimed at protecting customers from harm and reports of data breaches and fines being levied upon companies.

People at all levels of business are finding this unsettling, and understandably so. Many company founders are engineers or technologists; they are finding it hard to assess risks related to products that depend on data collection. There are other mid-level engineers in companies who write code and build other automation. They make many smaller decisions, and their technical outcomes, when multiplied by scale, can create shareholder and investor risk. Such tech leaders are right to wonder, what decisions am I making that may have a privacy impact down the line, just as my strategy is about to bear fruit?

Anyone in a position that will directly or indirectly impact user privacy will benefit from being conversant around privacy as a concept and as a threat vector. Such people need clear hands-on skills for implementing privacy controls. These skills will help them embed privacy engineering and tooling into a company’s technical offerings, as well as create privacy controls that break through the silos that typically define tech companies.

Too often, businesses fall into the trap of pitting innovation against privacy, where they build digital products on a foundation of user data, only to play catch up on privacy several cycles later. By this time, there has often been privacy and reputational harm. Privacy harm is an all-purpose term that captures the impact of data leakage, exfiltration, or improper access through which a user’s privacy is compromised. The loss of privacy protection implies that the user has been harmed; hence the use of this common term. These business leaders then have to find resources and bandwidth to staff a privacy program, prioritize its implementation, and alter the rhythm of business to adapt to privacy scrutiny.

This book will help you avoid this false choice and allow readers—ranging from technical department leaders to hands-on technologists—to think and speak of privacy from a place of knowledge and vision, with an understanding of the big picture as well as brass tacks. After the tools, techniques, and lessons of this book sink in, leaders will be able to adapt to a privacy-centric world. Beyond that, they will also find synergies in their operations to make their privacy posture a competitive differentiator.

In this chapter, we’ll begin with the fundamentals: what privacy actually means, the privacy implications of data flow within a company, and why privacy matters. The latter part of the chapter will take a brief look at privacy tooling, discuss what this book does not do, and consider how the role of engineers has evolved in recent years—an evolution bringing with it implications for privacy. Let’s start simple; what is privacy?

1.1 What is privacy?

In order to understand privacy, it helps to first refer to security. Most companies and leaders have some sort of security apparatus and at least a superficial understanding of the concept.

For readers of this book, many of whom may need to do double-duty as privacy and security specialists, this is an important insight. If you end up with a security issue, it probably includes something along one of these lines:

An employee or equivalent insider accesses sensitive business or customer data when they should not have.

A business partner obtains business or customer data at a time or in a volume that affects the privacy of the customers or the competitive advantage of the business.

Data that was collected for a benign, defensible purpose gets used for something more than that. For example, data collected for fraud detection by verifying that the user is real rather than a bot then gets used for marketing, because the access control systems were compromised.

Each of these examples started with a security compromise that led to the user’s privacy being compromised, besides any other damage done to the business and its competitive advantage. Any time you have a security issue, there is a strong possibility that there will be a privacy harm as well. This is critical for leaders to understand, lest they take a siloed approach and think of these concepts as disconnected and unrelated. In subsequent chapters, the privacy techniques you’ll learn will aim at improving both privacy and security, thereby helping companies protect their competitive intellectual property, as well as their user data.

IT security involves implementing a set of cybersecurity strategies aimed at preventing unauthorized access to organizational assets. These assets include computers, networks, and data. The integrity and confidentiality of sensitive information is maintained by validating the identity of users wishing to access the data and blocking those who do not have access rights. You can read more about this from security sources such as Cisco Systems. Cisco defines IT Security as a set of cybersecurity strategies that prevents unauthorized access to organizational assets such as computers, networks, and data. It maintains the integrity and confidentiality of sensitive information, blocking the access of sophisticated hackers.¹

Note that the definition covers access to computers (or more broadly, anywhere data can live), networks (where data moves in transit from computer to computer), and the data itself. The goal here is to avoid the data being leaked, modified, or exfiltrated by external bad actors, popularly known as hackers. This definition also introduces the concept of sensitive information, which means different things when it comes to data that belongs to a human being versus data that belongs to a corporation.

As a leader in the privacy space, I have always built privacy programs by adapting and repurposing security tools. This means that I would place an external bad actor (such as a hacker) on the same mental plane as an insider who may knowingly or unknowingly use data inappropriately. As a result, the goal is protecting the data by managing the collection, access, storage, and use of this data. In that sense, rather than recreating tools and processes for privacy, you can start by adapting the structures aimed at data security, and adjusting them to provide privacy capabilities.

As an example, if you detect unauthorized access from an outsider, you might shut down that account temporarily to investigate whether the account holder is posing a risk or whether the account has been breached. You may also

Enjoying the preview?

Page 1 of 1

Data Privacy: A runbook for engineers

About this ebook

Nishant Bhajaria

Related authors

Related to Data Privacy

Related ebooks

Security For You

Related podcast episodes

Related articles

Related categories

Reviews for Data Privacy

What did you think?

Book preview

Data Privacy - Nishant Bhajaria

Data Privacy

brief contents

contents

Part 1. Privacy, Data, and Your Business

Part 2. A proactive privacy program: Data governance

Part 3. Building tools and processes

Part 4. Security, scaling, and staffing

foreword

preface

acknowledgments

about this book

Who should read this book

How this book is organized: A roadmap

About the code

liveBook discussion forum

about the author

about the cover illustration

Part 1. Privacy, data, and your business

1 Privacy engineering: Why it’s needed, how to scale it

This chapter covers

1.1 What is privacy?