Federated Learning: Privacy and Incentive

Ebook618 pages5 hours

Federated Learning: Privacy and Incentive

Name: Federated Learning: Privacy and Incentive
ISBN: 9783030630768

By Qiang Yang

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book provides a comprehensive and self-contained introduction to federated learning, ranging from the basic knowledge and theories to various key applications.

Privacy and incentive issues are the focus of this book. It is timely as federated learning is becoming popular after the release of the General Data Protection Regulation (GDPR). Since federated learning aims to enable a machine model to be collaboratively trained without each party exposing private data to others. This setting adheres to regulatory requirements of data privacy protection such as GDPR.

This book contains three main parts. Firstly, it introduces different privacy-preserving methods for protecting a federated learning model against different types of attacks such as data leakage and/or data poisoning. Secondly, the book presents incentive mechanisms which aim to encourage individuals to participate in the federated learning ecosystems. Last but not least, this book also describes how federated learning can be applied in industry and business to address data silo and privacy-preserving problems. The book is intended for readers from both the academia and the industry, who would like to learn about federated learning, practice its implementation, and apply it in their own business. Readers are expected to have some basic understanding of linear algebra, calculus, and neural network. Additionally, domain knowledge in FinTech and marketing would be helpful.”

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateNov 25, 2020

ISBN9783030630768

Related to Federated Learning

Titles in the series (9)

Skip carousel

Integration of Constraint Programming, Artificial Intelligence, and Operations Research: 16th International Conference, CPAIOR 2019, Thessaloniki, Greece, June 4–7, 2019, Proceedings
Ebook
Integration of Constraint Programming, Artificial Intelligence, and Operations Research: 16th International Conference, CPAIOR 2019, Thessaloniki, Greece, June 4–7, 2019, Proceedings
byLouis-Martin Rousseau
Rating: 0 out of 5 stars
0 ratings
Biometric Recognition: 13th Chinese Conference, CCBR 2018, Urumqi, China, August 11-12, 2018, Proceedings
Ebook
Biometric Recognition: 13th Chinese Conference, CCBR 2018, Urumqi, China, August 11-12, 2018, Proceedings
byJie Zhou
Rating: 0 out of 5 stars
0 ratings
Explainable AI: Interpreting, Explaining and Visualizing Deep Learning
Ebook
Explainable AI: Interpreting, Explaining and Visualizing Deep Learning
byWojciech Samek
Rating: 0 out of 5 stars
0 ratings
Formal Methods – The Next 30 Years: Third World Congress, FM 2019, Porto, Portugal, October 7–11, 2019, Proceedings
Ebook
Formal Methods – The Next 30 Years: Third World Congress, FM 2019, Porto, Portugal, October 7–11, 2019, Proceedings
byMaurice H. ter Beek
Rating: 0 out of 5 stars
0 ratings
From Software Engineering to Formal Methods and Tools, and Back: Essays Dedicated to Stefania Gnesi on the Occasion of Her 65th Birthday
Ebook
From Software Engineering to Formal Methods and Tools, and Back: Essays Dedicated to Stefania Gnesi on the Occasion of Her 65th Birthday
byMaurice H. ter Beek
Rating: 0 out of 5 stars
0 ratings
Language and Automata Theory and Applications: 14th International Conference, LATA 2020, Milan, Italy, March 4–6, 2020, Proceedings
Ebook
Language and Automata Theory and Applications: 14th International Conference, LATA 2020, Milan, Italy, March 4–6, 2020, Proceedings
byAlberto Leporati
Rating: 0 out of 5 stars
0 ratings
Computer Security – ESORICS 2020: 25th European Symposium on Research in Computer Security, ESORICS 2020, Guildford, UK, September 14–18, 2020, Proceedings, Part I
Ebook
Computer Security – ESORICS 2020: 25th European Symposium on Research in Computer Security, ESORICS 2020, Guildford, UK, September 14–18, 2020, Proceedings, Part I
byLiqun Chen
Rating: 0 out of 5 stars
0 ratings
Artificial General Intelligence: 13th International Conference, AGI 2020, St. Petersburg, Russia, September 16–19, 2020, Proceedings
Ebook
Artificial General Intelligence: 13th International Conference, AGI 2020, St. Petersburg, Russia, September 16–19, 2020, Proceedings
byBen Goertzel
Rating: 0 out of 5 stars
0 ratings
Federated Learning: Privacy and Incentive
Ebook
Federated Learning: Privacy and Incentive
byQiang Yang
Rating: 0 out of 5 stars
0 ratings

Related ebooks

Skip carousel

Data Science Careers, Training, and Hiring: A Comprehensive Guide to the Data Ecosystem: How to Build a Successful Data Science Career, Program, or Unit
Ebook
Data Science Careers, Training, and Hiring: A Comprehensive Guide to the Data Ecosystem: How to Build a Successful Data Science Career, Program, or Unit
byRenata Rawlings-Goss
Rating: 0 out of 5 stars
0 ratings
Deep Biometrics
Ebook
Deep Biometrics
byRichard Jiang
Rating: 0 out of 5 stars
0 ratings
Automating Open Source Intelligence: Algorithms for OSINT
Ebook
Automating Open Source Intelligence: Algorithms for OSINT
byRobert Layton
Rating: 5 out of 5 stars
5/5
Advanced Deep Learning for Engineers and Scientists: A Practical Approach
Ebook
Advanced Deep Learning for Engineers and Scientists: A Practical Approach
byKolla Bhanu Prakash
Rating: 0 out of 5 stars
0 ratings
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
Ebook
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
byStylianos Kampakis
Rating: 0 out of 5 stars
0 ratings
Cyber Influence and Cognitive Threats
Ebook
Cyber Influence and Cognitive Threats
byVladlena Benson
Rating: 0 out of 5 stars
0 ratings
Introduction to Algorithms for Data Mining and Machine Learning
Ebook
Introduction to Algorithms for Data Mining and Machine Learning
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Supervised Learning with Python: Concepts and Practical Implementation Using Python
Ebook
Supervised Learning with Python: Concepts and Practical Implementation Using Python
byVaibhav Verdhan
Rating: 0 out of 5 stars
0 ratings
Life Engineering: Machine Intelligence and Quality of Life
Ebook
Life Engineering: Machine Intelligence and Quality of Life
byHubert Osterle
Rating: 0 out of 5 stars
0 ratings
New Advances in Intelligence and Security Informatics
Ebook
New Advances in Intelligence and Security Informatics
byWenji Mao
Rating: 0 out of 5 stars
0 ratings
Input-Output Models for Sustainable Industrial Systems: Implementation Using LINGO
Ebook
Input-Output Models for Sustainable Industrial Systems: Implementation Using LINGO
byRaymond R. Tan
Rating: 0 out of 5 stars
0 ratings
Adoption of Data Analytics in Higher Education Learning and Teaching
Ebook
Adoption of Data Analytics in Higher Education Learning and Teaching
byDirk Ifenthaler
Rating: 0 out of 5 stars
0 ratings
Learning Analytics Cookbook: How to Support Learning Processes Through Data Analytics and Visualization
Ebook
Learning Analytics Cookbook: How to Support Learning Processes Through Data Analytics and Visualization
byRoope Jaakonmäki
Rating: 0 out of 5 stars
0 ratings
Multicriteria Portfolio Construction with Python
Ebook
Multicriteria Portfolio Construction with Python
byElissaios Sarmas
Rating: 0 out of 5 stars
0 ratings
Cybersecurity in Digital Transformation: Scope and Applications
Ebook
Cybersecurity in Digital Transformation: Scope and Applications
byDietmar P.F. Möller
Rating: 0 out of 5 stars
0 ratings
Guide to Vulnerability Analysis for Computer Networks and Systems: An Artificial Intelligence Approach
Ebook
Guide to Vulnerability Analysis for Computer Networks and Systems: An Artificial Intelligence Approach
bySimon Parkinson
Rating: 0 out of 5 stars
0 ratings
Online Teaching and Learning in Higher Education
Ebook
Online Teaching and Learning in Higher Education
byPedro Isaias
Rating: 0 out of 5 stars
0 ratings
Blockchain Applications in IoT Ecosystem
Ebook
Blockchain Applications in IoT Ecosystem
byTanupriya Choudhury
Rating: 0 out of 5 stars
0 ratings
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
Ebook
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
byRupam Kumar Sharma
Rating: 0 out of 5 stars
0 ratings
A Best Practices Guide for Comprehensive Employee Awareness Programs
Ebook
A Best Practices Guide for Comprehensive Employee Awareness Programs
byMediaPro
Rating: 0 out of 5 stars
0 ratings
Cyber Security Meets Machine Learning
Ebook
Cyber Security Meets Machine Learning
byXiaofeng Chen
Rating: 0 out of 5 stars
0 ratings
Professional Penetration Testing: Volume 1: Creating and Learning in a Hacking Lab
Ebook
Professional Penetration Testing: Volume 1: Creating and Learning in a Hacking Lab
byThomas Wilhelm
Rating: 4 out of 5 stars
4/5
Innovative Data Integration and Conceptual Space Modeling for COVID, Cancer, and Cardiac Care
Ebook
Innovative Data Integration and Conceptual Space Modeling for COVID, Cancer, and Cardiac Care
byAmy Neustein
Rating: 0 out of 5 stars
0 ratings
The Impact of Digital Transformation and FinTech on the Finance Professional
Ebook
The Impact of Digital Transformation and FinTech on the Finance Professional
byVolker Liermann
Rating: 0 out of 5 stars
0 ratings
Cognitive Information Systems in Management Sciences
Ebook
Cognitive Information Systems in Management Sciences
byLidia Dominika Ogiela
Rating: 0 out of 5 stars
0 ratings
Computational Learning Approaches to Data Analytics in Biomedical Applications
Ebook
Computational Learning Approaches to Data Analytics in Biomedical Applications
byKhalid Al-Jabery
Rating: 5 out of 5 stars
5/5
Deep Learning: Convergence to Big Data Analytics
Ebook
Deep Learning: Convergence to Big Data Analytics
byMurad Khan
Rating: 0 out of 5 stars
0 ratings
Building a Cybersecurity Culture in Organizations: How to Bridge the Gap Between People and Digital Technology
Ebook
Building a Cybersecurity Culture in Organizations: How to Bridge the Gap Between People and Digital Technology
byIsabella Corradini
Rating: 0 out of 5 stars
0 ratings
Economic Modeling Using Artificial Intelligence Methods
Ebook
Economic Modeling Using Artificial Intelligence Methods
byTshilidzi Marwala
Rating: 0 out of 5 stars
0 ratings
Detecting and Combating Malicious Email
Ebook
Detecting and Combating Malicious Email
byJulie JCH Ryan
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
80 Ways to Use ChatGPT in the Classroom
Ebook
80 Ways to Use ChatGPT in the Classroom
byStan Skrabut
Rating: 5 out of 5 stars
5/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
Ebook
ChatGPT Millionaire 2024 - Bot-Driven Side Hustles, Prompt Engineering Shortcut Secrets, and Automated Income Streams that Print Money While You Sleep. The Ultimate Beginner’s Guide for AI Business
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
Ebook
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
byS M Howard
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
ChatGPT: The Future of Intelligent Conversation
Ebook
ChatGPT: The Future of Intelligent Conversation
byCea West
Rating: 4 out of 5 stars
4/5
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
ChatGPT
Ebook
ChatGPT
byRobert Conway
Rating: 1 out of 5 stars
1/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
TensorFlow in 1 Day: Make your own Neural Network
Ebook
TensorFlow in 1 Day: Make your own Neural Network
byKrishna Rungta
Rating: 4 out of 5 stars
4/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

#139 Austin Branch and Andy Whiskeyman on Phoenix Challenge London: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
Podcast episode
#139 Austin Branch and Andy Whiskeyman on Phoenix Challenge London: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
Podcast episode
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
byAlignment Newsletter Podcast
0 ratings
0% found this document useful
Live from TWIMLcon! Operationalizing Responsible AI - #310: An often forgotten about topic garnered high praise at TWIMLcon this month: operationalizing responsible and ethical AI. This important topic was combined with an impressive panel of speakers, including: Rachel Thomas, Director, Center for Applied...
Podcast episode
Live from TWIMLcon! Operationalizing Responsible AI - #310: An often forgotten about topic garnered high praise at TWIMLcon this month: operationalizing responsible and ethical AI. This important topic was combined with an impressive panel of speakers, including: Rachel Thomas, Director, Center for Applied...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
Podcast episode
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
byRetail Revolution
0 ratings
0% found this document useful
#57 Kathleen Carley on Social Cybersecurity and the BEND Framework: During this episode, we talk with of Carnegie Mellon University about social cybersecurity, the BEND framework, and the challenges and promise of developing understandings and technologies on how to manage the safety of online discourse. Guest Bio:...
Podcast episode
#57 Kathleen Carley on Social Cybersecurity and the BEND Framework: During this episode, we talk with of Carnegie Mellon University about social cybersecurity, the BEND framework, and the challenges and promise of developing understandings and technologies on how to manage the safety of online discourse. Guest Bio:...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Alex Pentland and Alexander Lipton, "Building the New Economy: Data As Capital" (MIT Press, 2021): An interview with Alex Pentland
Podcast episode
Alex Pentland and Alexander Lipton, "Building the New Economy: Data As Capital" (MIT Press, 2021): An interview with Alex Pentland
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
#157 Paul Lieber and Janis Butkevics on AI Best Practice in DoD: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
Podcast episode
#157 Paul Lieber and Janis Butkevics on AI Best Practice in DoD: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
Podcast episode
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
Alex Pentland and Alexander Lipton, "Building the New Economy: Data As Capital" (MIT Press, 2021): An interview with Alex Pentland
Podcast episode
Alex Pentland and Alexander Lipton, "Building the New Economy: Data As Capital" (MIT Press, 2021): An interview with Alex Pentland
byNew Books in Economics
0 ratings
0% found this document useful
Explainable/Interpretable AI: In this episode, we will continue with part two of a four-part series looking at Responsible AI (Listen to part one: ). One of the major challenges with effectively developing, deploying, and managing AI systems are often related to the “black...
Podcast episode
Explainable/Interpretable AI: In this episode, we will continue with part two of a four-part series looking at Responsible AI (Listen to part one: ). One of the major challenges with effectively developing, deploying, and managing AI systems are often related to the “black...
byGARP Risk Podcast
0 ratings
0% found this document useful
2868: Bridging the Gap: OPSWAT's $10 Million Cybersecurity Scholarship Initiative: Are we truly prepared to defend our critical infrastructures against cyber threats? In today’s episode of Tech Talks Daily, we're joined by Irfan Shakeel, the VP of Training & Certification Services at OPSWAT, a pivotal player in cybersecurity...
Podcast episode
2868: Bridging the Gap: OPSWAT's $10 Million Cybersecurity Scholarship Initiative: Are we truly prepared to defend our critical infrastructures against cyber threats? In today’s episode of Tech Talks Daily, we're joined by Irfan Shakeel, the VP of Training & Certification Services at OPSWAT, a pivotal player in cybersecurity...
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
#147 Yilun Du: AI Debates, Reinforcement Learning, & The Power of Generative Models: This episode is sponsored by Crusoe. Crusoe Cloud is a scalable, clean, high-performance cloud, optimized for AI and HPC workloads, and powered by wasted, stranded or clean energy. Crusoe offers virtualized compute and storage solutions for a range of...
Podcast episode
#147 Yilun Du: AI Debates, Reinforcement Learning, & The Power of Generative Models: This episode is sponsored by Crusoe. Crusoe Cloud is a scalable, clean, high-performance cloud, optimized for AI and HPC workloads, and powered by wasted, stranded or clean energy. Crusoe offers virtualized compute and storage solutions for a range of...
byEye On A.I.
0 ratings
0% found this document useful
76. Culture and Fit from the 2020 UMD Symposium: This episode, the first in a series featuring highlights from University of Maryland's 2020 Project Management Symposium, focuses on culture and the importance of fitting in. We examine the hiring process, and maintaining a culture designed to retain...
Podcast episode
76. Culture and Fit from the 2020 UMD Symposium: This episode, the first in a series featuring highlights from University of Maryland's 2020 Project Management Symposium, focuses on culture and the importance of fitting in. We examine the hiring process, and maintaining a culture designed to retain...
byPM Point of View
0 ratings
0% found this document useful
#94 Lisa DeFalco on Communications Analysis: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
Podcast episode
#94 Lisa DeFalco on Communications Analysis: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Cognition Ignition – Andrej Nikonov, CEO at Cognostics AG – Advancements in Learning Methodology Via Machine Learning, AI Strategies and Tools: Andrej Nikonov, CEO at Cognostics AG (<a href="http://cognostics.de">cognostics.de</a>), discusses the science of thinking and the many ways AI can expand human knowledge and capability. Andrej Nikonov, as CEO, is driven to discover new...
Podcast episode
Cognition Ignition – Andrej Nikonov, CEO at Cognostics AG – Advancements in Learning Methodology Via Machine Learning, AI Strategies and Tools: Andrej Nikonov, CEO at Cognostics AG (<a href="http://cognostics.de">cognostics.de</a>), discusses the science of thinking and the many ways AI can expand human knowledge and capability. Andrej Nikonov, as CEO, is driven to discover new...
byFinding Genius Podcast
0 ratings
0% found this document useful
ESW #306 - Space Rogue, Pablo Zurro, Dr. Inka Karppinen: Organizations today operate under the constant looming threat of cyber attacks. While reactive cybersecurity measures will help organizations respond to past and present threats, offensive measures are the only chance to get ahead of attackers and...
Podcast episode
ESW #306 - Space Rogue, Pablo Zurro, Dr. Inka Karppinen: Organizations today operate under the constant looming threat of cyber attacks. While reactive cybersecurity measures will help organizations respond to past and present threats, offensive measures are the only chance to get ahead of attackers and...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful
Data–driven Responses to COVID–19: opportunities and limitations [Audio]
Podcast episode
Data–driven Responses to COVID–19: opportunities and limitations [Audio]
byLSE: Public lectures and events
0 ratings
0% found this document useful
#74 Elham Tabassi on NIST, Technology Standards, and Trust: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
Podcast episode
#74 Elham Tabassi on NIST, Technology Standards, and Trust: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
byThe Cognitive Crucible
0 ratings
0% found this document useful
#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
Podcast episode
#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters
by80,000 Hours Podcast
0 ratings
0% found this document useful
Face Mask Sentiment Analysis: As the COVID-19 pandemic continues, the public (or at least those with Twitter accounts) are sharing their personal opinions about mask-wearing via Twitter. What does this data tell us about public opinion? How does it vary by demographic? What, if...
Podcast episode
Face Mask Sentiment Analysis: As the COVID-19 pandemic continues, the public (or at least those with Twitter accounts) are sharing their personal opinions about mask-wearing via Twitter. What does this data tell us about public opinion? How does it vary by demographic? What, if...
byData Skeptic
0 ratings
0% found this document useful
Seeing Green with Drs. Sandland and Chazot: What happens when you see yellow and I see blue? Together we see green. In this episode, diversifying materials science and improving the field.
Podcast episode
Seeing Green with Drs. Sandland and Chazot: What happens when you see yellow and I see blue? Together we see green. In this episode, diversifying materials science and improving the field.
byChalk Radio
0 ratings
0% found this document useful
Moonshot Thinking to Unleash Innovation [Audio]
Podcast episode
Moonshot Thinking to Unleash Innovation [Audio]
byLSE: Public lectures and events
0 ratings
0% found this document useful
#1 Rand Waltzman on Cognitive Security: Dr. Rand Waltzman is one of the founding members of the Information Professionals Association. During this conversation, we discuss how he coined the term, "cognitive security." Dr Waltzman considers the information threat to be like a chronic disease...
Podcast episode
#1 Rand Waltzman on Cognitive Security: Dr. Rand Waltzman is one of the founding members of the Information Professionals Association. During this conversation, we discuss how he coined the term, "cognitive security." Dr Waltzman considers the information threat to be like a chronic disease...
byThe Cognitive Crucible
0 ratings
0% found this document useful
#136 Victoria Nash on Internet governance and Regulation Related to Children: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
Podcast episode
#136 Victoria Nash on Internet governance and Regulation Related to Children: The Cognitive Crucible is a forum that presents different perspectives and emerging thought leadership related to the information environment. The opinions expressed by guests are their own, and do not necessarily reflect the views of or endorsement...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Secrets of RLHF in Large Language Models Part I: PPO: Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramoun...
Podcast episode
Secrets of RLHF in Large Language Models Part I: PPO: Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramoun...
byPapers Read on AI
0 ratings
0% found this document useful
AI and the Responsible Data Economy with Dawn Song - #403: Today we’re joined by Professor of Computer Science at UC Berkeley, Dawn Song. Dawn’s research is centered at the intersection of AI, deep learning, security, and privacy. She’s currently focused on bringing these disciplines together with her...
Podcast episode
AI and the Responsible Data Economy with Dawn Song - #403: Today we’re joined by Professor of Computer Science at UC Berkeley, Dawn Song. Dawn’s research is centered at the intersection of AI, deep learning, security, and privacy. She’s currently focused on bringing these disciplines together with her...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
#040 - Adversarial Examples (Dr. Nicholas Carlini, Dr. Wieland Brendel, Florian Tramèr)
Podcast episode
#040 - Adversarial Examples (Dr. Nicholas Carlini, Dr. Wieland Brendel, Florian Tramèr)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
ESW #302 - Brian Contos, Isabelle Roccia: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
Podcast episode
ESW #302 - Brian Contos, Isabelle Roccia: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is argu...
Podcast episode
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is argu...
byPapers Read on AI
0 ratings
0% found this document useful
AIOps for Security and Breach Protection
Podcast episode
AIOps for Security and Breach Protection
byThe Cloudcast
0 ratings
0% found this document useful

Skip carousel

Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
What Do Academics Think?
The Big Issue Magazine
Article
What Do Academics Think?
May 19, 2023
3 min read
Finding A New Career In AI
APC
Article
Finding A New Career In AI
Mar 23, 2020
4 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
A Licence To Cheat
The Critic Magazine
Article
A Licence To Cheat
May 25, 2023
6 min read
Banks, Uj, Establish Research Chair
Sunday Independent
Article
Banks, Uj, Establish Research Chair
Oct 11, 2020
BUILDING on a long-standing relationship focused on strengthening the link between academia and industry, Nedbank and the University of Johannesburg (UJ) have collaborated to establish a research chair. The new Nedbank Research Chair is headed by Pro
1 min read
Playing With Numbers
India Today
Article
Playing With Numbers
Jul 18, 2019
In the last few years, we have probably created more data digitally than in the rest of human history. Think about the millions of Internet searches and social media posts that are made every minute, and the resultant data that corporations and gover
3 min read
Opinion: Federated Learning: Collaboration Without Compromise For Health Care Research
STAT
Article
Opinion: Federated Learning: Collaboration Without Compromise For Health Care Research
Feb 13, 2020
Here's a new way to learn from massive collections of data while avoiding the privacy and other risks typically associated with sharing such information: federated learning.
3 min read
Tech Talk
India Today
Article
Tech Talk
Jul 31, 2021
2 min read
Danger On The Road
Business Today
Article
Danger On The Road
Oct 14, 2019
3 min read
11 Sources of Disruption
Rotman Management
Article
11 Sources of Disruption
Jan 1, 2021
You have observed a troubling tendency that often leads to the disruption of business models. Please describe it. All too often, business strategies fail to effectively account for external change in the world. When faced with deep uncertainty, leade
6 min read
Watched Over By Machines: AI And Surveillance At Work
The European Business Review
Article
Watched Over By Machines: AI And Surveillance At Work
Jan 25, 2021
6 min read
Why We Adopt And Then Ditch Online Security Tips
Futurity
Article
Why We Adopt And Then Ditch Online Security Tips
Apr 29, 2020
2 min read
Signals Of Change: how To Evolve For The New Global Reality
Rotman Management
Article
Signals Of Change: how To Evolve For The New Global Reality
May 1, 2022
11 min read
Free Flow Of Data: What The Corporate World Can Learn From Science
The European Business Review
Article
Free Flow Of Data: What The Corporate World Can Learn From Science
Jul 31, 2020
8 min read
Collaboration Cultures
Fast Company
Article
Collaboration Cultures
Aug 5, 2019
MUNICH Industrial manufacturer Siemens holds more than 15,000 patents in the United States alone, with more than 43,000 R&D-focused employees worldwide. Its Quickstarter program allows Siemens employees to independently (and democratically) allocate
3 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
Can’t Ignore The Emerging Economy Of The Internet-of-Things In Africa
Forbes Africa
Article
Can’t Ignore The Emerging Economy Of The Internet-of-Things In Africa
Oct 1, 2020
MORE recently, on August 28 to be precise, the South African-born, Silicon Valley-based engineer Elon Musk unveiled a brain implant that measures the activities of the brain and sends the information to a central database where it can be analyzed. Th
3 min read
We Don’t Actually Know If AI Is Taking Over Everything
The Atlantic
Article
We Don’t Actually Know If AI Is Taking Over Everything
Oct 19, 2023
5 min read
TRIP Framework: Re-Thinking Organisational Competitiveness in Digital Spheres
The European Business Review
Article
TRIP Framework: Re-Thinking Organisational Competitiveness in Digital Spheres
Aug 2, 2019
10 min read
Why It’s Imperative For Universities To Teach AI To All Students
Forbes Africa
Article
Why It’s Imperative For Universities To Teach AI To All Students
Mar 26, 2020
IN FEBRUARY THIS year, telecommunications company Ericsson announced that it had launched artificial intelligence-powered Energy Infrastructure Operations. This energy management system would leverage artificial intelligence (AI) to optimize energy c
3 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Strategic Drivers FOR THE POST-PANDEMIC ERA
The European Business Review
Article
Strategic Drivers FOR THE POST-PANDEMIC ERA
Feb 25, 2021
10 min read
The Bias Against Radical Innovation
The European Business Review
Article
The Bias Against Radical Innovation
Sep 30, 2022
4 min read
Understanding The Weight On Security Leaders' Shoulders, And How To Shift It
NZBusiness and Management
Article
Understanding The Weight On Security Leaders' Shoulders, And How To Shift It
Jul 20, 2022
4 min read
How Productive Is Generative AI Really?
The European Business Review
Article
How Productive Is Generative AI Really?
Jul 31, 2023
7 min read
About the Authors
The European Business Review
Article
About the Authors
Dec 3, 2019
Mark Esposito, Ph.D is Co-founder of Nexus FrontierTech, a leading global firm providing AI solutions to a variety of clients across industries, sectors, and regions. In 2016 he was listed on the Radar of Thinkers50, as of the 30 most prominent busin
1 min read
The Future Of The Data Economy
The European Business Review
Article
The Future Of The Data Economy
Jun 1, 2022
6 min read
Education 2.0: The Destructive Reconstruction of Higher Learning
Rotman Management
Article
Education 2.0: The Destructive Reconstruction of Higher Learning
Jan 1, 2018
8 min read
Bots And Robbers What Is AI, And Will It Make Us All Redundant?
Guardian Weekly
Article
Bots And Robbers What Is AI, And Will It Make Us All Redundant?
Nov 3, 2023
What is artificial intelligence? The term was coined in 1955 by a team including Harvard computer scientist Marvin Minsky. With no strict definition of the phrase, almost anything more complex than a calculator has been called artificial intelligence
3 min read

Related categories

Skip carousel

Reviews for Federated Learning

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Federated Learning - Qiang Yang

Privacy

Q. Yang et al. (eds.)Federated LearningLecture Notes in Computer Science12500https://doi.org/10.1007/978-3-030-63076-8_1

Threats to Federated Learning

Lingjuan Lyu¹ , Han Yu² , Jun Zhao² and Qiang Yang³, ⁴

(1)

National University of Singapore, Singapore, Singapore

(2)

School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore

(3)

Department of AI, WeBank, Shenzhen, China

(4)

Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong

Lingjuan Lyu (Corresponding author)

Email: lyulj@comp.nus.edu.sg

Han Yu

Email: han.yu@ntu.edu.sg

Jun Zhao

Email: junzhao@ntu.edu.sg

Qiang Yang

Email: qyang@cse.ust.hk

Abstract

As data are increasingly being stored in different silos and societies becoming more aware of data privacy issues, the traditional centralized approach of training artificial intelligence (AI) models is facing strong challenges. Federated learning (FL) has recently emerged as a promising solution under this new reality. Existing FL protocol design has been shown to exhibit vulnerabilities which can be exploited by adversaries both within and outside of the system to compromise data privacy. It is thus of paramount importance to make FL system designers aware of the implications of future FL algorithm design on privacy-preservation. Currently, there is no survey on this topic. In this chapter, we bridge this important gap in FL literature. By providing a concise introduction to the concept of FL, and a unique taxonomy covering threat models and two major attacks on FL: 1) poisoning attacks and 2) inference attacks, we provide an accessible review of this important topic. We highlight the intuitions, key techniques as well as fundamental assumptions adopted by various attacks, and discuss promising future research directions towards more robust privacy preservation in FL.

Keywords

Federated learningAttacksPrivacyRobustness

1 Introduction

As computing devices become increasingly ubiquitous, people generate huge amounts of data through their day to day usage. Collecting such data into centralized storage facilities is costly and time consuming. Another important concern is data privacy and user confidentiality as the usage data usually contain sensitive information [1]. Sensitive data such as facial images, location-based services, or health information can be used for targeted social advertising and recommendation, posing the immediate or potential privacy risks. Hence, private data should not be directly shared without any privacy consideration. As societies become increasingly aware of privacy preservation, legal restrictions such as the General Data Protection Regulation (GDPR) are emerging which makes data aggregation practices less feasible [48].

Traditional centralized machine learning (ML) cannot support such ubiquitous deployments and applications due to infrastructure shortcomings such as limited communication bandwidth, intermittent network connectivity, and strict delay constraints [26]. In this scenario, federated learning (FL) which pushes model training to the devices from which data originate emerged as a promising alternative ML paradigm [35]. FL enables a multitude of participants to construct a joint ML model without exposing their private training data [12, 35]. It can handle unbalanced and non-independent and identically distributed (non-IID) data which naturally arise in the real world [34]. In recent years, FL has benefited a wide range of applications such as next word prediction [34, 36], visual object detection for safety [29], etc.

Table 1.

Taxonomy for horizontal federated learning (HFL).

1.1 Types of Federated Learning

Based on the distribution of data features and data samples among participants, federated learning can be generally classified as horizontally federated learning (HFL), vertically federated learning (VFL) and federated transfer learning (FTL) [47].

Under HFL, datasets owned by each participant share similar features but concern different users [24]. For example, several hospitals may each store similar types of data (e.g., demographic, clinical, and genomic) about different patients. If they decide to build a machine learning model together using FL, we refer to such a scenario as HFL. In this chapter, we further classify HFL into HFL to businesses (H2B), and HFL to consumers (H2C). A comparison between H2B and H2C is listed in Table 1. The main difference lies in the number of participants, FL training participation level, and technical capability, which can influence how adversaries attempt to compromise the FL system. Under H2B, there are typically a handful of participants. They can be frequently selected during FL training. The participants tend to possess significant computational power and sophisticated technical capabilities [48]. Under H2C, there can be thousands or even millions of potential participants. In each round of training, only a subset of them are selected. As their datasets tend to be small, the chance of a participant being selected repeatedly for FL training is low. They generally possess limited computational power and low technical capabilities. An example of H2C is Google’s GBoard application [36].

VFL is applicable to the cases in which participants have large overlaps in the sample space but differ in the feature space, i.e., different participants hold different attributes of the same records [46]. VFL mainly targets business participants. Thus, the characteristics of VFL participants are similar to those of H2B participants.

FTL deals with scenarios in which FL participants have little overlap in both the sample space and the feature space [48]. Currently, there is no published research studying threats to FTL models.

../images/507290_1_En_1_Chapter/507290_1_En_1_Fig1_HTML.png

Fig. 1.

A typical FL training process, in which both the (potentially malicious) FL server/aggregator and malicious participants may compromise the FL system.

1.2 Threats to FL

FL offers a privacy-aware paradigm of model training which does not require data sharing and allows participants to join and leave a federation freely. Nevertheless, recent works have demonstrated that FL may not always provide sufficient privacy guarantees, as communicating model updates throughout the training process can nonetheless reveal sensitive information [8, 37] even incur deep leakage [52], either to a third-party, or to the central server [2, 36]. For instance, as shown by [3], even a small portion of gradients may reveal information about local data. A more recent work showed that the malicious attacker can completely steal the training data from gradients in a few iterations [52].

FL protocol designs may contain vulnerabilities for both (1) the (potentially malicious) server, who can observe individual updates over time, tamper with the training process and control the view of the participants on the global parameters; and (2) any participant who can observe the global parameter, and control its parameter uploads. For example, malicious participants can deliberately alter their inputs or introduce stealthy backdoors into the global model. Such attacks pose significant threats to FL, as in centralized learning only the server can violate participants’ privacy, but in FL, any participant may violate the privacy of other participants in the system, even without involving the server. Therefore, it is important to understand the principles behind these attacks. Existing survey papers on FL mostly focused on the broad aspect of how to make FL work [23, 27, 47]. In this chapter, we survey recent advances in threats to compromise FL to bridge this important gap in the artificial intelligence (AI) research community’s understanding in this topic. In particular, we focus on two specific threats initiated by the insiders on FL systems: 1) poisoning attacks that attempt to prevent a model from being learned at all, or to bias the model to produce inferences that are preferable to the adversary; and 2) inference attacks that target participant privacy. The properties of these attacks are summarized in Table 2.

Table 2.

A summary of attacks against server-based FL.

2 Threat Models

Before reviewing attacks on FL, we first present a summary of the threat models.

2.1 Insider v.s. Outsider

Attacks can be carried out by insiders and outsiders. Insider attacks include those launched by the FL server and the participants in the FL system. Outsider attacks include those launched by the eavesdroppers on the communication channel between participants and the FL server, and by users of the final FL model when it is deployed as a service.

Insider attacks are generally stronger than the outsider attacks, as it strictly enhances the capability of the adversary. Due to this stronger behavior, our discussion of attacks against FL will focus primarily on the insider attacks, which can take one of the following three general forms:

Single attack: a single, non-colluding malicious participant aims to cause the model to miss-classify a set of chosen inputs with high confidence [4, 7];

Byzantine attack: the byzantine malicious participants may behave completely arbitrarily and tailor their outputs to have similar distribution as the correct model updates, making them difficult to detect [11, 14, 15, 40, 49];

Sybil attack: the adversaries can simulate multiple dummy participant accounts or select previously compromised participants to mount more powerful attacks on FL [4, 17].

2.2 Semi-honest v.s. Malicious

Under the semi-honest setting, adversaries are considered passive or honest-but-curious. They try to learn the private states of other participants without deviating from the FL protocol. The passive adversaries are assumed to only observe the aggregated or averaged gradient, but not the training data or gradient of other honest participants. Under the malicious setting, an active, or malicious adversary tries to learn the private states of honest participants, and deviates arbitrarily from the FL protocol by modifying, re-playing, or removing messages. This strong adversary model allows the adversary to conduct particularly devastating attacks.

2.3 Training Phase v.s. Inference Phase

Attacks at training phase attempt to learn, influence, or corrupt the FL model itself [9]. During training phase, the attacker can run data poisoning attacks to compromise the integrity of training dataset collection, or model poisoning attacks to compromise the integrity of the learning process. The attacker can also launch a range of inference attacks on an individual participant’s update or on the aggregate of updates from all participants.

Attacks at inference phase are called evasion/exploratory attacks [5]. They generally do not tamper with the targeted model, but instead, either cause it to produce wrong outputs (targeted/untargeted) or collect evidence about the model characteristics. The effectiveness of such attacks is largely determined by the information that is available to the adversary about the model. Inference phase attacks can be classified into white-box attacks (i.e. with full access to the FL model) and black-box attacks (i.e. only able to query the FL model). In FL, the model maintained by the server not only suffers from the same evasion attacks as in the general ML setting when the target model is deployed as a service, the model broadcast step in FL renders the model accessible to any malicious client. Thus, FL requires extra efforts to defend against white-box evasion attacks. In this survey, we omit the discussion of inference phase attacks, and mainly focus on the training phase attacks.

3 Poisoning Attacks

Depending on the attacker’s objective, poisoning attacks can be either a) random attacks and b) targeted attacks [22]. Random attacks aim to reduce the accuracy of the FL model, whereas targeted attacks aim to induce the FL model to output the target label specified by the adversary. Generally, targeted attacks is more difficult than random attacks as the attacker has a specific goal to achieve. Poisoning attacks during the training phase can be performed on the data or on the model. Figure 2 shows that the poisoned updates can be sourced from two poisoning attacks: (1) data poisoning attack during local data collection; and (2) model poisoning attack during local model training process. At a high level, both poisoning attacks attempt to modify the behavior of the target model in some undesirable way. If adversaries can compromise the FL server, then they can easily perform both targeted and untargeted poisoning attacks on the trained model.

../images/507290_1_En_1_Chapter/507290_1_En_1_Fig2_HTML.png

Fig. 2.

Data v.s. model poisoning attacks in FL.

3.1 Data Poisoning

Data poisoning attacks largely fall in two categories: 1) clean-label [42] and 2) dirty-label [19]. Clean-label attacks assume that the adversary cannot change the label of any training data as there is a process by which data are certified as belonging to the correct class and the poisoning of data samples has to be imperceptible. In contrast, in dirty-label poisoning, the adversary can introduce a number of data sample it wishes to miss-classify with the desired target label into the training set.

One common example of dirty-label poisoning attack is the label-flipping attack [10, 17]. The labels of honest training examples of one class are flipped to another class while the features of the data are kept unchanged. For example, the malicious participants in the system can poison their dataset by flipping all 1 s into 7 s. A successful attack produces a model that is unable to correctly classify 1 s and incorrectly predicts them to be 7 s. Another weak but realistic attack scenario is backdoor poisoning [19]. Here, an adversary can modify individual features or small regions of the original training dataset to embed backdoors into the model, so that the model behaves according to the adversary’s objective if the input contains the backdoor features (e.g., a stamp on an image). However, the performance of the poisoned model on clean inputs is not affected. In this way, the attacks are harder to be detected.

Data poisoning attacks can be carried out by any FL participant. The impact on the FL model depends on the extent to which participants in the system engage in the attacks, and the amount of training data being poisoned. Data poisoning is less effective in settings with fewer participants like H2C.

3.2 Model Poisoning

Model poisoning attacks aim to poison local model updates before sending them to the server or insert hidden backdoors into the global model [4].

In targeted model poisoning, the adversary’s objective is to cause the FL model to miss-classify a set of chosen inputs with high confidence. Note that these inputs are not modified to induce miss-classification at test time as under adversarial example attacks [45]. Rather, the miss-classification is a result of adversarial manipulations of the training process. Recent works have investigated poisoning attacks on model updates in which a subset of updates sent to the server at any given iteration are poisoned [7, 11]. These poisoned updates can be generated by inserting hidden backdoors, and even a single-shot attack may be enough to introduce a backdoor into a model [4].

Bhagoji et al. [7] demonstrated that model poisoning attacks are much more effective than data poisoning in FL settings by analyzing a targeted model poisoning attack, where a single, non-colluding malicious participant aims to cause the model to miss-classify a set of chosen inputs with high confidence. To increase attack stealth and evade detection, they use the alternating minimization strategy to alternately optimize for the training loss and the adversarial objective, and use parameter estimation for the benign participants’ updates. This adversarial model poisoning attack can cause targeted poisoning of the FL model undetected.

In fact, model poisoning subsumes data poisoning in FL settings, as data poisoning attacks eventually change a subset of updates sent to the model at any given iteration [17]. This is functionally identical to a centralized poisoning attack in which a subset of the whole training data is poisoned. Model poisoning attacks require sophisticated technical capabilities and high computational resources. Such attacks are generally less suitable for H2C scenarios, but more likely to happen in H2B scenarios.

4 Inference Attacks

Exchanging gradients in FL can result in serious privacy leakage [37, 41, 44, 52]. As illustrated in Fig. 3, model updates can leak extra information about the unintended features about participants’ training data to the adversarial participants, as deep learning models appear to internally recognize many features of the data that are not apparently related with the main tasks. The adversary can also save the snapshot of the FL model parameters, and conduct property inference by exploiting the difference between the consecutive snapshots, which is equal to the aggregated updates from all participants less the adversary (Fig. 4).

../images/507290_1_En_1_Chapter/507290_1_En_1_Fig3_HTML.png

Fig. 3.

Attacker infers information unrelated to the learning task.

../images/507290_1_En_1_Chapter/507290_1_En_1_Fig4_HTML.png

Fig. 4.

Attacker infers gradients from a batch of training data.

The main reason is that the gradients are derived from the participants’ private data. In deep learning models, gradients of a given layer are computed using this layer’s features and the error from the layer above. In the case of sequential fully connected layers, the gradients of the weights are the inner products of the error from the layer above and the features. Similarly, for a convolutional layer, the gradients of the weights are convolutions of the error from the layer above and the features [37]. Consequently, observations of model updates can be used to infer a significant amount of private information, such as class representatives, membership as well as properties associated with a subset of the training data. Even worse, an attacker can infer labels from the shared gradients and recover the original training samples without requiring any prior knowledge about the training set [52].

4.1 Inferring Class Representatives

Hitaj et al. [21] devised an active inference attack called Generative Adversarial Networks (GAN) attack on deep FL models. Here, a malicious participant can intentionally compromise any other participant. The GAN attack exploits the real-time nature of the FL learning process that allows the adversarial participant to train a GAN that generates prototypical samples of the targeted training data which were meant to be private. The generated samples appear to come from the same distribution as the training data. Hence, GAN attack is not targeted at reconstructing actual training inputs, but only class representatives. It should be noted that GAN attack assumes that the entire training corpus for a given class comes from a single participant, and only in the special case where all class members are similar, GAN-constructed representatives are similar to the training data. This resembles model inversion attacks in the general ML settings [16]. However, these assumptions may be less practical in FL. Moreover, GAN attack is less suitable for H2C scenarios, as it requires large computation resources.

4.2 Inferring Membership

Given an exact data point, membership inference attacks aim to determine if it was used to train the model [43]. For example, an attacker can infer whether a specific patient profile was used to train a classifier associated with a disease. FL presents interesting new avenues for such attacks. In FL, the adversary’s objective is to infer if a particular sample belongs to the private training data of a single participant (if target update is of a single participant) or of any participant (if target update is the aggregate). For example, the non-zero gradients of the embedding layer of a deep learning model trained on natural-language text reveal which words appear in the training batches used by the honest participants during FL model training. This enables an adversary to infer whether a given text appeared in the training dataset [37].

Attackers in an FL system can conduct both active and passive membership inference attacks [37, 38]. In the passive case, the attacker simply observes the updated model parameters and performs inference without changing anything in the local or global collaborative training procedure. In the active case, however, the attacker can tamper with the FL model training protocol and perform a more powerful attack against other participants. Specifically, the attacker shares malicious updates and forces the FL model to share more information about the participants’ local data the attacker is interested in. This attack, called gradient ascent attack [38], exploits the fact that SGD optimization updates model parameters in the opposite direction of the gradient of the loss.

4.3 Inferring Properties

An adversary can launch both passive and active property inference attacks to infer properties of other participants’ training data that are independent of the features that characterize the classes of the FL model [37]. Property inference attacks assume that the adversary has auxiliary training data correctly labelled with the property he wants to infer. An passive adversary can only observe/eavesdrop the updates and perform inference by training a binary property classifier. An active adversary can use multi-task learning to trick the FL model into learning a better separation for data with and without the property, and thus extract more information. An adversarial participant can even infer when a property appears and disappears in the data during training (e.g., identifying when a person first appears in the photos used to train a gender classifier). The assumption in property inference attacks may prevent its applicability in H2C.

4.4 Inferring Training Inputs and Labels

The most recent work called Deep Leakage from Gradient (DLG) proposed an optimization algorithm that can obtain both the training inputs and the labels in just a few iterations [52]. This attack is much stronger than previous approaches. It can recover pixel-wise accurate original images and token-wise matching original texts. [50] presented an analytical approach called Improved Deep Leakage from Gradient (iDLG), which can certainly extract labels from the shared gradients by exploiting the relationship between the labels and the signs of corresponding gradients. iDLG is valid for any differentiable model trained with cross-entropy loss over one-hot labels, which is the general case for classification.

Inference attacks generally assume that the adversaries possess sophisticated technical capabilities and large computational resources. In addition, adversaries must be selected for many rounds of FL training. Thus, it is not suitable for H2C scenarios, but more likely under H2B scenarios. Such attacks also highlight the need for protecting the gradients being shared during FL training, possibly through mechanisms such as homomorphic encryption [48].

5 Discussions and Promising Directions

There are still potential vulnerabilities which need to be addressed in order to improve the robustness of FL systems. In this section, we outline research directions which we believe are promising.

Curse of Dimensionality: Large models, with high dimensional parameter vectors, are particularly susceptible to privacy and security attacks [13]. Most FL algorithms require overwriting the local model parameters with the global model. This makes them susceptible to poisoning and backdoor attacks, as the adversary can make small but damaging changes in the high-dimensional models without being detected. Thus, sharing model parameters may not be a strong design choice in FL, it opens all the internal state of the model to inference attacks, and maximizes the model’s malleability by poisoning attacks. To address these fundamental shortcomings of FL, it is worthwhile to explore whether sharing model updates is essential. Instead, sharing less sensitive information (e.g., SIGNSGD [6]) or only sharing model predictions [13] in a black-box manner may result in more robust privacy protection in FL.

Vulnerabilities to Free-Riding Participants: In FL system, there may exist free-riders in the collaborative learning system, who aim to benefit from the global model, but do not want to contribute any real information. The main incentives for free-rider to submit fake information may include: (1) one participant may not have any data to train a local model; (2) one participant is too concerned about its privacy to release any information that may compromise privacy; (3) one participant may not want to consume any local computation power to train any model [32, 33]. In the current FL paradigm [34], all participants receive the same federated model at the end of collaborative model training regardless of their contributions. This makes the paradigm vulnerable to free-riding participants [28, 32, 33].

Threats to VFL: In VFL [20], there may only be one participant who owns labels for the given learning task. It is unclear if all the participants have equal capability of attacking the FL model, and if threats to HFL can work on VFL. Most of the current threats still focus on HFL. Thus, threats on VFL, which is important to businesses, are worth exploring.

FL with Heterogeneous Architectures: Sharing model updates is typically limited only to homogeneous FL architectures, i.e., the same model is shared with all participants. It would be interesting to study how to extend FL to collaboratively train models with heterogeneous architectures [13, 18, 25], and whether existing attacks and privacy techniques can be adapted to this paradigm.

Decentralized Federated Learning: Decentralized FL where no single server is required in the system is currently being studied [32, 33, 36, 48]. This is a potential learning framework for collaboration among businesses which do not trust any third party. In this paradigm, each participant could be elected as a server in a round robin manner. It would be interesting to investigate if existing threats on server-based FL still apply in this scenario. Moreover, it may open new attack surfaces. One possible example is that the last participant who was elected as the server is more likely to effectively contaminate the whole model if it chooses to insert backdoors. This resembles the fact in server-based FL models which are more vulnerable to backdoors in later rounds of training nearing convergence. Similarly, if decentralized training is conducted in a ring all reduce manner, then any malicious participant can steal the training data from its neighbors.

Weakness of Current Defense: FL with secure aggregation are especially susceptible to poisoning attacks as the individual updates cannot be inspected. It is still unclear if adversarial training can be adapted to FL, as adversarial training was developed primarily for IID data, and it is still a challenging problem how it performs in non-IID settings. Moreover, adversarial training typically requires many epochs, which may be impractical in H2C. Another possible defense is based on differential privacy (DP) [30–33, 36, 51]. Record-level DP bounds the success of membership inference, but does not prevent property inference applied to a group of training records [37]. Participant-level DP, on the other hand, is geared to work with thousands of users for training to converge and achieving an acceptable trade-off between privacy and accuracy [36]. The FL model fails to converge with a small number of participants, making it unsuitable for H2B scenarios. Furthermore, DP may hurt the accuracy of the learned model [39], which is not appealing to the industry. Further work is needed to investigate if participant-level DP can protect FL systems with few participants.

Optimizing Defense Mechanism Deployment: When deploying defense mechanisms to check if any adversary is attacking the FL system, the FL server will need to incur extra computational cost. In addition, different defense mechanisms may have different effectiveness against various attacks, and incur different cost. It is important to study how to optimize the timing of deploying defense mechanisms or the announcement of deterrence measures. Game theoretic research holds promise in addressing this challenge.

Federated learning is still in its infancy and will continue to be an active and important research area for the foreseeable future. As FL evolves, so will the attack mechanisms. It is of vital importance to provide a broad overview of current attacks on FL so that future FL system designers are aware of the potential vulnerabilities in their designs. This survey serves as a concise and accessible overview of this topic, and it would greatly help our understanding of the threat landscape in FL. Global collaboration on FL is emerging through a number of workshops in leading AI conferences¹. The ultimate goal of developing a general purpose defense mechanism robust against various attacks without degrading model performance will require interdisciplinary effort from the wider research community.

References

Abadi, M., et al.: Deep learning with differential privacy. In: CCS, pp. 308–318 (2016)

Agarwal, N., Suresh, A.T., Yu, F.X.X., Kumar, S., McMahan, B.: cpSGD: communication-efficient and differentially-private distributed SGD. In: NeurIPS, pp. 7564–7575 (2018)

Aono, Y., Hayashi, T., Wang, L., Moriai, S., et al.: Privacy-preserving deep learning via additively homomorphic encryption. IEEE Trans. Inf. Forensics Secur. 13(5), 1333–1345 (2018)Crossref

Bagdasaryan, E., Veit, A., Hua, Y., Estrin, D., Shmatikov, V.: How to backdoor federated learning. CoRR, arXiv:1807.00459 (2018)

Barreno, M., Nelson, B., Sears, R., Joseph, A.D., Tygar, J.D.: Can machine learning be secure? In: ICCS, pp. 16–25 (2006)

Bernstein, J., Zhao, J., Azizzadenesheli, K., Anandkumar, A.: signSGD with majority vote is communication efficient and fault tolerant. CoRR, arXiv:1810.05291 (2018)

Bhagoji, A.N., Chakraborty, S., Mittal, P., Calo, S.: Analyzing federated learning through an adversarial lens. CoRR, arXiv:1811.12470 (2018)

Bhowmick, A., Duchi, J., Freudiger, J., Kapoor, G., Rogers, R.: Protection against reconstruction and its applications in private federated learning. CoRR, arXiv:1812.00984 (2018)

Biggio, B., Nelson, B., Laskov, P.: Support vector machines under adversarial label noise. In: ACML, pp. 97–112 (2011)

10.

Biggio, B., Nelson, B., Laskov, P.: Poisoning attacks against support vector machines. CoRR, arXiv:1206.6389 (2012)

11.

Blanchard, P., Guerraoui, R., Stainer, J., et al.: Machine learning with adversaries: Byzantine tolerant gradient descent. In: NeurIPS, pp. 119–129 (2017)

12.

Bonawitz, K., et al.: Practical secure aggregation for privacy-preserving machine learning. In: CCS, pp. 1175–1191 (2017)

13.

Chang, H., Shejwalkar, V., Shokri, R., Houmansadr, A.: Cronus: robust and heterogeneous collaborative learning with black-box knowledge transfer. CoRR, arXiv:1912.11279 (2019)

14.

Chen, L., Wang, H., Charles, Z., Papailiopoulos, D.: Draco: Byzantine-resilient distributed training via redundant gradients. CoRR, arXiv:1803.09877 (2018)

15.

Chen, Y., Su, L., Xu, J.: Distributed statistical machine learning in adversarial settings: Byzantine gradient descent. Proc. ACM Meas. Anal. Comput. Syst. 1(2), 44 (2017)

16.

Fredrikson, M., Jha, S., Ristenpart, T.: Model inversion attacks that exploit confidence information and basic countermeasures. In: CCS, pp. 1322–1333 (2015)

17.

Fung, C., Yoon, C.J., Beschastnikh, I.: Mitigating sybils in federated learning poisoning. CoRR, arXiv:1808.04866 (2018)

18.

Gao, D., Liu, Y., Huang, A., Ju, C., Yu, H., Yang, Q.: Privacy-preserving heterogeneous federated transfer learning. In: IEEE BigData (2019)

19.

Gu, T., Dolan-Gavitt, B., Garg, S.: BadNets: identifying vulnerabilities in the machine learning model supply chain. CoRR, arXiv:1708.06733 (2017)

20.

Hardy, S., et al.: Private federated learning on vertically partitioned data via entity resolution and additively homomorphic encryption. CoRR, arXiv:1711.10677 (2017)

21.

Hitaj, B.,

Enjoying the preview?

Page 1 of 1

Federated Learning: Privacy and Incentive

About this ebook

Related to Federated Learning

Titles in the series (9)

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Federated Learning

What did you think?

Book preview

Federated Learning - Qiang Yang

Threats to Federated Learning

Abstract

1 Introduction

1.1 Types of Federated Learning

1.2 Threats to FL

2 Threat Models

2.1 Insider v.s. Outsider

2.2 Semi-honest v.s. Malicious

2.3 Training Phase v.s. Inference Phase

3 Poisoning Attacks

3.1 Data Poisoning

3.2 Model Poisoning

4 Inference Attacks

4.1 Inferring Class Representatives

4.2 Inferring Membership

4.3 Inferring Properties

4.4 Inferring Training Inputs and Labels

5 Discussions and Promising Directions