AI Assurance: Towards Trustworthy, Explainable, Safe, and Ethical AI

Ebook937 pages5 hours

AI Assurance: Towards Trustworthy, Explainable, Safe, and Ethical AI

Name: AI Assurance: Towards Trustworthy, Explainable, Safe, and Ethical AI
Brand: Academic Press
Rating: 5.0 (1 reviews)

By Feras A. Batarseh and Laura Freeman

Rating: 5 out of 5 stars

5/5

()

Read preview

About this ebook

AI Assurance: Towards Trustworthy, Explainable, Safe, and Ethical AI provides readers with solutions and a foundational understanding of the methods that can be applied to test AI systems and provide assurance. Anyone developing software systems with intelligence, building learning algorithms, or deploying AI to a domain-specific problem (such as allocating cyber breaches, analyzing causation at a smart farm, reducing readmissions at a hospital, ensuring soldiers’ safety in the battlefield, or predicting exports of one country to another) will benefit from the methods presented in this book.

As AI assurance is now a major piece in AI and engineering research, this book will serve as a guide for researchers, scientists and students in their studies and experimentation. Moreover, as AI is being increasingly discussed and utilized at government and policymaking venues, the assurance of AI systems—as presented in this book—is at the nexus of such debates.

Provides readers with an in-depth understanding of how to develop and apply Artificial Intelligence in a valid, explainable, fair and ethical manner
Includes various AI methods, including Deep Learning, Machine Learning, Reinforcement Learning, Computer Vision, Agent-Based Systems, Natural Language Processing, Text Mining, Predictive Analytics, Prescriptive Analytics, Knowledge-Based Systems, and Evolutionary Algorithms
Presents techniques for efficient and secure development of intelligent systems in a variety of domains, such as healthcare, cybersecurity, government, energy, education, and more
Covers complete example datasets that are associated with the methods and algorithms developed in the book

Skip carousel

LanguageEnglish

PublisherAcademic Press

Release dateOct 12, 2022

ISBN9780323918824

Related to AI Assurance

Related ebooks

Skip carousel

Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things
Ebook
Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things
byGuido Dartmann
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence and Industry 4.0
Ebook
Artificial Intelligence and Industry 4.0
byAboul Ella Hassanien
Rating: 0 out of 5 stars
0 ratings
Web Semantics: Cutting Edge and Future Directions in Healthcare
Ebook
Web Semantics: Cutting Edge and Future Directions in Healthcare
bySarika Jain
Rating: 0 out of 5 stars
0 ratings
MINDFUL AI: Reflections on Artificial Intelligence
Ebook
MINDFUL AI: Reflections on Artificial Intelligence
byMurat Durmus
Rating: 0 out of 5 stars
0 ratings
A.I: The Path towards Logical and Rational Agents: Thinking Machines
Ebook
A.I: The Path towards Logical and Rational Agents: Thinking Machines
byalasdair gilchrist
Rating: 4 out of 5 stars
4/5
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
Ebook
Mastering Large Language Models: Advanced techniques, applications, cutting-edge methods, and top LLMs (English Edition)
bySanket Subhash Khandare
Rating: 0 out of 5 stars
0 ratings
AI Native Enterprise: The Leader's Guide to AI-Powered Business Transformation
Ebook
AI Native Enterprise: The Leader's Guide to AI-Powered Business Transformation
byYi Zhou
Rating: 0 out of 5 stars
0 ratings
Introduction to Generative AI
Ebook
Introduction to Generative AI
byNuma Dhamani
Rating: 0 out of 5 stars
0 ratings
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: Data Analytics and Innovation for Beginners
Ebook
Artificial Intelligence: Data Analytics and Innovation for Beginners
byJohn Adamssen
Rating: 5 out of 5 stars
5/5
Automated Reasoning: Fundamentals and Applications
Ebook
Automated Reasoning: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
General Artificial Intelligence: Fundamentals and Applications
Ebook
General Artificial Intelligence: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Perspectives on Data Science for Software Engineering
Ebook
Perspectives on Data Science for Software Engineering
byTim Menzies
Rating: 5 out of 5 stars
5/5
An Introduction to Engineering Systems: Pergamon Unified Engineering Series
Ebook
An Introduction to Engineering Systems: Pergamon Unified Engineering Series
bySamuel Seely
Rating: 0 out of 5 stars
0 ratings
Will Computers Revolt?: Preparing for the Future of Artificial Intelligence
Ebook
Will Computers Revolt?: Preparing for the Future of Artificial Intelligence
byCharles J Simon
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Computer Vision with SAS: An Introduction
Ebook
Deep Learning for Computer Vision with SAS: An Introduction
byRobert Blanchard
Rating: 0 out of 5 stars
0 ratings
Agile Architecture A Complete Guide - 2020 Edition
Ebook
Agile Architecture A Complete Guide - 2020 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Practical Knowledge Engineering
Ebook
Practical Knowledge Engineering
byRichard Kelly
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence in Business and Technology: Accelerate Transformation, Foster Innovation, and Redefine the Future
Ebook
Artificial Intelligence in Business and Technology: Accelerate Transformation, Foster Innovation, and Redefine the Future
byAD Al-Ghourabi
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence Foundations: Learning from experience
Ebook
Artificial Intelligence Foundations: Learning from experience
byAndrew Lowe
Rating: 0 out of 5 stars
0 ratings
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
Ebook
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
byStylianos Kampakis
Rating: 0 out of 5 stars
0 ratings
Toward Human-Level Artificial Intelligence: Representation and Computation of Meaning in Natural Language
Ebook
Toward Human-Level Artificial Intelligence: Representation and Computation of Meaning in Natural Language
byPhilip C. Jackson
Rating: 0 out of 5 stars
0 ratings
AI Unveiled: A Comprehensive Introduction to Artificial Intelligence
Ebook
AI Unveiled: A Comprehensive Introduction to Artificial Intelligence
byAmin Ahmad
Rating: 0 out of 5 stars
0 ratings
Centers Of Excellence A Complete Guide - 2019 Edition
Ebook
Centers Of Excellence A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Practical Natural Language Processing with Python: With Case Studies from Industries Using Text Data at Scale
Ebook
Practical Natural Language Processing with Python: With Case Studies from Industries Using Text Data at Scale
byMathangi Sri
Rating: 0 out of 5 stars
0 ratings
AI Strategy A Complete Guide - 2019 Edition
Ebook
AI Strategy A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence The Future is Now
Ebook
Artificial Intelligence The Future is Now
byLeonardo Guiliani
Rating: 0 out of 5 stars
0 ratings
Data Hubs A Complete Guide - 2021 Edition
Ebook
Data Hubs A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Readings in Artificial Intelligence
Ebook
Readings in Artificial Intelligence
byBonnie Lynn Webber
Rating: 0 out of 5 stars
0 ratings
Final Report
Ebook
Final Report
byNational Security Commission on Artificial Intelligence
Rating: 5 out of 5 stars
5/5

Science & Mathematics For You

Skip carousel

Homo Deus: A Brief History of Tomorrow
Ebook
Homo Deus: A Brief History of Tomorrow
byYuval Noah Harari
Rating: 4 out of 5 stars
4/5
How Emotions Are Made: The Secret Life of the Brain
Ebook
How Emotions Are Made: The Secret Life of the Brain
byLisa Feldman Barrett
Rating: 4 out of 5 stars
4/5
Becoming Cliterate: Why Orgasm Equality Matters--And How to Get It
Ebook
Becoming Cliterate: Why Orgasm Equality Matters--And How to Get It
byDr. Laurie Mintz
Rating: 4 out of 5 stars
4/5
The Big Book of Hacks: 264 Amazing DIY Tech Projects
Ebook
The Big Book of Hacks: 264 Amazing DIY Tech Projects
byDoug Cantor
Rating: 4 out of 5 stars
4/5
How to Think Critically: Question, Analyze, Reflect, Debate.
Ebook
How to Think Critically: Question, Analyze, Reflect, Debate.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
Ebook
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
byScott H. Young
Rating: 4 out of 5 stars
4/5
The Psychology of Totalitarianism
Ebook
The Psychology of Totalitarianism
byMattias Desmet
Rating: 5 out of 5 stars
5/5
Activate Your Brain: How Understanding Your Brain Can Improve Your Work - and Your Life
Ebook
Activate Your Brain: How Understanding Your Brain Can Improve Your Work - and Your Life
byScott G. Halford
Rating: 4 out of 5 stars
4/5
Fantastic Fungi: How Mushrooms Can Heal, Shift Consciousness, and Save the Planet
Ebook
Fantastic Fungi: How Mushrooms Can Heal, Shift Consciousness, and Save the Planet
byPaul Stamets
Rating: 5 out of 5 stars
5/5
Metaphors We Live By
Ebook
Metaphors We Live By
byGeorge Lakoff
Rating: 4 out of 5 stars
4/5
Free Will
Ebook
Free Will
bySam Harris
Rating: 4 out of 5 stars
4/5
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
Ebook
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
byAlbert Rutherford
Rating: 4 out of 5 stars
4/5
Hunt for the Skinwalker: Science Confronts the Unexplained at a Remote Ranch in Utah
Ebook
Hunt for the Skinwalker: Science Confronts the Unexplained at a Remote Ranch in Utah
byColm A. Kelleher
Rating: 4 out of 5 stars
4/5
Memory Craft: Improve Your Memory with the Most Powerful Methods in History
Ebook
Memory Craft: Improve Your Memory with the Most Powerful Methods in History
byLynne Kelly
Rating: 3 out of 5 stars
3/5
The Wisdom of Psychopaths: What Saints, Spies, and Serial Killers Can Teach Us About Success
Ebook
The Wisdom of Psychopaths: What Saints, Spies, and Serial Killers Can Teach Us About Success
byKevin Dutton
Rating: 4 out of 5 stars
4/5
Outsmart Your Brain: Why Learning is Hard and How You Can Make It Easy
Ebook
Outsmart Your Brain: Why Learning is Hard and How You Can Make It Easy
byDaniel T. Willingham
Rating: 4 out of 5 stars
4/5
The Trouble With Testosterone: And Other Essays On The Biology Of The Human Predi
Ebook
The Trouble With Testosterone: And Other Essays On The Biology Of The Human Predi
byRobert M. Sapolsky
Rating: 4 out of 5 stars
4/5
Other Minds: The Octopus, the Sea, and the Deep Origins of Consciousness
Ebook
Other Minds: The Octopus, the Sea, and the Deep Origins of Consciousness
byPeter Godfrey-Smith
Rating: 4 out of 5 stars
4/5
Conscious: A Brief Guide to the Fundamental Mystery of the Mind
Ebook
Conscious: A Brief Guide to the Fundamental Mystery of the Mind
byAnnaka Harris
Rating: 4 out of 5 stars
4/5
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
On Food and Cooking: The Science and Lore of the Kitchen
Ebook
On Food and Cooking: The Science and Lore of the Kitchen
byHarold McGee
Rating: 5 out of 5 stars
5/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 4 out of 5 stars
4/5
The Structure of Scientific Revolutions
Ebook
The Structure of Scientific Revolutions
byThomas S. Kuhn
Rating: 4 out of 5 stars
4/5
Born for Love: Why Empathy Is Essential--and Endangered
Ebook
Born for Love: Why Empathy Is Essential--and Endangered
byBruce D. Perry
Rating: 4 out of 5 stars
4/5
The Woman Who Changed Her Brain: And Other Inspiring Stories of Pioneering Brain Transformation
Ebook
The Woman Who Changed Her Brain: And Other Inspiring Stories of Pioneering Brain Transformation
byBarbara Arrowsmith-Young
Rating: 4 out of 5 stars
4/5
Why People Believe Weird Things: Pseudoscience, Superstition, and Other Confusions of Our Time
Ebook
Why People Believe Weird Things: Pseudoscience, Superstition, and Other Confusions of Our Time
byMichael Shermer
Rating: 4 out of 5 stars
4/5
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C Lennox
Rating: 4 out of 5 stars
4/5
No Stone Unturned: The True Story of the World's Premier Forensic Investigators
Ebook
No Stone Unturned: The True Story of the World's Premier Forensic Investigators
bySteve Jackson
Rating: 4 out of 5 stars
4/5
A Crack In Creation: Gene Editing and the Unthinkable Power to Control Evolution
Ebook
A Crack In Creation: Gene Editing and the Unthinkable Power to Control Evolution
byJennifer A. Doudna
Rating: 4 out of 5 stars
4/5
18 Tiny Deaths: The Untold Story of Frances Glessner Lee and the Invention of Modern Forensics
Ebook
18 Tiny Deaths: The Untold Story of Frances Glessner Lee and the Invention of Modern Forensics
byBruce Goldfarb
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Solving ChatGPT Hallucinations With Vector Embeddings: Edo Liberty, who helped create SageMaker while at Amazon's AI labs, is building out long-term memory for large language models using vector embeddings at Pinecone.io. Edo says it solves the problem of hallucinations in large models like ChatGPT, which...
Podcast episode
Solving ChatGPT Hallucinations With Vector Embeddings: Edo Liberty, who helped create SageMaker while at Amazon's AI labs, is building out long-term memory for large language models using vector embeddings at Pinecone.io. Edo says it solves the problem of hallucinations in large models like ChatGPT, which...
byEye On A.I.
0 ratings
0% found this document useful
Programming Languages, Software Engineering and Machine Learning
Podcast episode
Programming Languages, Software Engineering and Machine Learning
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Ep 186: Brett, Naval and more on AI & AGI: The latest on the philosophy, science and technology of AI and AGI. ChatGPT, Deep Learning, Neural Networks and Large Language Models (LLMs). What do we know? Where is it going?
Podcast episode
Ep 186: Brett, Naval and more on AI & AGI: The latest on the philosophy, science and technology of AI and AGI. ChatGPT, Deep Learning, Neural Networks and Large Language Models (LLMs). What do we know? Where is it going?
byToKCast
0 ratings
0% found this document useful
Machines Learn Better if We Teach Them the Basics: To improve reinforcement learning algorithms, computer scientists are training them as if they were human.
Podcast episode
Machines Learn Better if We Teach Them the Basics: To improve reinforcement learning algorithms, computer scientists are training them as if they were human.
byQuanta Science Podcast
0 ratings
0% found this document useful
Ilya Sutskever: Ilya Sutskever, a cofounder and chief scientist of OpenAI and one of the primary minds behind the large language model GPT-4 and it’s public progeny, ChatGPT, talks about AI hallucinations and his vision of AI democracy.
Podcast episode
Ilya Sutskever: Ilya Sutskever, a cofounder and chief scientist of OpenAI and one of the primary minds behind the large language model GPT-4 and it’s public progeny, ChatGPT, talks about AI hallucinations and his vision of AI democracy.
byEye On A.I.
0 ratings
0% found this document useful
055 | Disinformation Visualization w/ Mushon Zer-Aviv
Podcast episode
055 | Disinformation Visualization w/ Mushon Zer-Aviv
byData Stories
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
#100 Embedded Machine Learning on Edge Devices
Podcast episode
#100 Embedded Machine Learning on Edge Devices
byDataFramed
0 ratings
0% found this document useful
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
Podcast episode
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
byThe Data Chief
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
Podcast episode
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Deploying Edge and Embedded AI Systems with Heather Gorr - #655
Podcast episode
Deploying Edge and Embedded AI Systems with Heather Gorr - #655
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
Podcast episode
The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Bayesian A/B Testing: Today's guest is Cameron Davidson-Pilon. Cameron has a masters degree in quantitative finance from the University of Waterloo. Think of it as statistics on stock markets. For the last two years he's been the team lead of data science at Shopify. He's...
Podcast episode
Bayesian A/B Testing: Today's guest is Cameron Davidson-Pilon. Cameron has a masters degree in quantitative finance from the University of Waterloo. Think of it as statistics on stock markets. For the last two years he's been the team lead of data science at Shopify. He's...
byData Skeptic
100%
100% found this document useful
Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626
Podcast episode
Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
6. Jay Feng - Data science in the startup world
Podcast episode
6. Jay Feng - Data science in the startup world
byTowards Data Science
0 ratings
0% found this document useful
Easily Build Advanced Similarity Search With The Pinecone Vector Database: An interview with Edo Liberty about the Pinecone vector database and how it makes it easy to build a similarity search service.
Podcast episode
Easily Build Advanced Similarity Search With The Pinecone Vector Database: An interview with Edo Liberty about the Pinecone vector database and how it makes it easy to build a similarity search service.
byData Engineering Podcast
0 ratings
0% found this document useful
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
Podcast episode
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Generative models: exploration to deployment: get Fully-Connected with Chris & Daniel
Podcast episode
Generative models: exploration to deployment: get Fully-Connected with Chris & Daniel
byPractical AI: Machine Learning, Data Science
100%
100% found this document useful
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
Podcast episode
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Alternative Learning Methods 01 with Dmitry Krotov - 015: Do you think the ML world needs a new learning method? Dmitry is working on this very hard in the MIT-IBM Watson AI Lab and IBM Research. He is about to find learning methods that are inspired by biology.
Podcast episode
Alternative Learning Methods 01 with Dmitry Krotov - 015: Do you think the ML world needs a new learning method? Dmitry is working on this very hard in the MIT-IBM Watson AI Lab and IBM Research. He is about to find learning methods that are inspired by biology.
byMachine Learning Cafe
0 ratings
0% found this document useful
Human-Centered Design with Mira Lane - TWiML Talk #233: Today we present the final episode in our AI for the Benefit of Society series, in which we’re joined by Mira Lane, Partner Director for Ethics and Society at Microsoft. Mira and I focus our conversation on the role of culture and human-centered...
Podcast episode
Human-Centered Design with Mira Lane - TWiML Talk #233: Today we present the final episode in our AI for the Benefit of Society series, in which we’re joined by Mira Lane, Partner Director for Ethics and Society at Microsoft. Mira and I focus our conversation on the role of culture and human-centered...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
The Use of Digital Twins in Modeling and Simulation
Podcast episode
The Use of Digital Twins in Modeling and Simulation
byProblem Solved: The IISE Podcast
0 ratings
0% found this document useful
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
Podcast episode
Keeping Your Data Warehouse In Order With DataForm - Episode 102: An interview about Dataform and how it helps you to keep your data warehouse in good working order
byData Engineering Podcast
0 ratings
0% found this document useful
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Do You Dare Run Your ML Experiments in Production? with Ville Tuulos - #523
Podcast episode
Do You Dare Run Your ML Experiments in Production? with Ville Tuulos - #523
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Multi-cloud: Exploring the challenges and opportunities: When cloud first hit the mainstream more than a decade ago, its attraction was rooted, in part, in its apparent elegance and simplicity. As it has become an established norm in the industry, such simplicity has given way to more fragmentation and...
Podcast episode
Multi-cloud: Exploring the challenges and opportunities: When cloud first hit the mainstream more than a decade ago, its attraction was rooted, in part, in its apparent elegance and simplicity. As it has become an established norm in the industry, such simplicity has given way to more fragmentation and...
byThoughtworks Technology Podcast
0 ratings
0% found this document useful
AI and Algorithms Can Be Biased... So What We Should Do About It?: Artificial Intelligence sounds fantastic and far off, but it underlies so many things we do today. Today I talk with Jason Thacker, author of about AI and ethics. He tries to explain what it is, what it does, and why we should be concerned about it...
Podcast episode
AI and Algorithms Can Be Biased... So What We Should Do About It?: Artificial Intelligence sounds fantastic and far off, but it underlies so many things we do today. Today I talk with Jason Thacker, author of about AI and ethics. He tries to explain what it is, what it does, and why we should be concerned about it...
by10 Minute Teacher Podcast with Cool Cat Teacher
100%
100% found this document useful
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
Podcast episode
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful

Skip carousel

Open-source Sabotage Sparks Ethical Storm
PC Pro Magazine
Article
Open-source Sabotage Sparks Ethical Storm
Jun 10, 2021
2 min read
Setting STRATEGY
Trapper & Predator Caller
Article
Setting STRATEGY
Jul 15, 2022
7 min read
Unhappy Truckers and Other Algorithmic Problems: Transportation optimization starts with math, but ends in understanding human behavior.
Nautilus
Article
Unhappy Truckers and Other Algorithmic Problems: Transportation optimization starts with math, but ends in understanding human behavior.
Jul 18, 2013
When Bob Santilli, a senior project manager at UPS, was invited in 2009 to his daughter’s fifth grade class on Career Day, he struggled with how to describe exactly what he did for a living. Eventually, he decided he would show the class a travel opt
11 min read
Deep Learning Technique for Object Detection
Techfastly
Article
Deep Learning Technique for Object Detection
Jun 1, 2021
3 min read
AI – Turn Buzz Into Biz
Facility Management
Article
AI – Turn Buzz Into Biz
Dec 23, 2018
4 min read
Artificial Empathy: The Last Step Of Humanizing Machines
Techfastly
Article
Artificial Empathy: The Last Step Of Humanizing Machines
Jul 1, 2021
1 min read
AI As A Service
PC Pro Magazine
Article
AI As A Service
Jul 9, 2020
2 min read
Don’t Be Misled by GPT-4’s Gift of Gab
The Atlantic
Article
Don’t Be Misled by GPT-4’s Gift of Gab
Mar 15, 2023
4 min read
Out Of The Bag
New Zealand Listener
Article
Out Of The Bag
Mar 19, 2023
11 min read
How AI Amplifies Human Competencies
Rotman Management
Article
How AI Amplifies Human Competencies
Jan 1, 2020
Not everyone is on board with AI and machine learning. None other than Elon Musk has suggested that it might lead to World War III. What is your take on the situation? You’re absolutely right that there is a huge amount of alarmism and ‘automation an
5 min read
Book: AI Is Cool, But Nowhere Near Human Capacity
Futurity
Article
Book: AI Is Cool, But Nowhere Near Human Capacity
Aug 25, 2022
3 min read
The Rabbit Hole
Beijing Review
Article
The Rabbit Hole
May 11, 2023
7 min read
We’re Constantly Generating ‘Shadow’ Medical Records
Futurity
Article
We’re Constantly Generating ‘Shadow’ Medical Records
Feb 22, 2019
3 min read
Opinion: Artificial Intelligence In Pharma, Health Care: At The Crossroads Of Hype And Reality
STAT
Article
Opinion: Artificial Intelligence In Pharma, Health Care: At The Crossroads Of Hype And Reality
Dec 6, 2018
Artificial intelligence is at the forefront of the minds of many pharmaceutical and health care executives. Is it hype, or the future?
4 min read
Danger On The Road
Business Today
Article
Danger On The Road
Oct 14, 2019
3 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
STAT
Article
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
Nov 7, 2019
Poorly constructed AI algorithms for drug discovery and testing have the potential to cause harm. The FDA should play an important role in ensuring that AI-based drug development tools meet…
3 min read
The EPA’s Rule to Restrict Science Could Compromise Your Confidential Research Data
Union of Concerned Scientists
Article
The EPA’s Rule to Restrict Science Could Compromise Your Confidential Research Data
Mar 5, 2020
5 min read
Opinion: Oncologists Are Guardedly Optimistic About AI. But Will It Drive Real Improvements In Cancer Care?
STAT
Article
Opinion: Oncologists Are Guardedly Optimistic About AI. But Will It Drive Real Improvements In Cancer Care?
Jul 3, 2019
Unless developers of artificial intelligence applications understand what doctors want and need, AI will become just another here today, gone tomorrow health care technology story.
3 min read
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Forbes Africa
Article
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Sep 29, 2023
Artificial intelligence (AI) is transforming companies and economies worldwide, including in Africa. Data is an essential component in the training of AI systems. Unfortunately, the lack of accurate, high-quality data is a significant impediment in A
3 min read
Opinion: We Must Improve the Security of Networked Medical Devices
STAT
Article
Opinion: We Must Improve the Security of Networked Medical Devices
Nov 2, 2017
4 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
Opinion: Two Words To Help Ned Sharpless Revolutionize Clinical Trials: Data Standards
STAT
Article
Opinion: Two Words To Help Ned Sharpless Revolutionize Clinical Trials: Data Standards
May 13, 2019
4 min read
Opinion: What Facebook’s Public Scrutiny Can Teach Us About Artificial Intelligence In Health Care
STAT
Article
Opinion: What Facebook’s Public Scrutiny Can Teach Us About Artificial Intelligence In Health Care
Apr 11, 2018
3 min read
In Conversation With CHARLES BOICEY
Techfastly
Article
In Conversation With CHARLES BOICEY
Aug 2, 2021
8 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Opinion: Electronic Health Records Are Still Waiting To Be Transformed
STAT
Article
Opinion: Electronic Health Records Are Still Waiting To Be Transformed
Apr 11, 2019
Electronic health records aren't yet a transformative tool to support clinical decision-making. Many physicians feel they have traded physical filing cabinets for digital ones.
4 min read
Commentary: TSA Faces Ethical Limits In Use Of AI. But Work To Improve The Technology Must Persist
Chicago Tribune
Article
Commentary: TSA Faces Ethical Limits In Use Of AI. But Work To Improve The Technology Must Persist
Nov 23, 2022
3 min read
Opinion: Faster Concussion Diagnoses Are on the Horizon. Patent Filings Offer a Peek
STAT
Article
Opinion: Faster Concussion Diagnoses Are on the Horizon. Patent Filings Offer a Peek
Dec 18, 2017
5 min read
Opinion: Advancing AI In Health Care: It’s All About Trust
STAT
Article
Opinion: Advancing AI In Health Care: It’s All About Trust
Oct 23, 2019
In drug trials, success in Phase 1 is no guarantee of success in Phase 3. In the current mania for AI and deep learning, a proof of concept is taken…
4 min read

Related categories

Skip carousel

Reviews for AI Assurance

Rating: 5 out of 5 stars

5/5

1 rating0 reviews

Book preview

AI Assurance - Feras A. Batarseh

Part 1: Foundations of AI assurance

Outline

1. An introduction to AI assurance

2. Setting the goals for ethical, unbiased, and fair AI

3. An overview of explainable and interpretable AI

4. Bias, fairness, and assurance in AI: overview and synthesis

5. An evaluation of the potential global impacts of AI assurance

1: An introduction to AI assurance

Feras A. Batarseha; Jaganmohan Chandrasekaranb; Laura J. Freemanc aDepartment of Biological Systems Engineering (BSE), College of Engineering (COE) & College of Agriculture and Life Sciences (CALS), Virginia Tech, Arlington, VA, United States

bCommonwealth Cyber Initiative, Virginia Tech, Arlington, VA, United States

cDepartment of Statistics, National Security Institute, Virginia Tech, Arlington, VA, United States

Graphical abstract

Abstract

In this chapter, we present a brief introduction about the concept, dimensions, and challenges of AI assurance. The goal is to lay the path for the concepts presented in this book. AI is often assessed by its ability to consistently deliver accurate predictions of behavior in a system. A critical, often overlooked, aspect of developing AI algorithms is that performance is a function of the task the algorithm is assigned, the domain over which the algorithm is intended to operate, and changes to these elements in time. These parameters and their constituent parts form the basis which makes assuring AI a challenge. Algorithms need to be characterized by understanding the factors that contribute to stable performance across an operational environment (e.g., no dramatic perturbation by small changes and/or no effects measurable over time). This chapter presents a high-level introduction to AI Assurance and points readers to related areas of interest in the book.

Keywords

AI assurance; testing & evaluation; responsible AI

Highlights

• An introduction to AI assurance

• Topics covered and questions answered in this book

• Book's main thrusts and high-level summaries

• The need for AI assurance

1.1 Motivation and overview

To accurately and consistently predict behaviors in systems, AI systems require data for training and testing the outcomes. The iterative process of improving accuracy and precision in developed models involves trade-offs in performance, data quality, and other environmental factors. AI's predictive power can be impacted through changes in the training/testing data, the model, and its context. In this chapter, we discuss sources of change captured within the operational context (Brézillon and Gonzalez, 2014) of an AI's execution and that is often attributed to inconsistencies in AI systems. Model and data changes are discussed in this book, especially those related to concept drift in applications with examples of how these inconsistencies emerge (Žliobaitė et al., 2016; McPherson, 2021; Tucker, 2021).

The need for assurance was reinforced in a recent report by the US National Security Commission on Artificial Intelligence (NSCAI), which proposed that government agencies, such as the National Institute of Standards and Technology (NIST) and the National Institute of Food and Agriculture (NIFA) should provide and regularly refresh a set of standards, performance metrics, and tools for qualified confidence in AI models, data, training environments, and predicted outcomes (NSCAI, 2021). The government, industry, and academia are required to collectively advance the AI community in establishing these resources and standards. Accordingly, this book's motivation is to provide a vision for the path forward and serve as a foundational and theoretical manifesto to the assurance of AI systems.

1.1.1 Book content

The incremental testing movement was an afterthought in the software engineering world. If we aim to learn from that experience, however, we ought to develop assurance incrementally and as part of the AI lifecycle. Therefore assurance should not be treated as a separate component while developing AI systems; instead, it should be a part of the incremental learning process of any agent, environment, or algorithm. Chapter 2 of this book discusses the notion that the process of ensuring fair, unbiased, and ethical AI needs to be a continuous endeavor, making of AI assurance a process and not a goal; it also includes a valuable exploration of the areas of generalization and other major assurance challenges, such as the control problem, value loading, and human-AI alignment.

A well articulated process and a clearly defined set of metrics to categorize and measure the maturity of assurance could go a long way in establishing a common understanding of these systems' dependability. Similar process structures, e.g., the capability maturity model integrated (CMMI), have been employed to measure an organization's ability to produce high quality software systems. The advantage of such a model is that it encourages all stakeholders to agree on a set of metrics and processes to measure the quality of the AI systems being produced and deployed. It also shows the path to achieve a gradually higher level of assurance following a consensus set of criteria. We believe that a similar set of metrics and process under a maturity model framework will not only streamline AI systems' development efforts, it will also foster sharing of implementation experiences and best practices. However, such standards should be defined based on AI-related metrics, for instance, Chapter 3 presents a rich overview of statistical methods and foundational metrics to measuring assurance of AI systems, with focus on explainability and interpretability. However, if we consider assurance goals, such as explainability, fairness, and trustworthiness, trade-off decisions have to be made. Algorithms that are more complex (such as neural networks) tend to be less interpretable and prone to different kinds of bias for instance. As reported by Gunning and Aha (2019), the performance of AI algorithms is inversely proportional to the explainability of the model's decision. Accordingly, Chapter 4 presents bias reduction methods and compares them in terms of the overall validation of AI systems.

As AI is getting adopted across all domains, assuring AI systems is becoming a matter of national security; it has effects on manufacturing, cyber-physical systems, the economy, healthcare, government, and many other sectors; Chapter 5 introduces potential short- and long-term global impacts of AI assurance. Accomplishing assurance is a complicated endeavor nonetheless; Chapters 6, 8, and 10 present detailed frameworks and lifecycles that can be used to establish a process for assurance within any domain, by applying algorithm assurance concepts, such as inference, causality, resilience, and elasticity. Data assurance, however, is another critical dimension in AI assurance (Kulkarni et al., 2020); Chapters 7 and 9 provide answers and recommendations on data wrangling methods for improved model outcomes, as well as addressing outlier detection issues in training data and their effects on the outcomes of learning algorithms. The last part of the book presents a variety of applications and illustrates the need for assurance in many sectors such as Economics (Chapter 11), Healthcare (Chapter 12), Engineering (Chapter 13), Agriculture (Chapters 14 and 15), and Public Policy (Chapter 16).

1.2 The need for new assurance methods

Recent advancements in AI have demonstrated the potential of AI-based software systems in successfully performing tasks that generally require human-level intelligence. A survey by Batarseh et al. (2021) recommends a set of assurance goals, provides a new comprehensive definition for AI assurance, and suggests that AI-based software systems are rapidly adopted across various domains. An AI-based software system consists of one or more machine learning models that are used to perform intelligent tasks, such as object identification, pedestrian detection, speech translation, and decision support. In the AI engineering lifecycle, developing a model is a multi-step process. One of the critical initial steps is algorithm selection. An AI framework, such as sci-kit learn, Tensorflow, or Pytorch, consists of a collection of off-the-shelf AI algorithms. The AI algorithm analyzes the dataset, infers, and learns the hidden patterns, and derives a decision logic on receiving the input. This activity is referred to as the training phase, and the derived decision logic is referred to as a trained AI model (Chandrasekaran, 2021; Felderer and Ramler, 2021). In the training phase, multiple assurance challenges could be faced, such as data bias, data incompleteness, dark data, or data collection inconsistencies. Despite the promising potential demonstrated by AI-based software systems, they are error-prone and tend to fail once deployed in real-world environments (Lee, 2016; Dastin, 2018; Vincent, 2020; Mitchell, 2021; E.Boudetter, 2021). Such failures can have serious consequences, including fatal consequences in safety-critical domains (Cellan-Jones, 2020; Newman, 2021). However, assurance, testing, validation, and verification of systems is not a new problem, the software engineering community has made major progress and multiple conclusions on these fronts, some of which could be very useful for AI, whilst others are not related at all. In the remaining of this section, we argue against recycling existing assurance methods, and present the case for a new set of AI assurance methods. In the software development lifecycle, testing is performed before the software system is released. The objective of the testing activity is to ensure that the software system will behave as intended. According to ISO/IEC/IEEE 29119-1:2013 (IEEE_Software_Testing, 2013), the primary goals of software testing are to provide information about the quality of the test item and any residual risk in relation to how much the test item has been tested, to find defects in the test item prior to its release for use, and to mitigate the risks to the stakeholders of poor product quality. Poor software quality can have an adverse effect and cause severe damages to its stakeholders. A report in synopsis states that, in 2021, the software glitches in the US cost around an estimated $2 trillion (Armerding, 2021). Testing is a complex yet essential activity in the software development lifecycle. Over the years, several approaches and methodologies have been developed to effectively test and release software systems. However, they are tailored towards testing and evaluation of traditional software systems, and not AI. In traditional software systems, the decision logic is written by humans based on the requirements provided by stakeholders. More importantly, decision logic is deterministic, that is, for a given input, the system is guaranteed to produce the exact output at each execution. In contrast, AI systems derive their logic from a training dataset, and in most cases, the algorithms in an AI-based software system behaves in a stochastic manner. Furthermore, an AI software system shall exhibit a change in its behavior with different data, different contexts, and different users; all of which obviously exacerbate the assurance challenge (Freeman, 2020). Therefore the behavior of an AI-based software system is influenced by a combination of factors, all requiring assurance. Additionally, in the case of traditional software systems, the decision logic is derived based on the requirements. Hence, the test cases are generated based on the business requirements, and each test case shall have a predetermined/predefined set of outputs. On the contrary, in AI-based software systems, there are no written requirements in the traditional sense. Instead, the AI model derives the logic from the training dataset (through supervised or unsupervised training processes). Therefore AI-based software systems suffer from the test oracle problem (Weyuker, 1982; Murphy et al., 2007). That is, in most cases, the intended system behavior for a test case can hardly be predefined. In other words, the exact intended behavior of an AI software system is not fully known until the scenario occurs in real-time (such as in reinforcement learning scenarios).

An AI-based software system with a higher prediction accuracy (closer to a 100% accuracy) is expected to guarantee an error-free behavior, also, they are expected to make objective, impartial decisions. On the contrary, in most cases, when deployed in real-world conditions, AI-enabled software systems can inadvertently result in discriminatory behavior. For example, an AI algorithm used by a major US-based healthcare institution to identify patients for extra care through an intensive care management program appeared to be discriminatory against patients of African-American ancestry (Strickland, 2019). The root cause for such unintentional yet discriminatory behavior could be attributed to the inherent bias in the dataset used to train the AI model used in the AI-based software system. Issues reported in Buolamwini and Gebru (2018); Strickland (2019); Caliskan (2021); Zang (2021) indicate that evaluating the quality of AI-based systems requires determining beyond the correctness of the AI systems. As AI-based software systems are data-intensive, in addition to the correctness, it is essential to test for bias and variance in the system (to identify over or under-fitting issues). From an assurance standpoint, it is imperative to develop standardized assurance methods that are capable of detecting and mitigating any biased behavior before AI-based software systems are deployed.

Furthermore, in the case of a traditional software system, on executing a test case, a deviation of the observed behavior from the expected behavior is considered as failure. However, in the case of an AI-based software system, the correctness of system behavior is evaluated based on the prediction accuracy (a statistical score) of the AI system (Zhang et al., 2020; Riccio et al., 2020). A statistical score (for example, correct predictions/total predictions) is calculated over a test dataset. A model achieving a higher accuracy is considered one of higher quality. Also, the acceptable threshold of the prediction accuracy score varies across domains, users, and models. It follows that, as this book presents, assurance of AI systems could be domain-specific or domain-dependent, model-specific or model-agnostic, but is certainly needed in all cases, scenarios, and deployments. For traditional software systems, in most cases, the root cause for an unexpected behavior (failure) can be localized to a segment in the source code. However, when an AI system exhibits a failure, it can be caused by either by the training dataset, missing data, outliers, choice of hyper-parameters, or by the trained model and its architecture (Batarseh and Gonzalez, 2018). For example, as reported in Wiggers (2021), the inherent bias in the training dataset results in a discriminatory AI software system. In other cases, even the choice of an AI algorithm can be attributed to unexpected behavior (Yee et al., 2021). Given the fundamental differences between a traditional software system and an AI-based software system, and the quality assurance challenges that arise from these differences, it is vital to develop assurance methods that are especially tailored and best suited to assess and evaluate AI-based systems, a notion that is covered in this book.

1.3 Conclusion

As reflected by this book, AI assurance is based on a set of trade-offs. An AI model that exhibits better performance is generally a black-box, and their reasoning (or) decision-making process is not easily understandable to the users. From an assurance standpoint, in addition to evaluating a model's correctness (accuracy), it is essential to understand why a model makes a specific decision. Despite the prediction capabilities, as the reasoning behind a model's decision is largely opaque, it leads to a lack of trustworthiness among the users. From a quality assurance perspective, it is essential to develop approaches and tools that will generate fair outcomes that are secure, safe, and easily understandable to all stakeholders (AI engineers, end-users, business owners, data scientists) involved in the process. Accordingly, we provide an updated definition of AI assurance, which is an extension to the one presented in Batarseh et al. (2021); this definition is adopted across this book: AI assurance is a process that is applied at all stages of the AI engineering lifecycle, ensuring that any intelligent system is producing outcomes that are valid, verified, data-driven, trustworthy, and explainable to a layman, resilient against adversaries, robust within its domain, ethical in the context of its deployment, unbiased in its learning, and fair to its users.

References

Armerding, 2021 T. Armerding, What is the cost of poor software quality in the US? https://www.synopsys.com/blogs/software-security/poor-software-quality-costs-us/; 2021.

Batarseh et al., 2021 F.A. Batarseh, L. Freeman, C.H. Huang, A survey on artificial intelligence assurance, Journal of Big Data 2021;8:1–30.

Batarseh and Gonzalez, 2018 F.A. Batarseh, A.J. Gonzalez, Predicting failures in agile software development through data analytics, Software Quality Journal 2018;26:49–66.

Brézillon and Gonzalez, 2014 P. Brézillon, A.J. Gonzalez, Context in Computing: a Cross-Disciplinary Approach for Modeling the Real World. Springer; 2014.

Buolamwini and Gebru, 2018 J. Buolamwini, T. Gebru, Gender shades: intersectional accuracy disparities in commercial gender classification, Conference on Fairness, Accountability and Transparency. PMLR; 2018:77–91.

Caliskan, 2021 A. Caliskan, Detecting and mitigating bias in natural language processing, https://www.brookings.edu/research/detecting-and-mitigating-bias-in-natural-language-processing/; 2021.

Cellan-Jones, 2020 R. Cellan-Jones, Uber's self-driving operator charged over fatal crash, https://www.bbc.com/news/technology-54175359; 2020.

Chandrasekaran, 2021 J. Chandrasekaran, Testing artificial intelligence-based software systems. [Ph.D. thesis] 2021.

Dastin, 2018 J. Dastin, Amazon scraps secret AI recruiting tool that showed bias against women, https://www.reuters.com/article/us-amazon-com-jobs-automation-insight-idUSKCN1MK08G; 2018.

E.Boudetter, 2021 N. E.Boudetter, Tesla says autopilot makes its cars safer. crash victims say it kills, https://www.nytimes.com/2021/07/05/business/tesla-autopilot-lawsuits-safety.html; 2021.

Felderer and Ramler, 2021 M. Felderer, R. Ramler, Quality assurance for AI-based systems: overview and challenges (introduction to interactive session), International Conference on Software Quality. Springer; 2021:33–42.

Freeman, 2020 L. Freeman, Test and evaluation for artificial intelligence, Insight 2020;23:27–30.

Gunning and Aha, 2019 D. Gunning, D. Aha, Darpa's explainable artificial intelligence (xai) program, AI Magazine 2019;40:44–58.

IEEE_Software_Testing, 2013 IEEE_Software_Testing, Iso/iec/ieee international standard - software and systems engineering –software testing –part 1:concepts and definitions, ISO/IEC/IEEE 29119-1:2013(E). 2013:1–64 10.1109/IEEESTD.2013.6588537.

Kulkarni et al., 2020 A. Kulkarni, D. Chong, F.A. Batarseh, Foundations of data imbalance and solutions for a data democracy, Data Democracy. Elsevier; 2020:83–106.

Lee, 2016 P. Lee, Learning from Tay's introduction, https://blogs.microsoft.com/blog/2016/03/25/learning-tays-introduction/; 2016.

McPherson, 2021 S. McPherson, Fixing ‘concept drift’: retraining AI systems to deliver accurate insights at the edge, https://gcn.com/articles/2021/07/16/ai-concept-drift.aspx; 2021.

Mitchell, 2021 R. Mitchell, Tesla's handling of full self-driving bug raises alarms, https://www.latimes.com/business/story/2021-11-03/teslas-handling-braking-bug-in-public-self-driving-test; 2021.

Murphy et al., 2007 C. Murphy, G.E. Kaiser, M. Arias, An approach to software testing of machine learning applications. 2007.

Newman, 2021 R. Newman, It's time to notice Tesla's autopilot death toll, https://news.yahoo.com/its-time-to-notice-teslas-autopilot-death-toll-195849408.html; 2021.

NSCAI, 2021 NSCAI, Chapter 7 - NSCAI final report, https://reports.nscai.gov/final-report/chapter-7/; 2021.

Riccio et al., 2020 V. Riccio, G. Jahangirova, A. Stocco, N. Humbatova, M. Weiss, P. Tonella, Testing machine learning based systems: a systematic mapping, Empirical Software Engineering 2020;25:5193–5254.

Strickland, 2019 E. Strickland, Racial bias found in algorithms that determine health care for millions of patients, https://spectrum.ieee.org/racial-bias-found-in-algorithms-that-determine-health-care-for-millions-of-patients/; 2019.

Tucker, 2021 B. Tucker, Managing the risks of adopting AI engineering, https://insights.sei.cmu.edu/blog/managing-the-risks-of-adopting-ai-engineering/; 2021.

Vincent, 2020 J. Vincent, AI camera operator repeatedly confuses bald head for soccer ball during live stream, https://www.theverge.com/tldr/2020/11/3/21547392/ai-camera-operator-football-bald-head-soccer-mistakes; 2020.

Weyuker, 1982 E.J. Weyuker, On testing non-testable programs, Computer Journal 1982;25:465–470.

Wiggers, 2021 K. Wiggers, Employees attribute AI project failure to poor data quality, https://venturebeat.com/2021/03/24/employees-attribute-ai-project-failure-to-poor-data-quality/; 2021.

Yee et al., 2021 K. Yee, U. Tantipongpipat, S. Mishra, Image cropping on Twitter: fairness metrics, their limitations, and the importance of representation, design, and agency, arXiv preprint arXiv:2105.08667; 2021.

Zang, 2021 J. Zang, Solving the problem of racially discriminatory advertising on Facebook, https://www.brookings.edu/research/solving-the-problem-of-racially-discriminatory-advertising-on-facebook/; 2021.

Zhang et al., 2020 J.M. Zhang, M. Harman, L. Ma, Y. Liu, Machine learning testing: survey, landscapes and horizons, IEEE Transactions on Software Engineering 2020.

Žliobaitė et al., 2016 I. Žliobaitė, M. Pechenizkiy, J. Gama, An overview of concept drift applications, Big Data Analysis: New Algorithms for a New Society. 2016:91–114.

2: Setting the goals for ethical, unbiased, and fair AI

Antoni Lorente Department of Digital Humanities, King's College London, London, United Kingdom

Graphical abstract

Abstract

The main goal of AI assurance is to ensure that AI systems are, among other things, ethical, unbiased, and fair. In this chapter, three different approaches to the value alignment problem, i.e., how to ensure that an AI's decisions and behaviors are aligned with our values, are introduced. The chapter claims that AI assurance provides a shared vernacular and a formal framework to meaningfully apply the strategies to deal with the value alignment problem above, motivating several questions that are fundamental to such alignment. A brief overview of three different normative theories pinpoints the dilemmatic nature of defining the good, justifying in turn the need to tackle the problem of implementation, specification, and moral uncertainty. It is argued that even though behavior-based learning allows deferring some of these questions, for AI assurance to attain its goals—both now and in the future—the process of ensuring fair, unbiased, and ethical AI needs to be a continuous endeavor, making AI assurance a process and not a goal.

Keywords

Value alignment problem; AI ethics; specification; implementation; uncertainty; CIRL; Artificial Intelligence

Highlights

• This chapter introduces AI assurance as a process that enables fair, unbiased, and ethical AI

• Ethical interpretations are explained via the problem of aligning AI systems with our values and interests

• The embrace of behavior-based value learning methods motivates the necessity to further explore AI assurance as a crucial actor within AI systems development

2.1 Introduction and background

AI assurance is a field of research entrusted with a crucial task: ensuring that the development and adoption of advanced AI systems does not jeopardize the fundamental pillars on which our society stands. Such assurance involves, consequently, discussions about development, transparency, commercialization, regulation, control, or use, which need to be addressed at multiple levels of abstraction: from the most fundamental mathematical formalisms to the complexity of everyday language. The aim of AI assurance is thus to make sure that any AI system being developed and deployed produces outcomes that are valid, verified, data-driven, trustworthy and explainable to a layman, ethical in the context of its deployment, unbiased in its learning, and fair to its users (Batarseh et al., 2021).

The strength of the process suggested here is that it applies throughout the whole range of technologies that fall beneath the umbrella term artificial intelligence. It is a claim that compels any and all AI systems to satisfy a bare minimum of requirements for them to be acceptable not only in technical or commercial terms, but also social, ethical, and legal. AI assurance thus provides, on the one hand, the framework, while on the other, the vernacular to meaningfully assess each step of both the development and adoption process of AI systems for them to be fair, safe, unbiased, and ethical.

AI systems are increasingly embedded in our social context, and regardless of the technical brilliance that underpin them, the possible futures that AI opens up to us are both staggering and uncanny. Original and complex AI systems, capable of undertaking tasks that would otherwise be unfathomable, are many times obscure to the general public, justifying the grounds for anxiety. But once the emotion wears off, the difficult and often times profound philosophical questions remain. How should we align AI systems with our values? How similar to natural intelligence is artificial intelligence? While some of the questions are deeply metaphysical, especially those about conscience, free will, agency, and autonomy, many others pick on ethical dilemmas that have governed the progress of philosophy for the last millennia.

AI has evolved via different approaches to learning. However, one of the main problems when training a machine learning algorithm is generalization, or the capacity of a given model to adapt to new datasets. Intuitively, machine learning is the field of study that gives computers the ability to learn without being explicitly programmed to Samuel (1959). Such learning is materialized via a model about a given portion of the world, which is articulated by a hypothesis that is inferred from a dataset. If such dataset is previously partitioned into labeled categories, we call it supervised learning. Otherwise, the so-called learning is unsupervised learning. Last, if the agent conditions its action to the reward it receives, this is called reinforcement learning (RL).

The problem of generalization is troublesome for several reasons. First, it limits the capacity of an AI system to be used beyond the training dataset. But second and more importantly, it increases the chances of misinterpreting, or misclassifying elements from the real world. Given the growing impact of AI algorithms on our everyday lives, trying to align AI systems with our values is a crucial task for researchers. It is in this sense that philosophical approaches to AI provide a meaningful background to reformulate technical problems as ethical and concrete philosophical problems.

This chapter begins with an overview of three main formulations of the value alignment problem in AI. Section 2.1.1 introduces Nick Bostrom's concerns regarding the control problem and the value-loading problem in the context of the existential risks that AI entails. Section 2.1.2 briefly discusses Stuart Russell's defense of human-compatible AI, paying special attention to his proposal of abandoning the standard model of AI, as well as to the possibilities that assistance games, such as cooperative inverse reinforcement learning bring. Section 2.1.3, discusses Brian Christian's value alignment problem, which provides meaningful insights regarding the role of training data and objective functions in developing aligned AI systems. In Section 2.1.4, AI assurance is interpreted as a process that provides a shared vernacular and a formal framework to implement the discussions above.

The second part of this chapter is rooted in some fundamental aspects regarding the development of safe and ethical AI. Section 2.2.1 introduces three of the most important normative theories in ethics: duty-based deontology, utilitarianism, and virtue ethics. After this, the implementation problem is discussed in Section 2.2.2, considering two possible strategies to develop a moral sense in a machine: a top-down and a bottom-up approach. Then, in Section 2.2.3, some problems related to the nature of intentional statements are introduced, focusing in particular on the problem of specification and the role of moral uncertainty.

The chapter concludes insisting on the idea that for AI assurance to succeed in its goals, the process of ensuring that AI systems are ethical, unbiased, fair, and safe needs to be an iterative, interactive, and deliberative process. Thus AI assurance reformulates AI ethics not as a goal, but as a new relationship with technology.

2.1.1 Value-loading

In the book Superintelligence: Paths, Dangers, Strategies, Nick Bostrom (2014) presents two problems that are crucial for AI assurance. On the one hand, what Bostrom calls "the control problem raises the question bearing on which principles should buttress a framework to harness an artificial general intelligence that could overtake us. On the other hand, and before the provisional nature of control mechanisms, the value-loading problem" allows us to formalize the puzzle of having intelligent systems whose values are aligned with ours. The two sections that follow provide a brief outline.

2.1.1.1 The control problem

Both the control and the value-loading problem are raised from the perspective of the existential risk that an AI—a general, superintelligent one—could entail in the longer run for humanity. And even though AI assurance and machine ethics are primarily concerned with current developments (i.e., those related to machines that are still far inferior to humans in terms of general intelligence), it is both intrinsically and instrumentally enriching to engage in the exercises that Bostrom proposes. The idea of an artificial general intelligence (or AGI) has motivated relevant interdisciplinary research agendas that have contributed both to prevent such outcome and to ensure better systems. Moreover, thinking about this existential risk not only allows us to consider the long-term consequences of our current decisions, it also puts into perspective our ultimate goals. It is, perhaps, because of this that Bostrom's work has been so influential.

But how could we control an AGI? To answer this question, Bostrom introduces the possibility of an artificial intelligence explosion (or a putative process in which a moderately intelligent agent improves radically until reaching a superhuman level of intelligence via recursive self-improvement) (Bostrom, 2014: 408). The main goal of the control problem is to achieve a controlled detonation of such intelligence (Bostrom, 2014 , p. 155). To do so, the discussion targets which methods could be used to ensure that such agent realizes the sponsor's goals.

Bostrom proposes several methods to ensure a controlled detonation, which can be grouped into two categories: capability control and motivation selection. Capability control methods include the following:

• Boxing: This process consists of containing (either physically or informationally) the artificial intelligence.

• Incentive methods: These are methods that place the agent in an environment where it finds instrumental reasons to behave consistently with the designer and developer's

Enjoying the preview?

Page 1 of 1

AI Assurance: Towards Trustworthy, Explainable, Safe, and Ethical AI

About this ebook

Related to AI Assurance

Related ebooks

Science & Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for AI Assurance

What did you think?

Book preview

AI Assurance - Feras A. Batarseh

Part 1: Foundations of AI assurance

1: An introduction to AI assurance

Abstract

Keywords

Highlights

1.1 Motivation and overview

1.1.1 Book content

1.2 The need for new assurance methods

1.3 Conclusion

References

Abstract

Keywords

Highlights

2.1 Introduction and background

2.1.1 Value-loading

2.1.1.1 The control problem