Ebook813 pages6 hours

Ensemble Methods for Machine Learning

Name: Ensemble Methods for Machine Learning
Author: Gautam Kunapuli
ISBN: 9781638356707

By Gautam Kunapuli

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Ensemble machine learning combines the power of multiple machine learning approaches, working together to deliver models that are highly performant and highly accurate.

Inside Ensemble Methods for Machine Learning you will find:

Methods for classification, regression, and recommendations
Sophisticated off-the-shelf ensemble implementations
Random forests, boosting, and gradient boosting
Feature engineering and ensemble diversity
Interpretability and explainability for ensemble methods

Ensemble machine learning trains a diverse group of machine learning models to work together, aggregating their output to deliver richer results than a single model. Now in Ensemble Methods for Machine Learning you’ll discover core ensemble methods that have proven records in both data science competitions and real-world applications. Hands-on case studies show you how each algorithm works in production. By the time you're done, you'll know the benefits, limitations, and practical methods of applying ensemble machine learning to real-world data, and be ready to build more explainable ML systems.

About the Technology

Automatically compare, contrast, and blend the output from multiple models to squeeze the best results from your data. Ensemble machine learning applies a “wisdom of crowds” method that dodges the inaccuracies and limitations of a single model. By basing responses on multiple perspectives, this innovative approach can deliver robust predictions even without massive datasets.

About the Book

Ensemble Methods for Machine Learning teaches you practical techniques for applying multiple ML approaches simultaneously. Each chapter contains a unique case study that demonstrates a fully functional ensemble method, with examples including medical diagnosis, sentiment analysis, handwriting classification, and more. There’s no complex math or theory—you’ll learn in a visuals-first manner, with ample code for easy experimentation!

What’s Inside

Bagging, boosting, and gradient boosting
Methods for classification, regression, and retrieval
Interpretability and explainability for ensemble methods
Feature engineering and ensemble diversity

About the Reader

For Python programmers with machine learning experience.

About the Author

Gautam Kunapuli has over 15 years of experience in academia and the machine learning industry.

Table of Contents

PART 1 - THE BASICS OF ENSEMBLES
1 Ensemble methods: Hype or hallelujah?
PART 2 - ESSENTIAL ENSEMBLE METHODS
2 Homogeneous parallel ensembles: Bagging and random forests
3 Heterogeneous parallel ensembles: Combining strong learners
4 Sequential ensembles: Adaptive boosting
5 Sequential ensembles: Gradient boosting
6 Sequential ensembles: Newton boosting
PART 3 - ENSEMBLES IN THE WILD: ADAPTING ENSEMBLE METHODS TO YOUR DATA
7 Learning with continuous and count labels
8 Learning with categorical features
9 Explaining your ensembles

Skip carousel

LanguageEnglish

PublisherManning

Release dateMay 30, 2023

ISBN9781638356707

Author

Gautam Kunapuli

Gautam Kunapuli has over 15 years of experience in academia and the machine learning industry. He has developed several novel algorithms for diverse application domains including social network analysis, text and natural language processing, behavior mining, educational data mining and biomedical applications. He has also published papers exploring ensemble methods in relational domains and with imbalanced data.

Related authors

Skip carousel

Related to Ensemble Methods for Machine Learning

Related ebooks

Skip carousel

Feature Engineering Bookcamp
Ebook
Feature Engineering Bookcamp
bySinan Ozdemir
Rating: 0 out of 5 stars
0 ratings
Learning Apache Mahout
Ebook
Learning Apache Mahout
byTiwary Chandramani
Rating: 0 out of 5 stars
0 ratings
Mahout in Action
Ebook
Mahout in Action
bySean Owen
Rating: 0 out of 5 stars
0 ratings
Python High Performance - Second Edition
Ebook
Python High Performance - Second Edition
byGabriele Lanaro
Rating: 0 out of 5 stars
0 ratings
Generating a New Reality: From Autoencoders and Adversarial Networks to Deepfakes
Ebook
Generating a New Reality: From Autoencoders and Adversarial Networks to Deepfakes
byMicheal Lanham
Rating: 0 out of 5 stars
0 ratings
Java Data Science Cookbook
Ebook
Java Data Science Cookbook
byRushdi Shams
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
Ebook
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
byDeepti Chopra
Rating: 0 out of 5 stars
0 ratings
Experimentation for Engineers: From A/B testing to Bayesian optimization
Ebook
Experimentation for Engineers: From A/B testing to Bayesian optimization
byDavid Sweet
Rating: 0 out of 5 stars
0 ratings
Effective Amazon Machine Learning
Ebook
Effective Amazon Machine Learning
byAlexis Perrier
Rating: 0 out of 5 stars
0 ratings
Lucene 4 Cookbook
Ebook
Lucene 4 Cookbook
byEdwood Ng
Rating: 0 out of 5 stars
0 ratings
Python: Deeper Insights into Machine Learning
Ebook
Python: Deeper Insights into Machine Learning
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Ebook
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
byAlok Kumar
Rating: 0 out of 5 stars
0 ratings
Test-Driven Machine Learning
Ebook
Test-Driven Machine Learning
byBozonier Justin
Rating: 0 out of 5 stars
0 ratings
Natural Language Processing with Java and LingPipe Cookbook
Ebook
Natural Language Processing with Java and LingPipe Cookbook
byKrishna Dayanidhi
Rating: 0 out of 5 stars
0 ratings
Machine Learning Systems: Designs that scale
Ebook
Machine Learning Systems: Designs that scale
byJeffrey Smith
Rating: 0 out of 5 stars
0 ratings
Designing Machine Learning Systems with Python
Ebook
Designing Machine Learning Systems with Python
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
Ebook
The Supervised Learning Workshop - Second Edition: A New, Interactive Approach to Understanding Supervised Learning Algorithms, 2nd Edition
byBlaine Bateman
Rating: 0 out of 5 stars
0 ratings
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Ebook
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
byAvishek Nag
Rating: 0 out of 5 stars
0 ratings
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
Ebook
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
byTanay Agrawal
Rating: 0 out of 5 stars
0 ratings
Machine MLOps A Complete Guide - 2019 Edition
Ebook
Machine MLOps A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Parallel Python with Dask
Ebook
Parallel Python with Dask
byTim Peters
Rating: 0 out of 5 stars
0 ratings
Graph Analytics A Clear and Concise Reference
Ebook
Graph Analytics A Clear and Concise Reference
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood
Ebook
Foundations of Data Intensive Applications: Large Scale Data Analytics under the Hood
bySupun Kamburugamuve
Rating: 0 out of 5 stars
0 ratings
Apache Spark Graph Processing
Ebook
Apache Spark Graph Processing
byRamamonjison Rindra
Rating: 0 out of 5 stars
0 ratings
Real-time Analytics with Storm and Cassandra
Ebook
Real-time Analytics with Storm and Cassandra
byShilpi Saxena
Rating: 0 out of 5 stars
0 ratings
Natural language understanding A Complete Guide
Ebook
Natural language understanding A Complete Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
JBoss Weld CDI for Java Platform
Ebook
JBoss Weld CDI for Java Platform
byKen Finnigan
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
Ebook
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
byShekhar Khandelwal
Rating: 0 out of 5 stars
0 ratings
Programming the Network with Perl
Ebook
Programming the Network with Perl
byPaul Barry
Rating: 0 out of 5 stars
0 ratings
Machine Learning for the Web
Ebook
Machine Learning for the Web
byAndrea Isoni
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
Ebook
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
byGuy Hart-Davis
Rating: 2 out of 5 stars
2/5
ChatGPT
Ebook
ChatGPT
byGary Stevens
Rating: 3 out of 5 stars
3/5
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
Ebook
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
byRafiq Muhammad
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Ebook
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
TensorFlow in 1 Day: Make your own Neural Network
Ebook
TensorFlow in 1 Day: Make your own Neural Network
byKrishna Rungta
Rating: 4 out of 5 stars
4/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
Ebook
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
byBen Preston
Rating: 0 out of 5 stars
0 ratings
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
Ebook
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
byS M Howard
Rating: 4 out of 5 stars
4/5
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
ChatGPT: The Future of Intelligent Conversation
Ebook
ChatGPT: The Future of Intelligent Conversation
byCea West
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
Podcast episode
55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
byThe Web Platform Podcast
100%
100% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
Podcast episode
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
Declarative Machine Learning Without The Operational Overhead Using Continual: An interview with Tristan Zajonc about his work at Continual to make declarative machine learning workflows possible and seamless by building on top of the data warehouse, and how it reduces the time and cost of putting machine learning into production.
Podcast episode
Declarative Machine Learning Without The Operational Overhead Using Continual: An interview with Tristan Zajonc about his work at Continual to make declarative machine learning workflows possible and seamless by building on top of the data warehouse, and how it reduces the time and cost of putting machine learning into production.
byData Engineering Podcast
0 ratings
0% found this document useful
#159 - Leveling Up Your Code Reviews from 'Good Enough' to Great - Adrienne Tacke
Podcast episode
#159 - Leveling Up Your Code Reviews from 'Good Enough' to Great - Adrienne Tacke
byTech Lead Journal
0 ratings
0% found this document useful
Beam and Spark with Holden Karau: This week our colleague, Holden Karau, joins us to talk about Spark and Beam.
Podcast episode
Beam and Spark with Holden Karau: This week our colleague, Holden Karau, joins us to talk about Spark and Beam.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Distributing Geospatial Data: Distributing Geospatial Data - Every wondered why you might what to do this? Or maybe you understand the why but are unsure about the how? Perhaps you have heard people talk about partitioning data or sharding data, you might have heard some of thes...
Podcast episode
Distributing Geospatial Data: Distributing Geospatial Data - Every wondered why you might what to do this? Or maybe you understand the why but are unsure about the how? Perhaps you have heard people talk about partitioning data or sharding data, you might have heard some of thes...
byThe MapScaping Podcast - GIS, Geospatial, Remote Sensing, earth observation and digital geography
0 ratings
0% found this document useful
State In React: In this episode of Syntax, Scott and Wes talk about state in React: local state, global state, UI state, data state, caching, API data and more! LogRocket - Sponsor LogRocket lets you replay what users do on your site, helping you reproduce bugs and...
Podcast episode
State In React: In this episode of Syntax, Scott and Wes talk about state in React: local state, global state, UI state, data state, caching, API data and more! LogRocket - Sponsor LogRocket lets you replay what users do on your site, helping you reproduce bugs and...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Being Bayesian: This episode explores the root concept of what it is to be Bayesian: describing knowledge of a system probabilistically, having an appropriate prior probability, know how to weigh new evidence, and following Bayes's rule to compute the revised...
Podcast episode
Being Bayesian: This episode explores the root concept of what it is to be Bayesian: describing knowledge of a system probabilistically, having an appropriate prior probability, know how to weigh new evidence, and following Bayes's rule to compute the revised...
byData Skeptic
0 ratings
0% found this document useful
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
Podcast episode
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
byMachine Learning Guide
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
Podcast episode
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
byCoder Radio
0 ratings
0% found this document useful
Eureka moments with natural language processing: featuring Nicholas Mohnacky of bundleIQ
Podcast episode
Eureka moments with natural language processing: featuring Nicholas Mohnacky of bundleIQ
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
#143 - How to Think Like a Software Engineering Manager - Akanksha Gupta
Podcast episode
#143 - How to Think Like a Software Engineering Manager - Akanksha Gupta
byTech Lead Journal
100%
100% found this document useful
Data Visualization and D3.js with Irene Ros: Scott talks to Data Visualization expert Irene Ros. When she isn't contributing to the Miso Project, teaching her d3.js class, or working on making OpenVis Conf the best data visualization conference it can be, she's working on projects that focus on creating engaging interactive visual displays of information.
Podcast episode
Data Visualization and D3.js with Irene Ros: Scott talks to Data Visualization expert Irene Ros. When she isn't contributing to the Miso Project, teaching her d3.js class, or working on making OpenVis Conf the best data visualization conference it can be, she's working on projects that focus on creating engaging interactive visual displays of information.
byHanselminutes with Scott Hanselman
0 ratings
0% found this document useful
Cloud Education Made Easy with Katie Bullard: Katie Bullard is the president of A Cloud Guru, a cloud education platform. She’s also a board member at Conservice, ChildCareCRM, and Journyx, Inc. Katie previously served as president and chief growth officer at ZoomInfo (formerly DiscoverOrg), VP of ma
Podcast episode
Cloud Education Made Easy with Katie Bullard: Katie Bullard is the president of A Cloud Guru, a cloud education platform. She’s also a board member at Conservice, ChildCareCRM, and Journyx, Inc. Katie previously served as president and chief growth officer at ZoomInfo (formerly DiscoverOrg), VP of ma
byScreaming in the Cloud
0 ratings
0% found this document useful
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
Podcast episode
A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore: An interview with Shireesh Thota about how the Singlestore database engine allows you to reduce architectural sprawl in your data systems by combining performant and scalable transactional and analytical capabilities into a single platform
byData Engineering Podcast
0 ratings
0% found this document useful
63: Python Corporate Training - Matt Harrison: Matt Harrison is an author and instructor of Python and Data Science. This episode focuses on his training company, MetaSnake, and corporate training.
Podcast episode
63: Python Corporate Training - Matt Harrison: Matt Harrison is an author and instructor of Python and Data Science. This episode focuses on his training company, MetaSnake, and corporate training.
byTest and Code
0 ratings
0% found this document useful
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
Podcast episode
235: Pair programming with Ben Orenstein & Tuple: In this episode, Kaushik goes solo and interviews Ben Orenstein. Ben is a prolific Ruby developer, an amazing conference speaker, an ardent vim-ster, and now the CEO of Tuple. Kaushik has been a big fan of Ben's work and was super stoked to talk to Ben and pick his brains on a host of topics: starting the company Tuple, pair programming in general, learning different programming languages and technology, giving better conference talks and more! This episode is chock full of wisdom from Ben. Enjoy!
byFragmented - An Android Developer Podcast
0 ratings
0% found this document useful
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
Podcast episode
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
byPHPRoundtable Podcast
0 ratings
0% found this document useful
All Things Azure with Dwayne Monroe: Dwayne Monroe is a senior cloud architect at Cloudreach, an organization that helps enterprises maximize their cloud investments, who’s focused on Azure. Prior to joining Cloudreach, Dwayne worked as a senior Microsoft and cloud architect at High Availabi
Podcast episode
All Things Azure with Dwayne Monroe: Dwayne Monroe is a senior cloud architect at Cloudreach, an organization that helps enterprises maximize their cloud investments, who’s focused on Azure. Prior to joining Cloudreach, Dwayne worked as a senior Microsoft and cloud architect at High Availabi
byScreaming in the Cloud
0 ratings
0% found this document useful
A New Distributed Cloud Architecture
Podcast episode
A New Distributed Cloud Architecture
byThe Cloudcast
0 ratings
0% found this document useful
MLA 014 Machine Learning Server: Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev
Podcast episode
MLA 014 Machine Learning Server: Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev
byMachine Learning Guide
0 ratings
0% found this document useful
Why you should be using a hexagonal architecture for microservices with José Haro Peralta: Our guest for today is a consultant, author, and …
Podcast episode
Why you should be using a hexagonal architecture for microservices with José Haro Peralta: Our guest for today is a consultant, author, and …
byCoding Over Cocktails
0 ratings
0% found this document useful
Exploring Event Modeling with Adam Dymitruk: Event Modeling was coined by Adam Dymitruk by building on long-running process specifications that Greg Young used in CQRS/ES systems. Scott sits down with Adam to understand this process and how it make make your systems - and your life making those systems - easier to write, understand, and maintain.
Podcast episode
Exploring Event Modeling with Adam Dymitruk: Event Modeling was coined by Adam Dymitruk by building on long-running process specifications that Greg Young used in CQRS/ES systems. Scott sits down with Adam to understand this process and how it make make your systems - and your life making those systems - easier to write, understand, and maintain.
byHanselminutes with Scott Hanselman
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
Podcast episode
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Hasty Treat - Why should I use React Hooks?: In this Hasty Treat, Scott and Wes talk about React Hooks and why you might want to use them instead of class components. Sentry - Sponsor If you want to know what’s happening with your errors, track them with . Sentry is open-source error...
Podcast episode
Hasty Treat - Why should I use React Hooks?: In this Hasty Treat, Scott and Wes talk about React Hooks and why you might want to use them instead of class components. Sentry - Sponsor If you want to know what’s happening with your errors, track them with . Sentry is open-source error...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful

Skip carousel

Upgrade Your Marketing With Machine Learning
Fast Company
Article
Upgrade Your Marketing With Machine Learning
Sep 9, 2019
2 min read
Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
APC
Article
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
Dec 2, 2019
4 min read
Data Backups: Critical Part of Cyber Strategy Strategies to Protect Your Data
Techfastly
Article
Data Backups: Critical Part of Cyber Strategy Strategies to Protect Your Data
Jun 1, 2022
6 min read
Docker vs Podman
APC
Article
Docker vs Podman
Apr 19, 2021
When Cockpit was first developed, it had plug-in support for administering your Docker containers remotely via its user-friendly web interface. But then Red Hat OS became a major backer of Cockpit, and when Red Hat developed its own alternative to Do
1 min read
Quantum Computing’s DISRUPTION IN Finance Industry
Techfastly
Article
Quantum Computing’s DISRUPTION IN Finance Industry
Oct 1, 2021
5 min read
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Futurity
Article
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Apr 11, 2019
2 min read
Three Low-code Options
PC Pro Magazine
Article
Three Low-code Options
Nov 12, 2020
Counting Intel, Vodafone and VW among its customers, OutSystems helps businesses create cloudbased, on-premises and hybrid applications for mobile and web. Its development environment is predominantly drag-and-drop, with views for processes, data and
3 min read
» Stochastic Algorithms
Linux Format
Article
» Stochastic Algorithms
Dec 14, 2021
If you’re up for some relatively maths-heavy computer-science reading (and who isn’t?), then consider looking into stochastic algorithms. Sometimes lumped together with machine-learning, stochastic algorithms is a loosely defined category that you co
1 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
How Blockchain Can Win The War Against Plastic Waste
NZBusiness and Management
Article
How Blockchain Can Win The War Against Plastic Waste
Oct 22, 2019
It is always difficult for business executives to grasp when a new technology is ripe for use, and blockchain is no exception. This is ironic given blockchain is often presented as the “technology of trust”. Blockchains can be thought of as networks
4 min read
Augmented Reality: A New Goal for Apple
AppleMagazine
Article
Augmented Reality: A New Goal for Apple
Dec 22, 2017
4 min read
Create Asynchronous Code With Python
Linux Format
Article
Create Asynchronous Code With Python
Jun 29, 2021
8 min read
Grafana Terminology
Linux Format
Article
Grafana Terminology
Jan 14, 2020
A Grafana data source is a database, file or service that provides data to Grafana – it cannot operate without data. A Grafana panel is the basic building block of Grafana. Panels are made of visualisations or queries. A Grafana query is used for req
1 min read
How AI Algorithms Could Help Design New Drugs
Futurity
Article
How AI Algorithms Could Help Design New Drugs
Apr 6, 2017
A new kind of AI algorithm—designed to work with a small amount of data—may be able to assist in the early stages of drug development. Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguis
3 min read
Intelligent Composable Business
PC Pro Magazine
Article
Intelligent Composable Business
Mar 10, 2022
3 min read
Software Pools Server Memory for Faster Networks
Futurity
Article
Software Pools Server Memory for Faster Networks
May 31, 2017
A group of engineers has created open-source software that allows for memory sharing among servers in a computer network, allowing for more efficient use of memory and even faster computer operations. For decades, operators of large computer clusters
2 min read
Get Into Coding!
Linux Format
Article
Get Into Coding!
Aug 23, 2022
1 min read
Build A Static Analysis Development Pipeline
Linux Format
Article
Build A Static Analysis Development Pipeline
Jul 27, 2021
9 min read
Route Traffic Between Networks Using A Pi
Linux Format
Article
Route Traffic Between Networks Using A Pi
Jun 2, 2020
A deep-dive into Pi networking solutions resulted in this tutorial. The goal was to uncover a Pi configuration that would enable the routing of network traffic from a wired network to a wireless network. The aim is to build a network router using a R
10 min read
Experiential Planning And Design
Facility Management
Article
Experiential Planning And Design
Aug 23, 2018
5 min read
AI As A Service
PC Pro Magazine
Article
AI As A Service
Jul 9, 2020
2 min read
The Key Success Factors Of A Powerful AI Factory
The European Business Review
Article
The Key Success Factors Of A Powerful AI Factory
Jan 26, 2024
5 min read
An Introduction To Rabbitmq
Linux Format
Article
An Introduction To Rabbitmq
Jun 29, 2021
RabbitMQ is a Message Broker, which means that it can safely hold messages generated by applications and make them available to other applications. The main advantages are reliability, support for clustering and high-availability queues, tracing capa
1 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
ORGANIZING YOUR PHOTOS, PART 2: Using Keywords
Outdoor Photographer
Article
ORGANIZING YOUR PHOTOS, PART 2: Using Keywords
Sep 14, 2019
10 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Readers’comments
PC Pro Magazine
Article
Readers’comments
Dec 10, 2020
4 min read
Mailserver
Linux Format
Article
Mailserver
Feb 7, 2023
4 min read
What’s Best For Your Book?
Writing Magazine
Article
What’s Best For Your Book?
Aug 5, 2021
5 min read

Related categories

Skip carousel

Reviews for Ensemble Methods for Machine Learning

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Ensemble Methods for Machine Learning - Gautam Kunapuli

inside front cover

IFC_F01_Kunapuli

Ensemble Methods for Machine Learning

Gautam Kunapuli

To comment go to liveBook

Manning

Shelter Island

For more information on this and other Manning titles go to

www.manning.com

Copyright

For online information and ordering of these and other Manning books, please visit www.manning.com. The publisher offers discounts on these books when ordered in quantity.

For more information, please contact

Special Sales Department

Manning Publications Co.

20 Baldwin Road

PO Box 761

Shelter Island, NY 11964

Email: orders@manning.com

No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by means electronic, mechanical, photocopying, or otherwise, without prior written permission of the publisher.

Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in the book, and Manning Publications was aware of a trademark claim, the designations have been printed in initial caps or all caps.

♾ Recognizing the importance of preserving what has been written, it is Manning’s policy to have the books we publish printed on acid-free paper, and we exert our best efforts to that end. Recognizing also our responsibility to conserve the resources of our planet, Manning books are printed on paper that is at least 15 percent recycled and processed without the use of elemental chlorine.

ISBN: 9781617297137

dedication

To my cousin Bhima,

who inspired me to board a plane and go far away from home, who made grad school look glamorous (it wasn’t, but was worth it), without whose example, my own journey would have been very different, and this book would probably not exist.

Wish you were here.

front matter

preface

acknowledgments

about this book

about the author

about the cover illustration

Part 1 The basics of ensembles

1 Ensemble methods: Hype or hallelujah?

1.1 Ensemble methods: The wisdom of the crowds

1.2 Why you should care about ensemble learning

1.3 Fit vs. complexity in individual models

Regression with decision trees

Regression with support vector machines

1.4 Our first ensemble

1.5 Terminology and taxonomy for ensemble methods

Part 2 Essential ensemble methods

2 Homogeneous parallel ensembles: Bagging and random forests

2.1 Parallel ensembles

2.2 Bagging: Bootstrap aggregating

Intuition: Resampling and model aggregation

Implementing bagging

Bagging with scikit-learn

Faster training with parallelization

2.3 Random forests

Randomized decision trees

Random forests with scikit-learn

Feature importances

2.4 More homogeneous parallel ensembles

Pasting

Random subspaces and random patches

Extra Trees

2.5 Case study: Breast cancer diagnosis

Loading and preprocessing

Bagging, random forests, and Extra Trees

Feature importances with random forests

3 Heterogeneous parallel ensembles: Combining strong learners

3.1 Base estimators for heterogeneous ensembles

Fitting base estimators

Individual predictions of base estimators

3.2 Combining predictions by weighting

Majority vote

Accuracy weighting

Entropy weighting

Dempster-Shafer combination

3.3 Combining predictions by meta-learning

Stacking

Stacking with cross validation

3.4 Case study: Sentiment analysis

Preprocessing

Dimensionality reduction

Blending classifiers

4 Sequential ensembles: Adaptive boosting

4.1 Sequential ensembles of weak learners

4.2 AdaBoost: Adaptive boosting

Intuition: Learning with weighted examples

Implementing AdaBoost

AdaBoost with scikit-learn

4.3 AdaBoost in practice

Learning rate

Early stopping and pruning

4.4 Case study: Handwritten digit classification

Dimensionality reduction with t-SNE

Boosting

4.5 LogitBoost: Boosting with the logistic loss

Logistic vs. exponential loss functions

Regression as a weak learning algorithm for classification

Implementing LogitBoost

5 Sequential ensembles: Gradient boosting

5.1 Gradient descent for minimization

Gradient descent with an illustrative example

Gradient descent over loss functions for training

5.2 Gradient boosting: Gradient descent + boosting

Intuition: Learning with residuals

Implementing gradient boosting

Gradient boosting with scikit-learn

Histogram-based gradient boosting

5.3 LightGBM: A framework for gradient boosting

What makes LightGBM light?

Gradient boosting with LightGBM

5.4 LightGBM in practice

Learning rate

Early stopping

Custom loss functions

5.5 Case study: Document retrieval

The LETOR data set

Document retrieval with LightGBM

6 Sequential ensembles: Newton boosting

6.1 Newton’s method for minimization

Newton’s method with an illustrative example

Newton’s descent over loss functions for training

6.2 Newton boosting: Newton’s method + boosting

Intuition: Learning with weighted residuals

Intuition: Learning with regularized loss functions

Implementing Newton boosting

6.3 XGBoost: A framework for Newton boosting

What makes XGBoost extreme?

Newton boosting with XGBoost

6.4 XGBoost in practice

Learning rate

Early stopping

6.5 Case study redux: Document retrieval

The LETOR data set

Document retrieval with XGBoost

Part 3 Ensembles in the wild: Adapting ensemble methods to your data

7 Learning with continuous and count labels

7.1 A brief review of regression

Linear regression for continuous labels

Poisson regression for count labels

Logistic regression for classification labels

Generalized linear models

Nonlinear regression

7.2 Parallel ensembles for regression

Random forests and Extra Trees

Combining regression models

Stacking regression models

7.3 Sequential ensembles for regression

Loss and likelihood functions for regression

Gradient boosting with LightGBM and XGBoost

7.4 Case study: Demand forecasting

The UCI Bike Sharing data set

GLMs and stacking

Random forest and Extra Trees

XGBoost and LightGBM

8 Learning with categorical features

8.1 Encoding categorical features

Types of categorical features

Ordinal and one-hot encoding

Encoding with target statistics

The category_encoders package

8.2 CatBoost: A framework for ordered boosting

Ordered target statistics and ordered boosting

Oblivious decision trees

CatBoost in practice

8.3 Case study: Income prediction

Adult Data Set

Creating preprocessing and modeling pipelines

Category encoding and ensembling

Ordered encoding and boosting with CatBoost

8.4 Encoding high-cardinality string features

9 Explaining your ensembles

9.1 What is interpretability?

Black-box vs. glass-box models

Decision trees (and decision rules)

Generalized linear models

9.2 Case study: Data-driven marketing

Bank Marketing data set

Training ensembles

Feature importances in tree ensembles

9.3 Black-box methods for global explainability

Permutation feature importance

Partial dependence plots

Global surrogate models

9.4 Black-box methods for local explainability

Local surrogate models with LIME

Local interpretability with SHAP

9.5 Glass-box ensembles: Training for interpretability

Explainable boosting machines

EBMs in practice

epilogue

E.1 Further reading

Practical ensemble methods

Theory and foundations of ensemble methods

E.2 A few more advanced topics

Ensemble methods for statistical relational learning

Ensemble methods for deep learning

E.3 Thank you!

index

front matter

preface

Once upon a time, I was a graduate student, adrift and rudderless in an ocean of unfulfilling research directions and uncertain futures. Then I stumbled upon a remarkable article titled Support Vector Machines: Hype or Hallelujah? This being the early 2000s, support vector machines (SVMs) were, of course, the preeminent machine-learning technique of the time.

In the article, the authors (one of whom would later become my PhD advisor) took a rather reductionist approach to explaining the considerably complex topic of SVMs, interleaving intuition and geometry with theory and application. The article made a powerful impression on me, at once igniting a lifelong fascination with machine learning and an obsession with understanding how such methods work under the hood. Indeed, the title of the first chapter pays homage to that paper that had so profound an influence over my life.

Much like SVMs then, ensemble methods are widely considered a preeminent machine-learning technique today. But what many people don’t realize is that some ensemble method or another has always been considered state of the art over the decades: bagging in the 1990s, random forests and boosting in the 2000s, gradient boosting in the 2010s, and XGBoost in the 2020s. In the ever-mutable world of the best machine-learning models, ensemble methods, it seems, are indeed worth the hype.

I’ve been fortunate to spend a good deal of the past decade training many kinds of ensemble models, making industry applications out of them, and writing academic research papers on them. In this book, I try to showcase as many of these ensemble methods as possible: some that you’ve definitely heard of and some new ones that you should really hear about.

This book was never intended to be just a tutorial with step-by-step instructions and cut-and-paste code (although you can use it that way, too). There are dozens of such fantastic tutorials on the web, and they can get you going on your data set in an instant. Instead, I talk about each new method using an immersive approach inspired by that first machine-learning paper I ever read and refined in college classrooms during my time as a graduate lecturer.

I’ve always felt that to understand a technical topic deeply, it helps to strip it down, take it apart, and try to put it back together again. I adopt the same approach in this book: we’ll take ensemble methods apart and (re)create them ourselves. We’ll tweak them and poke them to see how they change. And, in doing so, we’ll see exactly what makes them tick!

I hope this book will be helpful in demystifying those technical and algorithmic details and get you into the ensemble mindset, be it for your class project, Kaggle competition, or production-quality application.

acknowledgments

I never thought that a book on ensemble methods would itself turn into an ensemble effort of family and friends, colleagues, and collaborators, all of whom had a lot to do with this book, from conception to completion.

To Brian Sawyer, who let me pitch the idea of this book, for believing in this project, for being patient, and for keeping me on track: thank you for giving me this opportunity to do this thing that I’ve always wanted to do.

To my first development editor, Katherine Olstein, second development editor, Karen Miller, and technical development editor, Alain Couniot: I had a vision for what this book would look like when I started, and you helped make it better. Thank you for the hours and days of meticulous reviews, for your eagle-eyed edits, and for challenging me always to be a better writer. Your efforts have much to do with the final quality of this book.

To Manish Jain: thank you for painstakingly proofreading the code line by line. To Marija Tudor: thank you for designing this absolutely fantastic cover (which I still think is the best part of this book), for making it orange at my request, and for typesetting it from cover to cover. To the proofing and production team at Manning: thank you for your exceptional craft—this book looks perfect—review editor Mihaela Batinic, production editor Kathleen Rossland, copy editor Julie McNamee, and proofreader Katie Tennant.

To my reviewers, Al Krinker, Alain Lompo, Biswanath Chowdhury, Chetan Saran Mehra, Eric Platon, Gustavo A. Patino, Joaquin Beltran, Lucian Mircea Sasu, Manish Jain, McHugson Chambers, Ninoslav Cerkez, Noah Flynn, Oliver Korten, Or Golan, Peter V. Henstock, Philip Best, Sergio Govoni, Simon Seyag, Stephen John Warnett, Subhash Talluri, Todd Cook, and Xiangbo Mao: thank you for your fabulous feedback and some truly terrific insights and comments. I tried to take in all of your advice (I really did), and much of it has worked its way into the book.

To the readers who read the book during early access and who left many comments, corrections, and words of encouragement—you know who you are—thank you for the support!

To my mentors, Kristin Bennett, Jong-Shi Pang, Jude Shavlik, Sriraam Natarajan, and Maneesh Singh, who have each shaped my thinking profoundly at different stages of my journey as a student, postdoc, professor, and professional: thank you for teaching me how to think in machine learning, how to speak machine learning, and how to build with machine learning. Much of your wisdom and many of your lessons endure in this book. And Kristin, I hope you like the title of the first chapter.

To Jenny and Guilherme de Oliveira, for your friendship over the years, but especially during the great pandemic, when much of this book was written: thank you for keeping me sane. I will always treasure our afternoons and evenings in that summer and fall of 2020, tucked away in your little backyard, our pod and sanctuary.

To my parents, Vijaya and Shivakumar, and my brother, Anupam: thank you for always believing in me, and for always supporting me, even from tens of thousands of miles away. I know you’re proud of me. This book is finally finished, and now we can do all those other things we’re always talking about . . . until I start writing the next one, anyway.

To my wife, best friend, and biggest champion, Kristine: you’ve been an inexhaustible source of comfort and encouragement, especially when things got tough. Thank you for bouncing ideas with me, for proofreading with me, for the tea and snacks, for the Gus, for sacrificing all those weekends (and, sometimes, weeknights) when I was writing. Thank you for hanging in there with me, for always being there for me, and for never once doubting that I could do this. I love you!

about this book

There has never been a better time to learn about ensemble methods. The models covered in this book fall into three broad categories:

Foundational ensemble methods—The classics that everyone has heard of, including historical ensemble techniques such as bagging, random forests, and AdaBoost

State-of-the-art ensemble methods—The tried and tested powerhouses of the modern ensemble era that form the core of many real-world, in-production prediction, recommendation, and search systems

Emerging ensemble methods—The latest methods fresh out of the research foundries to handle new needs and emerging priorities such as explainability and interpretability

Each chapter will introduce a different ensembling technique, using a three-pronged approach. First, you’ll learn the intuition behind each ensemble method by visualizing step by step how learning actually takes place. Second, you’ll implement a basic version of each ensemble method yourself to fully understand the algorithmic nuts and bolts. Third, you’ll learn how to apply powerful ensemble libraries and tools practically.

Most chapters also come with their own case study on real-world data, drawn from applications such as handwritten digit prediction, recommendation systems, sentiment analysis, demand forecasting, and others. These case studies tackle several real-world issues where appropriate, including preprocessing and feature engineering, hyperparameter selection, efficient training techniques, and effective model evaluation.

Who should read this book

This book is intended for a broad audience:

Data scientists who are interested in using ensemble methods to get the best out of their data for real-world applications

MLOps and DataOps engineers who are building, evaluating, and deploying ensemble-based, production-ready applications and pipelines

Students of data science and machine learning who want to use this book as a learning resource or as a practical reference to supplement textbooks

Kagglers and data science enthusiasts who can use this book as an entry point into learning about the endless modeling possibilities with ensemble methods

This book is not an introduction to machine learning and data science. This book assumes that you have some basic working knowledge of machine learning and that you’ve used or played around with at least one fundamental learning technique (e.g., decision trees).

A basic working knowledge of Python is also assumed. Examples, visualizations, and chapter case studies all use Python and Jupyter Notebooks. Knowledge of other commonly used Python packages such as NumPy (for mathematical computations), pandas (for data manipulation), and Matplotlib (for visualization) is useful, but not necessary. In fact, you can learn how to use these packages through the examples and case studies.

How this book is organized: A road map

This book is organized into nine chapters in three parts. Part 1 is a gentle introduction to ensemble methods, part 2 introduces and explains several essential ensemble methods, and part 3 covers advanced topics.

Part 1, The basics of ensembles, introduces ensemble methods and why you should care about them. This part also contains a road map of ensemble methods covered in the rest of the book:

Chapter 1 discusses ensemble methods and basic ensemble terminology. It also introduces the fit-versus-complexity tradeoff (or the bias-variance tradeoff, as it’s more formally called). You’ll build your very first ensemble in this chapter.

Part 2, Essential ensemble methods, covers several important families of ensemble methods, many of which are considered essential and are widely used in real-world applications. In each chapter, you’ll learn how to implement different ensemble methods from scratch, how they work, and how to apply them to real-world problems:

Chapter 2 begins our journey with parallel ensemble methods, specifically, parallel homogeneous ensembles. Ensemble methods covered include bagging, random forests, pasting, random subspaces, random patches, and Extra Trees.

Chapter 3 continues the journey with more parallel ensembles, but the focus in this chapter is on parallel heterogeneous ensembles. Ensemble methods covered include combining base models by majority voting, combining by weighting, prediction fusion with Dempster-Shafer, and meta-learning by stacking.

Chapter 4 introduces another family of ensemble methods—sequential adaptive ensembles—in particular, the fundamental concept of boosting many weak models into one powerful model. Ensemble methods covered include AdaBoost and LogitBoost.

Chapter 5 builds on the foundational concepts of boosting and covers another fundamental sequential ensemble method, gradient boosting, which combines gradient descent with boosting. This chapter discusses how we can train gradient-boosting ensembles with scikit-learn and LightGBM.

Chapter 6 continues to explore sequential ensemble methods with Newton boosting, an efficient and effective extension of gradient boosting that combines Newton’s descent with boosting. This chapter discusses how we can train Newton boosting ensembles with XGBoost.

Part 3, Ensembles in the wild: Adapting ensemble methods to your data, shows you how to apply ensemble methods to many scenarios, including data sets with continuous and count-valued labels and data sets with categorical features. You’ll also learn how to interpret your ensembles and explain their predictions:

Chapter 7 shows how we can train ensembles for different types of regression problems and generalized linear models, where training labels are continuous- or count-valued. Parallel and sequential ensembles for linear regression, Poisson regression, gamma regression, and Tweedie regression are covered.

Chapter 8 identifies challenges in learning with nonnumeric features, specifically, categorical features, and encoding schemes that will help us train effective ensembles for this kind of data. This chapter also discusses two important practical issues: data leakage and prediction shift. Finally, we’ll see how to overcome these issues with ordered boosting and CatBoost.

Chapter 9 covers the newly emerging and very important topic of explainable AI from the perspective of ensemble methods. This chapter introduces the notion of explainability and why it’s important. Several common black-box explainability methods are also discussed, including permutation feature importance, partial dependence plots, surrogate methods, Locally Interpretable Model-Agnostic Explanation, Shapley values, and SHapley Additive exPlanations. The glass-box ensemble method, explainable boosting machines, and the InterpretML package are also introduced.

The epilogue concludes our journey with additional topics for further exploration and reading.

While most of the chapters in the book can reasonably be read in a standalone manner, chapters 7, 8, and 9 build on part 2 of the book.

About the code

All the code and examples in this book are written in Python 3. The code is organized into Jupyter Notebooks and is available in an online GitHub repository (https://github.com/gkunapuli/ensemble-methods-notebooks) and for download from the Manning website (www.manning.com/books/ensemble-methods-for-machine-learning). You can get executable snippets of code from the liveBook (online) version of this book at https://livebook.manning.com/book/ensemble-methods-for-machine-learning.

Several Python scientific and visualization libraries are also used, including NumPy (https://numpy.org/), SciPy (https://scipy.org/), pandas (https://pandas.pydata.org/), and Matplotlib (https://matplotlib.org/). The code also uses several Python machine-learning and ensemble-method libraries, including scikit-learn (https:// scikit-learn.org/stable/), LightGBM (https://lightgbm.readthedocs.io/), XGBoost (https://xgboost.readthedocs.io/), CatBoost (https://catboost.ai/), and InterpretML (https://interpret.ml/).

This book contains many examples of source code both in numbered listings and in line with normal text. In both cases, source code is formatted in a fixed-width font like this to separate it from ordinary text. In many cases, the original source code has been reformatted; we’ve added line breaks and reworked indentation to accommodate the available page space in the book. Additionally, comments in the source code have often been removed from the listings when the code is described in the text. Code annotations accompany many of the listings, highlighting important concepts.

liveBook discussion forum

Purchase of Ensemble Methods for Machine Learning includes free access to liveBook, Manning’s online reading platform. Using liveBook’s exclusive discussion features, you can attach comments to the book globally or to specific sections or paragraphs. It’s a snap to make notes for yourself, ask and answer technical questions, and receive help from the author and other users. To access the forum, go to https://livebook.manning.com/book/ensemble-methods-for-machine-learning/discussion. You can also learn more about Manning’s forums and the rules of conduct at https://livebook.manning.com/discussion.

Manning’s commitment to our readers is to provide a venue where a meaningful dialogue between individual readers and between readers and the author can take place. It’s not a commitment to any specific amount of participation on the part of the author, whose contribution to the forum remains voluntary (and unpaid). We suggest you try asking the author some challenging questions lest his interest stray! The forum and the archives of previous discussions will be accessible from the publisher’s website as long as the book is in print.

about the author

FM_UN01_Kunapuli

Gautam Kunapuli

has more than 15 years of experience in both academia and the machine-learning industry. His work focuses on human-in-the-loop learning, knowledge-based and advice-taking learning algorithms, and scalable learning for difficult machine-learning problems. Gautam has developed several novel algorithms for diverse application domains, including social network analysis, text and natural language processing, computer vision, behavior mining, educational data mining, insurance and financial analytics, and biomedical applications. He has also published papers exploring ensemble methods in relational domains and with imbalanced data.

about the cover illustration

The figure on the cover of Ensemble Methods for Machine Learning is Huonv ou Musiciene Chinoise, or Huonv or Chinese musician, from a collection by Jacques Grasset de Saint-Sauveur, published in 1788. Each illustration is finely drawn and colored by hand.

In those days, it was easy to identify where people lived and what their trade or station in life was just by their dress. Manning celebrates the inventiveness and initiative of the computer business with book covers based on the rich diversity of regional culture centuries ago, brought back to life by pictures from collections such as this one.

Part 1 The basics of ensembles

You’ve probably heard a lot about random forests, XGBoost, or gradient boosting. Someone always seems to be using one or another of these to build cool applications or win Kaggle competitions. Have you ever wondered what this fuss is all about?

The fuss, it turns out, is all about ensemble methods, a powerful machine-learning paradigm that has found its way into all kinds of applications in health care, finance, insurance, recommendation systems, search, and a lot of other areas.

This book will introduce you to the wide world of ensemble methods, and this part will get you going. To paraphrase the incomparable Julie Andrews from The Sound of Music,

Let’s start at the very beginning,

A very good place to start.

When you read, you begin with A-B-C.

When you ensemble, you begin with fit-versus-complexity.

The first part of this book will gently introduce ensemble methods with a bit of intuition and a bit of theory on fit versus complexity (or the bias-variance tradeoff, as it’s more formally called). You’ll then build your very first ensemble from scratch.

When you’re finished with this part of the book, you’ll understand why ensemble models are often better than individual models and why you should care about them.

1 Ensemble methods: Hype or hallelujah?

This chapter covers

Defining and framing the ensemble learning problem

Motivating the need for ensembles in different applications

Understanding how ensembles handle fit versus complexity

Implementing our first ensemble with ensemble diversity and model aggregation

In October 2006, Netflix announced a $1 million prize for the team that could improve movie recommendations by 10% via Netflix’s own proprietary recommendation system, CineMatch. The Netflix Grand Prize was one of the first-ever open data science competitions and attracted tens of thousands of teams.

The training set consisted of 100 million ratings that 480,000 users had given to 17,000 movies. Within three weeks, 40 teams had already beaten CineMatch’s results. By September 2007, more than 40,000 teams had entered the contest, and a team from AT&T Labs took the 2007 Progress Prize by improving upon CineMatch by 8.42%.

As the competition progressed with the 10% mark remaining elusive, a curious phenomenon emerged among the competitors. Teams began to collaborate and share knowledge about effective feature engineering, algorithms, and techniques. Inevitably, they began combining their models, blending individual approaches into powerful and sophisticated ensembles of many models. These ensembles combined the best of various diverse models and features, and they proved to be far more effective than any individual model.

In June 2009, nearly two years after the contest began, BellKor’s Pragmatic Chaos, a merger of three different teams, edged out another merged team, The Ensemble (which was a merger of more than 30 teams!), to improve on the baseline by 10% and take the $1 million prize. Just edged out is a bit of an understatement as BellKor’s Pragmatic Chaos managed to submit their final models barely 20 minutes before The Ensemble got their models in (http://mng.bz/K08O). In the end, both teams achieved a final performance improvement of 10.06%.

While the Netflix competition captured the imagination of data scientists, machine learners, and casual data science enthusiasts worldwide, its lasting legacy has been to establish ensemble methods as a powerful way to build practical and robust models for large-scale, real-world applications. Among the individual algorithms used are several that have become staples of collaborative filtering and recommendation systems today: k-nearest neighbors, matrix factorization, and restricted Boltzmann machines. However, Andreas Töscher and Michael Jahrer of BigChaos, co-winners of the Netflix prize, summed up¹ their keys to success:

During the nearly 3 years of the Netflix competition, there were two main factors which improved the overall accuracy: the quality of the individual algorithms and the ensemble idea. . . . The ensemble idea was part of the competition from the beginning and evolved over time. In the beginning, we used different models with different parametrization and a linear blending. . . . [Eventually] the linear blend was replaced by a nonlinear one.

In the years since, the use of ensemble methods has exploded, and they have emerged as a state-of-the-art technology for machine learning.

The next two sections provide a gentle introduction to what ensemble methods are, why they work, and where they are applied. Then, we’ll look at a subtle but important challenge prevalent in all machine-learning algorithms: the fit versus complexity tradeoff.

Finally, we jump into training our very first ensemble method for a hands-on view of how ensemble methods overcome this fit versus complexity tradeoff and improve overall performance. Along the way, you’ll become familiar with several key terms that form the lexicon of ensemble methods and will be used throughout the book.

1.1 Ensemble methods: The wisdom of the crowds

What exactly is an ensemble method? Let’s get an intuitive idea of ensemble methods and how they work by considering the allegorical case of Dr. Randy Forrest. We can then go on to frame the ensemble learning problem.

Dr. Randy Forrest is a famed and successful diagnostician, much like his idol Dr. Gregory House of TV fame. His success, however, is due not only to his exceeding politeness (unlike his cynical and curmudgeonly idol) but also his rather unusual approach to diagnosis.

You see, Dr. Forrest works at a teaching hospital and commands the respect of a large number of doctors-in-training. Dr. Forrest has taken care to assemble a team with a diversity of skills (this is pretty important, and we’ll see why shortly). His residents excel at different specializations: one is good at cardiology (heart), another at pulmonology (lungs), yet another at neurology (nervous system), and so on. All in all, the group is a rather diversely skillful bunch, each with their own strengths.

Every time Dr. Forrest gets a new case, he solicits the opinions of his residents and collects possible diagnoses from all of them (see figure 1.1). He then democratically selects the final diagnosis as the most common one from among all those proposed.

CH01_F01_Kunapuli

Figure 1.1 The diagnostic procedure followed by Dr. Randy Forrest every time he gets a new case is to ask all of his residents their opinions of the case. His residents offer their diagnoses: either the patient does or does not have cancer. Dr. Forrest then selects the majority answer as the final diagnosis put forth by his team.

Dr. Forrest embodies a diagnostic ensemble: he aggregates his residents’ diagnoses into a single diagnosis representative of the collective wisdom of his team. As it turns out, Dr. Forrest is right more often than any individual resident because he knows that his residents are pretty smart, and a large number of pretty smart residents are unlikely to all make the same mistake. Here, Dr. Forrest relies on the power of model aggregating or model averaging: he knows that the average answer is most likely going to be a good one.

Still, how does Dr. Forrest know that all his residents aren’t wrong? He can’t know that for sure, of course. However, he has guarded against this undesirable outcome all the same. Remember that his residents all have diverse specializations. Because of their diverse backgrounds, training, specialization, and skills, it’s possible, but highly unlikely, that all his residents are wrong. Here, Dr. Forrest relies on the power of ensemble diversity, or the diversity of the individual components of his ensemble.

Dr. Randy Forrest, of course, is an ensemble method, and his residents (who are in training) are the machine-learning algorithms that make up the ensemble. The secrets to his success, and indeed the success of ensemble methods as well, are

Ensemble diversity—He has a variety of opinions to choose from.

Model aggregation—He can combine those opinions into a single final opinion.

Any collection of machine-learning algorithms can be used to build an ensemble, which is, literally, a group of machine learners. But why do they work? James Surowiecki, in The Wisdom of Crowds, describes human ensembles or wise crowds thus:

If you ask a large enough group of diverse and independent people to make a prediction or estimate a probability, the average of those answers will cancel out errors in individual estimation. Each person’s guess, you might say, has two components: information and errors. Subtract the errors, and you’re left with the information.

This is also precisely the intuition behind ensembles of learners: it’s possible to build a wise machine-learning ensemble by aggregating individual learners.

Ensemble methods

Formally, an ensemble method is a machine-learning algorithm that aims to improve predictive performance on a task by aggregating the predictions of multiple estimators or models. In this manner, an ensemble method learns a meta-estimator.

The key to success with ensemble methods is ensemble diversity, also known by alternate terms such as model complementarity or model orthogonality. Informally, ensemble diversity refers to the fact that individual ensemble components, or machine-learning models, are different from each other. Training such ensembles of diverse individual models is a key challenge in ensemble learning, and different ensemble methods achieve this in different ways.

1.2 Why you should care about ensemble learning

What can you do with ensemble methods? Are they really just hype, or are they hallelujah? As we see in this section, they can be used to train and deploy robust and effective predictive models for many different applications.

One palpable success of ensemble methods is their domination of data science competitions (alongside deep learning), where they have been generally successful on different types of machine-learning tasks and application areas.

Anthony Goldbloom, CEO of Kaggle, revealed in 2015 that the three most successful algorithms for structured problems were XGBoost, random forest, and gradient boosting, all ensemble methods. Indeed, the most popular way to tackle data science competitions these days is to combine feature engineering with ensemble methods. Structured data is generally organized in tables, relational databases, and other formats most of us are familiar with, and ensemble methods have proven to be very successful on this type of data.

Unstructured data, in contrast, doesn’t always have a tabular structure. Images, audio, video, waveform, and text data are typically unstructured, and deep learning approaches—including automated feature generation—have been very successful on these types of data. While we focus on structured data for most of this book, ensemble methods can be combined with deep learning for unstructured problems as well.

Beyond competitions, ensemble methods drive data science in several areas, including financial and business analytics, medicine and health care, cybersecurity, education, manufacturing, recommendation systems, entertainment, and many more.

In 2018, Olson et al.² conducted a comprehensive analysis of 14 popular machine-learning algorithms and their variants. They ranked each algorithm’s performance on 165 classification benchmark data sets. Their goal was to emulate the standard machine-learning pipeline to provide advice on how to select a machine-learning algorithm.

These comprehensive results are compiled into figure 1.2. Each row shows how often one model outperforms other models across all 165 data sets. For example, XGBoost beats gradient boosting on 34 of 165 benchmark data sets (first row, second column), while gradient boosting beats XGBoost on 12 of 165 benchmark data sets (second row, first column). Their performance is very similar on the remaining 119 of 165 data sets, meaning both models perform equally well on 119 data sets.

CH01_F02_Kunapuli

Figure 1.2 Which machine-learning algorithm should I use for my data set? The performance of several different machine-learning algorithms, relative to each other on 165 benchmark data sets, is shown here. The final trained models are ranked (top-to-bottom, left-to-right) based on their performance on all benchmark data sets in relation to all other methods. In their evaluation, Olson et al. consider two methods to have the same performance on a data set if their prediction accuracies are within 1% of each other. This figure was reproduced using the codebase and comprehensive experimental results compiled by the authors into a publicly available GitHub repository (https://github.com/rhiever/sklearn-benchmarks) and includes the authors’ evaluation of XGBoost as well.

In contrast, XGBoost beats multinomial naïve Bayes (MNB) on 157 of 165 data sets (first row, last column), while MNB only beats XGBoost on 2 of 165 data sets (last row, first column) and can only match XGBoost on 6 of 165 data sets!

In general, ensemble methods (1: XGBoost, 2: gradient boosting, 3: Extra Trees, 4: random forests, 8: AdaBoost) outperformed other methods handily. These results demonstrate exactly why ensemble methods (specifically, tree-based ensembles) are considered state of the art.

If your goal is to develop state-of-the-art analytics from your data, or to eke out better performance and improve models you already have, this book is for you. If your goal is to start competing more effectively in data science competitions for fame and fortune or to just improve your data science skills, this book is also for you. If you’re excited about adding powerful ensemble methods to your machine-learning arsenal, this book is definitely for you.

To drive home this point, we’ll build our first ensemble method: a simple model combination ensemble. Before we do, let’s dive into the tradeoff between fit and complexity that most machine-learning methods have to grapple with, as it will help us understand why ensemble methods are so effective.

1.3 Fit vs. complexity in individual models

In this section, we look at two popular machine-learning methods: decision trees and support vector machines (SVMs). As we do so, we’ll explore how their fitting and predictive behavior changes as they learn increasingly complex models. This section also serves as a refresher of the training and evaluation practices we usually follow during modeling.

Machine-learning tasks are typically

Supervised learning tasks—These have a data set of labeled examples, where data has been annotated. For

Enjoying the preview?

Page 1 of 1

Ensemble Methods for Machine Learning

About this ebook

Gautam Kunapuli

Related authors

Related to Ensemble Methods for Machine Learning

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Ensemble Methods for Machine Learning

What did you think?

Book preview

Ensemble Methods for Machine Learning - Gautam Kunapuli

dedication

contents

Part 1 The basics of ensembles

Part 2 Essential ensemble methods

Part 3 Ensembles in the wild: Adapting ensemble methods to your data

preface

acknowledgments

about this book

Who should read this book

How this book is organized: A road map

About the code

liveBook discussion forum

about the author

about the cover illustration

Part 1 The basics of ensembles

1 Ensemble methods: Hype or hallelujah?

This chapter covers

1.1 Ensemble methods: The wisdom of the crowds

Ensemble methods

1.2 Why you should care about ensemble learning

1.3 Fit vs. complexity in individual models