Predictive Analytics and Machine Learning for Managers

Ebook547 pages6 hours

Predictive Analytics and Machine Learning for Managers

Name: Predictive Analytics and Machine Learning for Managers
Author: J. Alberto Espinosa
ISBN: 9798987654309

By J. Alberto Espinosa

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book was written by the architect of two MS Analytics programs and one undergraduate specialization in Business Analytics, with over a decade of experience teaching and practicing predictive analytics, and co-chairing premier academic conference mini-track in this field. The author's goal is to provide strong but understandable conceptual foundations and practical material for graduate students and managers, describing how to frame a business question, identify various model specification (i.e., feature engineering) and model methods (explainable and black box), select the optimal model based on the bias, variance, and cross-validation testing, and interpret results with meaningful storytelling for clients and managers. The book contains two components: (1) the main text with two sections—one with conceptual, mathematical, and managerial foundations, the other about advanced predictive modeling methods based on machine learning. The main text is further subdivided into two sections—Section 1 contains basic fundamentals of statistics and predictive modeling; Section 2 provides a deeper discussion of machine learning and advance predictive modeling approaches based on machine learning and cross-validation methods; and (2) a free appendix companion with annotated R Markdown code with hands-on applications, posted in GitHub.

Skip carousel

LanguageEnglish

PublisherJibe4Fun Press

Release dateApr 20, 2023

ISBN9798987654309

Author

J. Alberto Espinosa

Related authors

Skip carousel

Related to Predictive Analytics and Machine Learning for Managers

Related ebooks

Skip carousel

Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
Ebook
Smarter Data Science: Succeeding with Enterprise-Grade Data and AI Projects
byNeal Fishman
Rating: 0 out of 5 stars
0 ratings
Operating AI: Bridging the Gap Between Technology and Business
Ebook
Operating AI: Bridging the Gap Between Technology and Business
byUlrika Jagare
Rating: 0 out of 5 stars
0 ratings
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
Ebook
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
byPeter Bradley
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Finance
Ebook
Machine Learning for Finance
bySaurav Singla
Rating: 0 out of 5 stars
0 ratings
Thriving in a Data World: A Guide for Leaders and Managers
Ebook
Thriving in a Data World: A Guide for Leaders and Managers
bySangeeta Krishnan
Rating: 0 out of 5 stars
0 ratings
Combining DataOps, MLOps and DevOps: Outperform Analytics and Software Development with Expert Practices on Process Optimization and Automation
Ebook
Combining DataOps, MLOps and DevOps: Outperform Analytics and Software Development with Expert Practices on Process Optimization and Automation
byDr. Kalpesh Parikh
Rating: 0 out of 5 stars
0 ratings
Leading with AI and Analytics: Build Your Data Science IQ to Drive Business Value
Ebook
Leading with AI and Analytics: Build Your Data Science IQ to Drive Business Value
byEric Anderson
Rating: 0 out of 5 stars
0 ratings
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
Ebook
Capitalizing Data Science: A Guide to Unlocking the Power of Data for Your Business and Products (English Edition)
byMathangi Sri Ramachandran
Rating: 0 out of 5 stars
0 ratings
Essentials of Deep Learning and AI: Experience Unsupervised Learning, Autoencoders, Feature Engineering, and Time Series Analysis with TensorFlow, Keras, and scikit-learn
Ebook
Essentials of Deep Learning and AI: Experience Unsupervised Learning, Autoencoders, Feature Engineering, and Time Series Analysis with TensorFlow, Keras, and scikit-learn
byShashidhar Soppin
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning: A Comprehensive Guide to Success
Ebook
Mastering Machine Learning: A Comprehensive Guide to Success
byRick Spair
Rating: 0 out of 5 stars
0 ratings
Data Science: What the Best Data Scientists Know About Data Analytics, Data Mining, Statistics, Machine Learning, and Big Data – That You Don't
Ebook
Data Science: What the Best Data Scientists Know About Data Analytics, Data Mining, Statistics, Machine Learning, and Big Data – That You Don't
byHerbert Jones
Rating: 5 out of 5 stars
5/5
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Ebook
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
byAlok Kumar
Rating: 0 out of 5 stars
0 ratings
Fun with Machine Learning: Simplify the Data Science process by automating repetitive and complex tasks using AutoML (English Edition)
Ebook
Fun with Machine Learning: Simplify the Data Science process by automating repetitive and complex tasks using AutoML (English Edition)
byArockia Liborious
Rating: 0 out of 5 stars
0 ratings
Operationalizing Machine Learning Pipelines: Building Reusable and Reproducible Machine Learning Pipelines Using MLOps
Ebook
Operationalizing Machine Learning Pipelines: Building Reusable and Reproducible Machine Learning Pipelines Using MLOps
byVishwajyoti Pandey
Rating: 0 out of 5 stars
0 ratings
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
Ebook
Data Science Fundamentals and Practical Approaches: Understand Why Data Science Is the Next
byRupam Kumar Sharma
Rating: 0 out of 5 stars
0 ratings
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
Ebook
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
bySiddhanta Bhatta
Rating: 0 out of 5 stars
0 ratings
The Freelance Data Scientist and Big Data Analyst: Freelance Jobs and Their Profiles, #3
Ebook
The Freelance Data Scientist and Big Data Analyst: Freelance Jobs and Their Profiles, #3
byThe Gig Economist
Rating: 5 out of 5 stars
5/5
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Ebook
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
byAvishek Nag
Rating: 0 out of 5 stars
0 ratings
Introduction to Statistical and Machine Learning Methods for Data Science
Ebook
Introduction to Statistical and Machine Learning Methods for Data Science
byCarlos Andre Reis Pinheiro
Rating: 0 out of 5 stars
0 ratings
Getting Data Science Done: Managing Projects From Ideas to Products
Ebook
Getting Data Science Done: Managing Projects From Ideas to Products
by"John" "Hawkins"
Rating: 0 out of 5 stars
0 ratings
From Data To Profit: How Businesses Leverage Data to Grow Their Top and Bottom Lines
Ebook
From Data To Profit: How Businesses Leverage Data to Grow Their Top and Bottom Lines
byVin Vashishta
Rating: 0 out of 5 stars
0 ratings
The Datapreneurs: The Promise of AI and the Creators Building Our Future
Ebook
The Datapreneurs: The Promise of AI and the Creators Building Our Future
byBob Muglia
Rating: 0 out of 5 stars
0 ratings
Developing Analytic Talent: Becoming a Data Scientist
Ebook
Developing Analytic Talent: Becoming a Data Scientist
byVincent Granville
Rating: 3 out of 5 stars
3/5
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
Ebook
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
byKavita Ganesan
Rating: 0 out of 5 stars
0 ratings
No-Code Artificial Intelligence: The new way to build AI powered applications (English Edition)
Ebook
No-Code Artificial Intelligence: The new way to build AI powered applications (English Edition)
byAmbuj Agrawal
Rating: 1 out of 5 stars
1/5
Microsoft Azure AI: A Beginner’s Guide: Explore Azure Applied AI Services, Azure Cognitive Services and Azure Machine Learning with Practical Illustrations
Ebook
Microsoft Azure AI: A Beginner’s Guide: Explore Azure Applied AI Services, Azure Cognitive Services and Azure Machine Learning with Practical Illustrations
byRekha Kodali
Rating: 0 out of 5 stars
0 ratings
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence in Business and Technology: Accelerate Transformation, Foster Innovation, and Redefine the Future
Ebook
Artificial Intelligence in Business and Technology: Accelerate Transformation, Foster Innovation, and Redefine the Future
byAD Al-Ghourabi
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for Business: A Roadmap for Getting Started with AI
Ebook
Artificial Intelligence for Business: A Roadmap for Getting Started with AI
byJason L. Anderson
Rating: 0 out of 5 stars
0 ratings
Responsible AI in the Age of Generative Models: Governance, Ethics and Risk Management: Byte-Sized Learning Series
Ebook
Responsible AI in the Age of Generative Models: Governance, Ethics and Risk Management: Byte-Sized Learning Series
byI. Almeida
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: Unlock the Power of AI for Enhanced Communication and Relationships: English
Ebook
Mastering ChatGPT: Unlock the Power of AI for Enhanced Communication and Relationships: English
byVasyl Kolomiiets
Rating: 0 out of 5 stars
0 ratings
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
Ebook
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
byS M Howard
Rating: 4 out of 5 stars
4/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT for Marketing: A Practical Guide
Ebook
ChatGPT for Marketing: A Practical Guide
byJuanjo Ramos
Rating: 3 out of 5 stars
3/5
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
ChatGPT
Ebook
ChatGPT
byRobert Conway
Rating: 1 out of 5 stars
1/5
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
TensorFlow in 1 Day: Make your own Neural Network
Ebook
TensorFlow in 1 Day: Make your own Neural Network
byKrishna Rungta
Rating: 4 out of 5 stars
4/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
Ebook
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
byKavita Ganesan
Rating: 0 out of 5 stars
0 ratings
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
Ebook
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
byJames Bridle
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
Ebook
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
byJasmine Wang
Rating: 5 out of 5 stars
5/5
THE CHATGPT MILLIONAIRE'S HANDBOOK: UNLOCKING WEALTH THROUGH AI AUTOMATION
Ebook
THE CHATGPT MILLIONAIRE'S HANDBOOK: UNLOCKING WEALTH THROUGH AI AUTOMATION
byLogan Rivers
Rating: 5 out of 5 stars
5/5
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5

Related podcast episodes

Skip carousel

From ML to AI to Generative AI: get Fully-Connected with Chris & Daniel
Podcast episode
From ML to AI to Generative AI: get Fully-Connected with Chris & Daniel
byPractical AI: Machine Learning, Data Science
100%
100% found this document useful
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
Podcast episode
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
byMachine Learning Cafe
0 ratings
0% found this document useful
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
Podcast episode
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
byData Engineering Podcast
100%
100% found this document useful
Interview with Kunal Das, Chief Architect at SouthState Bank.
Podcast episode
Interview with Kunal Das, Chief Architect at SouthState Bank.
byEnterprise Architecture Podcast
0 ratings
0% found this document useful
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
Podcast episode
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
The Generative AI Inflection Point: Generative AI (genAI) has driven a lot of change in a short period. Senior Analyst Rowan Curran and Vice President and Principal Analyst J. P. Gownder discuss the genAI inflection point and its impact on the future of work.
Podcast episode
The Generative AI Inflection Point: Generative AI (genAI) has driven a lot of change in a short period. Senior Analyst Rowan Curran and Vice President and Principal Analyst J. P. Gownder discuss the genAI inflection point and its impact on the future of work.
byWhat It Means
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
Podcast episode
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Managing non-REST APIs like GraphQL and gRPC with Nandan Sridhar and David Feuer: Alexandrina Garcia-Verdin and Stephanie Wong host this week's episode all about managing non-REST APIs.
Podcast episode
Managing non-REST APIs like GraphQL and gRPC with Nandan Sridhar and David Feuer: Alexandrina Garcia-Verdin and Stephanie Wong host this week's episode all about managing non-REST APIs.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
Podcast episode
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
byData Skeptic
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Generative AI, cybercrime, and scamability, with Stacey Edmonds
Podcast episode
Generative AI, cybercrime, and scamability, with Stacey Edmonds
byLondon Futurists
100%
100% found this document useful
Publishing - Can YouTube and Reddit be overtaken? STEEMIT, Props, Po.et, Flixxo: Today, we’ll be talking about decentralized publishing platforms vs. centralized publishing platforms and how blockchain technology can improve the future of the publishing industry. Let’s take a look at the current state of centralized...
Podcast episode
Publishing - Can YouTube and Reddit be overtaken? STEEMIT, Props, Po.et, Flixxo: Today, we’ll be talking about decentralized publishing platforms vs. centralized publishing platforms and how blockchain technology can improve the future of the publishing industry. Let’s take a look at the current state of centralized...
byBlockchain 2025
0 ratings
0% found this document useful
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
Podcast episode
Episode 19 (Python for Data Science - Python Files - Scripts and Modules)
byHow to Data (Joshiverse- Journey of a Budding Data Scientist)
0 ratings
0% found this document useful
Building A Data Mesh Platform At PayPal: There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational challenges that he encountered in the process.
Podcast episode
Building A Data Mesh Platform At PayPal: There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational challenges that he encountered in the process.
byData Engineering Podcast
0 ratings
0% found this document useful
Generative models: exploration to deployment: get Fully-Connected with Chris & Daniel
Podcast episode
Generative models: exploration to deployment: get Fully-Connected with Chris & Daniel
byPractical AI: Machine Learning, Data Science
100%
100% found this document useful
Power and Prediction with Joshua Gans
Podcast episode
Power and Prediction with Joshua Gans
byThinkers & Ideas
0 ratings
0% found this document useful
AI in Insurance - Addressing Compliance Considerations - with Pardeep Bassi of WTW: In this episode, we’re focusing on compliance considerations in the insurance world. Our guest this week is Pardeep Bassi. He is currently Global Proposition Leader of Data Science for WTW, or Willis Towers Watson, a publicly traded financial...
Podcast episode
AI in Insurance - Addressing Compliance Considerations - with Pardeep Bassi of WTW: In this episode, we’re focusing on compliance considerations in the insurance world. Our guest this week is Pardeep Bassi. He is currently Global Proposition Leader of Data Science for WTW, or Willis Towers Watson, a publicly traded financial...
byThe AI in Business Podcast
100%
100% found this document useful
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
Podcast episode
Building LLM-Based Applications with Azure OpenAI with Jay Emery - #657
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Getting Technical about the Data Center Revolution with Jonathan Friedmann, CEO of Speedata
Podcast episode
Getting Technical about the Data Center Revolution with Jonathan Friedmann, CEO of Speedata
byMaking Data Simple
0 ratings
0% found this document useful
Future of Generative AI [David Foster]
Podcast episode
Future of Generative AI [David Foster]
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
Podcast episode
This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars: This Week in Machine Learning & AI brings you the…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
The Undocumented Web: scraping, private APIs, proxies and “alternative solutions”: What is the undocumented web? Scott and Wes dive into it, discussing APIs, faking, scraping, automation, proxies as well as tips and tricks for best practices. Kyle Prinsloo’s Freelancing & Beyond — Sponsor Kyle Prinsloo teaches you everything...
Podcast episode
The Undocumented Web: scraping, private APIs, proxies and “alternative solutions”: What is the undocumented web? Scott and Wes dive into it, discussing APIs, faking, scraping, automation, proxies as well as tips and tricks for best practices. Kyle Prinsloo’s Freelancing & Beyond — Sponsor Kyle Prinsloo teaches you everything...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
How This Data Scientist Used Market Research to Launch A Successful Fertility Business: In this episode of Shopify Masters, Amy Divaraniya of OOVA shares with us the essential questions to ask when conducting market research and the challenges of being a female founder.
Podcast episode
How This Data Scientist Used Market Research to Launch A Successful Fertility Business: In this episode of Shopify Masters, Amy Divaraniya of OOVA shares with us the essential questions to ask when conducting market research and the challenges of being a female founder.
byShopify Masters
0 ratings
0% found this document useful
Why It's Exceedingly Difficult to Build and Adopt AI in Business: A lot of AI in the press is CMOs or marketing people talking about what a company can do in a way that really is aspirational. They're speaking about what they can do, but in reality, the things that they're talking about, the capabilities won't be...
Podcast episode
Why It's Exceedingly Difficult to Build and Adopt AI in Business: A lot of AI in the press is CMOs or marketing people talking about what a company can do in a way that really is aspirational. They're speaking about what they can do, but in reality, the things that they're talking about, the capabilities won't be...
byThe AI in Business Podcast
0 ratings
0% found this document useful
227: Ben M. Bensaou, Innovation Through Customer Collaboration: Welcome to Strategy Skills episode 227, an episode with a well-recognized professor, Ben M. Bensaou. Many people think that you need a genius leader or need to become a start-up to innovate. But we all have the potential to innovate. In this episode,...
Podcast episode
227: Ben M. Bensaou, Innovation Through Customer Collaboration: Welcome to Strategy Skills episode 227, an episode with a well-recognized professor, Ben M. Bensaou. Many people think that you need a genius leader or need to become a start-up to innovate. But we all have the potential to innovate. In this episode,...
byThe Strategy Skills Podcast: Strategy | Leadership | Critical Thinking | Problem-Solving
0 ratings
0% found this document useful
474 The AI Playbook by Eric Siegel: The AI Playbook: Mastering the Rare Art of Machine Learning Deployment by Eric Siegel ABOUT THE BOOK: In his bestselling first book, Eric Siegel explained how machine learning works. Now, in , he shows how to capitalize on it. The greatest tool...
Podcast episode
474 The AI Playbook by Eric Siegel: The AI Playbook: Mastering the Rare Art of Machine Learning Deployment by Eric Siegel ABOUT THE BOOK: In his bestselling first book, Eric Siegel explained how machine learning works. Now, in , he shows how to capitalize on it. The greatest tool...
byThe Marketing Book Podcast
0 ratings
0% found this document useful
554. Barry Saunders: AI Project Case Study: Show Notes: Barry Saunders, a digital expert at McKinsey, discusses his background in the firm and his experience in AI-related projects. He worked in the LEAP practice, which built platforms for video streaming, preventative maintenance, and...
Podcast episode
554. Barry Saunders: AI Project Case Study: Show Notes: Barry Saunders, a digital expert at McKinsey, discusses his background in the firm and his experience in AI-related projects. He worked in the LEAP practice, which built platforms for video streaming, preventative maintenance, and...
byUnleashed - How to Thrive as an Independent Professional
0 ratings
0% found this document useful
487: Innovation Through Customer Collaboration (with Ben M. Bensaou): Welcome to an episode with a well-recognized professor, Ben M. Bensaou. Get Ben's book here: Many people think that you need a genius leader or need to become a start-up to innovate. But we all have the potential to innovate. In this episode, Ben...
Podcast episode
487: Innovation Through Customer Collaboration (with Ben M. Bensaou): Welcome to an episode with a well-recognized professor, Ben M. Bensaou. Get Ben's book here: Many people think that you need a genius leader or need to become a start-up to innovate. But we all have the potential to innovate. In this episode, Ben...
byCase Interview Preparation & Management Consulting | Strategy | Critical Thinking
0 ratings
0% found this document useful
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
Podcast episode
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
byMLOps.community
0 ratings
0% found this document useful

Skip carousel

Top Five AI-ML Books For Business Leaders
Techfastly
Article
Top Five AI-ML Books For Business Leaders
Aug 2, 2021
5 min read
We Are Creating New Businesses And Transforming Old Ones”
Business Today
Article
We Are Creating New Businesses And Transforming Old Ones”
Apr 28, 2023
7 min read
The Risks Of The Generative AI Gold Rush
PC Pro Magazine
Article
The Risks Of The Generative AI Gold Rush
Apr 6, 2023
8 min read
Editor’s Note
Techfastly
Article
Editor’s Note
Jul 1, 2022
Dear Readers, Centralization has contributed to onboarding billions of users to the World Wide Web and creating the solid, reliable infrastructure that supports it. Simultaneously, a few centralized companies control significant swaths of the Interne
1 min read
How does OpenAI’s GPT 3 work?
Techfastly
Article
How does OpenAI’s GPT 3 work?
May 3, 2021
4 min read
Will Generative AI Disrupt Your Company And Your need For Workers?
The European Business Review
Article
Will Generative AI Disrupt Your Company And Your need For Workers?
Jul 31, 2023
5 min read
Machine Learning – With Zero Programming
APC
Article
Machine Learning – With Zero Programming
Aug 12, 2019
6 min read
AWS vs Azure
Linux Format
Article
AWS vs Azure
Aug 22, 2023
9 min read
VR: What Is The Metaverse And How Will It Work?
AppleMagazine
Article
VR: What Is The Metaverse And How Will It Work?
Jan 13, 2023
3 min read
AWS Vs Azure What’s The Difference?
PC Pro Magazine
Article
AWS Vs Azure What’s The Difference?
Sep 11, 2022
7 min read
In Conversation with Surbhi Rathore
Techfastly
Article
In Conversation with Surbhi Rathore
Oct 1, 2021
4 min read
AI Adoption In American Business: Who Is Doing What, And Where?
Rotman Management
Article
AI Adoption In American Business: Who Is Doing What, And Where?
Sep 1, 2023
10 min read
Why Is ELT Better For Cloud Data Warehousing?
Techfastly
Article
Why Is ELT Better For Cloud Data Warehousing?
Apr 1, 2021
2 min read
The Key Success Factors Of A Powerful AI Factory
The European Business Review
Article
The Key Success Factors Of A Powerful AI Factory
Jan 26, 2024
5 min read
Jobs Of The Future
True Love
Article
Jobs Of The Future
Jan 26, 2023
5 min read
Human-centred Design
Facility Management
Article
Human-centred Design
Dec 23, 2018
There was a recent report in The Sydney Morning Herald that was very misleading in its representation of the discipline of ergonomics. Citing the testimony of an authority on back pain, Sydney University professor, Chris Maher, the piece fundamentall
4 min read
Getting The edge
The European Business Review
Article
Getting The edge
Feb 25, 2021
7 min read
AI And Design: Questions Of Ethics
Architecture Australia
Article
AI And Design: Questions Of Ethics
Mar 4, 2024
Artificial intelligence (AI) is a very old idea, but the term AI and the field of AI as it relates to modern programmable digital computing have taken their contemporary forms in the past 70 years.1Today, we interact with AI technologies constantly,
5 min read
Why CEOs Must Delve into Design Thinking
Business Today
Article
Why CEOs Must Delve into Design Thinking
Feb 19, 2018
4 min read
WHAT EVERY MANAGER SHOULD KNOW ABOUT HUMAN-CENTERED AI: A Manager’s Introduction to Human-Centered Artificial Intelligence
The European Business Review
Article
WHAT EVERY MANAGER SHOULD KNOW ABOUT HUMAN-CENTERED AI: A Manager’s Introduction to Human-Centered Artificial Intelligence
Dec 3, 2019
9 min read
Genesis Of The Purple People
Business Today
Article
Genesis Of The Purple People
Oct 28, 2021
11 min read
Q&A
Rotman Management
Article
Q&A
May 1, 2023
Describe the capability that companies like Netflix, UPS, Amazon and Caesars Entertainment have in common. These are all leading firms in their industries with respect to leveraging analytics as a source of competitive advantage. We now have so much
7 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Up Your Skills
India Today
Article
Up Your Skills
Feb 1, 2020
The lines between digital and physical spheres have blurred. Future business leaders will have to go ahead within this dynamic, challenging world now that computers are able to emulate better than human thinking and performance. Progressive managemen
1 min read
Fact-check And Verify Information
Post South Africa
Article
Fact-check And Verify Information
Mar 13, 2024
Q: What is AI? A: AI is the acronym for artificial intelligence (AI) and refers to the development of computer systems capable of performing tasks that typically require human intelligence, such as visual perception, speech recognition, decision-maki
3 min read
Intelligent Artificiality: WHY ‘AI’ DOES NOT LIVE UP TO ITS HYPE – AND HOW TO MAKE IT MORE USEFUL THAN IT CURRENTLY IS
The European Business Review
Article
Intelligent Artificiality: WHY ‘AI’ DOES NOT LIVE UP TO ITS HYPE – AND HOW TO MAKE IT MORE USEFUL THAN IT CURRENTLY IS
Aug 2, 2019
5 min read
Quantum Leap
Marketing
Article
Quantum Leap
Jul 11, 2019
6 min read
01 Ready Or Not, AI Is Here To Assist You
HWM Singapore
Article
01 Ready Or Not, AI Is Here To Assist You
Jul 11, 2023
4 min read
Understanding The POTENTIAL OF AI In A Technology Driven World
The European Business Review
Article
Understanding The POTENTIAL OF AI In A Technology Driven World
Apr 3, 2019
9 min read
Digital Marketing: AI Enables Expanded Roles For Marketers
The European Business Review
Article
Digital Marketing: AI Enables Expanded Roles For Marketers
Jan 25, 2021
8 min read

Related categories

Skip carousel

Reviews for Predictive Analytics and Machine Learning for Managers

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Predictive Analytics and Machine Learning for Managers - J. Alberto Espinosa

Throughout my professional experience building data science practices for the Federal Government and commercial industries, I have realized the need for better and faster data-driven decision-making capabilities. Today, managers and business leaders need to become data literate, understand the power of analytics, and develop their analytical skills. In this book, Professor Espinosa provides a robust analytic roadmap for business leaders, providing practitioners with a better understanding of how advanced data analytics can improve their businesses. The first half of this book is an excellent compendium of statistical concepts to help professionals understand and implement advanced, complex analytical models. The second half of the book explains what business leaders are asking today—how to make better business decisions using data and machine learning algorithms to create business value. The scripts and code presented in this book will enable managers to understand and experiment with various predictive analytics and machine learning methods.

ROD FONTECILLA

Partner and Chief Innovation Officer

Technology Solutions

Guidehouse

Professor Espinosa’s book is a must-read for analysts and managers interested in learning how to use data analytics for decision-making and business problem-solving. Through this book, one learns how to frame a business analytics question, how to identify the right predictors and model, and how to interpret results. Professor Espinosa has been teaching predictive analytics for a decade, and his book has a good balance of technical and managerial insights. He does a wonderful job of explaining fundamental terms in a concise and understandable manner. Further, his deep experience is exemplified in the book, which will help business professionals and analysts understand the analytics lifecycle from a managerial perspective. The accompanying GitHub site appendices provide useful scripts and examples illustrated in the book, which will enhance the learning of the technical aspects presented. This book is a comprehensive and valuable guide for analysts and managers.

WAI FONG BOH

President’s Chair and Professor of Information Systems

Deputy Dean of Nanyang Business School

Nanyang Technological University in Singapore

Title: Predictive Analytics and Machine Learning for Managers

First Edition: 2023

ISBN paperback: 979-8-9876543-1-6

ISBN ebook: 979-8-9876543-0-9

Published by: Jibe4Fun Press

Published in the United States of America

All rights reserved. No part of this book may be used or reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) whatsoever without permission in writing from the author, except in the case of brief quotations embodied in critical articles and reviews.

Cover & Book Design: Alison Rayner

Author services by Pedernales Publishing, LLC

www.pedernalespublishing.com

Trademarks

All brand names and product names referred to in this book are registered trademarks and unregistered trade names of their owners. There is no implied endorsement of any of them.

Disclaimers

This publication aims to provide accurate and reliable information regarding the subject matter covered. However, neither the publisher nor the author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.

PREDICTIVE ANALYTICS AND MACHINE LEARNING FOR MANAGERS

J. Alberto Espinosa, Ph.D.

Dr. Espinosa is a Professor of Information Technology and Analytics (IT&A) at the Kogod School of Business at American University, Washington, DC. He holds Ph.D. and Master of Science degrees in Information Systems from the Tepper School of Business at Carnegie Mellon University, an MBA from Texas Tech University, and a Mechanical Engineering degree from Pontificia Universidad Catòlica del Peru. He is the architect of Kogod’s MS Analytics program (for both campus and online delivery) and of the undergraduate programs in Information Technology and Business Analytics. In addition to this book, he has co-authored two books, I’m Working While They’re Sleeping: Time Zone Separation Challenges and Solutions and Obtaining Value from Big Data for Service Systems: Volume I: Big Data Management and Volume II: Big Data Technology. His research focusses on coordination and performance in technical projects across global boundaries, particularly distance and time-separation (i.e., time zones and schedule shifts). His current research focus is on the visual and quantitative representation and analysis of team knowledge using social network analytics. Dr. Espinosa is a multi-method researcher, but most of his work involves field studies with technical organizations, using quantitative methods. His work has been published in leading scholarly journals, including Management Science, Organization Science, Information Systems Research, The Journal of Management Information Systems, IEEE Transactions on Software Engineering, IEEE Transactions on Engineering Management, Communications of the ACM, Human Factors, Information, Technology and People, and Software Process: Improvement and Practice. Dr. Espinosa’s work has also been presented and featured at leading academic conferences. He teaches predictive analytics, social and organizational network analytics, R programming for analytics, information technology foundations and business process analysis, and programming for business applications. He also has several years of working experience, first as a design engineer for oil and mining projects, and later as a senior manager, VP, and CFO with international organizations directly supporting, supervising, and formulating policy for finance, human resources, global IT, and data management and analytics applications to support geographically distributed work in Africa, Latin America, and Eastern Europe.

From The Author

Iwas motivated to write this book after architecting the Kogod School of Business’ MS program and undergraduate specialization in Business Analytics with my colleagues, and teaching and practicing analytics for over a decade. My goal was to provide strong but understandable conceptual foundations and practical material for graduate students and managers, describing how to frame a business question, identify various model specification (i.e., feature engineering) and model methods (explainable and black box), select the optimal model based on the bias, variance, and cross-validation testing, and interpret results with meaningful storytelling for clients and managers. The book contains two components: (1) the main text with two sections—one with conceptual, mathematical, and managerial foundations, the other about advanced predictive modeling methods based on machine learning; and (2) an appendix companion with annotated R Markdown code with hands-on applications, posted in GitHub.

This book is dedicated to my wife, Delphine Clegg, who has supported me on this book project and in life for many years.

This book was edited by Andrew Erickson and Delphine Clegg.

Andrew was one of my top students and an awesome teaching assistant for my Predictive Analytics course. He has also been a writer for American, the American University magazine, and is now a business analytics professional. Andrew reviewed the book for effective communication, comprehension, clarity, and overall quality of the material.

Delphine is a freelance editor with years of experience in editorial and communications work. She was the final editor of the book. She reviewed all the writing in detail and did an outstanding job ensuring consistency of content and style.

Alison Rayner designed this book. She laid out and presented the book’s content masterfully, including the front and back covers, for digital and print formats.

Overview

SECTION 1: PREDICTIVE ANALYTICS BASICS

CHAPTER 1: Introduction to Predictive Analytics

Introduction

1.1 The Importance of Predictive Analytics

1.2 Analytics and Its Cousins

Analytics

Predictive Analytics

Data Mining

Business Intelligence (BI)

Machine Learning (ML)

Cross Validation (CV)

Unsupervised Learning

Supervised Learning

Data Science

1.3 Key Tradeoffs

Bias vs. Variance

Interpretable vs. Black Box Models

1.4 The Analytics Lifecycle

1.5 Data Structures: Vectors, Matrices, and Data Frames

Vectors

Matrices

Data Frames

1.6 Predictive Analytics Overview

Descriptive, Predictive, and Prescriptive Analytics

Quantitative vs. Classification Prediction

Parametric vs. Non-Parametric Models

Association vs. Tree Methods

1.7 Predictive Modeling Goals

Interpretation

Inference

Prediction

1.8 Modeling Method and Model Specification

Modeling Method

Model Specification

Final Notes

CHAPTER 2: Foundations

Introduction

2.1 Understanding Variability

Variance

2.2 Covariance and Correlation

Variable Types and Statistical Association

Covariance (Quantitative vs. Quantitative)

Correlation (Quantitative vs. Quantitative)

2.3 Analysis of Variance (ANOVA)

Comparing Group Means (Quantitative vs. Categorical)

Evaluating a Single Model

Comparing Two or More Models

2.4 Chi-Square Test of Independence

Statistical Association of Categorical Variables (Categorical vs. Categorical)

2.5 Regression Overview

The Null Model

Simple Linear Regression

Linear Regression Parameters

Model Estimation

OLS Fit Statistics

Regression with Dummy Variables

Multivariate Regression

CHAPTER 3: Basic Models

Introduction

3.1 OLS Assumptions

Assumption 1: The Outcome Variable Y Is Continuous (YC)

Assumption 2: The Errors (Residuals) Are Normally Distributed (EN)

Assumption 3: The Predictors Are Independent (XI)

Assumption 4: The Outcome and Response Variables Have a Linear Relationship (LI)

Assumption 5: Observations Are Independent from Each Other (OI)

Assumption 6: Errors (i.e., Residuals) Are Independent from Each Other (EI)

Assumption 7: The Error Average Is Zero (EA)

Assumption 8: The Error Variance Is Constant (EV)

OLS and Predictive Modeling

3.2 Weighted Least Squares (WLS)

Heteroskedasticity

Testing for Heteroskedasticity

Weighted Least Squares Regression (WLS) Method

3.3 The Generalized Linear Models (GLM)

Maximum Likelihood Estimation (MLE)

The Infamous 2LL or Deviance

Generalized Linear Models (GLM) Specifications

3.4 Logistic Regression

Overview

Probabilities, Odds, and Log-Odds

Logistic Regression

3.5 Decision Trees

Overview

Regression Trees

Growing Trees

Classification Trees

CHAPTER 4: Data Pre-Processing

Introduction

4.1 Rationale for Data Transformations

Why Transform?

What Should Be Transformed? Predictors (P) and/or Outcomes (O)

4.2 Transformation (P) – Categorical to Binary (Dummy) Variables

The Dummy Variable Trap

Interpretation: Why Did You Drop Me?

Can I Use You as a Reference? Reference Level Matters

4.3 Transformation (P) (O) – Polynomials

Polynomial Transformations of Predictors

Polynomial Transformations of Outcomes

4.4 Transformation (P) (O) – Log Models

Properties of Logarithms

Why Log Models?

Log-Transformed Models and Interpretation

Elasticity Models

Illustration

Logit Transformation

Count Data Models

4.5 Transformation (P) (O) – Centering and Standardization

Centering

Standardization

4.6 Transformation (P) (O) – Lagging Data

Why Lag When You Can Lead?

Time Series and Forecasting Models

Serial Correlation

Durbin-Watson (DW) Test for Serial Correlation

Correcting for Serial Correlation: Lagged Data Models

CHAPTER 5: Variable Selection

Introduction

5.1 Dimensionality

Dimensionality Basics

Dimensionality Issues

Addressing Dimensionality

5.2 Multicollinearity

Eigenvectors and Eigenvalues

Testing for Multicollinearity

Correcting for Multicollinearity

5.3 Variable Selection Methods

Overview

Subset Comparison

Step Methods

SECTION 2: ADVANCED MODELS AND MACHINE LEARNING

CHAPTER 6: Machine Learning and Cross Validation

Introduction

What is Machine Learning?

Do You Need Supervision?

6.1 Machine Learning Key Concepts

Cross Validation

Main Uses of CV

6.2 Bias vs. Variance Trade-Off

6.3 Error Measures

Quantitative Models

Classification Models

6.4 Cross Validation, Partitioning, and Resampling

6.5 Random Splitting Cross Validation (RSCV)

6.6 Leave-One-Out Cross Validation (LOOCV)

6.7 K-Fold Cross Validation (KFCV)

6.8 Bootstrapping

6.9 The {caret} R Package

CHAPTER 7: Dimensionality

Overview

7.1 Tuning Parameters

7.2 Regularized (Penalized or Shrinkage) Regression Models

Intuition

Ridge Regression

L2 Norm

LASSO Regression

L1 Norm

Elastic Net Regression

Shrinkage Methods for Logistic Models

7.3 Dimension-Reduction Models

Intuition

Principal Components (PCs)

Principal Components Regression (PCR)

Partial Least Squares Regression (PLSR)

Dimension Reduction Summary

7.4 Dimensionality Summary

CHAPTER 8: Non-Linear Models

Overview

8.1 Interaction Models

B x C Interaction Models

C x C Interaction Models

8.2 Polynomial Models

Fitting Polynomials

8.3 Piecewise and Spline Models

Constructing Piecewise Linear Functions

Constructing Piecewise Polynomial Functions

MARS (Spline) Models

Smoothing Splines

CHAPTER 9: Classification Models

Overview

9.1 Introduction to Classification Models

Binomial Classification and Maximum Likelihood Estimation (MLE)

9.2 Binomial Logistic Models

Binomial Logistic Regression Refresher

9.3 Evaluating Classification Models: The Confusion Matrix, Sensitivity, Specificity, and ROC Curves

The Confusion Matrix

Sensitivity and Specificity

The Receiver Operating Characteristics (ROC) Curve

9.4 Multinomial Logistic Models

Multinomial Logistic Effect Interpretations

Multinomial Classification Confusion Matrix

9.5 Linear (LDA) and Quadratic (QDA) Discriminant Analysis

Bayes’ Theorem

Linear Discriminant Analysis (LDA)

Quadratic Discriminant Analysis (QDA)

CHAPTER 10: Decision Trees

Overview

Decision Tree Issues

Decision Tree Terminology

10.1 Growing and Pruning Trees with Cross Validation

Regression Trees

Classification Trees

Advanced Decision Tree Methods

10.2 Bootstrap Aggregation (Bagging) Trees

10.3 Random Forest Trees

10.4 Boosted Trees

CHAPTER 11: Deep Learning and Neural Networks

Overview

11.1 What We Know from Prior Models

11.2 Biological Neural Networks (ANNs)

11.3 Artificial Neural Networks (ANNs or Simply NNs)

Overview

The Perceptron

Simple Neural Networks (Shallow Learning)

Complex Neural Networks (Deep Learning)

Feedforward Neural Networks and Forward Propagation

Complex Neural Network Configurations

Neural Network Training, Predicting, and Testing

Typical Neural Network Training Data Pre-Processing and Parameters

Wrap-Up

OVERVIEW

Note About the R Code Companion for This Book: Appendices A1 to A11 provide the respective R scripts, code and programming notes associated with each chapter and are available at https://github.com/jibe4fun/paml4m/tree/R-Code. The book is devoted to conceptual issues and the appendices cover hands-on modeling with R. I plan to include appendices with scripts and code using the Python language at some point in the future.

An analyst refused to report his/her partner’s stolen credit card. The analyst told a friend in confidence that the refusal was because a predictive model showed with statistical confidence that the thief spent less money than the partner.

This humorous story illustrates an important point about this book: MD³, which stands for Models Don’t Make Decisions, Managers Do. A model can tell us what the best quantitative solution may be, but it is not necessarily what a rational human would decide. Nevertheless, some predictive models based on machine learning are written to automate such decisions (e.g., recommender systems, self-driving vehicles). It is important, therefore, to understand the goals of the particular predictive model—interpretability, inference, and/or predictive accuracy.

There have been many articles predicting large shortages of professionals with deep data science skills. But the predicted shortages are of a magnitude larger for managers with analytical skills. Analytics and data science today permeate every aspect of business and organizational work and managers are expected to know how to do basic analytic work and how to be good consumers of analytics reports. As such, the goal of this book is to provide knowledge and skills for the analytical manager. I place a strong emphasis on the conceptual foundations of predictive analytics and machine learning. My intent is to provide the analytical manager with the necessary knowledge to: specify business questions of interest and translate them into equivalent analytics questions; define the analytics goals for a project; select and specify the appropriate model to answer the business questions and fulfill the analytics goals; build the model using R open source software; test competing models or tune model parameters using machine learning and cross-validation methods; and interpret results and extract meaning from the data to tell the business story behind the analysis.

I have been teaching predictive analytics and machine learning at a business school for several years, and I have been applying these methods in my own research for about two decades. When looking for an appropriate textbook for my course, I read about a dozen books on predictive analytics and machine learning. Most of these books had this in common: (1) they start with very basic and simple material; (2) at some point the material turns very technical and cryptic for the average beginner to follow; (3) they focus on describing predictive models, rather than on understanding how to select the appropriate modeling method and model specification; and (4) they tend to cover data science and statistical aspects of these models, rather than their business application and interpretation. It is difficult to find a book on predictive analytics and machine learning that is both readable for managers and technically deep. Most books are one or the other. This book attempts to fill these gaps. And while a technical background is not necessary in order to understand the content of this book, some basic understanding of statistics and software programming may be helpful. I recommend that readers brush up on basic concepts like frequency distributions, descriptive statistics, correlation analysis, and linear regression, and also learn the basics of statistical programming languages like R. Again, I cover these topics in detail so they are not pre-requisites to reading this book, but a basic understanding of these topics will facilitate your understanding of the material.

We often hear politicians and people in the media people saying things like: we have to follow the data; decisions should be based on data; models are only as good as their assumptions; etc. What do they really mean by following the data? Data is usually full of anomalies, imperfections, and missing elements, so simply following the data will not always provide an answer to our questions. What does it mean for a model to be correct or incorrect? Models can be tested for statistical fit and accuracy, and nothing is ever certain. All models have levels of statistical confidence and accuracy, but no model can be correct 100% of the time and there is never a guarantee that any model will predict accurately with new data. A perfect example of all this is with weather forecasting in which the data changes continually, so predictions become more and more uncertain when the time horizon is long into the future. Any weather or pandemic prediction expert will agree that as the time horizon for predictions increases, the predictive accuracy of the models diminishes substantially. Another way of saying this is that the confidence in our prediction diminishes sharply. One way to illustrate this notion is to pay attention to hurricane forecasting models. Meteorologists usually show predicted hurricane paths from multiple predictive models (e.g., Global Forecasting System, European Model, Canadian Model, etc.), color coded for TV viewing. When the models agree, there is some degree of certainty as to where the hurricane will make landfall. When models disagree, landfall predictions become very uncertain (i.e., the predictive accuracy confidence goes down). Predictive models for business are no different. For any analytics questions, there will be a wide range of modeling methods, data transformations and model specifications to choose from, plus various ensemble models that aggregate the results from various models. The main goal of this book is to help the reader navigate through the various modeling options and provide one with the ability to test them and compare them in order to select the optimal model and specification to meet the analytics goals.

In this book I begin by discussing general principles and statistical and linear regression concepts, which then become the building blocks to develop more advanced and complex models. To this end, the book is divided into two main sections: Section 1 (Chapters 1-5) – Predictive Analytics Basics; and Section 2 (Chapters 6-11) – Advanced Models and Machine Learning.

In Section 1, Chapter 1, I introduce predictive analytics and machine learning from a business perspective. I discuss key foundational aspects of predictive modeling, such as the analytics life cycle, general model categories (i.e., quantitative vs. classification), classic modeling tensions (e.g., bias vs. variance, explainability vs. accuracy, etc.). In Chapter 2, I provide an overview of the basic statistical foundations necessary to follow the rest of the book. Most of this material is based on descriptive analytics, used to understand the data before building any predictive models. In Chapter 3, I introduce the most basic quantitative models (i.e., regression and regression trees) and classification models (i.e., binomial logistic regression and classification trees). In addition, I discuss the basic assumptions or preconditions for these models, which is important for two reasons—models are built on mathematical assumptions, which need to be tested before using the model; and regardless of the testing results, these models tend to be the most unbiased, thus serving as useful benchmarks to evaluate other models. Most advanced models are departures or derivatives of these basic models, so it really helps to understand them and their respective assumptions well before using them. In Chapter 4, I discuss the importance of data pre-processing. It is estimated that about 80% of the work in an analytics project is extracting and preparing the data for analysis. Some aspects of data pre-processing are necessary (e.g., curation, missing data, cleansing anomalies, etc.) and some are done to improve model fit and accuracy (e.g., transformations, interactions, sub-grouping, etc.). The final chapter of Section 1, Chapter 5, is where I discuss variable selection. Selecting the predictors for a model is one of the most important initial steps when building predictive models. Predictors must be rooted in an understanding of the business domain of the analytics problem. You cannot really undertake the task of healthcare or marketing analytics without understanding the healthcare and marketing domains. At the same time, predictors must have a solid statistical foundation to be incorporated and retained in models. At the end of this chapter, readers should have a basic understanding of the fundamental principles of predictive analytics and machine learning. The combination of data pre-processing (chapter 4) and variable selection (chapter 5) is often referred to as feature engineering.

In Section 2, I discuss more advanced predictive modeling methods. We switch into high gear when I introduce the concepts of machine learning (ML) and cross-validation (CV) in Chapter 6. ML is about training models with data and testing the models for predictive accuracy. As more data comes in, the algorithms learn without the need to be reprogrammed. Because accuracy is critical to model method and specification selection, and because this accuracy will change as new data arrives, CV is a central aspect of machine learning. CV is used for many things, including evaluating single models, tuning models, comparing models and training models. One important predictive modeling tension I discuss in depth is bias vs. variance. Bias is about the effects reported by a model departing from the true effects, which is not good for interpretability. Variance refers to whether we get consistent results when we compare model results with multiple resamples of the data. Stable (low variance) models will yield consistent results across resamples. Conversely, unstable (high variance) will yield widely dissimilar results across resamples, making the model unreliable. Bias and variance represent one of the most fundamental tradeoffs in predictive modeling. Smaller and simpler models tend to have more bias—generally caused by omitted predictors and less variance, whereas larger and more complex models have less bias, but this comes at the expense of increased variance. CV testing helps find the optimal model size and complexity that minimizes the combined effect of bias and variance.

The topic of dimensionality is covered in Chapter 7. Complex business problems often require complex predictive models. As the number of predictors grows large and as the model becomes more complex, the issue of dimensionality will manifest itself in the form of increased variance. But there are effective ways to address this problem, which are covered in this chapter. In Chapter 8, I discuss non-linearity, which is when the predictors have a non-linear relationship (e.g., quadratic, cubic, interactive) with the outcome variable. Classification methods are discussed in more depth in Chapter 9. We depart from the binomial classification model (e.g., yes or no, approve or decline, positive or negative diagnostic, etc.) and look at multinomial classification models (i.e., more than one categorical outcome, e.g., green, amber, or red traffic light recognition). I also discuss classification accuracy evaluation methods based on key concepts like the confusion matrix and ROC curves. In Chapter 10, I discuss decision trees, both quantitative and classification, in more depth. This chapter also covers more advanced tree methods like bootstrap aggregation, random forest, and boosted trees. Finally, Chapter 11 provides an introduction to deep learning predictive models, with a focus on neural networks.

SECTION 1

PREDICTIVE ANALYTICS BASICS

CHAPTER 1

INTRODUCTION TO PREDICTIVE ANALYTICS

Introduction to Predictive Analytics

What do people mean when they say that we have to follow the science? While I agree that decisions must be informed by science, science is never exact. Science is grounded in data and analytical models, both of which are often imperfect. There are good data, bad data, raw data, data with missing or inconsistent values, etc. There are also issues with sample size, sampling methods, measurement, probability distributions, collinearity, etc. Furthermore, often data needs to be pre-processed before it can be used, and bad pre-processing can lead to bad data. Models also have issues, which is why they are tested for compliance to their method assumptions, validated for accuracy and evaluated against competing models. In addition, effects reported by models are influenced by issues like confidence intervals, statistical significance, likelihood of predicting correctly, margins of error, etc. Following the science means understanding all these complex data and modeling issues and rendering optimal models that can help us interpret the effects on outcomes and make sound predictions. But when models do not yield these desirable results, we cannot really blame the models, but the people who built them. So, while data and modeling are essential to decision-making, it is also important that we understand the scope, generalizability, and limitations of these models, regardless of whether we are the analysts who build these models or the managers who consume the resulting analytics reports. Most analysts are well-intentioned and intelligent individuals, but predictive modeling is not trivial. Models are subject to algorithmic issues, model mathematical assumptions (i.e., conditions under which the model can be used), specification (i.e., which predictors to include and in what form), modeling method (e.g., quantitative, classification, regression based, tree based, etc.). And then all these things need to be applicable to the specific business question or model being analyzed. So, in sum, it is not about following the science, but about understanding the math, data, and methods in order to be able to provide answers to specific business questions with some degree of confidence. Figure 1.1 illustrates the various aspects that interrelate to form an appropriate predictive model.

Figure 1.1 Factors Influencing a Predictive Model

In God we trust, others must provide data. According to Quote Investigator (https://quoteinvestigator.com/2017/12/29/god-data/), the first known use of this quote was by a professor of pathology in 1978 who stated in a congressional hearing that he needed good scientific data before he could provide an opinion about whether smoking was hazardous to non-smokers. In this age of big data and analytics, decisions backed by data rather than by intuition or opinion is now the norm. Providing data is now a necessary component of a business professional’s job. These days when you meet with a client or manager, you can no longer use your expertise alone to make a convincing case. Your audience will demand to see evidence from data to back up your claims. Thus, it is important for all business professionals to be able to understand and process data to support their arguments.

Ronald Coase, a British economist and Nobel Prize winner, once said in the 1970s, If you torture the data, it will confess to anything. Essentially this means that skilled statisticians have the ability to manipulate statistics to support their conclusions. Others have argued that the actual quote was if you torture the data long enough, nature will confess, meaning that if you manipulate the data for too long, the truth will eventually emerge. In either case, we need to be mindful of being honest in our predictive modeling and not torture the data to fit a story. The truth will eventually come out. And, by being savvy about data analytics, one can probably figure out when others are torturing the data.

After reading this the book you should be able to look through the data—along with some descriptive statistics, plots, and correlation data—of an analytics report and evaluate the soundness of the models used. Following your review, you should be able to assess whether someone is manipulating the results and be able to determine whether or not the results reported are accurate.

When properly executed, predictive analytics can be very powerful. Take for example one of my favorite books, Moneyball by Michael Lewis. Michael Lewis is a well-known financial analyst turned book author. He has written best sellers like The Big Short and others. Published in 2003, Moneyball is chock-full of interesting details and stories about how baseball statistics were used by the Oakland A’s to enhance on-field performance and cost-effectiveness. For example, they would sell a very expensive player worth millions of dollars and acquire a few rookie players, who individually were not as good as the player they just sold, but collectively those players had an aggregation of skills that were necessary for the team to replace that player. This is one of the first books to document the use of statistical analysis in sports management, and it sparked great interest in understanding big data and analytics.

Hopefully, the present book will not only open your eyes to the power of predictive analytics and machine learning but will also help you get started in this fascinating field. By the time you finish reading the book, you should be able to take a business problem or question and resolve or answer it through analytics. This book is not about mathematics, statistics, or algorithms. It is also not about programming in R or Python. You will learn these things, for sure. But the focus of this book is on learning how

Enjoying the preview?

Page 1 of 1

Predictive Analytics and Machine Learning for Managers

About this ebook

J. Alberto Espinosa

Related authors

Related to Predictive Analytics and Machine Learning for Managers

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Predictive Analytics and Machine Learning for Managers

What did you think?

Book preview

Predictive Analytics and Machine Learning for Managers - J. Alberto Espinosa

ROD FONTECILLA

WAI FONG BOH

PREDICTIVE ANALYTICS AND MACHINE LEARNING FOR MANAGERS

J. Alberto Espinosa, Ph.D.

From The Author

TABLE OF CONTENTS

Overview

SECTION 1: PREDICTIVE ANALYTICS BASICS

CHAPTER 1: Introduction to Predictive Analytics

CHAPTER 2: Foundations

CHAPTER 3: Basic Models

CHAPTER 4: Data Pre-Processing

CHAPTER 5: Variable Selection

SECTION 2: ADVANCED MODELS AND MACHINE LEARNING

CHAPTER 6: Machine Learning and Cross Validation

CHAPTER 7: Dimensionality

CHAPTER 8: Non-Linear Models

CHAPTER 9: Classification Models

CHAPTER 10: Decision Trees

CHAPTER 11: Deep Learning and Neural Networks

Wrap-Up

OVERVIEW

Introduction to Predictive Analytics