Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

Ebook553 pages18 hours

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

Name: Introduction to Machine Learning in the Cloud with Python: Concepts and Practices
Author: Pramod Gupta
ISBN: 9783030712709

By Pramod Gupta and Naresh K. Sehgal

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book provides an introduction to machine learning and cloud computing, both from a conceptual level, along with their usage with underlying infrastructure. The authors emphasize fundamentals and best practices for using AI and ML in a dynamic infrastructure with cloud computing and high security, preparing readers to select and make use of appropriate techniques. Important topics are demonstrated using real applications and case studies.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateApr 28, 2021

ISBN9783030712709

Author

Pramod Gupta

Related authors

Skip carousel

Related to Introduction to Machine Learning in the Cloud with Python

Related ebooks

Skip carousel

Introduction to Reliable and Secure Distributed Programming
Ebook
Introduction to Reliable and Secure Distributed Programming
byChristian Cachin
Rating: 0 out of 5 stars
0 ratings
Real-time Analytics with Storm and Cassandra
Ebook
Real-time Analytics with Storm and Cassandra
byShilpi Saxena
Rating: 0 out of 5 stars
0 ratings
Machine Learning and Deep Learning With Python
Ebook
Machine Learning and Deep Learning With Python
byJames Chen
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
Ebook
Deep Learning for Data Architects: Unleash the power of Python's deep learning algorithms (English Edition)
byShekhar Khandelwal
Rating: 0 out of 5 stars
0 ratings
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
Ebook
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
byMugesh S.
Rating: 0 out of 5 stars
0 ratings
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
Ebook
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
byDr. Rajkumar Tekchandani
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
Ebook
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
byEkaba Bisong
Rating: 0 out of 5 stars
0 ratings
Profit From Your Forecasting Software: A Best Practice Guide for Sales Forecasters
Ebook
Profit From Your Forecasting Software: A Best Practice Guide for Sales Forecasters
byPaul Goodwin
Rating: 0 out of 5 stars
0 ratings
Effective Amazon Machine Learning
Ebook
Effective Amazon Machine Learning
byAlexis Perrier
Rating: 0 out of 5 stars
0 ratings
AI Strategy A Complete Guide - 2019 Edition
Ebook
AI Strategy A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Mastering Parallel Programming with R
Ebook
Mastering Parallel Programming with R
bySimon R. Chapple
Rating: 0 out of 5 stars
0 ratings
Apache Mahout Essentials
Ebook
Apache Mahout Essentials
byJayani Withanawasam
Rating: 0 out of 5 stars
0 ratings
Graph Analytics A Clear and Concise Reference
Ebook
Graph Analytics A Clear and Concise Reference
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
AI in Retail Second Edition
Ebook
AI in Retail Second Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Spark SQL A Complete Guide
Ebook
Spark SQL A Complete Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
Ebook
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
byCésar Pérez López
Rating: 0 out of 5 stars
0 ratings
Web Developer A Complete Guide - 2019 Edition
Ebook
Web Developer A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Pro Power BI Theme Creation: JSON Stylesheets for Automated Dashboard Formatting
Ebook
Pro Power BI Theme Creation: JSON Stylesheets for Automated Dashboard Formatting
byAdam Aspin
Rating: 0 out of 5 stars
0 ratings
Machine Learning Complete Self-Assessment Guide
Ebook
Machine Learning Complete Self-Assessment Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Data Science, Analytics and Machine Learning with R
Ebook
Data Science, Analytics and Machine Learning with R
byLuiz Paulo Favero
Rating: 0 out of 5 stars
0 ratings
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
Ebook
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
byRohollah Omidvar
Rating: 0 out of 5 stars
0 ratings
Diffuse Algorithms for Neural and Neuro-Fuzzy Networks: With Applications in Control Engineering and Signal Processing
Ebook
Diffuse Algorithms for Neural and Neuro-Fuzzy Networks: With Applications in Control Engineering and Signal Processing
byBoris.A Skorohod
Rating: 0 out of 5 stars
0 ratings
Hands-on Time Series Analysis with Python: From Basics to Bleeding Edge Techniques
Ebook
Hands-on Time Series Analysis with Python: From Basics to Bleeding Edge Techniques
byB V Vishwas
Rating: 5 out of 5 stars
5/5
Time Series Analysis A Complete Guide - 2020 Edition
Ebook
Time Series Analysis A Complete Guide - 2020 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Data Governance and Data Management: Contextualizing Data Governance Drivers, Technologies, and Tools
Ebook
Data Governance and Data Management: Contextualizing Data Governance Drivers, Technologies, and Tools
byRupa Mahanti
Rating: 0 out of 5 stars
0 ratings
Dynamic Programming and Its Applications: Proceedings of the International Conference on Dynamic Programming and Its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14-16, 1977
Ebook
Dynamic Programming and Its Applications: Proceedings of the International Conference on Dynamic Programming and Its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14-16, 1977
byMartin L. Puterman
Rating: 0 out of 5 stars
0 ratings
Support Vector Machine: Fundamentals and Applications
Ebook
Support Vector Machine: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Federal Data Science: Transforming Government and Agricultural Policy Using Artificial Intelligence
Ebook
Federal Data Science: Transforming Government and Agricultural Policy Using Artificial Intelligence
byFeras A. Batarseh
Rating: 0 out of 5 stars
0 ratings
Supply Chain Execution Predictive Analytics Second Edition
Ebook
Supply Chain Execution Predictive Analytics Second Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Metaheuristics for Vehicle Routing Problems
Ebook
Metaheuristics for Vehicle Routing Problems
byNacima Labadie
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C Lennox
Rating: 4 out of 5 stars
4/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Our Final Invention: Artificial Intelligence and the End of the Human Era
Ebook
Our Final Invention: Artificial Intelligence and the End of the Human Era
byJames Barrat
Rating: 4 out of 5 stars
4/5
Impromptu: Amplifying Our Humanity Through AI
Ebook
Impromptu: Amplifying Our Humanity Through AI
byReid Hoffman
Rating: 5 out of 5 stars
5/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
Ebook
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
byJasmine Wang
Rating: 5 out of 5 stars
5/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
Ebook
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
byKavita Ganesan
Rating: 0 out of 5 stars
0 ratings
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
Ebook
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
byJames Bridle
Rating: 4 out of 5 stars
4/5
Discovery Writing with ChatGPT: AI-Powered Storytelling: Three Story Method, #6
Ebook
Discovery Writing with ChatGPT: AI-Powered Storytelling: Three Story Method, #6
byJ. Thorn
Rating: 0 out of 5 stars
0 ratings
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
Podcast episode
The Past, Present, and Future of Deep Learning In PyTorch: An interview with the creator of the popular PyTorch deep learning framework
byThe Python Podcast.__init__
0 ratings
0% found this document useful
[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
Podcast episode
[DataFramed Careers Series #3]: Accelerating Data Careers with Writing
byDataFramed
0 ratings
0% found this document useful
EP 161 - How to maintain data quality across systems: This week, our guest is , Chief Data Officer of . Profisee is a cloud-native master data management solution that helps enterprises solve data quality and governance issues. In this talk, we discussed the challenges related to data management, from...
Podcast episode
EP 161 - How to maintain data quality across systems: This week, our guest is , Chief Data Officer of . Profisee is a cloud-native master data management solution that helps enterprises solve data quality and governance issues. In this talk, we discussed the challenges related to data management, from...
byIndustrial IoT Spotlight
0 ratings
0% found this document useful
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
Podcast episode
[DataFramed Careers Series #2] What Makes a Great Data Science Portfolio
byDataFramed
0 ratings
0% found this document useful
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
Podcast episode
040: Graph Databases: Traditional relational databases like MySQL or Postgres are really good at providing many solutions to the problem of persisting state. But these types of database are really horrible at querying highly connected models in an efficient way. Graph datab...
byPHPRoundtable Podcast
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Delivering on the Chief Data Officer Imperatives: A Chief Data Officer (CDO) is expected to use data to continually improve internal operations and create a competitive advantage while aligning with partners, vendors, and customers. But complexities related to data quality, availability, visibility,...
Podcast episode
Delivering on the Chief Data Officer Imperatives: A Chief Data Officer (CDO) is expected to use data to continually improve internal operations and create a competitive advantage while aligning with partners, vendors, and customers. But complexities related to data quality, availability, visibility,...
byCIO Talk Network Podcast
0 ratings
0% found this document useful
Experimentation and A/B Testing For Modern Data Teams With Eppo: An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
Podcast episode
Experimentation and A/B Testing For Modern Data Teams With Eppo: An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
byData Engineering Podcast
0 ratings
0% found this document useful
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
Podcast episode
Bringing Feature Stores and MLOps to the Enterprise at Tecton: An interview with Kevin Stumpf, CTO of Tecton, about his work building an enterprise grade feature store and how it functions as the core element of an MLOps strategy.
byData Engineering Podcast
0 ratings
0% found this document useful
167 | Visualization and Statistics with Andrew Gelman and Jessica Hullman
Podcast episode
167 | Visualization and Statistics with Andrew Gelman and Jessica Hullman
byData Stories
0 ratings
0% found this document useful
CFO lessons learned in planning and forecasting - with Dan Fletcher, CFO Planful
Podcast episode
CFO lessons learned in planning and forecasting - with Dan Fletcher, CFO Planful
byMetrics that Measure Up
0 ratings
0% found this document useful
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams: With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all of his time thinking and talking about the details of product thinking and its application to data. In this episode he shares his thoughts on the strategic and tactical elements of moving your work as a data professional from being task-oriented to being product-oriented and the long term improvements in your productivity that it provides.
Podcast episode
Using Product Driven Development To Improve The Productivity And Effectiveness Of Your Data Teams: With all of the messaging about treating data as a product it is becoming difficult to know what that even means. Vishal Singh is the head of products at Starburst which means that he has to spend all of his time thinking and talking about the details of product thinking and its application to data. In this episode he shares his thoughts on the strategic and tactical elements of moving your work as a data professional from being task-oriented to being product-oriented and the long term improvements in your productivity that it provides.
byData Engineering Podcast
0 ratings
0% found this document useful
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46: Building A Master Data Catalog Using Machine Learning (Interview)
Podcast episode
An Agile Approach To Master Data Management with Mark Marinelli - Episode 46: Building A Master Data Catalog Using Machine Learning (Interview)
byData Engineering Podcast
100%
100% found this document useful
Generative AI, cybercrime, and scamability, with Stacey Edmonds
Podcast episode
Generative AI, cybercrime, and scamability, with Stacey Edmonds
byLondon Futurists
100%
100% found this document useful
[From the Archives] Ep 122: Dr. Rebekah Willson on Grounded Theory: On this episode, Katie is Joined by Dr. Rebekah Willson, a Lecturer in Information Science in the Department of Computer and Information Sciences, University of Strathclyde, Glasgow, UK. Originally from Canada, she obtained her PhD from Charles Sturt...
Podcast episode
[From the Archives] Ep 122: Dr. Rebekah Willson on Grounded Theory: On this episode, Katie is Joined by Dr. Rebekah Willson, a Lecturer in Information Science in the Department of Computer and Information Sciences, University of Strathclyde, Glasgow, UK. Originally from Canada, she obtained her PhD from Charles Sturt...
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
#121 — ChatGPT and How Generative AI is Augmenting Workflows
Podcast episode
#121 — ChatGPT and How Generative AI is Augmenting Workflows
byDataFramed
0 ratings
0% found this document useful
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
Podcast episode
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
Shining A Light on Shadow IT In Data And Analytics - Episode 121: A conversation about the conflicts that lead to shadow IT in data and analytics projects and how to work toward resolving those tensions.
Podcast episode
Shining A Light on Shadow IT In Data And Analytics - Episode 121: A conversation about the conflicts that lead to shadow IT in data and analytics projects and how to work toward resolving those tensions.
byData Engineering Podcast
0 ratings
0% found this document useful
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
Podcast episode
Data Operations vs. Data Analytics: Are we doing data and analytics correctly? Self service, centralization vs decentralization, analytics vs operations… so many aspects that data teams need to consider. Join this week’s episode of Catalog & Cocktails with hos...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
Podcast episode
#10 Data Science, the Environment and MOOCs: Air pollution, the environment and data science: where do these intersect? Find out in this episode of DataFramed, in which Hugo speaks with Roger Peng, Professor in the Department of Biostatistics at the Johns Hopkins Bloomberg School of Public Health...
byDataFramed
0 ratings
0% found this document useful
Episode 441 - Databricks Accelerator for Azure Purview: The team catches up with the developers of the Databricks Accelerator for Azure Purview to learn when, where, and why you might use it.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode441.mp3 YouTube: https://youtu.be/W9Dyb6E5eKk Resources: The Databricks to Purview Solution Accelerator Repo: microsoft/Purview-ADB-Lineage-Solution-Accelerator: A connector to ingest Azure Databricks lineage into Microsoft Purview (github.com) Demo Deployment Quickstart: Purview-ADB-Lineage-Solution-Accelerator/deploy-demo.md at release/2.1 · microsoft/Purview-ADB-Lineage-Solution-Accelerator (github.com) YouTube Video overview: Demoing the Azure Databricks lineage solution accelerator in Microsoft Purview - YouTube The OpenLineage Repo: OpenLineage/OpenLineage: An Open Standard for lineage metadata collection (github.com) OpenLineage + Purview Blog: Microsoft Purview Accelerates Lineage Extraction from Az
Podcast episode
Episode 441 - Databricks Accelerator for Azure Purview: The team catches up with the developers of the Databricks Accelerator for Azure Purview to learn when, where, and why you might use it.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode441.mp3 YouTube: https://youtu.be/W9Dyb6E5eKk Resources: The Databricks to Purview Solution Accelerator Repo: microsoft/Purview-ADB-Lineage-Solution-Accelerator: A connector to ingest Azure Databricks lineage into Microsoft Purview (github.com) Demo Deployment Quickstart: Purview-ADB-Lineage-Solution-Accelerator/deploy-demo.md at release/2.1 · microsoft/Purview-ADB-Lineage-Solution-Accelerator (github.com) YouTube Video overview: Demoing the Azure Databricks lineage solution accelerator in Microsoft Purview - YouTube The OpenLineage Repo: OpenLineage/OpenLineage: An Open Standard for lineage metadata collection (github.com) OpenLineage + Purview Blog: Microsoft Purview Accelerates Lineage Extraction from Az
byThe Azure Podcast
0 ratings
0% found this document useful
#111 The Rise of the Julia Programming Language
Podcast episode
#111 The Rise of the Julia Programming Language
byDataFramed
0 ratings
0% found this document useful
MLA 021 Databricks: Discussing Databricks with Ming Chang from (part of )
Podcast episode
MLA 021 Databricks: Discussing Databricks with Ming Chang from (part of )
byMachine Learning Guide
0 ratings
0% found this document useful
Google Analytics 4 Pandamonium -- Hugo Loriot // fifty-five
Podcast episode
Google Analytics 4 Pandamonium -- Hugo Loriot // fifty-five
byMarTech Podcast ™ // Marketing + Technology = Business Growth
0 ratings
0% found this document useful
Eureka moments with natural language processing: featuring Nicholas Mohnacky of bundleIQ
Podcast episode
Eureka moments with natural language processing: featuring Nicholas Mohnacky of bundleIQ
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
MLOps is NOT Real: with Luis Ceze, CEO of OctoML
Podcast episode
MLOps is NOT Real: with Luis Ceze, CEO of OctoML
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
Podcast episode
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
byMachine Learning Guide
0 ratings
0% found this document useful
Making The Open Data Lakehouse Affordable Without The Overhead At Iomete: An interview with Vusal Dadalov about the Iomete platform and how they are building a managed data lakehouse using open technologies and formats without the overhead of running it yourself or paying more than if you hosted it yourself.
Podcast episode
Making The Open Data Lakehouse Affordable Without The Overhead At Iomete: An interview with Vusal Dadalov about the Iomete platform and how they are building a managed data lakehouse using open technologies and formats without the overhead of running it yourself or paying more than if you hosted it yourself.
byData Engineering Podcast
0 ratings
0% found this document useful
The Role of Learning in Digital Transformation: Guest - Jill Shepherd
Podcast episode
The Role of Learning in Digital Transformation: Guest - Jill Shepherd
byDigital Transformation Podcast
0 ratings
0% found this document useful

Skip carousel

MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
APC
Article
MapReduce: The ‘Big Data’ Idea Inside Your Android Phone
Dec 2, 2019
4 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
How European Companies Can Use The Cloud To Increase Their Competitiveness
The European Business Review
Article
How European Companies Can Use The Cloud To Increase Their Competitiveness
Nov 25, 2021
5 min read
Can I Use Python 2 In Maya 2022?
3D World
Article
Can I Use Python 2 In Maya 2022?
Aug 10, 2021
1 min read
A.I. Scans For Big Farms That Might Be Polluters
Futurity
Article
A.I. Scans For Big Farms That Might Be Polluters
Apr 9, 2019
3 min read
‘Neuroflight’ Drone Controller Gets A Boost From A.I.
Futurity
Article
‘Neuroflight’ Drone Controller Gets A Boost From A.I.
Mar 13, 2019
4 min read
01 Giving Data Collectors—and Donors—a Real-Time Rush
Fast Company
Article
01 Giving Data Collectors—and Donors—a Real-Time Rush
Mar 20, 2017
7 min read
Small Data
PC Pro Magazine
Article
Small Data
Oct 8, 2022
3 min read
Grafana Terminology
Linux Format
Article
Grafana Terminology
Jan 14, 2020
A Grafana data source is a database, file or service that provides data to Grafana – it cannot operate without data. A Grafana panel is the basic building block of Grafana. Panels are made of visualisations or queries. A Grafana query is used for req
1 min read
Visualise Complex Data In Style Using Timelion
Linux Format
Article
Visualise Complex Data In Style Using Timelion
Oct 20, 2020
Simon Quain is a site reliability engineer who likes discovering open datasets online to play around with in the Elastic Stack. You’ve probably heard of Elasticsearch – the search engine that enables you to index and then quickly search through your
9 min read
Demystifying Artificial Intelligence
Finweek - English
Article
Demystifying Artificial Intelligence
Oct 18, 2019
artificial intelligence (AI) has had a significant global impact by changing the way enterprises, markets and consumers define efficiency and innovation. Financial markets typically feature large volumes of noisy and dynamic data while utilising high
3 min read
Plotting Applications
Linux Format
Article
Plotting Applications
Mar 10, 2020
1 min read
Budget Strategies for Maximizing Big Data
Entrepreneur
Article
Budget Strategies for Maximizing Big Data
Jun 1, 2016
1 min read
Gear For The Great Outdoors
TechLife
Article
Gear For The Great Outdoors
Jun 27, 2022
Chances are that if you’re reading TechLife, you’ve got a home so decked out in all the latest gadgets that it could make most people drool. One thing’s for sure: we all love our devices and aren’t shy about the fact, whether we’re early adopters or
5 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Aug 14, 2017
5 min read
Artificial Intelligence: Relax, Flippy The Robot Is Not About To Steal Your Job
This Week in Asia
Article
Artificial Intelligence: Relax, Flippy The Robot Is Not About To Steal Your Job
Jul 23, 2018
4 min read
Innovation Grows Software
Lebanon Opportunities
Article
Innovation Grows Software
Mar 8, 2019
8 min read
How AI Algorithms Could Help Design New Drugs
Futurity
Article
How AI Algorithms Could Help Design New Drugs
Apr 6, 2017
A new kind of AI algorithm—designed to work with a small amount of data—may be able to assist in the early stages of drug development. Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguis
3 min read
Further Insights
NZ Marketing
Article
Further Insights
Jun 9, 2021
From our survey, marketers tell us there is a will to invest in tools to strengthen their analytics measurement and automation. But with challenges in user capabilities and know-how as well as a lack of human resources to deploy such MarTech how can
4 min read
Is My Data Really Safe? Your Questions About Cloud-Based Storage, Answered.
Entrepreneur
Article
Is My Data Really Safe? Your Questions About Cloud-Based Storage, Answered.
Nov 1, 2014
2 min read
Deep Learning
TechLife News
Article
Deep Learning
Dec 28, 2017
5 min read
CSV Handling
Linux Format
Article
CSV Handling
Mar 10, 2020
3 min read
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Futurity
Article
Deep Learning Tests Billions Of Graphene Combos In 2 Days
Apr 11, 2019
2 min read
Electronic Data Analysis Key To Agri Economics
Farmer's Weekly
Article
Electronic Data Analysis Key To Agri Economics
Nov 9, 2020
Collecting and analysing electronically generated data enable agricultural economists to compile meaningful recommendations for end-users in the agriculture sector. Data collection and analyses were increasingly being made easier, due to the developm
1 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
Real World Computing
PC Pro Magazine
Article
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
A.i. Coding
Linux Format
Article
A.i. Coding
Aug 22, 2023
16 min read
Five Steps To Join The Era Of Industry 4.0
Architectural Review Asia Pacific
Article
Five Steps To Join The Era Of Industry 4.0
Sep 4, 2019
When 3D modelling tool Revit first arrived on the scene, Australian architects were some of the world’s earliest adopters, with local users outnumbering Europe and the US combined. As a country, we’re often ahead of the curve, and should be building
1 min read
How Google Is Making The AI That Powers Its Products Better.
HWM Singapore
Article
How Google Is Making The AI That Powers Its Products Better.
Jun 3, 2019
3 min read
Machine-learning On Your Android Phone?
APC
Article
Machine-learning On Your Android Phone?
Dec 30, 2019
4 min read

Related categories

Skip carousel

Reviews for Introduction to Machine Learning in the Cloud with Python

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Introduction to Machine Learning in the Cloud with Python - Pramod Gupta

Part IConcepts

P. Gupta, N. K. SehgalIntroduction to Machine Learning in the Cloud with Pythonhttps://doi.org/10.1007/978-3-030-71270-9_1

1. Machine Learning Concepts

Pramod Gupta¹ and Naresh K. Sehgal²

(1)

NovaSignal, San Jose, CA, USA

(2)

NovaSignal, Santa Clara, CA, USA

Keywords

Machine learningSupervised learningUnsupervised learningReinforcement learningPredictionClassificationClusteringRegression

Over the last decade, machine learning (ML) has been at the core of our journey toward achieving larger goals in artificial intelligence (AI). It is one of the most influential and important technologies of the present time. ML is considered an application of AI, based on an idea that given sufficient data, machines can learn the necessary operational rules. It impacts every sphere of human life as new AI-based solutions are being developed.

Recently, machine learning has given us practical speech recognition, effective web search, and a vastly improved understanding of the human genome. Machine learning is so pervasive today that one probably uses it dozens of times daily without realizing it. There is no doubt, ML will continue to make headlines in the foreseeable future. It has the potential to improve as more data, powerful hardware, and newer algorithms continue to emerge. As we progress in the book, we will know that ML has a lot of benefits to offer.

The rate of development and complexity of the field make it difficult even for the experts to keep up with new technique. It can therefore be overwhelming for the beginners. This provided sufficient motivation for us to write this text to offer a conceptual-level understanding of machine learning and current state of affairs.

1.1 Terminology

Dataset: The starting point in ML is a dataset, which contains the measured or collected data values represented as numbers or text, a set of examples that contain important features describing the behavior of the problem to be solved. There is one important nuance though: if the given data is noisy, or has a low signal to noise ratio, then even the best algorithm will not help. Sometimes it is referred to as garbage in – garbage out. Thus, we should try to build the dataset as accurately as possible.

Features/attributes: These are also referred to as parameters or variables. Some examples include car mileage, user’s gender, and a word’s frequency in text, in other words, properties/information contained in the dataset that helps to better understand the problem. These parameters or features are the factors for a machine to consider. These parameters are used as input variables in machine learning algorithms to learn and infer, to be able to take an intelligent action. When the data is stored in tables, it is simple to understand, with features as column names. Selecting the right set of features is very important which will be considered in a later chapter. It is the most important part of a machine learning project’s process and usually takes much longer than all other ML steps.

Training data: ML model is built using the training data. This is the data that has been validated and includes desired output. The output or results are generally referred as the labels. The labeled training data helps an ML model to identify key trends and patterns essential to predicting the output later on.

Testing data: After the model is trained, it must be tested to evaluate how accurate it is. This is done by the testing data, where the ML-generated output is compared to the desired output. If both match, then the tests have passed. It is important for both the training and testing datasets to resemble the situations that the ML algorithms will encounter later in the field. Think of a self-driven car’s ML model, which has never seen a stop sign during its training phase. Then it will not know how to react when one is seen on actual drive later on.

Model: There are many ways to solve a given problem. The basic idea is building a mathematical representation that captures relationships between the input and output. In other words, it is a mapping function from input to output. This is achieved by a process known as training. For example, logistic regression algorithm may be trained to produce a logistic regression model. The method one chooses will affect the precision, performance, and complexity of the ML model.

To sum up, an ML process begins by inputting lots of data to a computer, then by using this data, the computer/machine is trained to reveal the hidden patterns and offer insights. These insights are then used to build an ML model, by using one or more algorithms to solve other instances of the same problem.

Let us take an example on the following dataset:

In the above example, there are five features (i.e., Outlook, Temperature, Humidity, Windy, and Class). There are nine observations or rows. In this example, Class is the target or desired output (i.e., to go for a play or no play), which ML algorithm wants to learn and predict for the unseen new datasets. This is a typical classification problem. We will discuss the concept of various tasks performed by ML later in this book.

1.2 What Is Machine Learning?

To demystify machine learning, and to offer a learning opportunity for those who are new to this domain, we will start by exploring the basics of machine learning and the process involved in developing a machine learning model. Machine learning is about building programs with tunable parameters, which are adjusted automatically. The goal is to improve the behavior of an ML model by adapting to previously seen data.

Machine learning is a subfield of artificial intelligence (AI). ML algorithms are the building blocks to make computers learn and act intelligently by generalizing, rather than just storing and retrieving data items like a database system.

While the field of machine learning has not been explored until recently, the term was first coined in 1959 [1]. Most foundational research was done through the 1970s and 1980s. Popularity of machine learning today can be attributed to the availability of vast amounts of data, faster computers, efficient data storage, and evolution of newer algorithms.

At a higher level, machine learning (ML) is the ability of a system to adapt to new data. The learning process advances through iterations offering better quality of response. Applications can learn from previous computations and transactions, by using pattern recognition to produce reliable and better informed results.

Arthur Samuel, a pioneer in the field of artificial intelligence, coined the term Machine Learning in 1959 while at IBM [1]. He defined machine learning as a Field of study that gives computers the capability to learn without being explicitly programmed.

In a layman’s words, machine learning (ML) can be explained as automating and improving the learning process of computers based on experiences, without explicit programming. The basic process starts with feeding data and training the computers (machines). This is achieved by feeding data to an algorithm to build ML models. The choice of algorithm depends upon the nature of task. The machine learning algorithms can perform various tasks using methods such as classification and regression.

Machine learning algorithms can identify patterns in the given data and build models that capture relationships between input and output. This is useful to predict outcome for a new set of inputs without explicit pre-programed rules or models.

1.2.1 Mitchell’s Notion of Machine Learning

Another widely accepted definition of machine learning was proposed by the computer scientist Tom M. Mitchell [2]. His definition states that a machine is said to learn if it is able to take experience and utilize it such that its performance improves upon similar experiences in the future. His definition says little about how machine learning techniques actually learn to transform data into actionable knowledge.

Machine learning also involves study of algorithms that improve a defined category of tasks while optimizing a performance criterion of past experiences. ML uses data and past experiences to realize a given goal or performance criterion.

Most desirable property of machine learning algorithms is the generalization, i.e., a model should perform well on the new or unseen data. The real aim of learning is to do well on test data that was not known during learning or training. The objective of machine learning is to model the true regularities in a data and to ignore the noise in the data.

1.3 What Does Learning Mean for a Computer?

A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks T, as measured by P, improves with experience E. It can be used as a design tool to help us think about which data to collect (E), what decisions software needs to make (T), and how to evaluate its results (P).

Example: playing tennis

E = the experience of playing many games of tennis

T = the task of playing tennis

P = the probability that the program will win the next game

1.4 Difference Between ML and Traditional Programming

Traditional programming: Feed in data and a program (logic), run it on a machine, and get the output.

Machine learning: Feed in data and its corresponding observed output, run it on machine during learning (training) phase. Then the machine generates its own logic, which can be evaluated during testing phase, as shown in Fig. 1.1.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig1_HTML.png

Fig. 1.1

Basic differences between traditional programming and machine learning

1.5 How Do Machines Learn?

Regardless of whether the learner is a human or a machine, basic learning process is similar to that shown in Fig. 1.2. It can be divided into three components as follows:

Data input: It comprises observations, memory storage, and recall to provide a factual basis for further reasoning.

Abstraction: It involves interpretation of data into broader representations.

Generalization: It uses abstracted data to form a basis for insight and taking an intelligent action.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig2_HTML.png

Fig. 1.2

Basic learning process

1.6 Steps to Apply ML

The machine learning process involves building a predictive model that can be used to find a solution for the given problem. Following steps are used in developing an ML model, as shown in Fig. 1.3.

Problem definition: This is an important phase as the choice of the machine learning algorithm/model will depend on the problem to be solved. The problem is defined only after the system has been studied well. For example, it may use classification or regression. In particular, the study will be designed to understand the principles of its behavior in order to make predictions or to make choices (defined as an informed choice). The definition step and the corresponding documentation (deliverables) of the scientific problem or business are both important to focus the analysis on getting results.

Data collection/data extraction: The next stage for machine learning model is a dataset. This step is the most important and forms the foundation of the learning. The predictive power of a model depends not only on the quality of the modeling technique but also on the ability to choose a good dataset upon which to build the model. So, search for the data, its extraction, and subsequent preparation related to data analysis because of their importance in the success of the results. Input data must be chosen with the basic purpose to build a predictive model, and its selection is crucial for the success of the analysis as well. Thus, a poor choice of data, or performing analysis on a data set that is not representative of the system, will lead to models that will deviate from the system under study. Better variety, density, and volume of relevant data will result in better learning prospects for the machine learning.

Prepare the data: Once data has been selected and collected, the next stage is to make sure that the data is in proper format and of good quality. As mentioned earlier, the quality of data is important for predictive power of machine learning algorithms. One needs to spend time determining the quality of data and then take steps for fixing issues such as missing data, inconsistent values, and treatment of outliers. Exploratory analysis is one method to study the nuances of data in details, thereby burgeoning the relevant content of the data. The quality of data is very important for the performance of machine learning algorithms.

Train the algorithm: By the time the data has been prepared for analysis, one is likely to have a sense of what one hopes to learn from the data. A specific machine learning task will result in the selection of an appropriate algorithm. This algorithm will represent data in the form of a model. This step involves choosing the appropriate algorithm and representation of data in the form of the model. The cleaned-up data is split into two parts: train and test; the first part (training data) is used for developing the model, and the second part (test data) is used as a reference. The proportion of data split depends on the prerequisites such as the number of input variables and complexity of the model.

Test the algorithm: Each machine learning model results in a biased solution to the learning problem, so it is important to evaluate how well the algorithm is learned. Depending on the type of model used, one can evaluate the accuracy of the model using a test dataset or may need to develop measures of performance specific to the intended application. To test the performance of the model, the second part of the data (test data) is used. This step determines the precision of the choice of the algorithm based on the desired outcome.

Improving the performance: A better test to check the performance of a model is to observe its performance on the data that was not used during building the model. If better performance is needed, it becomes necessary to utilize more advanced strategies to augment the performance of the model. This step may involve choosing a different model altogether or introducing more variables to augment the accuracy. Hence, significant amount of time needs to be spent in data collection and preparation. One may need to supplement with additional data or perform additional preparatory work as was described in step 2 of this process.

Deployment: After the above steps are completed, if the model appears to be performing satisfactorily, it can be deployed for the intended task. The successes and failures of a deployed model might even provide additional data for the next generation of model.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig3_HTML.png

Fig. 1.3

Machine learning process

The above steps 1–7 are used iteratively during the development of an algorithm.

1.7 Paradigms of Learning

Computers learn in many different ways from the data depending upon what we are trying to accomplish. There is No Free Lunch Theorem famous in machine learning. It states that there is no single algorithm that will work well for all the problems. Each problem has its own characteristics/properties. There are lots of algorithms and approaches to suit each problem with its individual quirks. Broadly speaking, there are three types of learning paradigms:

Supervised learning

Unsupervised learning

Reinforcement learning

Each form of machine learning has differing approaches, but they all follow an underlying iterative process and comparison of actual vs. desired output, as shown in Fig. 1.4.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig4_HTML.png

Fig. 1.4

Three types of learning paradigms

1.7.1 Supervised Machine Learning

Supervised learning , as shown in Fig. 1.5, is the most popular paradigm for machine learning. It is very similar to teaching a child with the use of flash cards. If you are learning a task under supervision, someone is judging whether you are getting the right answers. Similarly, supervised learning means having a full set of labeled data while training an algorithm. Fully labeled means that each observation in the dataset is tagged with the answer that the algorithm should learn. Supervised learning is a form of machine learning in which input is mapped to output using labeled data, i.e., input–output pairs. In this case, we know the expected response, and the model is trained with a teacher. In this type of learning, it is imperative to provide both inputs and outputs to the computer for it to learn from the data. The computer generates a function based on the data that can be used for the prediction of unseen data. Once trained, the model will be able to observe a new, never-seen-before example and predict an outcome for it. The trained model no longer expects the target. It will try to predict the most likely outcome from a new set of observations. The solution can use classification or regression depending on the type of the target.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig5_HTML.png

Fig. 1.5

A supervised learning model

Depending upon the nature of the target, supervised learning can be useful for classification as well as regression type of problems.

If target y has values in affixed set of categorical outcomes (e.g., male/female, true/false), the task to predict y is called classification.

If target y has continuous values (e.g., to represent a price, a temperature), the task to predict y is called regression.

1.7.2 Unsupervised Machine Learning

Unsupervised learning is the opposite of supervised learning. It uses no labels. Instead, the machine is provided with just the inputs to develop a model, as shown in Fig. 1.6. It is a learning method without target/response. The machine learns through observations and finds structures in the data. Here the task of machine is to group unsorted information according to similarities, patterns, and differences without any prior training. Unlike supervised training, no teacher is provided that means no training will be given to the machine. Therefore, the machine is restricted to find hidden patterns in unlabeled data. An example would be to perform customer segmentation or clustering. What makes unsupervised learning an interesting area is that an overwhelming majority of data in our world is unlabeled. Having intelligent algorithms that can take terabytes of unlabeled data and make sense of it are a huge source of potential profit in many industries. This is still an unexplored field of machine learning, and many big technology companies are currently researching it.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig6_HTML.png

Fig. 1.6

An unsupervised learning model

1.7.3 Reinforcement Machine Learning

Reinforcement learning allows machine to automatically determine the ideal behavior within a specific context, in order to maximize its performance. Reinforcement learning is looked upon as learning from mistakes as shown in Fig. 1.7. Over time, learning algorithm learns to make fewer mistakes than it used to. It is very behavior driven.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig7_HTML.png

Fig. 1.7

Reinforcement learning

This learning paradigm is like a dog trainer, who teaches the dog how to respond to specific signs, like catch a ball, jump, or anything else. Whenever the dog responds correctly, the trainer gives a reward to the dog, which can be a bone or a biscuit.

Reinforcement learning is said to be the hope of artificial intelligence because the potential it possesses is immense for many complex real-life problems, such as self-driving cars.

1.7.3.1 Types of Problems in Machine Learning

As depicted in Fig. 1.8, there are three main types of problems that can be solved using machine learning:

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig8_HTML.png

Fig. 1.8

Types of problems solved using machine learning

Classification problem: Classification is the process of predicting the class of a given data points. Classification predictive modeling is the task of approximating a mapping function from input variables to discrete output variables, e.g., spam detection in emails and credit card fraud. In these cases, we try to draw a boundary between different classes as shown in Fig. 1.9. A classifier utilizes some training data to understand how given input variables relate to the class. The dataset may simply be bi-class (e.g., is incoming mail a spam or non-spam?) or it may be multi-class (e.g., health of a patient). Some other examples of classification problems are speech recognition, fraud detection, documents classification, etc. There are various ML algorithms for classification that will be discussed later.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig9_HTML.png

Fig. 1.9

Classification using machine learning

Regression problem: Regression is the task of predicting the value of a continuously varying variable (e.g., a sale price of a house or a height of a tree) given some input variables (aka the predictors, features, or regressors). A continuous output variable is a real-value, such as an integer or floating-point value. These are often quantities such as the amounts and sizes. It tries to model data distribution with the best line/hyper-plane which goes through the points as shown in Fig. 1.10, Regression is based on a hypothesis that can be linear, polynomial, nonlinear, etc. The hypothesis is a function that is based on some hidden parameters and the input values.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig10_HTML.png

Fig. 1.10

Regression using machine learning

Clustering : This type of problem involves assigning the input into two or more clusters based on similarity as shown in Fig. 1.11, for example, clustering customers into similar groups based on their spending habits, age, geography, items they buy, etc. This is unsupervised learning as there is no target available in advance.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig11_HTML.png

Fig. 1.11

Clustering

Figure 1.12 sums up the differences between regression, classification, and clustering.

../images/510596_1_En_1_Chapter/510596_1_En_1_Fig12_HTML.png

Fig. 1.12

Regression vs. classification vs. clustering

1.8 Machine Learning in Practice

Machine learning algorithms are a small part of practices by a data analyst or data scientist to do machine learning. In reality, the actual process often looks like:

Start loop

Understand the domain, prior knowledge, and goals. Start by talking to the domain experts. Often the goals are unclear. One may have to try multiple approaches before starting to implement.

Data integration, selection, cleaning, and pre-processing. This is often the most time-consuming part. It is important to have high-quality data. The more data one has, more work may be required because the data can be noisy, remember GIGO (garbage in, garbage out).

Learning models. This is an exciting phase with availability of many tools to experiment with.

Interpreting results. Sometimes it does not matter how a model works as long as it delivers the results. Some domains require that the model is understandable, so we need to be prepared to be challenged by the experts.

Consolidating and deploying discovered knowledge. The majority of projects that are successful in the lab may not be used in practice. In such cases, compare the model’s output with desired results.

End loop

Clearly it is not a one-shot process, but an iterative cycle. It also explains that learning happens by observing. The reason is to learn from the gaps between actual and desired results. The loop iterates until we get a model and the results that can be used in practice. Also, incoming data may change, requiring a new loop.

1.9 Why Use Machine Learning?

It is important to remember that machine learning (ML) does not offer solutions to every type of problem at hand. There are

Enjoying the preview?

Page 1 of 1

Introduction to Machine Learning in the Cloud with Python: Concepts and Practices

About this ebook

Pramod Gupta

Related authors

Related to Introduction to Machine Learning in the Cloud with Python

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Introduction to Machine Learning in the Cloud with Python

What did you think?

Book preview

Introduction to Machine Learning in the Cloud with Python - Pramod Gupta

1. Machine Learning Concepts

1.1 Terminology

1.2 What Is Machine Learning?

1.2.1 Mitchell’s Notion of Machine Learning

1.3 What Does Learning Mean for a Computer?

1.4 Difference Between ML and Traditional Programming

1.5 How Do Machines Learn?

1.6 Steps to Apply ML

1.7 Paradigms of Learning

1.7.1 Supervised Machine Learning

1.7.2 Unsupervised Machine Learning

1.7.3 Reinforcement Machine Learning

1.8 Machine Learning in Practice

1.9 Why Use Machine Learning?