Introduction to Algorithms for Data Mining and Machine Learning

Ebook392 pages2 hours

Introduction to Algorithms for Data Mining and Machine Learning

Name: Introduction to Algorithms for Data Mining and Machine Learning
Author: Xin-She Yang
ISBN: 9780128172179

By Xin-She Yang

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Introduction to Algorithms for Data Mining and Machine Learning introduces the essential ideas behind all key algorithms and techniques for data mining and machine learning, along with optimization techniques. Its strong formal mathematical approach, well selected examples, and practical software recommendations help readers develop confidence in their data modeling skills so they can process and interpret data for classification, clustering, curve-fitting and predictions. Masterfully balancing theory and practice, it is especially useful for those who need relevant, well explained, but not rigorous (proofs based) background theory and clear guidelines for working with big data.

Presents an informal, theorem-free approach with concise, compact coverage of all fundamental topics
Includes worked examples that help users increase confidence in their understanding of key algorithms, thus encouraging self-study
Provides algorithms and techniques that can be implemented in any programming language, with each chapter including notes about relevant software packages

Skip carousel

Mathematics

LanguageEnglish

PublisherAcademic Press

Release dateJun 17, 2019

ISBN9780128172179

Author

Xin-She Yang

Xin-She Yang obtained his DPhil in Applied Mathematics from the University of Oxford. He then worked at Cambridge University and National Physical Laboratory (UK) as a Senior Research Scientist. He is currently a Reader in Modelling and Simulation at Middlesex University London, Fellow of the Institute of Mathematics and its Application (IMA) and a Book Series Co-Editor of the Springer Tracts in Nature-Inspired Computing. He has published more than 25 books and more than 400 peer-reviewed research publications with over 82000 citations, and he has been on the prestigious list of highly cited researchers (Web of Sciences) for seven consecutive years (2016-2022).

Related to Introduction to Algorithms for Data Mining and Machine Learning

Related ebooks

Skip carousel

Practical Machine Learning for Data Analysis Using Python
Ebook
Practical Machine Learning for Data Analysis Using Python
byAbdulhamit Subasi
Rating: 0 out of 5 stars
0 ratings
Deep Learning in Bioinformatics: Techniques and Applications in Practice
Ebook
Deep Learning in Bioinformatics: Techniques and Applications in Practice
byHabib Izadkhah
Rating: 0 out of 5 stars
0 ratings
Computational Learning Approaches to Data Analytics in Biomedical Applications
Ebook
Computational Learning Approaches to Data Analytics in Biomedical Applications
byKhalid Al-Jabery
Rating: 5 out of 5 stars
5/5
Designing Machine Learning Systems with Python
Ebook
Designing Machine Learning Systems with Python
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
Data Science: Concepts and Practice
Ebook
Data Science: Concepts and Practice
byVijay Kotu
Rating: 3 out of 5 stars
3/5
Deep Learning through Sparse and Low-Rank Modeling
Ebook
Deep Learning through Sparse and Low-Rank Modeling
byZhangyang Wang
Rating: 0 out of 5 stars
0 ratings
Neural Data Science: A Primer with MATLAB® and Python™
Ebook
Neural Data Science: A Primer with MATLAB® and Python™
byErik Lee Nylen
Rating: 5 out of 5 stars
5/5
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
Ebook
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
byBrandon Railey
Rating: 0 out of 5 stars
0 ratings
Pattern Recognition
Ebook
Pattern Recognition
byKonstantinos Koutroumbas
Rating: 4 out of 5 stars
4/5
Pattern Recognition and Machine Learning
Ebook
Pattern Recognition and Machine Learning
byY. Anzai
Rating: 0 out of 5 stars
0 ratings
TensorFlow A Complete Guide - 2019 Edition
Ebook
TensorFlow A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Data Analytics: Foundations, Biomedical Applications, and Challenges
Ebook
Deep Learning for Data Analytics: Foundations, Biomedical Applications, and Challenges
byHimansu Das
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
Ebook
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
byDeepti Chopra
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
Ebook
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
byGiuseppe Bonaccorso
Rating: 0 out of 5 stars
0 ratings
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Ebook
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
byFrank Millstein
Rating: 0 out of 5 stars
0 ratings
A Greater Foundation for Machine Learning Engineering: The Hallmarks of the Great Beyond in Pytorch, R, Tensorflow, and Python
Ebook
A Greater Foundation for Machine Learning Engineering: The Hallmarks of the Great Beyond in Pytorch, R, Tensorflow, and Python
byDr. Ganapathi Pulipaka
Rating: 0 out of 5 stars
0 ratings
Practical Machine Learning Cookbook
Ebook
Practical Machine Learning Cookbook
byAtul Tripathi
Rating: 0 out of 5 stars
0 ratings
Trends in Deep Learning Methodologies: Algorithms, Applications, and Systems
Ebook
Trends in Deep Learning Methodologies: Algorithms, Applications, and Systems
byVincenzo Piuri
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Action
Ebook
Machine Learning in Action
byPeter Harrington
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning Systems with Python
Ebook
Building Machine Learning Systems with Python
byWilli Richert
Rating: 4 out of 5 stars
4/5
Introduction to Statistical Machine Learning
Ebook
Introduction to Statistical Machine Learning
byMasashi Sugiyama
Rating: 4 out of 5 stars
4/5
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
Ebook
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
byEyal Wirsansky
Rating: 0 out of 5 stars
0 ratings
Machine Learning: A Bayesian and Optimization Perspective
Ebook
Machine Learning: A Bayesian and Optimization Perspective
bySergios Theodoridis
Rating: 3 out of 5 stars
3/5
Machine Learning and Data Mining
Ebook
Machine Learning and Data Mining
byIgor Kononenko
Rating: 3 out of 5 stars
3/5
Data Mining: Practical Machine Learning Tools and Techniques
Ebook
Data Mining: Practical Machine Learning Tools and Techniques
byIan H. Witten
Rating: 4 out of 5 stars
4/5
Hands-On Deep Learning Algorithms with Python: Master deep learning algorithms with extensive math by implementing them using TensorFlow
Ebook
Hands-On Deep Learning Algorithms with Python: Master deep learning algorithms with extensive math by implementing them using TensorFlow
bySudharsan Ravichandiran
Rating: 0 out of 5 stars
0 ratings
Computer Vision and Applications: A Guide for Students and Practitioners,Concise Edition
Ebook
Computer Vision and Applications: A Guide for Students and Practitioners,Concise Edition
byBernd Jahne
Rating: 5 out of 5 stars
5/5
Deep Learning for Medical Image Analysis
Ebook
Deep Learning for Medical Image Analysis
byS. Kevin Zhou
Rating: 4 out of 5 stars
4/5
TensorFlow Machine Learning Cookbook
Ebook
TensorFlow Machine Learning Cookbook
byNick McClure
Rating: 4 out of 5 stars
4/5
Python Deep Learning
Ebook
Python Deep Learning
byValentino Zocca
Rating: 5 out of 5 stars
5/5

Mathematics For You

Skip carousel

Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
Ebook
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
byClifford A. Pickover
Rating: 3 out of 5 stars
3/5
Flatland
Ebook
Flatland
byEdwin A. Abbott
Rating: 4 out of 5 stars
4/5
Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
Ebook
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
byInstaread Summaries
Rating: 5 out of 5 stars
5/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
Game Theory: A Simple Introduction
Ebook
Game Theory: A Simple Introduction
byK.H. Erickson
Rating: 4 out of 5 stars
4/5
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
Ebook
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
byChristopher Monahan
Rating: 5 out of 5 stars
5/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
Ebook
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
byKit Yates
Rating: 4 out of 5 stars
4/5
Algebra I Workbook For Dummies
Ebook
Algebra I Workbook For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
Geometry For Dummies
Ebook
Geometry For Dummies
byMark Ryan
Rating: 5 out of 5 stars
5/5
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
Ebook
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
byS. Deviant
Rating: 4 out of 5 stars
4/5
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
Ebook
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
byAlbert Rutherford
Rating: 3 out of 5 stars
3/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
Things to Make and Do in the Fourth Dimension: A Mathematician's Journey Through Narcissistic Numbers, Optimal Dating Algorithms, at Least Two Kinds of Infinity, and More
Ebook
Things to Make and Do in the Fourth Dimension: A Mathematician's Journey Through Narcissistic Numbers, Optimal Dating Algorithms, at Least Two Kinds of Infinity, and More
byMatt Parker
Rating: 4 out of 5 stars
4/5
A Mind for Numbers | Summary
Ebook
A Mind for Numbers | Summary
bySummary Station
Rating: 4 out of 5 stars
4/5
Introducing Game Theory: A Graphic Guide
Ebook
Introducing Game Theory: A Graphic Guide
byIvan Pastine
Rating: 4 out of 5 stars
4/5
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
#51 Francois Chollet - Intelligence and Generalisation
Podcast episode
#51 Francois Chollet - Intelligence and Generalisation
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
Podcast episode
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
byData Engineering Podcast
0 ratings
0% found this document useful
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
Podcast episode
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
byData Engineering Podcast
0 ratings
0% found this document useful
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
Podcast episode
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
byMLOps.community
0 ratings
0% found this document useful
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
Podcast episode
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
Podcast episode
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
100: Nanotechnology and the Brain: Fundamentals of Neuromorphic Computing | (ft. Dr. Jean Anne Incorvia): Biological brains can accomplish more than modern computing systems while using much less power. However, computers are much better at dealing with computation, while brains are (unsurprisingly) much better at interacting with ever-changing environme...
Podcast episode
100: Nanotechnology and the Brain: Fundamentals of Neuromorphic Computing | (ft. Dr. Jean Anne Incorvia): Biological brains can accomplish more than modern computing systems while using much less power. However, computers are much better at dealing with computation, while brains are (unsurprisingly) much better at interacting with ever-changing environme...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful
#34: AI, vaccines and happy sheep With Adam Bohr and Kaveh Memarzadeh
Podcast episode
#34: AI, vaccines and happy sheep With Adam Bohr and Kaveh Memarzadeh
byThe International Business Podcast
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
Optimising the Future
Podcast episode
Optimising the Future
byDataCafé
0 ratings
0% found this document useful
42. Will Grathwohl - Energy-based models and the future of generative algorithms
Podcast episode
42. Will Grathwohl - Energy-based models and the future of generative algorithms
byTowards Data Science
0 ratings
0% found this document useful
PhD goal setting and organisation: How is organising a PhD like an episode of Grand Designs?
Podcast episode
PhD goal setting and organisation: How is organising a PhD like an episode of Grand Designs?
byPhD: Addicted to Research
0 ratings
0% found this document useful
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
Podcast episode
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful
Europe & Privacy, Why It Matters to Security Pros - Isabelle Roccia - ESW #302: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
Podcast episode
Europe & Privacy, Why It Matters to Security Pros - Isabelle Roccia - ESW #302: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
byEnterprise Security Weekly (Video)
0 ratings
0% found this document useful
Europe & Privacy, Why It Matters to Security Pros - Isabelle Roccia - ESW #302: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
Podcast episode
Europe & Privacy, Why It Matters to Security Pros - Isabelle Roccia - ESW #302: Europe is a global driver for privacy rules and digital legislation. Which means it is also a force to be reckoned with when it comes to enforcement. With privacy and security being so intertwined, this conversation will focus on the current mindset...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Understanding Graph Database Patterns
Podcast episode
Understanding Graph Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
37. Sean Knapp - The brave new world of data engineering
Podcast episode
37. Sean Knapp - The brave new world of data engineering
byTowards Data Science
0 ratings
0% found this document useful
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
Podcast episode
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
byRetail Revolution
0 ratings
0% found this document useful
#338: Site Selection for Clinical Trials
Podcast episode
#338: Site Selection for Clinical Trials
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
596 : Topical English Vocabulary Lesson With Teacher Tiffani about Technology Advancements
Podcast episode
596 : Topical English Vocabulary Lesson With Teacher Tiffani about Technology Advancements
bySpeak English with Tiffani Podcast
0 ratings
0% found this document useful
The Philosopher King: James Mickens is a lifelong hacker and a professor at Harvard, and he knows too well where the gaps are when it comes to training computer scientists to think about the consequences of what they build. He takes Cindy and Danny on a journey through his philosophy of making better tech for all.
Podcast episode
The Philosopher King: James Mickens is a lifelong hacker and a professor at Harvard, and he knows too well where the gaps are when it comes to training computer scientists to think about the consequences of what they build. He takes Cindy and Danny on a journey through his philosophy of making better tech for all.
byHow to Fix the Internet
0 ratings
0% found this document useful
EP 35: How Students Can Use AI to Solve Everyday Problems
Podcast episode
EP 35: How Students Can Use AI to Solve Everyday Problems
byEveryday AI Podcast – An AI and ChatGPT Podcast
0 ratings
0% found this document useful
48: Nanobiosensors: Nanotechnology for Next Generation Diagnostics (ft. Dr. Arben Merkoci): Nanotechnology is having a profound impact on the development of a new class of biosensors known as nanobiosensors. Episode 48 reveals the world of nanobiosensors, how they work, and their impactful applications as diagnostic devices. Check out our MSE...
Podcast episode
48: Nanobiosensors: Nanotechnology for Next Generation Diagnostics (ft. Dr. Arben Merkoci): Nanotechnology is having a profound impact on the development of a new class of biosensors known as nanobiosensors. Episode 48 reveals the world of nanobiosensors, how they work, and their impactful applications as diagnostic devices. Check out our MSE...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
474 The AI Playbook by Eric Siegel: The AI Playbook: Mastering the Rare Art of Machine Learning Deployment by Eric Siegel ABOUT THE BOOK: In his bestselling first book, Eric Siegel explained how machine learning works. Now, in , he shows how to capitalize on it. The greatest tool...
Podcast episode
474 The AI Playbook by Eric Siegel: The AI Playbook: Mastering the Rare Art of Machine Learning Deployment by Eric Siegel ABOUT THE BOOK: In his bestselling first book, Eric Siegel explained how machine learning works. Now, in , he shows how to capitalize on it. The greatest tool...
byThe Marketing Book Podcast
0 ratings
0% found this document useful
Episode 75: AI in Academia: Research and Writing Tools to Reduce the Struggle
Podcast episode
Episode 75: AI in Academia: Research and Writing Tools to Reduce the Struggle
byThe Struggling Scientists
0 ratings
0% found this document useful
Open Source Software as a Triumph of Information Hiding, Modularity, and Creating Optionality with Dr. Gail Murphy: In this newest episode of The Idealcast, Gene Kim speaks with Dr. Gail Murphy, Professor of Computer Science and Vice President of Research and Innovation at the University of British Columbia. She is also the co-founder, board member, and former Chi...
Podcast episode
Open Source Software as a Triumph of Information Hiding, Modularity, and Creating Optionality with Dr. Gail Murphy: In this newest episode of The Idealcast, Gene Kim speaks with Dr. Gail Murphy, Professor of Computer Science and Vice President of Research and Innovation at the University of British Columbia. She is also the co-founder, board member, and former Chi...
byThe Idealcast with Gene Kim by IT Revolution
0 ratings
0% found this document useful
88: Flexible Electronics for Better Health (ft. Stan Farnsworth): What defines a flexible electronic? Is it just a squishy computer? Plastic with a leaf printed on it? What about artificial tissue? Well, it could be all of these! But what materials science goes into the production of flexible electronics? Today’s g...
Podcast episode
88: Flexible Electronics for Better Health (ft. Stan Farnsworth): What defines a flexible electronic? Is it just a squishy computer? Plastic with a leaf printed on it? What about artificial tissue? Well, it could be all of these! But what materials science goes into the production of flexible electronics? Today’s g...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful

Skip carousel

Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Sep 20, 2016
5 min read
How AI Algorithms Could Help Design New Drugs
Futurity
Article
How AI Algorithms Could Help Design New Drugs
Apr 6, 2017
A new kind of AI algorithm—designed to work with a small amount of data—may be able to assist in the early stages of drug development. Artificially intelligent algorithms can learn to identify amazingly subtle information, enabling them to distinguis
3 min read
Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Chicago Tribune
Article
Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Jul 10, 2018
3 min read
Upgrade Your Marketing With Machine Learning
Fast Company
Article
Upgrade Your Marketing With Machine Learning
Sep 9, 2019
2 min read
AI And Design: Questions Of Ethics
Architecture Australia
Article
AI And Design: Questions Of Ethics
Mar 4, 2024
Artificial intelligence (AI) is a very old idea, but the term AI and the field of AI as it relates to modern programmable digital computing have taken their contemporary forms in the past 70 years.1Today, we interact with AI technologies constantly,
5 min read
Quantum Jump
Business Today
Article
Quantum Jump
Dec 25, 2018
2 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
About the Authors
The European Business Review
Article
About the Authors
Feb 4, 2019
4 min read
Business applications For Quantum computing
Rotman Management
Article
Business applications For Quantum computing
May 1, 2022
COMPUTERS DO ARITHMETIC. Underlying every amazing application of computers today is math, calculated using binary digits or ‘bits.’ The original computers of the early 1950s could perform about 465 multiplications per second — much faster than the ‘h
11 min read
Finding A New Career In AI
APC
Article
Finding A New Career In AI
Mar 23, 2020
4 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
STEM Online Learning Resources
BBC Science Focus Magazine
Article
STEM Online Learning Resources
Sep 3, 2020
4 min read
Learning Code
India Today
Article
Learning Code
Feb 1, 2020
2 min read
What Do Academics Think?
The Big Issue Magazine
Article
What Do Academics Think?
May 19, 2023
3 min read
Why a Hedge Fund Started a Video Game Competition
Nautilus
Article
Why a Hedge Fund Started a Video Game Competition
Nov 30, 2017
There’s a weird way in which a hedge fund is a confluence of everything. There’s the money of course—Two Sigma, located in lower Manhattan, manages over $50 billion, an amount that has grown 600 percent in 6 years and is roughly the size of the econo
9 min read
Innovation – Now And For Generations To Come
The European Business Review
Article
Innovation – Now And For Generations To Come
Aug 1, 2022
6 min read
Why It’s Imperative For Universities To Teach AI To All Students
Forbes Africa
Article
Why It’s Imperative For Universities To Teach AI To All Students
Mar 26, 2020
IN FEBRUARY THIS year, telecommunications company Ericsson announced that it had launched artificial intelligence-powered Energy Infrastructure Operations. This energy management system would leverage artificial intelligence (AI) to optimize energy c
3 min read
Machine Learning in Business: Issues for Society
Rotman Management
Article
Machine Learning in Business: Issues for Society
Jan 1, 2020
11 min read
Changing Dynamics of Healthcare Sector - Quantum Computers Taking A Leap
Techfastly
Article
Changing Dynamics of Healthcare Sector - Quantum Computers Taking A Leap
Oct 1, 2021
5 min read
The Future Is Now
Palm Beach Illustrated
Article
The Future Is Now
Aug 19, 2019
5 min read
You Won’t Believe How Well This Algorithm Spots Clickbait
Futurity
Article
You Won’t Believe How Well This Algorithm Spots Clickbait
Aug 29, 2019
3 min read
Facilitating The Future
Facility Management
Article
Facilitating The Future
Oct 14, 2019
FM: Let’s start with a brief overview of your background. Sam Wishart: I lead the fantastic facilities, assets and services team within the infrastructure and operations division at La Trobe University and have for about seven and a half years. Prior
3 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Technology At The Crossroads
AQ: Australian Quarterly
Article
Technology At The Crossroads
Dec 31, 2018
7 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Education 2.0: The Destructive Reconstruction of Higher Learning
Rotman Management
Article
Education 2.0: The Destructive Reconstruction of Higher Learning
Jan 1, 2018
8 min read
Quantum Computing and The Rise Of Machine Learning
Techfastly
Article
Quantum Computing and The Rise Of Machine Learning
Oct 1, 2021
2 min read

Related categories

Skip carousel

Reviews for Introduction to Algorithms for Data Mining and Machine Learning

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Introduction to Algorithms for Data Mining and Machine Learning - Xin-She Yang

Introduction to Algorithms for Data Mining and Machine Learning

First edition

Xin-She Yang

Middlesex University, School of Science and Technology, London, United Kingdom

Cover image

Title page

Copyright

About the author

Preface

Acknowledgments

1: Introduction to optimization

Abstract

1.1. Algorithms

1.2. Optimization

1.3. Unconstrained optimization

1.4. Nonlinear constrained optimization

1.5. Notes on software

Bibliography

2: Mathematical foundations

Abstract

2.1. Convexity

2.2. Computational complexity

2.3. Norms and regularization

2.4. Probability distributions

2.5. Bayesian network and Markov models

2.6. Monte Carlo sampling

2.7. Entropy, cross entropy, and KL divergence

2.8. Fuzzy rules

2.9. Data mining and machine learning

2.10. Notes on software

Bibliography

3: Optimization algorithms

Abstract

3.1. Gradient-based methods

3.2. Variants of gradient-based methods

3.3. Optimizers in deep learning

3.4. Gradient-free methods

3.5. Evolutionary algorithms and swarm intelligence

3.6. Notes on software

Bibliography

4: Data fitting and regression

Abstract

4.1. Sample mean and variance

4.2. Regression analysis

4.3. Nonlinear least squares

4.4. Overfitting and information criteria

4.5. Regularization and Lasso method

4.6. Notes on software

Bibliography

5: Logistic regression, PCA, LDA, and ICA

Abstract

5.1. Logistic regression

5.2. Softmax regression

5.3. Principal component analysis

5.4. Linear discriminant analysis

5.5. Singular value decomposition

5.6. Independent component analysis

5.7. Notes on software

Bibliography

6: Data mining techniques

Abstract

6.1. Introduction

6.2. Hierarchy clustering

6.3. k-Nearest-neighbor algorithm

6.4. k-Means algorithm

6.5. Decision trees and random forests

6.6. Bayesian classifiers

6.7. Data mining for big data

6.8. Notes on software

Bibliography

7: Support vector machine and regression

Abstract

7.1. Statistical learning theory

7.2. Linear support vector machine

7.3. Kernel functions and nonlinear SVM

7.4. Support vector regression

7.5. Notes on software

Bibliography

8: Neural networks and deep learning

Abstract

8.1. Learning

8.2. Artificial neural networks

8.3. Back propagation algorithm

8.4. Loss functions in ANN

8.5. Optimizers and choice of optimizers

8.6. Network architecture

8.7. Deep learning

8.8. Tuning of hyperparameters

8.9. Notes on software

Bibliography

Index

Copyright

Academic Press is an imprint of Elsevier

125 London Wall, London EC2Y 5AS, United Kingdom

525 B Street, Suite 1650, San Diego, CA 92101, United States

50 Hampshire Street, 5th Floor, Cambridge, MA 02139, United States

The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, United Kingdom

No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher's permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions.

This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein).

Notices

Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary.

Practitioners and researchers must always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility.

To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein.

Library of Congress Cataloging-in-Publication Data

A catalog record for this book is available from the Library of Congress

British Library Cataloguing-in-Publication Data

A catalogue record for this book is available from the British Library

ISBN: 978-0-12-817216-2

For information on all Academic Press publications visit our website at https://www.elsevier.com/books-and-journals

Publisher: Candice Janco

Acquisition Editor: J. Scott Bentley

Editorial Project Manager: Michael Lutz

Production Project Manager: Nilesh Kumar Shah

Designer: Miles Hitchen

Typeset by VTeX

About the author

Xin-She Yang obtained his PhD in Applied Mathematics from the University of Oxford. He then worked at Cambridge University and National Physical Laboratory (UK) as a Senior Research Scientist. Now he is Reader at Middlesex University London, and an elected Bye-Fellow at Cambridge University.

He is also the IEEE Computer Intelligence Society (CIS) Chair for the Task Force on Business Intelligence and Knowledge Management, Director of the International Consortium for Optimization and Modelling in Science and Industry (iCOMSI), and an Editor of Springer's Book Series Springer Tracts in Nature-Inspired Computing (STNIC).

With more than 20 years of research and teaching experience, he has authored 10 books and edited more than 15 books. He published more than 200 research papers in international peer-reviewed journals and conference proceedings with more than 36 800 citations. He has been on the prestigious lists of Clarivate Analytics and Web of Science highly cited researchers in 2016, 2017, and 2018. He serves on the Editorial Boards of many international journals including International Journal of Bio-Inspired Computation, Elsevier's Journal of Computational Science (JoCS), International Journal of Parallel, Emergent and Distributed Systems, and International Journal of Computer Mathematics. He is also the Editor-in-Chief of the International Journal of Mathematical Modelling and Numerical Optimisation.

Preface

Xin-She Yang

Both data mining and machine learning are becoming popular subjects for university courses and industrial applications. This popularity is partly driven by the Internet and social media because they generate a huge amount of data every day, and the understanding of such big data requires sophisticated data mining techniques. In addition, many applications such as facial recognition and robotics have extensively used machine learning algorithms, leading to the increasing popularity of artificial intelligence. From a more general perspective, both data mining and machine learning are closely related to optimization. After all, in many applications, we have to minimize costs, errors, energy consumption, and environment impact and to maximize sustainability, productivity, and efficiency. Many problems in data mining and machine learning are usually formulated as optimization problems so that they can be solved by optimization algorithms. Therefore, optimization techniques are closely related to many techniques in data mining and machine learning.

Courses on data mining, machine learning, and optimization are often compulsory for students, studying computer science, management science, engineering design, operations research, data science, finance, and economics. All students have to develop a certain level of data modeling skills so that they can process and interpret data for classification, clustering, curve-fitting, and predictions. They should also be familiar with machine learning techniques that are closely related to data mining so as to carry out problem solving in many real-world applications. This book provides an introduction to all the major topics for such courses, covering the essential ideas of all key algorithms and techniques for data mining, machine learning, and optimization.

Though there are over a dozen good books on such topics, most of these books are either too specialized with specific readership or too lengthy (often over 500 pages). This book fills in the gap with a compact and concise approach by focusing on the key concepts, algorithms, and techniques at an introductory level. The main approach of this book is informal, theorem-free, and practical. By using an informal approach all fundamental topics required for data mining and machine learning are covered, and the readers can gain such basic knowledge of all important algorithms with a focus on their key ideas, without worrying about any tedious, rigorous mathematical proofs. In addition, the practical approach provides about 30 worked examples in this book so that the readers can see how each step of the algorithms and techniques works. Thus, the readers can build their understanding and confidence gradually and in a step-by-step manner. Furthermore, with the minimal requirements of basic high school mathematics and some basic calculus, such an informal and practical style can also enable the readers to learn the contents by self-study and at their own pace.

This book is suitable for undergraduates and graduates to rapidly develop all the fundamental knowledge of data mining, machine learning, and optimization. It can also be used by students and researchers as a reference to review and refresh their knowledge in data mining, machine learning, optimization, computer science, and data science.

January 2019 in London

Acknowledgments

Xin-She Yang

I would like to thank all my students and colleagues who have given valuable feedback and comments on some of the contents and examples of this book. I also would like to thank my editors, J. Scott Bentley and Michael Lutz, and the staff at Elsevier for their professionalism. Last but not least, I thank my family for all the help and support.

January 2019

Introduction to optimization

Abstract

This chapter introduces the fundamentals of algorithms in the context of data mining, optimization, and machine learning, including the feasibility, constraints, optimality, Lagrange multipliers, KKT conditions, and gradient-based techniques.

Keywords

Algorithm; data mining; gradient; machine learning; optimization

Chapter Outline

1.1 Algorithms

1.1.1 Essence of an algorithm

1.1.2 Issues with algorithms

1.1.3 Types of algorithms

1.2 Optimization

1.2.1 A simple example

1.2.2 General formulation of optimization

1.2.3 Feasible solution

1.2.4 Optimality criteria

1.3 Unconstrained optimization

1.3.1 Univariate functions

1.3.2 Multivariate functions

1.4 Nonlinear constrained optimization

1.4.1 Penalty method

1.4.2 Lagrange multipliers

1.4.3 Karush–Kuhn–Tucker conditions

1.5 Notes on software

This book introduces the most fundamentals and algorithms related to optimization, data mining, and machine learning. The main requirement is some understanding of high-school mathematics and basic calculus; however, we will review and introduce some of the mathematical foundations in the first two chapters.

1.1 Algorithms

An algorithm is an iterative, step-by-step procedure for computation. The detailed procedure can be a simple description, an equation, or a series of descriptions in combination with equations. Finding the roots of a polynomial, checking if a natural number is a prime number, and generating random numbers are all algorithms.

1.1.1 Essence of an algorithm

, we can use the following iterative equation:

(1.1)

where k .

Example 1

, then we have

(1.2)

Similarly, we have

(1.3)

(1.4)

. The accuracy of this iterative formula or algorithm is high because it achieves the accuracy of five decimal places after four iterations.

due to division by zero.

is equivalent to solving the equation

(1.5)

. We know that Newton's root-finding algorithm can be written as

(1.6)

. Thus, Newton's formula becomes

(1.7)

which can be written as

(1.8)

This is exactly what we have in Eq. (1.1).

Newton's method has rigorous mathematical foundations, which has a guaranteed convergence under certain conditions. However, in general, Eq. .

1.1.2 Issues with algorithms

The advantage of the algorithm given in Eq. , how can we find the other root −2 in addition to +2?

, not −2.

, we have

(1.9)

(1.10)

, not +2.

This highlights a key issue here: the final solution seems to depend on the initial starting point for this algorithm, which is true for many algorithms.

Now the relevant question is: how do we know where to start to get a particular solution? The general short answer is we do not know. Thus, some knowledge of the problem under consideration or an educated guess may be useful to find the final solution.

In fact, most algorithms may depend on the initial configuration, and such algorithms are often carrying out search moves locally. Thus, this type of algorithm is often referred to as local search. A good algorithm should be able to forget its initial configuration though such algorithms may not exist at all for most types of problems.

What we need in general is the global search, which attempts to find final solutions that are less sensitive to the initial starting point(s).

is necessary for some algorithms such as Newton's method given in Eq. . Some modifications are needed.

There are other issues related to algorithms such as the setting of parameters, the slow rate of convergence, condition numbers, and iteration structures. All these make algorithm designs and usage somehow challenging, and we will discuss these issues in more detail later in this book.

1.1.3 Types of algorithms

An algorithm can only do a specific computation task (at most a class of computational tasks), and no algorithms can do all the tasks. Thus, algorithms can be classified due to their purposes. An algorithm to find roots of a polynomial belongs to root-finding algorithms, whereas an algorithm for ranking a set of numbers belongs to sorting algorithms. There are many classes of algorithms for different purposes. Even for the same purpose such as sorting, there are many different algorithms such as the merge sort, bubble sort, quicksort, and others.

using Eq. (1.1), random initial values (both positive and negative) can allow the algorithm to find both roots. In fact, a major trend in the modern metaheuristics is using some randomization to suit different purposes.

For algorithms to be introduced in this book, we are mainly concerned with

Enjoying the preview?

Page 1 of 1

Introduction to Algorithms for Data Mining and Machine Learning

About this ebook

Xin-She Yang

Read more from Xin She Yang

Related authors

Related to Introduction to Algorithms for Data Mining and Machine Learning

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Introduction to Algorithms for Data Mining and Machine Learning

What did you think?

Book preview

Introduction to Algorithms for Data Mining and Machine Learning - Xin-She Yang

Table of Contents

Copyright

Notices

About the author

Preface

Acknowledgments

Abstract

Keywords

Algorithm; data mining; gradient; machine learning; optimization

Chapter Outline

1.1 Algorithms

1.1.1 Essence of an algorithm

Example 1

(1.3)

(1.8)

1.1.2 Issues with algorithms

(1.9)

(1.10)

1.1.3 Types of algorithms