An Elementary Introduction to Statistical Learning Theory

Ebook416 pages4 hours

An Elementary Introduction to Statistical Learning Theory

Name: An Elementary Introduction to Statistical Learning Theory
Author: Sanjeev Kulkarni
ISBN: 9781118023464

By Sanjeev Kulkarni and Gilbert Harman

Rating: 0 out of 5 stars

()

Read preview

About this ebook

A thought-provoking look at statistical learning theory and its role in understanding human learning and inductive reasoning

A joint endeavor from leading researchers in the fields of philosophy and electrical engineering, An Elementary Introduction to Statistical Learning Theory is a comprehensive and accessible primer on the rapidly evolving fields of statistical pattern recognition and statistical learning theory. Explaining these areas at a level and in a way that is not often found in other books on the topic, the authors present the basic theory behind contemporary machine learning and uniquely utilize its foundations as a framework for philosophical thinking about inductive inference.

Promoting the fundamental goal of statistical learning, knowing what is achievable and what is not, this book demonstrates the value of a systematic methodology when used along with the needed techniques for evaluating the performance of a learning system. First, an introduction to machine learning is presented that includes brief discussions of applications such as image recognition, speech recognition, medical diagnostics, and statistical arbitrage. To enhance accessibility, two chapters on relevant aspects of probability theory are provided. Subsequent chapters feature coverage of topics such as the pattern recognition problem, optimal Bayes decision rule, the nearest neighbor rule, kernel rules, neural networks, support vector machines, and boosting.

Appendices throughout the book explore the relationship between the discussed material and related topics from mathematics, philosophy, psychology, and statistics, drawing insightful connections between problems in these areas and statistical learning theory. All chapters conclude with a summary section, a set of practice questions, and a reference sections that supplies historical notes and additional resources for further study.

An Elementary Introduction to Statistical Learning Theory is an excellent book for courses on statistical learning theory, pattern recognition, and machine learning at the upper-undergraduate and graduate levels. It also serves as an introductory reference for researchers and practitioners in the fields of engineering, computer science, philosophy, and cognitive science that would like to further their knowledge of the topic.

Skip carousel

Mathematics

LanguageEnglish

PublisherWiley

Release dateJun 9, 2011

ISBN9781118023464

Author

Sanjeev Kulkarni

Related authors

Skip carousel

Related to An Elementary Introduction to Statistical Learning Theory

Titles in the series (100)

Skip carousel

Theory of Probability: A critical introductory treatment
Ebook
Theory of Probability: A critical introductory treatment
byBruno de Finetti
Rating: 0 out of 5 stars
0 ratings
Nonparametric Finance
Ebook
Nonparametric Finance
byJussi Klemelä
Rating: 0 out of 5 stars
0 ratings
Robust Correlation: Theory and Applications
Ebook
Robust Correlation: Theory and Applications
byGeorgy L. Shevlyakov
Rating: 0 out of 5 stars
0 ratings
Statistics and Causality: Methods for Applied Empirical Research
Ebook
Statistics and Causality: Methods for Applied Empirical Research
byWolfgang Wiedermann
Rating: 0 out of 5 stars
0 ratings
Theory of Ridge Regression Estimation with Applications
Ebook
Theory of Ridge Regression Estimation with Applications
byA. K. Md. Ehsanes Saleh
Rating: 0 out of 5 stars
0 ratings
Aspects of Multivariate Statistical Theory
Ebook
Aspects of Multivariate Statistical Theory
byRobb J. Muirhead
Rating: 0 out of 5 stars
0 ratings
Probability and Conditional Expectation: Fundamentals for the Empirical Sciences
Ebook
Probability and Conditional Expectation: Fundamentals for the Empirical Sciences
byRolf Steyer
Rating: 0 out of 5 stars
0 ratings
Applications of Statistics to Industrial Experimentation
Ebook
Applications of Statistics to Industrial Experimentation
byCuthbert Daniel
Rating: 3 out of 5 stars
3/5
Statistical Group Comparison
Ebook
Statistical Group Comparison
byTim Futing Liao
Rating: 0 out of 5 stars
0 ratings
Time Series Analysis: Nonstationary and Noninvertible Distribution Theory
Ebook
Time Series Analysis: Nonstationary and Noninvertible Distribution Theory
byKatsuto Tanaka
Rating: 0 out of 5 stars
0 ratings
Linear Statistical Inference and its Applications
Ebook
Linear Statistical Inference and its Applications
byC. Radhakrishna Rao
Rating: 0 out of 5 stars
0 ratings
Measuring Agreement: Models, Methods, and Applications
Ebook
Measuring Agreement: Models, Methods, and Applications
byPankaj K. Choudhary
Rating: 0 out of 5 stars
0 ratings
Fundamental Statistical Inference: A Computational Approach
Ebook
Fundamental Statistical Inference: A Computational Approach
byMarc S. Paolella
Rating: 0 out of 5 stars
0 ratings
Measurement Errors in Surveys
Ebook
Measurement Errors in Surveys
byPaul P. Biemer
Rating: 0 out of 5 stars
0 ratings
Business Survey Methods
Ebook
Business Survey Methods
byBrenda G. Cox
Rating: 0 out of 5 stars
0 ratings
Time Series Analysis with Long Memory in View
Ebook
Time Series Analysis with Long Memory in View
byUwe Hassler
Rating: 0 out of 5 stars
0 ratings
A Course in Time Series Analysis
Ebook
A Course in Time Series Analysis
byDaniel Peña
Rating: 3 out of 5 stars
3/5
Nonlinear Statistical Models
Ebook
Nonlinear Statistical Models
byA. Ronald Gallant
Rating: 0 out of 5 stars
0 ratings
Methods for Statistical Data Analysis of Multivariate Observations
Ebook
Methods for Statistical Data Analysis of Multivariate Observations
byR. Gnanadesikan
Rating: 0 out of 5 stars
0 ratings
The Statistical Analysis of Failure Time Data
Ebook
The Statistical Analysis of Failure Time Data
byJohn D. Kalbfleisch
Rating: 0 out of 5 stars
0 ratings
Computation for the Analysis of Designed Experiments
Ebook
Computation for the Analysis of Designed Experiments
byRichard Heiberger
Rating: 0 out of 5 stars
0 ratings
Statistical Models and Methods for Lifetime Data
Ebook
Statistical Models and Methods for Lifetime Data
byJerald F. Lawless
Rating: 0 out of 5 stars
0 ratings
Forecasting with Univariate Box - Jenkins Models: Concepts and Cases
Ebook
Forecasting with Univariate Box - Jenkins Models: Concepts and Cases
byAlan Pankratz
Rating: 0 out of 5 stars
0 ratings
Survey Measurement and Process Quality
Ebook
Survey Measurement and Process Quality
byLars E. Lyberg
Rating: 0 out of 5 stars
0 ratings
Modern Experimental Design
Ebook
Modern Experimental Design
byThomas P. Ryan
Rating: 0 out of 5 stars
0 ratings
The EM Algorithm and Extensions
Ebook
The EM Algorithm and Extensions
byGeoffrey McLachlan
Rating: 0 out of 5 stars
0 ratings
Subjective and Objective Bayesian Statistics: Principles, Models, and Applications
Ebook
Subjective and Objective Bayesian Statistics: Principles, Models, and Applications
byS. James Press
Rating: 0 out of 5 stars
0 ratings
Multiple Imputation for Nonresponse in Surveys
Ebook
Multiple Imputation for Nonresponse in Surveys
byDonald B. Rubin
Rating: 2 out of 5 stars
2/5
Periodically Correlated Random Sequences: Spectral Theory and Practice
Ebook
Periodically Correlated Random Sequences: Spectral Theory and Practice
byHarry L. Hurd
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Queueing Theory
Ebook
Fundamentals of Queueing Theory
byJohn F. Shortle
Rating: 0 out of 5 stars
0 ratings

Related ebooks

Skip carousel

Case Studies in Bayesian Statistical Modelling and Analysis
Ebook
Case Studies in Bayesian Statistical Modelling and Analysis
byClair L. Alston
Rating: 0 out of 5 stars
0 ratings
Finite Mixture Models
Ebook
Finite Mixture Models
byGeoffrey McLachlan
Rating: 0 out of 5 stars
0 ratings
Quantum Detection and Estimation Theory
Ebook
Quantum Detection and Estimation Theory
byElsevier Books Reference
Rating: 4 out of 5 stars
4/5
Dynamic Programming and Its Application to Optimal Control
Ebook
Dynamic Programming and Its Application to Optimal Control
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
A Survey of Combinatorial Theory
Ebook
A Survey of Combinatorial Theory
byJagdish N. Srivastava
Rating: 0 out of 5 stars
0 ratings
Factorization Methods for Discrete Sequential Estimation
Ebook
Factorization Methods for Discrete Sequential Estimation
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Interior Point Algorithms: Theory and Analysis
Ebook
Interior Point Algorithms: Theory and Analysis
byYinyu Ye
Rating: 0 out of 5 stars
0 ratings
Discrete Optimization
Ebook
Discrete Optimization
byR. Gary Parker
Rating: 0 out of 5 stars
0 ratings
Group Actions in Ergodic Theory, Geometry, and Topology: Selected Papers
Ebook
Group Actions in Ergodic Theory, Geometry, and Topology: Selected Papers
byRobert J. Zimmer
Rating: 0 out of 5 stars
0 ratings
Stochastic Stability and Control
Ebook
Stochastic Stability and Control
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control
Ebook
Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control
byJames C. Spall
Rating: 4 out of 5 stars
4/5
Nonserial Dynamic Programming
Ebook
Nonserial Dynamic Programming
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications
Ebook
Queueing Networks and Markov Chains: Modeling and Performance Evaluation with Computer Science Applications
byGunter Bolch
Rating: 5 out of 5 stars
5/5
Graphs and Tables of the Mathieu Functions and Their First Derivatives
Ebook
Graphs and Tables of the Mathieu Functions and Their First Derivatives
byJames C. Wiltse
Rating: 0 out of 5 stars
0 ratings
Dynamic Programming and Its Applications: Proceedings of the International Conference on Dynamic Programming and Its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14-16, 1977
Ebook
Dynamic Programming and Its Applications: Proceedings of the International Conference on Dynamic Programming and Its Applications, University of British Columbia, Vancouver, British Columbia, Canada, April 14-16, 1977
byMartin L. Puterman
Rating: 0 out of 5 stars
0 ratings
Random Graphs
Ebook
Random Graphs
bySvante Janson
Rating: 5 out of 5 stars
5/5
The Art and Theory of Dynamic Programming
Ebook
The Art and Theory of Dynamic Programming
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Stochastic Analysis of Mixed Fractional Gaussian Processes
Ebook
Stochastic Analysis of Mixed Fractional Gaussian Processes
byYuliya Mishura
Rating: 0 out of 5 stars
0 ratings
The EM Algorithm and Extensions
Ebook
The EM Algorithm and Extensions
byGeoffrey McLachlan
Rating: 0 out of 5 stars
0 ratings
Introductory Lectures on Equivariant Cohomology: (AMS-204)
Ebook
Introductory Lectures on Equivariant Cohomology: (AMS-204)
byLoring W. Tu
Rating: 0 out of 5 stars
0 ratings
Microwave Imaging
Ebook
Microwave Imaging
byMatteo Pastorino
Rating: 4 out of 5 stars
4/5
Low-Rank Models in Visual Analysis: Theories, Algorithms, and Applications
Ebook
Low-Rank Models in Visual Analysis: Theories, Algorithms, and Applications
byZhouchen Lin
Rating: 0 out of 5 stars
0 ratings
Ergodic Theory and Topological Dynamics
Ebook
Ergodic Theory and Topological Dynamics
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Applications of Finite Groups
Ebook
Applications of Finite Groups
byJ. S. Lomont
Rating: 0 out of 5 stars
0 ratings
Fractional Calculus and Fractional Processes with Applications to Financial Economics: Theory and Application
Ebook
Fractional Calculus and Fractional Processes with Applications to Financial Economics: Theory and Application
byHasan Fallahgoul
Rating: 3 out of 5 stars
3/5
Multinomial Probit: The Theory and Its Application to Demand Forecasting
Ebook
Multinomial Probit: The Theory and Its Application to Demand Forecasting
byCarlos Daganzo
Rating: 0 out of 5 stars
0 ratings
Latin Squares and Their Applications: Latin Squares and Their Applications
Ebook
Latin Squares and Their Applications: Latin Squares and Their Applications
byA. Donald Keedwell
Rating: 5 out of 5 stars
5/5
Almost Free Modules: Set-theoretic Methods
Ebook
Almost Free Modules: Set-theoretic Methods
byP.C. Eklof
Rating: 0 out of 5 stars
0 ratings
Statistics of Extremes: Theory and Applications
Ebook
Statistics of Extremes: Theory and Applications
byJan Beirlant
Rating: 4 out of 5 stars
4/5
Seismology: Surface Waves and Earth Oscillations
Ebook
Seismology: Surface Waves and Earth Oscillations
byBruce Bolt
Rating: 0 out of 5 stars
0 ratings

Mathematics For You

Skip carousel

Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
Ebook
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
byS. Deviant
Rating: 4 out of 5 stars
4/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
Logicomix: An epic search for truth
Ebook
Logicomix: An epic search for truth
byApostolos Doxiadis
Rating: 4 out of 5 stars
4/5
The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
Game Theory: A Simple Introduction
Ebook
Game Theory: A Simple Introduction
byK.H. Erickson
Rating: 4 out of 5 stars
4/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
Ebook
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
byChristopher Monahan
Rating: 5 out of 5 stars
5/5
Algebra I Workbook For Dummies
Ebook
Algebra I Workbook For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
Algebra I For Dummies
Ebook
Algebra I For Dummies
byMary Jane Sterling
Rating: 4 out of 5 stars
4/5
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
Ebook
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
byEditors of Portable Press
Rating: 4 out of 5 stars
4/5
Flatland
Ebook
Flatland
byEdwin A. Abbott
Rating: 4 out of 5 stars
4/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
Ebook
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
byAlbert Rutherford
Rating: 3 out of 5 stars
3/5
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5
Basic Math Notes
Ebook
Basic Math Notes
byErnest Bywater
Rating: 5 out of 5 stars
5/5
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
Ebook
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
byKit Yates
Rating: 4 out of 5 stars
4/5
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
ACT Math & Science Prep: Includes 500+ Practice Questions
Ebook
ACT Math & Science Prep: Includes 500+ Practice Questions
byKaplan Test Prep
Rating: 3 out of 5 stars
3/5

Related podcast episodes

Skip carousel

Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
Podcast episode
Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
byDigital Pathology Podcast
0 ratings
0% found this document useful
Top 5 Mistakes you must AVOID in using Machine Learning for pathology w/ Heather Couture, PixelScientia Labs
Podcast episode
Top 5 Mistakes you must AVOID in using Machine Learning for pathology w/ Heather Couture, PixelScientia Labs
byDigital Pathology Podcast
0 ratings
0% found this document useful
The Computational Complexity of Machine Learning: In this episode, Professor Michael Kearns from the University of Pennsylvania joins host Kyle Polich to talk about the computational complexity of machine learning, complexity in game theory, and algorithmic fairness. Michael's doctoral thesis gave an...
Podcast episode
The Computational Complexity of Machine Learning: In this episode, Professor Michael Kearns from the University of Pennsylvania joins host Kyle Polich to talk about the computational complexity of machine learning, complexity in game theory, and algorithmic fairness. Michael's doctoral thesis gave an...
byData Skeptic
0 ratings
0% found this document useful
90. LEAN Theorem Provers used to model Physics and Chemistry: http://breakingmath.io Breaking Math Email: BreakingMathPodcast@gmail.com Email us for copies of the transcript! Resources on the LEAN theorem prover and programming language can be found at the bottom of the show notes (scroll to the bottom). ...
Podcast episode
90. LEAN Theorem Provers used to model Physics and Chemistry: http://breakingmath.io Breaking Math Email: BreakingMathPodcast@gmail.com Email us for copies of the transcript! Resources on the LEAN theorem prover and programming language can be found at the bottom of the show notes (scroll to the bottom). ...
byBreaking Math Podcast
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
Podcast episode
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
byAlignment Newsletter Podcast
0 ratings
0% found this document useful
#338: Site Selection for Clinical Trials
Podcast episode
#338: Site Selection for Clinical Trials
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
The End of Finetuning — with Jeremy Howard of Fast.ai
Podcast episode
The End of Finetuning — with Jeremy Howard of Fast.ai
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
10. Unlocking Contract Intelligence: The Intersection of AI and Transformative Mathematics with Randy Friedman: The CLM Rx
Podcast episode
10. Unlocking Contract Intelligence: The Intersection of AI and Transformative Mathematics with Randy Friedman: The CLM Rx
byThe CLM Rx
0 ratings
0% found this document useful
Jeremiah Lowin – Machine Learning in Investing – [Invest Like the Best, EP.105]: My guest this week is one of my best and oldest friends, Jeremiah Lowin. Jeremiah has had a fascinating career, starting with advanced work in statistics before moving into the risk management field in the hedge fund world. Through his career he has studi
Podcast episode
Jeremiah Lowin – Machine Learning in Investing – [Invest Like the Best, EP.105]: My guest this week is one of my best and oldest friends, Jeremiah Lowin. Jeremiah has had a fascinating career, starting with advanced work in statistics before moving into the risk management field in the hedge fund world. Through his career he has studi
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
LM101-083: Ch5: How to Use Calculus to Design Learning Machines: This particular podcast covers the material from Chapter 5 of my new book “Statistical Machine Learning: A unified framework” which is now available! The book chapter shows how matrix calculus is very useful for the analysis and design of both linear
Podcast episode
LM101-083: Ch5: How to Use Calculus to Design Learning Machines: This particular podcast covers the material from Chapter 5 of my new book “Statistical Machine Learning: A unified framework” which is now available! The book chapter shows how matrix calculus is very useful for the analysis and design of both linear
byLearning Machines 101
0 ratings
0% found this document useful
Cris Moore on Algorithmic Justice & The Physics of Inference
Podcast episode
Cris Moore on Algorithmic Justice & The Physics of Inference
byCOMPLEXITY: Physics of Life
0 ratings
0% found this document useful
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
Podcast episode
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
Podcast episode
Fast.ai, AutoML, and Software Engineering for ML: Jeremy Howard // Coffee Session #47
byMLOps.community
0 ratings
0% found this document useful
118: A Program That Predicts the Properties of New Polymers (ft. Dr. Rishi Gurnani): Bakelite was discovered in 1907. Nylon was discovered in 1935, polyethylene in 1936, Kevlar in 1966. All of these discoveries were revolutionary and had years of work put into their discovery. Each major discovery is years apart. Maybe scientists too...
Podcast episode
118: A Program That Predicts the Properties of New Polymers (ft. Dr. Rishi Gurnani): Bakelite was discovered in 1907. Nylon was discovered in 1935, polyethylene in 1936, Kevlar in 1966. All of these discoveries were revolutionary and had years of work put into their discovery. Each major discovery is years apart. Maybe scientists too...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
Podcast episode
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
Podcast episode
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
The Cloudcast #285 - Automation, DevOps & Reddit: Aaron and Brian talk with Jason Edelman (@jedelman8, Founder @networktocode), and Matt Oswalt (@mierdin, Software Engineer @stackstorm) about the state of automation in the industry, how people are evolving their skills, if any of this DevOps is real, ...
Podcast episode
The Cloudcast #285 - Automation, DevOps & Reddit: Aaron and Brian talk with Jason Edelman (@jedelman8, Founder @networktocode), and Matt Oswalt (@mierdin, Software Engineer @stackstorm) about the state of automation in the industry, how people are evolving their skills, if any of this DevOps is real, ...
byThe Cloudcast
0 ratings
0% found this document useful
74: Pratik Desai: A time traveler’s guide to martech and personalization
Podcast episode
74: Pratik Desai: A time traveler’s guide to martech and personalization
byHumans of Martech
0 ratings
0% found this document useful
196: Memorization Techniques for Law Students: Creative ways to memorize the law you need for law school exams
Podcast episode
196: Memorization Techniques for Law Students: Creative ways to memorize the law you need for law school exams
byThe Law School Toolbox Podcast: Tools for Law Students from 1L to the Bar Exam, and Beyond
0 ratings
0% found this document useful
Live from TWIMLcon! Operationalizing Responsible AI - #310: An often forgotten about topic garnered high praise at TWIMLcon this month: operationalizing responsible and ethical AI. This important topic was combined with an impressive panel of speakers, including: Rachel Thomas, Director, Center for Applied...
Podcast episode
Live from TWIMLcon! Operationalizing Responsible AI - #310: An often forgotten about topic garnered high praise at TWIMLcon this month: operationalizing responsible and ethical AI. This important topic was combined with an impressive panel of speakers, including: Rachel Thomas, Director, Center for Applied...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
CM 066: Cathy O’Neil on the Human Cost of Big Data: Algorithms make millions of decisions about us every day. For example, they determine our insurance premiums, whether we get a mortgage, and how we perform on the job. Yet, what is more alarming is that data scientists also write the code that fires ...
Podcast episode
CM 066: Cathy O’Neil on the Human Cost of Big Data: Algorithms make millions of decisions about us every day. For example, they determine our insurance premiums, whether we get a mortgage, and how we perform on the job. Yet, what is more alarming is that data scientists also write the code that fires ...
byCurious Minds at Work
0 ratings
0% found this document useful
Machine Learning: Does machine learning feel like too convoluted a topic? Not anymore! Listen to hosts Lois Houston and Nikita Abraham, along with Senior Principal OCI Instructor Hemant Gahankari, talk about foundational machine learning concepts and dive into how...
Podcast episode
Machine Learning: Does machine learning feel like too convoluted a topic? Not anymore! Listen to hosts Lois Houston and Nikita Abraham, along with Senior Principal OCI Instructor Hemant Gahankari, talk about foundational machine learning concepts and dive into how...
byOracle University Podcast
0 ratings
0% found this document useful
596 : Topical English Vocabulary Lesson With Teacher Tiffani about Technology Advancements
Podcast episode
596 : Topical English Vocabulary Lesson With Teacher Tiffani about Technology Advancements
bySpeak English with Tiffani Podcast
0 ratings
0% found this document useful
BAM 29: My 4 step data model and system requirement process: Data models and system requirements. Two of the most commonly misunderstood steps of the systems integration process. In this episode I am going to be teaching you my step-by-step processes for creating data models and system requirements. If...
Podcast episode
BAM 29: My 4 step data model and system requirement process: Data models and system requirements. Two of the most commonly misunderstood steps of the systems integration process. In this episode I am going to be teaching you my step-by-step processes for creating data models and system requirements. If...
byThe Smart Buildings Academy Podcast | Teaching You Building Automation, Systems Integration, and Information Technology
0 ratings
0% found this document useful
109: How AI & ML Will Revolutionize Microscopy (ft. Dr. Sergei Kalinin): Watch the full episode here: https://youtu.be/8t4yR6br6Kg Have you ever taken a long SEM scan, only to realize that the image doesn’t show what you are looking for? Or maybe you were grinding and polishing a mounted sample, only to realize that you...
Podcast episode
109: How AI & ML Will Revolutionize Microscopy (ft. Dr. Sergei Kalinin): Watch the full episode here: https://youtu.be/8t4yR6br6Kg Have you ever taken a long SEM scan, only to realize that the image doesn’t show what you are looking for? Or maybe you were grinding and polishing a mounted sample, only to realize that you...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
5 Ways To Create An Effective Tech Coach Workflow To Be More Efficient and Effective
Podcast episode
5 Ways To Create An Effective Tech Coach Workflow To Be More Efficient and Effective
byAsk The Tech Coach
0 ratings
0% found this document useful
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
Podcast episode
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
byLinear Digressions
0 ratings
0% found this document useful
112: Managing Distractions in Law School: Practical tips for uncovering and minimizing distractions in law school
Podcast episode
112: Managing Distractions in Law School: Practical tips for uncovering and minimizing distractions in law school
byThe Law School Toolbox Podcast: Tools for Law Students from 1L to the Bar Exam, and Beyond
0 ratings
0% found this document useful
How to choose a digital slide scanner w/ Doug Stapleton, Hamamatsu
Podcast episode
How to choose a digital slide scanner w/ Doug Stapleton, Hamamatsu
byDigital Pathology Podcast
0 ratings
0% found this document useful

Skip carousel

Physicists Attack Math’s $1,000,000 Question
Quanta
Article
Physicists Attack Math’s $1,000,000 Question
Apr 4, 2017
Physicists are attempting to map the distribution of the prime numbers to the energy levels of a particular quantum system.
4 min read
The Awards and Rewards of Grasping Infinity
The Christian Science Monitor
Article
The Awards and Rewards of Grasping Infinity
Sep 19, 2017
Two mathematicians who made a breakthrough in understanding infinity were recently given a medal. Their work itself reflects an unbounded progress in explanations of reality.
2 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Business applications For Quantum computing
Rotman Management
Article
Business applications For Quantum computing
May 1, 2022
COMPUTERS DO ARITHMETIC. Underlying every amazing application of computers today is math, calculated using binary digits or ‘bits.’ The original computers of the early 1950s could perform about 465 multiplications per second — much faster than the ‘h
11 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Readers’comments
PC Pro Magazine
Article
Readers’comments
Oct 8, 2020
4 min read
Teaching Computers To Teach Boosts Intelligent Tutor Systems
Futurity
Article
Teaching Computers To Teach Boosts Intelligent Tutor Systems
May 11, 2020
3 min read
Why a Hedge Fund Started a Video Game Competition
Nautilus
Article
Why a Hedge Fund Started a Video Game Competition
Nov 30, 2017
There’s a weird way in which a hedge fund is a confluence of everything. There’s the money of course—Two Sigma, located in lower Manhattan, manages over $50 billion, an amount that has grown 600 percent in 6 years and is roughly the size of the econo
9 min read
Quantum Computing and The Rise Of Machine Learning
Techfastly
Article
Quantum Computing and The Rise Of Machine Learning
Oct 1, 2021
2 min read
How To Train Computers Faster For ‘Extreme’ Datasets
Futurity
Article
How To Train Computers Faster For ‘Extreme’ Datasets
Dec 12, 2019
4 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
The Race To Exascale Supercomputers
Maximum PC
Article
The Race To Exascale Supercomputers
Jun 21, 2022
9 min read
Changing Dynamics of Healthcare Sector - Quantum Computers Taking A Leap
Techfastly
Article
Changing Dynamics of Healthcare Sector - Quantum Computers Taking A Leap
Oct 1, 2021
5 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Things Get Strange When AI Starts Training Itself
The Atlantic
Article
Things Get Strange When AI Starts Training Itself
Feb 16, 2024
7 min read
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
AI And Design: Questions Of Ethics
Architecture Australia
Article
AI And Design: Questions Of Ethics
Mar 4, 2024
Artificial intelligence (AI) is a very old idea, but the term AI and the field of AI as it relates to modern programmable digital computing have taken their contemporary forms in the past 70 years.1Today, we interact with AI technologies constantly,
5 min read
Web App Security
Linux Format
Article
Web App Security
Jun 29, 2021
8 min read
Is Artificial Intelligence Permanently Inscrutable?
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Aug 14, 2017
5 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Sep 20, 2016
5 min read
The Deep Learning Revolution For Artificial Intelligence
Facility Management
Article
The Deep Learning Revolution For Artificial Intelligence
Mar 28, 2019
3 min read
Machine Learning – With Zero Programming
APC
Article
Machine Learning – With Zero Programming
Aug 12, 2019
6 min read
Opinion: Federated Learning: Collaboration Without Compromise For Health Care Research
STAT
Article
Opinion: Federated Learning: Collaboration Without Compromise For Health Care Research
Feb 13, 2020
Here's a new way to learn from massive collections of data while avoiding the privacy and other risks typically associated with sharing such information: federated learning.
3 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Even The Best Artificial Intelligence Has Weaknesses
Futurity
Article
Even The Best Artificial Intelligence Has Weaknesses
Jan 16, 2024
New research tries to reveal the weaknesses in artificial intelligence. Machines interpret medical scanning images more accurately than doctors, they translate foreign languages, and may soon be able to drive cars more safely than humans. However, ev
2 min read
“You Don’t Need A Computer, Let Alone One With 75,000 Processor Cores, To Think About The Parts Of A Problem”
PC Pro Magazine
Article
“You Don’t Need A Computer, Let Alone One With 75,000 Processor Cores, To Think About The Parts Of A Problem”
Dec 10, 2020
9 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
How Spooky Science Helps Us Peer Inside The Planets
All About Space
Article
How Spooky Science Helps Us Peer Inside The Planets
Dec 3, 2020
An assistant professor of computational science at the EPFL research centre in Lausanne, Switzerland, involved in the current research on metallic hydrogen. Could you explain how the machine-learning techniques used in your research work? Why were th
1 min read
Leadership Forum: Creative Destruction in Healthcare
Rotman Management
Article
Leadership Forum: Creative Destruction in Healthcare
Jan 1, 2020
10 min read

Related categories

Skip carousel

Reviews for An Elementary Introduction to Statistical Learning Theory

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

An Elementary Introduction to Statistical Learning Theory - Sanjeev Kulkarni

Series PageTitle Page

Published by John Wiley & Sons, Inc., Hoboken, New Jersey.

Published simultaneously in Canada.

No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4744. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008, or online at http://www.wiley.com/go/permission.

Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives or written sales materials. The advice and strategies contained herein may not be suitable for your situation. You should consult with a professional where appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.

For general information on our other products and services or for technical support, please contact our Customer Care Department within the United States at (800) 762-2974, outside the United States at (317) 572-3993 or fax (317) 572-4002.

Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic formats. For more information about Wiley products, visit our web site at www.wiley.com.

Library of Congress Cataloging-in-Publication Data:

Kulkarni, Sanjeev.

An elementary introduction to statistical learning theory / Sanjeev Kulkarni, Gilbert Harman. p. cm.

Includes index.

ISBN 978-0-470-64183-5 (cloth)

1. Machine learning-Statistical methods. 2. Pattern recognition systems. I. Harman, Gilbert. II. Title.

Q325.5.K85 2011

006.3′1–dc22

2010045223

Preface

This book offers a broad and accessible introduction to the relatively new field of statistical learning theory, a field that has emerged from engineering studies of pattern recognition and machine learning, developments in nonparametric statistics, computer science, the study of language learning in linguistics, developmental and cognitive psychology, the philosophical problem of induction, and the philosophy of science and method.

The book is the product of a very successful introductory course on Learning Theory and Epistemology that we have been teaching jointly in electrical engineering and philosophy at Princeton University. The course is open to all students and has no specific prerequisites other than some analytical skills and intellectual curiosity. Although much of the material is technical, we have found that the main points are both accessible to and appreciated by a broad range of students. In each class, our students have included freshmen through seniors, with majors from the sciences, engineering, humanities, and social sciences.

The engineering study of pattern recognition is concerned with developing automated systems to discriminate between various inputs in a useful way. How can the post office develop systems to scan and sort mail on the basis of hand-written addresses? How can a manufacturer design a computerized system to transcribe ordinary conversations? Can computers be used to analyze medical images to make diagnoses?

Machine learning provides an efficient way to approach some pattern recognition problems. It is possible to train a system to recognize handwritten zip codes. Automated systems can interact with users to learn to perform speech recognition. A computer might use machine learning to develop a system that can analyze medical images in the way that experts do.

Machine learning and pattern recognition are also concerned with the general principles involved in learning systems. Rather than develop algorithms from scratch and in an ad hoc manner for each new application, a systematic methodology can be extremely useful. It is also important to have techniques for evaluating the performance of a learning system. Knowing what is achievable and what is not helps to provide a benchmark and often suggests new techniques for practical learning algorithms.

These questions are also related to philosophical questions that arise in epistemology. What can we learn and how can we learn it? What can we learn about other minds and the external world? What can we learn through induction?

The philosophical problem of induction asks how it is possible to learn anything on the basis of inductive reasoning, given that the truth of the premises of inductive reasoning does not guarantee the truth of its conclusion. There is no single solution to this problem, not because there is no solution, but because there are many, depending on what counts as learning. In this book, we explain how various solutions depend on the way the problem of induction is formulated.

Thus, we hope this book will serve as an accessible introduction to statistical learning theory for a broad audience. For those interested in more in-depth studies of learning theory or practical algorithms, we hope the book will provide a helpful starting point. For those interested in epistemology or philosophy in general, we hope the book will help draw connections into very relevant ideas from other fields. And for others, we hope the book will help provide an understanding of some deep and fundamental insights from statistical learning theory that are at the heart of advances in artificial intelligence and shed light on the nature and limits of learning.

We acknowledge with thanks a Curriculum Development Grant from the 250th Anniversary Fund for Innovation in Undergraduate Education from Princeton University. Rajeev Kulkarni gave us extremely useful comments on the whole book, which has greatly improved the result. Joel Predd and Maya Gupta also provided valuable comments on various parts. We have also benefitted from a careful reading by Joshua Harris. We are also grateful to our teaching assistants over the years and to the many students who have discussed the content of the course with us. Thanks!

Chapter 1

Introduction: Classification, Learning, Features, and Applications

1.1 Scope

In this book we are concerned mainly with pattern classification—classifying an object into one of several categories on the basis of several observations or measurements of the object. The simplest case is classification of an object into one of two categories, but a more general case allows for any finite number of categories.

A second closely related task is estimation of a real number that is typically related to some property of the object. As in classification, several observations or measurements of the object are available, and our estimate is based on these observations.

Most of our discussion concerns issues arising about the first task, classification. But we occasionally say something about the second task, estimation. In either case, we are interested in rules for classifying objects or estimating values, given certain observations or measurements. More specifically, we are interested in methods for learning rules for classification or estimation.

We discuss some concrete examples further below. For now, think about learning to recognize handwritten characters or faces or other objects from visual data. Or, think about the problem of recognizing spoken words. While humans are extremely good at these types of classification problems in many natural settings, it is quite difficult to design automated algorithms for these tasks with performance and robustness anywhere near those of humans.

Even after more than a half century of effort in fields such as electrical engineering, mathematics, computer science, statistics, philosophy, and cognitive science, humans can still far outperform the best machine learning algorithms that have ever been developed. That said, enormous progress has been made in learning theory, algorithms, and applications. Results in this area are deep and practical and are relevant to a range of disciplines such as those we have mentioned above. Many of the basic ideas are accessible to a broad audience. However, most treatments of this material are at an advanced level, requiring a rather technical background and expertise.

Our aim in this book is to provide an accessible introduction to this field, either as a first step for those wishing to pursue the subject in more depth, or for those desiring a broad understanding of the basic ideas. For most of the book, we focus on the problem of two-class pattern classification. This problem arises in many useful applications and is sufficiently rich to explain many of the key ideas in the field, yet removes some unnecessary complications. Although many important aspects of learning are not covered by this model, we provide many good references for more depth, generalizations, and other models. We hope this book will serve as a valuable entry point.

1.2 Why Machine Learning?

Algorithms for recognizing patterns would be useful in a wide range of problems. This ability is one aspect of artificial intelligence. But one might reasonably ask why we need to design automated methods for learning good rules for classification, as opposed to just figuring out what is a good rule for a given application and implementing it.

The main reason is that in many applications, the only way we can find a good rule is to use data to learn one. For example, it is very hard to describe exactly what constitutes a face in an image, and therefore it is hard to come up with a classification rule to decide whether or not a given image contains a face. But, given a good learning algorithm, we might be able to present the algorithm with many examples of images of a face and many examples of images without a face, and then let the algorithm come up with a good rule for recognizing whether or not a face is present. There are other benefits of having a learning algorithms as well, such as robustness to errors in assumptions or modelling, reduced need for explicit programming, and adaptation to changing conditions.

In general, for a classification problem, we want to decide to which of several categories the object belongs on the basis of some measurements of the object. To learn a good rule, we use data that consist of many examples of objects with their correct classification. The following questions immediately arise:

1. What do we mean by an object and measurements of the object?

2. In the classification problem, what are the categories to which we assign objects?

3. In the estimation problem, what are the values we attempt to estimate?

4. How do we measure the quality of a classification or estimation rule, and what is the best rule we could hope for?

5. What information is available to use for learning?

6. How do we go about learning a good classification or an estimation rule?

We describe the answers to the first three questions in this chapter. To answer the remaining questions, some background material on probability is provided in Chapters 2 and 3. With this background, the answer to the fourth question is discussed in Chapters 4 and 5. The answer to the fifth question is discussed in Chapter 6. The rest of the book is devoted to various aspects of and approaches to the last question.

1.3 Some Applications

Before discussing further details, it may be helpful to have some concrete examples in mind. There are a wide range of applications for learning, classification, and estimation. Here we mention just a few.

1.3.1 Image Recognition

There are many applications in which the object to be classified is a digital image. The measurements in this case might describe the outputs of each of the pixels in the image. In the case of a black and white image, the intensity of each pixel serves as one measurement. If the image has N × N pixels, then the total number of pixels (and hence measurements) is N². In the case of a color image, each pixel can be considered as providing three measurements, corresponding to the intensities of each of three color components, say RGB values. Hence, for an N × N color image, there are 3N² measurements.

Depending on the application, there are many classification tasks based on using these measurements. Face detection or recognition is a common and useful application. In this case, the categories might be face versus no face present, or there might be a separate category for each person in a database of individuals.

A different application is character recognition. In this case, the writing can be segmented into smaller images that each contain a single character, and the categories might consist of the 26 letters of the alphabet (52 letters, if upper and lower case letters are to be distinguished), the 10 digits, and possibly some special characters (period, question mark, comma, colon, etc.).

In yet another application, the images might be of industrial parts and the categorization task is to decide whether the current part is defective or not.

1.3.2 Speech Recognition

In speech recognition, we are interested in recognizing the words uttered by a speaker. The measurements in this application might be a set of numbers that represent the speech signal. First, the signal is typically segmented into portions that contain distinct words or phonemes. In each segment, the speech signal can be represented in a variety of ways. For example, the signal can be represented by the intensities or energy in different time-frequency bands. Although the details of the signal representation are outside the scope of this book, the signal can be represented ultimately by a set of real values.

In the simplest case, the categories might be as simple as deciding whether the utterance is yes versus no. A slightly more complicated task might be to decide which of the 10 digits is uttered. Or there might be a category for each word from a large dictionary of acceptable words and the task might be to decide which, if any, of this large number of words has been uttered.

1.3.3 Medical Diagnosis

In medical diagnosis, we are interested in whether or not there is a disease present (and which disease). There is a separate category for each of the diseases under consideration and one category for the case where no disease is present.

The measurements in this application are typically the results of certain medical tests (e.g., blood pressure, temperature, and various blood tests) or medical diagnostics (such as medical images), presence/absence/intensity of various symptoms, and some basic physical information about the patient (age, sex, weight, etc.).

On the basis of the results of the measurements, we would like to decide which disease (if any) is present.

1.3.4 Statistical Arbitrage

In finance, statistical arbitrage refers to automated trading strategies that are typically of a very short term and involve a large number of securities. In such strategies, one tries to design a trading algorithm for the set of securities on the basis of quantities such as historical correlations among the large set of securities, price movements over recent time horizons, and general economic/financial variables. These can be thought of as the measurements and the prediction can be cast as a classification or estimation problem. In the case of classification, the categories might be buy, sell, or do nothing for each security. In the estimation case, one might try to predict the expected return of each security over some future time horizon. In this case, one typically needs to use the estimates of the expected return to make a trading decision (buy, sell, etc.).

1.4 Measurements, Features, and Feature Vectors

As we discussed in Sections 1.1 and 1.3, in classifying an object, we use observations about the object in order to make our decision. For example, when humans wish to classify an object, they might look at the object, pick it up, feel it, listen to it, etc. Or they might use some instruments to measure other properties of the object such as size, weight, and temperature.

Similarly, when designing a machine to automatically classify (or learn to classify) objects, we assume that the machine has access to measurements of various properties of the object. These measurements come from sensors that capture some physical variables of interest, or features, of the object.

For simplicity, in this book we model each measurement (or feature) as being captured by a single real number. Although in some applications, certain features may not be very naturally represented by a number, this assumption allows discussion of the most common learning techniques that are useful in the most common applications.

We assume that all the relevant and available aspects of the objects can be captured in a finite number of measurements/features. These finite number of features can be put together to form a feature vector. Suppose there are d features with the value of the features given by x1, x2, … , xd. The feature vector is x = (x1, x2, … , xd). This feature vector can be thought of as a point or a vector in d-dimensional space Rd, which we call the feature space. Each component of the feature vector, indicating the value of the corresponding feature, is the value along a particular dimension of the feature space.

In the case of image recognition with an N × N image, the number of features is N² for a black and white image and 3N² for a color image.

In speech recognition, the number of features is equal to the number of real values used to represent the speech segment to be classified.

1.5 The Need for Probability

In most applications, the category of the object is not uniquely and definitively determined by the value of the feature vector. There are some fundamental reasons for this. First, although it would be nice if the measured features capture all the properties of the object important for classification, this is usually not the case. The measured features might fail to capture some important details. This should be clear in the examples given above.

Second, depending on the application and the specific measurements, the feature values may be noisy. That is, there may be some inherent uncertainty or randomness in the observed values of the features so that even the same object might give rise to different values on different occasions.

For these reasons, it is helpful to use tools from probability to formulate the problem precisely and guide the solution. In Chapters 2 and 3, we review some of the basic tools from probability that we need for the rest of the book.

1.6 Supervised Learning

After providing the necessary background from probability, in Chapter 4, we formulate the pattern recognition problem. In the ideal (and unusual) case, where the underlying probabilistic structure is known, the solution to the classification problem is well known and is a basic result from statistics. This is discussed in Chapter 5.

However, in the much more typical case in applications, the underlying probability distributions are not known. In this case, we try to overcome this lack of knowledge by resorting to labeled examples as we discuss in Chapter 6. The learning problem, as formulated in Chapter 6, is just one type of machine learning problem known by various terms such as learning from examples, supervised learning, statistical pattern classification, statistical pattern recognition, and statistical learning.

The term supervised learning arises from the fact that examples we assume that we have access to are properly labeled by a supervisor or teacher. This contrasts with unsupervised learning, in which many examples of objects are available, but the class to which the objects belong are unknown. There are also other formulations of machine learning problems such as semi-supervised learning and reinforcement learning, as well as many other related problems in statistics, computer science, and other fields. But in this book, we focus exclusively on the case of supervised learning.

1.7 Summary

In this chapter, we described the general problems of classification and estimation and discussed several concrete and important applications. We then introduced the terminology of features, feature vectors, and feature space. The need for introducing probability and learning was described.

We have mentioned both classification and estimation. We focus mainly on classification in this book, with some discussion of extensions to estimation.

In the next two chapters, we review some principles of probability that are important for aspects discussed in the rest of the book. After this, we formalize the classification (or pattern recognition) problem and discuss general issues in learning from data, before moving on to a discussion of specific learning methods and results.

1.8 Appendix: Induction

The appendices at the end of each chapter briefly discuss certain side issues, perhaps of a philosophical nature.

In this book, we are concerned primarily with inductive learning rather than deductive learning. Deductive learning consists in deriving a new conclusion from premises whose truth guarantees the truth of the conclusion. For example, you might learn that the area of a parallelogram is equal to its base times its height by deducing this from what you already know about rectangles and about how the area of a parallelogram is related to the area of a rectangle with the same base and

height. You might then learn that the area of a triangle is equal to its base times half its height, by deducing this from the fact that any triangle is exactly half of a certain parallelogram.

Inductive learning consists in reaching a conclusion from evidence that does not guarantee the truth of the conclusion. For example, you might infer from the fact that mail has almost always been delivered before noon on Saturdays up until now to the conclusion that mail will be delivered before noon next Saturday. This is an inductive inference, because the data do not guarantee the truth of the conclusion. Sometimes, the conclusion of an inductive inference is false even though the premises of the inference are all true.

The philosophical problem of induction asks how one can be justified in believing inductive conclusions from true premises. Certainly, it is not possible to prove deductively that any such inductive conclusion is true if its premises are, since typical inductive inferences do not provide such a guarantee. Even if you are justified inductively in thinking that your mail will be delivered before noon next Saturday, it is compatible with your evidence that your mail is not delivered before noon next Saturday. Induction is not a special case of deduction.

It might be suggested that induction has almost always led to true conclusions in the past, so it is reasonable to conclude that it will almost always lead to true conclusions in the future. The objection to this suggestion is that this is circular reasoning: we are assuming that induction is justified in order to argue that induction is justified!

On the other hand, is it possible to offer a noncircular justification of deduction? Wouldn't any such justification take the form of a deductive argument and so also be circular?

It will emerge that statistical learning theory provides partial deductive mathematical justifications for certain inductive methods, given certain assumptions.

Questions

1. What is a feature space? What do the dimensions of such a space represent? What is a vector? What is a feature vector?

2. If we want to use the values of F different features, in order to classify objects, where each feature can have any of G different values, what is the dimension of the feature space?

3. For a 12 × 12 grayscale image (256 grayscale levels), how many dimensions are there for the feature vector? How many different possible feature vectors are there?

4. Is classification a special case of estimation? What differences are there between typical cases of classification and typical cases of

Enjoying the preview?

Page 1 of 1

An Elementary Introduction to Statistical Learning Theory

About this ebook

Sanjeev Kulkarni

Related authors

Related to An Elementary Introduction to Statistical Learning Theory

Titles in the series (100)

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for An Elementary Introduction to Statistical Learning Theory

What did you think?

Book preview

An Elementary Introduction to Statistical Learning Theory - Sanjeev Kulkarni

Preface

1.1 Scope

1.2 Why Machine Learning?

1.3 Some Applications

1.3.1 Image Recognition

1.3.2 Speech Recognition

1.3.3 Medical Diagnosis

1.3.4 Statistical Arbitrage

1.4 Measurements, Features, and Feature Vectors

1.5 The Need for Probability

1.6 Supervised Learning

1.8 Appendix: Induction

Questions