Ebook701 pages5 hours

Mixture Models and Applications

Name: Mixture Models and Applications
ISBN: 9783030238766

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book focuses on recent advances, approaches, theories and applications related to mixture models. In particular, it presents recent unsupervised and semi-supervised frameworks that consider mixture models as their main tool. The chapters considers mixture models involving several interesting and challenging problems such as parameters estimation, model selection, feature selection, etc. The goal of this book is to summarize the recent advances and modern approaches related to these problems. Each contributor presents novel research, a practical study, or novel applications based on mixture models, or a survey of the literature.

Reports advances on classic problems in mixture modeling such as parameter estimation, model selection, and feature selection;
Present theoretical and practical developments in mixture-based modeling and their importance in different applications;
Discusses perspectives and challenging future works related tomixture modeling.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateAug 13, 2019

ISBN9783030238766

Related to Mixture Models and Applications

Related ebooks

Skip carousel

High-Order Models in Semantic Image Segmentation
Ebook
High-Order Models in Semantic Image Segmentation
byIsmail Ben Ayed
Rating: 0 out of 5 stars
0 ratings
Deep Learning and Parallel Computing Environment for Bioengineering Systems
Ebook
Deep Learning and Parallel Computing Environment for Bioengineering Systems
byArun Kumar Sangaiah
Rating: 0 out of 5 stars
0 ratings
Tracking with Particle Filter for High-dimensional Observation and State Spaces
Ebook
Tracking with Particle Filter for High-dimensional Observation and State Spaces
bySéverine Dubuisson
Rating: 0 out of 5 stars
0 ratings
Human Activity Recognition and Behaviour Analysis: For Cyber-Physical Systems in Smart Environments
Ebook
Human Activity Recognition and Behaviour Analysis: For Cyber-Physical Systems in Smart Environments
byLiming Chen
Rating: 0 out of 5 stars
0 ratings
Contextual Image Classification: Understanding Visual Data for Effective Classification
Ebook
Contextual Image Classification: Understanding Visual Data for Effective Classification
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Quotient Space Based Problem Solving: A Theoretical Foundation of Granular Computing
Ebook
Quotient Space Based Problem Solving: A Theoretical Foundation of Granular Computing
byLing Zhang
Rating: 0 out of 5 stars
0 ratings
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Ebook
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Challenges and Trends in Multimodal Fall Detection for Healthcare
Ebook
Challenges and Trends in Multimodal Fall Detection for Healthcare
byHiram Ponce
Rating: 0 out of 5 stars
0 ratings
Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis
Ebook
Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis
byUffe B. Kjærulff
Rating: 0 out of 5 stars
0 ratings
Extended Finite Element and Meshfree Methods
Ebook
Extended Finite Element and Meshfree Methods
byTimon Rabczuk
Rating: 0 out of 5 stars
0 ratings
Machine Learning: Fundamentals and Applications
Ebook
Machine Learning: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Adjustment Models in 3D Geomatics and Computational Geophysics: With MATLAB Examples
Ebook
Adjustment Models in 3D Geomatics and Computational Geophysics: With MATLAB Examples
byBashar Alsadik
Rating: 0 out of 5 stars
0 ratings
Cybersecurity and Applied Mathematics
Ebook
Cybersecurity and Applied Mathematics
byLeigh Metcalf
Rating: 0 out of 5 stars
0 ratings
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
Ebook
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Synthetic Data and Generative AI
Ebook
Synthetic Data and Generative AI
byVincent Granville
Rating: 0 out of 5 stars
0 ratings
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
Ebook
Assessing and Improving Prediction and Classification: Theory and Algorithms in C++
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
3D Computer Vision: Efficient Methods and Applications
Ebook
3D Computer Vision: Efficient Methods and Applications
byChristian Wöhler
Rating: 0 out of 5 stars
0 ratings
Structural Equation Modeling: Applications Using Mplus
Ebook
Structural Equation Modeling: Applications Using Mplus
byJichuan Wang
Rating: 0 out of 5 stars
0 ratings
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Ebook
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
bySebastian Thelen
Rating: 5 out of 5 stars
5/5
Recent Advances in Ensembles for Feature Selection
Ebook
Recent Advances in Ensembles for Feature Selection
byVerónica Bolón-Canedo
Rating: 0 out of 5 stars
0 ratings
Pattern Recognition in Practice II
Ebook
Pattern Recognition in Practice II
byL.N. Kanal
Rating: 0 out of 5 stars
0 ratings
Econometrics and Data Science: Apply Data Science Techniques to Model Complex Problems and Implement Solutions for Economic Problems
Ebook
Econometrics and Data Science: Apply Data Science Techniques to Model Complex Problems and Implement Solutions for Economic Problems
byTshepo Chris Nokeri
Rating: 0 out of 5 stars
0 ratings
The Mimetic Finite Difference Method for Elliptic Problems
Ebook
The Mimetic Finite Difference Method for Elliptic Problems
byLourenco Beirao da Veiga
Rating: 0 out of 5 stars
0 ratings
Metaheuristic: Fundamentals and Applications
Ebook
Metaheuristic: Fundamentals and Applications
byFouad Sabry
Rating: 0 out of 5 stars
0 ratings
Applied Hierarchical Modeling in Ecology: Analysis of Distribution, Abundance and Species Richness in R and BUGS: Volume 2: Dynamic and Advanced Models
Ebook
Applied Hierarchical Modeling in Ecology: Analysis of Distribution, Abundance and Species Richness in R and BUGS: Volume 2: Dynamic and Advanced Models
byMarc Kéry
Rating: 0 out of 5 stars
0 ratings
Introduction to Statistics in Metrology
Ebook
Introduction to Statistics in Metrology
byStephen Crowder
Rating: 0 out of 5 stars
0 ratings
Environmental Data Analysis with MatLab
Ebook
Environmental Data Analysis with MatLab
byWilliam Menke
Rating: 0 out of 5 stars
0 ratings
Higher Order Dynamic Mode Decomposition and Its Applications
Ebook
Higher Order Dynamic Mode Decomposition and Its Applications
byJose Manuel Vega
Rating: 0 out of 5 stars
0 ratings
Computational Materials Engineering: Achieving High Accuracy and Efficiency in Metals Processing Simulations
Ebook
Computational Materials Engineering: Achieving High Accuracy and Efficiency in Metals Processing Simulations
byMaciej Pietrzyk
Rating: 0 out of 5 stars
0 ratings
Mivar NETs and logical inference with the linear complexity
Ebook
Mivar NETs and logical inference with the linear complexity
byVarlamov, Oleg O.
Rating: 0 out of 5 stars
0 ratings

Applications & Software For You

Skip carousel

Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Logic Pro X For Dummies
Ebook
Logic Pro X For Dummies
byGraham English
Rating: 0 out of 5 stars
0 ratings
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
Sound Design for Filmmakers: Film School Sound
Ebook
Sound Design for Filmmakers: Film School Sound
byMurray Stiller
Rating: 5 out of 5 stars
5/5
iPhone Photography For Dummies
Ebook
iPhone Photography For Dummies
byMark Hemmings
Rating: 0 out of 5 stars
0 ratings
GarageBand Basics: The Complete Guide to GarageBand: Music
Ebook
GarageBand Basics: The Complete Guide to GarageBand: Music
byAventuras De Viaje
Rating: 0 out of 5 stars
0 ratings
How Do I Do That In InDesign?
Ebook
How Do I Do That In InDesign?
byDave Clayton
Rating: 5 out of 5 stars
5/5
Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
Ebook
Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
byKyle Brach
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Adobe Illustrator: A Complete Course and Compendium of Features
Ebook
Adobe Illustrator: A Complete Course and Compendium of Features
byJason Hoppe
Rating: 0 out of 5 stars
0 ratings
The Best Hacking Tricks for Beginners
Ebook
The Best Hacking Tricks for Beginners
byRAJ TYAGI
Rating: 4 out of 5 stars
4/5
GarageBand For Dummies
Ebook
GarageBand For Dummies
byBob LeVitus
Rating: 5 out of 5 stars
5/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Memes for Music Producers: Top 100 Funny Memes for Musicians With Hilarious Jokes, Epic Fails & Crazy Comedy (Best Music Production Memes, EDM Memes, DJ Memes & FL Studio Memes 2021)
Ebook
Memes for Music Producers: Top 100 Funny Memes for Musicians With Hilarious Jokes, Epic Fails & Crazy Comedy (Best Music Production Memes, EDM Memes, DJ Memes & FL Studio Memes 2021)
byScreech House
Rating: 4 out of 5 stars
4/5
The Unofficial Guide to Open Broadcaster Software: OBS: The World's Most Popular Free Live-Streaming Application
Ebook
The Unofficial Guide to Open Broadcaster Software: OBS: The World's Most Popular Free Live-Streaming Application
byPaul Richards
Rating: 0 out of 5 stars
0 ratings
Adobe Photoshop: A Complete Course and Compendium of Features
Ebook
Adobe Photoshop: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 5 out of 5 stars
5/5
Beautiful eBooks With Scrivener
Ebook
Beautiful eBooks With Scrivener
byN.R. Wick
Rating: 4 out of 5 stars
4/5
Adobe InDesign CC: A Complete Course and Compendium of Features
Ebook
Adobe InDesign CC: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 0 out of 5 stars
0 ratings
The Little SAS Book: A Primer, Sixth Edition
Ebook
The Little SAS Book: A Primer, Sixth Edition
byLora D. Delwiche
Rating: 5 out of 5 stars
5/5
Six Figure Blogging In 3 Months
Ebook
Six Figure Blogging In 3 Months
byShekhar Mishra
Rating: 4 out of 5 stars
4/5
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
Ebook
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
byScreech House
Rating: 3 out of 5 stars
3/5
Photoshop For Beginners: Learn Adobe Photoshop cs5 Basics With Tutorials
Ebook
Photoshop For Beginners: Learn Adobe Photoshop cs5 Basics With Tutorials
byNisha Ramavat
Rating: 0 out of 5 stars
0 ratings
Blender 3D Basics Beginner's Guide Second Edition
Ebook
Blender 3D Basics Beginner's Guide Second Edition
byGordon Fisher
Rating: 5 out of 5 stars
5/5
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
Samsung Galaxy S23 Ultra User Guide for Beginners and Seniors
Ebook
Samsung Galaxy S23 Ultra User Guide for Beginners and Seniors
byCharles J. Jones
Rating: 3 out of 5 stars
3/5
Exercises and Projects for The Little SAS Book, Sixth Edition
Ebook
Exercises and Projects for The Little SAS Book, Sixth Edition
byRebecca A. Ottesen
Rating: 0 out of 5 stars
0 ratings
Gray Hat Hacking the Ethical Hacker's
Ebook
Gray Hat Hacking the Ethical Hacker's
byÇağatay Şanlı
Rating: 5 out of 5 stars
5/5
Adobe Illustrator CC For Dummies
Ebook
Adobe Illustrator CC For Dummies
byDavid Karlins
Rating: 5 out of 5 stars
5/5
Kodi User Manual: Watch Unlimited Movies & TV shows for free on Your PC, Mac or Android Devices
Ebook
Kodi User Manual: Watch Unlimited Movies & TV shows for free on Your PC, Mac or Android Devices
byKazi Muhith
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

FABRIC: Personalizing Diffusion Models with Iterative Feedback: In an era where visual content generation is increasingly driven by machine learning, the integration of human feedback into generative models presents significant opportunities for enhancing user experience and output quality. This study explores st...
Podcast episode
FABRIC: Personalizing Diffusion Models with Iterative Feedback: In an era where visual content generation is increasingly driven by machine learning, the integration of human feedback into generative models presents significant opportunities for enhancing user experience and output quality. This study explores st...
byPapers Read on AI
0 ratings
0% found this document useful
Diffusion Model-Based Image Editing: A Survey: Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning to reverse ...
Podcast episode
Diffusion Model-Based Image Editing: A Survey: Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning to reverse ...
byPapers Read on AI
0 ratings
0% found this document useful
Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
Podcast episode
Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
byDigital Pathology Podcast
0 ratings
0% found this document useful
Complex Geometries: Modellansatz 086
Podcast episode
Complex Geometries: Modellansatz 086
byModellansatz - English episodes only
0 ratings
0% found this document useful
Complex Geometries
Podcast episode
Complex Geometries
byModellansatz
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful
SWaV: Unsupervised Learning of Visual Features by Contrasting Cluster Assignments (Mathilde Caron)
Podcast episode
SWaV: Unsupervised Learning of Visual Features by Contrasting Cluster Assignments (Mathilde Caron)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
What to consider when choosing an image analysis solution for phenotyping? (part 3) w/ Regan Baird, Visiopharm
Podcast episode
What to consider when choosing an image analysis solution for phenotyping? (part 3) w/ Regan Baird, Visiopharm
byDigital Pathology Podcast
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
Podcast episode
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
The Computational Complexity of Machine Learning: In this episode, Professor Michael Kearns from the University of Pennsylvania joins host Kyle Polich to talk about the computational complexity of machine learning, complexity in game theory, and algorithmic fairness. Michael's doctoral thesis gave an...
Podcast episode
The Computational Complexity of Machine Learning: In this episode, Professor Michael Kearns from the University of Pennsylvania joins host Kyle Polich to talk about the computational complexity of machine learning, complexity in game theory, and algorithmic fairness. Michael's doctoral thesis gave an...
byData Skeptic
0 ratings
0% found this document useful
130R_A new framework for very large-scale urban modelling (research summary)
Podcast episode
130R_A new framework for very large-scale urban modelling (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
Podcast episode
Alignment Newsletter #167: Concrete ML safety problems and their relevance to x-risk: Concrete ML safety problems and their relevance to x-risk
byAlignment Newsletter Podcast
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
Web Browsing Data to Study Digital Political Behavior, with Prof. Sebastian Stier: Prof. Sebastian Stier, Scientific Director of Computational Social Science at GESIS and Professor of CSS at the University of Mannheim, discusses how web tracking data can inform social science questions. We discuss the data structure of web browsing data, how it is collected, and the types of incentives used to recruit participants. Prof. Stier also shares his insights and research integrating web browsing data with survey data, as well as how LLMs are opening up new methodological avenues in simulated data.
Podcast episode
Web Browsing Data to Study Digital Political Behavior, with Prof. Sebastian Stier: Prof. Sebastian Stier, Scientific Director of Computational Social Science at GESIS and Professor of CSS at the University of Mannheim, discusses how web tracking data can inform social science questions. We discuss the data structure of web browsing data, how it is collected, and the types of incentives used to recruit participants. Prof. Stier also shares his insights and research integrating web browsing data with survey data, as well as how LLMs are opening up new methodological avenues in simulated data.
bySocial Media and Politics
0 ratings
0% found this document useful
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers: Vision Transformers (ViT) have been shown to attain highly competitive performance for a wide range of vision applications, such as image classification, object detection and semantic image segmentation. In comparison to convolutional neural networks...
Podcast episode
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers: Vision Transformers (ViT) have been shown to attain highly competitive performance for a wide range of vision applications, such as image classification, object detection and semantic image segmentation. In comparison to convolutional neural networks...
byPapers Read on AI
0 ratings
0% found this document useful
MLOps Coffee Sessions #11: Analyzing “Continuous Delivery and Automation Pipelines in ML" // Part 3
Podcast episode
MLOps Coffee Sessions #11: Analyzing “Continuous Delivery and Automation Pipelines in ML" // Part 3
byMLOps.community
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful
ScaleFExSM: a lightweight and scalable method to extract fixed features from single cells in high-content imaging screens
Podcast episode
ScaleFExSM: a lightweight and scalable method to extract fixed features from single cells in high-content imaging screens
byPaperPlayer biorxiv cell biology
0 ratings
0% found this document useful
Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus: An interview with Frank Liu about the open source vector database Milvus and how its native storage of vector embeddings reduces the friction involved in building and deploying machine learning models.
Podcast episode
Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus: An interview with Frank Liu about the open source vector database Milvus and how its native storage of vector embeddings reduces the friction involved in building and deploying machine learning models.
byData Engineering Podcast
0 ratings
0% found this document useful
083R_Operationalising a concept: The systematic review of composite indicator building for measuring community disaster resilience (research summary)
Podcast episode
083R_Operationalising a concept: The systematic review of composite indicator building for measuring community disaster resilience (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Pete Barbrook-Johnson and Alexandra S. Penn, "Systems Mapping: How to Build and Use Causal Models of Systems" (Palgrave MacMillan, 2022): An interview with Pete Barbrook-Johnson and Alexandra S. Penn
Podcast episode
Pete Barbrook-Johnson and Alexandra S. Penn, "Systems Mapping: How to Build and Use Causal Models of Systems" (Palgrave MacMillan, 2022): An interview with Pete Barbrook-Johnson and Alexandra S. Penn
byNew Books in Environmental Studies
0 ratings
0% found this document useful
Retrieval-Augmented Generation for Large Language Models: A Survey: Large language models (LLMs) demonstrate powerful capabilities, but they still face challenges in practical applications, such as hallucinations, slow knowledge updates, and lack of transparency in answers. Retrieval-Augmented Generation (RAG) refers...
Podcast episode
Retrieval-Augmented Generation for Large Language Models: A Survey: Large language models (LLMs) demonstrate powerful capabilities, but they still face challenges in practical applications, such as hallucinations, slow knowledge updates, and lack of transparency in answers. Retrieval-Augmented Generation (RAG) refers...
byPapers Read on AI
0 ratings
0% found this document useful
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
Podcast episode
Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
Podcast episode
B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
byNew Books in Mathematics
0 ratings
0% found this document useful
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations: Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis. Despite being trained on huge volume of data with thousands o...
Podcast episode
Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations: Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis. Despite being trained on huge volume of data with thousands o...
byPapers Read on AI
0 ratings
0% found this document useful
Adversarial Examples Are Not Bugs, They Are Features with Aleksander Madry - #369: Today we’re joined by Aleksander Madry, Faculty in the MIT EECS Department, a member of CSAIL and of the Theory of Computation group. Aleksander, whose work is more on the theoretical side of machine learning research, walks us through his paper...
Podcast episode
Adversarial Examples Are Not Bugs, They Are Features with Aleksander Madry - #369: Today we’re joined by Aleksander Madry, Faculty in the MIT EECS Department, a member of CSAIL and of the Theory of Computation group. Aleksander, whose work is more on the theoretical side of machine learning research, walks us through his paper...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Dynamical Sampling: Modellansatz 173
Podcast episode
Dynamical Sampling: Modellansatz 173
byModellansatz - English episodes only
0 ratings
0% found this document useful
Dynamical Sampling
Podcast episode
Dynamical Sampling
byModellansatz
0 ratings
0% found this document useful
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is argu...
Podcast episode
A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is argu...
byPapers Read on AI
0 ratings
0% found this document useful
Pete Barbrook-Johnson and Alexandra S. Penn, "Systems Mapping: How to Build and Use Causal Models of Systems" (Palgrave MacMillan, 2022): An interview with Pete Barbrook-Johnson and Alexandra S. Penn
Podcast episode
Pete Barbrook-Johnson and Alexandra S. Penn, "Systems Mapping: How to Build and Use Causal Models of Systems" (Palgrave MacMillan, 2022): An interview with Pete Barbrook-Johnson and Alexandra S. Penn
byNew Books in Economics
0 ratings
0% found this document useful

Skip carousel

Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
How To Train Computers Faster For ‘Extreme’ Datasets
Futurity
Article
How To Train Computers Faster For ‘Extreme’ Datasets
Dec 12, 2019
4 min read
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
‘Deep Learning’ Goes Faster With Organized Data
Futurity
Article
‘Deep Learning’ Goes Faster With Organized Data
Jun 5, 2017
Researchers have found that a technique for speedy data lookup, called hashing, can dramatically reduce the amount of computation required for deep learning, a demanding form of machine learning. “This applies to any deep-learning architecture, and t
2 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Is Artificial Intelligence Permanently Inscrutable?
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
Deep-learning Algorithm Can De-noise Images
Futurity
Article
Deep-learning Algorithm Can De-noise Images
Jan 26, 2021
2 min read
Experiments In Photogrammetry
British Columbia History
Article
Experiments In Photogrammetry
Jun 15, 2023
Ever since the fire of June 30, 2021, destroyed the Lytton Museum and Archives, I have been trying to assemble preservation methods designed to reduce the effect of another catastrop loss. To this end, I have been studying ways of making digital thre
2 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Machine Learning Could Cut Delays From Traffic Lights
Futurity
Article
Machine Learning Could Cut Delays From Traffic Lights
Jan 20, 2021
2 min read
Can Machine Learning Predict The Next Big Disaster?
Futurity
Article
Can Machine Learning Predict The Next Big Disaster?
Jan 3, 2023
3 min read
How Artificial Intelligence Is Helping With Space Exploration
Techfastly
Article
How Artificial Intelligence Is Helping With Space Exploration
Sep 1, 2021
3 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Seeds Of Change
Landscape Architecture Australia
Article
Seeds Of Change
Jan 29, 2024
4 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
Grid Modeling Overview: Four Types of Models Guiding the Transition to Clean Electricity
Union of Concerned Scientists
Article
Grid Modeling Overview: Four Types of Models Guiding the Transition to Clean Electricity
Apr 25, 2022
6 min read
4D Printing Makes Stuff That Changes Shape
Futurity
Article
4D Printing Makes Stuff That Changes Shape
Jun 10, 2020
2 min read
So Predictable? AI And Landscape Architecture
Landscape Architecture Australia
Article
So Predictable? AI And Landscape Architecture
Apr 30, 2023
6 min read
The Physical Process That Powers a New Type of Generative AI
Nautilus
Article
The Physical Process That Powers a New Type of Generative AI
Oct 18, 2023
5 min read
Business applications For Quantum computing
Rotman Management
Article
Business applications For Quantum computing
May 1, 2022
COMPUTERS DO ARITHMETIC. Underlying every amazing application of computers today is math, calculated using binary digits or ‘bits.’ The original computers of the early 1950s could perform about 465 multiplications per second — much faster than the ‘h
11 min read
How Quickly Do Large Language Models Learn Unexpected Skills?
Nautilus
Article
How Quickly Do Large Language Models Learn Unexpected Skills?
Mar 8, 2024
4 min read
Quantum Computing and The Rise Of Machine Learning
Techfastly
Article
Quantum Computing and The Rise Of Machine Learning
Oct 1, 2021
2 min read
Deep Learning Technique for Object Detection
Techfastly
Article
Deep Learning Technique for Object Detection
Jun 1, 2021
3 min read
New Quantum Algorithms Finally Crack Nonlinear Equations
Quanta
Article
New Quantum Algorithms Finally Crack Nonlinear Equations
Jan 5, 2021
4 min read
Algorithm Lets Robots Make Faster Sense Of Our Chaotic World
Futurity
Article
Algorithm Lets Robots Make Faster Sense Of Our Chaotic World
May 24, 2019
3 min read
AI And Design: Questions Of Ethics
Architecture Australia
Article
AI And Design: Questions Of Ethics
Mar 4, 2024
Artificial intelligence (AI) is a very old idea, but the term AI and the field of AI as it relates to modern programmable digital computing have taken their contemporary forms in the past 70 years.1Today, we interact with AI technologies constantly,
5 min read
Touch-based Display Boosts 3D Modeling Accessibility
Futurity
Article
Touch-based Display Boosts 3D Modeling Accessibility
Oct 30, 2019
3 min read
4D Camera Gives Robots a Wider View
Futurity
Article
4D Camera Gives Robots a Wider View
Jul 25, 2017
Researchers have created a new camera that could create four-dimensional images and capture nearly 140 degrees of information. “We’re great at making cameras for humans but do robots need to see the way humans do? Probably not…” The camera could gene
3 min read
2024: What Is The Near Future Of Generative AI?
The European Business Review
Article
2024: What Is The Near Future Of Generative AI?
Jan 26, 2024
8 min read

Related categories

Skip carousel

Reviews for Mixture Models and Applications

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Mixture Models and Applications - Nizar Bouguila

Part IGaussian-Based Models

N. Bouguila, W. Fan (eds.)Mixture Models and ApplicationsUnsupervised and Semi-Supervised Learninghttps://doi.org/10.1007/978-3-030-23876-6_1

1. A Gaussian Mixture Model Approach to Classifying Response Types

Owen E. Parsons¹

(1)

University of Cambridge, Cambridge, UK

Owen E. Parsons

Email: oep20@cam.ac.uk

Abstract

Visual perception is influenced by prior experiences and learned expectations. One example of this is the ability to rapidly resume visual search after an interruption to the stimuli. The occurrence of this phenomenon within an interrupted search task has been referred to as rapid resumption. Previous attempts to quantify individual differences in the extent to which rapid resumption occurs across participants relied on using an operationally defined cutoff criteria to classify response types within the task. This approach is potentially limited in its accuracy and could be improved by turning to data-driven alternatives for classifying response types. In this chapter, I present an alternative approach to classifying participant responses on the interrupted search task by fitting a Gaussian mixture model to response distributions. The parameter estimates obtained from fitting this model can then be used in a naïve Bayesian classifier to allow for probabilistic classification of individual responses. The theoretical basis and practical application of this approach are covered, detailing the use of the Expectation-Maximisation algorithm to estimate the parameters of the Gaussian mixture model as well as applying a naïve classifier to data and interpreting the results.

Keywords

Visual searchInterrupted searchRapid resumptionPrior expectationsAttentionGaussian mixture modelsExpectation-maximisation

1.1 Background

1.1.1 The Influence of Prior Information During Interrupted Visual Search

Visual perception is widely regarded to involve processes of unconscious inference about the state of the external world which act upon incoming noisy sensory information [1]. Hermann von Helmholtz was an early pioneer of the view that visual perception involves higher order processing of ambiguous retinal images. He suggested that vision was a process of finding the most likely state of visual stimuli based on both the sensory information being received and the previous experiences of the observer [2]. This view of vision, as a process of testing hypotheses about the state of the world, has since been strongly advocated and perception is now understood to be heavily influenced by our expectations of the external environment [3]. These expectations help to solve any ambiguities in the incoming sensory information and enable us to process visual scenes in a fast and efficient way.

Prior expectations have been shown to influence performance during visual search tasks [4–9]. One particular set of studies, which were carried out by Lleras and colleagues, demonstrated how periodically removing the search display during visual search tasks results in a unique distribution of response times [4, 10, 11]. These results illustrated the effects of previously acquired information on search performance. The initial paradigm within these studies required participants to complete a visual search task in which the search display was only visible for short intervals, while being intermittently interrupted by a blank screen [4]. By separating responses into those which occurred after a single presentation of the search display and those which occurred after two or more presentations, the authors found that the distributions of these two response types were distinct. Responses which immediately followed the first presentation of the search display showed a typical unimodal distribution, with all responses occurring after 500 ms from the onset of the search display. However, responses that followed subsequent presentations of the search display showed a clear bimodal distribution with a large proportion of responses occurring within 500 ms of the most recent presentation of the search display. This was interpreted as evidence for a predictive aspect of visual processing in the latter response type, as participants were able to use information acquired from previous exposures of the search display to facilitate their search performance on subsequent presentations.

Lleras and colleagues built on this initial finding by carrying out a number of different manipulations to the original task design in order to better understand the mechanisms of this phenomenon and to rule out alternative explanations for their results [4]. First, they implemented an adaptation of the original paradigm in which the participants had to search for two separate targets in parallel which occurred within distinct search displays that alternated on each presentation. This version of the task produced similar response distributions from participants as the original task, which provided evidence that the results they found in the original task were not simply the product of delayed responses following previous presentations of the display. The authors also experimented with increasing the display time of their search display from 100 ms to 500 ms, which resulted in a stronger influence of prior information on search performance as the participants had longer to accumulate visual information.

Importantly, they were able to rule out the possibility that the effects they observed in the original study were due to a confirmation bias. This refers to a potential strategy where participants would withhold their response following the initial presentation of the search display until they could confirm their decision after viewing a subsequent presentation. The authors assessed whether this strategy was adopted by participants by inserting catch trials into the task (20% of time) in which the search display did not reappear following the initial presentation. The absence of further presentations of the search display forced participants to respond when they realised that they weren’t going to be presented with any additional information. The results from this version of the task found that responses which occurred during these catch trials were likely to have been generated by random guessing, suggesting that a confirmation strategy was unlikely to have been the cause of the observed results in the original task.

1.1.2 Quantifying Individual Differences During the Interrupted Search Task

It is common that distributions of responses which are obtained from single-condition behavioural tasks (tasks in which the behavioural paradigm is consistent across all trials) are assumed to be a result of a single underlying cognitive process. Distinct cognitive processes are more commonly seen in multiple-condition tasks where two types of condition are presented to participants. A classic example of a multiple-condition task is the Posner cueing task, in which trials may either have valid or invalid cues [12]. In tasks such as this, the data are normally stratified by the type of task condition to allow for statistical comparison. This is straightforward in multiple-condition tasks where the different response types occur as a direct result of task manipulation. However, a different approach is required in the case of single-condition tasks, such as the interrupted search task, as different response types occur throughout the task independently of any task manipulation. This means there are no directly observable labels that indicate which response type occurred in any given trial.

Previous attempts have been made to classify response types during the interrupted search task in order to quantify the effects of rapid resumption across individual participants. Lleras and colleagues carried out a subsequent study which looked at whether there were age-related differences in the extent to which individuals showed the effects of rapid resumption [10]. In their study, they focused on responses that occurred after subsequent presentations of the search display (in which rapid resumption could occur) and discarded responses that occurred immediately after the first presentation of the search display (in which rapid resumption could not occur). They classified trials where rapid resumption was thought to have occurred using a cutoff value of 500 ms, which was based on their operational definition of rapid resumption. This allowed for a comparison to be made between the reaction time distributions of the two different response types and for the relative proportion of each response type to be calculated. Using this method, they were able to calculate the ratio of trials in which rapid resumption did and did not occur and then used this to assess for age-related effects. While they found increasing age led to an improvement in overall visual search performance, they were unable to find an association between age and the extent to which participants displayed the effects of rapid resumption.

The method developed by Lleras and colleagues has some potential issues regarding its validity and suitability for classifying response types in the interrupted search paradigm. First, the defined cutoff used to differentiate between response types is slightly arbitrary as it wasn’t derived empirically from behavioural data. The cutoff used in this approach was chosen primarily based on visual inspection of data [4, 11] and is therefore unlikely to allow for optimal labelling of the different response types. By using more sophisticated statistical methods, empirical data could be used to classify response types more accurately. Second, the use of a cutoff point leads to binary classifications that might lose some of the richness of the behavioural data. To further illustrate the potential variance in performance that this method fails to capture, I generated simulated data for 3 different hypothetical response distributions (see Fig. 1.1). These 3 response distributions were created using distinct underlying generative models. Distributions (a) and (c) were each drawn from single Gaussians. While both of these had a mean reaction time of μ = 0.5, they had differing variances of σ = 0.07 and σ = 0.3, respectively. Distribution (b) was drawn from a mixture of two Gaussians with the same variance (σ = 0.1) but different means (μ = 0.25 and μ = 0.75). Using the approach by Lleras and colleagues [10] to classify these different distributions (in terms of the proportion of rapid resumption responses that they contain) gives us the same value (0.5) for all 3 distributions. As they clearly have distinct underlying generative models, this result highlights how this method fails to capture certain types of variation in response distributions that may be indicative of differences in performance on the task.

../images/476187_1_En_1_Chapter/476187_1_En_1_Fig1_HTML.png

Fig. 1.1

Simulated reaction time distributions. Distribution (a) was drawn from a single Gaussian of μ = 0.5 and σ = 0.07. Distribution (b) was drawn equally (λ = 0.5) from a mixture of two Gaussians with the same variance (σ = 0.1) but different means (μ = 0.25 and μ = 0.75). Distribution (c) was drawn from a single Gaussian of μ = 0.5 and σ = 0.3. The ratio of rapid to non-rapid responses, using the method suggested by Lleras et al. [10], is shown in the top right corner of each plot

1.1.3 An Alternative Approach to Classifying Response Types During Interrupted Search

One way in which the evaluation of performance in the interrupted search task could be improved is through the use of an empirical data-driven approach to classify response types. The following chapter presents a novel method which uses behavioural data to drive response classification. Considering the data obtained from the interrupted search task, the overall distribution of responses can be viewed as being comprised of two separate distributions. When distributions are derived from two or more distinct processes, the underlying probabilistic structure can be captured using mixture modelling [13]. Based on the evidence put forward by Lleras and colleagues [4, 11], there is a strong reason to believe that there are two distinct response types that occur within the interrupted search paradigm, these being (1) those responses which involve rapid resumption and (2) responses which don’t involve rapid resumption.

In terms of the true underlying cognitive mechanisms responsible for the different response types, there is no direct way of observing which response type occurred in any given trial. Therefore, the response type can be described as a latent variable (or a hidden variable), a variable which is not directly observable but can be inferred from other observed variables. The main observed variable that can be used in the present study is reaction time. The method used by Lleras and colleagues was essentially a way of using a simple classification rule to infer the latent variable, response type, from the observed variable, reaction time. The main concern with this approach, as outlined earlier, is the suitability of the classification rule used to infer the latent variable from the observed data. Here, I present a novel data-driven approach that uses reaction times from trials to infer the most likely response type for any given trial.

1.1.4 Aims of This Chapter

This chapter aims to clearly present a method of applying the Expectation-Maximisation algorithm to fit a Gaussian mixture model to behavioural data and demonstrating how this can then be used to classify response types based on which generative process they were likely to have been produced by. Here, I will focus primarily on applying the outcomes from this approach to assessing whether the original cutoff point suggested by Lleras and colleagues is valid. The results produced by this novel method will also be compared with the results from the method used by Lleras et al. to assess whether the classifications produced by the two methods differ significantly. However, as outlined above, this approach also has the potential to provide a number of additional advantages such as individualised modelling of classification criteria as well as potential quantification of the confidence of classifications. While I will not apply these approaches in the present chapter, the additional benefits of such approaches will be considered during the discussion.

1.2 Methods

1.2.1 Data Collection

The dataset presented here was collected as part of a larger study which used a reproduced version of the original task presented by Lleras et al. [4]. A summary of the procedures used for this experiment are presented in the appendix. For the present analysis, only participant responses that occurred following subsequent presentations of the search display were included.

1.2.2 Overview of Approach

This alternative approach to estimating the latent variable from the observed data will be based on extracting the parameters for the separate unimodal distributions of the different response types and then using these parameters to calculate which distribution was more likely to have generated each individual response. The outline of this approach is shown in Fig. 1.2. The overall response distribution for the combined distributions is assumed to be a bimodal distribution, as illustrated by Fig. 1.2a. The first step is to estimate the distribution parameters of the two individual Gaussian distributions that would generate similar data to the observed bimodal distribution. This step is shown in Fig. 1.2b. Once these parameters have been estimated, individual data points can be assessed to determine which of the two Gaussians they were more likely to have been generated by. Two example data points, x i and x j, are shown in Fig. 1.2c. Both of these example data points are more likely to have been generated by the rightmost Gaussian distribution, as indicated in Fig. 1.2d. One additional advantage of the new approach is that the likelihood to which these data points are expected to have been generated by a given distribution and not the other can also be quantified. In this instance, x j will be more likely to have been generated by the highlighted Gaussian than x i. The exact details and methodology of approach will be outlined in greater detail below.

../images/476187_1_En_1_Chapter/476187_1_En_1_Fig2_HTML.png

Fig. 1.2

Demonstration of the procedure used to classify data generated by a bimodal distribution. Diagram (a) shows a hypothetical binomial distribution. A Gaussian mixture model can be used to estimate the parameters of the different components of the binomial distribution as shown in diagram (b). These can be used to label data points such as x i and x j based on which distribution they were most likely to have been drawn from, as shown in diagrams (c) and (d)

1.2.3 Gaussian Mixture Models

One particular example of a latent variable model is the Gaussian mixture model. A mixture model is an example of a hidden model, in which observations are generated from a mixture of two distinct generative models [14]. A Gaussian mixture model is a common example of this, which consists of a mixture model comprising of two or more Gaussian distributions. The Gaussian distribution can be expressed as:

$$\displaystyle \begin{aligned} \mathcal{N}(x \vert \mu , \sigma ) = \frac{1}{\sigma(2\pi)^{1/2}} \exp -\left(\frac{(x-\mu)^{2}}{\sigma^{2}} \right) \end{aligned} $$

(1.1)

where μ is the expected value of the dataset x, and σ ² is the variance of the dataset. A mixture model can be defined as such:

$$\displaystyle \begin{aligned} p(x| \{\theta_{k}\}) = \sum_{k=1}^{K} \lambda_k p_{k}(x | \theta_k) \end{aligned} $$

(1.2)

Here, λ k represents the relative weights of the different components (for a model with k components) where $$\sum \lambda _k = 1$$ and p k(x|θ k) represents the respective components of the subpopulations with θ k referring to the parameter set for component k. Note that this assumes that λ k > 0 for all values of k, otherwise the model contains non-contributive subpopulations which can be ignored. Gaussian mixture models are a specific case of mixture models in which the distributions for the subpopulations are Gaussian. This can be written as:

$$\displaystyle \begin{aligned} p(x| \{\theta_{k}\}) = \sum \lambda_k \mathcal{N}(x \vert \mu_k , \Sigma_k ) \end{aligned} $$

(1.3)

Within the mixture model, each individual Gaussian density

$$\mathcal {N}(x \vert \mu _k , \Sigma _k )$$

is referred to as a component of the mixture and has specific values for its mean μ k and covariance Σk. The parameters λ k are the mixing coefficients, which are the relative weights of each distribution within the mixture model. Integrating equation (1.3) with respect to x, while incorporating the fact that both p(x) and each of the individual Gaussian components are normalised, gives:

$$\displaystyle \begin{aligned} \sum_{k=1}^{K} \lambda_k = 1 \end{aligned} $$

(1.4)

By definition, both p(x) ≥ 0 and

$$\mathcal {N}(x \vert \mu _k , \Sigma _k ) \geq 0$$

. This indicates that λ k ≥ 0 for all values of k. These statements can be combined with Eq. (1.4) to show that the mixing coefficients meet the criteria to be probabilities:

$$\displaystyle \begin{aligned} 0 \leq \lambda_k \leq 1 \end{aligned} $$

(1.5)

It can also be stated across all the components k that:

$$\displaystyle \begin{aligned} p(x) = \sum_{k=1}^{K}p(k)p(x\vert k) \end{aligned} $$

(1.6)

So, it is clear that λ k is equivalent to p(k), which is the prior probability of a data point coming from the k th component. Additionally, the density

$$\mathcal {N}(x \vert \mu _{k}, \Sigma _k) = p(x\vert k)$$

can be regarded as the probability of data point x given component k. The properties of the Gaussian mixture distribution are defined by the parameters λ, μ and Σ, which refer to sets containing the parameters of the individual components λ ≡{λ 1, …, λ K}, μ ≡{μ 1, …, μ K} and Σ ≡{ Σ1, …, ΣK}.

In the present study, there is no direct information that indicates which of the two underlying processes generate any given response. In order to be able to estimate which underlying process is the most likely cause of individual responses, knowledge of the specific characteristics of the distributions for the different subpopulations is required. In the case of a Gaussian mixture model, estimates need to be obtained for the number of subpopulations, k, the characteristics of each Gaussian, μ k and Σk, as well as the relative weight of each subpopulation distribution to the overall population, λ k. A standard approach for estimating parameters such as these is to find the maximum likelihood. This involves finding values of parameters for which the likelihood function is maximised. The log likelihood function can be written as:

$$\displaystyle \begin{aligned} \log p(X| \lambda, \mu, \Sigma) = \sum_{n=1}^{N} \log \left \{ \sum_{k=1}^{K} \lambda_k \mathcal{N}(x_n \vert \mu_{k}, \Sigma_k) \right \} \end{aligned} $$

(1.7)

This equation includes a summation term within the logarithm. This leads to it not being possible to solve the derivative of this in closed-form and so it is necessary to turn to the Expectation-Maximisation algorithm to estimate the parameter values.

1.2.4 Expectation-Maximisation Algorithm

The Expectation-Maximisation algorithm is an iterative method which can be used to find the maximum likelihood estimate in models that contain latent variables [15]. It works by starting with initial parameter estimates and then iterates through an Expectation Step and a Maximisation Step until the estimates for the parameters converge on a stable solution. The Expectation Step assumes the current parameter estimates are fixed and uses these to compute the expected values of the latent variables in the model. The Maximisation Step takes the expected values of the latent variables and finds updated values for the previous parameter estimates that maximise the likelihood function.

In the case of a Gaussian mixture model, the Expectation Step assumes that the values of all the 3 parameters for the Gaussians in the model are fixed and then computes the probability that each given data point is drawn from each of the individual Gaussians in the model. This property, the probability that a data point is drawn from a specific distribution, is referred to as the responsibility of the distribution to a given data point. Once the responsibility values are calculated, the Maximisation Step assumes these responsibilities are fixed and then attempts to maximise the likelihood function across all the model parameters.

The responsibilities are equivalent to the posterior probabilities for a given component within the model and can be calculated as follows:

$$\displaystyle \begin{aligned} \gamma(z_k) = p(z_k = 1|x) &= \frac{p(z_k=1) \cdot p(x | z_k = 1)}{\sum_{j=1}^K p(z_j=1) \cdot p(x | z_j=1) } \end{aligned} $$

(1.8)

$$\displaystyle \begin{aligned} & = \frac{\lambda_k \cdot \mathcal{N}(x | \mu_k, \Sigma_k)} {\sum_{j=1}^{K} \lambda_j \cdot \mathcal{N}(x | \mu_j, \Sigma_j) } \end{aligned} $$

(1.9)

where

$$\sum _{j=1}^{K} \lambda _j \cdot \mathcal {N}(x | \mu _j, \Sigma _j)$$

is the normaliser term across all components. The responsibility of a component of the model to a data point is equivalent to the normalised probability of a given data point belonging to a specific Gaussians within the mixture model, then weighted by the estimated mixture proportions (λ k). This is the posterior probability for a specific distribution given the observed data, x. Using this, it is possible to calculate the distribution of the prior mixture weights. The responsibilities can be summed and normalise to estimate the contribution of the individual Gaussians to the observed data:

$$\displaystyle \begin{aligned} \lambda_k = \frac{1}{N} \sum_i \gamma(z_k) {} \end{aligned} $$

(1.10)

The responsibilities of each data point to the different distributions in the model can be used to estimate the mean and standard deviation of the Gaussians:

$$\displaystyle \begin{aligned} \mu_k = \frac{\sum_i \gamma(z_k) x_i}{\sum_i \gamma(z_k)} \end{aligned} $$

(1.11)

and

$$\displaystyle \begin{aligned} \sigma_k = \frac{\sum_i \gamma(z_k)(x_i - \mu_k)(x_i - \mu_k)}{\sum_i \gamma(z_k)} {} \end{aligned} $$

(1.12)

It would be straightforward to calculate the posteriors for the components within the model if the distribution parameters were known and, similarly, it would be easy to calculate the parameters were the posterior know. The Expectation-Maximisation algorithm overcomes this issue of circularity by alternating between fixing either the posterior or the parameters while maximising the likelihood. Initially, the parameters are fixed and then the posterior distribution is calculated for the hidden variables. Then, the posterior distribution is fixed, and the parameters are optimised. These steps are repeated in an alternating fashion until the likelihood value converges.

1.2.5 Estimation of Mixture Model Parameters

I used the Expectation-Maximisation algorithm to estimate the parameters for the individual distributions of responses where rapid resumption did occur and responses where rapid resumption did not occur. Once the parameters of these two distributions had been estimated, I would be able to not only reclassify all participant responses using an empirically derived criterion but also quantify the relative likelihood of each individual classification. The Expectation-Maximisation algorithm was carried out by initialising the parameters and then iterating through the Expectation and Maximisation Steps until the parameters converged. The individual steps of the Expectation-Maximisation algorithm are detailed below.

1.2.5.1 Initialisation

The means μ k, covariances Σk and mixing coefficients λ k were initialised by using the values obtained from the classification method suggested by Lleras and colleagues [10] to classify data points across all participants and then estimate the distribution parameters for the two response types based on these classifications.

1.2.5.2 Expectation Step

The responsibilities (posteriors) for the individual components were evaluated using the current estimates for the parameter values:

$$\displaystyle \begin{aligned} \gamma(z_{nk}) = \frac{\lambda_k \cdot \mathcal{N}(x_n | \mu_k, \Sigma_k)}{\sum_{j=1}^{K} \lambda_j \cdot \mathcal{N}(x_n | \mu_j, \Sigma_j)} \end{aligned} $$

(1.13)

1.2.5.3 Maximisation Step

The parameters were then updated by re-estimating them based on the current values for the responsibilities. This can be done using Eqs. (1.10)–(1.12), giving the following update equations:

$$\displaystyle \begin{aligned} \mu_k^{\text{new}} = \frac{1}{N_k} \cdot \sum_{n=1}^N \gamma(z_{nk}) \cdot x_n \end{aligned} $$

(1.14)

$$\displaystyle \begin{aligned} \Sigma_k^{\text{new}} = \frac{1}{N_k} \sum_{n=1}^N \gamma(z_{nk}) \cdot (x_n-\mu_k) \cdot (x_n-\mu_k)^T \end{aligned} $$

(1.15)

$$\displaystyle \begin{aligned} \lambda_k^{\text{new}} = \frac{N_k}{N} \end{aligned} $$

(1.16)

where:

$$\displaystyle \begin{aligned} N_k = \sum_{n=1}^N \cdot \gamma(z_{nk}) \end{aligned} $$

(1.17)

1.2.5.4 Convergence Criteria

Convergence was checked for both the model parameters and log likelihood. The convergence criteria were all set as 10−15. During each iteration of the Expectation-Maximisation algorithm, the updated parameter and log likelihood estimates were compared to the previous estimates to assess whether the change in values met the convergence criteria. The log likelihood was estimated as follows:

$$\displaystyle \begin{aligned} \log p(X | \mu, \Sigma, \lambda) = \sum_{n=1}^{N} \log \left \{ \sum_{k=1}^{K} \lambda_k \mathcal{N}(x_n \vert \mu_{k}, \Sigma_k) \right \} \end{aligned} $$

(1.18)

If any of the parameters or the log likelihood satisfied the convergence criteria, then the algorithm terminated, otherwise the next iteration was started.

1.2.6 Log Probability Ratio

Once the parameters for the distributions of rapid and non-rapid responses had been estimated, log probability ratios were calculated for all trials across each participant individually. The log probability ratios could be used to classify responses as either rapid or non-rapid which in turn allowed for an updated calculation of the proportion of rapid responses for all participants. This updated measure will be referred to as RR-Model which can then be compared to the RR-Basic scores that were calculated using the cutoff method outlined by Lleras and colleagues [10]. Additionally, the log probability ratios allow for a measure of the cumulative confidence of classifications to be calculated for individual participants. For the current dataset, the set of latent variables (which refer to the components of the Gaussian mixture model) is Z ≡{z R, z S} where z R and z S are multinominal vectors such that z R = 1 is a classification of a rapid response and z S = 1 is a classification of a slow response. For any given response x i, the probabilities of the observed responses can be formulated as either being classified as a rapid response or a slow response (non-rapid response). These can be written, respectively, as:

$$\displaystyle \begin{aligned} P(z_R = 1 \mid x) = \frac{P(x \mid z_R = 1) \, P(z_R = 1)}{P(x)} \end{aligned} $$

(1.19)

and

$$\displaystyle \begin{aligned} P(z_S = 1 \mid x) = \frac{P(x \mid z_S = 1) \, P(z_S = 1)}{P(x)} \end{aligned} $$

(1.20)

As a binary classification (two classes) has been used and slow trials are defined as any trials in which rapid resumption has not occurred, it can also be stated that:

$$\displaystyle \begin{aligned} P(z_R = 1 \mid x) + P(z_S = 1 \mid x) = 1 \end{aligned} $$

(1.21)

Equations (1.19) and (1.20) can then be combined with Eq. (1.21). This gives us an equation for the normaliser term P(x):

$$\displaystyle \begin{aligned} P(x \mid z_R = 1) P(z_R = 1) + P(x \mid z_S = 1) P(z_S = 1) = P(x) \end{aligned} $$

(1.22)

This can be rearranged to give the probability that data point x will be classified as a rapid response:

$$\displaystyle \begin{aligned} P(z_R = 1 \mid x) = \frac{1}{\frac{P(x \mid z_S = 1)P(z_S = 1)}{P(x \mid z_R = 1)P(z_R = 1)} + 1} \end{aligned} $$

(1.23)

All the terms within this equation are computable from the observed data. The prior probabilities P(z R = 1) and P(z S = 1) can be estimated from the observed data. The posterior terms P(x|z R = 1) and P(x|z S = 1) can then be calculated by assuming that:

$$\displaystyle \begin{aligned} P(x \mid z_k = 1) = \mathcal{N}(x \mid \mu_k , \sigma_k^2) \end{aligned} $$

(1.24)

where z k = 1 is the response type (either rapid, z R = 1, or slow, z S = 1) and μ k and $$\sigma _k^2$$ are the estimates for the mean and standard deviation of the given response distribution which were calculated using the Expectation-Maximisation algorithm. From this it is possible to expand Eq. (1.24) using Eq. (1.1) to calculate the log probability ratio, which is the ratio of log likelihood probabilities for the Gaussian components of the model.

$$\displaystyle \begin{aligned} \log \frac{P(z_R = 1 | x)}{P(z_S = 1 | x)} &= - \frac{1}{2} \left ( \frac{(x-\mu_r)^{2}}{\sigma_r^2} - \frac{(x-\mu_s)^{2}}{\sigma_s^2} + \log \sigma_r^{2} - \log \sigma_s^{2} \right )\notag\\ &\quad + \log P(z_R = 1) - \log P(z_S = 1) \end{aligned} $$

(1.25)

The final form shows the 3 mains components of the log probability ratio: the variance-weighted Euclidean distances from the means

$$\left (\frac {(x-\mu _r)^{2}}{\sigma _r^2} - \frac {(x-\mu _s)^{2}}{\sigma _s^2}\right )$$

, the log variances

$$\left ( \log \sigma _r^{2} - \log \sigma _s^{2}\right )$$

and the difference in log prior probabilities (

$$\log P(\omega _r) - \log P(\omega _s)$$

). A log probability ratio of 0 would suggest that the observed response was equally likely to have been generated by either distribution, with positive values suggesting stronger evidence that the response was a rapid response and negative values suggesting that the observed response was a non-rapid response. These values can be accumulated across all responses for each individual participant using a sequential probability ratio test. This approach rests on the assumption that the outcome on the nth trial is independent of the outcome on the n − 1th trial. To verify whether this assumption holds, regression analyses can be used to determine whether previous trial response type has an effect on current trial response type. Additionally, it is worth considering that the accumulation of directional log probability ratios is not entirely informative as distributions which are evenly balanced across the classification boundary will have values close to zero regardless of the likelihood of the individual trial classifications. Returning to the simulated distributions in Fig. 1.1, accumulation of the direction log probability ratios would still not be able to differentiate between these 3 distributions. Therefore, the absolute values of the log probability ratio for each individual trial could also be considered. These absolute values of the log probability ratios can be accumulated across both response types combined, to give an overall measure of classification confidence for each participant, or for each of the response types individually, to create 2 distinct within-subjects measures.

1.3 Results

1.3.1 Parameter Estimation of Response Distributions

The Expectation-Maximisation algorithm was initialised using the values detailed in the methods section. The algorithm found a two-Gaussian fit for the response distribution. The parameters for the two Gaussians were μ a = 0.324, σ a = 0.155 and μ b = 0.73, σ b = 0.119 with a λ of 0.551. To ensure the parameter estimates were accurate, the Expectation-Maximisation algorithm was run 100 times. The algorithm consistently converged on the same values with an average of 631.92 iterations (SD = 30.85) taken to converge. The fit of the estimated Gaussians to the observed data is shown in Fig. 1.3.

../images/476187_1_En_1_Chapter/476187_1_En_1_Fig3_HTML.png

Enjoying the preview?

Page 1 of 1

Mixture Models and Applications

About this ebook

Related to Mixture Models and Applications

Related ebooks

Applications & Software For You

Related podcast episodes

Related articles

Related categories

Reviews for Mixture Models and Applications

What did you think?

Book preview

Mixture Models and Applications - Nizar Bouguila

1. A Gaussian Mixture Model Approach to Classifying Response Types

Abstract

Keywords

1.1 Background

1.1.1 The Influence of Prior Information During Interrupted Visual Search

1.1.2 Quantifying Individual Differences During the Interrupted Search Task

1.1.3 An Alternative Approach to Classifying Response Types During Interrupted Search

1.1.4 Aims of This Chapter

1.2 Methods

1.2.1 Data Collection

1.2.2 Overview of Approach

1.2.3 Gaussian Mixture Models

1.2.4 Expectation-Maximisation Algorithm

1.2.5 Estimation of Mixture Model Parameters

1.2.6 Log Probability Ratio

1.3 Results

1.3.1 Parameter Estimation of Response Distributions