Ebook570 pages5 hours

Mixtures: Estimation and Applications

Name: Mixtures: Estimation and Applications
Author: Kerrie L. Mengersen
ISBN: 9781119998440

By Kerrie L. Mengersen, Christian Robert and Mike Titterington

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book uses the EM (expectation maximization) algorithm to simultaneously estimate the missing data and unknown parameter(s) associated with a data set. The parameters describe the component distributions of the mixture; the distributions may be continuous or discrete.

The editors provide a complete account of the applications, mathematical structure and statistical analysis of finite mixture distributions along with MCMC computational methods, together with a range of detailed discussions covering the applications of the methods and features chapters from the leading experts on the subject. The applications are drawn from scientific discipline, including biostatistics, computer science, ecology and finance. This area of statistics is important to a range of disciplines, and its methodology attracts interest from researchers in the fields in which it can be applied.

Skip carousel

Mathematics

LanguageEnglish

PublisherWiley

Release dateMay 3, 2011

ISBN9781119998440

Author

Kerrie L. Mengersen

Related authors

Skip carousel

Related to Mixtures

Titles in the series (100)

Skip carousel

Business Survey Methods
Ebook
Business Survey Methods
byBrenda G. Cox
Rating: 0 out of 5 stars
0 ratings
Applications of Statistics to Industrial Experimentation
Ebook
Applications of Statistics to Industrial Experimentation
byCuthbert Daniel
Rating: 3 out of 5 stars
3/5
Time Series Analysis: Nonstationary and Noninvertible Distribution Theory
Ebook
Time Series Analysis: Nonstationary and Noninvertible Distribution Theory
byKatsuto Tanaka
Rating: 0 out of 5 stars
0 ratings
Robust Correlation: Theory and Applications
Ebook
Robust Correlation: Theory and Applications
byGeorgy L. Shevlyakov
Rating: 0 out of 5 stars
0 ratings
Measurement Errors in Surveys
Ebook
Measurement Errors in Surveys
byPaul P. Biemer
Rating: 0 out of 5 stars
0 ratings
Measuring Agreement: Models, Methods, and Applications
Ebook
Measuring Agreement: Models, Methods, and Applications
byPankaj K. Choudhary
Rating: 0 out of 5 stars
0 ratings
Computation for the Analysis of Designed Experiments
Ebook
Computation for the Analysis of Designed Experiments
byRichard Heiberger
Rating: 0 out of 5 stars
0 ratings
Theory of Probability: A critical introductory treatment
Ebook
Theory of Probability: A critical introductory treatment
byBruno de Finetti
Rating: 0 out of 5 stars
0 ratings
Fundamental Statistical Inference: A Computational Approach
Ebook
Fundamental Statistical Inference: A Computational Approach
byMarc S. Paolella
Rating: 0 out of 5 stars
0 ratings
Modern Experimental Design
Ebook
Modern Experimental Design
byThomas P. Ryan
Rating: 0 out of 5 stars
0 ratings
Forecasting with Univariate Box - Jenkins Models: Concepts and Cases
Ebook
Forecasting with Univariate Box - Jenkins Models: Concepts and Cases
byAlan Pankratz
Rating: 0 out of 5 stars
0 ratings
Nonlinear Statistical Models
Ebook
Nonlinear Statistical Models
byA. Ronald Gallant
Rating: 0 out of 5 stars
0 ratings
Survey Measurement and Process Quality
Ebook
Survey Measurement and Process Quality
byLars E. Lyberg
Rating: 0 out of 5 stars
0 ratings
Nonparametric Finance
Ebook
Nonparametric Finance
byJussi Klemelä
Rating: 0 out of 5 stars
0 ratings
Probability and Conditional Expectation: Fundamentals for the Empirical Sciences
Ebook
Probability and Conditional Expectation: Fundamentals for the Empirical Sciences
byRolf Steyer
Rating: 0 out of 5 stars
0 ratings
Time Series Analysis with Long Memory in View
Ebook
Time Series Analysis with Long Memory in View
byUwe Hassler
Rating: 0 out of 5 stars
0 ratings
Statistics and Causality: Methods for Applied Empirical Research
Ebook
Statistics and Causality: Methods for Applied Empirical Research
byWolfgang Wiedermann
Rating: 0 out of 5 stars
0 ratings
Sensitivity Analysis in Linear Regression
Ebook
Sensitivity Analysis in Linear Regression
bySamprit Chatterjee
Rating: 0 out of 5 stars
0 ratings
Theory of Ridge Regression Estimation with Applications
Ebook
Theory of Ridge Regression Estimation with Applications
byA. K. Md. Ehsanes Saleh
Rating: 0 out of 5 stars
0 ratings
Sequential Stochastic Optimization
Ebook
Sequential Stochastic Optimization
byR. Cairoli
Rating: 0 out of 5 stars
0 ratings
Aspects of Multivariate Statistical Theory
Ebook
Aspects of Multivariate Statistical Theory
byRobb J. Muirhead
Rating: 0 out of 5 stars
0 ratings
Linear Statistical Inference and its Applications
Ebook
Linear Statistical Inference and its Applications
byC. Radhakrishna Rao
Rating: 0 out of 5 stars
0 ratings
Methods for Statistical Data Analysis of Multivariate Observations
Ebook
Methods for Statistical Data Analysis of Multivariate Observations
byR. Gnanadesikan
Rating: 0 out of 5 stars
0 ratings
Statistics for Research
Ebook
Statistics for Research
byShirley Dowdy
Rating: 0 out of 5 stars
0 ratings
Fractal-Based Point Processes
Ebook
Fractal-Based Point Processes
bySteven Bradley Lowen
Rating: 4 out of 5 stars
4/5
Regression Graphics: Ideas for Studying Regressions Through Graphics
Ebook
Regression Graphics: Ideas for Studying Regressions Through Graphics
byR. Dennis Cook
Rating: 0 out of 5 stars
0 ratings
A Course in Time Series Analysis
Ebook
A Course in Time Series Analysis
byDaniel Peña
Rating: 3 out of 5 stars
3/5
Multiple Imputation for Nonresponse in Surveys
Ebook
Multiple Imputation for Nonresponse in Surveys
byDonald B. Rubin
Rating: 2 out of 5 stars
2/5
The Statistical Analysis of Failure Time Data
Ebook
The Statistical Analysis of Failure Time Data
byJohn D. Kalbfleisch
Rating: 0 out of 5 stars
0 ratings
Statistical Models and Methods for Lifetime Data
Ebook
Statistical Models and Methods for Lifetime Data
byJerald F. Lawless
Rating: 0 out of 5 stars
0 ratings

Related ebooks

Skip carousel

Handbook of Blind Source Separation: Independent Component Analysis and Applications
Ebook
Handbook of Blind Source Separation: Independent Component Analysis and Applications
byPierre Comon
Rating: 0 out of 5 stars
0 ratings
Simplicity, Complexity and Modelling
Ebook
Simplicity, Complexity and Modelling
byMike Christie
Rating: 0 out of 5 stars
0 ratings
Entropy Theory and its Application in Environmental and Water Engineering
Ebook
Entropy Theory and its Application in Environmental and Water Engineering
byVijay P. Singh
Rating: 0 out of 5 stars
0 ratings
Case Studies in Bayesian Statistical Modelling and Analysis
Ebook
Case Studies in Bayesian Statistical Modelling and Analysis
byClair L. Alston
Rating: 0 out of 5 stars
0 ratings
Handbook of Statistical Systems Biology
Ebook
Handbook of Statistical Systems Biology
byMichael Stumpf
Rating: 0 out of 5 stars
0 ratings
Statistical Methods in the Atmospheric Sciences
Ebook
Statistical Methods in the Atmospheric Sciences
byDaniel S. Wilks
Rating: 5 out of 5 stars
5/5
Latent Variable Models and Factor Analysis: A Unified Approach
Ebook
Latent Variable Models and Factor Analysis: A Unified Approach
byDavid J. Bartholomew
Rating: 0 out of 5 stars
0 ratings
Applied Bayesian Modelling
Ebook
Applied Bayesian Modelling
byPeter Congdon
Rating: 0 out of 5 stars
0 ratings
Spatio-temporal Design: Advances in Efficient Data Acquisition
Ebook
Spatio-temporal Design: Advances in Efficient Data Acquisition
byJorge Mateu
Rating: 0 out of 5 stars
0 ratings
Bayesian Analysis of Stochastic Process Models
Ebook
Bayesian Analysis of Stochastic Process Models
byDavid Insua
Rating: 0 out of 5 stars
0 ratings
Trace Quantitative Analysis by Mass Spectrometry
Ebook
Trace Quantitative Analysis by Mass Spectrometry
byRobert K. Boyd
Rating: 0 out of 5 stars
0 ratings
Environmental Modelling: Finding Simplicity in Complexity
Ebook
Environmental Modelling: Finding Simplicity in Complexity
byJohn Wainwright
Rating: 0 out of 5 stars
0 ratings
Multiple Imputation and its Application
Ebook
Multiple Imputation and its Application
byJames Carpenter
Rating: 0 out of 5 stars
0 ratings
Bayesian Biostatistics
Ebook
Bayesian Biostatistics
byEmmanuel Lesaffre
Rating: 0 out of 5 stars
0 ratings
Energy Communities: Customer-Centered, Market-Driven, Welfare-Enhancing?
Ebook
Energy Communities: Customer-Centered, Market-Driven, Welfare-Enhancing?
bySabine Loebbe
Rating: 0 out of 5 stars
0 ratings
Process Safety Calculations
Ebook
Process Safety Calculations
byRenato Benintendi
Rating: 5 out of 5 stars
5/5
Computational Mechanics of Discontinua
Ebook
Computational Mechanics of Discontinua
byAntonio A. Munjiza
Rating: 0 out of 5 stars
0 ratings
Cluster Analysis
Ebook
Cluster Analysis
byBrian S. Everitt
Rating: 4 out of 5 stars
4/5
Modern Industrial Statistics: with applications in R, MINITAB and JMP
Ebook
Modern Industrial Statistics: with applications in R, MINITAB and JMP
byRon S. Kenett
Rating: 0 out of 5 stars
0 ratings
Statistical Pattern Recognition
Ebook
Statistical Pattern Recognition
byAndrew R. Webb
Rating: 4 out of 5 stars
4/5
Applied Nanotechnology: The Conversion of Research Results to Products
Ebook
Applied Nanotechnology: The Conversion of Research Results to Products
byJeremy Ramsden
Rating: 0 out of 5 stars
0 ratings
Multilevel Statistical Models
Ebook
Multilevel Statistical Models
byHarvey Goldstein
Rating: 0 out of 5 stars
0 ratings
Nanoscale Ferroelectrics and Multiferroics: Key Processing and Characterization Issues, and Nanoscale Effects, 2 Volumes
Ebook
Nanoscale Ferroelectrics and Multiferroics: Key Processing and Characterization Issues, and Nanoscale Effects, 2 Volumes
byMiguel Alguero
Rating: 0 out of 5 stars
0 ratings
Solid-Phase Organic Synthesis: Concepts, Strategies, and Applications
Ebook
Solid-Phase Organic Synthesis: Concepts, Strategies, and Applications
byPatrick H. Toy
Rating: 0 out of 5 stars
0 ratings
Polymeric Sensors and Actuators
Ebook
Polymeric Sensors and Actuators
byJohannes Karl Fink
Rating: 0 out of 5 stars
0 ratings
Communications, Radar and Electronic Warfare
Ebook
Communications, Radar and Electronic Warfare
byAdrian Graham
Rating: 0 out of 5 stars
0 ratings
Adaptive Learning Methods for Nonlinear System Modeling
Ebook
Adaptive Learning Methods for Nonlinear System Modeling
byDanilo Comminiello
Rating: 0 out of 5 stars
0 ratings
Nature-Inspired Optimization Algorithms
Ebook
Nature-Inspired Optimization Algorithms
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Nonlinear Finite Elements for Continua and Structures
Ebook
Nonlinear Finite Elements for Continua and Structures
byTed Belytschko
Rating: 3 out of 5 stars
3/5
Capillary Electromigration Separation Methods
Ebook
Capillary Electromigration Separation Methods
byColin F. Poole
Rating: 0 out of 5 stars
0 ratings

Mathematics For You

Skip carousel

The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
Ebook
The Math of Life and Death: 7 Mathematical Principles That Shape Our Lives
byKit Yates
Rating: 4 out of 5 stars
4/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
Ebook
Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics
byGary Smith
Rating: 4 out of 5 stars
4/5
Flatland
Ebook
Flatland
byEdwin A. Abbott
Rating: 4 out of 5 stars
4/5
Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5
Limitless Mind: Learn, Lead, and Live Without Barriers
Ebook
Limitless Mind: Learn, Lead, and Live Without Barriers
byJo Boaler
Rating: 4 out of 5 stars
4/5
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
ACT Math & Science Prep: Includes 500+ Practice Questions
Ebook
ACT Math & Science Prep: Includes 500+ Practice Questions
byKaplan Test Prep
Rating: 3 out of 5 stars
3/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Algebra I For Dummies
Ebook
Algebra I For Dummies
byMary Jane Sterling
Rating: 4 out of 5 stars
4/5
A Mind for Numbers | Summary
Ebook
A Mind for Numbers | Summary
bySummary Station
Rating: 4 out of 5 stars
4/5
Introducing Game Theory: A Graphic Guide
Ebook
Introducing Game Theory: A Graphic Guide
byIvan Pastine
Rating: 4 out of 5 stars
4/5
Precalculus: A Self-Teaching Guide
Ebook
Precalculus: A Self-Teaching Guide
bySteve Slavin
Rating: 5 out of 5 stars
5/5
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
Ebook
The Math Book: From Pythagoras to the 57th Dimension, 250 Milestones in the History of Mathematics
byClifford A. Pickover
Rating: 3 out of 5 stars
3/5
Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
Ebook
Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition
byAndrew Hodges
Rating: 4 out of 5 stars
4/5
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
Ebook
Mathematical Thinking - For People Who Hate Math: Level Up Your Analytical and Creative Thinking Skills. Excel at Problem-Solving and Decision-Making.
byAlbert Rutherford
Rating: 3 out of 5 stars
3/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
Ebook
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
byEditors of Portable Press
Rating: 4 out of 5 stars
4/5
The Elements of Euclid for the Use of Schools and Colleges (Illustrated)
Ebook
The Elements of Euclid for the Use of Schools and Colleges (Illustrated)
byISAAC TODHUNTER
Rating: 0 out of 5 stars
0 ratings
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

TIL about nuclear energy: We know how to generate tons of electricity without pumping greenhouse gases in the atmosphere, using a technology that’s already mature, widespread, and competitive with fossil fuels -- and also, very controversial: nuclear power. In this episode of TILclimate (Today I Learned Climate), Prof. Jacopo Buongiorno, Director of the MIT Center for Advanced Nuclear Energy Systems, sits down with host Laur Hesse Fisher to explore how nuclear power works, why even some climate advocates don’t agree on using it, and what role it can play in our clean energy future.
Podcast episode
TIL about nuclear energy: We know how to generate tons of electricity without pumping greenhouse gases in the atmosphere, using a technology that’s already mature, widespread, and competitive with fossil fuels -- and also, very controversial: nuclear power. In this episode of TILclimate (Today I Learned Climate), Prof. Jacopo Buongiorno, Director of the MIT Center for Advanced Nuclear Energy Systems, sits down with host Laur Hesse Fisher to explore how nuclear power works, why even some climate advocates don’t agree on using it, and what role it can play in our clean energy future.
byTILclimate
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
Netflix for Agriculture? Digital Technology for Development [Audio]
Podcast episode
Netflix for Agriculture? Digital Technology for Development [Audio]
byLSE: Public lectures and events
0 ratings
0% found this document useful
Journal Club: Building a Better Chloroplast: with @heyjudka and @lr_bio a16z bio Journal Club covers recent research building artificial chloroplasts to improve the processes of photosynthesis and carbon fixation, and how these artificial chloroplasts could one day be used to improve crop growth and carbon sequestration in plants.
Podcast episode
Journal Club: Building a Better Chloroplast: with @heyjudka and @lr_bio a16z bio Journal Club covers recent research building artificial chloroplasts to improve the processes of photosynthesis and carbon fixation, and how these artificial chloroplasts could one day be used to improve crop growth and carbon sequestration in plants.
bya16z Podcast
0 ratings
0% found this document useful
Professor Michael Levitt - How COVID-19 Panic From The Government Destroyed Millions Of Lives: BROADCAST YOURSELF - 8 Week Course: 2021 SUMMIT TICKETS: NEW MASTERCLASS EACH WEEK: LATEST EPISODE: Professor Michael Levitt is a Nobel Prize-winning biophysicist who has conducted pioneering work on the molecular...
Podcast episode
Professor Michael Levitt - How COVID-19 Panic From The Government Destroyed Millions Of Lives: BROADCAST YOURSELF - 8 Week Course: 2021 SUMMIT TICKETS: NEW MASTERCLASS EACH WEEK: LATEST EPISODE: Professor Michael Levitt is a Nobel Prize-winning biophysicist who has conducted pioneering work on the molecular...
byLondon Real
0 ratings
0% found this document useful
Episode 46: Better Nuclear Fuel: The TRISO Particle
Podcast episode
Episode 46: Better Nuclear Fuel: The TRISO Particle
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
TLP #18 : Why do we go to conferences? - Jean-Léon, Rita, Mariaceleste, Tim
Podcast episode
TLP #18 : Why do we go to conferences? - Jean-Léon, Rita, Mariaceleste, Tim
byThe Lonely Pipette : helping scientists do better science
0 ratings
0% found this document useful
Episode 17: Perfecting Polymers Processing
Podcast episode
Episode 17: Perfecting Polymers Processing
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
Which technologies get cheaper over time, and why?
Podcast episode
Which technologies get cheaper over time, and why?
byVolts
0 ratings
0% found this document useful
S2E25: The DAC-up plan for climate change—w/ Dr. Jen Wilcox of Worcester Polytechnic Institute
Podcast episode
S2E25: The DAC-up plan for climate change—w/ Dr. Jen Wilcox of Worcester Polytechnic Institute
byReversing Climate Change
0 ratings
0% found this document useful
17: How Extracting Gold From Your Data Accelerates Process Development w/ Ioscani Jiménez del Val - Part 1
Podcast episode
17: How Extracting Gold From Your Data Accelerates Process Development w/ Ioscani Jiménez del Val - Part 1
bySmart Biotech Scientist | Master Bioprocess CMC Development, Biologics Manufacturing & Scale-up for Busy Scientists
0 ratings
0% found this document useful
063 - Why do we need a handbook of fire and the environment with Brian Meacham and Margaret McNamee
Podcast episode
063 - Why do we need a handbook of fire and the environment with Brian Meacham and Margaret McNamee
byFire Science Show
0 ratings
0% found this document useful
Ep 38: Hussein Khalil, Argonne National Lab: Nuclear Energy R&D
Podcast episode
Ep 38: Hussein Khalil, Argonne National Lab: Nuclear Energy R&D
byTitans Of Nuclear | Interviewing World Experts on Nuclear Energy
0 ratings
0% found this document useful
Active Learning for Materials Design with Kevin Tran - TWiML Talk #238: Today we’re joined by Kevin Tran, PhD student in the department of chemical engineering at Carnegie Mellon University. Kevin’s research focuses on creating and using automated, active learning workflows to perform density functional theory, or...
Podcast episode
Active Learning for Materials Design with Kevin Tran - TWiML Talk #238: Today we’re joined by Kevin Tran, PhD student in the department of chemical engineering at Carnegie Mellon University. Kevin’s research focuses on creating and using automated, active learning workflows to perform density functional theory, or...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
3: Introduction to Hydraulics: In the third episode of the Neutrium podcast Trevor and Matt start a multi-episode arc on process hydraulics. This week they introduce the concept of pressure and discuss Bernoulli's principal, flow regime and Reynolds number.
Podcast episode
3: Introduction to Hydraulics: In the third episode of the Neutrium podcast Trevor and Matt start a multi-episode arc on process hydraulics. This week they introduce the concept of pressure and discuss Bernoulli's principal, flow regime and Reynolds number.
byNeutrium
0 ratings
0% found this document useful
TIL about energy efficiency: We hear a lot about technologies that produce carbon-free energy, but what about actually using less energy to begin with? In this episode of TILclimate (Today I Learned Climate), Harvey Michaels, lecturer at the MIT Sloan School of Management, joins host Laur Hesse Fisher to explore the three ways that energy efficiency can help us reduce carbon emissions.
Podcast episode
TIL about energy efficiency: We hear a lot about technologies that produce carbon-free energy, but what about actually using less energy to begin with? In this episode of TILclimate (Today I Learned Climate), Harvey Michaels, lecturer at the MIT Sloan School of Management, joins host Laur Hesse Fisher to explore the three ways that energy efficiency can help us reduce carbon emissions.
byTILclimate
0 ratings
0% found this document useful
Setting the Standard: Impact of Method Standardization in Chromatography
Podcast episode
Setting the Standard: Impact of Method Standardization in Chromatography
byThe Analytical Wavelength
0 ratings
0% found this document useful
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
Podcast episode
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
byEditors in Conversation
0 ratings
0% found this document useful
Podcast Ep. #7 – Dr Valeska Ting on Smart Nanomaterials for Hydrogen Storage: Today's episode features Dr Valeska Ting who is a Reader in Smart Nanomaterials at the University of Bristol and is researching the use of nanoporous materials for hydrogen storage. Using hydrogen as a fuel source has many benefits.
Podcast episode
Podcast Ep. #7 – Dr Valeska Ting on Smart Nanomaterials for Hydrogen Storage: Today's episode features Dr Valeska Ting who is a Reader in Smart Nanomaterials at the University of Bristol and is researching the use of nanoporous materials for hydrogen storage. Using hydrogen as a fuel source has many benefits.
byAerospace Engineering Podcast
0 ratings
0% found this document useful
Melanie Tribble
Podcast episode
Melanie Tribble
byPeople doing Physics
0 ratings
0% found this document useful
S3:E6 - 53 MILES IN MAINE, Part 3 of 5
Podcast episode
S3:E6 - 53 MILES IN MAINE, Part 3 of 5
byAnimalia
0 ratings
0% found this document useful
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
Podcast episode
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
How Should We Pay for the Grid?
Podcast episode
How Should We Pay for the Grid?
byThe Energy Gang
0 ratings
0% found this document useful
Wrapping up Productivity Puzzles 2023: brief interviews with productivity experts
Podcast episode
Wrapping up Productivity Puzzles 2023: brief interviews with productivity experts
byProductivity Puzzles
0 ratings
0% found this document useful
Data Centers: The Epicenter of the Clean Energy Economy
Podcast episode
Data Centers: The Epicenter of the Clean Energy Economy
byThe Interchange: Recharged
0 ratings
0% found this document useful
Ep 6 | Why we need a solar rooftop revolution: A solar rooftop revolution is coming, and it is not leaving anyone behind! From private citizens to local businesses, everyone will gain access to a renewable source of energy that will reduce costs, stop our toxic reliance on fossil fuels, empower...
Podcast episode
Ep 6 | Why we need a solar rooftop revolution: A solar rooftop revolution is coming, and it is not leaving anyone behind! From private citizens to local businesses, everyone will gain access to a renewable source of energy that will reduce costs, stop our toxic reliance on fossil fuels, empower...
byEurope Climate Connection
0 ratings
0% found this document useful
Meet the 2023 SPE Carbon Capture, Utilization, and Storage Conference Technical Program Chairs recorded live at 2023 SPE CCUS Conference held at the University of Houston, April 25-27, 2023 – Ep 216
Podcast episode
Meet the 2023 SPE Carbon Capture, Utilization, and Storage Conference Technical Program Chairs recorded live at 2023 SPE CCUS Conference held at the University of Houston, April 25-27, 2023 – Ep 216
byOil and Gas Upstream
0 ratings
0% found this document useful
TIL about wind and solar: What will it take to generate the electricity our society needs, without generating carbon emissions? In this episode of TILclimate (Today I Learned Climate), Dr. Magdalena Klemun at the MIT Institute for Data, Systems and Society joins host Laur Hesse Fisher to begin exploring this question, starting with wind and solar power. What exactly are wind and solar power? What challenges do we currently face when trying to use wind and solar to generate most of our electricity? What’s the role of energy storage, and what could our future zero-carbon energy mix look like?
Podcast episode
TIL about wind and solar: What will it take to generate the electricity our society needs, without generating carbon emissions? In this episode of TILclimate (Today I Learned Climate), Dr. Magdalena Klemun at the MIT Institute for Data, Systems and Society joins host Laur Hesse Fisher to begin exploring this question, starting with wind and solar power. What exactly are wind and solar power? What challenges do we currently face when trying to use wind and solar to generate most of our electricity? What’s the role of energy storage, and what could our future zero-carbon energy mix look like?
byTILclimate
0 ratings
0% found this document useful
Turnaround Cities: Lessons learned for the UK
Podcast episode
Turnaround Cities: Lessons learned for the UK
byProductivity Puzzles
0 ratings
0% found this document useful
Michel Rauchs (Cambridge Center for Alternative Finance) on quantifying Bitcoin's energy use (EP.232): We welcome , research affiliate and longtime contributor to the which just released an update its featuring new . Covered in this episode: Why Michel returned to the Cambridge Center for Alternative Finance The evolution of the CCAF's miner...
Podcast episode
Michel Rauchs (Cambridge Center for Alternative Finance) on quantifying Bitcoin's energy use (EP.232): We welcome , research affiliate and longtime contributor to the which just released an update its featuring new . Covered in this episode: Why Michel returned to the Cambridge Center for Alternative Finance The evolution of the CCAF's miner...
byOn The Brink with Castle Island
0 ratings
0% found this document useful

Skip carousel

Business applications For Quantum computing
Rotman Management
Article
Business applications For Quantum computing
May 1, 2022
COMPUTERS DO ARITHMETIC. Underlying every amazing application of computers today is math, calculated using binary digits or ‘bits.’ The original computers of the early 1950s could perform about 465 multiplications per second — much faster than the ‘h
11 min read
Funding For Research Tools That Could Halt Future Pandemics Announced
The Independent
Article
Funding For Research Tools That Could Halt Future Pandemics Announced
Mar 27, 2024
2 min read
Funding For Research Tools That Could Halt Future Pandemics Announced
Evening Standard
Article
Funding For Research Tools That Could Halt Future Pandemics Announced
Mar 27, 2024
2 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read
Let Your Supply Chains Be Sustainable Too: SOUND ADVICE From LUXEMBOURG
The European Business Review
Article
Let Your Supply Chains Be Sustainable Too: SOUND ADVICE From LUXEMBOURG
Mar 31, 2020
4 min read
About the Author
The European Business Review
Article
About the Author
Apr 3, 2019
Stijn Viaene is a full professor and partner at Vlerick Business School in Belgium. He is the director of the school’s Digital Transformation focus area. He is also a full professor in the Decision Sciences and Information Management department at KU
1 min read
Innovation – Now And For Generations To Come
The European Business Review
Article
Innovation – Now And For Generations To Come
Aug 1, 2022
6 min read
How Green Tech Will Affect the Upcoming Industrial Revolution, Industry 4.0?
Techfastly
Article
How Green Tech Will Affect the Upcoming Industrial Revolution, Industry 4.0?
Jan 3, 2022
5 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Quantum Computing and The Rise Of Machine Learning
Techfastly
Article
Quantum Computing and The Rise Of Machine Learning
Oct 1, 2021
2 min read
How Spooky Science Helps Us Peer Inside The Planets
All About Space
Article
How Spooky Science Helps Us Peer Inside The Planets
Dec 3, 2020
An assistant professor of computational science at the EPFL research centre in Lausanne, Switzerland, involved in the current research on metallic hydrogen. Could you explain how the machine-learning techniques used in your research work? Why were th
1 min read
A British Perspective on Practice-based Architectural Research
Architecture Australia
Article
A British Perspective on Practice-based Architectural Research
Jul 2, 2018
Two conditions seem self-evident. Firstly, architectural practice is permeated with research: how else could things be so varied or be able to change so rapidly? Secondly, though, architects in Australia, Britain and elsewhere are hugely ineffective
4 min read
From Trash To Treasure: Australia In A Take-make-remake World
AQ: Australian Quarterly
Article
From Trash To Treasure: Australia In A Take-make-remake World
Dec 31, 2017
9 min read
Chemical Industry Could Go Totally Carbon Neutral
Futurity
Article
Chemical Industry Could Go Totally Carbon Neutral
Apr 6, 2020
2 min read
‘The Silk Road’: How Do You Take Technology out of the Lab and Into the World?
STAT
Article
‘The Silk Road’: How Do You Take Technology out of the Lab and Into the World?
Oct 11, 2017
How to take discoveries out of the lab and into the lives of patients was the subject of the HUBweek panel discussion in the Ragon Institute’s auditorium.
2 min read
References
AQ: Australian Quarterly
Article
References
Mar 31, 2018
1 Turner, G. (2014) ‘Is Global Collapse Imminent?’, MSSI Research Paper No. 4, Melbourne Sustainable Society Institute, The University of Melbourne. 2 W. Steffen et al., Science 347, 1259855 (2015). DOI: 10.1126/science.1259855 3 Stiglitz, J., Sen, A
4 min read
Transforming Transportation: Opportunity for a Sustainable and Equitable Electric Future
Union of Concerned Scientists
Article
Transforming Transportation: Opportunity for a Sustainable and Equitable Electric Future
May 8, 2023
Implementation of solutions from researches and community members impacted by pollution are key.
4 min read
World Class Conference on Show at Waste Expo Australia
Facility Management
Article
World Class Conference on Show at Waste Expo Australia
Aug 29, 2019
2 min read
Metabolomes: A New Way To Store Data In Little Space
Futurity
Article
Metabolomes: A New Way To Store Data In Little Space
Jul 5, 2019
3 min read
Smart Moves For Europe’s Energy-intensive Industries
The European Business Review
Article
Smart Moves For Europe’s Energy-intensive Industries
May 25, 2021
6 min read
Already On The Block
HWM Singapore
Article
Already On The Block
Jul 1, 2019
3 min read
Country Diary
Shooting Times & Country
Article
Country Diary
Sep 15, 2021
The Campaign for the Protection of Rural England (CPRE) is calling for the Government to set a target to increase English hedgerows by 40% by the year 2050. As well as environmental benefits, the CPRE claims 25,000 rural jobs would be created to plan
2 min read
Giving Thanks to Climate Researchers of the Federal Agencies
Union of Concerned Scientists
Article
Giving Thanks to Climate Researchers of the Federal Agencies
Nov 16, 2017
3 min read
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Nautilus
Article
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Feb 12, 2015
I’ve never seen the computer you’re reading this story on, but I can tell you a lot about it. It runs on electricity. It uses binary logic to carry out programmed instructions. It shuttles information using materials known as semiconductors. Its brai
7 min read
Lessons From Ellen MacArthur And The Circular Economy On How Leaders Can Build And Sustain Transformation?
The European Business Review
Article
Lessons From Ellen MacArthur And The Circular Economy On How Leaders Can Build And Sustain Transformation?
Apr 3, 2019
7 min read
How Quantum Computing Can Fight Climate Change
PC Pro Magazine
Article
How Quantum Computing Can Fight Climate Change
Oct 8, 2022
8 min read
How Quantum Computing Can Fight Climate Change
APC
Article
How Quantum Computing Can Fight Climate Change
Nov 28, 2022
8 min read
3-step Recycling Turns Diapers Into Gooey Adhesives
Futurity
Article
3-step Recycling Turns Diapers Into Gooey Adhesives
Jul 28, 2021
3 min read
Team Takes Key Step Toward Truly Recycling Plastic
Futurity
Article
Team Takes Key Step Toward Truly Recycling Plastic
Mar 28, 2022
2 min read
References
The European Business Review
Article
References
Sep 20, 2018
1. Kosfeld, M., Heinrichs, M., Zak, P. J., Fischbacher, U., Fehr, E. (2005). "Oxytocin increases trust in humans". Nature. 435: 673–676. doi:10.1038/nature03701. 2. Zak, P. J., and Knack, S. (2001). Trust and growth. The Economic Journal, 111: 295-32
1 min read

Related categories

Skip carousel

Reviews for Mixtures

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Mixtures - Kerrie L. Mengersen

Chapter 1

The EM algorithm, variational approximations and expectation propagation for mixtures

D. Michael Titterington

1.1 Preamble

The material in this chapter is largely tutorial in nature. The main goal is to review two types of deterministic approximation, variational approximations and the expectation propagation approach, which have been developed mainly in the computer science literature, but with some statistical antecedents, to assist approximate Bayesian inference. However, we believe that it is helpful to preface discussion of these methods with an elementary reminder of the EM algorithm as a way of computing posterior modes. All three approaches have now been applied to many model types, but we shall just mention them in the context of mixtures, and only a very small number of types of mixture at that.

1.2 The EM algorithm

1.2.1 Introduction to the algorithm

Parameter estimation in mixture models often goes hand-in-hand with a discussion of the EM algorithm. This is especially so if the objective is maximum likelihood estimation, but the algorithm is also relevant in the Bayesian approach if maximum a posteriori estimates are required. If we have a set of data D from a parametric model, with parameter θ, probably multidimensional, and with likelihood function and prior density , then the posterior density for θ is

and therefore the posterior mode is the maximiser of . Of course, if the prior density is uniform then the posterior mode is the same as the maximum likelihood estimate. If explicit formulae for the posterior mode do not exist then recourse has to be made to numerical methods, and the EM algorithm is a popular general method in contexts that involve incomplete data, either explicitly or by construct. Mixture data fall into this category, with the component membership indicators z regarded as missing values.

The EM algorithm is as follows. With data D and initial guess for , a sequence of values are generated from the following double-step that creates from .

E-step: Evaluate

M-step: Find to maximise with respect to .

Remarks

1. In many other incomplete data problems the missing values z are continuous, in which case the summation is replaced by an integration in the above.

2. Not surprisingly, is usually very like a complete-data log-posterior, apart from a constant that is independent of , so that the M-step is easy or difficult according as calculation of the complete-data posterior mode is easy or difficult.

3. The usual monotonicity proof of the EM algorithm in the maximum-likelihood context can be used, with minimal adaptation, to show that

Thus, the EM algorithm ‘improves’ at each stage and, provided the posterior density for is locally bounded, the values of should converge to a local maximum of . The corresponding sequence will also often converge, one hopes to , but convergence may not occur if, for instance, contains a ridge. The niceties of convergence properties are discussed in detail, for maximum likelihood, in Chapter 3 of McLachlan and Krishnan 1997.

1.2.2 The E-step and the M-step for the mixing weights

Suppose now that the data are a random sample from a distribution with probability density function

where are the mixing weights, the are the component densities, each corresponding to a subpopulation, and k is finite. The density is parameterised by and the set of all these is to be called ϕ. Often we shall assume that the component densities are of the same type, in which case we shall omit the subscript j from . The complete set of parameters is .

The complete-data joint distribution can be conveniently written as

with the help of the indicator notation, where if the ith observation comes from component j and is zero otherwise. Thus

For the E-step of the EM algorithm all that we have to compute are the expectations of the indicator variables. Given , we obtain

for each i, j. We now have

say, where , a ‘pseudo’ sample size associated with subpopulation j. (In the case of complete data, is precisely the sample size for subpopulation j.)

We now consider the M-step for the mixing weights λ. Before this we make some assumptions about the prior distributions. In particular, we assume that λ and ϕ are a priori independent and that the prior for λ takes a convenient form, namely that which would be conjugate were the data complete. Thus, we assume that, a priori, that is,

for prescribed hyperparameters . Clearly, must maximise

which is essentially a log-posterior associated with multinomial data. Thus

where .

Example 1

Mixture of two known densities

In this simplest case with , write . Then the iteration is

(1.1) equation

In examples involving mixtures of known densities, this completes the analysis. Otherwise the nature of the M-step for ϕ depends on the model used for the component densities.

1.2.3 The M-step for mixtures of univariate Gaussian distributions

This is by far the most commonly used finite mixture in practice. Here, for , and using the natural notation for means and variances of Gaussian distributions,

Thus

which is very like a sum of k Gaussian log-likelihoods, with the included.

For the prior we make the convenient assumptions that the parameters corresponding to the different components are a priori independent, with densities that belong to the appropriate conjugate families. Thus

in which denotes the jth component's precision,

where denotes the density of the distribution and denotes the density of the distribution. Thus, for a given j, and are the joint maximisers of

Straightforward calculus gives

As usual, note the similarity in structure with formulae for posterior modes for Gaussian parameters given complete data.

1.2.4 M-step for mixtures of regular exponential family distributions formulated in terms of the natural parameters

Here and therefore

Then an appropriate prior for has the form

and therefore is the maximiser of

Differentiation with respect to ϕ leads to the following equation satisfied by :

However, if we parameterise by

in which the expectation is over Y assumed to belong to the jth subpopulation, then

so the M-step is just

Example 2

Mixture of Poisson distributions

In this example the jth component density can be written as

so that an appropriate conjugate prior for is a distribution. Then it is easy to see that the M-step of the EM algorithm leads to

The terms in that involve constitute, up to an additive constant, the logarithm of a density function, with

Example 3

Mixture of exponential distributions

In the ‘natural’ parameterisation,

so that and . To fit in with the previous notation, we assume that the prior for is a distribution. Thus the M-step is

from which can be obtained.

1.2.5 Application to other mixtures

Application of EM to various other specific mixture models is discussed in monographs such as Titterington et al. (1985) and McLachlan and Peel (2000), not to mention many individual papers. Included in these special cases is that of hidden Markov models, more precisely called hidden Markov chain models. In a mixture sample the complete data corresponding to the ith observation consist of observed data together with the component-membership indicators . In a mixture sample the are missing or ‘hidden’, it is assumed that the ys for different observations are independent, given the zs, and also that the zs are themselves independent. In the hidden Markov model, this second assumption is modified; instead, it is assumed that the come from a homogeneous, first-order Markov chain with states corresponding to the component subpopulations. Thus, dependence is assumed, but is of the simplest, one-dimensional kind. This model is very popular, in areas such as ecology, speech modelling and DNA sequencing, and associated methodology has been developed based on both maximum likelihood (Rabiner, 1989) and the Bayesian approach (Robert et al., 1993). The dependence among the hidden variables leads to additional complications, in comparison to the case of mixture data, but typically not to a severe degree. More precisely, the E-step in the EM algorithm for finding a posterior mode is not explicit, but requires a (terminating) so-called forwards–backwards algorithm.

1.2.6 EM as a double expectation

There is an appealing interpretation of EM as a double maximisation rather than as an expectation followed by a maximisation. The idea goes back at least as far as Csiszár and Tusnády (1984) but has more recently been set out clearly, in the context of maximum likelihood estimation, by Neal and Hinton (1999). The version corresponding to calculation of a Bayesian posterior mode with our notation goes as follows. Define

where q is a density function, and suppose that we are at stage m in the algorithm.

The first step is to choose to maximise . Since we can write

the solution is to take .

The second step is to choose to maximise . This amounts to maximising , which is just the EM algorithm.

It is easy to see that

and therefore the above alternating maximisation technique leads to monotonicity:

(1.2)

equation

1.3 Variational approximations

1.3.1 Preamble

Exact Bayesian analysis of mixture data is complicated, from a computational point of view, because the likelihood function, in expanded form, consists of the sum of a large number of terms. In practice, the use of some form of approximation is inevitable, and much space has been devoted to the idea of approximating the posterior density or predictive density of interest stochastically, in the form of a set of realisations from the distribution, typically created by an MCMC procedure. In principle the resulting inferences will be asymptotically ‘correct’, in that the empirical distribution associated with a very large set of realisations should be very similar to the target distribution. However, practical difficulties may arise, especially for problems involving many observations and many parameters, because of storage costs, computation time, the need to confirm convergence and so on. These considerations have led to the development of deterministic approximations to complicated distributions. In the next two sections we describe two such approaches, based respectively on variational approximations and on expectation propagation. We shall see in this section that, unlike with MCMC methods, the resulting variational approximations are not exact, even asymptotically as , but they are less unwieldy and this may be sufficiently attractive to outweigh technical considerations. In fact some attractive asymptotic properties do hold, as is discussed in Section 1.3.6.

1.3.2 Introduction to variational approximations

The fundamental idea is very natural, namely to identify a best approximating density q to a target density p, where ‘best’ means the minimisation of some ‘distance’ between q and p. There are many measures of distance between density functions, but the one generally used in this context is the Kullback—Leibler directed divergence , defined by

(1.3) equation

Of course, is not symmetric in its arguments and is therefore not a metric, but it is nonnegative, and zero only if q and p are the same except on a set of measure zero. Without further constraints, the optimal q is of course p, but in practice the whole point is that the p in question is very complicated and the optimisation is carried out subject to approximating but facilitating constraints being imposed on q. In many applications the target, p, is , the conditional density of a set of ‘unobservables’, u, conditional on a set of observed data, D, and q is chosen to minimise

subject to simplifying constraints on q. The same solution yields a lower bound on , the (marginal) probability of the data. This follows because

as can be seen by combining the two terms on the right-hand side. The properties of the Kullback—Leibler divergence then both provide the desired lower bound, in that then

(1.4)

equation

say, and demonstrate that a q that minimises provides the best lower bound for . The right-hand side of (1.4), , is known in statistical physics as the free energy associated with q. The optimum q can be interpreted as the maximiser of the free energy, and it is this interpretation that stimulates the methodology described in this chapter.

In a non-Bayesian context these results permit the approximation of a likelihood function corresponding to a missing-data problem. Here u represents the missing data and $(D, the complete data. Thus, the target for q is the conditional density of the missing data, given the observed data, evaluated at a specified value θ for the parameters, and (1.4) provides a lower bound for the observed-data loglikelihood, again for the specified value of θ. More discussion of this use of variational approximations, with references, is given in Section 3.1 of Titterington (2004).

In the Bayesian context, the unobservables include the parameters themselves, as well as any missing values. Thus , where z denotes the missing data, which for us are the mixture component indicators. The target for q is therefore , the posterior density of the parameters and the missing values, and (1.4) provides a lower bound for the marginal loglikelihood for the observed data. This marginal likelihood is called the ‘evidence’ by MacKay (1992) or the Type-II likelihood, and it is a key component of Bayes factors in model-comparison contexts. As we have said, constraints have to be imposed on q in order to create a workable procedure, and the standard approximation is to assume a factorised form,

for q. We are therefore imposing a (posterior) assumption of independence between the parameters and the missing values. This is clearly a substantive concession, and it is crucial to assess the degree to which the extra computational feasibility counteracts the loss of accuracy. One consequence of the assumption is that the factor represents the variational approximation to the ‘true’ posterior, . The lower bound in (1.4) is

(1.5)

equation

say, and can also be written

The optimal and maximise the right-hand side of (1.5). Thus, the two factors respectively maximise the coupled formulae

(1.6)

equation

(1.7)

equation

It follows that the optimum and satisfy

(1.8) equation

and

(1.9) equation

Explicit solution of these equations will not be possible. However, writing the equations in the shorthand form suggests the following iterative algorithm for computing and from an initial : for , calculate

(1.10) equation

The construction of the algorithm is such that, so far as the ‘free energy’ in (1.5) is concerned,

(1.11)

equation

for , so that the free energy is monotonically increasing and bounded above by , and therefore the sequence of free energies converges. This behaviour is a direct parallel of that of the EM algorithm; compare (1.11) with (1.2). Whether or not there are multiple local maxima, and so on, is a different issue; in fact the existence of multiple local maxima, or multiple fixed points of the algorithm, is a very common phenomenon.

Again the reader is referred to Titterington (2004), and to Bishop (2006), for general discussions of variational Bayes approaches to missing-data problems and a substantial body of references.

1.3.3 Application of variational Bayes to mixture problems

We shall consider some of the same examples as those used in section 1.2 about the EM algorithm, with the intention of revealing strong methodological similarities between the two approaches.

Example 4

Mixture of k known densities

Recall that here we assume that the observed data are a random sample from a distribution with probability density function

where the s are known so that θ just consists of the s. In this case,

where if the ith observation comes from component j and is zero otherwise, and is the prior density for θ. If we assume a prior for θ, then

so that

where ‘const.’ does not depend on θ and is the marginal probability that , according to the distribution . From (1.8), therefore, the optimal density is that of the distribution, where , for

Next we identify the optimal as a function of . We have

where and now the ‘const.’ is independent of the . If we substitute this on the right-hand side of (1.9) we see that the optimal takes a factorised form, with one factor for each observation, and that the optimal factors are defined by

for , subject to , for each i. Properties of the Dir (a) distribution imply that , where Ψ denotes the digamma function and . Thus, the equations for are

for each i and j. As predicted earlier, clearly the equations for the , which define the Dirichlet distribution that represents the variational approximation to the posterior distribution of θ, and the cannot be solved explicitly, but the version of the iterative algorithm (1.10) works as follows. Initialise, for example by choosing , for each i and j, and carry out the following steps, for .

Step 1: For , calculate

(1.12) equation

Step 2: For and , calculate

(1.13)

equation

Note the strong similarity between Step 1 and an EM algorithm M-step and between Step 2 and an EM-algorithm E-step, the major difference being the replacement of by . Indeed, in some of the literature these algorithms are called ‘variational EM’.

Example 5

Mixture of k univariate Gaussian densities

In this case

where are the mixing weights, are the component means and are the component variances, so that . Given data , we have

At this point it is revealing to return to Example 4 for a couple of remarks.

1. The variational posterior for θ there took the form of a Dirichlet distribution, as a result of the factorisation assumption about and of the choice of a Dirichlet prior. In other words, having chosen the traditional complete-data conjugate family of priors, conjugacy was obtained for the variational method.

2. The joint variational approximation for the distribution of z took a factorised form, essentially because took the form of a product over i; in other words the are independent over i, where .

The second remark holds again here, so that we shall have

(1.14)

equation

for each i and j, normalised so that , for each i.

Also, if we choose a complete-data conjugate prior for θ then the optimal will be a member of that family. The appropriate hyperparameters will satisfy equations interlinked with those for the that can be solved iteratively, by alternately updating the hyperparameters and the . This structure will clearly obtain for a wide range of other examples, but we concentrate on the details for the univariate Gaussian mixture, for which the appropriate prior density takes the form mentioned before, namely

in which denotes the jth component precision,

where denotes the density of the Dirichlet distribution with parameters , and the other notation has been defined already.

Often the priors will be exchangeable, so that all the s will be the same, and so on, but we shall work through the more general case. The optimal variational approximation then takes the form

within which the factors are given by

where the hyperparameters satisfy

in which , for each j, representing a pseudo sample mean for component j in the same way that represents a pseudo sample size.

If we write the total set of hyperparameters as h, then the above equations take the form

(1.15) equation

We now have to identify the optimal as a function of or, to be more specific, as a function of the associated hyperparameters h. As we have seen, for each i we have

From Example 1 we know that

Also, from properties of the gamma distribution, we have

and, averaging out first conditionally on and then over , we obtain

Thus

with the normalisation carried out over j for each i. The set of relationships can be represented concisely in the form

(1.16) equation

The obvious algorithm for obtaining the variational approximations involves initialising, by setting to some , and then calculating and , for . Again, this is the form of (1.10) corresponding to this example. Since the component densities are unknown, initialisation is not so straightforward, especially if exchangeable priors are assumed for the parameters in the component densities. An ad hoc approach that seemed to be adequate in practice is to perform a cluster analysis of the data into k clusters and to base the initial on that. For univariate data the crude method of using sample quantiles to determine the clusters can be adequate, and for multivariate data a k-means clustering can be effective enough.

1.3.4 Application to other mixture problems

As already mentioned, Titterington (2004) and Bishop (2006) list many references to particular cases of variational Bayes. The case of mixtures of multivariate Gaussian distributions has been treated in a number of places. The natural analogue of the univariate case as discussed in Example 5 is considered by McGrory (2005) and McGrory and Titterington (2007). The usual conjugate prior is used for the parameters of the component distributions, namely independent Wishart distributions for the inverses of the covariance matrices and independent Gaussian priors for the mean vectors, conditionally on the inverse covariance matrices. The version of the variational posterior distribution, , naturally has the same structure.

The same mixture model was investigated by Corduneanu and Bishop (2001), but with a different structure for the prior. The prior distributions for the component means were independent Gaussians with zero means, but with covariance matrices that were , where I is the identity matrix and β is a positive scalar; this is clearly different from the usual conjugate prior structure. Furthermore, they did not assume any prior distribution for the mixing weights. Instead, they kept the mixing weights λ as fixed parameters and chose , where θ consists of the component mean vectors and precision matrices, so as to maximise

which is a lower bound for the ‘marginal loglikelihood’ . Corduneanu and Bishop (2001) alternately calculate the optimal q for the current value of λ and then obtain λ so as to maximise for that q. In fact they only perform one iteration of the algorithm for the calculation of the optimal q before moving on to obtain a new λ. As the algorithm progresses the values of increase, reflecting the fact that the algorithm is similar to the generalised EM algorithm (Dempster et al., 1977). Hyperparameters such as are pre-set, although explicit procedures are not stated in the paper.

The method of Corduneanu and Bishop (2001) is unconventional in statistical terms in not using the usual structure for the prior distributions for the component mean vectors and in not treating the mixing weights in a fully Bayesian way, as was done in McGrory (2005) and McGrory and Titterington (2007). The more traditional structure is also followed by Ueda and Ghahramani (2002), who consider a more complicated mixture model known as the mixture-of-experts model (Jordan and Jacobs, 1994). This model assumes the existence of covariates, the mixing weights depend on the covariates and the component distributions correspond to regression models, usually Gaussian, again depending on the covariates. The calculations for the variational approach involve much careful detail but are again evocative of complete-data conjugate Bayesian analysis.

Ueda and Ghahramani (2002) emphasise other issues beyond the calculation of approximate posterior distributions, and in particular they cover prediction and model choice. So far as prediction is concerned, in the case of ‘ordinary’ Gaussian mixtures, the calculation of the predictive density of a new observation is straightforward, being a mixture of the corresponding component predictive densities, with mixing weights given by the means of the variational posterior distribution for the mixing weights. In the univariate case this gives a mixture of Student's t distributions. In the case of mixture-of-experts models the complexity of the mixing weights requires a mild amount of approximation in order to obtain a closed-form predictive density. Ueda and Ghahramani deal with model choice by assuming a specified finite class of models, with associated prior probabilities, incorporating a factor in the formula to represent the variational posterior distribution on the class of models, and basing model choice on the resulting values of . In their empirical work on Gaussian mixtures, Corduneanu and Bishop (2001) observed automatic model selection taking place, when analysing data that were actually simulated from mixture distributions. If a model were fitted that contained more components than the true model, then, as the ‘marginal likelihood’ maximisation progressed, at least one of the estimates of the mixing weights would become smaller and smaller. At that stage that component was dropped from the model, the algorithm proceeded and this automatic pruning stopped only when the estimated mixture had the same number of components as the true model. If the fitted model did not have as many components as the true model then no pruning occurred. The same phenonemon occurred in the fully Bayesian approach of McGrory and Titterington (2007) in that, if the fitted model was too rich, then, for some j, became very small, corresponding to a component for which the other parameters were very similar to those of a second component.

As we have said, is a pseudo sample size of those observations nominally assigned to the jth component, and if that number fell much below 1 the component was dropped from the fitted model. There does not seem to be any formal explanation for this intriguing yet helpful behaviour.

Application to hidden Markov models is described in MacKay (1997) and Humphreys and Titterington (2001). Software for variational Bayes developed by J. Winn is available at http://vibes.sourceforge.net/.

1.3.5 Recursive variational approximations

Example 1 (Revisited): Mixture of two known densities

We introduce the idea of recursive methods by returning to the very simple case of a mixture of two known densities, with λ denoting the mixing weight, which is the sole unknown parameter. The variational posterior turned out to be a beta distribution, provided that a beta prior was proposed. Alternative ways of deriving beta approxi-mations to the true posterior are given by recursive approximations, in which the observations are processed one by one and the posterior is updated each time, within the beta class. If, after the ith observation has been dealt with, the approximate posterior is the distribution, then the recursion computes hyperparameters on the basis of and . Nonvariational versions of these recursions are motivated by the exact Bayesian step for incorporating observation , given a ‘current’ prior. The correct posterior is the mixture

where, for , and denotes the density.

Recursions such as these were discussed in detail in Chapter 6 of Titterington et al. 1985. For this simple, one-parameter problem, two particular versions are the quasi-Bayes method (Makov and Smith, 1977; Smith and Makov, 1978, 1981), in which , for , and the probabilistic editor (Athans et al., 1977; Makov, 1983,), in which and are calculated to ensure that the first two moments of the beta approximation match the ‘correct’ moments corresponding to the mixture. This moment-matching is possible beause there are two hyperparameters.

It is easy to define a sequence of beta variational approximations . Given an approximating at stage i, choose , for each , and carry out the following steps, for .

Step 1: For , calculate

Step 2: For , calculate

Note the obvious similarity to Equations (1.12) and (1.13).

Clearly, at each recursive stage, the iterative procedure may take some time, but for large i very few iterations should be necessary, and even terminating at may eventually be adequate. One important factor is that the results obtained will depend on the order in which the observations are incorporated, as tends to be the case with all recursive methods.

Humphreys and Titterington (2001) report an experiment based on a sample of size 50 from a mixture of the and densities with mixing weight , starting from a Un(0,1) prior, so that . So far as the posterior variances are concerned, the values that were obtained empirically from the nonrecursive variational method, the quasi-Bayes and the recursive variational methods were 0.0043, 0.0044 and 0.0043, respectively. In contrast, the values obtained from Gibbs sampling and the Probabilistic Editor (PE) were 0.0088 and 0.0091, respectively.

Of particular note is the fact that only the probabilistic editor leads to a posterior variance for λ that is very similar to the ‘correct’ value provided by the Gibbs sampling result; all the other methods, including the nonrecursive variational method, ‘underestimate’ the posterior variability, although they produce a reasonable mean. Further detailed elucidation of this behaviour is provided in Section 1.4.4.

As remarked above, the recursive results are influenced by the ordering of the data. In the case of a hidden Markov chain, a natural ordering does exist, and indeed recursive (online) analysis could be of genuine practical importance. Humphreys and Titterington

Enjoying the preview?

Page 1 of 1

Mixtures: Estimation and Applications

About this ebook

Kerrie L. Mengersen

Related authors

Related to Mixtures

Titles in the series (100)

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Mixtures

What did you think?

Book preview

Mixtures - Kerrie L. Mengersen

1.1 Preamble

1.2 The EM algorithm

1.2.1 Introduction to the algorithm

1.2.2 The E-step and the M-step for the mixing weights

Example 1

1.2.3 The M-step for mixtures of univariate Gaussian distributions

1.2.4 M-step for mixtures of regular exponential family distributions formulated in terms of the natural parameters

Example 2

Example 3

1.2.5 Application to other mixtures

1.2.6 EM as a double expectation

1.3 Variational approximations

1.3.1 Preamble

1.3.2 Introduction to variational approximations

1.3.3 Application of variational Bayes to mixture problems

Example 4

Example 5

1.3.4 Application to other mixture problems

1.3.5 Recursive variational approximations