Linear and Generalized Linear Mixed Models and Their Applications

Ebook823 pages7 hours

Linear and Generalized Linear Mixed Models and Their Applications

Name: Linear and Generalized Linear Mixed Models and Their Applications
Author: Jiming Jiang
ISBN: 9781071612828

By Jiming Jiang and Thuan Nguyen

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Now in its second edition, this book covers two major classes of mixed effects models—linear mixed models and generalized linear mixed models—and it presents an up-to-date account of theory and methods in analysis of these models as well as their applications in various fields. It offers a systematic approach to inference about non-Gaussian linear mixed models. Furthermore, it discusses the latest developments and methods in the field, incorporating relevant updates since publication of the first edition. These include advances in high-dimensional linear mixed models in genome-wide association studies (GWAS), advances in inference about generalized linear mixed models with crossed random effects, new methods in mixed model prediction, mixed model selection, and mixed model diagnostics.

This book is suitable for students, researchers, and practitioners who are interested in using mixed models for statistical data analysis with public health applications. It is best for graduate courses in statistics, or for those who have taken a first course in mathematical statistics, are familiar with using computers for data analysis, and have a foundational background in calculus and linear algebra.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateMar 22, 2021

ISBN9781071612828

Author

Jiming Jiang

Related authors

Skip carousel

Related to Linear and Generalized Linear Mixed Models and Their Applications

Related ebooks

Skip carousel

Statistical Methods for Overdispersed Count Data
Ebook
Statistical Methods for Overdispersed Count Data
byJean-Francois Dupuy
Rating: 0 out of 5 stars
0 ratings
Sampling Theory and Practice
Ebook
Sampling Theory and Practice
byChangbao Wu
Rating: 0 out of 5 stars
0 ratings
Data and the American Dream: Contemporary Social Controversies and the American Community Survey
Ebook
Data and the American Dream: Contemporary Social Controversies and the American Community Survey
byMatthew J. Holian
Rating: 0 out of 5 stars
0 ratings
Introduction to Statistics in Metrology
Ebook
Introduction to Statistics in Metrology
byStephen Crowder
Rating: 0 out of 5 stars
0 ratings
Schaum's Outline of Elements of Statistics I: Descriptive Statistics and Probability
Ebook
Schaum's Outline of Elements of Statistics I: Descriptive Statistics and Probability
byStephen Bernstein
Rating: 0 out of 5 stars
0 ratings
Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating
Ebook
Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating
byEwout W. Steyerberg
Rating: 0 out of 5 stars
0 ratings
Exploring Data Analysis: The Computer Revolution in Statistics
Ebook
Exploring Data Analysis: The Computer Revolution in Statistics
byW. J. Dixon
Rating: 0 out of 5 stars
0 ratings
Data Treatment in Environmental Sciences
Ebook
Data Treatment in Environmental Sciences
byValérie David
Rating: 0 out of 5 stars
0 ratings
Biostatistics and Computer-based Analysis of Health Data using Stata
Ebook
Biostatistics and Computer-based Analysis of Health Data using Stata
byChristophe Lalanne
Rating: 0 out of 5 stars
0 ratings
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Ebook
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
byHaiyan Wang
Rating: 0 out of 5 stars
0 ratings
Reliability Analysis and Plans for Successive Testing: Start-up Demonstration Tests and Applications
Ebook
Reliability Analysis and Plans for Successive Testing: Start-up Demonstration Tests and Applications
byNarayanaswamy Balakrishnan
Rating: 0 out of 5 stars
0 ratings
Audit Studies: Behind the Scenes with Theory, Method, and Nuance
Ebook
Audit Studies: Behind the Scenes with Theory, Method, and Nuance
byS. Michael Gaddis
Rating: 0 out of 5 stars
0 ratings
Computational Statistics
Ebook
Computational Statistics
byGeof H. Givens
Rating: 5 out of 5 stars
5/5
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
Ebook
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
byRobert L. Mason
Rating: 0 out of 5 stars
0 ratings
Numerical Methods in Environmental Data Analysis
Ebook
Numerical Methods in Environmental Data Analysis
byMoses Eterigho Emetere
Rating: 0 out of 5 stars
0 ratings
Nonparametric Regression Methods for Longitudinal Data Analysis: Mixed-Effects Modeling Approaches
Ebook
Nonparametric Regression Methods for Longitudinal Data Analysis: Mixed-Effects Modeling Approaches
byHulin Wu
Rating: 0 out of 5 stars
0 ratings
The Demand for Life Insurance: Dynamic Ecological Systemic Theory Using Machine Learning Techniques
Ebook
The Demand for Life Insurance: Dynamic Ecological Systemic Theory Using Machine Learning Techniques
byWookjae Heo
Rating: 0 out of 5 stars
0 ratings
Random Forests with R
Ebook
Random Forests with R
byRobin Genuer
Rating: 0 out of 5 stars
0 ratings
Statistics for Earth and Environmental Scientists
Ebook
Statistics for Earth and Environmental Scientists
byJohn H. Schuenemeyer
Rating: 0 out of 5 stars
0 ratings
Basic Statistics for Social Workers
Ebook
Basic Statistics for Social Workers
byRobert A. Schneider
Rating: 0 out of 5 stars
0 ratings
Elements of Copula Modeling with R
Ebook
Elements of Copula Modeling with R
byMarius Hofert
Rating: 0 out of 5 stars
0 ratings
Working with Dynamic Crop Models: Methods, Tools and Examples for Agriculture and Environment
Ebook
Working with Dynamic Crop Models: Methods, Tools and Examples for Agriculture and Environment
byDaniel Wallach
Rating: 0 out of 5 stars
0 ratings
Dimensionless Physical Quantities in Science and Engineering
Ebook
Dimensionless Physical Quantities in Science and Engineering
byJosef Kunes
Rating: 0 out of 5 stars
0 ratings
Modern Mathematical Statistics with Applications
Ebook
Modern Mathematical Statistics with Applications
byJay L. Devore
Rating: 0 out of 5 stars
0 ratings
Robust Estimation and Testing
Ebook
Robust Estimation and Testing
byRobert G. Staudte
Rating: 3 out of 5 stars
3/5
Essential Statistics, Regression, and Econometrics
Ebook
Essential Statistics, Regression, and Econometrics
byGary Smith
Rating: 0 out of 5 stars
0 ratings
Quantitative Analysis and Modeling of Earth and Environmental Data: Space-Time and Spacetime Data Considerations
Ebook
Quantitative Analysis and Modeling of Earth and Environmental Data: Space-Time and Spacetime Data Considerations
byJiaping Wu
Rating: 0 out of 5 stars
0 ratings
Kernel Smoothing: Principles, Methods and Applications
Ebook
Kernel Smoothing: Principles, Methods and Applications
bySucharita Ghosh
Rating: 0 out of 5 stars
0 ratings
Multiple Imputation and its Application
Ebook
Multiple Imputation and its Application
byJames Carpenter
Rating: 0 out of 5 stars
0 ratings
Biostatistics Explored Through R Software: An Overview
Ebook
Biostatistics Explored Through R Software: An Overview
byVinaitheerthan Renganathan
Rating: 4 out of 5 stars
4/5

Medical For You

Skip carousel

The 40 Day Dopamine Fast
Ebook
The 40 Day Dopamine Fast
byGreg Kamphuis
Rating: 4 out of 5 stars
4/5
The Vagina Bible: The Vulva and the Vagina: Separating the Myth from the Medicine
Ebook
The Vagina Bible: The Vulva and the Vagina: Separating the Myth from the Medicine
byDr. Jen Gunter
Rating: 5 out of 5 stars
5/5
Native American Herbalist's Bible - 10 Books in 1: Create Your Green Paradise of Medicinal Plants and Herbal Remedies to Unleash Your Vitality: Herbal Apotecary Collection
Ebook
Native American Herbalist's Bible - 10 Books in 1: Create Your Green Paradise of Medicinal Plants and Herbal Remedies to Unleash Your Vitality: Herbal Apotecary Collection
byLomasi Ahusaka
Rating: 5 out of 5 stars
5/5
The Lost Book of Simple Herbal Remedies: Discover over 100 herbal Medicine for all kinds of Ailment Inspired By Barbara O'Neill
Ebook
The Lost Book of Simple Herbal Remedies: Discover over 100 herbal Medicine for all kinds of Ailment Inspired By Barbara O'Neill
byBlossom Davis
Rating: 0 out of 5 stars
0 ratings
Holistic Herbal: A Safe and Practical Guide to Making and Using Herbal Remedies
Ebook
Holistic Herbal: A Safe and Practical Guide to Making and Using Herbal Remedies
byDavid Hoffmann
Rating: 4 out of 5 stars
4/5
The Abandonment Recovery Workbook: Guidance through the Five Stages of Healing from Abandonment, Heartbreak, and Loss
Ebook
The Abandonment Recovery Workbook: Guidance through the Five Stages of Healing from Abandonment, Heartbreak, and Loss
bySusan Anderson
Rating: 4 out of 5 stars
4/5
The Diabetes Code: Prevent and Reverse Type 2 Diabetes Naturally
Ebook
The Diabetes Code: Prevent and Reverse Type 2 Diabetes Naturally
byDr. Jason Fung
Rating: 4 out of 5 stars
4/5
Mediterranean Diet Meal Prep Cookbook: Easy And Healthy Recipes You Can Meal Prep For The Week
Ebook
Mediterranean Diet Meal Prep Cookbook: Easy And Healthy Recipes You Can Meal Prep For The Week
byLisa Rainolds
Rating: 5 out of 5 stars
5/5
Rewire Your Brain: Think Your Way to a Better Life
Ebook
Rewire Your Brain: Think Your Way to a Better Life
byJohn B. Arden
Rating: 4 out of 5 stars
4/5
The Amazing Liver and Gallbladder Flush
Ebook
The Amazing Liver and Gallbladder Flush
byAndreas Moritz
Rating: 5 out of 5 stars
5/5
The Obesity Code: Unlocking the Secrets of Weight Loss (Why Intermittent Fasting Is the Key to Controlling Your Weight)
Ebook
The Obesity Code: Unlocking the Secrets of Weight Loss (Why Intermittent Fasting Is the Key to Controlling Your Weight)
byDr. Jason Fung
Rating: 4 out of 5 stars
4/5
The Hormone Reset Diet: Heal Your Metabolism to Lose Up to 15 Pounds in 21 Days
Ebook
The Hormone Reset Diet: Heal Your Metabolism to Lose Up to 15 Pounds in 21 Days
bySara Szal Gottfried M.D.
Rating: 4 out of 5 stars
4/5
What Happened to You?: Conversations on Trauma, Resilience, and Healing
Ebook
What Happened to You?: Conversations on Trauma, Resilience, and Healing
byOprah Winfrey
Rating: 4 out of 5 stars
4/5
The Emotion Code: How to Release Your Trapped Emotions for Abundant Health, Love, and Happiness (Updated and Expanded Edition)
Ebook
The Emotion Code: How to Release Your Trapped Emotions for Abundant Health, Love, and Happiness (Updated and Expanded Edition)
byDr. Bradley Nelson
Rating: 4 out of 5 stars
4/5
Tight Hip Twisted Core: The Key To Unresolved Pain
Ebook
Tight Hip Twisted Core: The Key To Unresolved Pain
byChristine Koth
Rating: 4 out of 5 stars
4/5
Adult ADHD: How to Succeed as a Hunter in a Farmer's World
Ebook
Adult ADHD: How to Succeed as a Hunter in a Farmer's World
byThom Hartmann
Rating: 4 out of 5 stars
4/5
Period Power: Harness Your Hormones and Get Your Cycle Working For You
Ebook
Period Power: Harness Your Hormones and Get Your Cycle Working For You
byMaisie Hill
Rating: 4 out of 5 stars
4/5
The Art of Dying Well: A Practical Guide to a Good End of Life
Ebook
The Art of Dying Well: A Practical Guide to a Good End of Life
byKaty Butler
Rating: 4 out of 5 stars
4/5
Taking Charge of Your Fertility: The Definitive Guide to Natural Birth Control, Pregnancy Achievement, and Reproductive Health
Ebook
Taking Charge of Your Fertility: The Definitive Guide to Natural Birth Control, Pregnancy Achievement, and Reproductive Health
byToni Weschler
Rating: 4 out of 5 stars
4/5
Woman: An Intimate Geography
Ebook
Woman: An Intimate Geography
byNatalie Angier
Rating: 4 out of 5 stars
4/5
Herbal Healing for Women
Ebook
Herbal Healing for Women
byRosemary Gladstar
Rating: 4 out of 5 stars
4/5
Healthy Gut, Healthy You: The Personalized Plan to Transform Your Health from the Inside Out
Ebook
Healthy Gut, Healthy You: The Personalized Plan to Transform Your Health from the Inside Out
byMichael Ruscio
Rating: 4 out of 5 stars
4/5
Summary of Dr. Gundry's Diet Evolution: Turn off the Genes That Are Killing You and Your Waistline
Ebook
Summary of Dr. Gundry's Diet Evolution: Turn off the Genes That Are Killing You and Your Waistline
byBookSuma Publishing
Rating: 3 out of 5 stars
3/5
ATOMIC HABITS:: How to Disagree With Your Brain so You Can Break Bad Habits and End Negative Thinking
Ebook
ATOMIC HABITS:: How to Disagree With Your Brain so You Can Break Bad Habits and End Negative Thinking
byDr. Dan Builfford
Rating: 5 out of 5 stars
5/5
Working Stiff: Two Years, 262 Bodies, and the Making of a Medical Examiner
Ebook
Working Stiff: Two Years, 262 Bodies, and the Making of a Medical Examiner
byJudy Melinek
Rating: 4 out of 5 stars
4/5
Women With Attention Deficit Disorder: Embrace Your Differences and Transform Your Life
Ebook
Women With Attention Deficit Disorder: Embrace Your Differences and Transform Your Life
bySari Solden
Rating: 5 out of 5 stars
5/5
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
Ebook
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
byGiulia Enders
Rating: 4 out of 5 stars
4/5
Sick to Fit: 3 Simple Techniques that Got Me from 420 Pounds to the Cover of Runner's World, Good Morning America, and The Today Show
Ebook
Sick to Fit: 3 Simple Techniques that Got Me from 420 Pounds to the Cover of Runner's World, Good Morning America, and The Today Show
byJosh LaJaunie, Jr
Rating: 5 out of 5 stars
5/5
"Cause Unknown": The Epidemic of Sudden Deaths in 2021 & 2022
Ebook
"Cause Unknown": The Epidemic of Sudden Deaths in 2021 & 2022
byEd Dowd
Rating: 5 out of 5 stars
5/5
The Butchering Art: Joseph Lister's Quest to Transform the Grisly World of Victorian Medicine
Ebook
The Butchering Art: Joseph Lister's Quest to Transform the Grisly World of Victorian Medicine
byLindsey Fitzharris
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
Podcast episode
B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
byNew Books in Mathematics
0 ratings
0% found this document useful
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
Podcast episode
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Sociology
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Economics
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Education
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Public Policy
0 ratings
0% found this document useful
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
Podcast episode
Nick Huntington-Klein, "The Effect: An Introduction to Research Design and Causality" (CRC Press, 2021): An interview with Nick Huntington-Klein
byNew Books in Business, Management, and Marketing
0 ratings
0% found this document useful
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
Podcast episode
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
byNew Books in Science
0 ratings
0% found this document useful
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
Podcast episode
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020)
Podcast episode
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020)
byNew Books in Education
0 ratings
0% found this document useful
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
Podcast episode
Vicky Neale, "Why Study Mathematics?" (London Publishing Partnership, 2020): An interview with Vicky Neale
byNew Books in Mathematics
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in Mathematics
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in the History of Science
0 ratings
0% found this document useful
Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William D. Marelich on the Applied Quantitative Perspective: In this episode, William discusses the applied quantitative perspective on research. In this episode, Dr. Mary Ellen Dello Stritto is joined by Dr. William D. Marelich, a Professor of Psychology at California State University, Fullerton, and...
Podcast episode
Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William D. Marelich on the Applied Quantitative Perspective: In this episode, William discusses the applied quantitative perspective on research. In this episode, Dr. Mary Ellen Dello Stritto is joined by Dr. William D. Marelich, a Professor of Psychology at California State University, Fullerton, and...
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
Volts podcast: Fran Moore on how to represent social change in climate models
Podcast episode
Volts podcast: Fran Moore on how to represent social change in climate models
byVolts
0 ratings
0% found this document useful
Data Decisions (w/ Dr. Peter Enns)
Podcast episode
Data Decisions (w/ Dr. Peter Enns)
byThe People Nerds Podcast
0 ratings
0% found this document useful
056R_A place-based model for understanding community resilience to natural disasters (research summary)
Podcast episode
056R_A place-based model for understanding community resilience to natural disasters (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
Podcast episode
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
byLinear Digressions
0 ratings
0% found this document useful
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
Podcast episode
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
Leah Stokes, "Short Circuiting Policy: Interest Groups and the Battle Over Clean Energy" (Oxford UP, 2020)
Podcast episode
Leah Stokes, "Short Circuiting Policy: Interest Groups and the Battle Over Clean Energy" (Oxford UP, 2020)
byNew Books in Public Policy
0 ratings
0% found this document useful
[From the Archives] Ep 109: Dr. Mary Ellen Dello Stritto and Patrick Aldrich on Non-parametric Statistics: On this episode, Dr. Mary Ellen Dello Stritto is joined by Patrick Aldrich. Patrick received his bachelor’s degree in Wildlife biology and a minor in Entomology from the University of California, Davis. After graduation, he spent 5 years in various...
Podcast episode
[From the Archives] Ep 109: Dr. Mary Ellen Dello Stritto and Patrick Aldrich on Non-parametric Statistics: On this episode, Dr. Mary Ellen Dello Stritto is joined by Patrick Aldrich. Patrick received his bachelor’s degree in Wildlife biology and a minor in Entomology from the University of California, Davis. After graduation, he spent 5 years in various...
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
Leah Stokes, "Short Circuiting Policy: Interest Groups and the Battle Over Clean Energy" (Oxford UP, 2020)
Podcast episode
Leah Stokes, "Short Circuiting Policy: Interest Groups and the Battle Over Clean Energy" (Oxford UP, 2020)
byNew Books in Law
0 ratings
0% found this document useful
Interview with Sophie Sun, econometrician and recent graduate of MIT
Podcast episode
Interview with Sophie Sun, econometrician and recent graduate of MIT
byThe Mixtape with Scott
0 ratings
0% found this document useful
Linear Sampling: Modellansatz 226
Podcast episode
Linear Sampling: Modellansatz 226
byModellansatz - English episodes only
0 ratings
0% found this document useful
Crime Prevention: Modellansatz 109
Podcast episode
Crime Prevention: Modellansatz 109
byModellansatz
0 ratings
0% found this document useful
Crime Prevention: Modellansatz 109
Podcast episode
Crime Prevention: Modellansatz 109
byModellansatz - English episodes only
0 ratings
0% found this document useful
Estimands for Repeated Continuous Outcomes: Interview with Oliver Keene
Podcast episode
Estimands for Repeated Continuous Outcomes: Interview with Oliver Keene
byThe Effective Statistician - in association with PSI
0 ratings
0% found this document useful
#26 Dr Karen Thompson, The Academic Project Manager
Podcast episode
#26 Dr Karen Thompson, The Academic Project Manager
byThe Sunday Lunch Project Manager
0 ratings
0% found this document useful
Scott Moore, "China's Next Act: How Sustainability and Technology Are Reshaping China's Rise and the World's Future" (Oxford UP, 2022): An interview with Scott Moore
Podcast episode
Scott Moore, "China's Next Act: How Sustainability and Technology Are Reshaping China's Rise and the World's Future" (Oxford UP, 2022): An interview with Scott Moore
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful

Skip carousel

With So Many COVID-19 Models, Which Is Best?
Futurity
Article
With So Many COVID-19 Models, Which Is Best?
May 8, 2020
3 min read
Singapore Is Right To Question University Rankings Fixation. Research Has A Social Value, Too
This Week in Asia
Article
Singapore Is Right To Question University Rankings Fixation. Research Has A Social Value, Too
Jan 16, 2019
4 min read
Epidemic Forecasts Can Miss The Human Reaction
Futurity
Article
Epidemic Forecasts Can Miss The Human Reaction
Jul 3, 2019
1 min read
Economics, Equity, and Climate Change: Back to School on Climate Economics
Union of Concerned Scientists
Article
Economics, Equity, and Climate Change: Back to School on Climate Economics
Aug 21, 2019
4 min read
Forecasts For Covid-19 Based On Artificial Intelligence
Frontiers of Science
Article
Forecasts For Covid-19 Based On Artificial Intelligence
Apr 21, 2020
3 min read
Science Impact
AQ: Australian Quarterly
Article
Science Impact
Dec 30, 2019
To what end are you working? Presumably for the principle that science’s sole aim must be to lighten the burden of human existence. If the scientists, brought to heel by self-interested rulers, limit themselves to piling up knowledge for knowledge’s
10 min read
The National Academies Illustrates the More Nuanced Value of Transparency in Science
Union of Concerned Scientists
Article
The National Academies Illustrates the More Nuanced Value of Transparency in Science
May 13, 2019
4 min read
People Who Think Further Into The Future Less Likely To Take Risks
Futurity
Article
People Who Think Further Into The Future Less Likely To Take Risks
Feb 6, 2018
People who tend to think further into the future may be more likely to invest money and avoid risks, a new study suggests. Researchers tapped big data tools to conduct text analyses of nearly 40,000 Twitter users and to run online experiments of thei
3 min read
AI Methods Make COVID-19 Forecasting More Local
Futurity
Article
AI Methods Make COVID-19 Forecasting More Local
Aug 6, 2020
3 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
Tweaked Science Textbook Diagrams Boost Student Understanding
Futurity
Article
Tweaked Science Textbook Diagrams Boost Student Understanding
Apr 18, 2024
Life cycle diagrams are ubiquitous in science textbooks, and they may be due for some updates, according to a new study. The findings show that simple design changes in science textbook diagrams can have a dramatic impact on the ability of undergradu
2 min read
Policy During a Pandemic: How to Make Research Accessible for Policymakers During the COVID-19 Pandemic
Union of Concerned Scientists
Article
Policy During a Pandemic: How to Make Research Accessible for Policymakers During the COVID-19 Pandemic
Jul 6, 2020
4 min read
“Fattening” the Curve: Funding Equitable Scientific Research After the Pandemic
Union of Concerned Scientists
Article
“Fattening” the Curve: Funding Equitable Scientific Research After the Pandemic
May 28, 2020
3 min read
'The Cloud' and Other Dangerous Metaphors
The Atlantic
Article
'The Cloud' and Other Dangerous Metaphors
Jan 20, 2015
4 min read
Tool Highlights Best Messages To Kickstart Climate Action
Futurity
Article
Tool Highlights Best Messages To Kickstart Climate Action
Feb 8, 2024
Researchers have created a tool that can aid in increasing climate awareness and climate action globally by highlighting messaging themes shown to be effective through experimental research. The web-based tool, and the methods that led to its creatio
3 min read
The Situation: How And Why Are We Here?
Architecture Australia
Article
The Situation: How And Why Are We Here?
Aug 28, 2022
It has long been suspected that we have a problem with mental wellbeing in the Australian architecture profession. But we have lacked conclusive evidence of whether there is a problem and, if yes, what form it takes and what’s causing it. The convers
3 min read
Opinion: Working Together, Data Scientists And Cancer Researchers Can Transform Cancer Treatment
STAT
Article
Opinion: Working Together, Data Scientists And Cancer Researchers Can Transform Cancer Treatment
Jun 18, 2019
Exposing more cancer researchers and oncologists to data science and data scientists to the complexity of cancer has the potential to transform treatment.
3 min read
Expanding Professional Development Opportunities for Scientists Beyond the Lab
Union of Concerned Scientists
Article
Expanding Professional Development Opportunities for Scientists Beyond the Lab
Dec 4, 2019
3 min read
The Mathematics Of Contagion
Frontiers of Science
Article
The Mathematics Of Contagion
Apr 21, 2020
4 min read
Engaged Science: 6 Tips for the Trump Era
Union of Concerned Scientists
Article
Engaged Science: 6 Tips for the Trump Era
Jun 26, 2018
5 min read
A Watershed Moment: Architecture Education And Climate Change
Architecture Australia
Article
A Watershed Moment: Architecture Education And Climate Change
Jun 26, 2022
The results of an ongoing project that aims to understand staff and student engagement with climate change suggest that architecture schools are potentially a revolutionizing force for the whole profession, but that they need to leave behind some obs
6 min read
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
The Christian Science Monitor
Article
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
Mar 12, 2020
The Environmental Protection Agency is mulling a proposal to give preference to scientific research whose datasets and models are publicly available.
3 min read
Death Bot: New AI Predicts People’s Lives Including Time Of Death ‘With High Accuracy’
The Independent
Article
Death Bot: New AI Predicts People’s Lives Including Time Of Death ‘With High Accuracy’
Dec 20, 2023
2 min read
A Graduate Researcher’s (Brief) Guide to: Creating a Student Science Policy Group
Union of Concerned Scientists
Article
A Graduate Researcher’s (Brief) Guide to: Creating a Student Science Policy Group
Apr 18, 2018
5 min read
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
STAT
Article
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
Jan 29, 2019
Instead of waiting for fleshed-out protocols for using real-world evidence, pharmaceutical companies should dive in now using a commonsense approach that relies on rigor and reason.
4 min read
How the Pandemic Has Tested Behavioral Science
Nautilus
Article
How the Pandemic Has Tested Behavioral Science
Jul 6, 2020
5 min read
Re-Framing Innovation: Integrating Behavioural Science and Design
Rotman Management
Article
Re-Framing Innovation: Integrating Behavioural Science and Design
May 1, 2019
11 min read
The Stereotypes That Distort How Americans Teach and Learn Math
The Atlantic
Article
The Stereotypes That Distort How Americans Teach and Learn Math
Nov 12, 2013
5 min read
MISSING IN ACTION: Policy Implications of Management Research
The European Business Review
Article
MISSING IN ACTION: Policy Implications of Management Research
Feb 1, 2023
6 min read
How Did Public Transportation Affect COVID’s Spread?
Futurity
Article
How Did Public Transportation Affect COVID’s Spread?
Apr 14, 2022
3 min read

Related categories

Skip carousel

Reviews for Linear and Generalized Linear Mixed Models and Their Applications

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Linear and Generalized Linear Mixed Models and Their Applications - Jiming Jiang

J. Jiang, T. NguyenLinear and Generalized Linear Mixed Models and Their ApplicationsSpringer Series in Statisticshttps://doi.org/10.1007/978-1-0716-1282-8_1

1. Linear Mixed Models: Part I

Jiming Jiang¹ and Thuan Nguyen²

(1)

Department of Statistics, University of California, Davis, CA, USA

(2)

School of Public Health, Oregon Health & Science University, Portland, OR, USA

1.1 Introduction

The best way to understand a linear mixed model , or mixed linear model in some earlier literature, is to first recall a linear regression model. The latter can be expressed as y = Xβ + 𝜖, where y is a vector of observations, X is a matrix of known covariates, β is a vector of unknown regression coefficients, and 𝜖 is a vector of (unobservable random) errors. In this model, the regression coefficients are considered as fixed, unknown constants. However, there are cases in which it makes sense to assume that some of these coefficients are random. These cases typically occur when the observations are correlated. For example, in medical studies observations are often collected from the same individuals over time. It may be reasonable to assume that correlations exist among the observations from the same individual, especially if the times at which the observations are collected are relatively close. In animal breeding, lactation yields of dairy cows associated with the same sire may be correlated. In educational research, test scores of the same student may be related.

To see how a linear mixed model may be useful for modeling the correlations among the observations, consider, for example, the example above regarding medical studies. Assume that each individual is associated with a random effect whose value is unobservable. Let y ij denote the observation from the ith individual collected at time t j, and α i the random effect associated with the ith individual. Assume that there are m individuals. For simplicity, let us assume that the observations from all individuals are collected at a common set of times, say, t 1, …, t k. Then, a linear mixed model may be expressed as

$$y_{ij}=x_{ij}^{\prime }\beta +\alpha _{i}+\epsilon _{ij},\ i=1,\dots ,m,\ j=1, \dots ,k$$

, where x ij is a vector of known covariates; β is a vector of unknown regression coefficients; the random effects α 1, …, α m are assumed to be i.i.d. with mean zero and variance σ ²; the 𝜖 ijs are errors that are i.i.d. with mean zero and variance τ ²; and the random effects and errors are independent. It follows that the correlation between any two observations from the same individual is σ ²∕(σ ² + τ ²), whereas observations from different individuals are uncorrelated. This model is a special case of the so-called longitudinal linear mixed model, which is discussed in much detail in the sequel. Of course, this is only a simple model, and it may not capture all of the correlations among the observations. We need to have a richer class of models that allows further complications.

A general linear mixed model may be expressed as

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} y& =&\displaystyle X\beta+Z\alpha+\epsilon, \end{array} \end{aligned} $$

(1.1)

where y is a vector of observations, X is a matrix of known covariates, β is a vector of unknown regression coefficients which are now called fixed effects , Z is a known matrix, α is a vector of random effects, and 𝜖 is a vector of errors. Note that both α and 𝜖 are unobservable. Compared with the linear regression model , it is clear that the difference is Zα, which may take many different forms, thus creating a rich class of models, as we shall see. A statistical model must come with assumptions. The basic assumptions for (1.1) are that the random effects and errors have mean zero and finite variances. Typically, the covariance matrices G = Var(α) and R = Var(𝜖) involve some unknown dispersion parameters , or variance components. It is also assumed that α and 𝜖 are uncorrelated. It is easy to show that the example of medical studies discussed above can be expressed as (1.1).

We conclude this section with three examples illustrating applications of the linear mixed models . Many more examples will be discussed later.

1.1.1 Effect of Air Pollution Episodes on Children

Laird and Ware (1982) discussed an example of analysis of the effect of air pollution episodes on pulmonary function. About 200 school children were examined under normal conditions, then during an air pollution alert, and on three successive weeks following the alert. One of the objectives was to determine whether the volume of air exhaled in the first second of a forced exhalation, denoted by FEV1, was depressed during the alert.

Note that in this case the data were longitudinally collected with five observational times common for all of the children. Laird and Ware proposed the following simple linear mixed model for analysis of the longitudinal data: y ij = β j + α i + 𝜖 ij, i = 1, …, m, j = 1, …, 5, where β j is the mean FEV1 for the jth observational time, α i is a random effect associated with the ith child, 𝜖 ij is an error term, and m is the total number of children involved in the study. It is assumed that the random effects are independent and distributed as N(0, σ ²), the errors are independent and distributed as N(0, τ ²), and the random effects and errors are independent. It should be mentioned that some measurements were missing in this study. However, the above model can be modified to take this into account. In particular, the number of observations for the ith individual may be denoted by n i, where 1 ≤ n i ≤ 5.

Based on the model, Laird and Ware analyzed the data using methods to be described in the sequel, with the following findings: (i) a decline in mean FEV1 was observed on and after the alert day; and (ii) the variances and covariances for the last four measurements were larger than those involving the baseline day. The two authors further studied the problem of identification of sensitive subgroups or individuals most severely affected by the pollution episode using a more complicated linear mixed model.

1.1.2 Genome-Wide Association Study

Genome-wide association study (GWAS), which typically refers to examination of associations between up to millions of genetic variants in the genome and certain traits of interest among unrelated individuals, has been very successful for detecting genetic variants that affect complex human traits/diseases in the past decade. As a result, tens of thousands of single-nucleotide polymorphisms (SNPs) have been reported to be associated with at least one trait/disease at the genome-wide significance level (p-value ≤ 5 × 10−8). In spite of the success, these significantly associated SNPs only account for a small portion of the genetic factors underlying complex human traits/diseases. For example, human height is a highly heritable trait with an estimated heritability of around 80%. This means that 80% of the height variation in the population can be attributed to genetic factors (Visscher et al. 2008). Based on large-scale GWAS, about 180 genetic loci have been reported to be significantly associated with human height (Allen et al. 2010). However, these loci together can explain only about 5–10% of variation of human height (e.g., Allen et al. 2010). This gap between the total genetic variation and the variation that can be explained by the identified genetic loci is universal among many complex human traits/diseases and is referred to in the literature as the missing heritability (e.g., Maher 2008).

One possible explanation for the missing heritability is that many SNPs jointly affect the phenotype, while the effect of each SNP is too weak to be detected at the genome-wide significance level. To address this issue, Yang et al. (2010) used a linear mixed model (LMM)-based approach to estimate the total amount of human height variance that can be explained by all common SNPs assayed in GWAS. They showed that 45% of the human height variance can be explained by those SNPs, providing compelling evidence for this explanation: A large proportion of the heritability is not missing, but rather hidden among many weak-effect SNPs. These SNPs may require a much larger sample size to be detected at the genome-wide significance level.

The LMM used by Yang et al. can be expressed as (1.1), where y is a vector of observed phenotypes. A difference is that, here, Z is a matrix of random entries corresponding to the SNPs. Restricted maximum likelihood (REML) analysis of the LMM was carried out to obtain estimates of the variance components of interest, such as the variance of the environmental errors (corresponding to 𝜖) and heritability, defined as the ratio of the total genetic variation, computed under the LMM, and total variation (genetic plus environmental) in the phenotypes. More detail will be discussed later.

1.1.3 Small Area Estimation of Income

Large-scale sample surveys are usually designed to produce reliable estimates of various characteristics of interest for large geographic area or population. However, for effective planning of health, social, and other services, and for apportioning government funds, there is a growing demand to produce similar estimates for smaller geographic areas and subpopulations for which adequate samples are not available. The usual design-based small area estimators (e.g., Cochran 1977) are unreliable because they are often based on very few observations that are available from the area or subpopulation. This makes it necessary to borrow strength from related small areas to find indirect estimators to increase the effective sample size and thus improve the precision. Such indirect estimators are typically based on linear mixed models or generalized linear mixed models (see Chap. 3) that provide a link to a related small area through the use of supplementary data such as recent census data and current administrative records. See Rao and Molina (2015) for details.

Among many examples of applications, Fay and Herriot (1979) used a linear mixed model for estimating per capita income (PCI) for small places from the 1970 Census of Population and Housing. In the 1970 Census, income was collected on the basis of a 20% sample. However, of the estimates required, more than one-third, or approximately 15,000, were for places with populations of fewer than 500 persons. With such small populations, sampling errors of the direct estimates are quite significant. For example, Fay and Herriot estimated that for a place of 500 persons, the coefficient of variation of the direct estimate of PCI was about 13%; for a place of 100 persons, that coefficient increased to 30%. In order to borrow strength from related places and other sources, Fay and Herriot proposed the following linear mixed model ,

$$y_{i}=x_{i}^{\prime }\beta +v_{i}+e_{i}$$

, where y i is the natural logarithm of the sample estimate of PCI for the ith place (the logarithm transformation stabilized the variance); x i is a vector of known covariates related to the place; β is a vector of unknown regression coefficients; v i is a random effect associated with the place; and e i represents the sampling error. It is assumed that v i and e i are distributed independently such that v i ∼ N(0, A), e i ∼ N(0, D i), where A is unknown but D i is assumed known.

Here, the characteristics of interest are the small area means of the logarithm PCI, expressed as

$$\xi _{i}=x_{i}^{\prime }\beta +v_{i}, 1\leq i\leq m$$

, where m is the total number of small places. This may be viewed as a problem of mixed model prediction, which we shall discuss in detail in the sequel.

1.2 Types of Linear Mixed Models

There are different types of linear mixed models , depending on how these models are classified. One way of classification is according to whether or not a normality assumption is made. As will be seen, normality provides more flexibility in modeling; on the other hand, modeling without normality has the advantage of robustness to violation of distributional assumptions.

1.2.1 Gaussian Mixed Models

Under Gaussian linear mixed models, or simply Gaussian mixed models, both the random effects and errors in (1.1) are assumed to be normally distributed. The following are some specific types.

1.2.1.1 Mixed ANOVA Model

As usual, ANOVA refers to analysis of variance. Some of the earliest (Gaussian) mixed models appeared in the literature were of the ANOVA type. Here we consider some simple cases.

Example 1.1 (One-way random effects model)

A model is called a random effects model if the only fixed effect is an unknown mean. Suppose that the observations y ij, i = 1, …, m, j = 1, …, k i satisfy y ij = μ + α i + 𝜖 ij for all i and j, where μ is the unknown mean; α i, i = 1, …, m are random effects that are distributed independently as N(0, σ ²); 𝜖 ijs are errors that are distributed independently as N(0, τ ²); and the random effects are independent with the errors. Typically, the variances σ ² and τ ² are unknown. To express the model in terms of (1.1), let

$$y_{i}= (y_{ij})_{1\leq j\leq k_{i}}$$

be the (column) vector of observations from the ith group or cluster, and similarly

$$\epsilon _{i}=( \epsilon _{ij})_{1\leq j\leq k_{i}}$$

. Then, let

$$y=(y_{1}^{\prime },\dots , y_{m}^{\prime })',\ \alpha =(\alpha _{i})_{1\leq i\leq m}$$

, and

$$\epsilon = (\epsilon _{1}^{\prime },\dots ,\epsilon _{m}^{\prime })'$$

. It is easy to show that the model can be expressed as (1.1) with β = μ and suitable X and Z (see Exercise 1.1), in which α ∼ N(0, σ ² I m) and 𝜖 ∼ N(0, τ ² I n) with

$$n=\sum _{i=1}^{m} k_{i}$$

. One special case is when k i = k for all i. This is called the balanced case. It can be shown that, in the balanced case, the model can be expressed as (1.1) with X = 1m ⊗ 1k = 1mk, Z = I m ⊗ 1k, where ⊗ denotes the Kronecker product (see Appendix A). Note that in this case n = mk.

Example 1.2 (Two-way random effects model)

For simplicity, let us consider the case of one observation per cell. In this case, the observations y ij, i = 1, …, m, j = 1, …, k satisfy y ij = μ + ξ i + η j + 𝜖 ij for all i, j, where μ is as in Example 1.1; ξ i, i = 1, …, m, η j, j = 1, …, k are independent random effects such that

$$\xi _{i}\sim N(0,\sigma _{1}^{2}),\ \eta _{j}\sim N(0,\sigma _{2}^{2})$$

; and 𝜖 ijs are independent errors distributed as N(0, τ ²). Again, assume that the random effects and errors are independent. This model can also be expressed as (1.1) (see Exercise 1.2). Note that this model is different from that of Example 1.1 in that, under the one-way model, the observations can be divided into independent blocks, whereas no such division exists under the two-way model.

A general mixed ANOVA model is defined by (1.1) such that

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} Z\alpha& =&\displaystyle Z_{1}\alpha_{1}+\cdots+Z_{s}\alpha_{s}, \end{array} \end{aligned} $$

(1.2)

where Z 1, …, Z s are known matrices and α 1, …, α s are vectors of random effects such that for each 1 ≤ r ≤ s, the components of α r are independent and distributed as $$N(0,\sigma _{r}^{2})$$ . It is also assumed that the components of 𝜖 are independent and distributed as N(0, τ ²) and α 1, …, α s, 𝜖 are independent. For the ANOVA model, a natural set of variance components is

$$\tau ^{2},\sigma _{1}^{2},\dots ,\sigma _{s}^{2}$$

. We call this form of variance components the original form. Alternatively, the Hartley–Rao form of variance components (Hartley and Rao 1967) is defined as

$$\lambda =\tau ^{2}, \gamma _{1}= \sigma _{1}^{2}/\tau ^{2},\dots ,\gamma _{s}=\sigma _{s}^{2}/\tau ^{2}$$

A special case is the so-called balanced mixed ANOVA models. An ANOVA model is balanced if X and Z r, 1 ≤ r ≤ s can be expressed as

$$X=\bigotimes _{l=1}^{w+1} 1_{n_{l}}^{a_{l}},\ Z_{r}=\bigotimes _{l=1}^{w+1}1_{n_{l}}^{b_{r,l}}$$

, where (a 1, …, a w+1) ∈ S w+1 = {0, 1}w+1, (b r,1, …, b r,w+1) ∈ S ⊂ S w+1. In other words, there are w factors in the model; n l represents the number of levels for factor l (1 ≤ l ≤ w); and the (w + 1)st factor corresponds to repetition within cells. Thus, we have a s+1 = 1 and b r,s+1 = 1 for all r. For example, in Example 1.1, the model is balanced if k i = k for all i. In this case, we have w = 1, n 1 = m, n 2 = k, and S = {(0, 1)}. In Example 1.2 the model is balanced with w = 2, n 1 = m, n 2 = k, n 3 = 1, and S = {(0, 1, 1), (1, 0, 1)} (Exercise 1.2).

1.2.1.2 Longitudinal Model

These models gain their name because they are often used in the analysis of longitudinal data. See, for example, Diggle et al. (2002). One feature of these models is that the observations may be divided into independent groups with one random effect (or vector of random effects ) corresponding to each group. In practice, these groups may correspond to different individuals involved in the longitudinal study. Furthermore, there may be serial correlations within each group which are in addition to that due to the random effect. Another feature of the longitudinal models is that there are often time-dependent covariates, which may appear either in X or in Z (or in both) of (1.1). Example 1.1 may be considered a simple case of the longitudinal model, in which there is no serial correlation within the groups. The following is a more complex model.

Example 1.3 (Growth curve model )

For simplicity, suppose that for each of the m individuals, the observations are collected over a common set of times t 1, …, t k. Suppose that y ij, the observation collected at time t j from the ith individual, satisfies y ij = ξ i + η i x ij + ζ ij + 𝜖 ij, where ξ i and η i represent, respectively, a random intercept and a random slope; x ij is a known covariate; ζ ij corresponds to a serial correlation; and 𝜖 ij is an error. For each i, it is assumed that ξ i and η i are jointly normally distributed with means μ 1, μ 2, variances $$\sigma _{1}^{2},\ \sigma _{2}^{2}$$ , respectively, and correlation coefficient ρ; and 𝜖 ijs are independent and distributed as N(0, τ ²). As for the ζ ijs, it is assumed that they satisfy the following relation of the first-order autoregressive process , or AR(1):

$$\zeta _{ij}= \phi \zeta _{i \underline {j-1}}+\omega _{ij}$$

, where ϕ is a constant such that 0 < ϕ < 1, and ω ijs are independent and distributed as

$$N\{0,\sigma _{3}^{2}(1-\phi ^{2})\}$$

. Furthermore, the three random components (ξ, η), ζ, and 𝜖 are independent, and observations from different individuals are independent. There is a slight departure of this model from the standard linear mixed model in that the random intercept and slope may have nonzero means. However, by subtracting the means and thus defining new random effects, this model can be expressed in the standard form (Exercise 1.3). In particular, the fixed effects are μ 1 and μ 2, and the (unknown) variance components are

$$\sigma _{j}^{2},\ j=1,2,3,\ \tau ^{2},\ \rho $$

, and ϕ. It should be pointed out that an error term, 𝜖 ij, is included in this model. Standard growth curve models do not include such a term.

Following Datta and Lahiri (2000), a general form of longitudinal model may be expressed as

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ3_HTML.png

(1.3)

where y i represents the vector of observations from the ith individual; X i and Z i are known matrices; β is an unknown vector of regression coefficients; α i is a vector of random effects; and 𝜖 i is a vector of errors. It is assumed that α i, 𝜖 i, i = 1, …, m are independent with α i ∼ N(0, G i), 𝜖 i ∼ N(0, R i), where the covariance matrices G i and R i are known up to a vector θ of variance components . It can be shown that Example 1.3 is a special case of the general longitudinal model (Exercise 1.3). Also note that (1.3) is a special case of (1.1) with y = (y i)1≤i≤m, X = (X i)1≤i≤m, Z = diag(Z 1, …, Z m), α = (α i)1≤i≤m, and 𝜖 = (𝜖 i)1≤i≤m.

1.2.1.3 Marginal Model

Alternatively, a Gaussian mixed model may be expressed by its marginal distribution. To see this, note that under (1.1) and the normality assumption, we have

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ4_HTML.png

(1.4)

Hence, without using (1.1), one may simply define a linear mixed model by (1.4). In particular, for the ANOVA model, one has (1.4) with

$$V=\tau ^{2}I_{n}+\sum _{r=1}^{s}\sigma _{r}^{2}Z_{r}Z_{r}^{\prime }$$

, where n is the sample size (i.e., the dimension of y). As for the longitudinal model (1.3), one may assume that y 1, …, y m are independent with y i ∼ N(X i β, V i), where

$$V_{i}=R_{i}+Z_{i}G_{i}Z_{i}^{\prime }$$

. It is clear that the model can also be expressed as (1.4) with R = diag(R 1, …, R m), G = diag(G 1, …, G m), and X and Z defined below (1.3).

A disadvantage of the marginal model is that the random effects are not explicitly defined. In many cases, these random effects have practical meanings, and the inference about them may be of interest. For example, in small area estimation (e.g., Rao and Molina 2015), the random effects are associated with the small area means, which are often of main interest.

1.2.1.4 Hierarchical Models

From a Bayesian point of view, a linear mixed model is a three-stage hierarchy. In the first stage, the distribution of the observations given the random effects is defined. In the second stage, the distribution of the random effects given the model parameters is defined. In the last stage, a prior distribution is assumed for the parameters. Under normality, these stages may be specified as follows. Let θ represent the vector of variance components involved in the model. Then, we have

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ5_HTML.png

(1.5)

where π is a known distribution. In many cases, it is assumed that π(β, θ) = π 1(β)π 2(θ), where π 1 = N(β 0, D) with both β 0 and D known, and π 2 is a known distribution. The following is an example.

Example 1.4

Suppose that (i) conditional on the means μ ij, i = 1, …, m, j = 1, …, k i and variance τ ², y ij, 1 ≤ i ≤ m, 1 ≤ j ≤ n i are independent and distributed as N(μ ij, τ ²); (ii) conditional on β and

$$\sigma ^{2},\ \mu _{ij}=x_{ij}^{\prime }\beta +\alpha _{i}$$

, where x ij is a known vector of covariates, and α 1, …, α m are independent and distributed as N(0, σ ²); (iii) the prior distributions are such that β ∼ N(β 0, D), where β 0 and D are known; σ ² and τ ² are both distributed as inverse gamma with pdfs

$$f_{1}(\sigma ^{2}) \propto (\sigma ^{2})^{-3}e^{-1/\sigma ^{2}},\ f_{0}(\tau ^{2})\propto (\tau ^{2})^{-2}e^{-1/\tau ^{2}}$$

, respectively; and β, σ ², and τ ² are independent. It is easy to show that (i) and (ii) are equivalent to the model in Example 1.1 with μ replaced by $$x_{ij}^{\prime }\beta $$ (Exercise 1.4). Thus, the difference between a classical linear mixed model and a Bayesian hierarchical model is the prior.

1.2.2 Non-Gaussian Linear Mixed Models

Under non-Gaussian linear mixed models , the random effects and errors are assumed to be independent, or at least uncorrelated, but their distributions are not assumed to be normal. As a result, the (joint) distribution of the data may not be fully specified (up to a set of unknown parameters). The following are some specific cases.

1.2.2.1 Mixed ANOVA Model

Following Jiang (1996), a non-Gaussian (linear) mixed ANOVA model is defined by (1.1) and (1.2), where the components of α r are i.i.d. with mean 0 and variance

$$\sigma _{r}^{2},\ 1\leq r\leq s$$

; the components of 𝜖 are i.i.d. with mean 0 and variance τ ²; and α 1, …, α s, 𝜖 are independent. All of the other assumptions are the same as in the Gaussian case. Denote the common distribution of the components of α r by F r (1 ≤ r ≤ s) and that of the components of 𝜖 by G. If parametric forms of F 1, …, F s, G are not assumed, the distribution of y is not specified up to a set of parameters. In fact, even if the parametric forms of the F rs and G are known, as long as they are not normal, the pdf of y may not have an analytic expression. The vector θ of variance components is defined the same way as in the Gaussian case, having either the original or Hartley–Rao forms.

Example 1.5

Suppose that, in Example 1.1, the random effects α 1, …, α m are i.i.d. but their common distribution is t 3 instead of normal. Furthermore, the 𝜖 ijs are i.i.d., but their common distribution is double exponential, rather than normal. It follows that the joint distribution of y ij, i = 1, …, m, j = 1, …, k i does not have a closed-form expression.

1.2.2.2 Longitudinal Model

The Gaussian longitudinal model may also be extended to the non-Gaussian case. The typical non-Gaussian case is such that y 1, …, y m are independent and (1.3) holds. Furthermore, for each i, α i and 𝜖 i are uncorrelated with E(α i) = 0, Var(α i) = G i; E(𝜖 i) = 0, Var(𝜖 i) = R i. Alternatively, the independence of y i, 1 ≤ i ≤ m may be replaced by that of (α i, 𝜖 i), 1 ≤ i ≤ m. All of the other assumptions, except the normality assumption, are the same as in the Gaussian case. Again, in this case, the distribution of y may not be fully specified up to a set of parameters; even if it is, the pdf of y may not have a closed-form expression.

Example 1.6

Consider Example 1.3. For simplicity, assume that the times t 1, …, t k are equally spaced; thus, without loss of generality, let t j = j, 1 ≤ j ≤ k. Let ζ i = (ζ ij)1≤j≤k, 𝜖 i = (𝜖 ij)1≤j≤k. In the non-Gaussian case, it is assumed that y 1, …, y m are independent, where y i = (y ij)1≤j≤k, or, alternatively, (ξ i, η i), ζ i, 𝜖 i, 1 ≤ i ≤ m are independent. Furthermore, assume that E(ξ i) = μ 1, E(η i) = μ 2, E(ζ i) = E(𝜖 i) = 0;

$$\mathrm {var}(\xi _{i})= \sigma _{1}^{2},\ \mathrm {var}(\eta _{i})=\sigma _{2}^{2},\ \mathrm {cor}(\xi _{i},\eta _{i}) =\rho ,\ \mathrm {Var}(\zeta _{i})=G_{i},\ \mathrm {Var}(\epsilon _{i})=\tau ^{2}I_{k}$$

, where G i is the covariance matrix of ζ i under the AR(1) model of Example 1.3, whose (s, t) element is given by

$$\sigma _{3}^{2}\phi ^{-|s-t|},\ 1\leq s,t\leq k$$

1.2.2.3 Marginal Model

This is, perhaps, the most general model among all types. Under a marginal model , it is assumed that y, the vector of observations, satisfies E(y) = Xβ and Var(y) = V , where V is specified up to a vector θ of variance components. A marginal model may arise by taking the mean and covariance matrix of a Gaussian mixed model (marginal or otherwise; see Sect. 1.2.1) and dropping the normality assumption. Similar to the Gaussian marginal model, the random effects are not present in this model. Therefore, the model has the disadvantage of not being suitable for inference about the random effects, if the latter are of interest. Also, because the model is so general that not much assumption is made, methods of inference that require specification of the parametric form of the distribution, such as maximum likelihood, may not apply. It may also be difficult to assess uncertainty of the estimators under such a general model. See Sect. 1.4.2.

On the other hand, by not fully specifying the distribution, a non-Gaussian model may be more robust to violation of distributional assumptions.

1.3 Estimation in Gaussian Mixed Models

Standard methods of estimation in Gaussian mixed models are maximum likelihood (ML) and restricted maximum likelihood (REML) . In this section we focus on discussing these two methods. Historically, there have been other types of estimation that are computationally simpler than the likelihood-based methods. These other methods are discussed in Sect. 1.5.

1.3.1 Maximum Likelihood

Although the ML method has a long and celebrated history going back to Fisher (1922a), it had not been used in mixed model analysis until Hartley and Rao (1967). The main reason was because estimation of the variance components in a linear mixed model was not easy to handle computationally in the old days, although estimation of the fixed effects given the variance components is fairly straightforward.

1.3.1.1 Point Estimation

Under a Gaussian mixed model , the distribution of y is given by (1.4), which has a joint pdf

$$\displaystyle \begin{aligned} \begin{array}{rcl} f(y)& =&\displaystyle \frac{1}{(2\pi)^{n/2}|V|{}^{1/2}} \exp\left\{-\frac{1}{2}(y-X\beta)'V^{-1}(y-X\beta)\right\}, \end{array} \end{aligned} $$

where n is the dimension of y. Thus, the log-likelihood function is given by

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} l(\beta,\theta)& =&\displaystyle c-\frac{1}{2}\log(|V|)-\frac{1}{2} (y-X\beta)'V^{-1}(y-X\beta), \end{array} \end{aligned} $$

(1.6)

where θ represents the vector of all of the variance components (involved in V ) and c is a constant. By differentiating the log-likelihood with respect to the parameters (see Appendix A), we obtain the following:

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \frac{\partial l}{\partial\beta}& =&\displaystyle X'V^{-1}y-X'V^{-1}X\beta, \end{array} \end{aligned} $$

(1.7)

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \frac{\partial l}{\partial\theta_{r}}& =&\displaystyle \frac{1}{2}\left\{(y-X\beta)' V^{-1}\frac{\partial V}{\partial\theta_{r}}V^{-1}(y-X\beta)-\mathrm{tr}\left( V^{-1}\frac{\partial V}{\partial\theta_{r}}\right)\right\},\\ & &\displaystyle \;\;\;r=1,\dots,q, \end{array} \end{aligned} $$

(1.8)

where θ r is the rth component of θ, which has dimension q. The standard procedure of finding the ML estimator, or MLE, is to solve the ML equations ∂l∕∂β = 0, ∂l∕∂θ = 0. However, finding the solution may not be the end of the story. In other words, the solution to (1.7) and (1.8) is not necessarily the MLE. See Sect. 1.8 for further discussion. Let p be the dimension of β. For simplicity, we assume that X is of full (column) rank; that is, rank(X) = p (see Sect. 1.8). Let $$(\hat {\beta }, \hat {\theta })$$ be the MLE. From (1.7) one obtains

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \hat{\beta}& =&\displaystyle (X'\hat{V}^{-1}X)^{-1}X'\hat{V}^{-1}y, \end{array} \end{aligned} $$

(1.9)

where $$\hat {V}=V(\hat {\theta })$$ , that is, V with the variance components involved replaced by their MLE. Thus, once the MLE of θ is found, the MLE of β can be calculated by the closed-form expression (1.9). As for the MLE of θ, by (1.7) and (1.8), it is easy to show that it satisfies

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ11_HTML.png

(1.10)

where

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} P& =&\displaystyle V^{-1}-V^{-1}X(X'V^{-1}X)^{-1}X'V^{-1}. \end{array} \end{aligned} $$

(1.11)

Thus, one procedure is to first solve (1.10) for $$\hat {\theta }$$ and then compute $$\hat {\beta }$$ by (1.9). The computation of the MLE is discussed in Sect. 1.6.1.

In the special case of mixed ANOVA models (Sect. 1.2.1.1) with the original form of variance components , we have

$$V=\tau ^{2}I_{n}+\sum _{r=1}^{s}\sigma _{r}^{2}Z_{r}Z_{r}^{\prime }$$

; hence

$$\partial V/\partial \tau ^{2}=I_{n},\ \partial V/\partial \sigma _{r}^{2}=Z_{r}Z_{r}^{\prime },\ 1\leq r\leq s$$

. Similarly, with the Hartley–Rao form , we have

$$V=\lambda (I_{n}+\sum _{r=1}^{s}\gamma _{r} Z_{r}Z_{r}^{\prime })$$

; hence

$$\partial V/\partial \lambda =V/\lambda ,\ \partial V/\partial \gamma _{r}=\lambda Z_{r}Z_{r}^{\prime },\ 1\leq r\leq s$$

. With these expressions, the above equations may be further simplified. As for the longitudinal model (Sect. 1.2.1.2), the specification of (1.7)–(1.10) is left as an exercise (see Exercise 1.5).

For mixed ANOVA models (Sect. 1.2.1.1), asymptotic properties of the MLE, including consistency and asymptotic normality , were first studied by Hartley and Rao (1967). Also see Anderson (1969, 1971a), who studied asymptotic properties of the MLE under the marginal model (Sect. 1.2.1.3) with a linear covariance structure, and Miller (1977). In these papers the authors have assumed that the number of fixed effects (i.e., p) remains fixed or bounded as the sample size n increases. As it turns out, this assumption is critical to ensure consistency of the MLE. See Example 1.7. Jiang (1996) considered asymptotic behavior of the MLE when p increases with n and compared it with that of the REML estimator. The results hold for non-Gaussian mixed ANOVA models (Sect. 1.2.2.1), which, of course, include the Gaussian case. See Sect. 1.8 for more details.

1.3.1.2 Asymptotic Covariance Matrix

Under suitable conditions (see Sect. 1.8 for discussion), the MLE is consistent and asymptotically normal with the asymptotic covariance matrix equal to the inverse of the Fisher information matrix . Let ψ = (β′, θ′)′. Then, under regularity conditions, the Fisher information matrix has the following expressions:

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \mathrm{Var}\left(\frac{\partial l}{\partial\psi}\right) & =&\displaystyle -\mathrm{E}\left(\frac{\partial^{2}l}{\partial\psi\partial\psi'}\right). \end{array} \end{aligned} $$

(1.12)

By (1.7) and (1.8), further expressions can be obtained for the elements of (1.12). For example, assuming that V is twice continuously differentiable (with respect to the components of θ), then, using the results of Appendices B and C, it can be shown (Exercise 1.6) that

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \mathrm{E}\left(\frac{\partial^{2}l}{\partial\beta\partial\beta'}\right) & =&\displaystyle -X'V^{-1}X, \end{array} \end{aligned} $$

(1.13)

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ15_HTML.png

(1.14)

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ16_HTML.png

(1.15)

It follows that (1.12) does not depend on β, and therefore may be denoted by I(θ), as we do in the sequel.

We now consider some examples.

Example 1.1 (Continued)

It can be shown (see Exercise 1.7) that, in this case, (1.6) has the following expression:

$$\displaystyle \begin{aligned} \begin{array}{rcl} l(\mu,\sigma^{2},\tau^{2})& =&\displaystyle c-\frac{1}{2}(n-m)\log(\tau^{2})- \frac{1}{2}\sum_{i=1}^{m}\log(\tau^{2}+k_{i}\sigma^{2})\\ & &\displaystyle -\frac{1}{2\tau^{2}}\sum_{i=1}^{m}\sum_{j=1}^{k_{i}}(y_{ij}-\mu)^{2} +\frac{\sigma^{2}}{2\tau^{2}}\sum_{i=1}^{m}\frac{k_{i}^{2}}{\tau^{2} +k_{i}\sigma^{2}}(\bar{y}_{i\cdot}-\mu)^{2}, \end{array} \end{aligned} $$

where c is a constant,

$$n=\sum _{i=1}^{m}k_{i}$$

, and

$$\bar {y}_{i\cdot } =k_{i}^{-1}\sum _{j=1}^{k_{i}}y_{ij}$$

. Furthermore, (1.7) and (1.8) become

$$\displaystyle \begin{aligned} \begin{array}{rcl} \frac{\partial l}{\partial\mu}& =&\displaystyle \sum_{i=1}^{m}\frac{k_{i}}{\tau^{2} +k_{i}\sigma^{2}}(\bar{y}_{i\cdot}-\mu),\\ \frac{\partial l}{\partial\tau^{2}}& =&\displaystyle -\frac{n-m}{2\tau^{2}}-\frac {1}{2}\sum_{i=1}^{m}\frac{1}{\tau^{2}+k_{i}\sigma^{2}} +\frac{1}{2\tau^{4}}\sum_{i=1}^{m}\sum_{j=1}^{k_{i}}(y_{ij}-\mu)^{2}\\ & &\displaystyle -\frac{\sigma^{2}}{2\tau^{2}}\sum_{i=1}^{m}\left(\frac{1}{\tau^{2}}+ \frac{1}{\tau^{2}+k_{i}\sigma^{2}}\right)\frac{k_{i}^{2}}{\tau^{2}+ k_{i}\sigma^{2}}(\bar{y}_{i\cdot}-\mu)^{2},\\ \frac{\partial l}{\partial\sigma^{2}}& =&\displaystyle -\frac{1}{2}\sum_{i=1}^{m} \frac{k_{i}}{\tau^{2}+k_{i}\sigma^{2}}+\frac{1}{2}\sum_{i=1}^{m} \left(\frac{k_{i}}{\tau^{2}+k_{i}\sigma^{2}}\right)^{2}(\bar{y}_{i\cdot} -\mu)^{2}. \end{array} \end{aligned} $$

The specification of the asymptotic covariance matrix in this case is left as an exercise (Exercise 1.7).

Example 1.7 (Neyman–Scott problem )

Neyman and Scott (1948) used the following example to show that, when the number of parameters increases with the sample size, the MLE may not be consistent. Suppose that two observations are collected from each of m individuals. Each individual has its own (unknown) mean, say, μ i for the ith individual. Suppose that the observations are independent and normally distributed with variance σ ². The problem of interest is to estimate σ ². The model may be expressed as y ij = μ i + 𝜖 ij, i = 1, …, m, j = 1, 2, where 𝜖 ijs are independent and distributed as N(0, σ ²). Note that this may be viewed as a special case of the linear mixed model (1.1), in which Z = 0. However, it can be shown that, as m →∞, the MLE of σ ² is inconsistent (Exercise 1.8).

1.3.2 Restricted Maximum Likelihood (REML)

The inconsistency of MLE in Example 1.7 has to do with the bias. In general, the MLE of the variance components are biased. Such a bias can be severe that it may not vanish as the sample size increases, if the number of the fixed effects is proportional to the sample size. In fact, in the latter case, the MLE will be inconsistent (Jiang 1996).

Furthermore, in cases such as Example 1.7, the fixed effects are considered as nuisance parameters, while the main interest is the variance components. However, with maximum likelihood one has to estimate all of the parameters involved, including the nuisance fixed effects. It would be nicer to have a method that can estimate the parameters of main interest without having to deal with the nuisance parameters. To introduce such a method, let us revisit the Neyman–Scott problem (Example 1.7).

Example 1.7 (Continued)

In this case, there are m + 1 parameters, of which the means μ 1, …, μ m are nuisance, while the parameter of main interest is σ ². Clearly, the number of parameters is proportional to the sample size, which is 2m. Now, instead of using the original data, consider the following simple transformation: z i = y i1 − y i2. It follows that z 1, …, z m are independent and distributed as N(0, 2σ ²). What makes a difference is that now the nuisance parameters are gone; that is, they are not involved in the distribution of the z is. In fact, the MLE of σ ² based on the new data, z 1, …, z m, is consistent (Exercise 1.8). Note that, after the transformation, one is in a world with a single parameter, σ ², and m observations.

The trick used in the above example is simple: apply a transformation to the data to eliminate the (nuisance) fixed effects; then use the transformed data to estimate the variance component. The method can be illustrated under a general framework as follows.

1.3.2.1 Point Estimation

As before, we assume, w.l.o.g., that rank(X) = p. Let A be an n × (n − p) matrix such that

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \mathrm{rank}(A)=n-p,\;\;A'X=0. \end{array} \end{aligned} $$

(1.16)

Then, define z = A′y. It is easy to see that z ∼ N(0, A′V A). It follows that the joint pdf of z is given by

$$\displaystyle \begin{aligned} \begin{array}{rcl} f_{\mathrm{R}}(z)& =&\displaystyle \frac{1}{(2\pi)^{(n-p)/2}|A'VA|{}^{1/2}}\exp\left\{-\frac {1}{2}z'(A'VA)^{-1}z\right\}, \end{array} \end{aligned} $$

where and hereafter the subscript R corresponds to restricted. Thus, the log-likelihood based on z, which we call restricted log-likelihood, is given by

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} l_{\mathrm{R}}(\theta)& =&\displaystyle c-\frac{1}{2}\log(|A'VA|)-\frac{1}{2}z'(A'VA)^{-1}z, \end{array} \end{aligned} $$

(1.17)

where c is a constant. By differentiating the restricted log-likelihood (see Appendix A), we obtain, expressed in terms of y,

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ22_HTML.png

(1.18)

where

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} P& =&\displaystyle A(A'VA)^{-1}A'\;\;=\;\;\mathrm{the}\;\mathrm{right}\;\mathrm{side}\;\mathrm{of} \;(\text{1.11}) \end{array} \end{aligned} $$

(1.19)

(see Appendix A). The REML estimator of θ is defined as the maximizer of (1.17). As in the ML case, such a maximizer satisfies the REML equation ∂l R∕∂θ = 0. See Sect. 1.8 for further discussion.

Remark

Although the REML estimator is defined through a transforming, matrix A, the REML estimator, in fact, does not depend on A. To see this, note that, by (1.18) and (1.19), the REML equations do not depend on A. A more thorough demonstration is left as an exercise (Exercise 1.9). This fact is important because, obviously, the choice of A is not unique, and one does not want the estimator to depend on the choice of the transformation.

Example 1.7 (Continued)

It is easy to see that the transformation z i = y i1 − y i2, i = 1, …, m corresponds to

$$\displaystyle \begin{aligned} \begin{array}{rcl} A& =&\displaystyle \left(\begin{array}{ccccccc} 1&\displaystyle -1&\displaystyle 0&\displaystyle 0&\displaystyle \cdots&\displaystyle 0&\displaystyle 0\\ 0& 0&\displaystyle 1&\displaystyle -1&\displaystyle \cdots&\displaystyle 0&\displaystyle 0\\ \vdots& \vdots&\displaystyle \vdots&\displaystyle \vdots&\displaystyle \vdots&\displaystyle \vdots&\displaystyle \vdots\\ 0& 0&\displaystyle 0&\displaystyle 0&\displaystyle \cdots&\displaystyle 1&\displaystyle -1 \end{array}\right)'. \end{array} \end{aligned} $$

An alternative transforming matrix may be obtained as B = AT, where T is any m × m nonsingular matrix. But the resulting REML estimator of σ ² is the same (Exercise 1.9).

1.3.2.2 Historical Note

The REML method was first proposed by Thompson (1962); it was put on a broad basis by Patterson and Thompson (1971). The method is also known as residual maximum likelihood, although the abbreviation, REML , remains the same. There have been different derivations of REML. For example, Harville (1974) provided a Bayesian derivation of REML. He showed that the restricted likelihood can be derived as the marginal likelihood when β is integrated out under a non-informative, or flat, prior (Exercise 1.10). Also see Verbyla (1990). Barndorff-Nielsen (1983) derived the restricted likelihood as a modified profile likelihood. Jiang (1996) pointed out that the REML equations may be derived under the assumption of a multivariate t-distribution (instead of multivariate normal distribution). More generally, Heyde (1994) showed that the REML equations may be viewed as quasi-likelihood equations. See Heyde (1997) for further details. Surveys on REML can be found in Harville (1977), Khuri and Sahai (1985), Robinson (1987), and Speed (1997).

Note that the restricted log-likelihood (1.17) is a function of θ only. In other words, the REML method is a method of estimating θ (not β, because the latter is eliminated before the estimation). However, once the REML estimator of θ is obtained, β is usually estimated the same way as the ML, that is, by (1.9), where $$V=V(\hat {\theta })$$ with $$\hat {\theta }$$ being the REML estimator. Such an estimator is sometimes referred as the REML estimator of β.

1.3.2.3 Asymptotic Covariance Matrix

Under suitable conditions, the REML estimator is consistent and asymptotically normal (see Sect. 1.8 for discussion). The asymptotic covariance matrix is equal to the inverse of the restricted Fisher information matrix , which, under regularity conditions, has similar expressions as (1.12):

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \mathrm{Var}\left(\frac{\partial l_{\mathrm{R}}}{\partial\theta}\right)& =&\displaystyle -\mathrm{E}\left(\frac{\partial^{2}l_{\mathrm{R}}}{\partial\theta\partial\theta'} \right). \end{array} \end{aligned} $$

(1.20)

Further expressions may be obtained. For example, assuming, again, that V is twice continuously differentiable (with respect to the components of θ), then we have (Exercise 1.11)

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ26_HTML.png

(1.21)

Example 1.1 (Continued)

For simplicity, consider the balanced case, that is, k i = k, 1 ≤ i ≤ m. It can be shown (Exercise 1.12) that, in this case, the REML equations ∂l R∕∂τ ² = 0 and ∂l R∕∂σ ² = 0 are equivalent to the following equation system:

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \left\{\begin{array}{rrr}\tau^{2}+k\sigma^{2}&=&\mathrm{MSA},\\ \tau^{2}&=&\mathrm{MSE},\end{array}\right. \end{array} \end{aligned} $$

(1.22)

where

$$\mathrm {MSA}=\mathrm {SSA}/(m-1),\ \mathrm {SSA}=k\sum _{i=1}^{m} (\bar {y}_{i\cdot }-\bar {y}_{\cdot \cdot })^{2},\ \bar {y}_{i\cdot }= k^{-1}\sum _{j=1}^{k}y_{ij},\ \bar {y}_{\cdot \cdot }=(mk)^{-1}\sum _{i=1}^{m} \sum _{j=1}^{k}y_{ij}$$

;

$$\mathrm {MSE}=\mathrm {SSE}/m(k-1),\ \mathrm {SSE}= \sum _{i=1}^{m}\sum _{j=1}^{k}(y_{ij}-\bar {y}_{i\cdot })^{2}$$

. The REML equations thus have an explicit solution:

$$\dot {\tau }^{2}=\mathrm {MSE},\ \dot {\sigma }^{2}=k^{-1}(\mathrm {MSA}-\mathrm {MSE})$$

. Note that these are only the solution to the REML equations, which are not necessarily the REML estimators (although, in most cases, the two are identical). For example, it is seen that $$\dot {\sigma }^{2}$$ can be negative; when this happens $$\dot {\sigma }^{2}$$ cannot be the REML estimator because the latter, by definition, has to belong to the parameter space, which is [0, ∞) for a variance.

The derivation of the asymptotic covariance matrix in this special case is left as an exercise (Exercise 1.12).

1.4 Estimation in Non-Gaussian Linear Mixed Models

The methods discussed in the previous section are based on the normality assumption. However, the normality assumption is likely to be violated in practice. For example, Lange and Ryan (1989) gave several examples showing that non-normality of the random effects is, indeed, encountered in practice. The authors also developed a method for assessing normality of the random effects. Due to such concerns, linear mixed models without normality assumption, or non-Gaussian linear mixed models, have been considered. In this section, we focus on estimation in the two types of non-Gaussian linear mixed models described in Sect. 1.2.2, that is, the mixed ANOVA model and longitudinal model. Sections 1.4.1 and 1.4.2 discuss estimation in ANOVA models, and Sects. 1.4.3 and 1.4.4 deal with longitudinal models. It should be noted that, although the methods proposed here for the two types of models are different, it does not mean that one method cannot be applied to a different type of model. In fact, the two methods overlap in some special cases. See the discussion in Sect. 1.4.4. Finally, in Sect. 1.4.5 we consider an application of high-dimensional misspecified mixed model analysis that may be viewed as a case of non-Gaussian mixed ANOVA model.

1.4.1 Quasi-Likelihood Method

In this and the next sections, we discuss estimation in non-Gaussian mixed ANOVA models. Some remarks are made at the end of the next section on possible extension of the method to more general models.

First note that, when normality is not assumed, (fully) likelihood-based inference is difficult, or even impossible. To see this, first note that if the distributions of the random effects and errors are not specified, the likelihood function is simply not available. Furthermore, even if the (non-normal) distributions of the random effects and errors are specified (up to some unknown parameters), the likelihood function is usually complicated. In particular, such a likelihood may not have an analytic expression. Finally, like normality, any other specific distributional assumption may not hold in practice. These difficulties have led to consideration of methods other than maximum likelihood. One such method is Gaussian likelihood, or, as we call it, quasi-likelihood .

The idea is to use normality-based estimators even if the data are not normal. For the mixed ANOVA models, the REML estimator of θ is defined as the solution to the (Gaussian) REML equations , provided that the solution belongs to the parameter space. See Sect. 1.8 for some discussion on how to handle cases where the solution is outside the parameter space. Similarly, the ML estimators of β and θ are defined as the solution to the (Gaussian) ML equations , provided that they stay in the parameter space. More specifically, under the mixed ANOVA model with the original form of variance components , the REML equations are given by (Exercise 1.13):

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ28_HTML.png

(1.23)

With the same model and variance components, the ML equations are

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ29_HTML.png

(1.24)

Similarly, the REML equations under the mixed ANOVA model with the Hartley–Rao form of variance components are given by

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ30_HTML.png

(1.25)

the corresponding ML equations are given by

../images/77895_2_En_1_Chapter/77895_2_En_1_Equ31_HTML.png

(1.26)

In order to justify such an approach, let us first point out that, although the REML estimator is defined as the solution to the REML equations, which are derived under normality, normal likelihood is not the only one that can lead to the REML equations. For example, Jiang (1996) showed that exactly the same equations arise if one starts with a multivariate t-distribution , that is, y ∼ t n(Xβ, V, d), which has a joint pdf

$$\displaystyle \begin{aligned} \begin{array}{rcl} p(y)& =&\displaystyle \frac{\varGamma\{(n+d)/2\}}{(d\pi)^{n/2}\varGamma(d/2)|V|{}^{1/2}} \left\{1+\frac{1}{d}(y-X\beta)'V^{-1}(y-X\beta)\right\}^{-(n+d)/2} \end{array} \end{aligned} $$

(Exercise 1.14). Here d is the degree of freedom of the multivariate t-distribution. More generally, Heyde (1994, 1997) showed that the REML equations can be derived from a quasi-likelihood . As it turns out, the likelihood under multivariate t is a special case of Heyde’s quasi-likelihood. For such a reason, the (Gaussian) REML estimation may be regarded as a method of quasi-likelihood. Similarly, the (Gaussian) ML estimation may be justified from a quasi-likelihood point of view. For simplicity, the corresponding estimators are still called REML or ML estimators.

Furthermore, it has been shown (Richardson and Welsh 1994; Jiang 1996, 1997a) that the REML estimator is consistent and asymptotically normal even if normality does not hold. Furthermore, Jiang (1996) showed that the ML estimator has similar

Enjoying the preview?

Page 1 of 1

Linear and Generalized Linear Mixed Models and Their Applications

About this ebook

Jiming Jiang

Related authors

Related to Linear and Generalized Linear Mixed Models and Their Applications

Related ebooks

Medical For You

Related podcast episodes

Related articles

Related categories

Reviews for Linear and Generalized Linear Mixed Models and Their Applications

What did you think?

Book preview

Linear and Generalized Linear Mixed Models and Their Applications - Jiming Jiang

1. Linear Mixed Models: Part I

1.1 Introduction

1.1.1 Effect of Air Pollution Episodes on Children

1.1.2 Genome-Wide Association Study

1.1.3 Small Area Estimation of Income

1.2 Types of Linear Mixed Models

1.2.1 Gaussian Mixed Models

1.2.2 Non-Gaussian Linear Mixed Models

1.3 Estimation in Gaussian Mixed Models

1.3.1 Maximum Likelihood

1.3.2 Restricted Maximum Likelihood (REML)

1.4 Estimation in Non-Gaussian Linear Mixed Models

1.4.1 Quasi-Likelihood Method