Exploring Data Analysis: The Computer Revolution in Statistics

Ebook381 pages2 hours

Exploring Data Analysis: The Computer Revolution in Statistics

Name: Exploring Data Analysis: The Computer Revolution in Statistics
ISBN: 9780520338210

By W. J. Dixon

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This title is part of UC Press's Voices Revived program, which commemorates University of California Press’s mission to seek out and cultivate the brightest minds and give them voice, reach, and impact. Drawing on a backlist dating to 1893, Voices Revived makes high-quality, peer-reviewed scholarship accessible once again using print-on-demand technology. This title was originally published in 1974.
This title is part of UC Press's Voices Revived program, which commemorates University of California Press’s mission to seek out and cultivate the brightest minds and give them voice, reach, and impact. Drawing on a backlist dating to 1893, Voices Revived

Skip carousel

LanguageEnglish

PublisherUniversity of California Press

Release dateDec 22, 2023

ISBN9780520338210

Related to Exploring Data Analysis

Related ebooks

Skip carousel

Data Preparation and Exploration: Applied to Healthcare Data
Ebook
Data Preparation and Exploration: Applied to Healthcare Data
byRobert Hoyt
Rating: 0 out of 5 stars
0 ratings
Data and the American Dream: Contemporary Social Controversies and the American Community Survey
Ebook
Data and the American Dream: Contemporary Social Controversies and the American Community Survey
byMatthew J. Holian
Rating: 0 out of 5 stars
0 ratings
Principles of Biomedical Informatics
Ebook
Principles of Biomedical Informatics
byIra J. Kalet
Rating: 0 out of 5 stars
0 ratings
Clinical Decision Support Systems: Theory and Practice
Ebook
Clinical Decision Support Systems: Theory and Practice
byEta S. Berner
Rating: 3 out of 5 stars
3/5
Schaum's Outline of Elements of Statistics I: Descriptive Statistics and Probability
Ebook
Schaum's Outline of Elements of Statistics I: Descriptive Statistics and Probability
byStephen Bernstein
Rating: 0 out of 5 stars
0 ratings
Repurposing Legacy Data: Innovative Case Studies
Ebook
Repurposing Legacy Data: Innovative Case Studies
byJules J. Berman
Rating: 0 out of 5 stars
0 ratings
Biostatistics: A Guide to Design, Analysis and Discovery
Ebook
Biostatistics: A Guide to Design, Analysis and Discovery
byRonald N. Forthofer
Rating: 0 out of 5 stars
0 ratings
Statistical Method from the Viewpoint of Quality Control
Ebook
Statistical Method from the Viewpoint of Quality Control
byWalter A. Shewhart
Rating: 5 out of 5 stars
5/5
Data Analysis: What Can Be Learned From the Past 50 Years
Ebook
Data Analysis: What Can Be Learned From the Past 50 Years
byPeter J. Huber
Rating: 0 out of 5 stars
0 ratings
Data-Centric Biology: A Philosophical Study
Ebook
Data-Centric Biology: A Philosophical Study
bySabina Leonelli
Rating: 0 out of 5 stars
0 ratings
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
Ebook
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
byRobert L. Mason
Rating: 0 out of 5 stars
0 ratings
Ways of Knowing in HCI
Ebook
Ways of Knowing in HCI
byJudith S. Olson
Rating: 5 out of 5 stars
5/5
Data Treatment in Environmental Sciences
Ebook
Data Treatment in Environmental Sciences
byValérie David
Rating: 0 out of 5 stars
0 ratings
Quantitative Analysis and Modeling of Earth and Environmental Data: Space-Time and Spacetime Data Considerations
Ebook
Quantitative Analysis and Modeling of Earth and Environmental Data: Space-Time and Spacetime Data Considerations
byJiaping Wu
Rating: 0 out of 5 stars
0 ratings
Practical Biostatistics: A Friendly Step-by-Step Approach for Evidence-based Medicine
Ebook
Practical Biostatistics: A Friendly Step-by-Step Approach for Evidence-based Medicine
byMendel Suchmacher
Rating: 5 out of 5 stars
5/5
Logic of Discovery and Diagnosis in Medicine
Ebook
Logic of Discovery and Diagnosis in Medicine
byKenneth F. Schaffner
Rating: 0 out of 5 stars
0 ratings
Linear and Generalized Linear Mixed Models and Their Applications
Ebook
Linear and Generalized Linear Mixed Models and Their Applications
byJiming Jiang
Rating: 0 out of 5 stars
0 ratings
Multiple Imputation and its Application
Ebook
Multiple Imputation and its Application
byJames Carpenter
Rating: 0 out of 5 stars
0 ratings
Introduction to Data Analysis in Qualitative Research
Ebook
Introduction to Data Analysis in Qualitative Research
byAsher Shkedi
Rating: 0 out of 5 stars
0 ratings
Signal Processing for Neuroscientists, A Companion Volume: Advanced Topics, Nonlinear Techniques and Multi-Channel Analysis
Ebook
Signal Processing for Neuroscientists, A Companion Volume: Advanced Topics, Nonlinear Techniques and Multi-Channel Analysis
byWim van Drongelen
Rating: 0 out of 5 stars
0 ratings
Analysis of Clinical Trials Using SAS: A Practical Guide, Second Edition
Ebook
Analysis of Clinical Trials Using SAS: A Practical Guide, Second Edition
byCSPtrade2
Rating: 0 out of 5 stars
0 ratings
Audit Studies: Behind the Scenes with Theory, Method, and Nuance
Ebook
Audit Studies: Behind the Scenes with Theory, Method, and Nuance
byS. Michael Gaddis
Rating: 0 out of 5 stars
0 ratings
Clinical Research Computing: A Practitioner's Handbook
Ebook
Clinical Research Computing: A Practitioner's Handbook
byPrakash Nadkarni
Rating: 0 out of 5 stars
0 ratings
Designing User Studies in Informatics
Ebook
Designing User Studies in Informatics
byGondy Leroy
Rating: 0 out of 5 stars
0 ratings
Multimethod Research, Causal Mechanisms, and Case Studies: An Integrated Approach
Ebook
Multimethod Research, Causal Mechanisms, and Case Studies: An Integrated Approach
byGary Goertz
Rating: 0 out of 5 stars
0 ratings
Psychophysics: A Practical Introduction
Ebook
Psychophysics: A Practical Introduction
byFrederick A.A. Kingdom
Rating: 0 out of 5 stars
0 ratings
Data Mining for the Social Sciences: An Introduction
Ebook
Data Mining for the Social Sciences: An Introduction
byPaul Attewell
Rating: 0 out of 5 stars
0 ratings
Computational Frameworks: Systems, Models and Applications
Ebook
Computational Frameworks: Systems, Models and Applications
byMamadou Kaba Traore
Rating: 0 out of 5 stars
0 ratings
Sensory Evaluation Practices
Ebook
Sensory Evaluation Practices
byElsevier Books Reference
Rating: 5 out of 5 stars
5/5
Complex Surveys: A Guide to Analysis Using R
Ebook
Complex Surveys: A Guide to Analysis Using R
byThomas Lumley
Rating: 0 out of 5 stars
0 ratings

Data Modeling & Design For You

Skip carousel

Supercharge Power BI: Power BI is Better When You Learn To Write DAX
Ebook
Supercharge Power BI: Power BI is Better When You Learn To Write DAX
byMatt Allington
Rating: 5 out of 5 stars
5/5
DAX Patterns: Second Edition
Ebook
DAX Patterns: Second Edition
byMarco Russo
Rating: 5 out of 5 stars
5/5
Microsoft 365 Excel: The Only App That Matters: Calculations, Analytics, Modeling, Data Analysis and Dashboard Reporting for the New Era of Dynamic Data Driven Decision Making & Insight
Ebook
Microsoft 365 Excel: The Only App That Matters: Calculations, Analytics, Modeling, Data Analysis and Dashboard Reporting for the New Era of Dynamic Data Driven Decision Making & Insight
byMike Girvin
Rating: 3 out of 5 stars
3/5
Hacks To Crush Plc Program Fast & Efficiently Everytime... : Coding, Simulating & Testing Programmable Logic Controller With Examples
Ebook
Hacks To Crush Plc Program Fast & Efficiently Everytime... : Coding, Simulating & Testing Programmable Logic Controller With Examples
byMichael Blake
Rating: 5 out of 5 stars
5/5
R Programming - a Comprehensive Guide: Software
Ebook
R Programming - a Comprehensive Guide: Software
byEditor IJSMI
Rating: 0 out of 5 stars
0 ratings
Ultimate Enterprise Data Analysis and Forecasting using Python
Ebook
Ultimate Enterprise Data Analysis and Forecasting using Python
byShanthababu Pandian
Rating: 0 out of 5 stars
0 ratings
Thinking in Algorithms: Strategic Thinking Skills, #2
Ebook
Thinking in Algorithms: Strategic Thinking Skills, #2
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
Ebook
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
byIvan Vasilev
Rating: 0 out of 5 stars
0 ratings
Power Pivot and Power BI: The Excel User's Guide to DAX, Power Query, Power BI & Power Pivot in Excel 2010-2016
Ebook
Power Pivot and Power BI: The Excel User's Guide to DAX, Power Query, Power BI & Power Pivot in Excel 2010-2016
byRob Collie
Rating: 4 out of 5 stars
4/5
Data Analytics for Beginners: Introduction to Data Analytics
Ebook
Data Analytics for Beginners: Introduction to Data Analytics
byAnthony S. Williams
Rating: 4 out of 5 stars
4/5
Data Visualization: a successful design process
Ebook
Data Visualization: a successful design process
byAndy Kirk
Rating: 4 out of 5 stars
4/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Bayesian Analysis with Python
Ebook
Bayesian Analysis with Python
byOsvaldo Martin
Rating: 5 out of 5 stars
5/5
Deep Learning: An Essential Guide to Deep Learning for Beginners Who Want to Understand How Deep Neural Networks Work and Relate to Machine Learning and Artificial Intelligence
Ebook
Deep Learning: An Essential Guide to Deep Learning for Beginners Who Want to Understand How Deep Neural Networks Work and Relate to Machine Learning and Artificial Intelligence
byHerbert Jones
Rating: 5 out of 5 stars
5/5
Mastering Agile User Stories
Ebook
Mastering Agile User Stories
byDeEtta Balthazar
Rating: 4 out of 5 stars
4/5
Raspberry Pi :Raspberry Pi Guide On Python & Projects Programming In Easy Steps
Ebook
Raspberry Pi :Raspberry Pi Guide On Python & Projects Programming In Easy Steps
byJason Scotts
Rating: 3 out of 5 stars
3/5
End-to-End Data Science with SAS: A Hands-On Programming Guide
Ebook
End-to-End Data Science with SAS: A Hands-On Programming Guide
byJames Gearheart
Rating: 0 out of 5 stars
0 ratings
The Esri Guide to GIS Analysis, Volume 3: Modeling Suitability, Movement, and Interaction
Ebook
The Esri Guide to GIS Analysis, Volume 3: Modeling Suitability, Movement, and Interaction
byAndy Mitchell
Rating: 0 out of 5 stars
0 ratings
Graph Databases in Action: Examples in Gremlin
Ebook
Graph Databases in Action: Examples in Gremlin
byJosh Perryman
Rating: 0 out of 5 stars
0 ratings
AutoCAD® Pocket Reference
Ebook
AutoCAD® Pocket Reference
byCheryl R. Shrock
Rating: 0 out of 5 stars
0 ratings
Data Fluency: Empowering Your Organization with Effective Data Communication
Ebook
Data Fluency: Empowering Your Organization with Effective Data Communication
byZach Gemignani
Rating: 2 out of 5 stars
2/5
Data Analytics with Python: Data Analytics in Python Using Pandas
Ebook
Data Analytics with Python: Data Analytics in Python Using Pandas
byFrank Millstein
Rating: 3 out of 5 stars
3/5
A Concise Guide to Object Orientated Programming
Ebook
A Concise Guide to Object Orientated Programming
byalasdair gilchrist
Rating: 0 out of 5 stars
0 ratings
The Systems Thinker - Mental Models: The Systems Thinker Series, #3
Ebook
The Systems Thinker - Mental Models: The Systems Thinker Series, #3
byAlbert Rutherford
Rating: 0 out of 5 stars
0 ratings
Learn T-SQL Querying: A guide to developing efficient and elegant T-SQL code
Ebook
Learn T-SQL Querying: A guide to developing efficient and elegant T-SQL code
byPedro Lopes
Rating: 0 out of 5 stars
0 ratings
Principles of Data Science
Ebook
Principles of Data Science
bySinan Ozdemir
Rating: 4 out of 5 stars
4/5
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
Ebook
Machine Learning: A Comprehensive, Step-by-Step Guide to Learning and Understanding Machine Learning Concepts, Technology and Principles for Beginners: 1
byPeter Bradley
Rating: 0 out of 5 stars
0 ratings
Learning Python Design Patterns - Second Edition
Ebook
Learning Python Design Patterns - Second Edition
byGiridhar Chetan
Rating: 0 out of 5 stars
0 ratings
Brainstorming and Beyond: A User-Centered Design Method
Ebook
Brainstorming and Beyond: A User-Centered Design Method
byChauncey Wilson
Rating: 0 out of 5 stars
0 ratings
No-Code Data Science: Mastering Advanced Analytics, Machine Learning, and Artificial Intelligence
Ebook
No-Code Data Science: Mastering Advanced Analytics, Machine Learning, and Artificial Intelligence
byDavid Patrishkoff
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
Podcast episode
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
byLinear Digressions
0 ratings
0% found this document useful
Data Decisions (w/ Dr. Peter Enns)
Podcast episode
Data Decisions (w/ Dr. Peter Enns)
byThe People Nerds Podcast
0 ratings
0% found this document useful
B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
Podcast episode
B. Fong and D. I. Spivak, "An Invitation to Applied Category Theory: Seven Sketches in Compositionality" (Cambridge UP, 2019): Fong and Spivak have written a marvelous and timely new textbook that, as its title suggests, invites readers of all backgrounds to explore what it means to take a compositional approach and how it might serve their needs....
byNew Books in Mathematics
0 ratings
0% found this document useful
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
Podcast episode
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
#89 – Owen Cotton-Barratt on epistemic systems and layers of defense against potential global catastrophes: You could think of academia as one big epistemic system — something which processes information, directs people's attention, and finds new ideas. 
Podcast episode
#89 – Owen Cotton-Barratt on epistemic systems and layers of defense against potential global catastrophes: You could think of academia as one big epistemic system — something which processes information, directs people's attention, and finds new ideas. 
by80,000 Hours Podcast
0 ratings
0% found this document useful
Resoundingly Human: Providing decision-makers with the tools they need, featuring AAAS Science & Technology Policy Fellows: Operations research, analytics, data science, and other related disciplines enable individuals and organizations to transform data into insights that facilitate better, more informed decision-making in order to save lives, save money, and solve...
Podcast episode
Resoundingly Human: Providing decision-makers with the tools they need, featuring AAAS Science & Technology Policy Fellows: Operations research, analytics, data science, and other related disciplines enable individuals and organizations to transform data into insights that facilitate better, more informed decision-making in order to save lives, save money, and solve...
byResoundingly Human
0 ratings
0% found this document useful
Crime Prevention: Modellansatz 109
Podcast episode
Crime Prevention: Modellansatz 109
byModellansatz
0 ratings
0% found this document useful
Crime Prevention: Modellansatz 109
Podcast episode
Crime Prevention: Modellansatz 109
byModellansatz - English episodes only
0 ratings
0% found this document useful
S1E31: Interview with Rajeev Dehejia, Professor at NYU and Economist
Podcast episode
S1E31: Interview with Rajeev Dehejia, Professor at NYU and Economist
byThe Mixtape with Scott
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in Mathematics
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
#47 Yaneer Bar-Yam on Complex Systems and the War on Values: During this thought provoking episode, Prof. discusses the nature of complex systems and complexity science. Our discussion covers the cacophony of signals within the information environment and how complexity science provides tools for understanding...
Podcast episode
#47 Yaneer Bar-Yam on Complex Systems and the War on Values: During this thought provoking episode, Prof. discusses the nature of complex systems and complexity science. Our discussion covers the cacophony of signals within the information environment and how complexity science provides tools for understanding...
byThe Cognitive Crucible
0 ratings
0% found this document useful
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
Podcast episode
Thomas Huckle and Tobias Neckel, "Bits and Bugs: A Scientific and Historical Review of Software Failures in Computational Science" (SIAM, 2019): An interview with Thomas Huckle and Tobias Neckel
byNew Books in the History of Science
0 ratings
0% found this document useful
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
Podcast episode
[From the Archives] Ep 91: Dr. Mary Ellen Dello Stritto and Dr. William Marelich on the Applied Quantitative Perspective
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
Cerebral Fluid Flow: Modellansatz 134
Podcast episode
Cerebral Fluid Flow: Modellansatz 134
byModellansatz - English episodes only
0 ratings
0% found this document useful
Shared Measurement and Big Data For Good: Traditional tools for evaluation and measurement fail to take into account the complexity of an interconnected and digitized world. Emerging techniques, such as developmental evaluation, improve on traditional linear, cause-and-effect models, while...
Podcast episode
Shared Measurement and Big Data For Good: Traditional tools for evaluation and measurement fail to take into account the complexity of an interconnected and digitized world. Emerging techniques, such as developmental evaluation, improve on traditional linear, cause-and-effect models, while...
byInside Social Innovation
0 ratings
0% found this document useful
Empowering Communities through Local Monitoring: BioScience handling editors Rick Bonney, of Cornell University, and Finn Danielsen, of the Nordic Foundation for Development and Ecology (NORDECO) join us to discuss an open-access special section on community-based monitoring programs and the broader fut
Podcast episode
Empowering Communities through Local Monitoring: BioScience handling editors Rick Bonney, of Cornell University, and Finn Danielsen, of the Nordic Foundation for Development and Ecology (NORDECO) join us to discuss an open-access special section on community-based monitoring programs and the broader fut
byBioScience Talks
0 ratings
0% found this document useful
Sandy Pentland on Social Physics: For Alex “Sandy” Pentland, one of the best-known and widely cited computational social scientists in the world, these are halcyon days for his field. One of the creators of the MIT Media Lab and currently the director of the...
Podcast episode
Sandy Pentland on Social Physics: For Alex “Sandy” Pentland, one of the best-known and widely cited computational social scientists in the world, these are halcyon days for his field. One of the creators of the MIT Media Lab and currently the director of the...
bySocial Science Bites
100%
100% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
Podcast episode
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Business, Management, and Marketing
0 ratings
0% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
Podcast episode
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
Podcast episode
Renee M. P. Teate, "SQL for Data Scientists: A Beginner's Guide for Building Datasets for Analysis" (John Wiley & Sons, 2021): An interview with Renee M. P. Teate
byNew Books in Economics
0 ratings
0% found this document useful
Bringing it All Together: Chaining Procedures in AAC
Podcast episode
Bringing it All Together: Chaining Procedures in AAC
bySLP Nerdcast
0 ratings
0% found this document useful
Episode 121 - Megan Levis, full interview (rerun): Life is pretty intense for Paul these days. We present this interview with Megan Levis from the 2019 Society of Catholic Scientists archives, every bit as relevant now as it was then. It was originally presented as two episodes. Megan Levis is a fifth-ye...
Podcast episode
Episode 121 - Megan Levis, full interview (rerun): Life is pretty intense for Paul these days. We present this interview with Megan Levis from the 2019 Society of Catholic Scientists archives, every bit as relevant now as it was then. It was originally presented as two episodes. Megan Levis is a fifth-ye...
byThat's So Second Millennium
0 ratings
0% found this document useful
Dynamical Sampling: Modellansatz 173
Podcast episode
Dynamical Sampling: Modellansatz 173
byModellansatz - English episodes only
0 ratings
0% found this document useful
Dynamical Sampling
Podcast episode
Dynamical Sampling
byModellansatz
0 ratings
0% found this document useful
Using Behavior Analysis for Policy Development and Analysis: Inside JABA 14: Thanks so much for checking out installment number 14 in the Inside JABA Series on Behavioral Observations. Dr. John Borrero, JABA's Editor in Chief, and I are joined by Drs. Brett Gelino and Derek Reed to discuss a novel study that they and their...
Podcast episode
Using Behavior Analysis for Policy Development and Analysis: Inside JABA 14: Thanks so much for checking out installment number 14 in the Inside JABA Series on Behavioral Observations. Dr. John Borrero, JABA's Editor in Chief, and I are joined by Drs. Brett Gelino and Derek Reed to discuss a novel study that they and their...
byThe Behavioral Observations Podcast with Matt Cicoria
0 ratings
0% found this document useful
Challenging the Foundation of Asset Pricing Theory with Andrew Chen and Alejandro Lopez-Lira
Podcast episode
Challenging the Foundation of Asset Pricing Theory with Andrew Chen and Alejandro Lopez-Lira
byExcess Returns
0 ratings
0% found this document useful
Christine L. Borgman, “Big Data, Little Data, No Data: Scholarship in the Networked World” (MIT Press, 2015): Social media and digital technology now allow researchers to collect vast amounts of a variety data quickly. This so-called “big data,” and the practices that surround its collection, is all the rage in both the media and in research circles.
Podcast episode
Christine L. Borgman, “Big Data, Little Data, No Data: Scholarship in the Networked World” (MIT Press, 2015): Social media and digital technology now allow researchers to collect vast amounts of a variety data quickly. This so-called “big data,” and the practices that surround its collection, is all the rage in both the media and in research circles.
byNew Books in Education
0 ratings
0% found this document useful
S1E29: Interview with Noam Angrist, Co-founder and Director of Youth Impact
Podcast episode
S1E29: Interview with Noam Angrist, Co-founder and Director of Youth Impact
byThe Mixtape with Scott
0 ratings
0% found this document useful
Increasing Access to Genomic Data and Driving Data-Driven Medicine—Kevin Puylaert—SOPHiA GENETICS: Kevin Puylaert is general manager of North American operations and VP of business development at SOPHiA GENETICS, a company that’s offering software as a service platform for the democratization of data-driven medicine. They have already connected...
Podcast episode
Increasing Access to Genomic Data and Driving Data-Driven Medicine—Kevin Puylaert—SOPHiA GENETICS: Kevin Puylaert is general manager of North American operations and VP of business development at SOPHiA GENETICS, a company that’s offering software as a service platform for the democratization of data-driven medicine. They have already connected...
byFinding Genius Podcast
0 ratings
0% found this document useful

Skip carousel

NIH-funded Project Aims To Build A ‘Google’ For Biomedical Data
STAT
Article
NIH-funded Project Aims To Build A ‘Google’ For Biomedical Data
Jul 31, 2019
4 min read
The National Academies Illustrates the More Nuanced Value of Transparency in Science
Union of Concerned Scientists
Article
The National Academies Illustrates the More Nuanced Value of Transparency in Science
May 13, 2019
4 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
Data Doesn’t Speak, People Do!
Union of Concerned Scientists
Article
Data Doesn’t Speak, People Do!
Mar 3, 2022
Science Network guest blogger Professor Barbara Allen describes how scientists can better engage with communities for the best impact from their work.
2 min read
The Trump EPA Is Restricting EPA Science. It’s Somehow Worse than We Expected.
Union of Concerned Scientists
Article
The Trump EPA Is Restricting EPA Science. It’s Somehow Worse than We Expected.
Mar 4, 2020
3 min read
Four New (Old) Ways the White House is Trying to Restrict Science for Policymaking
Union of Concerned Scientists
Article
Four New (Old) Ways the White House is Trying to Restrict Science for Policymaking
Apr 25, 2019
5 min read
Can A Research Accelerator Solve The Psychology Replication Crisis?
NPR
Article
Can A Research Accelerator Solve The Psychology Replication Crisis?
Dec 13, 2019
6 min read
How the Pandemic Has Tested Behavioral Science
Nautilus
Article
How the Pandemic Has Tested Behavioral Science
Jul 6, 2020
5 min read
A Graduate Researcher’s (Brief) Guide to: Creating a Student Science Policy Group
Union of Concerned Scientists
Article
A Graduate Researcher’s (Brief) Guide to: Creating a Student Science Policy Group
Apr 18, 2018
5 min read
With So Many COVID-19 Models, Which Is Best?
Futurity
Article
With So Many COVID-19 Models, Which Is Best?
May 8, 2020
3 min read
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
STAT
Article
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
Sep 5, 2018
Giving study participants their individual results can drive greater public participation in research, increased support for science, and better health.
6 min read
EPA Should Cancel Plans to Restrict Science Once and For All
Union of Concerned Scientists
Article
EPA Should Cancel Plans to Restrict Science Once and For All
May 15, 2020
3 min read
People Who Think Further Into The Future Less Likely To Take Risks
Futurity
Article
People Who Think Further Into The Future Less Likely To Take Risks
Feb 6, 2018
People who tend to think further into the future may be more likely to invest money and avoid risks, a new study suggests. Researchers tapped big data tools to conduct text analyses of nearly 40,000 Twitter users and to run online experiments of thei
3 min read
Five Things to Yell About in the EPA’s New Opaque “Transparency” Supplemental Rule
Union of Concerned Scientists
Article
Five Things to Yell About in the EPA’s New Opaque “Transparency” Supplemental Rule
Nov 12, 2019
4 min read
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
The Christian Science Monitor
Article
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
Mar 12, 2020
The Environmental Protection Agency is mulling a proposal to give preference to scientific research whose datasets and models are publicly available.
3 min read
Opinion: What Facebook’s Public Scrutiny Can Teach Us About Artificial Intelligence In Health Care
STAT
Article
Opinion: What Facebook’s Public Scrutiny Can Teach Us About Artificial Intelligence In Health Care
Apr 11, 2018
3 min read
Engaged Science: 6 Tips for the Trump Era
Union of Concerned Scientists
Article
Engaged Science: 6 Tips for the Trump Era
Jun 26, 2018
5 min read
Science Impact
AQ: Australian Quarterly
Article
Science Impact
Dec 30, 2019
To what end are you working? Presumably for the principle that science’s sole aim must be to lighten the burden of human existence. If the scientists, brought to heel by self-interested rulers, limit themselves to piling up knowledge for knowledge’s
10 min read
EPA’s New Scientific Integrity Policy: the Good, the Bad, and the Ugly
Union of Concerned Scientists
Article
EPA’s New Scientific Integrity Policy: the Good, the Bad, and the Ugly
Jan 30, 2024
The Environmental Protection Agency (EPA) recently released its updated scientific integrity policy for public comment. Here are the details and how you can make your voice heard.
5 min read
Machine Learning Could Warn Us About The Next Public Health Threat
Futurity
Article
Machine Learning Could Warn Us About The Next Public Health Threat
Nov 11, 2022
2 min read
Our Aversion to A/B Testing on Humans Is Dangerous
Nautilus
Article
Our Aversion to A/B Testing on Humans Is Dangerous
Jun 24, 2019
5 min read
Six Things You Should Know About The EPA’s New Science Restriction Draft Policy
Union of Concerned Scientists
Article
Six Things You Should Know About The EPA’s New Science Restriction Draft Policy
Apr 25, 2018
5 min read
Wastewater Captures COVID Outbreaks Even Before Test Results
Futurity
Article
Wastewater Captures COVID Outbreaks Even Before Test Results
Oct 17, 2022
2 min read
Policy During a Pandemic: How to Make Research Accessible for Policymakers During the COVID-19 Pandemic
Union of Concerned Scientists
Article
Policy During a Pandemic: How to Make Research Accessible for Policymakers During the COVID-19 Pandemic
Jul 6, 2020
4 min read
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Union of Concerned Scientists
Article
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Nov 12, 2019
7 min read
Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
STAT
Article
Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
Nov 18, 2019
The culture of clinical research is changing, and there are now expectations that researchers will share data — even when it isn't required.
5 min read
Industry’s Newest Tactics to Undermine EPA Science
Union of Concerned Scientists
Article
Industry’s Newest Tactics to Undermine EPA Science
Apr 11, 2024
Industry is attempting some new tactics to undermine independent science and science-based decisionmaking at the Environmental Protection Agency (EPA). The EPA previously released their updated scientific integrity policy for public comment, and many
5 min read
Why Data Matters For Tracking Biodiversity Changes
Futurity
Article
Why Data Matters For Tracking Biodiversity Changes
Oct 3, 2018
New research highlights the importance of trait variability within species in measuring biodiversity changes and how ecologists can incorporate that data into their assessments. Around the world, ecologists are studying how species are responding to
2 min read
'The Cloud' and Other Dangerous Metaphors
The Atlantic
Article
'The Cloud' and Other Dangerous Metaphors
Jan 20, 2015
4 min read
Twitter Can Reveal The Well-being Of A Whole Community
Futurity
Article
Twitter Can Reveal The Well-being Of A Whole Community
May 4, 2020
3 min read

Related categories

Skip carousel

Reviews for Exploring Data Analysis

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Exploring Data Analysis - W. J. Dixon

EXPLORING DATA ANALYSIS

Exploring Data Analysis

The Computer Revolution In Statistics

Edited by

W. J. DIXON Department of Biomathematics University of California, Los Angeles

and

W. L. NICHOLSON Battelle Pacific Northwest Laboratories and

National Buréau of Standards

UNIVERSITY OF CALIFORNIA PRESS

Berkeley Los Angeles London

University of California Press Berkeley and Los Angeles, California

University of California Press, Ltd.

London, England

ISBN: 0-520-02470-2

Printed in the United States of America

Contents 1

Contents 1

Preface

CHAPTER 1

CHAPTER 2

CHAPTER 3

CHAPTER 4

CHAPTER 5

CHAPTER 6

CHAPTER 7

CHAPTER 8

Citation Index

Preface

The genesis of this book was a conference on statistical computing, organized as a workshop, to examine the frontiers of data analysis based on computer use. It was held in the Health Sciences Computing Facility (HSCF) at the University of California at Los Angeles in September 1971.

The original impetus for such a workshop came from discussions with Wesley Nicholson during an international meeting in London some years earlier. We were dismayed at the current ivory tower trends in statistics. Mimicking the mathematicians, statisticians were increasingly avoiding the real world of application, and were purifying and separating the field from other sciences. The conference was planned as a counterrevolution to that trend.

The Health Sciences Computing Facility provided an excellent place for the workshop. The facility is dedicated to serving biomedical research through research in mathematics, statistics and computer science. It has an IBM 360/91 and numerous typewriter, character scope, and graphics consoles served by a time-sharing operating system. The system specializes in interactive statistical techniques and the programs to serve them. Of special importance to conference participants was the use of graphical statistical techniques.

Participants were limited to a select group of practicing data analysts. The papers presented real problems and included a discussion of the physical mechanisms involved in generating data for the analyses. With a real problem as a focal point, the analyses pursued the needs of the problem rather than stressing particular techniques of statistics. But any new techniques useful for the analyses were emphasized, and the degree to which the derivation and use of the techniques was dependent on the computer was stressed.

Each paper was available to several critics in advance of the meeting. Their comments are included in this volume as well as additional comments by the authors and other critics that developed during the sessions.

The conference revealed many characteristics of a data analyst at work.

In contrast to the biologist who examines his data with the constructs of his own field in mind, the data analyst examines the data for its apparent similarity to a variety of statistical models he has in mind, letting the results of successive analytical attempts guide the direction he pursues (and refines) as he proceeds. The statistician approach might be described as one in which he states: "if we assume normality, independence, and perhaps other fundamentals, then the results indicate the validity of certain stated hypotheses with associated probabilities. ¹¹ In contrast, the data analyst may use many of the same techniques, but he will explore (also with statistical techniques) the degree to which these assumptions might be affecting his conclusions, and the consequences to the applicational field of deviations from reality in the analytical assumptions.

The data analyst seems to be more involved in exploration than in refinement. He is slow to make assumptions before he examines the data. He is quite satisfied if any advance is made in the problem area independent of the sophistication of the analysis, the goodness of agreement of his model, or the presentability of the statistical analysis itself.

He is quite prepared to find that one might arrive at the same conclusion using quite different routes and quite different techniques. The data analyst is almost sure to have a deep involvement in computers since he requires computing power for his freedom to use a wide variety of techniques.

Techniques and analysts are not independent. They interact.

One obtains a maximum result from interactions rather than from main effects. A particular person who uses certain techniques more powerfully than someone else may obtain better answers using those techniques than others can. On the other hand, another person may use his own techniques and do equally well, that is, there is an interaction in the process.

Even when techniques are mathematically equivalent, different analysts use them in different ways. One may think and do analysis of variance, and another may think regression. They may be doing the same thing but their thought processes and the way they proceed through the analysis of the problem differ because of the way they conceptualize analysis of variance and regression; although the language may differ and even communication maybe difficult, the overall analyses may really be very similar.

By the end of the conference it was clear that there is a heavy interaction between analysts and scientists in other fields. In most cases, the analyst has become very involved with the subject matter of the field’s basic theories and problems. The statistical research for his data analysis is truly collaborative— in many cases he enlists the cooperation of other statisticians as well. The statistical analysis is not separated and pursued for its mathematical elegance, rather it is oriented toward the needs of the problem.

Perhaps this team work and cooperation is the most important and far-reaching revelation of the conference.

A short definition of data analysis was proposed at the conference: Data analysis is the application of one or more techniques to a set of data steered by the problem.

Computer facilities at HSCF were available to participants before and during the conference, and a UCLA,rbuddyn was assigned to each participant to help in any way necessary. Data presented at the conference is available from HSCF in machine readable form. A data set description containing at least a partial listing of the data from each paper is given in this book.

The computational support was made possible by grant RR-3 of the Biotechnology Resources Branch, Division of Research and Resources of the National Institutes of Health. The conference itself was supported by grant GJ-29844 from the National Science Foundation.

Acknowledgements are due several members of my staff for their help with the conference and in preparing material for this book. Ed Chen, Dolores Adams and Ellen Sommers assisted in preparations for and during the conference. Ellen Sommers prepared and edited the associated data sets. Lyda Boyer edited, and Betsy Potter typed the manuscripts.

Much of the work or organization of the conference itself and working with the authors on the preparation of their manuscripts was done by Wesley Nicholson.

W. J. Dixon

CHAPTER 1

ADVANCED BREAST CANCER DATA JAMES DICKEY

Statistics Department, State University of New York at Buffalo and

JUDY WALRATH

Department of Epidemiology and Public Health, Yale University

The majority of medical data-analysis problems arise from a physician’s hope that his records of past cases will yield useful information. The real problems are mathematically vague, but tangible: What lessons are to be learned from past experience for future clinical practice? What patient subpopulations have distinctive behavior patterns? What treatments should be used in what kinds of cases?

In the language of John Tukey (1962, 1970), these are problems of exploratory data analysis — problems of how to Find Interesting Reportable Effects (FIRE).

FIRE problems, however, are not the subject of the bulk of statistical theory, which is devised for After The Revelation Orderly Pickling of HYpotheses (ATROPHY), and to Guard Against Silly Selection Effects by Definition (GASSED).

Research for this study was supported by NIGMS-NIH Grant GM 16557.

Linear discrimination procedures have not been very productive in real medical problems (Radhakrishna, 1964). Even the FIRE- problem-motivated stepwise linear procedures (regression and discrimination) deliver linear functions that tend to be almost meaningless as final answers to physicians and statisticians alike, especially linear functions of three or more variables. They may, however, be useful in pointing out the few important variables.

In this paper we strive to concentrate on FIRE problems of clinical-experience data, with the aim of contributing to a general systematic approach involving the use of computer programs as steps in an analytic sequence. We discuss exploratory data analysis for an important class of problems — the prediction of a dichotomized treatment-response variable.

Prof. Wilfrid J. Dixon’s (1969, 1970) BMD biomedical computer programs are widely used for practical data analysis. Contributions to a systematized approach, inspired by the BMD programs, are put forth here, together with a few rough predecessor FORTRAN language programs, and programs not yet available.

In the following section we introduce, as concrete motivation, the well-studied (Armitage et al, 1969) advanced breast cancer data analysis, and the clinical-decision problem of Bulbrook et al (I960), and Atkins et al (1968). Each of the remaining sections describes a type of computer program:

• First Look At Graphs (FLAG);

• Subsample Histograms Or Plots (SHOP);

• Shop In Full Totality (SIFT); and

• a discussion of discriminant analysis per se, with an emphasis on recent nonparametric procedures.

The typical medical data set features a few (1 — 10) response variables and many (10 — 100) mixed-type (dichotomous to practically continuous) predictor variables, for a precious few (10 — 100Ó) observed cases. Missing values abound. The definitions of individual variables are ambiguous and ill-conceived. The data embody histories of undisciplined clerks’ misunderstandings. In short, the statistics teacher’s nightmare: imperfect data and vague problems.

We consider here a decision problem in the management of advanced breast cancer, and a related data set from Guy’s Hospital, London (Atkins et al, 1968), unusual for the painstaking care with which it was collected. This concrete data- analysis problem is put forth as representative of many in being suited to a general systematic approach.

Two hundred and ten advanced breast cancer patients were included in the study. Approximately two-thirds (139/210) of them had undergone attempted cure by radical (116/210) or simple (23/210) mastectomy, and then a year or so later had a recurrence of tumor growth locally or at a distant site. The other one-third (71/210) had been first diagnosed as already advanced. Three- fifths (132/210) began the palliative stage of their treatment with the administration of hormones, which were useful in some cases (17/132) for up to one year in controlling tumor growth.

Then it was a question of whether or not surgery should be used to alter the hormonal environment of the tumors. If so, which of two operations should be performed: bilateral adrenalectomy with oophorectomy (removal of all adrenals and ovaries), or hypophysectomy (removal of pituitary). Each patient underwent an operation, about half each kind (115/210, 95/210).

For one-quarter of the patients (54/210), the surgery was successful (complete remission of symptoms for over six months); for another one-quarter (53/210), intermediate results (partial remission); and for the other half (103/210), failure (no improvement).

Both surgical procedures are radical attempts to prolong life. Hypophysectomy is a more involved and dangerous operation, but its whole-sample remission percentages (28/95 and 24/95) were essentially the same as those for adrenalectomy (26/115 and 29/115).

Natural suggestions for variables related to surgical success include:

1. measures of tumor growth rate

a) age of patient

b) extent of disease at mastectomy

c) time from mastectomy to recurrence;

2. tumor histology;

3. menopausal status;

4. history of mastectomy; and

5. systemic (hence urinary) hormone levels.

In I960, Dr. R. D. Bulbrook and his coinvestigators at Guy’s Hospital developed a linear discriminant function of two 24-hour- urinary-steroid levels, aetiocholanolone (E) and 17-hydroxy- corticosteroid (17 OHCS),

80 — 80(17 OHCS) + E, (1)

positive values of which tend to predict favorable response to surgery. After further prospective studies, Atkins et al (1968) reported the discriminant function by itself provides an efficient guide to response to hypophysectomy but does not do so for adrenalectomy in this series. They also found small effects for the factors l.c), 3., and 4. above.

Armitage et al (1969) carried out extensive FIRE-like analyses of these same data. First, each of three response variables was dichotomized and fit by Hills’ (1967) stepwise sample-splitting discrimination procedure for dichotomized predictor variables. Then they performed special analyses, each suited to each original response variable.

The response, a clinical assessment of success (as success, intermediate, and failure, defined above), was dichotomized into nonfailure and failure, and then related to various sets of predictor variables. Our discussion is restricted to this choice of a dichotomous response variable and to dichotomized responses in general, thus neglecting other important developments of methodology, for example, survival-time data.

At the suggestion, and through the kindness, of Prof. Marvin Zelen, a card copy of the Armitage et al (1969) data was obtained from John Copas, and a slightly updated version of the original patient records (including 16 new cases) from Dr. R. D. Bulbrook. The updated records of all 210 cases are on file at HSCF under the title Advanced Breast Cancer Data (J. Dickey). A complete listing of the cancer data in card image form is given in the Data Set Description at the end of this chapter. This includes a description of the 50 variables associated with each patient, and, parenthetically, single word acronyms which identify variables.

FIRST LOOK AT GRAPHS (FLAG)

Newly punched data will, with high probability, contain mistaken values appearing as

1. over punches and illegal characters;

2. data-to-format mismatches;

3. nonsense values of a variable

a) off-range numeric values

b) meaningless multiple-choice values;

4. nonsense combinations of variable values, e. g., autopsy date preceding date of death;

5. multivariate outliers; and

6. undetectable-per-se mistaken values.

Computer program-processing systems tend to abort program runs when data input contains mistakes of types 1 and 2. Many data-analysis programs abort or deliver unacceptable output from input mistakes of type 3, and less commonly, of type 4.

One of the functions of our computer program, FLAG (Goldman et al, 1971) is to detect, and identify by flagged output, mistaken

Enjoying the preview?

Page 1 of 1

Exploring Data Analysis: The Computer Revolution in Statistics

About this ebook

Related to Exploring Data Analysis

Related ebooks

Data Modeling & Design For You

Related podcast episodes

Related articles

Related categories

Reviews for Exploring Data Analysis

What did you think?

Book preview

Exploring Data Analysis - W. J. Dixon

Contents 1

Preface

CHAPTER 1