A Primer in Biological Data Analysis and Visualization Using R

Ebook351 pages2 hours

A Primer in Biological Data Analysis and Visualization Using R

Name: A Primer in Biological Data Analysis and Visualization Using R
Author: Gregg Hartvigsen
ISBN: 9780231537049

By Gregg Hartvigsen

Rating: 0 out of 5 stars

()

Read preview

About this ebook

R is a popular programming language that statisticians use to perform a variety of statistical computing tasks. Rooted in Gregg Hartvigsen's extensive experience teaching biology, this text is an engaging, practical, and lab-oriented introduction to R for students in the life sciences.

Underscoring the importance of R and RStudio to the organization, computation, and visualization of biological statistics and data, Hartvigsen guides readers through the processes of entering data into R, working with data in R, and using R to express data in histograms, boxplots, barplots, scatterplots, before/after line plots, pie charts, and graphs. He covers data normality, outliers, and nonnormal data and examines frequently used statistical tests with one value and one sample; paired samples; more than two samples across a single factor; correlation; and linear regression. The volume also includes a section on advanced procedures and a final chapter on possible extensions into programming, featuring a discussion of algorithms, the art of looping, and combining programming and output.

Skip carousel

Biology

LanguageEnglish

PublisherColumbia University Press

Release dateFeb 18, 2014

ISBN9780231537049

Author

Gregg Hartvigsen

Related authors

Skip carousel

Related to A Primer in Biological Data Analysis and Visualization Using R

Related ebooks

Skip carousel

Python for the Life Sciences: A Gentle Introduction to Python for Life Scientists
Ebook
Python for the Life Sciences: A Gentle Introduction to Python for Life Scientists
byAlexander Lancaster
Rating: 0 out of 5 stars
0 ratings
Probably Overthinking It: How to Use Data to Answer Questions, Avoid Statistical Traps, and Make Better Decisions
Ebook
Probably Overthinking It: How to Use Data to Answer Questions, Avoid Statistical Traps, and Make Better Decisions
byAllen B. Downey
Rating: 0 out of 5 stars
0 ratings
R Object-oriented Programming
Ebook
R Object-oriented Programming
byKelly Black
Rating: 3 out of 5 stars
3/5
A Biologist's Guide to Mathematical Modeling in Ecology and Evolution
Ebook
A Biologist's Guide to Mathematical Modeling in Ecology and Evolution
bySarah P. Otto
Rating: 4 out of 5 stars
4/5
Machine Learning in Bioinformatics
Ebook
Machine Learning in Bioinformatics
byYanqing Zhang
Rating: 0 out of 5 stars
0 ratings
Modern Experimental Design
Ebook
Modern Experimental Design
byThomas P. Ryan
Rating: 0 out of 5 stars
0 ratings
Advanced R Statistical Programming and Data Models: Analysis, Machine Learning, and Visualization
Ebook
Advanced R Statistical Programming and Data Models: Analysis, Machine Learning, and Visualization
byMatt Wiley
Rating: 0 out of 5 stars
0 ratings
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
Ebook
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
byTshepo Chris Nokeri
Rating: 0 out of 5 stars
0 ratings
Computational Immunology: Models and Tools
Ebook
Computational Immunology: Models and Tools
byJosep Bassaganya-Riera
Rating: 0 out of 5 stars
0 ratings
Applied Longitudinal Analysis
Ebook
Applied Longitudinal Analysis
byGarrett M. Fitzmaurice
Rating: 3 out of 5 stars
3/5
Ecological Models and Data in R
Ebook
Ecological Models and Data in R
byBenjamin M. Bolker
Rating: 5 out of 5 stars
5/5
Bayesian Models: A Statistical Primer for Ecologists
Ebook
Bayesian Models: A Statistical Primer for Ecologists
byN. Thompson Hobbs
Rating: 4 out of 5 stars
4/5
Epigenetic Regulation and Epigenomics
Ebook
Epigenetic Regulation and Epigenomics
byRobert A. Meyers
Rating: 5 out of 5 stars
5/5
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
Ebook
Statistical Design and Analysis of Experiments: With Applications to Engineering and Science
byRobert L. Mason
Rating: 0 out of 5 stars
0 ratings
Life Out of Sequence: A Data-Driven History of Bioinformatics
Ebook
Life Out of Sequence: A Data-Driven History of Bioinformatics
byHallam Stevens
Rating: 4 out of 5 stars
4/5
Introduction to Bioinformatics Using Action Labs
Ebook
Introduction to Bioinformatics Using Action Labs
byJean-Louis Lassez
Rating: 0 out of 5 stars
0 ratings
Robustness and Evolvability in Living Systems
Ebook
Robustness and Evolvability in Living Systems
byAndreas Wagner
Rating: 5 out of 5 stars
5/5
Concepts and Techniques in Genomics and Proteomics
Ebook
Concepts and Techniques in Genomics and Proteomics
byN Saraswathy
Rating: 0 out of 5 stars
0 ratings
Essential Computational Modeling in Chemistry
Ebook
Essential Computational Modeling in Chemistry
byPhilippe G. Ciarlet
Rating: 0 out of 5 stars
0 ratings
Production of Biologicals from Animal Cells in Culture
Ebook
Production of Biologicals from Animal Cells in Culture
byR. E. Spier
Rating: 0 out of 5 stars
0 ratings
Bioinformatics Algorithms: Design and Implementation in Python
Ebook
Bioinformatics Algorithms: Design and Implementation in Python
byMiguel Rocha
Rating: 0 out of 5 stars
0 ratings
Computational Non-coding RNA Biology
Ebook
Computational Non-coding RNA Biology
byYun Zheng
Rating: 0 out of 5 stars
0 ratings
Frontiers in Computational Chemistry: Volume 5
Ebook
Frontiers in Computational Chemistry: Volume 5
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Protein Bioinformatics: From Sequence to Function
Ebook
Protein Bioinformatics: From Sequence to Function
byM. Michael Gromiha
Rating: 5 out of 5 stars
5/5
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
Ebook
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
byRichard E. Neapolitan
Rating: 0 out of 5 stars
0 ratings
Genes and Genomes
Ebook
Genes and Genomes
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
The Social Amoebae: The Biology of Cellular Slime Molds
Ebook
The Social Amoebae: The Biology of Cellular Slime Molds
byJohn Tyler Bonner
Rating: 5 out of 5 stars
5/5
Statistical Issues in Drug Development
Ebook
Statistical Issues in Drug Development
byStephen S. Senn
Rating: 0 out of 5 stars
0 ratings
Statistics for Research
Ebook
Statistics for Research
byShirley Dowdy
Rating: 0 out of 5 stars
0 ratings
Cancer Genomics: From Bench to Personalized Medicine
Ebook
Cancer Genomics: From Bench to Personalized Medicine
byGraham Dellaire
Rating: 0 out of 5 stars
0 ratings

Biology For You

Skip carousel

Anatomy and Physiology For Dummies
Ebook
Anatomy and Physiology For Dummies
byMaggie Norris
Rating: 4 out of 5 stars
4/5
Sapiens: A Brief History of Humankind
Ebook
Sapiens: A Brief History of Humankind
byYuval Noah Harari
Rating: 4 out of 5 stars
4/5
Anatomy 101: From Muscles and Bones to Organs and Systems, Your Guide to How the Human Body Works
Ebook
Anatomy 101: From Muscles and Bones to Organs and Systems, Your Guide to How the Human Body Works
byKevin Langford
Rating: 4 out of 5 stars
4/5
Dopamine Detox: Biohacking Your Way To Better Focus, Greater Happiness, and Peak Performance
Ebook
Dopamine Detox: Biohacking Your Way To Better Focus, Greater Happiness, and Peak Performance
byNick Trenton
Rating: 3 out of 5 stars
3/5
Why We Sleep: Unlocking the Power of Sleep and Dreams
Ebook
Why We Sleep: Unlocking the Power of Sleep and Dreams
byMatthew Walker
Rating: 4 out of 5 stars
4/5
The Rise and Fall of the Dinosaurs: A New History of a Lost World
Ebook
The Rise and Fall of the Dinosaurs: A New History of a Lost World
bySteve Brusatte
Rating: 4 out of 5 stars
4/5
The Obesity Code: the bestselling guide to unlocking the secrets of weight loss
Ebook
The Obesity Code: the bestselling guide to unlocking the secrets of weight loss
byJason Fung
Rating: 4 out of 5 stars
4/5
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
Ebook
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
byScott H. Young
Rating: 4 out of 5 stars
4/5
The Grieving Brain: The Surprising Science of How We Learn from Love and Loss
Ebook
The Grieving Brain: The Surprising Science of How We Learn from Love and Loss
byMary-Frances O'Connor
Rating: 4 out of 5 stars
4/5
How Emotions Are Made: The Secret Life of the Brain
Ebook
How Emotions Are Made: The Secret Life of the Brain
byLisa Feldman Barrett
Rating: 4 out of 5 stars
4/5
Homo Deus: A Brief History of Tomorrow
Ebook
Homo Deus: A Brief History of Tomorrow
byYuval Noah Harari
Rating: 4 out of 5 stars
4/5
The Seven Sins of Memory: How the Mind Forgets and Remembers
Ebook
The Seven Sins of Memory: How the Mind Forgets and Remembers
byDaniel L. Schacter
Rating: 4 out of 5 stars
4/5
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
Ebook
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
byGiulia Enders
Rating: 4 out of 5 stars
4/5
This Will Make You Smarter: 150 New Scientific Concepts to Improve Your Thinking
Ebook
This Will Make You Smarter: 150 New Scientific Concepts to Improve Your Thinking
byJohn Brockman
Rating: 4 out of 5 stars
4/5
Peptide Protocols: Volume One
Ebook
Peptide Protocols: Volume One
byMD William A. Seeds
Rating: 4 out of 5 stars
4/5
Lifespan: Why We Age—and Why We Don't Have To
Ebook
Lifespan: Why We Age—and Why We Don't Have To
byDavid A. Sinclair
Rating: 4 out of 5 stars
4/5
Mother of God: An Extraordinary Journey into the Uncharted Tributaries of the Western Amazon
Ebook
Mother of God: An Extraordinary Journey into the Uncharted Tributaries of the Western Amazon
byPaul Rosolie
Rating: 4 out of 5 stars
4/5
The Soul of an Octopus: A Surprising Exploration into the Wonder of Consciousness
Ebook
The Soul of an Octopus: A Surprising Exploration into the Wonder of Consciousness
bySy Montgomery
Rating: 4 out of 5 stars
4/5
All That Remains: A Renowned Forensic Scientist on Death, Mortality, and Solving Crimes
Ebook
All That Remains: A Renowned Forensic Scientist on Death, Mortality, and Solving Crimes
bySue Black
Rating: 4 out of 5 stars
4/5
The Winner Effect: The Neuroscience of Success and Failure
Ebook
The Winner Effect: The Neuroscience of Success and Failure
byIan H. Robertson
Rating: 5 out of 5 stars
5/5
The Code Breaker: Jennifer Doudna, Gene Editing, and the Future of the Human Race
Ebook
The Code Breaker: Jennifer Doudna, Gene Editing, and the Future of the Human Race
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Lies My Gov't Told Me: And the Better Future Coming
Ebook
Lies My Gov't Told Me: And the Better Future Coming
byRobert W. Malone
Rating: 4 out of 5 stars
4/5
Outlive Diet Recipes: Over 60 Delicious and Healthy Recipes To Help You Live 10 Decades Younger in The Outlive Plan
Ebook
Outlive Diet Recipes: Over 60 Delicious and Healthy Recipes To Help You Live 10 Decades Younger in The Outlive Plan
byJesse Smith
Rating: 4 out of 5 stars
4/5
Jaws: The Story of a Hidden Epidemic
Ebook
Jaws: The Story of a Hidden Epidemic
bySandra Kahn
Rating: 4 out of 5 stars
4/5
The Blood of Emmett Till
Ebook
The Blood of Emmett Till
byTimothy B. Tyson
Rating: 4 out of 5 stars
4/5
Suicidal: Why We Kill Ourselves
Ebook
Suicidal: Why We Kill Ourselves
byJesse Bering
Rating: 4 out of 5 stars
4/5
The Coming Plague: Newly Emerging Diseases in a World Out of Balance
Ebook
The Coming Plague: Newly Emerging Diseases in a World Out of Balance
byLaurie Garrett
Rating: 4 out of 5 stars
4/5
Woman: An Intimate Geography
Ebook
Woman: An Intimate Geography
byNatalie Angier
Rating: 4 out of 5 stars
4/5
Vax-Unvax: Let the Science Speak
Ebook
Vax-Unvax: Let the Science Speak
byRobert F. Kennedy, Jr.
Rating: 5 out of 5 stars
5/5
The Sixth Extinction: An Unnatural History
Ebook
The Sixth Extinction: An Unnatural History
byElizabeth Kolbert
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Untangling Why Knots Are Important: Everyone knows what a knot is. But they have special significance in math and science because their properties can help unlock hidden secrets like the biochemistry of DNA or the geometry of three-dimensional spaces.
Podcast episode
Untangling Why Knots Are Important: Everyone knows what a knot is. But they have special significance in math and science because their properties can help unlock hidden secrets like the biochemistry of DNA or the geometry of three-dimensional spaces.
byThe Joy of Why
0 ratings
0% found this document useful
Causal inference when you can't experiment: difference-in-differences and synthetic controls: When you need to untangle cause and effect, but y…
Podcast episode
Causal inference when you can't experiment: difference-in-differences and synthetic controls: When you need to untangle cause and effect, but y…
byLinear Digressions
0 ratings
0% found this document useful
Mathematicians Set Numbers in Motion to Unlock Their Secrets: A new proof demonstrates the power of arithmetic dynamics, an emerging discipline that combines insights from number theory and dynamical systems.
Podcast episode
Mathematicians Set Numbers in Motion to Unlock Their Secrets: A new proof demonstrates the power of arithmetic dynamics, an emerging discipline that combines insights from number theory and dynamical systems.
byQuanta Science Podcast
0 ratings
0% found this document useful
Causal Trees: What do you get when you combine the causal infer…
Podcast episode
Causal Trees: What do you get when you combine the causal infer…
byLinear Digressions
0 ratings
0% found this document useful
Aubrey Clayton, "Bernoulli's Fallacy: Statistical Illogic and the Crisis of Modern Science" (Columbia UP, 2021): An interview with Aubrey Clayton
Podcast episode
Aubrey Clayton, "Bernoulli's Fallacy: Statistical Illogic and the Crisis of Modern Science" (Columbia UP, 2021): An interview with Aubrey Clayton
byNew Books in Science, Technology, and Society
0 ratings
0% found this document useful
What Are Gravitational Waves And LIGO For Dummies? with Professor Alan Weinstein
Podcast episode
What Are Gravitational Waves And LIGO For Dummies? with Professor Alan Weinstein
byGetting Curious with Jonathan Van Ness
0 ratings
0% found this document useful
Does Not Compute: Scientific journal articles have a lot of numbers. Scientists are smart people with even smarter computers, so an outsider might think that, if nothing else, you can count on the math checking out. But modern data analysis is complicated, and computation...
Podcast episode
Does Not Compute: Scientific journal articles have a lot of numbers. Scientists are smart people with even smarter computers, so an outsider might think that, if nothing else, you can count on the math checking out. But modern data analysis is complicated, and computation...
byThe Black Goat
0 ratings
0% found this document useful
Probe Data: The Good, The Bad, and The Ugly
Podcast episode
Probe Data: The Good, The Bad, and The Ugly
bySLP Nerdcast
0 ratings
0% found this document useful
Writing Studies Research in Practice: Welcome to Mere Rhetoric, the podcast for beginners and outsider about the ideas, people and movements who have shaped rhetorical history. Today we’re going to talk about the method to the madness, if madness were writing studies research. That’s...
Podcast episode
Writing Studies Research in Practice: Welcome to Mere Rhetoric, the podcast for beginners and outsider about the ideas, people and movements who have shaped rhetorical history. Today we’re going to talk about the method to the madness, if madness were writing studies research. That’s...
byMere Rhetoric
0 ratings
0% found this document useful
Scientific Dog Studies
Podcast episode
Scientific Dog Studies
byDog Talk by Happy Dog Training
0 ratings
0% found this document useful
39 Jordan Ellenberg - Why Math Is The Ultimate BS Detector: Chances are that when you think about math—which, for most of us, happens pretty infrequently—you don't think of it in anything like the way that Jordan Ellenberg does. Ellenberg is a rare scholar who is both a math professor (at the University of Wiscon
Podcast episode
39 Jordan Ellenberg - Why Math Is The Ultimate BS Detector: Chances are that when you think about math—which, for most of us, happens pretty infrequently—you don't think of it in anything like the way that Jordan Ellenberg does. Ellenberg is a rare scholar who is both a math professor (at the University of Wiscon
byInquiring Minds
0 ratings
0% found this document useful
#147 – Spencer Greenberg on stopping valueless papers from getting into top journals: Can you trust the things you read in published scientific research? Not really. 
Podcast episode
#147 – Spencer Greenberg on stopping valueless papers from getting into top journals: Can you trust the things you read in published scientific research? Not really. 
by80,000 Hours Podcast
0 ratings
0% found this document useful
Episode 104 - Scraping Facts Online: If You Can’t Beat ’Em, Datum: At the time of this taping, Paul was in the middle of the Metis “bootcamp” program learning the capabilities, tools, and insights of data science. This conversation ranged widely in the realm of data analysis and management, examining its relevance to Pa...
Podcast episode
Episode 104 - Scraping Facts Online: If You Can’t Beat ’Em, Datum: At the time of this taping, Paul was in the middle of the Metis “bootcamp” program learning the capabilities, tools, and insights of data science. This conversation ranged widely in the realm of data analysis and management, examining its relevance to Pa...
byThat's So Second Millennium
0 ratings
0% found this document useful
The Inaugural Inside JABA Series: Session 102 with Drs. LeBlanc, St. Peter, and Tiger: Welcome to the first installment of The Inside JABA Series. A few months ago, Drs. Linda LeBlanc and Dorothea Lerman approached me about creating an ongoing podcast series that highlights and disseminates the work of The Journal of Applied Behavior...
Podcast episode
The Inaugural Inside JABA Series: Session 102 with Drs. LeBlanc, St. Peter, and Tiger: Welcome to the first installment of The Inside JABA Series. A few months ago, Drs. Linda LeBlanc and Dorothea Lerman approached me about creating an ongoing podcast series that highlights and disseminates the work of The Journal of Applied Behavior...
byThe Behavioral Observations Podcast with Matt Cicoria
0 ratings
0% found this document useful
I Felt Like a Real Scientist: As scientists we are accustomed to knowing the results when we evaluate the quality of research. But is that a good thing? How would it change the way we edit and review research if we had to make our evaluations without knowing the results? And beyond t...
Podcast episode
I Felt Like a Real Scientist: As scientists we are accustomed to knowing the results when we evaluate the quality of research. But is that a good thing? How would it change the way we edit and review research if we had to make our evaluations without knowing the results? And beyond t...
byThe Black Goat
0 ratings
0% found this document useful
Our Most Significant Episode Ever: p-values. Love them or hate them, they are everywhere in science. In this episode we talk about some of our thoughts and feelings about this ubiquitous statistics. What are the drawbacks and benefits to dichotomizing results into "significant" and "nonsi...
Podcast episode
Our Most Significant Episode Ever: p-values. Love them or hate them, they are everywhere in science. In this episode we talk about some of our thoughts and feelings about this ubiquitous statistics. What are the drawbacks and benefits to dichotomizing results into "significant" and "nonsi...
byThe Black Goat
0 ratings
0% found this document useful
Inside JABA Series #2: Session 106: If you missed the first installment of the Inside JABA Series, let me explain what’s going on here: Once a quarter, I’ll be joined by Drs. Linda Leblanc, Clair St. Peter, and Jeff Tiger to discuss the latest issue of The Journal of Applied...
Podcast episode
Inside JABA Series #2: Session 106: If you missed the first installment of the Inside JABA Series, let me explain what’s going on here: Once a quarter, I’ll be joined by Drs. Linda Leblanc, Clair St. Peter, and Jeff Tiger to discuss the latest issue of The Journal of Applied...
byThe Behavioral Observations Podcast with Matt Cicoria
0 ratings
0% found this document useful
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
Podcast episode
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
byAI Live & Unbiased
0 ratings
0% found this document useful
Do baboons understand death?: A conversation with Alecia Carter and Elise Huchard
Podcast episode
Do baboons understand death?: A conversation with Alecia Carter and Elise Huchard
byMany Minds
0 ratings
0% found this document useful
23: Don't Touch My Circles! (Geometry): In the study of mathematics, there are many abstractions that we deal with. For example, we deal with the notion of a real number with infinitesimal granularity and infinite range, even though we have no evidence for this existing in nature besides...
Podcast episode
23: Don't Touch My Circles! (Geometry): In the study of mathematics, there are many abstractions that we deal with. For example, we deal with the notion of a real number with infinitesimal granularity and infinite range, even though we have no evidence for this existing in nature besides...
byBreaking Math Podcast
0 ratings
0% found this document useful
Survey Raking: It's quite common for survey respondents not to b…
Podcast episode
Survey Raking: It's quite common for survey respondents not to b…
byLinear Digressions
0 ratings
0% found this document useful
Just Be Cause: Many important questions about cause and effect are impractical to answer with a randomized experiment. What should we do instead? In this episode we talk about doing causal inference with observational data. Has psychology's historical obsession with in...
Podcast episode
Just Be Cause: Many important questions about cause and effect are impractical to answer with a randomized experiment. What should we do instead? In this episode we talk about doing causal inference with observational data. Has psychology's historical obsession with in...
byThe Black Goat
0 ratings
0% found this document useful
Alignment Newsletter #163: Using finite factored sets for causal and temporal inference: Using finite factored sets for causal and temporal inference
Podcast episode
Alignment Newsletter #163: Using finite factored sets for causal and temporal inference: Using finite factored sets for causal and temporal inference
byAlignment Newsletter Podcast
0 ratings
0% found this document useful
Even More Data Collection: Data Collection Systems and Strategies
Podcast episode
Even More Data Collection: Data Collection Systems and Strategies
bySLP Nerdcast
0 ratings
0% found this document useful
#87 – Russ Roberts on whether it's more effective to help strangers, or people you know: If you want to make the world a better place, would it be better to help your niece with her SATs, or try to join the State Department and lower the risk that the US and China go to war? 
Podcast episode
#87 – Russ Roberts on whether it's more effective to help strangers, or people you know: If you want to make the world a better place, would it be better to help your niece with her SATs, or try to join the State Department and lower the risk that the US and China go to war? 
by80,000 Hours Podcast
0 ratings
0% found this document useful
Jim Guszcza: Data Science AND Behavioral Science, New Wine in a New Bottle: Jim Guszcza is the chief data scientist at Deloitte Analytics. His title paints a picture that he’s a total numbers geek. And that would be a fair, but single-dimensional assessment. What it doesn’t speak to is Jim’s passion for behavioral science and, m...
Podcast episode
Jim Guszcza: Data Science AND Behavioral Science, New Wine in a New Bottle: Jim Guszcza is the chief data scientist at Deloitte Analytics. His title paints a picture that he’s a total numbers geek. And that would be a fair, but single-dimensional assessment. What it doesn’t speak to is Jim’s passion for behavioral science and, m...
byBehavioral Grooves Podcast
0 ratings
0% found this document useful
Ep 175: How to Use Lists to Transform Your Writing (and your life): Tis the season for lists, even for those who aren't naturally checklist and to-do list types. For the holidays, people will make packing lists, shopping lists, cleaning lists, address lists, and wish lists. - Lists are useful and practical,
Podcast episode
Ep 175: How to Use Lists to Transform Your Writing (and your life): Tis the season for lists, even for those who aren't naturally checklist and to-do list types. For the holidays, people will make packing lists, shopping lists, cleaning lists, address lists, and wish lists. - Lists are useful and practical,
byAnn Kroeker, Writing Coach
0 ratings
0% found this document useful
James D. Stein, "The Fate of Schrodinger's Cat: Using Math and Computers to Explore the Counterintuitive" (World Scientific, 2020): Stein shows how high-school algebra and basic probability theory, with the invaluable assistance of computer simulations, can be used to investigate both the intuitive and the counterintuitive....
Podcast episode
James D. Stein, "The Fate of Schrodinger's Cat: Using Math and Computers to Explore the Counterintuitive" (World Scientific, 2020): Stein shows how high-school algebra and basic probability theory, with the invaluable assistance of computer simulations, can be used to investigate both the intuitive and the counterintuitive....
byNew Books in Mathematics
0 ratings
0% found this document useful
300: Why Research Isn’t Always Right: If you’re into looking at research to figure out more natural ways to support your skin condition (eczema, psoriasis, rosacea, hives, seborrheic dermatitis), this episode is for you.
Podcast episode
300: Why Research Isn’t Always Right: If you’re into looking at research to figure out more natural ways to support your skin condition (eczema, psoriasis, rosacea, hives, seborrheic dermatitis), this episode is for you.
byThe Healthy Skin Show
0 ratings
0% found this document useful
304 — The seductive allure of neuroscientific podchat: People are more satisfied by explanations that contain neuroscientific jargon and images. Why? Because dopamine fires up the hippocampus, and that’s a fact! (Warning: It’s not.) This week on The Mind Tools L&D Podcast, Owen and Ross G...
Podcast episode
304 — The seductive allure of neuroscientific podchat: People are more satisfied by explanations that contain neuroscientific jargon and images. Why? Because dopamine fires up the hippocampus, and that’s a fact! (Warning: It’s not.) This week on The Mind Tools L&D Podcast, Owen and Ross G...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful

Skip carousel

RNA Prepares The Immune System To Fight
Science Illustrated
Article
RNA Prepares The Immune System To Fight
Mar 31, 2021
1 min read
2 Common Plant Extracts Shield Cells From COVID
Futurity
Article
2 Common Plant Extracts Shield Cells From COVID
Feb 13, 2023
4 min read
Why Is Chaos Theory Important?
All About Space
Article
Why Is Chaos Theory Important?
Jan 3, 2020
1 min read
Viruses Have a Secret, Altruistic Social Life
Nautilus
Article
Viruses Have a Secret, Altruistic Social Life
Apr 16, 2019
4 min read
FAQs
Family Tree
Article
FAQs
Nov 27, 2023
6 min read
The Problem with the Way Scientists Study Reason
Nautilus
Article
The Problem with the Way Scientists Study Reason
Mar 13, 2020
5 min read
SLOW DOWN & Plan Your Research
Family Tree UK
Article
SLOW DOWN & Plan Your Research
Aug 11, 2023
7 min read
Join The Family Tree Academy & Become A Skilled Family Historian
Family Tree UK
Article
Join The Family Tree Academy & Become A Skilled Family Historian
Mar 10, 2020
4 min read
The Problem with the Way Scientists Study Reason
Nautilus
Article
The Problem with the Way Scientists Study Reason
Apr 29, 2019
5 min read
Research Logs:
Family Tree UK
Article
Research Logs:
Mar 8, 2024
You’ve probably heard the story of Theseus and the Minotaur: how the young hero wound his way through a fiendish labyrinth, to slay the fearsome beast hidden in its confines. But do you recall how Theseus escaped from the maze, when others had been t
9 min read
Steven Pinker Has His Reasons
Nautilus
Article
Steven Pinker Has His Reasons
Nov 17, 2021
A few years ago, at the Princeton Club in Manhattan, I chanced on a memorable chat with the Harvard psychologist Steven Pinker. His spouse, the philosopher Rebecca Goldstein, with whom he was tagging along, had been invited onto a panel to discuss th
19 min read
The 'Genome Hacker' Who Mapped a 13-Million-Person Family Tree
The Atlantic
Article
The 'Genome Hacker' Who Mapped a 13-Million-Person Family Tree
Mar 1, 2018
5 min read
Three Ways to Tell If Research Is Bunk
The Atlantic
Article
Three Ways to Tell If Research Is Bunk
Nov 30, 2023
5 min read
WRITE to KNOW
Family Tree
Article
WRITE to KNOW
Apr 25, 2023
7 min read
Our Brains Tell Stories So We Can Live
Nautilus
Article
Our Brains Tell Stories So We Can Live
Aug 8, 2019
We are all storytellers; we make sense out of the world by telling stories. And science is a great source of stories. Not so, you might argue. Science is an objective collection and interpretation of data. I completely agree. At the level of the stud
8 min read
Pro Case Study
Photography Week
Article
Pro Case Study
May 12, 2022
3 min read
Organizing Your Research Process
Family Tree
Article
Organizing Your Research Process
Dec 22, 2020
3 min read
What Tech Can Learn from the Fruit Fly’s Search Algorithm
Nautilus
Article
What Tech Can Learn from the Fruit Fly’s Search Algorithm
Nov 13, 2017
5 min read
How Big Data Creates False Confidence
Nautilus
Article
How Big Data Creates False Confidence
Apr 23, 2016
4 min read
Your DNA Workshop
Family Tree UK
Article
Your DNA Workshop
Feb 9, 2024
10 min read
20 Things You NEED To Know About FamilySearch.org
Family Tree UK
Article
20 Things You NEED To Know About FamilySearch.org
Jul 9, 2021
6 min read
Pro Case Study
Digital Photographer
Article
Pro Case Study
Apr 19, 2022
3 min read
Ian And The Limits Of Rationality
Nautilus
Article
Ian And The Limits Of Rationality
Sep 22, 2021
Setting: Chesterfield High, an unusual school in the suburbs of Ohio. The teacher writes on the board: 2, 3, 5, 7, ... How, he asks, do we complete this pattern? Now a student might say that the next term is 12. When the teacher asks him why, he says
9 min read
TIPS & TACTICS TO PROVE YOUR FAMILY TREE IS CORRECT
Family Tree UK
Article
TIPS & TACTICS TO PROVE YOUR FAMILY TREE IS CORRECT
Mar 10, 2023
It’s tempting to bemoan the inaccurate and tangled tree branches that we (sometimes? often?) find online. However, it’s worth stepping back and asking ourselves, firstly whether we are absolutely sure our own research is correct? And secondly how to
5 min read
Monumental Yet Accessible
Equus
Article
Monumental Yet Accessible
Nov 28, 2023
Deb Bennett, PhD, has been a columnist and consulting editor at EQUUS for more than three decades. During that time, she has helped thousands of readers better understand their horses from the skeleton out—what makes them work and how humans can best
4 min read
HOW TO BUILD A research plan
Family Tree UK
Article
HOW TO BUILD A research plan
Feb 10, 2023
8 min read
Why Happiness Is Hard to Find—in the Brain
Nautilus
Article
Why Happiness Is Hard to Find—in the Brain
May 3, 2018
I arrived for my meeting with Professor Chambers at the pleasant Cardiff pub near his office where we’d agreed to have lunch. He was already sitting at the back of the room, and waved me a hello as I entered. Professor Chris Chambers is a disarmingly
9 min read
Researchers Develop AI-based Intervention For Kids With Autism
STAT
Article
Researchers Develop AI-based Intervention For Kids With Autism
Feb 26, 2020
A new AI-based intervention could help children with #autism gain the social skills they need to better communicate with their family and peers.
2 min read
Documenting Your Life – creating Your Legacy
Family Tree UK
Article
Documenting Your Life – creating Your Legacy
Jan 14, 2022
9 min read
Family History In The AI Era
Family Tree UK
Article
Family History In The AI Era
Apr 12, 2024
7 min read

Related categories

Skip carousel

Reviews for A Primer in Biological Data Analysis and Visualization Using R

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

A Primer in Biological Data Analysis and Visualization Using R - Gregg Hartvigsen

INTRODUCTION

We face danger whenever information growth outpaces our understanding of how to process it.

(Silver, 2012)

In our effort to understand and predict patterns and processes in biology we usually develop an idea or, more formally, a conceptual model of how our system works. We generally frame our models as testable hypotheses that we challenge with data. As the science of biology has matured our questions of how nature works have gotten more sophisticated and complex. Unfortunately, we are not able to simply look at a table of raw data that we get from an experiment and see an answer to an interesting question with any quantitative level of confidence. Instead, to accomplish this we will learn how to use the R statistical and programming software package to process these data (summarize, analyze, and visualize our results). We also will go a step further and work to understand what these results mean biologically.

Data, graphs, and statistics, oh my! Isn’t the interesting stuff in biology really just the cool, living things all around us? It is that stuff but it’s so much more beautiful when we understand it. Maybe you want to be a vet. Perhaps an early memory for you was loving a little furry thing that purred. However, maybe now you’ve become a little more concerned about what impact these lovable pets might have on populations of other cute animals that live outside. I recently took a break from writing and looked at an issue of the journal PLoS ONE (a well-respected, open-access, online journal). In this journal I saw an article on predation by urban cats in the UK (Thomas et al. (2012)). I own three cats and was surprised by the number of prey items that cats brought back to their owners (see Figure 1). It seems that there is a lot of variability in predation rates (the histogram) and that predation rates decrease with increasing urbanization (housing density). Specifically, as seen in the inset graph, the authors state that There was a significant negative correlation between housing density and annual predation rates on birds (r = 20.699, p = 0.036).

When we have questions that we want to answer, such as what are cats up to when they’re outside?, we might read books of fiction, such as the series on Warrior cats (see books by Erin Hunter, which is actually a pseudonym!). In biology, however, we seek to understand things like cats by collecting, interpreting, analyzing, and visualizing data. This book is designed to help you to be able to do this. If you’re interested in other disciplines I hope the examples in this book help you, too! I also hope that as you use this book you lose any fear you might have of data and instead seek out and work with data and understand what they tell you about the things that got you interested in biology in the first place, like cats (or, more likely, dogs).

WHAT THIS BOOK IS (AND ISN’T)

This book is designed to help you collect, organize, analyze, and visualize data. I assume you have not heard of the free, open-source program R and I will, therefore, introduce you to how to use it to accomplish these goals. Although I imagine you have had some experience making graphs and calculating a few descriptive statistics (e.g., mean and standard deviation in Excel) I assume you haven’t done this. If you don’t know Excel, or don’t have access to it, you will be able to do all the heavy lifting in this book. I assume you have not taken a course in statistics.

This book, therefore, aims to give you a foundation upon which to become a better student of science and a better consumer of scientific information. More specifically you will learn how to

• formulate hypotheses,

• design better experiments,

• do many standard statistical procedures,

• interpret your results,

• create publication-quality visualizations of your results,

• find help so you can solve your own problems, and

• write a simple computer program.

You shouldn’t expect to read this book and become a quantitative guru. Instead, you should hope to become competent at finding answers to some of your questions, such as are these two samples different? and is there a significant linear relationship between my variables? You will become a resource to the people around you. And if you put in some time playing with R you will be the go-to person for data.

Figure 1: Two figures from a recent paper on urban cat predation rates (Thomas et al. [2012]). The larger graph is a histogram showing percentages (instead of the usual frequencies, or counts) for the number of prey returned to households. Black and white bars are for households with a single-cat versus multiple-cats, respectively. The insert is a scatterplot with best-fit straight lines added for birds, mammals, and for both animal groups combined. The combined data points have been omitted! The relationships are analyzed and discussed in the paper as correlations and, therefore, adding lines is inappropriate (see the box on page 138). The graphs and resulting analyses were likely done using R, but that doesn’t mean they are correct! After you work through this introduction you should be able to comfortably assess these data, correctly perform the analyses and create more appropriate visualizations.

I have written this book primarily with the hope that you’ll feel more comfortable with complex biological problems. It has grown out of what I have seen challenge my own undergraduate students. But it also covers some topics that I think are fun and valuable to know how to do (e.g., programming). The chapters end with problem sets for you to challenge yourself to use what you have learned. Some of the data are real while some are merely realistic. I also have included solutions to the odd-numbered problems at the end of the book. Finally, the book is filled with R code. You should type this is in yourself because this helps with the learning process. You can, however, go to https://github.com/GreggHartvigsen/PrimerBiostats and download all the code from this book.

This book is neither a formal introduction to R nor a statistics textbook. Instead, this book helps you to you solve problems you’re likely to encounter in your undergraduate program in biology. I work to explain what statistics are and how to share and interpret scientific results. After working through this book you should be able to solve a variety of problems with the most widely used statistical and programming environment. I hope you will no longer be afraid of data and will be more able to enter data into the computer, test hypotheses, and present your findings.

So, this book should help you make more appropriate and professional, scientific visualizations and discover findings that might have otherwise been missed. You will no longer be satisfied with hearing from anyone things like Well, it looks significant or there seems to be a trend in the data. So, for the rest of your career, I hope you become the person who says We can test that! Let me get my laptop.

WHO REALLY NEEDS THIS?

In this book I work not only to present visualization and analytical techniques but to explain why we do all this. There’s an unfortunate misconception that we don’t really need all this quantitative stuff in biology. I have heard several times the following line of thinking:

Why do we need to use statistics in biology? If the hypothesis is clear, the experiment is designed correctly, and the data are carefully collected, anyone should be able to just look at the data and clearly see whether or not the hypothesis is supported. Statistical procedures are simply safety nets for sloppy science.

As you work your way through this book you’ll see why the above thinking limits scientific exploration, understanding, and the ability to make predictions about natural phenomena. Here is a brief list of reasons why statistics, mathematics, and appropriate visualizations are critical for understanding biological systems:

1. Statistical procedures help us determine whether data are consistent with hypotheses. Data from modern biological experiments are unable to speak for themselves. Data, instead, require rigorous evaluation, which is appropriate because they are often hard to collect. Statements based on opinion, such as I don’t believe global warming is happening or I believe this drug will cure cancer, fall outside the realm of science.

2. Based on our results from data analyses we often develop formal mathematical models that help us to understand and explain how systems work. We do this by developing quantitative predictions that we assess with data.

3. Biologists often work to understand how multiple factors work together, often in complex, non-linear ways, to affect biological systems. To determine the individual effects and the combined interactive effects we need to develop and conduct complex experiments to illuminate biological patterns and mechanisms that cause these patterns. We then use sophisticated data analysis procedures and visualization techniques to answer today’s challenging questions.

Biology is one of the more complex sciences. I will admit that, at times, some questions can be pretty simple. Imagine, for instance, that we have 100 randomly selected pea pods and expect a 3:1 phenotypic ratio of yellow to green peas. We should expect to see a ratio of 75 to 25 yellow to green peas. We, however, are unlikely to see exactly this ratio. If, instead, we find a ratio of 78:22 we can see immediately (without statistics!) that this is not a 3:1 ratio. Are you prepared, based on this finding, to conclude that this system does not follow the well established rules of segregation? Scientists are predisposed by their profession to be skeptical and, therefore, will not accept a statement like Trust me that our finding of a 78:22 ratio demonstrates that Mendel was wrong!

Our goal is to understand biological systems. Unfortunately, anything interesting nowadays is complex (even determining if our data adhere to a simple 3:1 ratio!). With quantitative tools we can better understand how natural systems work. Only then might we be able to make accurate and useful predictions. Science relies on a strong foundation of statistics, mathematics, and the visualization of results, all of which are available to you through the R statistical and programming environment.

ADDITIONAL RESOURCES

There are far too many great sources of information on data analysis, statistics, visualizing information, and programming to list them all here. This book is a very basic introduction to all of these topics. I hope you seek more information in all of these areas. If you do, here are a few recommendations that go more deeply into different subsets of the topics covered in this book:

General introductions to R

1. An introduction to R. Venables and Smith (2009)

2. A beginner’s guide to R. Zuur et al. (2009)

3. R for dummies. Meys and de Vries (2012)

4. The R book. Crawley (2012)

5. R in a nutshell: A desktop quick reference. Adler (2012)

Statistics books

1. A primer of ecological statistics. Gotelli and Ellison (2012)

2. Statistical methods. Snedecor and Cochran (1989)

3. Biostatistical analysis. Zar (2009)

Statistics books specifically using R

1. Introductory statistics: a conceptual approach using R. Ware et al. (2012)

2. Foundations and applications of statistics: an introduction using R. Pruim (2011)

3. Probability and statistics with R. Ugarte et al. (2008)

Visualization using R

1. ggplot2: elegant graphics for data analysis. Wickham (2009)

2. R graphics cookbook. Chang (2013)

Programming using R

1. The art of R programming. Matloff (2011)

2. http://manuals.bioinformatics.ucr.edu/home/programming-in-r

CHAPTER 1 INTRODUCING OUR SOFTWARE TEAM

In science we are interested in understanding systems that are complicated. Our use of quantitative approaches gives us the ability to not only understand these systems but also to predict how a system might behave in the future (or maybe even how it behaved in the past). As we work to understand and predict complex biological systems we need computational help. You probably have written lab reports using only a calculator. This should be avoided for a variety of important reasons:

1. Difficulty in verifying that you entered the data correctly. (I think the numbers are right.)

2. Difficulty in repeating the analysis. (I’m not doing it again because I might get a different answer!)

3. Inability to share your analytical approaches and results. (Sorry, I hit the all-clear button! You have to trust me.)

4. Inflexibility in how the data are analyzed. (You wanted me to do what?).

5. Inability to make and share appropriate graphs. (Can I take a picture of the graph on my calculator with my phone and incorporate that in my lab report?)

To solve these shortcomings we will use Excel and R.

You may be somewhat familiar with Excel but probably have little or no experience with R. Therefore, I welcome you to the world of R! I know this might be a scary place for you at first. I bet R is really different from all the programs you’ve used. Fortunately, this introduction is intended for newcomers. But as you proceed you will learn how to do some really amazing things with R. You’ll gain independence with practice. R is like playing an instrument, a sport, or learning a foreign language—they all require practice. I have confidence that you are capable of using R to solve interesting problems. And the more time you spend at it the better you will get.

1.1 SOLVING PROBLEMS WITH EXCEL AND R

For many analytical problems we will be able to use just R. However, in biology, we often test our ideas, or hypotheses, with large amounts of data. We, therefore, will try to use Excel for what it does well (allows us to enter and organize our data). But we will not use Excel to do what it doesn’t do well (statistical analyses, modeling, and visualizing data). Instead, these core scientific skills are best done with R. If you love Excel then you’ll be happy to know we’re not abandoning it—Excel has its place.

It is important to recognize that doing things well is rarely easy. Writing a good poem, playing tennis well, or doing ballet well are all hard. And conducting hypothesis tests correctly and making professional-quality graphs are not simple, one-click operations.

At first you will likely think that making graphs and performing statistical tests in R are absolute nightmares. (And when you become a skilled R programmer you’ll still be challenged at times!) But the days of skipping an analysis or accepting a ungly or incorrect graph because that’s the best I can do with Excel are over. You can do it in R! Therefore, in this introduction we will discuss Excel but focus mainly on R. It is the combination of using Excel to organize our data and R for analyses and visualizations that will allow you to ask and answer questions in biology.

You still may be wondering why you can’t just do this all in Excel. Here is a sampling of reasons why R is clearly better than Excel for problem solving in biology. With R you can:

1. create professional, publication-quality visualizations;

2. conduct quantitative analyses, both analytical and statistical (e.g., do a t-test, solve systems of differential equations, conduct non-linear regression, use matrix algebra, conduct signal processing, perform wavelet analysis, analyze fMRI data, do genome analyses, and create phylogenetic reconstructions, to name a few);

3. build statistical tests that can be repeated easily and shared with anyone. These tests might rely on their own data, data read from a file, or data acquired directly from a website;

4. do the same thing and work the same way on computers running Mac, Windows, and Linux;

5. write computer programs, such as modeling a population growing over time, using an object-oriented language;

6. access modern analytical tools for biologists that are being developed right now, right here, and no where else;

7. use and receive widely available help from the R open-source community;

8. use open-source software that provides solutions that are auditable, meaning you can understand and explain to others how you got your results (there are no black boxes - it’s open software!);

9. write a document like this. This environment allows one to compile together in one document words, mathematical equations, computer code, statistical tests and output, and professional-quality graphs, all within the free, open-source LATEX typesetting environment;

10. carry a research project, paper, all the data, AND carry the entire software package for doing the analysis on a low-capacity flash drive;

11. rest assured that your investment in skill building will pay off well into the future. You don’t have to hope you’ll have access to the program when you move on to your next stage of life (which could be in a hospital in Ghana!);

12. enjoy these benefits because open-source means R is free!

Your ability to use R to make informed, evidence-based conclusions likely will provide you the most valuable set of skills you’ll learn as an undergraduate science major. If you keep this skill set you will be highly marketable. R helps you speak the language of science, which is written in mathematics, statistics, and data evaluation and visualization. This ability to answer scientific questions and present your results professionally is finally in your hands.

Your ability to use R helps fulfill an important goal that was synthesized in the report Scientific Foundations for Future Physicians produced by the American Association of American Medical Colleges and the Howard Hughes Medical Institute, 2009. The authors

Enjoying the preview?

Page 1 of 1

A Primer in Biological Data Analysis and Visualization Using R

About this ebook

Gregg Hartvigsen

Related authors

Related to A Primer in Biological Data Analysis and Visualization Using R

Related ebooks

Biology For You

Related podcast episodes

Related articles

Related categories

Reviews for A Primer in Biological Data Analysis and Visualization Using R

What did you think?

Book preview

A Primer in Biological Data Analysis and Visualization Using R - Gregg Hartvigsen

INTRODUCTION

WHAT THIS BOOK IS (AND ISN’T)

WHO REALLY NEEDS THIS?

ADDITIONAL RESOURCES

CHAPTER 1

INTRODUCING OUR SOFTWARE TEAM

1.1 SOLVING PROBLEMS WITH EXCEL AND R