CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R

Ebook433 pages2 hours

CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R

Name: CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R
Author: William Yarberry
ISBN: 9781484268766

By William Yarberry

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Want to use the power of R sooner rather than later? Don’t have time to plow through wordy texts and online manuals? Use this book for quick, simple code to get your projects up and running. It includes code and examples applicable to many disciplines. Written in everyday language with a minimum of complexity, each chapter provides the building blocks you need to fit R’s astounding capabilities to your analytics, reporting, and visualization needs.

CRAN Recipes recognizes how needless jargon and complexity get in your way. Busy professionals need simple examples and intuitive descriptions; side trips and meandering philosophical discussions are left for other books.

Here R scripts are condensed, to the extent possible, to copy-paste-run format. Chapters and examples are structured to purpose rather than particular functions (e.g., “dirty data cleanup” rather than the R package name “janitor”). Everyday language eliminatesthe need to know functions/packages in advance.

What You Will Learn

Carry out input/output; visualizations; data munging; manipulations at the group level; and quick data exploration
Handle forecasting (multivariate, time series, logistic regression, Facebook’s Prophet, and others)
Use text analytics; sampling; financial analysis; and advanced pattern matching (regex)
Manipulate data using DPLYR: filter, sort, summarize, add new fields to datasets, and apply powerful IF functions
Create combinations or subsets of files using joins
Write efficient code using pipes to eliminate intermediate steps (MAGRITTR)
Work with string/character manipulation of all types (STRINGR)
Discover counts, patterns, and how to locate whole words
Do wild-card matching, extraction, and invert-match
Work with dates using LUBRIDATE
Fix dirty data; attractive formatting; bad habits to avoid

Who This Book Is For

Programmers/data scientists with at least some prior exposure to R.

Skip carousel

LanguageEnglish

PublisherApress

Release dateApr 23, 2021

ISBN9781484268766

Author

William Yarberry

Related authors

Skip carousel

Related to CRAN Recipes

Related ebooks

Skip carousel

R Data Science Quick Reference: A Pocket Guide to APIs, Libraries, and Packages
Ebook
R Data Science Quick Reference: A Pocket Guide to APIs, Libraries, and Packages
byThomas Mailund
Rating: 0 out of 5 stars
0 ratings
Propeller Programming: Using Assembler, Spin, and C
Ebook
Propeller Programming: Using Assembler, Spin, and C
bySridhar Anandakrishnan
Rating: 0 out of 5 stars
0 ratings
MATLAB Machine Learning Recipes: A Problem-Solution Approach
Ebook
MATLAB Machine Learning Recipes: A Problem-Solution Approach
byMichael Paluszek
Rating: 0 out of 5 stars
0 ratings
Pointers in C Programming: A Modern Approach to Memory Management, Recursive Data Structures, Strings, and Arrays
Ebook
Pointers in C Programming: A Modern Approach to Memory Management, Recursive Data Structures, Strings, and Arrays
byThomas Mailund
Rating: 0 out of 5 stars
0 ratings
Raku Recipes: A Problem-Solution Approach
Ebook
Raku Recipes: A Problem-Solution Approach
byJ.J. Merelo
Rating: 0 out of 5 stars
0 ratings
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
Ebook
Oracle Database Transactions and Locking Revealed: Building High Performance Through Concurrency
byDarl Kuhn
Rating: 0 out of 5 stars
0 ratings
TensorFlow 2.x in the Colaboratory Cloud: An Introduction to Deep Learning on Google’s Cloud Service
Ebook
TensorFlow 2.x in the Colaboratory Cloud: An Introduction to Deep Learning on Google’s Cloud Service
byDavid Paper
Rating: 0 out of 5 stars
0 ratings
JavaScript Data Structures and Algorithms: An Introduction to Understanding and Implementing Core Data Structure and Algorithm Fundamentals
Ebook
JavaScript Data Structures and Algorithms: An Introduction to Understanding and Implementing Core Data Structure and Algorithm Fundamentals
bySammie Bae
Rating: 0 out of 5 stars
0 ratings
Modern Full-Stack Development: Using TypeScript, React, Node.js, Webpack, and Docker
Ebook
Modern Full-Stack Development: Using TypeScript, React, Node.js, Webpack, and Docker
byFrank Zammetti
Rating: 0 out of 5 stars
0 ratings
C in 30 Pages
Ebook
C in 30 Pages
byU.Q. Magnusson
Rating: 5 out of 5 stars
5/5
C# 7 Quick Syntax Reference: A Pocket Guide to the Language, APIs, and Library
Ebook
C# 7 Quick Syntax Reference: A Pocket Guide to the Language, APIs, and Library
byMikael Olsson
Rating: 0 out of 5 stars
0 ratings
Developing Web Components with TypeScript: Native Web Development Using Thin Libraries
Ebook
Developing Web Components with TypeScript: Native Web Development Using Thin Libraries
byJörg Krause
Rating: 0 out of 5 stars
0 ratings
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Ebook
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
byTenko
Rating: 0 out of 5 stars
0 ratings
Introducing Vala Programming: A Language and Techniques to Boost Productivity
Ebook
Introducing Vala Programming: A Language and Techniques to Boost Productivity
byMichael Lauer
Rating: 0 out of 5 stars
0 ratings
Practical Test Automation: Learn to Use Jasmine, RSpec, and Cucumber Effectively for Your TDD and BDD
Ebook
Practical Test Automation: Learn to Use Jasmine, RSpec, and Cucumber Effectively for Your TDD and BDD
byPanos Matsinopoulos
Rating: 0 out of 5 stars
0 ratings
Joe Celko's Trees and Hierarchies in SQL for Smarties
Ebook
Joe Celko's Trees and Hierarchies in SQL for Smarties
byJoe Celko
Rating: 0 out of 5 stars
0 ratings
Learn R By Coding
Ebook
Learn R By Coding
byThomas Kurnicki
Rating: 0 out of 5 stars
0 ratings
C# 8 Quick Syntax Reference: A Pocket Guide to the Language, APIs, and Library
Ebook
C# 8 Quick Syntax Reference: A Pocket Guide to the Language, APIs, and Library
byMikael Olsson
Rating: 0 out of 5 stars
0 ratings
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
Ebook
Good Habits for Great Coding: Improving Programming Skills with Examples in Python
byMichael Stueben
Rating: 0 out of 5 stars
0 ratings
42 Astoundingly Useful Scripts and Automations for the Macintosh
Ebook
42 Astoundingly Useful Scripts and Automations for the Macintosh
byJerry Stratton
Rating: 0 out of 5 stars
0 ratings
Clean C++20: Sustainable Software Development Patterns and Best Practices
Ebook
Clean C++20: Sustainable Software Development Patterns and Best Practices
byStephan Roth
Rating: 0 out of 5 stars
0 ratings
Domain-Specific Languages in R: Advanced Statistical Programming
Ebook
Domain-Specific Languages in R: Advanced Statistical Programming
byThomas Mailund
Rating: 0 out of 5 stars
0 ratings
Deep Belief Nets in C++ and CUDA C: Volume 3: Convolutional Nets
Ebook
Deep Belief Nets in C++ and CUDA C: Volume 3: Convolutional Nets
byTimothy Masters
Rating: 0 out of 5 stars
0 ratings
Pro C# 8 with .NET Core 3: Foundational Principles and Practices in Programming
Ebook
Pro C# 8 with .NET Core 3: Foundational Principles and Practices in Programming
byAndrew Troelsen
Rating: 0 out of 5 stars
0 ratings
BigNum Math: Implementing Cryptographic Multiple Precision Arithmetic
Ebook
BigNum Math: Implementing Cryptographic Multiple Precision Arithmetic
byTom St Denis
Rating: 3 out of 5 stars
3/5
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Ebook
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
byJoerg Christian Seubert
Rating: 0 out of 5 stars
0 ratings
Database-Driven Web Development: Learn to Operate at a Professional Level with PERL and MySQL
Ebook
Database-Driven Web Development: Learn to Operate at a Professional Level with PERL and MySQL
byThomas Valentine
Rating: 0 out of 5 stars
0 ratings
Beginning Ada Programming: From Novice to Professional
Ebook
Beginning Ada Programming: From Novice to Professional
byAndrew T. Shvets
Rating: 0 out of 5 stars
0 ratings
Raspberry Pi Assembly Language Programming: ARM Processor Coding
Ebook
Raspberry Pi Assembly Language Programming: ARM Processor Coding
byStephen Smith
Rating: 0 out of 5 stars
0 ratings
Pro C# 9 with .NET 5: Foundational Principles and Practices in Programming
Ebook
Pro C# 9 with .NET 5: Foundational Principles and Practices in Programming
byAndrew Troelsen
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Network+ Study Guide & Practice Exams
Ebook
Network+ Study Guide & Practice Exams
byRobert Shimonski
Rating: 4 out of 5 stars
4/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byAaron Smith
Rating: 0 out of 5 stars
0 ratings
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
Ebook
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
byTriumph Books
Rating: 5 out of 5 stars
5/5
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
Ebook
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
bySeth Reichelson
Rating: 0 out of 5 stars
0 ratings
CompTIA Security+ Practice Questions
Ebook
CompTIA Security+ Practice Questions
byIP Specialist
Rating: 2 out of 5 stars
2/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
Ebook
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
byKatherine Johnson Martinko
Rating: 0 out of 5 stars
0 ratings
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
Ebook
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
byRizwan Virk
Rating: 5 out of 5 stars
5/5
Practical Lock Picking: A Physical Penetration Tester's Training Guide
Ebook
Practical Lock Picking: A Physical Penetration Tester's Training Guide
byDeviant Ollam
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
Master Builder Roblox: The Essential Guide
Ebook
Master Builder Roblox: The Essential Guide
byTriumph Books
Rating: 4 out of 5 stars
4/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1
Ebook
Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1
byDexter Jackson
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

331: Why Computers Suck: How learning OpenBSD makes computers suck a little less, How Unix works, FreeBSD 12.1 Runs Well on Ryzen Threadripper 3970X, BSDCan CFP, HardenedBSD Infrastructure Goals, and more.
Podcast episode
331: Why Computers Suck: How learning OpenBSD makes computers suck a little less, How Unix works, FreeBSD 12.1 Runs Well on Ryzen Threadripper 3970X, BSDCan CFP, HardenedBSD Infrastructure Goals, and more.
byBSD Now
0 ratings
0% found this document useful
Episode 273: A Thoughtful Episode | BSD Now 273: Thoughts on NetBSD 8.0, Monitoring love for a GigaBit OpenBSD firewall, cat’s source history, X.org root permission bug, thoughts on OpenBSD as a desktop, and NomadBSD review.
Podcast episode
Episode 273: A Thoughtful Episode | BSD Now 273: Thoughts on NetBSD 8.0, Monitoring love for a GigaBit OpenBSD firewall, cat’s source history, X.org root permission bug, thoughts on OpenBSD as a desktop, and NomadBSD review.
byBSD Now
0 ratings
0% found this document useful
312: Why Package Managers: The UNIX Philosophy in 2019, why use package managers, touchpad interrupted, Porting wine to amd64 on NetBSD second evaluation report, Enhancing Syzkaller Support for NetBSD, all about the Pinebook Pro, killing a process and all of its descendants, fast software the best software, and more.
Podcast episode
312: Why Package Managers: The UNIX Philosophy in 2019, why use package managers, touchpad interrupted, Porting wine to amd64 on NetBSD second evaluation report, Enhancing Syzkaller Support for NetBSD, all about the Pinebook Pro, killing a process and all of its descendants, fast software the best software, and more.
byBSD Now
0 ratings
0% found this document useful
393: ZFS dRAID: Lessons learned from a 27 years old UNIX book, Finally dRAID, Setting up a Signal Proxy using FreeBSD, Annotate your PDF files on OpenBSD, Things You Should Do Now, Just: More unixy than Make, and more
Podcast episode
393: ZFS dRAID: Lessons learned from a 27 years old UNIX book, Finally dRAID, Setting up a Signal Proxy using FreeBSD, Annotate your PDF files on OpenBSD, Things You Should Do Now, Just: More unixy than Make, and more
byBSD Now
0 ratings
0% found this document useful
Rack-scale Networking
Podcast episode
Rack-scale Networking
byOxide and Friends
0 ratings
0% found this document useful
Things Coming Down the Pipe From TC39 - JSJ 590
Podcast episode
Things Coming Down the Pipe From TC39 - JSJ 590
byJavaScript Jabber
0 ratings
0% found this document useful
The Rapid Rise of Vector Databases with Ram Sriharsha: Ram Sriharsha, VP of Engineering and R&D at Pinecone, joins Corey on Screaming in the Cloud to discuss Pinecone’s creation of Vector Databases, the challenges they solve, and why their customer adoption has seen such a rapid rise. Ram reveals the the comm
Podcast episode
The Rapid Rise of Vector Databases with Ram Sriharsha: Ram Sriharsha, VP of Engineering and R&D at Pinecone, joins Corey on Screaming in the Cloud to discuss Pinecone’s creation of Vector Databases, the challenges they solve, and why their customer adoption has seen such a rapid rise. Ram reveals the the comm
byScreaming in the Cloud
0 ratings
0% found this document useful
Whiteboard Confessional: Naming Is Hard, Don’t Make it Worse: Join me as I continue the Whiteboard Confessional series with a look the importance of owning your own domain names while touching upon what split-horizon DNS is and why companies use it, what the Route 53 Resolver is actually designed to do, why it is im
Podcast episode
Whiteboard Confessional: Naming Is Hard, Don’t Make it Worse: Join me as I continue the Whiteboard Confessional series with a look the importance of owning your own domain names while touching upon what split-horizon DNS is and why companies use it, what the Route 53 Resolver is actually designed to do, why it is im
byAWS Morning Brief
0 ratings
0% found this document useful
374: OpenBSD’s 25th anniversary: OpenBSD 6.8 has been released, NetBSD 9.1 is out, OpenZFS devsummit report, BastilleBSD’s native container management for FreeBSD, cleaning up old tarsnap backups, Michael W. Lucas’ book sale, and more.
Podcast episode
374: OpenBSD’s 25th anniversary: OpenBSD 6.8 has been released, NetBSD 9.1 is out, OpenZFS devsummit report, BastilleBSD’s native container management for FreeBSD, cleaning up old tarsnap backups, Michael W. Lucas’ book sale, and more.
byBSD Now
0 ratings
0% found this document useful
Episode 272: Detain the bhyve | BSD Now 272: Byproducts of reading OpenBSD’s netcat code, learnings from porting your own projects to FreeBSD, OpenBSD’s unveil(), NetBSD’s Virtual Machine Monitor, what 'dependency' means in Unix init systems, jailing bhyve, and more.
Podcast episode
Episode 272: Detain the bhyve | BSD Now 272: Byproducts of reading OpenBSD’s netcat code, learnings from porting your own projects to FreeBSD, OpenBSD’s unveil(), NetBSD’s Virtual Machine Monitor, what 'dependency' means in Unix init systems, jailing bhyve, and more.
byBSD Now
0 ratings
0% found this document useful
371: Wildcards running wild
Podcast episode
371: Wildcards running wild
byBSD Now
0 ratings
0% found this document useful
How To Get Better At Problem Solving: In this episode of Syntax, Scott and Wes talk about how to get better at problem solving — one of the most important skills to build as a developer. Netlify - Sponsor Netlify is the best way to deploy and host a front-end website. All the features...
Podcast episode
How To Get Better At Problem Solving: In this episode of Syntax, Scott and Wes talk about how to get better at problem solving — one of the most important skills to build as a developer. Netlify - Sponsor Netlify is the best way to deploy and host a front-end website. All the features...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
288: Turing Complete Sed: Software will never fix Spectre-type bugs, a proof that sed is Turing complete, managed jails using Bastille, new version of netdata, using grep with /dev/null, using GMail with mutt, and more.
Podcast episode
288: Turing Complete Sed: Software will never fix Spectre-type bugs, a proof that sed is Turing complete, managed jails using Bastille, new version of netdata, using grep with /dev/null, using GMail with mutt, and more.
byBSD Now
0 ratings
0% found this document useful
Episode 454: RR 446: Development Environments
Podcast episode
Episode 454: RR 446: Development Environments
byRuby Rogues
0 ratings
0% found this document useful
299: The NAS Fleet: Running AIX on QEMU on Linux on Windows, your NAS fleet with TrueCommand, Unleashed 1.3 is available, LLDB: CPU register inspection support extension, V7 Unix programs often not written as expected, and more.
Podcast episode
299: The NAS Fleet: Running AIX on QEMU on Linux on Windows, your NAS fleet with TrueCommand, Unleashed 1.3 is available, LLDB: CPU register inspection support extension, V7 Unix programs often not written as expected, and more.
byBSD Now
0 ratings
0% found this document useful
Episode 308: JSJ 305: Continuous Integration, Processes, and DangerJS with Orta Therox
Podcast episode
Episode 308: JSJ 305: Continuous Integration, Processes, and DangerJS with Orta Therox
byJavaScript Jabber
0 ratings
0% found this document useful
Whiteboard Confessional: Don’t Run a Database on Top of NFS: Join me as I continue a new series called Whiteboard Confessional by focusing on the wild world of databases and touching upon three-tiered web apps, how scaling an app to 200 million users is a massive challenge, the time Corey’s boss suggested running a
Podcast episode
Whiteboard Confessional: Don’t Run a Database on Top of NFS: Join me as I continue a new series called Whiteboard Confessional by focusing on the wild world of databases and touching upon three-tiered web apps, how scaling an app to 200 million users is a massive challenge, the time Corey’s boss suggested running a
byAWS Morning Brief
0 ratings
0% found this document useful
Acorns for AWS
Podcast episode
Acorns for AWS
byThe Cloudcast
0 ratings
0% found this document useful
Episode 244: C is a Lie | BSD Now 244: Arcan and OpenBSD, running OpenBSD 6.3 on RPI 3, why C is not a low-level language, HardenedBSD switching back to OpenSSL, how the Internet was almost broken, EuroBSDcon CfP is out, and the BSDCan 2018 schedule is available.
Podcast episode
Episode 244: C is a Lie | BSD Now 244: Arcan and OpenBSD, running OpenBSD 6.3 on RPI 3, why C is not a low-level language, HardenedBSD switching back to OpenSSL, how the Internet was almost broken, EuroBSDcon CfP is out, and the BSDCan 2018 schedule is available.
byBSD Now
0 ratings
0% found this document useful
React + TypeScript: In this episode of Syntax, Scott and Wes talk about using React with Typescript — how to set it up, components, state, props, passing data, custom hooks, and more! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put...
Podcast episode
React + TypeScript: In this episode of Syntax, Scott and Wes talk about using React with Typescript — how to set it up, components, state, props, passing data, custom hooks, and more! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks at and put...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Deserted Island DevOps with Austin Parker: Austin Parker is a principal developer advocate at LightStep. Prior to this position, he worked as a software architect at Apprenda, an adjunct instruction and researcher at the University of Albany, a telecommunications specialist at Alltech, and as a su
Podcast episode
Deserted Island DevOps with Austin Parker: Austin Parker is a principal developer advocate at LightStep. Prior to this position, he worked as a software architect at Apprenda, an adjunct instruction and researcher at the University of Albany, a telecommunications specialist at Alltech, and as a su
byScreaming in the Cloud
0 ratings
0% found this document useful
311: Conference Gear Breakdown: NetBSD 9.0 release process has started, xargs, a tale of two spellcheckers, Adapting TriforceAFL for NetBSD, Exploiting a no-name freebsd kernel vulnerability, and more.
Podcast episode
311: Conference Gear Breakdown: NetBSD 9.0 release process has started, xargs, a tale of two spellcheckers, Adapting TriforceAFL for NetBSD, Exploiting a no-name freebsd kernel vulnerability, and more.
byBSD Now
0 ratings
0% found this document useful
8: Exploring Dart & Polymer: Dart was originally a Google language revealed in 2011 and is now an ECMA Standard known as TC52. When Dart first came into being it was annoounced it's purpose was to "ultimately to replace JavaScript as the 'lingua franca' of web development on the...
Podcast episode
8: Exploring Dart & Polymer: Dart was originally a Google language revealed in 2011 and is now an ECMA Standard known as TC52. When Dart first came into being it was annoounced it's purpose was to "ultimately to replace JavaScript as the 'lingua franca' of web development on the...
byThe Web Platform Podcast
0 ratings
0% found this document useful
Hasty Treat - Effortless Custom GraphQL with GraphQL Codegen: In this Hasty Treat, Scott and Wes talk about GraphQL tooling, and specifically a couple tools we use that will change your experience with GraphQL. .TECH Domains - Sponsor .TECH is taking the tech industry by storm. A domain that shows the world...
Podcast episode
Hasty Treat - Effortless Custom GraphQL with GraphQL Codegen: In this Hasty Treat, Scott and Wes talk about GraphQL tooling, and specifically a couple tools we use that will change your experience with GraphQL. .TECH Domains - Sponsor .TECH is taking the tech industry by storm. A domain that shows the world...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Will we be writing Hare in 2099? (with Drew DeVault)
Podcast episode
Will we be writing Hare in 2099? (with Drew DeVault)
byDeveloper Voices
0 ratings
0% found this document useful
Scalable Python for Everyone, Everywhere // Matthew Rocklin // MLOps Meetup #38
Podcast episode
Scalable Python for Everyone, Everywhere // Matthew Rocklin // MLOps Meetup #38
byMLOps.community
0 ratings
0% found this document useful
Episode 421: RR 413: When Your Tools Interrupt Your Coding Process
Podcast episode
Episode 421: RR 413: When Your Tools Interrupt Your Coding Process
byRuby Rogues
0 ratings
0% found this document useful
Understanding Arduino Syntax: Learn Programming and Electronics with Arduino
Podcast episode
Understanding Arduino Syntax: Learn Programming and Electronics with Arduino
byLearn Programming and Electronics with Arduino
0 ratings
0% found this document useful
Hasty Treat - What is the n+1 problem?: In this Hasty Treat, Scott and Wes talk about a common problem you’ll encounter in your development career — the n+1 problem. Hasura - Sponsor With Hasura, you can get a fully managed, production-ready GraphQL API as a service to help you...
Podcast episode
Hasty Treat - What is the n+1 problem?: In this Hasty Treat, Scott and Wes talk about a common problem you’ll encounter in your development career — the n+1 problem. Hasura - Sponsor With Hasura, you can get a fully managed, production-ready GraphQL API as a service to help you...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
185: InstructorEx for LLMs: Explore InstructorEx's approach to harnessing LLMs for structured JSON data and Elixir's role in refining AI interactions. Uncover strategies for enhancing tasks and integrating Python skills with Elixir potential, and more!
Podcast episode
185: InstructorEx for LLMs: Explore InstructorEx's approach to harnessing LLMs for structured JSON data and Elixir's role in refining AI interactions. Uncover strategies for enhancing tasks and integrating Python skills with Elixir potential, and more!
byThinking Elixir Podcast
0 ratings
0% found this document useful

Skip carousel

Entropy Isn’t What It Used To Be
Linux Format
Article
Entropy Isn’t What It Used To Be
Nov 14, 2023
10 min read
Next-gen Terminals
Linux Format
Article
Next-gen Terminals
Jan 12, 2021
9 min read
Coding Secure Rust System Tools
Linux Format
Article
Coding Secure Rust System Tools
Apr 5, 2022
8 min read
Coding Secure Rust System Tools
Linux Format
Article
Coding Secure Rust System Tools
Apr 5, 2022
8 min read
Answers
Linux Format
Article
Answers
Sep 20, 2022
Q Bloated Debian This Debian 11 is my first attempt to explore Debian and I truly loved it for its stability and user-friendliness. I’m dual-booting Debian with Windows 10. I was wrong to set the Debian partition of 40GB only (2.6GB/41.8GB available
10 min read
How To Develop Multi-threaded Code
Linux Format
Article
How To Develop Multi-threaded Code
Jul 26, 2022
Get the code for this tutorial from the Linux Format archive: www. linuxformat. com/archives ?issue=292. You can learn more about Rust at www. rust-lang.org. This month’s instalment of our ongoing Rust series will cover concurrent programming. The di
10 min read
8 Batch Files You Must Create
Computeractive
Article
8 Batch Files You Must Create
Jul 6, 2022
5 min read
Quick Tip
Linux Format
Article
Quick Tip
Sep 24, 2019
The best way to follow along with this guide is to get the files for this tutorial from the DVD or from https:// github.com/jschwartzman/ asm-tutorial. The stack is critical for making programs run. Linux allocates a stack for every program that it
1 min read
Answers
Linux Format
Article
Answers
Feb 8, 2022
Neil Bothwick finds the fault in our stars and fixes em! I have a music collection that’s stored on both local and external drives. I have just sorted through all the duplicates with fdupes. Now my local collection is only two-thirds of my external c
8 min read
Poisoning The Well
Linux Format
Article
Poisoning The Well
Jan 11, 2022
4 min read
HotPicks
Linux Format
Article
HotPicks
Feb 11, 2020
13 min read
Generate And Then Solve Mazes With C
Linux Format
Article
Generate And Then Solve Mazes With C
May 30, 2023
Credit: https://github.com/joewing David Bolton has been programming since before the war. Which war? Don’t ask. He wrote a 1,000-player RPG game called Quest in 1989 that is still running today. Open another terminal while in the maze and run Scrot
10 min read
Observability Of The Kernel And Containers
Linux Format
Article
Observability Of The Kernel And Containers
Apr 4, 2023
Mihalis Tsoukalos is currently working on Time Series. You can reach him at: @mactsouk. For our final delve into eBPF, we’re tackling applications, the kernel and Docker containers. At the end of the day, all Linux machines execute code for applicat
10 min read
Microcontrollers In Amateur Radio
CQ Amateur Radio
Article
Microcontrollers In Amateur Radio
May 1, 2022
When you hit the compile button for your compiler, there’s a whole bunch of stuff that takes place that isn’t obvious while the code compiles. In general terms, the C compiler: 1) invokes a preprocessor pass on the code;2) performs syntax/semantic ch
4 min read
Answers
Linux Format
Article
Answers
Jun 4, 2019
7 min read
Answers
Linux Format
Article
Answers
Apr 7, 2020
7 min read
Website And RSS Feed Python Scraping
Linux Format
Article
Website And RSS Feed Python Scraping
Oct 18, 2022
Matt Holder has worked in IT support for over a decade, and is keen to utilise Linux alongside other installed systems. All the Python scripts that we’ve discussed in this tutorial are all available at https://github.com/mattmole/LXF295. Before we b
8 min read
Solve Problems
Linux Format
Article
Solve Problems
Jan 12, 2021
4 min read
Develop Linux Filesystem Tools In Rust
Linux Format
Article
Develop Linux Filesystem Tools In Rust
May 3, 2022
Part Two Missed part one? Turn to page 62 to get hold of it! The subject of this second Rust tutorial is working with files and directories as filesystem entities. This means that we’re going to learn how to move, delete and copy files, explore direc
8 min read
Develop Linux Filesystem Tools In Rust
Linux Format
Article
Develop Linux Filesystem Tools In Rust
May 3, 2022
Part Two Missed part one? Turn to page 62 to get hold of it! The subject of this second Rust tutorial is working with files and directories as filesystem entities. This means that we’re going to learn how to move, delete and copy files, explore direc
8 min read
Answers
Linux Format
Article
Answers
Mar 9, 2021
8 min read
Answers
Linux Format
Article
Answers
Dec 14, 2021
8 min read
Manage Your Apps!
Linux Format
Article
Manage Your Apps!
Nov 14, 2023
17 min read
HotPicks
Linux Format
Article
HotPicks
Nov 19, 2019
12 min read
Code Read/write System File Tools
Linux Format
Article
Code Read/write System File Tools
May 31, 2022
The subject of this tutorial is file input and output (I/O) in Rust. File I/O is an important part of every operating system. An OS or even a database system wouldn’t be able to function without being able to process, read, write and append to files.
8 min read
Code Read/write System File Tools
Linux Format
Article
Code Read/write System File Tools
May 31, 2022
The subject of this tutorial is file input and output (I/O) in Rust. File I/O is an important part of every operating system. An OS or even a database system wouldn’t be able to function without being able to process, read, write and append to files.
8 min read
Mailserver
Linux Format
Article
Mailserver
May 31, 2022
3 min read
Mailserver
Linux Format
Article
Mailserver
May 31, 2022
3 min read
Mailserver
Linux Format
Article
Mailserver
Dec 12, 2023
4 min read
Hacking 101
Linux Format
Article
Hacking 101
May 31, 2022
5 min read

Related categories

Skip carousel

Reviews for CRAN Recipes

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

CRAN Recipes - William Yarberry

W. YarberryCRAN Recipeshttps://doi.org/10.1007/978-1-4842-6876-6_1

1. DPLYR

William Yarberry¹

(1)

Kingwood, TX, USA

DPLYR is one of my favorite R packages. Its logical and consistent rules replace the older, motley collection of syntactically inconsistent packages and functions. It’s like a Swiss Army knife in the woods—don’t leave home without it.

Most of the book’s code examples use built-in R datasets or toy dataframe hard-coded into the program. For practice, you should substitute your own data when running the snippets of code.

1.1 Filter Commands

The filter command is used to eliminate rows (records) you do not want. The following commands use built-in datasets as the input dataframe. The dataset mtcars is used in the following. The output shows cars with six cylinders only.

Note

The following shown libraries will be used in all code unless otherwise noted. DPLYR is included in the mega-package tidyverse.

1.1.1 Single-Condition Filter

library(tidyverse)

data(mtcars)

#select only cars with six cylinders

six.cyl.only <- filter(mtcars, cyl == 6)

six.cyl.only

## mpg cyl disp hp drat wt qsec vs am gear carb

## Mazda RX4 21.0 6 160.0 110 3.90 2.620 16.46 0 1 4 4

## Mazda RX4 Wag 21.0 6 160.0 110 3.90 2.875 17.02 0 1 4 4

## Hornet 4 Drive 21.4 6 258.0 110 3.08 3.215 19.44 1 0 3 1

## Valiant 18.1 6 225.0 105 2.76 3.460 20.22 1 0 3 1

## Merc 280 19.2 6 167.6 123 3.92 3.440 18.30 1 0 4 4

## Merc 280C 17.8 6 167.6 123 3.92 3.440 18.90 1 0 4 4

## Ferrari Dino 19.7 6 145.0 175 3.62 2.770 15.50 0 1 5 6

In the filter command, equals is a double equals sign ==.

1.1.2 Multiple-Condition Filter

Filter the dataset mtcars for both six cylinders and 110 horsepower:

six.cylinders.and.110.horse.power <- filter(mtcars, cyl == 6,

hp == 110)

six.cylinders.and.110.horse.power

## mpg cyl disp hp drat wt qsec vs am gear carb

## Mazda RX4 21.0 6 160 110 3.90 2.620 16.46 0 1 4 4

## Mazda RX4 Wag 21.0 6 160 110 3.90 2.875 17.02 0 1 4 4

## Hornet 4 Drive 21.4 6 258 110 3.08 3.215 19.44 1 0 3 1

1.1.3 OR Logic for Filtering

You can use as many OR symbols (pipe |) as needed.

Filter based on the OR logical operator:

gear.eq.4.or.more.than.8 <- filter(mtcars, gear == 4|cyl > 6)

gear.eq.4.or.more.than.8

## mpg cyl disp hp drat wt qsec vs am gear carb

## Mazda RX4 21.0 6 160.0 110 3.90 2.620 16.46 0 1 4 4

## Mazda RX4 Wag 21.0 6 160.0 110 3.90 2.875 17.02 0 1 4 4

## Datsun 710 22.8 4 108.0 93 3.85 2.320 18.61 1 1 4 1

## Hornet Sportabout 18.7 8 360.0 175 3.15 3.440 17.02 0 0 3 2

## Duster 360 14.3 8 360.0 245 3.21 3.570 15.84 0 0 3 4

## Merc 240D 24.4 4 146.7 62 3.69 3.190 20.00 1 0 4 2

## Merc 230 22.8 4 140.8 95 3.92 3.150 22.90 1 0 4 2

## Merc 280 19.2 6 167.6 123 3.92 3.440 18.30 1 0 4 4

## Merc 280C 17.8 6 167.6 123 3.92 3.440 18.90 1 0 4 4

## Merc 450SE 16.4 8 275.8 180 3.07 4.070 17.40 0 0 3 3

## Merc 450SL 17.3 8 275.8 180 3.07 3.730 17.60 0 0 3 3

## Merc 450SLC 15.2 8 275.8 180 3.07 3.780 18.00 0 0 3 3

## Cadillac Fleetwood 10.4 8 472.0 205 2.93 5.250 17.98 0 0 3 4

## Lincoln Continental 10.4 8 460.0 215 3.00 5.424 17.82 0 0 3 4

## Chrysler Imperial 14.7 8 440.0 230 3.23 5.345 17.42 0 0 3 4

## Fiat 128 32.4 4 78.7 66 4.08 2.200 19.47 1 1 4 1

## Honda Civic 30.4 4 75.7 52 4.93 1.615 18.52 1 1 4 2

## Toyota Corolla 33.9 4 71.1 65 4.22 1.835 19.90 1 1 4 1

## Dodge Challenger 15.5 8 318.0 150 2.76 3.520 16.87 0 0 3 2

## AMC Javelin 15.2 8 304.0 150 3.15 3.435 17.30 0 0 3 2

## Camaro Z28 13.3 8 350.0 245 3.73 3.840 15.41 0 0 3 4

## Pontiac Firebird 19.2 8 400.0 175 3.08 3.845 17.05 0 0 3 2

## Fiat X1-9 27.3 4 79.0 66 4.08 1.935 18.90 1 1 4 1

## Ford Pantera L 15.8 8 351.0 264 4.22 3.170 14.50 0 1 5 4

## Maserati Bora 15.0 8 301.0 335 3.54 3.570 14.60 0 1 5 8

## Volvo 142E 21.4 4 121.0 109 4.11 2.780 18.60 1 1 4 2

1.1.4 Filter by Minimums, Maximums, and Other Numeric Criteria

The output shows, as one would expect, a single row with the smallest engine displacement:

smallest.engine.displacement <- filter(mtcars, disp ==

min(disp))

smallest.engine.displacement

## mpg cyl disp hp drat wt qsec vs am gear carb

## Toyota Corolla 33.9 4 71.1 65 4.22 1.835 19.9 1 1 4 1

Filter with conditions separated by commas:

data(ChickWeight)

chick.subset <- filter(ChickWeight, Time < 3, weight > 53)

chick.subset

## weight Time Chick Diet

## 1 55 2 22 2

## 2 55 2 40 3

## 3 55 2 43 4

## 4 54 2 50 4

1.1.5 Filter Out Missing Values (NAs) for a Specific Column

The built-in dataset airquality has a missing value in the fifth row of the first column (Ozone):

data(airquality)

head(airquality,10) #before filter

## Ozone Solar.R Wind Temp Month Day

## 1 41 190 7.4 67 5 1

## 2 36 118 8.0 72 5 2

## 3 12 149 12.6 74 5 3

## 4 18 313 11.5 62 5 4

## 5 NA NA 14.3 56 5 5

## 6 28 NA 14.9 66 5 6

## 7 23 299 8.6 65 5 7

## 8 19 99 13.8 59 5 8

## 9 8 19 20.1 61 5 9

## 10 NA 194 8.6 69 5 10

Remove any row with missing values in the Ozone column:

no.missing.ozone = filter(airquality, !is.na(Ozone))

head(no.missing.ozone,8) #after filter

## Ozone Solar.R Wind Temp Month Day

## 1 41 190 7.4 67 5 1

## 2 36 118 8.0 72 5 2

## 3 12 149 12.6 74 5 3

## 4 18 313 11.5 62 5 4

## 5 28 NA 14.9 66 5 6

## 6 23 299 8.6 65 5 7

## 7 19 99 13.8 59 5 8

## 8 8 19 20.1 61 5 9

Note that although the row with NA for Ozone has been eliminated, the row with an NA for Solar.R is still there.

1.1.6 Filter Rows with NAs Anywhere in the Dataset

Use complete.cases() to remove any rows containing an NA in any column:

airqual.no.NA.anywhere <- filter(airquality[1:10,],

complete.cases(airquality[1:10,]))

airqual.no.NA.anywhere

## Ozone Solar.R Wind Temp Month Day

## 1 41 190 7.4 67 5 1

## 2 36 118 8.0 72 5 2

## 3 12 149 12.6 74 5 3

## 4 18 313 11.5 62 5 4

## 5 23 299 8.6 65 5 7

## 6 19 99 13.8 59 5 8

## 7 8 19 20.1 61 5 9

1.1.7 Filter by %in%

%in% is a powerful operator, providing a convenient shorthand for including/excluding specified values:

data(iris)

table(iris$Species) #counts of species in the dataset

## setosa versicolor virginica

## 50 50 50

iris.two.species <- filter(iris,

Species %in% c(setosa, virginica))

table(iris.two.species$Species)

## setosa versicolor virginica

## 50 0 50

Show the number of rows before and after filtering:

nrow(iris); nrow(iris.two.species)

## [1] 150

## [1] 100

1.1.8 Filter for Ozone > 29 and Include Only Three Columns

data(airquality)

airqual.3.columns <- filter(airquality, Ozone > 29)[,1:3]

head(airqual.3.columns)

## Ozone Solar.R Wind

## 1 41 190 7.4

## 2 36 118 8.0

## 3 34 307 12.0

## 4 30 322 11.5

## 5 32 92 12.0

## 6 45 252 14.9

1.1.9 Filter by Total Frequency of a Value Across All Rows

This logic uses group_by to enable counting of rows based on number of gears. After the counts of gears are made, then only those rows whose total counts exceed ten are included in the output. All you want to see here are records that have at least 11 rows with a specific number of gears in the car. The filter is driven solely by frequency of occurrence. Your question may be phrased as just show me records where common gear configurations occur. Five gears are not nearly as common as three and four, so in the filtered dataframe, they are omitted. In the following first table, there are 15 records with a car having three gears, 12 records for four gears, and five records for five gears. After applying the filter and creating a new dataframe, there are no records having five gears:

table(mtcars$gear)

## 3 4 5

## 15 12 5

more.frequent.no.of.gears <- mtcars %>%

group_by(gear) %>%

filter(n() > 10) #

table(more.frequent.no.of.gears$gear)

## 3 4

## 15 12

Additional criteria can be added to the filter by including a requirement that the horsepower be less than 105:

more.frequent.no.of.gears.and.low.horsepower <- mtcars %>%

group_by(gear) %>%

filter(n() > 10, hp < 105)

table(more.frequent.no.of.gears.and.low.horsepower$gear)

## 3 4

## 1 7

1.1.10 Filter by Column Name Using starts with

In this code, records are selected where the column name starts with an S:

names(iris) #show the column names

## [1] Sepal.Length Sepal.Width Petal.Length Petal.Width Species

iris.display <- iris %>% dplyr::select(starts_with(S))

head(iris.display) #use head to reduce number of rows output

## Sepal.Length Sepal.Width Species

## 1 5.1 3.5 setosa

## 2 4.9 3.0 setosa

## 3 4.7 3.2 setosa

## 4 4.6 3.1 setosa

## 5 5.0 3.6 setosa

## 6 5.4 3.9 setosa

1.1.11 Filter Rows: Columns Meet Criteria (filter_at)

Use filter_at to find rows which meet some criteria such as maximum:

new.mtcars <- mtcars %>% filter_at(vars(cyl, hp),

all_vars(. == max(.)))

new.mtcars

## mpg cyl disp hp drat wt qsec vs am gear carb

## Maserati Bora 15 8 301 335 3.54 3.57 14.6 0 1 5 8

Note that only one car, the Maserati Bora, had both the maximum number of cylinders and the maximum horsepower for each column, respectively.

Another example dataset comes from Suzan Baert’s blog (https://suzan.rbind.io/2018/02/dplyr-tutorial-3/#filter-at), using sleep study research.

Load the msleep dataframe from the package ggplot2:

msleep <- ggplot2::msleep

msleep

## # A tibble: 83 x 11

## name genus vore order conservation sleep_total sleep_rem sleep_cycle awake

## 1 Chee~ Acin~ carni Carn~ lc 12.1 NA NA 11.9

## 2 Owl ~ Aotus omni Prim~ 17 1.8 NA 7

## 3 Moun~ Aplo~ herbi Rode~ nt 14.4 2.4 NA 9.6

## 4 Grea~ Blar~ omni Sori~ lc 14.9 2.3 0.133 9.1

## 5 Cow Bos herbi Arti~ domesticated 4 0.7 0.667 20

## 6 Thre~ Brad~ herbi Pilo~ 14.4 2.2 0.767 9.6

## 7 Nort~ Call~ carni Carn~ vu 8.7 1.4 0.383 15.3

## 8 Vesp~ Calo~ Rode~ 7 NA NA 17

## 9 Dog Canis carni Carn~ domesticated 10.1 2.9 0.333 13.9

## 10 Roe ~ Capr~ herbi Arti~ lc 3 NA NA 21

## # ... with 73 more rows, and 2 more variables: brainwt , bodywt

msleep.over.5 <- msleep %>%

select(name, sleep_total:sleep_rem, brainwt:bodywt) %>%

filter_at(vars(contains(sleep)), all_vars(.>5))

msleep.over.5

## # A tibble: 2 x 5

## name sleep_total sleep_rem brainwt bodywt

## 1 Thick-tailed opposum 19.4 6.6 NA 0.37

## 2 Giant armadillo 18.1 6.1 0.081 60

For the preceding code, ignore the select statement for the moment (covered later). The filter_at function says to look at only variables containing the word sleep. Within those variables (in this case, two of them), filter for any values greater than 5. The . means any variable with sleep in the name. Only two rows met the criteria for the filter in this case.

1.2 Arrange (Sort)

Arrange, the sorting function, is as old as the alphabet. Based on the defined ASCII order, it rearranges a dataframe or vector in a sequence defined as either ascending or descending. Sort keys are defined as primary, secondary, and so on.

Load the msleep dataframe from the package ggplot2:

msleep <- ggplot2::msleep

msleep[,1:4]

## # A tibble: 83 x 4

## name genus vore order

## 1 Cheetah Acinonyx carni Carnivora

## 2 Owl monkey Aotus omni Primates

## 3 Mountain beaver Aplodontia herbi Rodentia

## 4 Greater short-tailed shrew Blarina omni Soricomorpha

## 5 Cow Bos herbi Artiodactyla

## 6 Three-toed sloth Bradypus herbi Pilosa

## 7 Northern fur seal Callorhinus carni Carnivora

## 8 Vesper mouse Calomys Rodentia

## 9 Dog Canis carni Carnivora

## 10 Roe deer Capreolus herbi Artiodactyla

## # ... with 73 more rows

1.2.1 Ascending

animal.name.sequence <- arrange(msleep, vore, order)

animal.name.sequence[,1:4]

## # A tibble: 83 x 4

## name genus vore order

## 1 Cheetah Acinonyx carni Carnivora

## 2 Northern fur seal Callorhinus carni Carnivora

## 3 Dog Canis carni Carnivora

## 4 Domestic cat Felis carni Carnivora

## 5 Gray seal Halichoerus carni Carnivora

## 6 Tiger Panthera carni Carnivora

## 7 Jaguar Panthera carni Carnivora

## 8 Lion Panthera carni Carnivora

## 9 Caspian seal Phoca carni Carnivora

## 10 Genet Genetta carni Carnivora

## # ... with 73 more rows

1.2.2 Descending

animal.name.sequence.desc <- arrange(msleep, vore, desc(order))

head(animal.name.sequence.desc[,1:4])

## # A tibble: 6 x 4

## name genus vore order

## 1 Northern grasshopper mouse Onychomys carni Rodentia

## 2 Slow loris Nyctibeus carni Primates

## 3 Thick-tailed opposum Lutreolina carni Didelphimorphia

## 4 Long-nosed armadillo Dasypus carni Cingulata

## 5 Pilot whale Globicephalus carni Cetacea

## 6 Common porpoise Phocoena carni Cetacea

In section Mutate, you’ll see how a variable can be created on the fly and then used in the same statement for sorting.

1.3 Rename

Rename allows you to change the name of one or more columns. It is a convenience function and changes no data.

Rename one or more columns in a dataset:

names(iris)

## [1] Sepal.Length Sepal.Width Petal.Length Petal.Width Species

Show new column names:

renamed.iris <- rename(iris, width.of.petals = Petal.Width,

various.plants.and.animals = Species)

names(renamed.iris)

## [1] Sepal.Length Sepal.Width

## [3] Petal.Length width.of.petals

## [5] various.plants.and.animals

1.4 Mutate

Mutate adds new variables to a dataframe. It requires the original dataframe as the first argument and then arguments to create new variables as the remaining arguments. The following example adds the natural log of length and weight to the dataframe created earlier that contains just the length and weight variables.

Add a new, calculated variable to a dataframe:

data(ChickWeight)

ChickWeight[1:2,] #first two rows

## weight Time Chick Diet

## 1 42 0 1 1

## 2 51 2 1 1

First two rows, with new field added:

Chickweight.with.log <- mutate(ChickWeight,

log.of.weight = log10(weight))

Chickweight.with.log[1:2,]

## weight Time Chick Diet log.of.weight

## 1 42 0 1 1 1.623249

## 2 51 2 1 1 1.707570

1.4.1 mutate_all to Add New Fields All

Enjoying the preview?

Page 1 of 1

CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R

About this ebook

William Yarberry

Related authors

Related to CRAN Recipes

Related ebooks

Computers For You

Related podcast episodes

Related articles

Related categories

Reviews for CRAN Recipes

What did you think?

Book preview

CRAN Recipes - William Yarberry

1. DPLYR

1.1 Filter Commands

1.1.1 Single-Condition Filter

1.1.2 Multiple-Condition Filter

1.1.3 OR Logic for Filtering

1.1.4 Filter by Minimums, Maximums, and Other Numeric Criteria

1.1.5 Filter Out Missing Values (NAs) for a Specific Column

1.1.6 Filter Rows with NAs Anywhere in the Dataset

1.1.7 Filter by %in%

1.1.8 Filter for Ozone > 29 and Include Only Three Columns

1.1.9 Filter by Total Frequency of a Value Across All Rows

1.1.10 Filter by Column Name Using starts with

1.1.11 Filter Rows: Columns Meet Criteria (filter_at)

1.2 Arrange (Sort)

1.2.1 Ascending

1.2.2 Descending

1.3 Rename

1.4 Mutate

1.4.1 mutate_all to Add New Fields All