Machine Learning in Earth, Environmental and Planetary Sciences: Theoretical and Practical Applications

Ebook804 pages6 hours

Machine Learning in Earth, Environmental and Planetary Sciences: Theoretical and Practical Applications

Name: Machine Learning in Earth, Environmental and Planetary Sciences: Theoretical and Practical Applications
Author: Hossein Bonakdari
ISBN: 9780443152856

By Hossein Bonakdari, Isa Ebtehaj and Joseph D. Ladouceur

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Machine Learning in Earth, Environmental and Planetary Sciences: Theoretical and Practical Applications is a practical guide on implementing different variety of extreme learning machine algorithms to Earth and environmental data. The book provides guided examples using real-world data for numerous novel and mathematically detailed machine learning techniques that can be applied in Earth, environmental, and planetary sciences, including detailed MATLAB coding coupled with line-by-line descriptions of the advantages and limitations of each method. The book also presents common postprocessing techniques required for correct data interpretation.

This book provides students, academics, and researchers with detailed understanding of how machine learning algorithms can be applied to solve real case problems, how to prepare data, and how to interpret the results.

Describes how to develop different schemes of machine learning techniques and apply to Earth, environmental and planetary data
Provides detailed, guided line-by-line examples using real-world data, including the appropriate MATLAB codes
Includes numerous figures, illustrations and tables to help readers better understand the concepts covered

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateJul 3, 2023

ISBN9780443152856

Author

Hossein Bonakdari

Dr. Bonakdari obtained his PhD in Civil Engineering from the University of Caen Normandy (France). He has worked for several organizations and most recently as an Associate Professor at the Department of Civil Engineering of the University of Ottawa (Canada). He is one of the most influential scientists in the field of developing novel algorithms for solving practical problems through the decision-making abilities of AI. His research also focuses on creating comprehensive methodologies in the areas of simulation modeling, optimization, and machine learning algorithms. The results obtained from his research have been published in international journals and presented at international conferences. He was included in the list of the world's top 2% scientists, published by Stanford University, and is on the Editorial board of several journals.

Related authors

Skip carousel

Related to Machine Learning in Earth, Environmental and Planetary Sciences

Related ebooks

Skip carousel

Advances in Streamflow Forecasting: From Traditional to Modern Approaches
Ebook
Advances in Streamflow Forecasting: From Traditional to Modern Approaches
byPriyanka Sharma
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for Future Generation Robotics
Ebook
Artificial Intelligence for Future Generation Robotics
byRabindra Nath Shaw
Rating: 5 out of 5 stars
5/5
Academic Press Library in Signal Processing, Volume 6: Image and Video Processing and Analysis and Computer Vision
Ebook
Academic Press Library in Signal Processing, Volume 6: Image and Video Processing and Analysis and Computer Vision
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Advances of Artificial Intelligence in a Green Energy Environment
Ebook
Advances of Artificial Intelligence in a Green Energy Environment
byPandian Vasant
Rating: 0 out of 5 stars
0 ratings
Networked Control Systems: Cloud Control and Secure Control
Ebook
Networked Control Systems: Cloud Control and Secure Control
byMagdi S. Mahmoud
Rating: 0 out of 5 stars
0 ratings
Internet of Multimedia Things (IoMT): Techniques and Applications
Ebook
Internet of Multimedia Things (IoMT): Techniques and Applications
byShailendra Shukla
Rating: 0 out of 5 stars
0 ratings
Stochastic Global Optimization Methods and Applications to Chemical, Biochemical, Pharmaceutical and Environmental Processes
Ebook
Stochastic Global Optimization Methods and Applications to Chemical, Biochemical, Pharmaceutical and Environmental Processes
byCh. Venkateswarlu
Rating: 0 out of 5 stars
0 ratings
Interpretable Machine Learning for the Analysis, Design, Assessment, and Informed Decision Making for Civil Infrastructure
Ebook
Interpretable Machine Learning for the Analysis, Design, Assessment, and Informed Decision Making for Civil Infrastructure
byM. Z. Naser
Rating: 0 out of 5 stars
0 ratings
Hybrid Computational Intelligence: Challenges and Applications
Ebook
Hybrid Computational Intelligence: Challenges and Applications
bySiddhartha Bhattacharyya
Rating: 0 out of 5 stars
0 ratings
Discrete Networked Dynamic Systems: Analysis and Performance
Ebook
Discrete Networked Dynamic Systems: Analysis and Performance
byMagdi S. Mahmoud
Rating: 0 out of 5 stars
0 ratings
Digital Image Enhancement and Reconstruction
Ebook
Digital Image Enhancement and Reconstruction
byShyam Singh Rajput
Rating: 0 out of 5 stars
0 ratings
Advances in Digitalization and Machine Learning for Integrated Building-Transportation Energy Systems
Ebook
Advances in Digitalization and Machine Learning for Integrated Building-Transportation Energy Systems
byYuekuan Zhou
Rating: 0 out of 5 stars
0 ratings
Internet of Things in Biomedical Engineering
Ebook
Internet of Things in Biomedical Engineering
byValentina Emilia Balas
Rating: 4 out of 5 stars
4/5
Synthesis and Operability Strategies for Computer-Aided Modular Process Intensification
Ebook
Synthesis and Operability Strategies for Computer-Aided Modular Process Intensification
byEfstratios N Pistikopoulos
Rating: 0 out of 5 stars
0 ratings
Advanced Distributed Consensus for Multiagent Systems
Ebook
Advanced Distributed Consensus for Multiagent Systems
byMagdi S. Mahmoud
Rating: 0 out of 5 stars
0 ratings
IoT and Spacecraft Informatics
Ebook
IoT and Spacecraft Informatics
byK.L. Yung
Rating: 0 out of 5 stars
0 ratings
Edge-of-Things in Personalized Healthcare Support Systems
Ebook
Edge-of-Things in Personalized Healthcare Support Systems
byRajeswari Sridhar
Rating: 0 out of 5 stars
0 ratings
Generative Adversarial Networks for Image-to-Image Translation
Ebook
Generative Adversarial Networks for Image-to-Image Translation
byArun Solanki
Rating: 0 out of 5 stars
0 ratings
The Cognitive Approach in Cloud Computing and Internet of Things Technologies for Surveillance Tracking Systems
Ebook
The Cognitive Approach in Cloud Computing and Internet of Things Technologies for Surveillance Tracking Systems
byDinesh Peter
Rating: 0 out of 5 stars
0 ratings
Easy Statistics for Food Science with R
Ebook
Easy Statistics for Food Science with R
byAbbas F.M. Alkarkhi
Rating: 0 out of 5 stars
0 ratings
Assistive Technology for the Elderly
Ebook
Assistive Technology for the Elderly
byNagender Kumar Suryadevara
Rating: 0 out of 5 stars
0 ratings
Nature-Inspired Optimization Algorithms
Ebook
Nature-Inspired Optimization Algorithms
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Monitoring and Control of Electrical Power Systems using Machine Learning Techniques
Ebook
Monitoring and Control of Electrical Power Systems using Machine Learning Techniques
byEmilio Barocio Espejo
Rating: 0 out of 5 stars
0 ratings
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
Ebook
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
byArun Kumar Sangaiah
Rating: 0 out of 5 stars
0 ratings
Semantic Models in IoT and eHealth Applications
Ebook
Semantic Models in IoT and eHealth Applications
bySanju Tiwari
Rating: 0 out of 5 stars
0 ratings
Swarm Intelligence and Bio-Inspired Computation: Theory and Applications
Ebook
Swarm Intelligence and Bio-Inspired Computation: Theory and Applications
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Web Semantics: Cutting Edge and Future Directions in Healthcare
Ebook
Web Semantics: Cutting Edge and Future Directions in Healthcare
bySarika Jain
Rating: 0 out of 5 stars
0 ratings
Data Analytics for Intelligent Transportation Systems
Ebook
Data Analytics for Intelligent Transportation Systems
byMashrur Chowdhury
Rating: 0 out of 5 stars
0 ratings
Computation and Storage in the Cloud: Understanding the Trade-Offs
Ebook
Computation and Storage in the Cloud: Understanding the Trade-Offs
byDong Yuan
Rating: 5 out of 5 stars
5/5
Data Analytics for Social Microblogging Platforms
Ebook
Data Analytics for Social Microblogging Platforms
bySoumi Dutta
Rating: 0 out of 5 stars
0 ratings

Enterprise Applications For You

Skip carousel

The Ridiculously Simple Guide to Google Docs: A Practical Guide to Cloud-Based Word Processing
Ebook
The Ridiculously Simple Guide to Google Docs: A Practical Guide to Cloud-Based Word Processing
byScott La Counte
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Bitcoin For Dummies
Ebook
Bitcoin For Dummies
byPrypto
Rating: 4 out of 5 stars
4/5
QuickBooks 2023 All-in-One For Dummies
Ebook
QuickBooks 2023 All-in-One For Dummies
byStephen L. Nelson
Rating: 0 out of 5 stars
0 ratings
The New Email Revolution: Save Time, Make Money, and Write Emails People Actually Want to Read!
Ebook
The New Email Revolution: Save Time, Make Money, and Write Emails People Actually Want to Read!
byRobert W. Bly
Rating: 5 out of 5 stars
5/5
Excel Formulas and Functions 2020: Excel Academy, #1
Ebook
Excel Formulas and Functions 2020: Excel Academy, #1
byAdam Ramirez
Rating: 4 out of 5 stars
4/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
MrExcel XL: The 40 Greatest Excel Tips of All Time
Ebook
MrExcel XL: The 40 Greatest Excel Tips of All Time
byBill Jelen
Rating: 4 out of 5 stars
4/5
Scrivener For Dummies
Ebook
Scrivener For Dummies
byGwen Hernandez
Rating: 4 out of 5 stars
4/5
Excel 2019 For Dummies
Ebook
Excel 2019 For Dummies
byGreg Harvey
Rating: 3 out of 5 stars
3/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
Ebook
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]: Career Elevator
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Systems Thinking: Managing Chaos and Complexity: A Platform for Designing Business Architecture
Ebook
Systems Thinking: Managing Chaos and Complexity: A Platform for Designing Business Architecture
byJamshid Gharajedaghi
Rating: 4 out of 5 stars
4/5
50 Useful Excel Functions: Excel Essentials, #3
Ebook
50 Useful Excel Functions: Excel Essentials, #3
byM.L. Humphrey
Rating: 5 out of 5 stars
5/5
QuickBooks Online For Dummies
Ebook
QuickBooks Online For Dummies
byDavid H. Ringstrom
Rating: 0 out of 5 stars
0 ratings
Microsoft Power Platform A Deep Dive: Dig into Power Apps, Power Automate, Power BI, and Power Virtual Agents (English Edition)
Ebook
Microsoft Power Platform A Deep Dive: Dig into Power Apps, Power Automate, Power BI, and Power Virtual Agents (English Edition)
byBijay Kumar Sahoo
Rating: 0 out of 5 stars
0 ratings
Excel for Beginners 2023: A Step-by-Step and Quick Reference Guide to Master the Fundamentals, Formulas, Functions, & Charts in Excel with Practical Examples | A Complete Excel Shortcuts Cheat Sheet
Ebook
Excel for Beginners 2023: A Step-by-Step and Quick Reference Guide to Master the Fundamentals, Formulas, Functions, & Charts in Excel with Practical Examples | A Complete Excel Shortcuts Cheat Sheet
byJames H. Moyle
Rating: 0 out of 5 stars
0 ratings
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
Excel 2016 For Dummies
Ebook
Excel 2016 For Dummies
byGreg Harvey
Rating: 4 out of 5 stars
4/5
Excel 2023 for Beginners: A Complete Quick Reference Guide from Beginner to Advanced with Simple Tips and Tricks to Master All Essential Fundamentals, Formulas, Functions, Charts, Tools, & Shortcuts
Ebook
Excel 2023 for Beginners: A Complete Quick Reference Guide from Beginner to Advanced with Simple Tips and Tricks to Master All Essential Fundamentals, Formulas, Functions, Charts, Tools, & Shortcuts
byTerry R. Hoffmann
Rating: 0 out of 5 stars
0 ratings
Excel Formulas That Automate Tasks You No Longer Have Time For
Ebook
Excel Formulas That Automate Tasks You No Longer Have Time For
byErik Kopp
Rating: 5 out of 5 stars
5/5
QuickBooks Online For Dummies
Ebook
QuickBooks Online For Dummies
byElaine Marmel
Rating: 0 out of 5 stars
0 ratings
QuickBooks 2021 For Dummies
Ebook
QuickBooks 2021 For Dummies
byStephen L. Nelson
Rating: 0 out of 5 stars
0 ratings
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
Ebook
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
byCrystalynn Shelton
Rating: 0 out of 5 stars
0 ratings
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Experts' Guide to OneNote
Ebook
Experts' Guide to OneNote
byJeremy P. Jones
Rating: 5 out of 5 stars
5/5
Evernote Essentials Guide (Boxed Set): Evernote Guide For Beginners for Organizing Your Life
Ebook
Evernote Essentials Guide (Boxed Set): Evernote Guide For Beginners for Organizing Your Life
bySpeedy Publishing
Rating: 3 out of 5 stars
3/5
101 Ready-to-Use Excel Formulas
Ebook
101 Ready-to-Use Excel Formulas
byMichael Alexander
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
Podcast episode
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
Podcast episode
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
Doing Software Engineering in Academia - Johanna Bayer
Podcast episode
Doing Software Engineering in Academia - Johanna Bayer
byDataTalks.Club
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 43. Fascinating laser research projects you wish you thought of (9 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 43. Fascinating laser research projects you wish you thought of (9 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
448: Using Technology to Investigate the Inner Workings of Large Networks - Dr. Madhav Marathe: Dr. Madhav Marathe is a Professor of Computer Science and Director of the Network Dynamics and Simulation Science Laboratory within the Biocomplexity Institute of Virginia Tech. He is also an adjunct faculty member at Chalmers University, the Indian...
Podcast episode
448: Using Technology to Investigate the Inner Workings of Large Networks - Dr. Madhav Marathe: Dr. Madhav Marathe is a Professor of Computer Science and Director of the Network Dynamics and Simulation Science Laboratory within the Biocomplexity Institute of Virginia Tech. He is also an adjunct faculty member at Chalmers University, the Indian...
byPeople Behind the Science Podcast - Stories from Scientists about Science, Life, Research, and Science Careers
0 ratings
0% found this document useful
E28: The Truth About Rainwater Harvesting Featuring Michelle Avis + Dr. Peter Coombes: After pursuing international training and certifications in renewable energy and regenerative design, as well as a degree in Mechanical Engineering, Michelle Avis, along with her husband and business partner Rob, began Verge Permaculture in 2010, now...
Podcast episode
E28: The Truth About Rainwater Harvesting Featuring Michelle Avis + Dr. Peter Coombes: After pursuing international training and certifications in renewable energy and regenerative design, as well as a degree in Mechanical Engineering, Michelle Avis, along with her husband and business partner Rob, began Verge Permaculture in 2010, now...
byThe Jim Gale Show
0 ratings
0% found this document useful
005 Prof. John Mauro - Developing a Data-Driven Materials Workforce: Summary: This episode highlights opportunities, challenges, and solutions around building a successful cross-functional, data-driven Research and Development team. In this episode, Dr. Bryce Meredig and Prof. Mauro discuss: ...
Podcast episode
005 Prof. John Mauro - Developing a Data-Driven Materials Workforce: Summary: This episode highlights opportunities, challenges, and solutions around building a successful cross-functional, data-driven Research and Development team. In this episode, Dr. Bryce Meredig and Prof. Mauro discuss: ...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
Converging User and Platform Centric Agendas: A Case Study: The University of Arizona, wanting to deliver a consumer-grade, integrated, digital experience for its students, blended user-centered design and business architecture methodologies to uncover students’ top priorities and assess how best to deliver...
Podcast episode
Converging User and Platform Centric Agendas: A Case Study: The University of Arizona, wanting to deliver a consumer-grade, integrated, digital experience for its students, blended user-centered design and business architecture methodologies to uncover students’ top priorities and assess how best to deliver...
byCIO Talk Network Podcast
0 ratings
0% found this document useful
Crop Growth: Modellansatz 089
Podcast episode
Crop Growth: Modellansatz 089
byModellansatz - English episodes only
0 ratings
0% found this document useful
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
Podcast episode
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
Podcast episode
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
41. Cell Free Systems w/ Michael Jewett - Professor @ Northwestern / Director @ NU Center for Synthetic Biology
Podcast episode
41. Cell Free Systems w/ Michael Jewett - Professor @ Northwestern / Director @ NU Center for Synthetic Biology
byBIOS
0 ratings
0% found this document useful
Episode 72: Step 05 (Student Accounts)
Podcast episode
Episode 72: Step 05 (Student Accounts)
byCoordinated
0 ratings
0% found this document useful
100: Nanotechnology and the Brain: Fundamentals of Neuromorphic Computing | (ft. Dr. Jean Anne Incorvia): Biological brains can accomplish more than modern computing systems while using much less power. However, computers are much better at dealing with computation, while brains are (unsurprisingly) much better at interacting with ever-changing environme...
Podcast episode
100: Nanotechnology and the Brain: Fundamentals of Neuromorphic Computing | (ft. Dr. Jean Anne Incorvia): Biological brains can accomplish more than modern computing systems while using much less power. However, computers are much better at dealing with computation, while brains are (unsurprisingly) much better at interacting with ever-changing environme...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
Podcast episode
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
byScreaming in the Cloud
0 ratings
0% found this document useful
Leading Sustainability MBA Program with Dr. Robert Sroufe: Robert Sroufe is a Professor of Sustainability, Operations and Supply Chain Management in the Palumbo-Donahue School of Business at Duquesne University in Pittsburgh, Pennsylvania. Professor Sroufe develops and teaches a course on Strategic...
Podcast episode
Leading Sustainability MBA Program with Dr. Robert Sroufe: Robert Sroufe is a Professor of Sustainability, Operations and Supply Chain Management in the Palumbo-Donahue School of Business at Duquesne University in Pittsburgh, Pennsylvania. Professor Sroufe develops and teaches a course on Strategic...
byThe Green Building Matters Podcast with Charlie Cichetti
0 ratings
0% found this document useful
Climate Conversations Vs Climate Facts? How Neighbor Conversations Drive Change with Professor Brian Southwell PhD
Podcast episode
Climate Conversations Vs Climate Facts? How Neighbor Conversations Drive Change with Professor Brian Southwell PhD
byHow to Save the World | The Psychology & Science of Environmental Behavior
0 ratings
0% found this document useful
Growing And Supporting The Data Science Community At Anaconda: An interview with Kevin Goldsmith, CTO of Anaconda, about the challenges that data scientists are faced with, how the role is continuing to evolve, and the tools and educational resources that they are building to support the community
Podcast episode
Growing And Supporting The Data Science Community At Anaconda: An interview with Kevin Goldsmith, CTO of Anaconda, about the challenges that data scientists are faced with, how the role is continuing to evolve, and the tools and educational resources that they are building to support the community
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 42. Fascinating laser research projects you wish you thought of (8 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 42. Fascinating laser research projects you wish you thought of (8 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Master Manna Irrigation in Minutes: Live Demo to Unlock Optimal Irrigation Insights
Podcast episode
Master Manna Irrigation in Minutes: Live Demo to Unlock Optimal Irrigation Insights
byIrrigation Training Series
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 40. Fascinating laser research projects you wish you thought of (6 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 40. Fascinating laser research projects you wish you thought of (6 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Improving Software Engineering in Biostatistics with Daniel Sabanés Bové
Podcast episode
Improving Software Engineering in Biostatistics with Daniel Sabanés Bové
byAxial Podcast
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 37. Fascinating laser research projects you wish you thought of (3 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 37. Fascinating laser research projects you wish you thought of (3 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 41. Fascinating laser research projects you wish you thought of (7 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 41. Fascinating laser research projects you wish you thought of (7 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 38. Fascinating laser research projects you wish you thought of (4 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 38. Fascinating laser research projects you wish you thought of (4 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 36. Fascinating laser research projects you wish you thought of (2 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 36. Fascinating laser research projects you wish you thought of (2 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
Podcast episode
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful
Season Two, Episode One -- Future of Work Part One: Communication Conundrum
Podcast episode
Season Two, Episode One -- Future of Work Part One: Communication Conundrum
byScience in Parallel
0 ratings
0% found this document useful
Cisco Optics Podcast Ep 39. Fascinating laser research projects you wish you thought of (5 of 9): Lasers have been around for over six decades. Sin…
Podcast episode
Cisco Optics Podcast Ep 39. Fascinating laser research projects you wish you thought of (5 of 9): Lasers have been around for over six decades. Sin…
byCisco Podcast Network
0 ratings
0% found this document useful

Skip carousel

About the Author
The European Business Review
Article
About the Author
May 31, 2020
Rashmi Anoop Patil, (Email: rashmi.anoop33@gmail.com) is a circular economy enthusiast and an engineer by profession with a Bachelors in Electronic Engineering from Visveswaraiah Technological University, India. As a member of the Circular Economy Ta
1 min read
About the Authors
The European Business Review
Article
About the Authors
Jan 31, 2020
Rashmi Anoop Patil, (Email: rashmi.anoop33@gmail.com) is a circular economy enthusiast and an engineer by profession with a Bachelors in Electronic Engineering from Visveswaraiah Technological University, India. As a freelance circular economy advoca
2 min read
Experimental By Choice
India Today
Article
Experimental By Choice
Mar 14, 2020
6 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Alagappa University
Cosmopolitan India
Article
Alagappa University
Oct 10, 2019
Prof. N. Rajendran assumed office as 10th ViceChancellor of Alagappa University on 5th June 2018 with high academic distinction. He is a scholar of repute, who has published in high Impact journals and also in Oxford University Press and Orient Longm
4 min read
Diagnose Crop Disease With A Smartphone
Futurity
Article
Diagnose Crop Disease With A Smartphone
Apr 26, 2018
New technology could allow anyone with a smartphone to diagnose crop disease the same way trained plant breeders and scientists do. The findings show how artificial intelligence can identify a range of common stresses in soybeans and improve efficien
1 min read
Chinese Students' Dream Device Defeats Japan's Most Powerful Supercomputer In World Contest
Post Magazine
Article
Chinese Students' Dream Device Defeats Japan's Most Powerful Supercomputer In World Contest
Jun 15, 2022
A small computer developed by Chinese students outperformed Japan's most powerful machine in solving a major complex data problem related to artificial intelligence, according to the latest global ranking. Supercomputer Fugaku in Japan has nearly 4 m
3 min read
Reddy’s List Of Honours Grows
Sunday Tribune
Article
Reddy’s List Of Honours Grows
Apr 18, 2021
WORLD-renowned mathematician Professor Dayanand Reddy received a doctor of science degree from the University of Stellenbosch. The honorary doctorate was bestowed on Reddy for his research leadership and scientific breakthroughs and exceptional contr
2 min read
Pinelands Pupils Off To International Science Fair
Independent on Saturday
Article
Pinelands Pupils Off To International Science Fair
Sep 16, 2023
1 min read
Pinelands Pupils Off To International Science Fair
Weekend Argus Saturday
Article
Pinelands Pupils Off To International Science Fair
Sep 16, 2023
1 min read
Pinelands Pupils Off To International Science Fair
Saturday Star
Article
Pinelands Pupils Off To International Science Fair
Sep 16, 2023
1 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read
Building The Future
India Today
Article
Building The Future
Jun 26, 2021
3 min read
Circuit Programs Human Cells to Add and Subtract
Futurity
Article
Circuit Programs Human Cells to Add and Subtract
Apr 15, 2017
A new platform offers a fast and more efficient way to target and program mammalian cells as genetic circuits, even complex ones. “The problem synthetic biologists are trying to solve is how we ask cells to make decisions and try to design a strategy
2 min read
Native Plant Trust Online Courses
MOTHER EARTH NEWS
Article
Native Plant Trust Online Courses
Mar 19, 2021
1 min read
Why Does Research Matter?
Landscape Architecture Australia
Article
Why Does Research Matter?
Jul 31, 2022
Through my experience in private practice over many years, as a national director of AILA and as a PhD candidate, I have gained insight into the organizational, practice-related and academic value of two things: lifelong learning and evidence-based r
2 min read
Rebuilding A Critical American Industry
Fast Company
Article
Rebuilding A Critical American Industry
May 2, 2023
2 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
Team Encodes Digital ‘Hello’ Into Lab-made DNA
Futurity
Article
Team Encodes Digital ‘Hello’ Into Lab-made DNA
Mar 26, 2019
4 min read
Indiana State University
Indianapolis Monthly
Article
Indiana State University
Jul 25, 2019
Sycamore engineers are designing solutions to the world’s problems When you pursue an engineering degree at Indiana State University, you’ll be ready to star in a role essential to manufacturing, aviation, aerospace, and beyond. State’s engineering d
2 min read
Category Winners
Facility Management
Article
Category Winners
Jun 24, 2018
2 min read
These Walls Can Talk
Facility Management
Article
These Walls Can Talk
Aug 23, 2018
3 min read
A.I. Speeds Up Battery Testing For Electric Vehicles
Futurity
Article
A.I. Speeds Up Battery Testing For Electric Vehicles
Feb 24, 2020
4 min read
Seeds Of Change
Landscape Architecture Australia
Article
Seeds Of Change
Jan 29, 2024
4 min read
Cheatbot? Why AI Could Spell The End For Exam Essays
Guardian Weekly
Article
Cheatbot? Why AI Could Spell The End For Exam Essays
Jan 27, 2023
3 min read
Be In The Know
Backyard and Outdoor Living
Article
Be In The Know
Jan 22, 2020
2 min read
City of Melbourne’s Green Factor tool City of Melbourne
Landscape Architecture Australia
Article
City of Melbourne’s Green Factor tool City of Melbourne
Oct 30, 2022
1 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
What Do Academics Think?
The Big Issue Magazine
Article
What Do Academics Think?
May 19, 2023
3 min read
Whitehead Tapped As EVC Head Of Policy
Deals On Wheels Australia
Article
Whitehead Tapped As EVC Head Of Policy
Dec 19, 2021
Dr Jake Whitehead has been appointed the Electric Vehicle Council’s (EVC’s) head of policy. Whitehead comes to the industry body with deep sustainable-transport learning and understanding. “Dr Whitehead is a world-leading expert in e-mobility,” EVC c
1 min read

Related categories

Skip carousel

Reviews for Machine Learning in Earth, Environmental and Planetary Sciences

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Machine Learning in Earth, Environmental and Planetary Sciences - Hossein Bonakdari

Chapter 1

Dataset preparation

Abstract

The machine learning (ML) approach, a powerful tool for solving complex nonlinear problems, has attracted the attention of many scholars in various fields of applied sciences, including social science, chemical engineering, physics and astronomy, agriculture and biological science, mathematics, earth and planetary sciences, environmental science, computer science, etc. The primary objective of the ML approach is to generate an intelligent model capable of producing solutions to complex problems, problems which humans would be unable to solve without the help of an expert system. In this chapter, the prerequisite steps required to model using ML techniques are presented in detail. This chapter begins with an overview of the modeling process for ML applications, a process which may be implemented regardless of the specific ML technique considered. The reader is introduced to five different real-world problems which contain two to six input variables and 100 to more than 1000 sample points.

Keywords

Data; machine learning; artificial intelligence; MATLAB; Barplot

1.1 The modeling process

Prior to implementing any machine learning (ML) technique, the modeler must first have an understanding of the general approach to be followed when faced with any modeling problem. Often, in the analysis and resolution of any of these problems, a consistent modeling paradigm or methodology may be followed. Fig. 1.1 presents a schematic detailing all of the steps involved in the resolution of ML-based modeling problems. From this figure, the ML application process is composed of three main components: (1) data collection; (2) preprocessing; (3) modeling by ML techniques, and (4) postprocessing. During data collection, a set of data relating independent variables to one or more target variables is identified.

Figure 1.1 The modeling process/paradigm in machine learning.

In Fig. 1.1, for example, three independent input variables are identified as raw data. Data collection/ data generation can be done in many ways including through personal experimentation (laboratory experimental results), from open data sources (such as USGS or Statistics Canada), or from review documentation and/or existing published data sets (Fig. 1.2).

Figure 1.2 Different types of data collections.

The second step is preprocessing. This is considered one of the most critical steps when modeling with ML techniques, as data transformation and cleaning can result in more meaningful modeling results and better model performance. In fact, without preprocessing, it may prove difficult in some circumstances to fit an adequate and generalizable model to the data set (Niu et al., 2020; Obaid et al., 2019). Consider the following example. Suppose the range of different input variables within a dataset is not the same and spans several orders of magnitude. In this case, the model will place more effort into optimizing adjustable parameters for the variables that have higher magnitudes compared to those variables which are small in magnitude. The changes obtained during the modeling process for the optimized values of the variables with smaller values will be negligible, and the model may even choose to ignore the changes in the value of these parameters. One of the most well-known approaches to overcome this limitation of ML models is by applying preprocessing in the form of normalization (Ebtehaj & Bonakdari, 2016a; Ivanyuk & Soloviev, 2019; Qasem et al., 2017) or standardization (Ebtehaj et al., 2019; Gómez-Escalonilla et al., 2022; Zeynoddin et al., 2019) of the raw data. From this simple example, it can be readily appreciated that familiarity with preprocessing methods is fundamental to modeling with ML methods. In addition to improving model accuracy, preprocessing may make the input data simpler to understand for the modeler and easier to compare (Bonakdari et al., 2019; Moeeni et al., 2017). Besides data scaling in the form of normalization (Zeynoddin, Bonakdari et al., 2020) or standardization (Zeynoddin, Ebtehaj et al., 2020; Zhang et al., 2018), data splitting (Ebtehaj et al., 2020) and cross-validation (Bonakdari & Zeynoddin, 2022; Ebtehaj & Bonakdari, 2016b; Ferdinandy et al., 2020) are also used during the preprocessing phase to split data into training and testing samples (Fig. 1.3).

Figure 1.3 Summary of fundamental preprocessing techniques.

The third step in the paradigm is modeling using ML approaches. In this step, the best model should be identified through the use of optimization techniques coupled with preprocessing of the input data. To find the optimum models, a set of quantitative postprocessing tools such as statistical indices and qualitative tools such as scatter plots (Kim et al., 2019), box plots (Jato-Espino et al., 2019), Taylor diagrams (Hu et al., 2021), uncertainty analysis (Herrera et al., 2022; Sharafati et al., 2020); reliability analysis (Hariri-Ardebili & Pourkamali-Anaraki, 2018a,b), etc., must be considered. After selecting the best model through the use of these tools, the final model can be applied for practical tasks to new data sets. A summary of the fundamental postprocessing techniques is presented in Fig. 1.4.

Figure 1.4 Summary of fundamental postprocessing techniques.

1.2 Data description

Throughout this text, five different sample data sets (all collected from real-world projects) are considered to demonstrate the development, application, and performance of different ML models. A description of each dataset, which will be termed Examples 1–5 in Appendix 1A, is provided in the following subsections. The number of input variables and the number of all samples are provided in Fig. 1.5.

Figure 1.5 Characteristics of example data. NIV, Number of input variables; NS, number of all samples.

1.3 Different types of problems

1.3.1 Example 1: a problem with six input variables

The dataset considered in Example 1 is a composite set that was formed by the aggregation of data from two different studies (Bagheri et al., 2014; Cheong, 1991). Therefore the collected data for Example 1 is a combination of two published data sets. In order for a modeler to aggregate any number of datasets, the laboratory conditions of both datasets must be practically identical such that the data was collected in the same way. Given that it is time-consuming to perform experiments with different ranges of variables affecting the investigation, it is often the case that scholar(s) cannot examine all conditions in a single study. Therefore, by juxtaposing several studies in which the conditions of the experiments are consistent with each other, the limitations of each study can be overcome. In addition, the use of ML methods requires a wide range of independent input variables to train the desired model(s) and the typically large number of samples. High numbers of input variables are required to provide the model with the experience required to estimate the target variable with acceptable accuracy for unseen samples (i.e., testing samples). For example, if one dataset only covers a range of input values from 0 to 20, while another set ranges from 15 to 70, it may prove valuable to develop a model that spans the greater range of values defined by the juxtaposed set (i.e., from 0 to 70) so that it has a greater range of application in solving real-world problems. Considering more than one data set is a well-known approach in real-world practical applications of ML by scholars (Azimi et al., 2016; Ebtehaj & Bonakdari, 2014; Ebtehaj et al., 2015, 2016, 2017; Gholami et al., 2017).

In Example 1, the total number of samples is 161, with 113 samples being randomly selected to train the model (i.e., Training samples) while the rest of the samples (i.e., 48 samples) are used as a validation to check the performance of the developed ML-based model when faced with unseen samples (i.e., Testing samples). Modeling is performed considering 70% of all samples, while 30% of samples are reserved to verify the generalizability of the developed model. It should be noted that the modeling process for ML-based models should be controlled in such a way that the developed model performs well in both the testing phase and the training phase such that it is generalizable to application in a range of future tasks.

Different splitting ratios may be considered for assigning training and testing data, where the maximum amount of test data is about 50% (i.e., 50% for the training stage and 50% for the testing stage) and the minimum is 10% (i.e., 90% for the training stage and 10% for the testing stage) (Ebtehaj et al., 2020). However, considering 30% of the total data as testing samples is a well-known distribution for the testing stage. Therefore this ratio is considered throughout this text. However, the reader should be aware that in some instances, the nature of the data set necessitates that different splitting ratios be studied to obtain the optimal distribution of training and testing samples. The optimal percentage is defined in such a way that the number of training and testing samples is not too small, but also the performance of the model in both training and testing modes must be very close to each other.

1.3.1.1 Statistical description of Example 1 data using barplot analysis

The minimum (Min.), average (Avg.), maximum (Max.), and standard deviation (SD) values for all independent inputs (In1, In2, In3, In4, In5, In6), as well as the dependent output (Out) for the training, testing, and total data, are provided in Fig. 1.6A–G for Example 1 data.

Figure 1.6 Statistical indices of Example 1 training, testing, and total data. (A) Input 1, (B) input 2, (C) input 3, (D) input 4, (E) input 5, (F) input 6, and (G) output.

From this figure, it can be seen that the ranges of input values are significantly different from each other. This can also be said of the input variables compared to the output variable. For example, the maximum value of In3 is 10, while this same value is 4 (i.e., In3) or less than 4 (In1, In4, In5, In6) for all other inputs. If the range of values for the different inputs and output variables differs greatly from one another, the modeler may need to apply normalization during the preprocessing stage. Another consideration for the modeler is the similitude in the range of the data used for training and testing subsets. It is desirable to split the data such that the training subset will provide all the necessary experience to the developed model through exposure to the full range of input variables. If the data distribution spans different ranges in the training and testing subsets, then the model may yield poor results.

1.3.1.2 The barplot coding using MATLAB®

In Fig. 1.6, several statistical indices (minimum, maximum, standard deviation) were computed for each independent input variable as well as the output variable. In the following subsection, the detailed steps required to code and generate the barplots, including the aggregation of training and test data, the calculation of indices, and the plotting of figures, are presented.

The coding syntax for the generation of a barplot can be divided into several general categories: (1) load data; (2) Merge all samples; (3) Calculation of statistical indices; (4) Prepare data for plotting; (5) Plot results. First, the data must be read or loaded into the MATLAB environment. To do so, the data is first saved within a Microsoft Excel file that contains sheets (i.e., sheet1, sheet2, sheet3, sheet4), where sheets 1–4 contain the training input, training output, testing input, and testing output, respectively. From Fig. 1.7, For Example 1 data, the number of input variables is six, while the number of output variables is one. This was previously shown in the dataset description in Fig. 1.6. In addition, 70% of the data was considered as training data, while the other 30% was reserved as the testing subset. This results in 113 and 48 training and testing data samples, respectively.

Figure 1.7 Data preparation in Microsoft Excel file. (A) Training inputs, (B) training targets, (C) testing inputs, and (D) testing target.

Code 1.1 presents the required syntax for the load data and Merge all samples steps. Before providing the details of the code, its function is conceptualized in Fig. 1.8. According to this figure, using the xlsread command in the MATLAB environment, four different variables are loaded from the previously developed Excel spreadsheet (i.e., TrainInputs; Testargets; TestInputs; TrainTargets). In the next step, the train and test inputs, as well as their corresponding targets, are merged as Inputs and Targets.

Figure 1.8 The conceptual coding process of Code 1.1.

Below, the coding details related to data loading (i.e., Code 1.1), calculating indicators (i.e., Code 1.2), as well as plotting figures (i.e., Code 1.3) are explained. In lines 1–3 of Code 1.1 presented next, some general MATLAB functions are used to prepare the MATLAB environment prior to the execution of any programming. These commands are used in almost all MATLAB syntaxes and are the real-life equivalent of wiping a whiteboard clean—a clean slate. In line 1, clc is used to clear the command window, which erases the text that was previously displayed. Once this command has been executed, the function history cannot be seen using the scroll bar, but the command history statements could be called using the up-arrow key ↑. In line 2, the second command is the clear command. This clears variables and functions from the memory of the program. Other times, the clear all command may be used, which is used to clear functions, variables, and other stored items from memory. Examples include cached memory, breakpoints, and persistent variables. It is often unnecessary to employ the clear all function, and the clear command is sufficient. The third command shown in line 3 is close all and is applied to remove all the figures whose handles are not hidden.

Code 1.1

In lines 6–9 of Code 1.1, the xlsread command is used to load the data from the Excel spreadsheet developed previously (i.e., Fig. 1.7). The general format of this built-in MATLAB function is xlsread (filename, sheet), where filename is the name of the saved file and sheet is the name of the reference sheet where the data is contained.

In the case of the Example 1 data set, the data is saved under the Excel filename Example1, which is used as the filename argument, while the sheet argument is specified as either sheet1, sheet2, sheet3, or sheet4, for the training input, training target, testing input, and testing target, respectively. Because the training and testing data are independently read into the MATLAB environment, it is necessary to define a variable capable of storing the data features for the entire set. To achieve this, the training inputs and testing inputs are merged and stored into the Inputs variable in line 12, while the training and testing output are merged and stored under the Targets variable in line 13.

Once the training, testing, and total data have been read into the MATLAB environment, the statistical indices, including the minimum, the average, the maximum, and the standard deviation, may be calculated (Code 1.2). Code 1.2 includes four different sections that are independently discussed as Code 1.2.A (finding the minimum), Code 1.2.B (finding the mean), Code 1.2.C (finding the maximum), and Code 1.2.D (finding the standard deviation). A simple graphical definition of Code 1.2 is provided in Fig. 1.9. The statistical indices are computed for each of the training input, training output, testing input, and testing output subsets, as well as the total input and total output. This results in 24 different parameters computed by the MATLAB code. The statistical indices are computed using the built-in MATLAB functions min(x), mean(x), max(x), and std(x) where x contains the data set of interest. For example, x is TrainInputs for calculating the minimum, maximum, mean, and standard deviations of the training inputs.

Figure 1.9 A graphical definition of Code 1.2.

Code 1.2.A

Code 1.2.B

Code 1.2.C

Code 1.2.D

During the fourth step, the data is prepared to be plotted by using Code 1.3, which is schematically represented in Fig. 1.10. For each input and output variable, the minimum, average, maximum, and standard deviation values computed for the testing and training subsets are merged into one parameter in Code 1.3.H. Considering line 3 of Code 1.3.A for example, the minimum values for the training and testing subsets (e.g., Min_TrIn(1) and Min_TsIn(1)) for In(1), as well as the total data set (Min_allIn(1)) are merged into the variable Min1. Similarly, the mean, maximum, and standard deviation of In1 (i.e., input one) are also merged and saved into the variables Avg1, Max1, and SD1 in lines 4 through 6, respectively. This process is repeated for all input variables (i.e., In2, In3, In4, In5, In6) in Codes 1.3.B to Code 1.3.G and for the output variable (i.e., Out) in Code 1.3.F. Following this, the information for each input and output variable is stored in an array format in Code 1.3.H. These final variables (i.e., In1, In2, In3, In4, In5, In6, and Out) are employed in the next step to plot all of the input and output characteristics. Indeed, each of the newly generated variables (i.e., In1, In2, In3, In4, In5, In6, and Out) is a matrix that contains four rows and three columns. As seen in Fig. 1.11, each row is associated with a given data statistic (i.e., minimum, maximum, mean, standard deviation), while the columns present the results for each subset of data (i.e., training, testing, total).

Figure 1.10 Schematic of Code 1.3.

Figure 1.11 The size and type of the stored parameters in variable

Enjoying the preview?

Page 1 of 1

Machine Learning in Earth, Environmental and Planetary Sciences: Theoretical and Practical Applications

About this ebook

Hossein Bonakdari

Related authors

Related to Machine Learning in Earth, Environmental and Planetary Sciences

Related ebooks

Enterprise Applications For You

Related podcast episodes

Related articles

Related categories

Reviews for Machine Learning in Earth, Environmental and Planetary Sciences

What did you think?

Book preview

Machine Learning in Earth, Environmental and Planetary Sciences - Hossein Bonakdari

Abstract

Keywords

1.1 The modeling process

1.2 Data description

1.3 Different types of problems

1.3.1 Example 1: a problem with six input variables