Machine Learning for Subsurface Characterization

Ebook774 pages5 hours

Machine Learning for Subsurface Characterization

Name: Machine Learning for Subsurface Characterization
Author: Siddharth Misra
ISBN: 9780128177372

By Siddharth Misra, Hao Li and Jiabo He

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Machine Learning for Subsurface Characterization develops and applies neural networks, random forests, deep learning, unsupervised learning, Bayesian frameworks, and clustering methods for subsurface characterization. Machine learning (ML) focusses on developing computational methods/algorithms that learn to recognize patterns and quantify functional relationships by processing large data sets, also referred to as the "big data." Deep learning (DL) is a subset of machine learning that processes "big data" to construct numerous layers of abstraction to accomplish the learning task. DL methods do not require the manual step of extracting/engineering features; however, it requires us to provide large amounts of data along with high-performance computing to obtain reliable results in a timely manner. This reference helps the engineers, geophysicists, and geoscientists get familiar with data science and analytics terminology relevant to subsurface characterization and demonstrates the use of data-driven methods for outlier detection, geomechanical/electromagnetic characterization, image analysis, fluid saturation estimation, and pore-scale characterization in the subsurface.

Learn from 13 practical case studies using field, laboratory, and simulation data
Become knowledgeable with data science and analytics terminology relevant to subsurface characterization
Learn frameworks, concepts, and methods important for the engineer’s and geoscientist’s toolbox needed to support

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateOct 12, 2019

ISBN9780128177372

Author

Siddharth Misra

Siddharth Misra is currently associate professor at the Harold Vance Department of Petroleum Engineering, Texas A&M University, College Station, Texas. His research work is in the area of data-driven predictive models, machine learning, geosensors, and subsurface characterization. He earned a PhD in petroleum engineering from the University of Texas and a bachelor of technology in electrical engineering from the Indian Institute of Technology in Bombay. He received the Department of Energy Early Career Award in 2018 to promote geoscience research.

Related authors

Skip carousel

Related to Machine Learning for Subsurface Characterization

Related ebooks

Skip carousel

Data Mining and Knowledge Discovery for Geoscientists
Ebook
Data Mining and Knowledge Discovery for Geoscientists
byGuangren Shi
Rating: 0 out of 5 stars
0 ratings
Petrophysical Characterization and Fluids Transport in Unconventional Reservoirs
Ebook
Petrophysical Characterization and Fluids Transport in Unconventional Reservoirs
byJianchao Cai
Rating: 5 out of 5 stars
5/5
Metaheuristics in Water, Geotechnical and Transport Engineering
Ebook
Metaheuristics in Water, Geotechnical and Transport Engineering
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Predictive Modelling for Energy Management and Power Systems Engineering
Ebook
Predictive Modelling for Energy Management and Power Systems Engineering
byRavinesh Deo
Rating: 0 out of 5 stars
0 ratings
Applications of Artificial Intelligence Techniques in the Petroleum Industry
Ebook
Applications of Artificial Intelligence Techniques in the Petroleum Industry
byAbdolhossein Hemmati-Sarapardeh
Rating: 0 out of 5 stars
0 ratings
Reservoir Simulations: Machine Learning and Modeling
Ebook
Reservoir Simulations: Machine Learning and Modeling
byShuyu Sun
Rating: 0 out of 5 stars
0 ratings
Assisted History Matching for Unconventional Reservoirs
Ebook
Assisted History Matching for Unconventional Reservoirs
bySutthaporn Tripoppoom
Rating: 0 out of 5 stars
0 ratings
Machine Learning and Data Science in the Oil and Gas Industry: Best Practices, Tools, and Case Studies
Ebook
Machine Learning and Data Science in the Oil and Gas Industry: Best Practices, Tools, and Case Studies
byPatrick Bangert
Rating: 3 out of 5 stars
3/5
Experimental Design in Petroleum Reservoir Studies
Ebook
Experimental Design in Petroleum Reservoir Studies
byMohammad Jamshidnezhad
Rating: 0 out of 5 stars
0 ratings
Deconvolution of Geophysical Time Series in the Exploration for Oil and Natural Gas
Ebook
Deconvolution of Geophysical Time Series in the Exploration for Oil and Natural Gas
byM.T. Silvia
Rating: 0 out of 5 stars
0 ratings
Machine Learning Guide for Oil and Gas Using Python: A Step-by-Step Breakdown with Data, Algorithms, Codes, and Applications
Ebook
Machine Learning Guide for Oil and Gas Using Python: A Step-by-Step Breakdown with Data, Algorithms, Codes, and Applications
byHoss Belyadi
Rating: 4 out of 5 stars
4/5
Practical Reservoir Engineering and Characterization
Ebook
Practical Reservoir Engineering and Characterization
byRichard O. Baker
Rating: 4 out of 5 stars
4/5
Computing Risk for Oil Prospects: Principles and Programs
Ebook
Computing Risk for Oil Prospects: Principles and Programs
byJ.W. Harbaugh
Rating: 0 out of 5 stars
0 ratings
Supervised Machine Learning in Wind Forecasting and Ramp Event Prediction
Ebook
Supervised Machine Learning in Wind Forecasting and Ramp Event Prediction
byHarsh S. Dhiman
Rating: 0 out of 5 stars
0 ratings
Advances in Subsurface Data Analytics
Ebook
Advances in Subsurface Data Analytics
byShuvajit Bhattacharya
Rating: 0 out of 5 stars
0 ratings
Intelligent Digital Oil and Gas Fields: Concepts, Collaboration, and Right-Time Decisions
Ebook
Intelligent Digital Oil and Gas Fields: Concepts, Collaboration, and Right-Time Decisions
byGustavo Carvajal
Rating: 5 out of 5 stars
5/5
Fundamentals of Numerical Reservoir Simulation
Ebook
Fundamentals of Numerical Reservoir Simulation
byD.W. Peaceman
Rating: 3 out of 5 stars
3/5
Fundamentals of Applied Reservoir Engineering: Appraisal, Economics and Optimization
Ebook
Fundamentals of Applied Reservoir Engineering: Appraisal, Economics and Optimization
byRichard Wheaton
Rating: 4 out of 5 stars
4/5
Quantitative Methods in Reservoir Engineering
Ebook
Quantitative Methods in Reservoir Engineering
byWilson C Chin
Rating: 0 out of 5 stars
0 ratings
Reservoir Characterization
Ebook
Reservoir Characterization
byLarry Lake
Rating: 4 out of 5 stars
4/5
Harness Oil and Gas Big Data with Analytics: Optimize Exploration and Production with Data-Driven Models
Ebook
Harness Oil and Gas Big Data with Analytics: Optimize Exploration and Production with Data-Driven Models
byKeith R. Holdaway
Rating: 0 out of 5 stars
0 ratings
Methods for Petroleum Well Optimization: Automation and Data Solutions
Ebook
Methods for Petroleum Well Optimization: Automation and Data Solutions
byRasool Khosravanian
Rating: 0 out of 5 stars
0 ratings
Artificial Neural Networks and Statistical Pattern Recognition: Old and New Connections
Ebook
Artificial Neural Networks and Statistical Pattern Recognition: Old and New Connections
byElsevier Books Reference
Rating: 0 out of 5 stars
0 ratings
Hydrocarbon Exploration and Production
Ebook
Hydrocarbon Exploration and Production
byFrank Jahn
Rating: 5 out of 5 stars
5/5
Advanced Production Decline Analysis and Application
Ebook
Advanced Production Decline Analysis and Application
byHedong Sun
Rating: 4 out of 5 stars
4/5
Data Assimilation for the Geosciences: From Theory to Application
Ebook
Data Assimilation for the Geosciences: From Theory to Application
bySteven J. Fletcher
Rating: 0 out of 5 stars
0 ratings
Deep Learning with R
Ebook
Deep Learning with R
byJ. J. Allaire
Rating: 0 out of 5 stars
0 ratings
Computational Neural Networks for Geophysical Data Processing
Ebook
Computational Neural Networks for Geophysical Data Processing
byElsevier Books Reference
Rating: 5 out of 5 stars
5/5
Geophysics for Petroleum Engineers
Ebook
Geophysics for Petroleum Engineers
byFred Aminzadeh
Rating: 5 out of 5 stars
5/5
Managing Subsurface Data in the Oil and Gas Sector Seismic: Seismic
Ebook
Managing Subsurface Data in the Oil and Gas Sector Seismic: Seismic
byAhmad Bin Maidinsar
Rating: 0 out of 5 stars
0 ratings

Science & Mathematics For You

Skip carousel

Outsmart Your Brain: Why Learning is Hard and How You Can Make It Easy
Ebook
Outsmart Your Brain: Why Learning is Hard and How You Can Make It Easy
byDaniel T. Willingham
Rating: 4 out of 5 stars
4/5
Becoming Cliterate: Why Orgasm Equality Matters--And How to Get It
Ebook
Becoming Cliterate: Why Orgasm Equality Matters--And How to Get It
byDr. Laurie Mintz
Rating: 4 out of 5 stars
4/5
Activate Your Brain: How Understanding Your Brain Can Improve Your Work - and Your Life
Ebook
Activate Your Brain: How Understanding Your Brain Can Improve Your Work - and Your Life
byScott G. Halford
Rating: 4 out of 5 stars
4/5
A Letter to Liberals: Censorship and COVID: An Attack on Science and American Ideals
Ebook
A Letter to Liberals: Censorship and COVID: An Attack on Science and American Ideals
byRobert F. Kennedy, Jr.
Rating: 3 out of 5 stars
3/5
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Ebook
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
byMargot Lee Shetterly
Rating: 4 out of 5 stars
4/5
The Big Fat Surprise: Why Butter, Meat and Cheese Belong in a Healthy Diet
Ebook
The Big Fat Surprise: Why Butter, Meat and Cheese Belong in a Healthy Diet
byNina Teicholz
Rating: 4 out of 5 stars
4/5
The Dorito Effect: The Surprising New Truth About Food and Flavor
Ebook
The Dorito Effect: The Surprising New Truth About Food and Flavor
byMark Schatzker
Rating: 4 out of 5 stars
4/5
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
Ebook
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
byAlbert Rutherford
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 4 out of 5 stars
4/5
Memory Craft: Improve Your Memory with the Most Powerful Methods in History
Ebook
Memory Craft: Improve Your Memory with the Most Powerful Methods in History
byLynne Kelly
Rating: 3 out of 5 stars
3/5
How Emotions Are Made: The Secret Life of the Brain
Ebook
How Emotions Are Made: The Secret Life of the Brain
byLisa Feldman Barrett
Rating: 4 out of 5 stars
4/5
Born for Love: Why Empathy Is Essential--and Endangered
Ebook
Born for Love: Why Empathy Is Essential--and Endangered
byBruce D. Perry
Rating: 4 out of 5 stars
4/5
The Big Book of Hacks: 264 Amazing DIY Tech Projects
Ebook
The Big Book of Hacks: 264 Amazing DIY Tech Projects
byDoug Cantor
Rating: 4 out of 5 stars
4/5
Homo Deus: A Brief History of Tomorrow
Ebook
Homo Deus: A Brief History of Tomorrow
byYuval Noah Harari
Rating: 4 out of 5 stars
4/5
Why People Believe Weird Things: Pseudoscience, Superstition, and Other Confusions of Our Time
Ebook
Why People Believe Weird Things: Pseudoscience, Superstition, and Other Confusions of Our Time
byMichael Shermer
Rating: 4 out of 5 stars
4/5
The Wisdom of Psychopaths: What Saints, Spies, and Serial Killers Can Teach Us About Success
Ebook
The Wisdom of Psychopaths: What Saints, Spies, and Serial Killers Can Teach Us About Success
byKevin Dutton
Rating: 4 out of 5 stars
4/5
Metaphors We Live By
Ebook
Metaphors We Live By
byGeorge Lakoff
Rating: 4 out of 5 stars
4/5
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C Lennox
Rating: 4 out of 5 stars
4/5
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
Ebook
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
byScott H. Young
Rating: 4 out of 5 stars
4/5
Free Will
Ebook
Free Will
bySam Harris
Rating: 4 out of 5 stars
4/5
On Food and Cooking: The Science and Lore of the Kitchen
Ebook
On Food and Cooking: The Science and Lore of the Kitchen
byHarold McGee
Rating: 5 out of 5 stars
5/5
Oppenheimer: The Tragic Intellect
Ebook
Oppenheimer: The Tragic Intellect
byCharles Thorpe
Rating: 5 out of 5 stars
5/5
The Great Mortality: An Intimate History of the Black Death, the Most Devastating Plague of All Time
Ebook
The Great Mortality: An Intimate History of the Black Death, the Most Devastating Plague of All Time
byJohn Kelly
Rating: 4 out of 5 stars
4/5
The Psychology of Totalitarianism
Ebook
The Psychology of Totalitarianism
byMattias Desmet
Rating: 5 out of 5 stars
5/5
Hunt for the Skinwalker: Science Confronts the Unexplained at a Remote Ranch in Utah
Ebook
Hunt for the Skinwalker: Science Confronts the Unexplained at a Remote Ranch in Utah
byColm A. Kelleher
Rating: 4 out of 5 stars
4/5
Fantastic Fungi: How Mushrooms Can Heal, Shift Consciousness, and Save the Planet
Ebook
Fantastic Fungi: How Mushrooms Can Heal, Shift Consciousness, and Save the Planet
byPaul Stamets
Rating: 5 out of 5 stars
5/5
No Stone Unturned: The True Story of the World's Premier Forensic Investigators
Ebook
No Stone Unturned: The True Story of the World's Premier Forensic Investigators
bySteve Jackson
Rating: 4 out of 5 stars
4/5
Other Minds: The Octopus, the Sea, and the Deep Origins of Consciousness
Ebook
Other Minds: The Octopus, the Sea, and the Deep Origins of Consciousness
byPeter Godfrey-Smith
Rating: 4 out of 5 stars
4/5
Lies My Gov't Told Me: And the Better Future Coming
Ebook
Lies My Gov't Told Me: And the Better Future Coming
byRobert W. Malone
Rating: 4 out of 5 stars
4/5
The Misinformation Age: How False Beliefs Spread
Ebook
The Misinformation Age: How False Beliefs Spread
byCailin O'Connor
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
FREEDA: an automated computational pipeline guides experimental testing of protein innovation by detecting positive selection
Podcast episode
FREEDA: an automated computational pipeline guides experimental testing of protein innovation by detecting positive selection
byPaperPlayer biorxiv cell biology
0 ratings
0% found this document useful
Melanie Tribble
Podcast episode
Melanie Tribble
byPeople doing Physics
0 ratings
0% found this document useful
What is maximal oxygen consumption (VO2max) and why is it important? | #17 ft. Dr. Robby Jacobs
Podcast episode
What is maximal oxygen consumption (VO2max) and why is it important? | #17 ft. Dr. Robby Jacobs
byCritical Oxygen
0 ratings
0% found this document useful
Setting the Standard: Impact of Method Standardization in Chromatography
Podcast episode
Setting the Standard: Impact of Method Standardization in Chromatography
byThe Analytical Wavelength
0 ratings
0% found this document useful
Ep 38: Hussein Khalil, Argonne National Lab: Nuclear Energy R&D
Podcast episode
Ep 38: Hussein Khalil, Argonne National Lab: Nuclear Energy R&D
byTitans Of Nuclear | Interviewing World Experts on Nuclear Energy
0 ratings
0% found this document useful
SPE Live Podcast: Innovation in Hydraulic Fracturing: Join us for an SPE Live episode on “Innovation in Hydraulic Fracturing” with a preview of the SPE Hydraulic Fracturing Technology Conference and Exhibition on 6-8 February 2024, being held in the Woodlands, Texas – USA. We will retrace 2023 from the ...
Podcast episode
SPE Live Podcast: Innovation in Hydraulic Fracturing: Join us for an SPE Live episode on “Innovation in Hydraulic Fracturing” with a preview of the SPE Hydraulic Fracturing Technology Conference and Exhibition on 6-8 February 2024, being held in the Woodlands, Texas – USA. We will retrace 2023 from the ...
byThe SPE Podcast
0 ratings
0% found this document useful
Changepoint Detection: Secret Weapon of the Data Scientist
Podcast episode
Changepoint Detection: Secret Weapon of the Data Scientist
byDataCafé
0 ratings
0% found this document useful
Is the Best Predictor of Endurance Performance your Muscle?! | #13 ft. Dr. Robby Jacobs
Podcast episode
Is the Best Predictor of Endurance Performance your Muscle?! | #13 ft. Dr. Robby Jacobs
byCritical Oxygen
0 ratings
0% found this document useful
Podcast Ep. #7 – Dr Valeska Ting on Smart Nanomaterials for Hydrogen Storage: Today's episode features Dr Valeska Ting who is a Reader in Smart Nanomaterials at the University of Bristol and is researching the use of nanoporous materials for hydrogen storage. Using hydrogen as a fuel source has many benefits.
Podcast episode
Podcast Ep. #7 – Dr Valeska Ting on Smart Nanomaterials for Hydrogen Storage: Today's episode features Dr Valeska Ting who is a Reader in Smart Nanomaterials at the University of Bristol and is researching the use of nanoporous materials for hydrogen storage. Using hydrogen as a fuel source has many benefits.
byAerospace Engineering Podcast
0 ratings
0% found this document useful
The Future of Chemistry Education
Podcast episode
The Future of Chemistry Education
byThe Analytical Wavelength
0 ratings
0% found this document useful
48. Big Data Wrangling for Core Sensing Technology
Podcast episode
48. Big Data Wrangling for Core Sensing Technology
byDiscovery to Recovery
0 ratings
0% found this document useful
Ep. 415 Thyroid Ablation: Efficacy, Safety, and Procedure Overview with Dr. Gary Tse
Podcast episode
Ep. 415 Thyroid Ablation: Efficacy, Safety, and Procedure Overview with Dr. Gary Tse
byBackTable Vascular & Interventional
0 ratings
0% found this document useful
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
Podcast episode
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
byEditors in Conversation
0 ratings
0% found this document useful
Emerging Environmental Technology - Strategies to Curb Water Contamination with Satinder K. Brar and Rama Pulicharla: How can the impact of wastewater on the environment be reduced? A future of fewer pollutants may be possible through new technologies that can purify or mitigate contamination. Listen up to learn: The function of the black box Possibilities for...
Podcast episode
Emerging Environmental Technology - Strategies to Curb Water Contamination with Satinder K. Brar and Rama Pulicharla: How can the impact of wastewater on the environment be reduced? A future of fewer pollutants may be possible through new technologies that can purify or mitigate contamination. Listen up to learn: The function of the black box Possibilities for...
byFinding Genius Podcast
0 ratings
0% found this document useful
Revisiting the Minimalist Approach to Offline Reinforcement Learning: Recent years have witnessed significant advancements in offline reinforcement learning (RL), resulting in the development of numerous algorithms with varying degrees of complexity. While these algorithms have led to noteworthy improvements, many inco...
Podcast episode
Revisiting the Minimalist Approach to Offline Reinforcement Learning: Recent years have witnessed significant advancements in offline reinforcement learning (RL), resulting in the development of numerous algorithms with varying degrees of complexity. While these algorithms have led to noteworthy improvements, many inco...
byPapers Read on AI
0 ratings
0% found this document useful
55. Future of Nanotechnology w/ Chad Mirkin - Professor @ Northwestern
Podcast episode
55. Future of Nanotechnology w/ Chad Mirkin - Professor @ Northwestern
byBIOS
0 ratings
0% found this document useful
Precision and Efficiency at Your Fingertips: Why Electronic Pipettes are the Future of Pipetting
Podcast episode
Precision and Efficiency at Your Fingertips: Why Electronic Pipettes are the Future of Pipetting
byListen In - Bitesize Bio Webinar Audios
0 ratings
0% found this document useful
Stemloop, Biotech, and Rapid Tests with Khalid Alam
Podcast episode
Stemloop, Biotech, and Rapid Tests with Khalid Alam
byFYI - For Your Innovation
0 ratings
0% found this document useful
S1.E10: Hot Tub Flow Time Machine
Podcast episode
S1.E10: Hot Tub Flow Time Machine
byPharm to Table
0 ratings
0% found this document useful
#037 - Tour De Bayesian with Connor Tann
Podcast episode
#037 - Tour De Bayesian with Connor Tann
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Different biological effects of exposure to far-UVC (222 nm) and near-UVC (254 nm) irradiation
Podcast episode
Different biological effects of exposure to far-UVC (222 nm) and near-UVC (254 nm) irradiation
byPaperPlayer biorxiv cell biology
0 ratings
0% found this document useful
135: Could food waste fuel your next flight?
Podcast episode
135: Could food waste fuel your next flight?
byFood Matters Live Podcast
0 ratings
0% found this document useful
Ep. 21 Airway Surgery- What's in Your Toolbox? with Drs. Johnson, Matrka and Gerber: In this episode, Dr. Laura Matrka and Dr. Mark Gerber join Dr. Romaine Johnson and Dr. Gopi Shah to discuss airway surgery. The evaluation of adult and pediatric patients requiring airway surgery is outlined.
Podcast episode
Ep. 21 Airway Surgery- What's in Your Toolbox? with Drs. Johnson, Matrka and Gerber: In this episode, Dr. Laura Matrka and Dr. Mark Gerber join Dr. Romaine Johnson and Dr. Gopi Shah to discuss airway surgery. The evaluation of adult and pediatric patients requiring airway surgery is outlined.
byBackTable ENT
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
Traditional vs. block periodisation with Nicki Winfield Almquist, PhD | EP#362: Nicki Winfield Almquist, PhD, is a post-doctoral researcher at University of Copenhagen, Denmark. In this interview, we discuss a recent study comparing the effects of 12 weeks of block periodisation with "best practice" cyclic, progressive...
Podcast episode
Traditional vs. block periodisation with Nicki Winfield Almquist, PhD | EP#362: Nicki Winfield Almquist, PhD, is a post-doctoral researcher at University of Copenhagen, Denmark. In this interview, we discuss a recent study comparing the effects of 12 weeks of block periodisation with "best practice" cyclic, progressive...
byThat Triathlon Show
0 ratings
0% found this document useful
S2E25: The DAC-up plan for climate change—w/ Dr. Jen Wilcox of Worcester Polytechnic Institute
Podcast episode
S2E25: The DAC-up plan for climate change—w/ Dr. Jen Wilcox of Worcester Polytechnic Institute
byReversing Climate Change
0 ratings
0% found this document useful
Why is it important to test your biochar?
Podcast episode
Why is it important to test your biochar?
byThe Biochar Podcast
0 ratings
0% found this document useful
Low Cost Indoor Air Quality Sensors
Podcast episode
Low Cost Indoor Air Quality Sensors
byTalking Air Filtration
0 ratings
0% found this document useful
Why point-of-care ultrasound should be the standard for lung assessment: Innovations in technology have enabled the creation of point-of-care ultrasound, which allows clinicians to gather images at the first patient assessment, improving care quality by reducing wait times for diagnosis. Despite the benefits of...
Podcast episode
Why point-of-care ultrasound should be the standard for lung assessment: Innovations in technology have enabled the creation of point-of-care ultrasound, which allows clinicians to gather images at the first patient assessment, improving care quality by reducing wait times for diagnosis. Despite the benefits of...
byModern Healthcare’s Healthcare Insider Podcast
0 ratings
0% found this document useful

Skip carousel

A.I. Speeds Up Battery Testing For Electric Vehicles
Futurity
Article
A.I. Speeds Up Battery Testing For Electric Vehicles
Feb 24, 2020
4 min read
New Fuel Cells Double The Voltage
Futurity
Article
New Fuel Cells Double The Voltage
Jun 25, 2020
2 min read
System Shaves 75% Off Electric Vehicle Battery Test Time
Futurity
Article
System Shaves 75% Off Electric Vehicle Battery Test Time
Jun 29, 2022
3 min read
A CQ Exclusive: Slow Website Speeds Cause Spectrum Rage
CQ Amateur Radio
Article
A CQ Exclusive: Slow Website Speeds Cause Spectrum Rage
Apr 1, 2022
5 min read
How Spooky Science Helps Us Peer Inside The Planets
All About Space
Article
How Spooky Science Helps Us Peer Inside The Planets
Dec 3, 2020
An assistant professor of computational science at the EPFL research centre in Lausanne, Switzerland, involved in the current research on metallic hydrogen. Could you explain how the machine-learning techniques used in your research work? Why were th
1 min read
Method Predicts When Batteries Will Die
Futurity
Article
Method Predicts When Batteries Will Die
Mar 26, 2019
3 min read
Boat Electrics 101 Course Review
Practical Boat Owner
Article
Boat Electrics 101 Course Review
Aug 5, 2021
2 min read
Battery Breakthrough May Offer Key To Five-minute Smartphone Charge
The Independent
Article
Battery Breakthrough May Offer Key To Five-minute Smartphone Charge
Jun 23, 2021
1 min read
Flexible Polymer Could Power Future Pacemakers Non-stop
Futurity
Article
Flexible Polymer Could Power Future Pacemakers Non-stop
Dec 27, 2019
1 min read
Green Beat
WellBeing
Article
Green Beat
Mar 14, 2024
2 min read
‘3Q’ Outperforms Other Organic Materials in Batteries
Futurity
Article
‘3Q’ Outperforms Other Organic Materials in Batteries
Sep 18, 2017
A novel compound called 3Q conducts electricity and retains energy better than other organic materials currently used in batteries, researchers report. “Our study provides evidence that 3Q, and organic molecules of similar structures, in combination
1 min read
New Fuel Cell Has Enough Juice For Drones And Subs
Futurity
Article
New Fuel Cell Has Enough Juice For Drones And Subs
Mar 5, 2019
2 min read
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Nautilus
Article
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Feb 12, 2015
I’ve never seen the computer you’re reading this story on, but I can tell you a lot about it. It runs on electricity. It uses binary logic to carry out programmed instructions. It shuttles information using materials known as semiconductors. Its brai
7 min read
Plan a Microgrid Project in 6 Steps
MOTHER EARTH NEWS
Article
Plan a Microgrid Project in 6 Steps
Jul 12, 2019
1 min read
Robot Speeds Up Polymer Creation For Better Medical Devices
Futurity
Article
Robot Speeds Up Polymer Creation For Better Medical Devices
Dec 12, 2019
1 min read
A Timeline of Recent Attacks on the EPA’s Science-based Ambient Air Pollution Standards
Union of Concerned Scientists
Article
A Timeline of Recent Attacks on the EPA’s Science-based Ambient Air Pollution Standards
Sep 18, 2019
2 min read
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Union of Concerned Scientists
Article
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Nov 12, 2019
7 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
To Make Fuel Cells Last Longer, Look Inside
Futurity
Article
To Make Fuel Cells Last Longer, Look Inside
Sep 29, 2017
Researchers have created a way to look inside fuel cells to see the chemical processes that lead them to breakdown. “If you buy a device—a car, a cell phone—you want it to last as long as possible…” Fuel cells could someday generate electricity for n
2 min read
Desalination Device Turns Brine Into Useful Chemicals
Futurity
Article
Desalination Device Turns Brine Into Useful Chemicals
Jun 12, 2020
2 min read
New Breakthrough Brings Batteries That Charge In Five Minutes
The Independent
Article
New Breakthrough Brings Batteries That Charge In Five Minutes
Feb 1, 2024
2 min read
Finalists
Fast Company
Article
Finalists
May 2, 2023
WORLD-CHANGING COMPANY OF THE YEAR Fuseproject GAF PepsiCo Salesforce Siemens UNICEF USA GENERAL EXCELLENCE Carbon Insetting Program Organic Valley Cool Community Project GAF ElectrifyNYC City of New York Expansion of paid apprenticeship program Bitw
4 min read
System Could Make Seawater Drinkable
Futurity
Article
System Could Make Seawater Drinkable
Jan 24, 2024
Researchers have achieved a major breakthrough in redox flow desalination, an emerging electrochemical technique that can turn seawater into potable drinking water and also store affordable renewable energy. In a paper published in Cell Reports Physi
2 min read
EPA Administrator Wheeler Worsens Particulate Pollution Review Process
Union of Concerned Scientists
Article
EPA Administrator Wheeler Worsens Particulate Pollution Review Process
Aug 22, 2019
4 min read
Natural Protein Offers Greener Way To Extract Rare Earth Elements
Futurity
Article
Natural Protein Offers Greener Way To Extract Rare Earth Elements
Oct 11, 2021
3 min read
THE WORLD’S BEST Smart Hospitals 2023
Newsweek
Article
THE WORLD’S BEST Smart Hospitals 2023
Sep 16, 2022
7 min read
‘Rope-jumping’ Rotor Could Pave Way For Molecular Machines
Futurity
Article
‘Rope-jumping’ Rotor Could Pave Way For Molecular Machines
Jul 15, 2018
Researchers have created a new type of molecular rotor that shows promise for future development as a functional machine capable of manipulating matter at atomic and subatomic levels. The research could transform multiple branches of chemistry, along
2 min read
Circuit Programs Human Cells to Add and Subtract
Futurity
Article
Circuit Programs Human Cells to Add and Subtract
Apr 15, 2017
A new platform offers a fast and more efficient way to target and program mammalian cells as genetic circuits, even complex ones. “The problem synthetic biologists are trying to solve is how we ask cells to make decisions and try to design a strategy
2 min read
Japanese Paper Art Could Let Electronics Stretch Out
Futurity
Article
Japanese Paper Art Could Let Electronics Stretch Out
Apr 4, 2018
Kirigami, a variation of origami that involves cutting folded pieces of paper, has inspired researchers’ efforts to build malleable electronic circuits. Their innovation—creating tiny sheets of strong yet bendable electronic materials made of select
1 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read

Related categories

Skip carousel

Reviews for Machine Learning for Subsurface Characterization

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Machine Learning for Subsurface Characterization - Siddharth Misra

papers.

Chapter 1

Unsupervised outlier detection techniques for well logs and geophysical data

Siddharth Misra⁎; Oghenekaro Osogba†,a; Mark Powers‡ ⁎ Harold Vance Department of Petroleum Engineering, Texas A&M University, College Station, TX, United States

† Texas A&M University, College Station, TX, United States

‡ The University of Oklahoma, Norman, OK, United States

a Formerly at the University of Oklahoma, Norman, OK, United States

Abstract

Outliers in well logs and other borehole-based subsurface measurements are often due to poor borehole condition, problems in data acquisition, irregularity in operating procedures, the presence of rare geological formations, or certain rare process/phenomenon in the subsurface. Detection of outliers is an important step prior to building a robust data-driven or machine learning-based model. We perform a comparative study of the performances of four unsupervised outlier detection techniques (ODTs) on various original and synthetic well-log datasets. The four unsupervised ODTs compared in this study are isolation forest (IF), one-class SVM (OCSVM), local outlier factor (LOF), and density-based spatial application with noise (DBSCAN). The unsupervised ODTs are evaluated on four labeled outlier-prone validation datasets using precision-recall curve, F1 score, area under the curve (AUC) score, and receiver operating characteristic (ROC) curve. Isolation forest is the most robust unsupervised ODT for detecting various types of outliers, whereas DBSCAN is particularly effective in detecting noise in a well-log dataset. Efficient feature engineering and feature selection is important to ensure robust detection of outliers in well-log and subsurface measurements using unsupervised outlier detection methods.

Keywords

Isolation forest; DBSCAN; Support vector; Local outlier factor; ROC; AUC; Precision; Recall; Outliers; Precision-recall curve

Chapter outline

1Introduction

1.1Basic terminologies in machine learning and data-driven models

1.2Types of machine learning techniques

1.3Types of outliers

2Outlier detection techniques

3Unsupervised outlier detection techniques

3.1Isolation forest

3.2One-class SVM

3.3DBSCAN

3.4Local outlier factor

3.5Influence of hyperparameters on the unsupervised ODTs

4Comparative study of unsupervised outlier detection methods on well logs

4.1Description of the dataset used for the comparative study of unsupervised ODTs

4.2Data preprocessing

4.3Validation dataset

4.4Metrics/scores for the assessment of the performances of unsupervised ODTs on the conventional logs

5Performance of unsupervised ODTs on the four validation datasets

5.1Performance on Dataset #1 containing noisy measurements

5.2Performance on Dataset #2 containing measurements affected by bad holes

5.3Performance on Dataset #3 containing shaly layers and bad holes with noisy measurements

5.4Performance on Dataset #4 containing manually labeled outliers

6Conclusions

Appendix APopular methods for outlier detection

Appendix BConfusion matrix to quantify the inlier and outlier detections by the unsupervised ODTs

Appendix CValues of important hyperparameters of the unsupervised ODT models

Appendix DReceiver operating characteristics (ROC) and precision-recall (PR) curves for various unsupervised ODTs on the Dataset #1

Acknowledgments

References

Acknowledgments

Workflows and visualizations used in this chapter are based upon the work supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Chemical Sciences Geosciences, and Biosciences Division, under Award Number DE-SC-00019266.

1 Introduction

From a statistical standpoint, outliers are data points (samples) that are significantly different from the general trend of the dataset. From a conceptual standpoint, a sample is considered as an outlier when it does not represent the behavior of the phenomenon/process as represented by most of the samples in a dataset. Outliers are indicative of issues in data collection/measurement procedure or unexpected events in the operation/process that generated the data. Detection and removal of outliers is an important step prior to building a robust data-driven (DD) and machine learning-based (ML) model. Outliers skew the descriptive statistics used by data analysis, data-driven and machine learning methods to build the data-driven model. A model developed on data containing outliers will not accurately represent the normal behavior of data because the model picks the unrepresentative patterns exhibited by the outliers. As a result, there will be nonuniqueness in the model predictions. Data-driven models affected by outliers have lower predictive accuracy and generalization capability.

Outlier handling refers to all the steps taken to negate the adverse effects of outliers in a dataset. After detecting the outliers in a dataset, how they are handled depends on the immediate use of the dataset. Outliers can be removed, replaced, or transformed depending on the type of dataset and its use. Outlier handling is particularly important as outliers could enhance or mask relevant statistical characteristics of the dataset. For instance, outliers in weather data could be early signs of a weather disaster; ignoring this could have catastrophic consequences. However, before considering outlier handling, we must first detect them.

Outliers in well logs and other borehole-based subsurface measurements occur due to wellbore conditions, logging tool deployment, and physical characteristics of the geological formations. For example, washed out zones in the wellbore and borehole rugosity significantly affects the readings of shallow-sensing logs, such as density, sonic, and photoelectric factor (PEF) logs, resulting in outlier response. Along with wellbore conditions, uncommon beds and sudden change in physical/chemical properties at a certain depth in a formation also result in outlier behavior of the subsurface measurements. In this chapter, we perform a comparative study of the performances of four unsupervised outlier detection techniques (ODTs) on various original and synthetic well-log datasets.

1.1 Basic terminologies in machine learning and data-driven models

Before discussing more about outliers, the authors would like to clearly distinguish the following terms: dataset, sample, feature, and target. Data-driven (DD) and machine learning-based (ML) methods find statistical/probabilistic functions by processing a relevant dataset to either relate features to targets (referred as supervised learning) or appropriately transform features and/or samples (referred as unsupervised learning). Various types of information (i.e., values of features and targets) about several samples constitute a dataset. A dataset is a collection of values corresponding to features and/or targets for several samples. Features are physical properties or attributes that can be measured or computed for each sample in the dataset. Targets are the observable/measurable outcomes, and the target values for a sample are consequences of certain combinations of features for that sample. For purposes of unsupervised learning, a relevant dataset is collection of only the features for all the available samples, whereas a dataset is collection of features and corresponding targets for all the available samples for purposes of supervised learning. A dataset comprises of one or many targets and several features for several samples. An increase in the number of samples increases the size of the dataset, whereas an increase in the number of features increases the dimensionality of dataset. A DD/ML model becomes more robust with the increase in the size of the dataset. However, with increase in dimension of the dataset, a model tends to overfit and becomes less generalizable, unless the increase in dimension is due to the addition of informative, relevant, uncorrelated features. Prior to building the DD/ML model using supervised learning, a dataset is split into training and testing datasets to ensure the model does not overfit the training dataset and generalizes well to the testing dataset. Further, the training dataset is divided into certain number of splits to perform cross validation that ensures the model learns from and is evaluated on all the statistical distributions present in the training dataset. For evaluating the model on the testing dataset, it is of utmost importance to avoid any form of mixing (leakage) between the training and testing datasets. Also, when evaluating the model on the testing dataset, one should select relevant evaluation metrics out of the several available metrics with various assumptions and limitations.

1.2 Types of machine learning techniques

Machine learning (ML) models can be broadly categorized into three techniques: supervised learning, unsupervised learning, and reinforcement learning. In supervised learning (e.g., regression and classification), a data-driven model is developed by first training the model on samples with known features/attributes and corresponding targets/outcomes from the training dataset; following that, the trained model is evaluated on the testing dataset; and finally, the data-driven model is used to predict targets/outcomes based on the features/attributes of new, unseen samples during the model deployment. In unsupervised learning (e.g., clustering and transformation), a data-driven model learns to generate an outcome based on the features/attributes of samples without any prior information about the outcomes. In reinforcement learning (which tends to be very challenging), a data-driven model learns to perform a specific task by interacting with an environment to receive a reward based on the actions performed by the model toward accomplishing the task. In reinforcement learning, the model learns the policy for a specific task by optimizing the cumulative reward obtained from the environment. These three learning techniques have several day-to-day applications; for instance, supervised learning is commonly used in spam detection. The spam detection model is trained on different mails labeled as spam or not spam; after gaining the knowledge from the training dataset and subsequent evaluation on the testing dataset, the trained spam detection model can detect if a new mail is spam or not. Unsupervised learning is used in marketing where customers are categorized/segmented based on the similarity/dissimilarity of their purchasing trends as compared with other customers; for instance, Netflix’s computational engine uses the similarity/dissimilarity between what other users have watched when recommending the movies. Reinforcement learning was used to train DeepMind’s AlphaGo to beat world champions at the game of Go. Reinforcement learning was also used to train the chess playing engine, where the model was penalized for making moves that led to losing a piece and rewarded for moves that led to a checkmate.

A machine learning method first processes the training dataset to build a data-driven model; following that, the performance of the newly developed model is evaluated against the testing dataset. After confirming the accuracy and precision of the data-driven model on the testing dataset, these methods are deployed on the new dataset. These three types of dataset, namely, training, testing, and new dataset, comprise measurements of certain specific features for numerous samples. The training and testing datasets, when used in supervised learning, contain additional measurements of the targets/outcomes. A supervised learning technique tries to functionally relate the features to the targets for all the samples in the dataset. On the contrary, for unsupervised learning, the data-driven model development takes place without the targets; in other words, there are no targets to be considered during the training and testing stages of unsupervised learning. Obviously, information about the targets is never available in the new dataset because the trained models are deployed on the new dataset to compute the desired targets or certain outcomes.

1.3 Types of outliers

In the context of this work, outliers can be broadly categorized into three types: point/global, contextual, and collective outliers [1]. Point/global outliers refer to individual data points or samples that significantly deviate from the overall distribution of the entire dataset or from the distribution of certain combination of features. These outliers exist at the tail end of a distribution and largely vary from the mean of the distribution, generally lying beyond 2 standard deviations away from the mean; for example, subsurface depths where porosity is > 40 porosity units or permeability is > 5 Darcy should be considered as point/global outliers. From an event perspective, a house getting hit by a meteorite is an example of point outlier. The second category of outliers is the contextual/conditional outliers, which deviate significantly from the data points within a specific context, for example, a large gamma ray reading in sandstone due to an increase in potassium-rich minerals (feldspar). Snow in summer is an example of contextual outlier. Points labeled as contextual outliers are valid outliers only for a specific context; a change in the context will result in a similar point to be considered as an inlier. Collective outliers are a small cluster of data that as a whole deviate significantly from the entire dataset, for example, log measurements from regions affected by borehole washout. For example, it is not rare that people move from one residence to the next; however, when an entire neighborhood relocates at the same time, it will be considered as collective outlier. Contextual and collective outliers need a domain expert to guide the outlier detection.

2 Outlier detection techniques

An outlier detection technique (ODT) is used to detect anomalous observations/samples that do not fit the typical/normal statistical distribution of a dataset. Simple methods for outlier detection use statistical tools, such as boxplot and Z-score, on each individual feature of the dataset. A boxplot is a standardized way of representing the distributions of samples corresponding to various features using boxes and whiskers. The boxes represent the interquartile range of the data, and the whiskers represent a multiple of the first and third quartiles of the variable; any data point/sample outside these limits is considered as an outlier. The next simple statistical tool for feature-specific outlier detection is the Z-score, which indicates how far the value of the data point/sample is from its mean for a specific feature. A Z-score of 1 means the sample point is 1 standard deviation away from its mean. Typically, Z-score values greater than or less than + 3 or − 3, respectively, are considered outliers. Z-score is expressed

Enjoying the preview?

Page 1 of 1

Machine Learning for Subsurface Characterization

About this ebook

Siddharth Misra

Related authors

Related to Machine Learning for Subsurface Characterization

Related ebooks

Science & Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Machine Learning for Subsurface Characterization

What did you think?

Book preview

Machine Learning for Subsurface Characterization - Siddharth Misra

Abstract

Keywords

Isolation forest; DBSCAN; Support vector; Local outlier factor; ROC; AUC; Precision; Recall; Outliers; Precision-recall curve

Chapter outline

Acknowledgments

1 Introduction

1.1 Basic terminologies in machine learning and data-driven models

1.2 Types of machine learning techniques

1.3 Types of outliers

2 Outlier detection techniques