Machine Learning for Oracle Database Professionals: Deploying Model-Driven Applications and Automation Pipelines

Ebook425 pages3 hours

Machine Learning for Oracle Database Professionals: Deploying Model-Driven Applications and Automation Pipelines

Name: Machine Learning for Oracle Database Professionals: Deploying Model-Driven Applications and Automation Pipelines
Author: Heli Helskyaho
ISBN: 9781484270325

By Heli Helskyaho, Jean Yu and Kai Yu

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Database developers and administrators will use this book to learn how to deploy machine learning models in Oracle Database and in Oracle’s Autonomous Database cloud offering. The book covers the technologies that make up the Oracle Machine Learning (OML) platform, including OML4SQL, OML Notebooks, OML4R, and OML4Py. The book focuses on Oracle Machine Learning as part of the Oracle Autonomous Database collaborative environment. Also covered are advanced topics such as delivery and automation pipelines.

Throughout the book you will find practical details and hand-on examples showing you how to implement machine learning and automate deployment of machine learning. Discussion around the examples helps you gain a conceptual understanding of machine learning. Important concepts discussed include the methods involved, the algorithms to choose from, and mechanisms for process and deployment. Seasoned database professionals looking to make the leap into machine learning as a growth path will find much to like in this book as it helps you step up and use your current knowledge of Oracle Database to transition into providing machine learning solutions.

What You Will Learn

Use the Oracle Machine Learning (OML) Notebooks for data visualization and machine learning model building and evaluation
Understand Oracle offerings for machine learning
Develop machine learning with Oracle database using the built-in machine learning packages
Develop and deploy machine learning models using OML4SQL and OML4R
Leverage the Oracle Autonomous Database and its collaborative environment for Oracle Machine Learning
Develop and deploy machine learning projects in Oracle Autonomous Database
Build an automated pipeline that can detect and handle changes in data/model performance

Who This Book Is For
Database developers and administrators who want to learn about machine learning, developers who want to build models and applications using Oracle Database’s built-in machine learning feature set, and administrators tasked with supporting applications on Oracle Database that make use of the Oracle Machine Learning feature set

Skip carousel

LanguageEnglish

PublisherApress

Release dateJun 11, 2021

ISBN9781484270325

Author

Heli Helskyaho

Related authors

Skip carousel

Related to Machine Learning for Oracle Database Professionals

Related ebooks

Skip carousel

Monitoring Elasticsearch
Ebook
Monitoring Elasticsearch
byDan Noble
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Keras
Ebook
Deep Learning with Keras
bySujit Pal
Rating: 5 out of 5 stars
5/5
Oracle 10g/11g Data and Database Management Utilities: LITE
Ebook
Oracle 10g/11g Data and Database Management Utilities: LITE
byHector R. Madrid
Rating: 0 out of 5 stars
0 ratings
Pro Oracle Database 18c Administration: Manage and Safeguard Your Organization’s Data
Ebook
Pro Oracle Database 18c Administration: Manage and Safeguard Your Organization’s Data
byMichelle Malcher
Rating: 0 out of 5 stars
0 ratings
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
Ebook
Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn
byTshepo Chris Nokeri
Rating: 0 out of 5 stars
0 ratings
Apache Solr High Performance
Ebook
Apache Solr High Performance
bySurendra Mohan
Rating: 0 out of 5 stars
0 ratings
Parallel and High Performance Programming with Python: Unlock parallel and concurrent programming in Python using multithreading, CUDA, Pytorch and Dask. (English Edition)
Ebook
Parallel and High Performance Programming with Python: Unlock parallel and concurrent programming in Python using multithreading, CUDA, Pytorch and Dask. (English Edition)
byFabio Nelli
Rating: 0 out of 5 stars
0 ratings
Learn Java with Math: Using Fun Projects and Games
Ebook
Learn Java with Math: Using Fun Projects and Games
byRon Dai
Rating: 0 out of 5 stars
0 ratings
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python (English Edition)
Ebook
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python (English Edition)
byMugesh S.
Rating: 0 out of 5 stars
0 ratings
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
Ebook
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
byMugesh S.
Rating: 0 out of 5 stars
0 ratings
Mastering Snowflake Platform: Generate, fetch, and automate Snowflake data as a skilled data practitioner (English Edition)
Ebook
Mastering Snowflake Platform: Generate, fetch, and automate Snowflake data as a skilled data practitioner (English Edition)
byPooja Kelgaonkar
Rating: 0 out of 5 stars
0 ratings
Mastering OpenCV with Python: Use NumPy, Scikit, TensorFlow, and Matplotlib to learn Advanced algorithms for Machine Learning through a set of Practical Projects
Ebook
Mastering OpenCV with Python: Use NumPy, Scikit, TensorFlow, and Matplotlib to learn Advanced algorithms for Machine Learning through a set of Practical Projects
byAyush Vaishya
Rating: 0 out of 5 stars
0 ratings
Mastering OpenCV with Python
Ebook
Mastering OpenCV with Python
byAyush Vaishya
Rating: 0 out of 5 stars
0 ratings
Practical Machine Learning in JavaScript: TensorFlow.js for Web Developers
Ebook
Practical Machine Learning in JavaScript: TensorFlow.js for Web Developers
byCharlie Gerard
Rating: 0 out of 5 stars
0 ratings
Practical Oracle JET: Developing Enterprise Applications in JavaScript
Ebook
Practical Oracle JET: Developing Enterprise Applications in JavaScript
byDaniel Curtis
Rating: 0 out of 5 stars
0 ratings
Data Science with Jupyter: Master Data Science skills with easy-to-follow Python examples
Ebook
Data Science with Jupyter: Master Data Science skills with easy-to-follow Python examples
byPrateek Gupta
Rating: 0 out of 5 stars
0 ratings
Practical Oracle Cloud Infrastructure: Infrastructure as a Service, Autonomous Database, Managed Kubernetes, and Serverless
Ebook
Practical Oracle Cloud Infrastructure: Infrastructure as a Service, Autonomous Database, Managed Kubernetes, and Serverless
byMichał Tomasz Jakóbczyk
Rating: 0 out of 5 stars
0 ratings
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
Ebook
Hyperparameter Optimization in Machine Learning: Make Your Machine Learning and Deep Learning Models More Efficient
byTanay Agrawal
Rating: 0 out of 5 stars
0 ratings
Learning Elasticsearch
Ebook
Learning Elasticsearch
byAbhishek Andhavarapu
Rating: 4 out of 5 stars
4/5
Deep Learning with TensorFlow
Ebook
Deep Learning with TensorFlow
byMd. Rezaul Karim
Rating: 5 out of 5 stars
5/5
Scala Programming for Big Data Analytics: Get Started With Big Data Analytics Using Apache Spark
Ebook
Scala Programming for Big Data Analytics: Get Started With Big Data Analytics Using Apache Spark
byIrfan Elahi
Rating: 0 out of 5 stars
0 ratings
Clojure Data Structures and Algorithms Cookbook
Ebook
Clojure Data Structures and Algorithms Cookbook
byRafik Naccache
Rating: 0 out of 5 stars
0 ratings
MySQL Admin Cookbook LITE: Configuration, Server Monitoring, Managing Users
Ebook
MySQL Admin Cookbook LITE: Configuration, Server Monitoring, Managing Users
byDaniel Schneller
Rating: 4 out of 5 stars
4/5
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
Ebook
Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners
byEkaba Bisong
Rating: 0 out of 5 stars
0 ratings
Apache Oozie Essentials
Ebook
Apache Oozie Essentials
bySingh Jagat Jasjit
Rating: 0 out of 5 stars
0 ratings
OpenStack Essentials
Ebook
OpenStack Essentials
byDan Radez
Rating: 0 out of 5 stars
0 ratings
PyTorch Recipes: A Problem-Solution Approach
Ebook
PyTorch Recipes: A Problem-Solution Approach
byPradeepta Mishra
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Hadoop
Ebook
Deep Learning with Hadoop
byDipayan Dev
Rating: 0 out of 5 stars
0 ratings
MLOps Engineering at Scale
Ebook
MLOps Engineering at Scale
byCarl Osipov
Rating: 0 out of 5 stars
0 ratings
Learning Neo4j
Ebook
Learning Neo4j
byRik Van Bruggen
Rating: 3 out of 5 stars
3/5

Databases For You

Skip carousel

HTML, CSS, Bootstrap, Php, Javascript and MySql: All you need to know to create a dynamic site
Ebook
HTML, CSS, Bootstrap, Php, Javascript and MySql: All you need to know to create a dynamic site
byOlga Maria Stefania Cucaro
Rating: 4 out of 5 stars
4/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Spring in Action, Sixth Edition
Ebook
Spring in Action, Sixth Edition
byCraig Walls
Rating: 5 out of 5 stars
5/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
SQL Clearly Explained
Ebook
SQL Clearly Explained
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
COBOL Basic Training Using VSAM, IMS and DB2
Ebook
COBOL Basic Training Using VSAM, IMS and DB2
byRobert Wingate
Rating: 5 out of 5 stars
5/5
CompTIA DataSys+ Study Guide: Exam DS0-001
Ebook
CompTIA DataSys+ Study Guide: Exam DS0-001
byMike Chapple
Rating: 0 out of 5 stars
0 ratings
Access 2019 For Dummies
Ebook
Access 2019 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Building a Scalable Data Warehouse with Data Vault 2.0
Ebook
Building a Scalable Data Warehouse with Data Vault 2.0
byDaniel Linstedt
Rating: 4 out of 5 stars
4/5
Serverless Architectures on AWS, Second Edition
Ebook
Serverless Architectures on AWS, Second Edition
byPeter Sbarski
Rating: 5 out of 5 stars
5/5
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
Ebook
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
byAJIT DASH
Rating: 3 out of 5 stars
3/5
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
Ebook
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
byBrian Peasland
Rating: 0 out of 5 stars
0 ratings
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
Ebook
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
bySteve Williams
Rating: 5 out of 5 stars
5/5
Relational Database Design and Implementation
Ebook
Relational Database Design and Implementation
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
Beginning Microsoft SQL Server 2012 Programming
Ebook
Beginning Microsoft SQL Server 2012 Programming
byPaul Atkinson
Rating: 1 out of 5 stars
1/5
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
COMPUTER SCIENCE FOR ROOKIES
Ebook
COMPUTER SCIENCE FOR ROOKIES
byAngel Bahabwa
Rating: 0 out of 5 stars
0 ratings
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
The SQL Workshop: Learn to create, manipulate and secure data and manage relational databases with SQL
Ebook
The SQL Workshop: Learn to create, manipulate and secure data and manage relational databases with SQL
byFrank Solomon
Rating: 0 out of 5 stars
0 ratings
Go in Action
Ebook
Go in Action
byErik St. Martin
Rating: 5 out of 5 stars
5/5
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
Ebook
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
byDan Clark
Rating: 0 out of 5 stars
0 ratings
Database Design: Know It All
Ebook
Database Design: Know It All
byToby J. Teorey
Rating: 5 out of 5 stars
5/5
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Ebook
Blockchain Basics: A Non-Technical Introduction in 25 Steps
byDaniel Drescher
Rating: 5 out of 5 stars
5/5
The Visual Imperative: Creating a Visual Culture of Data Discovery
Ebook
The Visual Imperative: Creating a Visual Culture of Data Discovery
byLindy Ryan
Rating: 4 out of 5 stars
4/5
Getting Started with SQL Server 2014 Administration
Ebook
Getting Started with SQL Server 2014 Administration
byGethyn Ellis
Rating: 0 out of 5 stars
0 ratings
Data Mining: Concepts and Techniques
Ebook
Data Mining: Concepts and Techniques
byJiawei Han
Rating: 4 out of 5 stars
4/5
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
Ebook
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
byJeremy Li
Rating: 3 out of 5 stars
3/5

Related podcast episodes

Skip carousel

Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
Podcast episode
Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
byMLOps.community
0 ratings
0% found this document useful
Oracle Data Lakehouse: With each passing day, more and more data sources are sending greater volumes of data across the globe. For any organization, this combination of structured and unstructured data continues to be a challenge. Data lakehouses link, correlate, and...
Podcast episode
Oracle Data Lakehouse: With each passing day, more and more data sources are sending greater volumes of data across the globe. For any organization, this combination of structured and unstructured data continues to be a challenge. Data lakehouses link, correlate, and...
byOracle University Podcast
0 ratings
0% found this document useful
Julien Le Dem: Why Data Lineage Matters: Julien has a unique history of building open frameworks that make data platforms interoperable. He’s contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework...
Podcast episode
Julien Le Dem: Why Data Lineage Matters: Julien has a unique history of building open frameworks that make data platforms interoperable. He’s contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework...
byThe Analytics Engineering Podcast
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
Podcast episode
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful
Autonomous Database Tools: In this episode, hosts Lois Houston and Nikita Abraham speak with Oracle Database experts about the various tools you can use with Autonomous Database, including Oracle Application Express (APEX), Oracle Machine Learning, and more. Oracle...
Podcast episode
Autonomous Database Tools: In this episode, hosts Lois Houston and Nikita Abraham speak with Oracle Database experts about the various tools you can use with Autonomous Database, including Oracle Application Express (APEX), Oracle Machine Learning, and more. Oracle...
byOracle University Podcast
0 ratings
0% found this document useful
Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs: An interview with Andy Dang about the open source WhyLogs library and how it simplifies the work of data logging for instrumenting your machine learning workflows and unlocking observability.
Podcast episode
Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs: An interview with Andy Dang about the open source WhyLogs library and how it simplifies the work of data logging for instrumenting your machine learning workflows and unlocking observability.
byData Engineering Podcast
0 ratings
0% found this document useful
The Role of Infrastructure in ML // Niels Bantilan // #197
Podcast episode
The Role of Infrastructure in ML // Niels Bantilan // #197
byMLOps.community
0 ratings
0% found this document useful
Getting Started with Business Process Training: In today’s competitive landscape, a comprehensive understanding of business processes is critical to successfully deploying and using Oracle Cloud Applications. Having solid foundational knowledge of business processes can help you understand “how...
Podcast episode
Getting Started with Business Process Training: In today’s competitive landscape, a comprehensive understanding of business processes is critical to successfully deploying and using Oracle Cloud Applications. Having solid foundational knowledge of business processes can help you understand “how...
byOracle University Podcast
0 ratings
0% found this document useful
Let The Whole Team Participate In Data With The Quilt Versioned Data Hub: Data is a team sport, but it's often difficult for everyone on the team to participate. For a long time the mantra of data tools has been "by developers, for developers", which automatically excludes a large portion of the business members who play a crucial role in the success of any data project. Quilt Data was created as an answer to make it easier for everyone to contribute to the data being used by an organization and collaborate on its application. In this episode Aneesh Karve shares the journey that Quilt has taken to provide an approachable interface for working with versioned data in S3 that empowers everyone to collaborate.
Podcast episode
Let The Whole Team Participate In Data With The Quilt Versioned Data Hub: Data is a team sport, but it's often difficult for everyone on the team to participate. For a long time the mantra of data tools has been "by developers, for developers", which automatically excludes a large portion of the business members who play a crucial role in the success of any data project. Quilt Data was created as an answer to make it easier for everyone to contribute to the data being used by an organization and collaborate on its application. In this episode Aneesh Karve shares the journey that Quilt has taken to provide an approachable interface for working with versioned data in S3 that empowers everyone to collaborate.
byData Engineering Podcast
0 ratings
0% found this document useful
69: Testing Front End Code: Summary Oren Rubin (@Shexman) goes through why it’s important to not only test the back-end code of our applications but also to test our Front End code, the integration points, and the full user experience. Oren also goes through...
Podcast episode
69: Testing Front End Code: Summary Oren Rubin (@Shexman) goes through why it’s important to not only test the back-end code of our applications but also to test our Front End code, the integration points, and the full user experience. Oren also goes through...
byThe Web Platform Podcast
0 ratings
0% found this document useful
Everything You Need to Know to Get Certified on Oracle Autonomous Database: Everything You Need to Know to Get Certified on Oracle Autonomous Database
Podcast episode
Everything You Need to Know to Get Certified on Oracle Autonomous Database: Everything You Need to Know to Get Certified on Oracle Autonomous Database
byOracle University Podcast
0 ratings
0% found this document useful
Compounding Time In Your Favor: We all have 24 hours in a day. How can you compound your time? In many ways, this is what "spend it wisely" means. In this episode we classify different types of decisions and actions, and provide a reminder that direction is ultimately the most critical thing to align.
Podcast episode
Compounding Time In Your Favor: We all have 24 hours in a day. How can you compound your time? In many ways, this is what "spend it wisely" means. In this episode we classify different types of decisions and actions, and provide a reminder that direction is ultimately the most critical thing to align.
byDeveloper Tea
0 ratings
0% found this document useful
#06 - Tech stack of Open Podcast: Which database is best?
Podcast episode
#06 - Tech stack of Open Podcast: Which database is best?
byTOPP - The Open Podcast Podcast
0 ratings
0% found this document useful
Eliminating Garbage In/Garbage Out for Analytics and ML // Roy Hasson & Santona Tuli // MLOps Podcast #166
Podcast episode
Eliminating Garbage In/Garbage Out for Analytics and ML // Roy Hasson & Santona Tuli // MLOps Podcast #166
byMLOps.community
0 ratings
0% found this document useful
The Birth and Growth of Spark: An Open Source Success Story // Matei Zaharia // MLOps Podcast #155
Podcast episode
The Birth and Growth of Spark: An Open Source Success Story // Matei Zaharia // MLOps Podcast #155
byMLOps.community
0 ratings
0% found this document useful
Oracle Machine Learning: There is so much data available today. But it only makes a difference when you transform that data into actionable intelligence. In this episode, hosts Lois Houston and Nikita Abraham, along with Nick Commisso, discuss how you can harness the...
Podcast episode
Oracle Machine Learning: There is so much data available today. But it only makes a difference when you transform that data into actionable intelligence. In this episode, hosts Lois Houston and Nikita Abraham, along with Nick Commisso, discuss how you can harness the...
byOracle University Podcast
0 ratings
0% found this document useful
Prepare Your Unstructured Data For Machine Learning And Computer Vision Without The Toil Using Activeloop: An interview with Davit Buniatyan about his work on Activeloop and the open source Hub framework for reducing the toil involved in getting your unstructured data ready for computer vision and machine learning projects.
Podcast episode
Prepare Your Unstructured Data For Machine Learning And Computer Vision Without The Toil Using Activeloop: An interview with Davit Buniatyan about his work on Activeloop and the open source Hub framework for reducing the toil involved in getting your unstructured data ready for computer vision and machine learning projects.
byData Engineering Podcast
0 ratings
0% found this document useful
Ep. 145 - Laura Anne Edwards, DATA OASIS founder, NASA Datanaut, TED Resident & SheCanHackIT on Sustainable Innovation and Big Data: Laura Anne Edwards is founder of DATA OASIS and serves as a NASA Datanaut, TED Resident and with SheCanHackIT. Brian Ardinger, Inside Outside Innovation founder, talks with Laura Anne about sustainable innovation and big data. Important Take Aways: Su
Podcast episode
Ep. 145 - Laura Anne Edwards, DATA OASIS founder, NASA Datanaut, TED Resident & SheCanHackIT on Sustainable Innovation and Big Data: Laura Anne Edwards is founder of DATA OASIS and serves as a NASA Datanaut, TED Resident and with SheCanHackIT. Brian Ardinger, Inside Outside Innovation founder, talks with Laura Anne about sustainable innovation and big data. Important Take Aways: Su
byInside Outside Innovation
0 ratings
0% found this document useful
The Future of Data Science Platforms is Accessibility // Skylar Payne // Coffee Session #65
Podcast episode
The Future of Data Science Platforms is Accessibility // Skylar Payne // Coffee Session #65
byMLOps.community
0 ratings
0% found this document useful
How to measure and improve developer productivity | Nicole Forsgren (Microsoft Research, GitHub, Google)
Podcast episode
How to measure and improve developer productivity | Nicole Forsgren (Microsoft Research, GitHub, Google)
byLenny's Podcast: Product | Growth | Career
0 ratings
0% found this document useful
Best of 2023: Getting Started with Oracle Database: In today’s digital economy, data is a form of capital. Given the mission-critical role that it has, having a robust data management strategy is now more crucial than ever. Join Lois Houston and Nikita Abraham, along with Kay Malcolm, as they...
Podcast episode
Best of 2023: Getting Started with Oracle Database: In today’s digital economy, data is a form of capital. Given the mission-critical role that it has, having a robust data management strategy is now more crucial than ever. Join Lois Houston and Nikita Abraham, along with Kay Malcolm, as they...
byOracle University Podcast
0 ratings
0% found this document useful
Automating Analytics Teams
Podcast episode
Automating Analytics Teams
byThe Cloudcast
0 ratings
0% found this document useful
Implicit Hypotheses: Are you acting on impulse? How would you know? One door to understanding this "acting without thinking" is to investigate our hypotheses. Implicit hypotheses are expressed as instances of our beliefs. What implicit hypotheses are you relying on today?
Podcast episode
Implicit Hypotheses: Are you acting on impulse? How would you know? One door to understanding this "acting without thinking" is to investigate our hypotheses. Implicit hypotheses are expressed as instances of our beliefs. What implicit hypotheses are you relying on today?
byDeveloper Tea
0 ratings
0% found this document useful
Analytics for a Better World - Parvathy Krishnan
Podcast episode
Analytics for a Better World - Parvathy Krishnan
byDataTalks.Club
0 ratings
0% found this document useful
Tips, Tricks, and Strategies for Presenting Multi-Hour Workshops
Podcast episode
Tips, Tricks, and Strategies for Presenting Multi-Hour Workshops
byAsk The Tech Coach
0 ratings
0% found this document useful
Oracle Autonomous Database: What if you could significantly reduce the amount of time spent managing your database while still being confident that it is secure? Well, you can! With Oracle Autonomous Database (ADB), you can enjoy the highest levels of performance,...
Podcast episode
Oracle Autonomous Database: What if you could significantly reduce the amount of time spent managing your database while still being confident that it is secure? Well, you can! With Oracle Autonomous Database (ADB), you can enjoy the highest levels of performance,...
byOracle University Podcast
0 ratings
0% found this document useful
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
Podcast episode
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
byScreaming in the Cloud
0 ratings
0% found this document useful
Episode 104. It's all about Apache Tika, the project that lets you index EVERYTHING.: So we continue to have guests in our show to talk to us about interesting things... This time is about Apache Tika. This is an incredible tool to do search file processing and metadata extraction. Think about that you have tons of unstructured files,...
Podcast episode
Episode 104. It's all about Apache Tika, the project that lets you index EVERYTHING.: So we continue to have guests in our show to talk to us about interesting things... This time is about Apache Tika. This is an incredible tool to do search file processing and metadata extraction. Think about that you have tons of unstructured files,...
byJava Pub House
0 ratings
0% found this document useful
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
Podcast episode
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Opening AI's Black Box with Prof. David Bau, Koyena Pal, and Eric Todd of Northeastern University: In this episode, we dive deep into the inner workings of large language models with Professor David Bau and grad students Koyena Pal and Eric Todd from Northeastern University.
Podcast episode
Opening AI's Black Box with Prof. David Bau, Koyena Pal, and Eric Todd of Northeastern University: In this episode, we dive deep into the inner workings of large language models with Professor David Bau and grad students Koyena Pal and Eric Todd from Northeastern University.
by"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
0 ratings
0% found this document useful

Skip carousel

2 The Use of Python in AI and ML
Techfastly
Article
2 The Use of Python in AI and ML
Nov 30, 2020
3 min read
Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
GENEALOGY GADGETS & APPS FOR ALL OCCASIONS!
Family Tree UK
Article
GENEALOGY GADGETS & APPS FOR ALL OCCASIONS!
Dec 9, 2022
4 min read
Family History Software: An Introduction
Family Tree UK
Article
Family History Software: An Introduction
Feb 11, 2020
5 min read
Note-taking Applications For Family History
Family Tree UK
Article
Note-taking Applications For Family History
Mar 10, 2023
7 min read
An Expert Speaks Up on What You Should Know About Programming Languages
Entrepreneur
Article
An Expert Speaks Up on What You Should Know About Programming Languages
Oct 1, 2015
1 min read
Use Katana For Lookdev And Lighting
3D World
Article
Use Katana For Lookdev And Lighting
Sep 7, 2021
3 min read
CalicoPie Family Historian 7
Computeractive
Article
CalicoPie Family Historian 7
Mar 24, 2021
SOFTWARE | £60 from Family Historian Store www.snipca.com/37615 If you’ve ever researched your family tree, you’ll know it’s much harder than the BBC’s celebrity genealogy programme Who Do You Think You Are? makes it appear. You’ll certainly need to
2 min read
The Verdict
Linux Format
Article
The Verdict
Sep 22, 2020
2 min read
Contributing For Non - Coders
Linux Format
Article
Contributing For Non - Coders
Jan 10, 2023
9 min read
Growing Communities of Practice
Rotman Management
Article
Growing Communities of Practice
Sep 1, 2019
According to the SAP Digital Transformation Executive Study, 80 per cent of companies that have embraced digital transformation have experienced increased profitability. So why have only 21 per cent of companies completed their digital transformation
5 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Mining Actionable Information with Smart Capture
The European Business Review
Article
Mining Actionable Information with Smart Capture
May 22, 2018
4 min read
Connect OnlyOffice With E-learning Tools
Linux Format
Article
Connect OnlyOffice With E-learning Tools
Oct 18, 2022
ONLYOFFICE Credit: www.onlyoffice.com Kseniya Fedoruk is a document specialist from OnlyOffice. She spends all day demonstrating documents and all night dreaming about them. Separate steps on installation of OnlyOffice Docs were covered in LXF292.
4 min read
SPOTLIGHT ON THE... Family Historian User Group
Family Tree UK
Article
SPOTLIGHT ON THE... Family Historian User Group
Jan 13, 2023
2 min read
Software Whiteboards
Linux Format
Article
Software Whiteboards
Jul 26, 2022
1 min read
“Be Global But Act Local because Each Economy Is Unique”
Business Today
Article
“Be Global But Act Local because Each Economy Is Unique”
Dec 8, 2023
6 min read
Overall Usefulness
Linux Format
Article
Overall Usefulness
Sep 22, 2020
3 min read
Buying The Tool
Techfastly
Article
Buying The Tool
Apr 1, 2021
3 min read
Neural Pathways
Guitar Magazine
Article
Neural Pathways
Jul 2, 2021
5 min read
Perfect Backup: Perfect? No, But Darn Close
PCWorld
Article
Perfect Backup: Perfect? No, But Darn Close
Jan 11, 2023
3 min read
Create Your Own Virtual Classroom
Linux Format
Article
Create Your Own Virtual Classroom
Mar 8, 2022
Credit: https://moodle.org David Rutland believes it’s impossible for a person to be either be overdressed or overeducated. People who know him agree that he is neither. Education, education, education. If you were around in 1997, you probably rememb
10 min read
Create Your Own Virtual Classroom
Linux Format
Article
Create Your Own Virtual Classroom
Mar 8, 2022
Credit: https://moodle.org David Rutland believes it’s impossible for a person to be either be overdressed or overeducated. People who know him agree that he is neither. Education, education, education. If you were around in 1997, you probably rememb
10 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
Zulip Economy
Linux Format
Article
Zulip Economy
Oct 20, 2020
10 min read
The Race To Exascale Supercomputers
Maximum PC
Article
The Race To Exascale Supercomputers
Jun 21, 2022
9 min read
Flask Resources
Linux Format
Article
Flask Resources
Jun 4, 2019
The designers of Flask decided early on that the framework itself would not have all the functions embedded. This philosophy is, of course, an extension of all open source work. When you start using Flask, you may be disappointed by this. Don’t be: t
1 min read
Documentation
Linux Format
Article
Documentation
Mar 10, 2020
1 min read
Understanding ELT & ETL
Techfastly
Article
Understanding ELT & ETL
Apr 1, 2021
8 min read

Related categories

Skip carousel

Reviews for Machine Learning for Oracle Database Professionals

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Machine Learning for Oracle Database Professionals - Heli Helskyaho

H. Helskyaho et al.Machine Learning for Oracle Database Professionalshttps://doi.org/10.1007/978-1-4842-7032-5_1

1. Introduction to Machine Learning

Heli Helskyaho¹ , Jean Yu² and Kai Yu²

(1)

Helsinki, Finland

(2)

Austin, TX, USA

We live in exciting times with smartphones and watches, smart clothes, robots, drones, face recognition, smart personal assistants, recommender systems, self-driving autonomous cars, and 24/7 service chatbots, all of which are artificial intelligence (AI). But what is intelligence? Intelligence might be defined as the ability to acquire and apply knowledge and skills, in other words, to learn and use the skills learned. Artificial intelligence is exactly that but done by computers and software. In real life, people would like to have intelligent machines that can do things people find boring, do inefficiently, or maybe cannot do at all. It could be an extension of human intelligence through using computers, which is artificial intelligence. The core of artificial intelligence is the ability to learn, acquire knowledge and skills, which is machine learning. In machine learning, the machine is learning, reasoning, and self-correcting. Arthur Samuel defined machine learning in 1959 as a field of study that gives computers the ability to learn without being explicitly programmed, which defines machine learning very well.

Why Machine Learning?

When Arthur Samuel defined machine learning in 1959, a lot of the mathematics and statistics needed was already invented. Still, there was no technology nor enough data to get the theory to practice. Today, there are hardware solutions, including GPUs and TPUs for matrix calculation, inexpensive storage solutions for storing data, open data sets, pre-trained models for transfer learning, and so on. All this makes it possible to use machine learning in the most interesting and useful ways. But it is not only that we are now able to use machine learning; it is also necessary to use it. With its volume, velocity, variety, veracity, viability, value, variability, and visualization, big data has made it necessary to change traditional data processing into something more efficient and faster: machine learning.

Machine learning is not a silver bullet, and it should not be seen as such. Machine learning should be used only when it brings value. Typical use cases are when the rules and equations are complex or constantly changing. If the rules are understandable and can be programmed with if-else-then structures, machine learning might not be the best solution.

Classic examples of machine learning use cases are image recognition, speech recognition, fraud detection, predicting shopping trends, spam filters, medical diagnosis, or robotics. Some examples of machine learning to businesses are churn prediction, predicting customer behavior, anticipating voluntary employee attrition, and cross and up-sell opportunities.

An important requirement for machine learning is that you have data; otherwise, it makes no sense. The data is given to the machine, or the machine produces it, as it does in reinforcement learning. The better the quality of the data is, the better it can be used by machine learning. But even though the data is of excellent quality and machine learning works like a charm, a machine learning prediction is never a fact; it is always a sophisticated guess. Sometimes that guess is good and even useful, but sometimes it is not.

Also, a well-working machine learning model will no longer work well if something has changed—perhaps there is more noise in the data, the amount of data is larger, or the quality of data has lessened. In other words, it is important to understand that machine learning models need to monitor their defined metrics to make sure they still work as planned and to tune them if necessary.

What Is Machine Learning?

Machine learning can be divided into different categories based on the nature of the training data, the problem type, and the technique used to solve it. This book divides machine learning into three main categories: supervised learning, unsupervised learning, and semi-supervised learning.

Supervised Learning

Supervised machine learning is supervised by a human. Typically, that means that somebody has labeled the data to show the output or the correct answer. For example, somebody manually checks 1000 pictures and labels them to identify which of the pictures show cats, dogs, or horses.

Supervised learning is used when there is enough high-quality data and you know the target (e.g., the data is labeled). The models are trained and tested on known input and known output data to predict future outputs on new data. When testing the models, the prediction is compared to the true output to evaluate the models. To make this process meaningful, the training data must separate from the data used for testing. Each model is built using a different algorithm. A model maps the data to the algorithm and produces the prediction. So, each algorithm is processing the data differently. Depending on the chosen metrics, the evaluation process defines which algorithm performed the best, and the model using that algorithm can be implemented into production. The selection of an algorithm depends on the data’s size, the type of data, the insights you want to get from the data, or how those insights will be used. The decision is a trade-off between many things, such as the predictive accuracy on new data, the speed of training, memory usage, transparency (black box vs. clear box, how decisions are made), or interpretability (the ability of a human to understand the model).

Regression and classification are the most common methods for supervised learning. Regression predicts numeric values and works with continuous data. Classification works with categorized data and classifies data points. So, if you want to predict a quantity, you should use regression. If you want to predict a class or a group, you should use classification. An example of regression is the price of a house over time. An example of classification is predicting a beer’s evaluation by rating it against other beers on a scale of 1 to 5, with 1 being poor quality and 5 being excellent. Figure 1-1 is a simple example of regression. From the line shown in Figure 1-1, you can see that for value 3, the prediction of the target value is 1.5.

../images/499897_1_En_1_Chapter/499897_1_En_1_Fig1_HTML.jpg

Figure 1-1

An example of regression

Figure 1-2 is an example of classification. The data points are classified in orange and blue. The red line shows in which category each data point belongs. You can see that point (4,1) belongs to the orange group, and point (9,2) belongs to the blue group.

../images/499897_1_En_1_Chapter/499897_1_En_1_Fig2_HTML.jpg

Figure 1-2

An example of classification

Time series forecasting can be a supervised learning problem. The machine learning model predicts the value of the next time step by using the value of a previous time step. You need data that is suitable for the purpose. This method is called the sliding window method . For example, the following is a small part of a data set.

You can reconstruct this data set to be useful in supervised learning by setting the next value as the prediction of the value, as follows.

The first and the last rows cannot be used because some of the information is missing, so we remove those rows. Afterward, there is a solid data set that can be used in supervised machine learning.

Time series forecasting can be used in weather forecasting, inventory planning, or resource allocation, for example. Time series prediction can be very complex, and understanding the data is very important. For example, trends in data might be different in summer than in winter, or on weekdays than on weekends. That must be considered when building the model or maybe several models for different trends.

Deep learning has become very popular as a technique for mainly supervised machine learning. Deep learning is typically used with more complex machine learning tasks on text, voice, recommender systems, or images and videos. Text can be transformed into speech using deep learning. Speech can be transformed into text, which can be used as input to another machine learning task, such as translating from the Finnish language to English.

Automatic speech recognition or natural language processing might also be tasks for deep learning. Recommender systems are producing recommendations for users to make their decision process easier and more fluent. There are three kinds of recommender systems: collaborative filtering, content-based, and hybrid recommender systems. A collaborative filtering recommender system uses the decisions of other users with a similar profile as a base for a recommendation for another user. Content-based recommender systems create recommendations based on similarities of new items to those that the user liked in the past. Hybrid recommender systems use multiple approaches when creating recommendations. Visual recognition and computer vision are very typical and useful tasks for deep learning. Image or action classification, object detection or recognition, image captioning, or image segmentation are useful in machine learning.

One difference between classical supervised learning and deep learning is that in deep learning you do not need to perform feature extraction at all, it is done by the machine as part of the deep learning process. In supervised learning, feature extracting is time-consuming manual work. Of course, that means that deep learning needs more data to do it and, in general, more resources and time. Deep learning has become more popular and useful because of so many improvements in different areas. There is a lot of digital data (photos, videos, voice, etc.) available. The technology has improved: existing data sets and pre-trained models, transfer learning, research such as combining convolutional layers to a neural network, and much more is available. Things that were difficult or nearly impossible to perform using deep learning have become easy and almost trivial. There are plenty of example codes that programmers can use and start building their first deep learning projects.

Deep learning uses neural networks for the prediction process and backpropagation to learn (e.g., tune the network). A neural network consists of neurons. Each input is multiplied by its weight, and a bias is added to that. When using an activation function, an output is passed to the next layer until the last layer and the prediction are reached. The weight and the bias are called hyperparameters. Their values are defined before the machine learning process starts. The first values are a guess, but by using backpropagation and an optimizer function, the process tunes those hyperparameters to have a better-performing model.

In a neural network, there are plenty of hyperparameters that need to be defined before starting the process, and they need to be tuned during it. Some examples of hyperparameters are the number of layers, number of epochs, the batch size, number of neurons in each layer, or what activation function, optimizer, and loss function to use. The backpropagation computes the loss function for the initial guess and the gradient of the loss function. Using that information the optimizer takes the steps to a negative gradient direction to reduce loss. This is done as long as needed to get the weights as good as possible. A convolutional neural network complements the neural network with convolutional layers. Convolutional neural networks are especially useful with image processing. A convolutional neural network consists of several convolutional layers (filter, output, pooling) and a flattening layer to pass the data to a neural network for further processing.

Algorithms for Supervised Learning

A model uses an algorithm to produce a prediction. The goal is to find the best algorithm for the use case. There are plenty of algorithms to be used with supervised learning.

For classification, examples of algorithms include k-nearest neighbors (kNN), naïve Bayes, neural networks, decision trees, or support-vector machine (SVM). kNN categorizes objects based on the classes of their nearest neighbors that have already been categorized. It assumes that objects near each other are similar. kNN is a simple algorithm, but it consumes a lot of memory, and the prediction speed can be slow if the amount of data is large or several dimensions are used. Naïve Bayes assumes that the presence (or absence) of a particular feature of a class is unrelated to the presence (or absence) of any other feature when the class is defined. It classifies new data based on the highest probability of its belonging to a particular class. For example, if a fruit is red, it could be an apple, and if a fruit is round, it could be an apple, but if it is both red and round, there is a stronger probability that the fruit is an apple.

Naïve Bayes works well for a data set containing many features (e.g., the dimensionality of the inputs is high). It is simple to implement and easy to interpret. A neural network imitates the way biological nervous systems and the brain process information. A large number of highly interconnected processing elements (neurons) work together to solve specific problems. Neural networks are good for modeling highly nonlinear systems when the interpretability of the model is not important. They are useful when data is available incrementally, you wish to constantly update the model, and unexpected changes in your input data may occur.

Decision trees are very typical classification algorithms. Decision trees, bagged decision trees, or boosted decision trees are tree structures that consist of branching conditions. They predict responses to data by following the decisions in the tree from the root down to a leaf node.

A bagged decision tree consists of several trees that are trained independently on data. Boosting involves reweighting misclassified events and building a new tree with reweighted events. Decision trees are used when there is a need for an algorithm that is easy to interpret and fast to fit, and you want to minimize memory usage but high predictive accuracy is not a requirement and the time taken to train a model is less of a concern. A support-vector machine (SVM) classifies data by finding the linear decision boundary, or hyperplane, that separates all the data points of one class from those of another class. The best hyperplane for an SVM is the one with the largest margin between the two classes when the data is linearly separable. If the data is not linearly separable, a loss function penalizes points on the wrong side of the hyperplane. Sometimes SVMs use a kernel to transform nonlinearly separable data into higher dimensions where a linear decision boundary can be found. SVMs work the best for high-dimensional, nonlinearly separable data that has exactly two classes. For multiclass classification, it can be used with a technique called error-correcting output codes . It is very useful as a simple classifier, it is easy to interpret, and it is accurate.

For regression tasks, some examples of algorithms are linear regression, nonlinear regression, generalized linear model (GLM), Gaussian process regression (GPR), regression tree, or support-vector regression (SVR).

Linear regression describes a continuous response variable as a linear function of one or more predictor variables. Linear regression could be used when you need an algorithm that is easy to interpret and fast to fit. It is often the first model to be fitted to a new data set and could be used as a baseline for evaluating other, more complex, regression models.

Nonlinear regression describes nonlinear relationships in data. It can be used when data has nonlinear trends and cannot be easily transformed into a linear space.

GLM is a special nonlinear model that uses linear methods. It fits a linear combination of the input to a nonlinear function of the output. It could be used when the response variables have non-normal distributions.

GPR is for nonparametric models used to predict the value of a continuous response variable; for example, to interpolate spatial data, as a surrogate model to optimize complex designs such as automotive engines, or to forecast mortality rates.

Regression trees are similar to decision trees for classification, but they are modified to predict continuous responses. They could be used when predictors are categorical (discrete) or behave nonlinearly.

SVM regression algorithms (SVR) work like SVM classification algorithms but are modified to predict a continuous response. Instead of finding a hyperplane that separates data, SVR algorithms find the decision boundaries and data points inside those boundaries. SVR can be useful with high-dimensional data.

Unsupervised Learning

Unsupervised learning is machine learning with unlabeled data, with an unknown target, to find something useful from the data. Unsupervised learning finds hidden patterns or intrinsic structures in input data.

Clustering is one of the most common methods for unsupervised learning. It is used for exploratory data analysis to find hidden patterns or groupings in data. There are typically two ways of clustering: hard and soft. In hard clustering, each data point belongs to only one cluster, whereas in soft clustering, each data point can belong to more than one cluster.

In Figure 1-3, you can see data points, and in Figure 1-4, you see how they have been clustered in two clusters: green and blue. The idea of clustering is that you tell the algorithm that you want to break the data into two groups, and it finds things that are common to the data points and things that are different. Using that information, the algorithm decides which group (cluster) a particular data point belongs to.

../images/499897_1_En_1_Chapter/499897_1_En_1_Fig3_HTML.jpg

Enjoying the preview?

Page 1 of 1

Machine Learning for Oracle Database Professionals: Deploying Model-Driven Applications and Automation Pipelines

About this ebook

Heli Helskyaho

Related authors

Related to Machine Learning for Oracle Database Professionals

Related ebooks

Databases For You

Related podcast episodes

Related articles

Related categories

Reviews for Machine Learning for Oracle Database Professionals

What did you think?

Book preview

Machine Learning for Oracle Database Professionals - Heli Helskyaho

1. Introduction to Machine Learning

Why Machine Learning?

What Is Machine Learning?

Supervised Learning

Unsupervised Learning