Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

Ebook628 pages5 hours

Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

Name: Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)
Brand: BPB Online LLP
Rating: 4.0 (1 reviews)

By Ivan Gridin

Rating: 4 out of 5 stars

4/5

()

Read preview

About this ebook

Reinforcement learning is a fascinating branch of AI that differs from standard machine learning in several ways. Adaptation and learning in an unpredictable environment is the part of this project. There are numerous real-world applications for reinforcement learning these days, including medical, gambling, human imitation activity, and robotics.
This book introduces readers to reinforcement learning from a pragmatic point of view. The book does involve mathematics, but it does not attempt to overburden the reader, who is a beginner in the field of reinforcement learning.
The book brings a lot of innovative methods to the reader's attention in much practical learning, including Monte-Carlo, Deep Q-Learning, Policy Gradient, and Actor-Critical methods. While you understand these techniques in detail, the book also provides a real implementation of these methods and techniques using the power of TensorFlow and PyTorch. The book covers some enticing projects that show the power of reinforcement learning, and not to mention that everything is concise, up-to-date, and visually explained.
After finishing this book, the reader will have a thorough, intuitive understanding of modern reinforcement learning and its applications, which will tremendously aid them in delving into the interesting field of reinforcement learning.

Skip carousel

Intelligence (AI) & Semantics

LanguageEnglish

PublisherBPB Online LLP

Release dateJul 15, 2022

ISBN9789355512062

Author

Ivan Gridin

Related to Practical Deep Reinforcement Learning with Python

Related ebooks

Skip carousel

Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
Ebook
Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
byAndrea Lonza
Rating: 0 out of 5 stars
0 ratings
Advanced Deep Learning with TensorFlow 2 and Keras - Second Edition: Apply DL, GANs, VAEs, deep RL, unsupervised learning, object detection and segmentation, and more, 2nd Edition
Ebook
Advanced Deep Learning with TensorFlow 2 and Keras - Second Edition: Apply DL, GANs, VAEs, deep RL, unsupervised learning, object detection and segmentation, and more, 2nd Edition
byRowel Atienza
Rating: 0 out of 5 stars
0 ratings
Deep Reinforcement Learning Hands-On - Second Edition: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more, 2nd Edition
Ebook
Deep Reinforcement Learning Hands-On - Second Edition: Apply modern RL methods to practical problems of chatbots, robotics, discrete optimization, web automation, and more, 2nd Edition
byMaxim Lapan
Rating: 0 out of 5 stars
0 ratings
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
Ebook
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
byIvan Vasilev
Rating: 0 out of 5 stars
0 ratings
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
Ebook
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
byRajdeep Dua
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
Ebook
Deep Learning with Keras: Beginner’s Guide to Deep Learning with Keras
byFrank Millstein
Rating: 3 out of 5 stars
3/5
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
Ebook
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
byBrandon Railey
Rating: 0 out of 5 stars
0 ratings
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
Ebook
Applied Machine Learning Solutions with Python: Production-ready ML Projects Using Cutting-edge Libraries and Powerful Statistical Techniques (English Edition)
bySiddhanta Bhatta
Rating: 0 out of 5 stars
0 ratings
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
Ebook
Machine Learning: Adaptive Behaviour Through Experience: Thinking Machines
byalasdair gilchrist
Rating: 4 out of 5 stars
4/5
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
Ebook
Hands-on ML Projects with OpenCV: Master computer vision and Machine Learning using OpenCV and Python
byMugesh S.
Rating: 0 out of 5 stars
0 ratings
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
Ebook
Building Machine Learning Systems Using Python: Practice to Train Predictive Models and Analyze Machine Learning Results with Real Use-Cases (English Edition)
byDeepti Chopra
Rating: 0 out of 5 stars
0 ratings
Designing Machine Learning Systems with Python
Ebook
Designing Machine Learning Systems with Python
byDavid Julian
Rating: 0 out of 5 stars
0 ratings
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
Ebook
Pragmatic Machine Learning with Python: Learn How to Deploy Machine Learning Models in Production
byAvishek Nag
Rating: 0 out of 5 stars
0 ratings
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
Ebook
Mastering Machine Learning Algorithms - Second Edition: Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work, 2nd Edition
byGiuseppe Bonaccorso
Rating: 0 out of 5 stars
0 ratings
Deep Learning with TensorFlow
Ebook
Deep Learning with TensorFlow
byMd. Rezaul Karim
Rating: 5 out of 5 stars
5/5
Machine Learning for Finance
Ebook
Machine Learning for Finance
bySaurav Singla
Rating: 0 out of 5 stars
0 ratings
Generative Adversarial Networks with Industrial Use Cases: Learning How to Build GAN Applications for Retail, Healthcare, Telecom, Media, Education, and HRTech
Ebook
Generative Adversarial Networks with Industrial Use Cases: Learning How to Build GAN Applications for Retail, Healthcare, Telecom, Media, Education, and HRTech
byNavin K Manaswi
Rating: 0 out of 5 stars
0 ratings
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Ebook
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
byMark Magic
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
Getting started with Deep Learning for Natural Language Processing: Learn how to build NLP applications with Deep Learning (English Edition)
Ebook
Getting started with Deep Learning for Natural Language Processing: Learn how to build NLP applications with Deep Learning (English Edition)
bySunil Patel
Rating: 0 out of 5 stars
0 ratings
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
Ebook
Machine Learning - A Comprehensive, Step-by-Step Guide to Learning and Applying Advanced Concepts and Techniques in Machine Learning: 3
byPeter Bradley
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning Projects: Learn how to build Machine Learning projects from scratch (English Edition)
Ebook
Python Machine Learning Projects: Learn how to build Machine Learning projects from scratch (English Edition)
byDr. Deepali R Vora
Rating: 0 out of 5 stars
0 ratings
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
Ebook
Practical Full Stack Machine Learning: A Guide to Build Reliable, Reusable, and Production-Ready Full Stack ML Solutions
byAlok Kumar
Rating: 0 out of 5 stars
0 ratings
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
Ebook
Deep Learning With Python Illustrated Guide For Beginners & Intermediates: The Future Is Here!: The Future Is Here!, #2
byWilliam Sullivan
Rating: 1 out of 5 stars
1/5
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Ebook
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
byAlexandra George
Rating: 0 out of 5 stars
0 ratings
Elements of Deep Learning for Computer Vision: Explore Deep Neural Network Architectures, PyTorch, Object Detection Algorithms, and Computer Vision Applications for Python Coders (English Edition)
Ebook
Elements of Deep Learning for Computer Vision: Explore Deep Neural Network Architectures, PyTorch, Object Detection Algorithms, and Computer Vision Applications for Python Coders (English Edition)
byBharat Sikka
Rating: 0 out of 5 stars
0 ratings
Python Deep Learning
Ebook
Python Deep Learning
byGianmario Spacagna
Rating: 5 out of 5 stars
5/5
Learn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)
Ebook
Learn AI with Python: Explore Machine Learning and Deep Learning techniques for Building Smart AI Systems Using Scikit-Learn, NLTK, NeuroLab, and Keras (English Edition)
byGaurav Leekha
Rating: 5 out of 5 stars
5/5
Beginning with Machine Learning: The Ultimate Introduction to Machine Learning, Deep Learning, Scikit-learn, and TensorFlow (English Edition)
Ebook
Beginning with Machine Learning: The Ultimate Introduction to Machine Learning, Deep Learning, Scikit-learn, and TensorFlow (English Edition)
byDr. Amit Dua
Rating: 0 out of 5 stars
0 ratings
PyTorch Recipes: A Problem-Solution Approach
Ebook
PyTorch Recipes: A Problem-Solution Approach
byPradeepta Mishra
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
Ebook
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
byGuy Hart-Davis
Rating: 2 out of 5 stars
2/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Ebook
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
ChatGPT
Ebook
ChatGPT
byGary Stevens
Rating: 3 out of 5 stars
3/5
Hacking : Guide to Computer Hacking and Penetration Testing
Ebook
Hacking : Guide to Computer Hacking and Penetration Testing
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT
Ebook
Mastering ChatGPT
byCharles J. Jones
Rating: 0 out of 5 stars
0 ratings
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
Ebook
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
byS M Howard
Rating: 4 out of 5 stars
4/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5
ChatGPT
Ebook
ChatGPT
byRobert Conway
Rating: 1 out of 5 stars
1/5
Hacking With Linux 2020:A Complete Beginners Guide to the World of Hacking Using Linux - Explore the Methods and Tools of Ethical Hacking with Linux
Ebook
Hacking With Linux 2020:A Complete Beginners Guide to the World of Hacking Using Linux - Explore the Methods and Tools of Ethical Hacking with Linux
byJoseph Kenna
Rating: 0 out of 5 stars
0 ratings
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C. Lennox
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Measuring Your Python Learning Progress
Podcast episode
Measuring Your Python Learning Progress
byThe Real Python Podcast
100%
100% found this document useful
Leveling Up Natural Language Processing with Transfer Learning: An interview with Paul Azunre about how you can use transfer learning techniques to build more flexible natural language processing systems and reduce the requirements for labelled data.
Podcast episode
Leveling Up Natural Language Processing with Transfer Learning: An interview with Paul Azunre about how you can use transfer learning techniques to build more flexible natural language processing systems and reduce the requirements for labelled data.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
Podcast episode
This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE: This Week In Machine Learning & AI - May 20, 2016…
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
Podcast episode
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
byGoogle Cloud Platform Podcast
100%
100% found this document useful
#51 Francois Chollet - Intelligence and Generalisation
Podcast episode
#51 Francois Chollet - Intelligence and Generalisation
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
Podcast episode
Graph Analytic Systems with Zachary Hanif - TWiML Talk #188: In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. Zach led a session at Strata called “Network effects: Working with modern...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Exploring deep reinforcement learning: with Thomas Simonini of Hugging Face
Podcast episode
Exploring deep reinforcement learning: with Thomas Simonini of Hugging Face
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
Podcast episode
What is beyond PoCs? ML project-hurdles you should be prepared to take with Balázs Kégl - 016: Why do we do PoCs all the time and why do we struggle with Real projects? We are going to talk about ML project-hurdles with the head of AI at Huawei Paris, Balazs Kegl.
byMachine Learning Cafe
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
Podcast episode
MLA 018 Descript: (Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it...
byMachine Learning Guide
0 ratings
0% found this document useful
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
Podcast episode
Declarative Machine Learning For High Performance Deep Learning Models With Predibase
byThe Python Podcast.__init__
0 ratings
0% found this document useful
008 Math: Introduction to the branches of mathematics used in machine learning. Linear algebra, statistics, calculus. ocdevel.com/mlg/8 for notes and resources
Podcast episode
008 Math: Introduction to the branches of mathematics used in machine learning. Linear algebra, statistics, calculus. ocdevel.com/mlg/8 for notes and resources
byMachine Learning Guide
0 ratings
0% found this document useful
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
Podcast episode
[MINI] Long Short Term Memory: Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to store information for longer periods of time. An...
byData Skeptic
0 ratings
0% found this document useful
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#10 Exploratory Analysis of Bayesian Models, with ArviZ and Ari Hartikainen
Podcast episode
#10 Exploratory Analysis of Bayesian Models, with ArviZ and Ari Hartikainen
byLearning Bayesian Statistics
0 ratings
0% found this document useful
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
Podcast episode
How LLMs and Generative AI are Revolutionizing AI for Science with Anima Anandkumar - #614
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
084: Yves Hilpisch – Quantitative finance and programming trading strategies w/ The Python Quants: Dr. Yves Hilpisch is the founder of The Python Quants, a keynote speaker, and a three-time published author (most notably, Python For Finance). He regularly contracts to hedge funds, banks and exchanges, and hosts workshops on Python programming and algor
Podcast episode
084: Yves Hilpisch – Quantitative finance and programming trading strategies w/ The Python Quants: Dr. Yves Hilpisch is the founder of The Python Quants, a keynote speaker, and a three-time published author (most notably, Python For Finance). He regularly contracts to hedge funds, banks and exchanges, and hosts workshops on Python programming and algor
byChat With Traders
0 ratings
0% found this document useful
LLMOps & Conversational Intelligence for AI
Podcast episode
LLMOps & Conversational Intelligence for AI
byThe Cloudcast
0 ratings
0% found this document useful
Wiring the Winning Organization
Podcast episode
Wiring the Winning Organization
byThe Cloudcast
0 ratings
0% found this document useful
RLHF 201 - with Nathan Lambert of AI2 and Interconnects
Podcast episode
RLHF 201 - with Nathan Lambert of AI2 and Interconnects
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
Podcast episode
Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Helping the Development of a Skill with Victor Karkar Transformative Principal 565
Podcast episode
Helping the Development of a Skill with Victor Karkar Transformative Principal 565
byTransformative Principal
0 ratings
0% found this document useful
#60 Geometric Deep Learning Blueprint (Special Edition)
Podcast episode
#60 Geometric Deep Learning Blueprint (Special Edition)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Platform Engineering at a FAANG Company
Podcast episode
Platform Engineering at a FAANG Company
byThe Cloudcast
0 ratings
0% found this document useful
The End of Finetuning — with Jeremy Howard of Fast.ai
Podcast episode
The End of Finetuning — with Jeremy Howard of Fast.ai
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
How The Modern Day Learning Experience Is Shaping Digital Workforce Transformations with Steven Rath Morgan: The fourth industrial revolution has completely changed the way human effort factors into the modern-day workplace. Technological changes have visibly improved business efficiency, cut costs, and given companies a sharper competitive edge than before. But more than anything, it has defined “a golden age for modern-day L&D organizations in supporting the digital workforce transformation that is so widespread today.” Quite an interesting fact from the Digital Adoption Show put forward by our guest, Steven Rath Morgan, Head of Learning & Development at Xerox and a thought leader who uniquely envisions the new age L&D to make the workforce effective and efficient. In this episode of the Digital Adoption Show with Steven, we’ve chatted about how he has seen L&D evolving over the past few decades and how he envisions the incorporation of Digital Adoption practices in corporate L&D.
Podcast episode
How The Modern Day Learning Experience Is Shaping Digital Workforce Transformations with Steven Rath Morgan: The fourth industrial revolution has completely changed the way human effort factors into the modern-day workplace. Technological changes have visibly improved business efficiency, cut costs, and given companies a sharper competitive edge than before. But more than anything, it has defined “a golden age for modern-day L&D organizations in supporting the digital workforce transformation that is so widespread today.” Quite an interesting fact from the Digital Adoption Show put forward by our guest, Steven Rath Morgan, Head of Learning & Development at Xerox and a thought leader who uniquely envisions the new age L&D to make the workforce effective and efficient. In this episode of the Digital Adoption Show with Steven, we’ve chatted about how he has seen L&D evolving over the past few decades and how he envisions the incorporation of Digital Adoption practices in corporate L&D.
byThe Digital Adoption Show | Upskilling the Future Digital Workforce
0 ratings
0% found this document useful
Open School with An Edcamp & Learn About AI with Mike Gaskell - Transformative Principal: Summer of AI
Podcast episode
Open School with An Edcamp & Learn About AI with Mike Gaskell - Transformative Principal: Summer of AI
byTransformative Principal
0 ratings
0% found this document useful
Visualizing the Cloud
Podcast episode
Visualizing the Cloud
byThe Cloudcast
0 ratings
0% found this document useful
133. Understanding Collective Energy Through Logic: We're continuing our journey through Human Design Circuitry by diving into the Collective Circuit. This week, we're focusing on the Logic Circuit, the pattern-focused half of the Collective Circuit. The Logic Circuit - and the channels in it -...
Podcast episode
133. Understanding Collective Energy Through Logic: We're continuing our journey through Human Design Circuitry by diving into the Collective Circuit. This week, we're focusing on the Logic Circuit, the pattern-focused half of the Collective Circuit. The Logic Circuit - and the channels in it -...
byThat Projector Life
0 ratings
0% found this document useful

Skip carousel

Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Don’t Be Misled by GPT-4’s Gift of Gab
The Atlantic
Article
Don’t Be Misled by GPT-4’s Gift of Gab
Mar 15, 2023
4 min read
How Image Recognition Works
APC
Article
How Image Recognition Works
Nov 4, 2019
4 min read
The Fundamental Limits of Machine Learning
Nautilus
Article
The Fundamental Limits of Machine Learning
Sep 20, 2016
5 min read
Deep Learning Is Hitting a Wall
Nautilus
Article
Deep Learning Is Hitting a Wall
Mar 10, 2022
Let me start by saying a few things that seem obvious,” Geoffrey Hinton, “Godfather” of deep learning, and one of the most celebrated scientists of our time, told a leading AI conference in Toronto in 2016. “If you work as a radiologist you’re like t
20 min read
Mucking About With AI
APC
Article
Mucking About With AI
May 22, 2023
2 min read
Q&A: OPENAI CTO MIRA MURATI ON SHEPHERDING CHATGPT
TechLife News
Article
Q&A: OPENAI CTO MIRA MURATI ON SHEPHERDING CHATGPT
Apr 29, 2023
4 min read
Q&A: OPENAI CTO MIRA MURATI ON SHEPHERDING CHATGPT
AppleMagazine
Article
Q&A: OPENAI CTO MIRA MURATI ON SHEPHERDING CHATGPT
Apr 28, 2023
4 min read
SYNC OR SWIM Rough Animator
Screen Education
Article
SYNC OR SWIM Rough Animator
Dec 1, 2019
11 min read
Quantum Leap
Marketing
Article
Quantum Leap
Jul 11, 2019
6 min read
Harnessing The Power Of Artificial Intelligence TO ENHANCE EDUCATION
JOY Magazine
Article
Harnessing The Power Of Artificial Intelligence TO ENHANCE EDUCATION
Dec 1, 2023
3 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Commentary: Worried That ChatGPT Is Coming For Your Job? An Old Assessment Tool May Have The Answer
Los Angeles Times
Article
Commentary: Worried That ChatGPT Is Coming For Your Job? An Old Assessment Tool May Have The Answer
Mar 7, 2023
4 min read
Scary AI Is More “Fantasia” Than “Terminator”
Nautilus
Article
Scary AI Is More “Fantasia” Than “Terminator”
Mar 15, 2018
When Nate Soares psychoanalyzes himself, he sounds less Freudian than Spockian. As a boy, he’d see people acting in ways he never would “unless I was acting maliciously,” the former Google software engineer, who now heads the non-profit Machine Intel
7 min read
Why Robot Brains Need Symbols
Nautilus
Article
Why Robot Brains Need Symbols
Dec 6, 2018
Nowadays, the words “artificial intelligence” seem to be on practically everyone’s lips, from Elon Musk to Henry Kissinger. At least a dozen countries have mounted major AI initiatives, and companies like Google and Facebook are locked in a massive b
16 min read
Things Get Strange When AI Starts Training Itself
The Atlantic
Article
Things Get Strange When AI Starts Training Itself
Feb 16, 2024
7 min read
Conversica CEO Discusses Future of Artificial Intelligence
AppleMagazine
Article
Conversica CEO Discusses Future of Artificial Intelligence
May 19, 2017
2 min read
Conversica CEO Discusses Future of Artificial Intelligence
TechLife News
Article
Conversica CEO Discusses Future of Artificial Intelligence
May 19, 2017
2 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
Getting The edge
The European Business Review
Article
Getting The edge
Feb 25, 2021
7 min read
Why a Hedge Fund Started a Video Game Competition
Nautilus
Article
Why a Hedge Fund Started a Video Game Competition
Nov 30, 2017
There’s a weird way in which a hedge fund is a confluence of everything. There’s the money of course—Two Sigma, located in lower Manhattan, manages over $50 billion, an amount that has grown 600 percent in 6 years and is roughly the size of the econo
9 min read
Tech Tutor Exponential Technologies Are Changing
Business Today
Article
Tech Tutor Exponential Technologies Are Changing
Mar 5, 2020
8 min read
Questions for Tim Brown, CEO, IDEO
Rotman Management
Article
Questions for Tim Brown, CEO, IDEO
Jan 1, 2018
You have said that, at its best, design creates relationships between people and technologies. Please explain. When I use the term ‘technologies’, I mean anything that is constructed by human beings — whether it’s an iPod, an automobile, a rapid tran
8 min read
SYNC OR SWIM Trello
Screen Education
Article
SYNC OR SWIM Trello
Sep 15, 2019
8 min read
“Reputations Are Going To Be Staked On How ‘The Computer’ Goes About Making Decisions”
PC Pro Magazine
Article
“Reputations Are Going To Be Staked On How ‘The Computer’ Goes About Making Decisions”
Jun 10, 2021
We live lonely lives here sometimes. The type of critic who sees patterns in everything loves to tell me that I’m in the pockets of PC Pro advertisers, and that we all toe the party line – most recently over systems such as the Raspberry Pi 400 or an
6 min read
Chatgpt Goes (not Yet) To Hollywood
The European Business Review
Article
Chatgpt Goes (not Yet) To Hollywood
Jul 31, 2023
4 min read
ChatGPT: What Leaders Need to Know
Rotman Management
Article
ChatGPT: What Leaders Need to Know
Sep 1, 2023
10 min read
Leadership Forum: Investing in Disruption
Rotman Management
Article
Leadership Forum: Investing in Disruption
Jan 1, 2019
10 min read
The Most Underrated Skill in Management
Rotman Management
Article
The Most Underrated Skill in Management
Sep 1, 2018
7 min read

Related categories

Skip carousel

Reviews for Practical Deep Reinforcement Learning with Python

Rating: 4 out of 5 stars

4/5

1 rating0 reviews

Book preview

Practical Deep Reinforcement Learning with Python - Ivan Gridin

Part - I

The first part of the book will be devoted to classical reinforcement learning methods. This part will consider the theoretical foundations of reinforcement learning problems and the primary techniques for solving them. One of the main concepts of the book's first part is the Q-Learning method. The Q-Learning method described in Chapter 6: Escaping Maze With Q-Learning, is the cornerstone for most reinforcement learning solutions. The book's first part can be considered as an introduction to reinforcement learning.

CHAPTER 1

Introducing Reinforcement Learning

Reinforcement learning ( RL ) is one of the most active research areas in machine learning. Many researchers think that RL will take us closer to reaching artificial general intelligence. In the past few years, RL has evolved rapidly and has been used in complex applications ranging from stock trading to self-driving cars. The main reason for this growth is the involvement of deep reinforcement learning, which is a combination of deep learning and reinforcement learning. Reinforcement learning is one of the most promising areas of machine learning that we will study in this book.

Structure

In this chapter, we will discuss the following topics:

What is reinforcement learning?

Reinforcement learning mechanism

Reinforcement learning vs. supervised learning

Applications of reinforcement learning

Objectives

After completing this chapter, you will have a basic understanding of reinforcement learning and its key definitions. You will also have learned how reinforcement learning works and how it differs from other machine learning approaches.

What is reinforcement learning?

Reinforcement learning is defined as a machine learning technique concerned with how agents should take actions in a surrounding environment depending on their current state. RL is a part of machine learning that helps an agent maximize the cumulative reward collected after making some sequence of actions. In RL, agents act in a known or unknown environment to constantly adapt and learn based on collected experience. The feedback of an environment might be positive, also known as rewards, or negative, also called punishments. At this point of time, all the above definitions may seem too abstract and unclear, but we will elaborate on them in this chapter.

The following figure represents the key concept of RL:

Figure 1.1: Reinforcement learning

Here, the agent is in some initial state in some environment. Then, the agent decides to take some action. The environment reacts to the agent's action, returns the agent some reward for his action, and transfers him to another state.

Most used reinforcement learning keywords are as follows:

Agent is a decision-maker who defines what action to take.

Examples: Self-driving car, chess player, stock trading robot

Action is a concrete act in a surrounding environment that is taken by the agent.

Examples: Turn car left, move chess pawn one cell forward, sell all assets

Environment is a problem context that the agent cooperates with.

Examples: Car track, chess board, stock market

State is a position of the agent in the environment.

Examples: Car coordinates on the track and its speed, arrangement of pieces on the chessboard, price of assets

Reward is a numerical value returned by an environment as the reaction to the agent's action.

Example: To reach a goal on the car without any accidents, to win chess play, to earn more money

RL is learning what to do or how to map situations to actions to maximize a reward. The agent doesn't know which actions to take but must learn which actions produce the most reward by trying them. Usually, actions may affect the immediate reward and the next situation and all subsequent rewards. It means that the agent should not think about the immediate reward only but about the reward in the long-term sense.

Reinforcement learning mechanics

In our life, we usually try to maximize our rewards. And it does not mean that we are always thinking about money or materialistic things. To give an example, when we read a new book to learn new skills, we understand that it is better to read a book carefully, without hurrying. Our way to read a book is a strategy, and the skills we gain are our reward. When we are negotiating with other people, we are trying to be polite, and the feedback we get is our reward.

The purpose of the reward is to tell our agent how well it has behaved. The main goal of RL is to find such strategy that maximizes the reward after some number of actions. Let's see some simple examples that help you illustrate the reinforcement learning mechanism.

Consider the following scientifically factual scenario. A robot has arrived on our planet. This robot is very good at designing posters but does not know how to negotiate with people. His target is to get a job and make a lot of money in 5 years. Good plan, why not? Every day, the robot makes a particular decision about how it will act today. At the end of the day, he checks his bank account and summarizes his state in the company.

Let's consider the first scenario. The robot decides to steal a computer from the office and sell it on the first working day. And it may seem that this is a pretty good decision because it will help the robot increase its balance significantly. But of course, we understand that the decision like this can be made only once, and the profits of our robot will stop there.

The following figure illustrates the first scenario:

Figure 1.2: First strategy

Now, let's consider the second scenario. Every day the robot works hard and learns new things. In this case, his strategy is long-term. It may be inferior to other strategies in the short term, but it will be significantly more profitable in the long term.

Figure 1.3: Second strategy

Of course, in real life, everything is much more complicated. But this example illustrates the principle when it is necessary to think several steps ahead. A solution that has a quick effect can be fatal in the long run. Reinforcement learning aims to find long-term strategies that maximize the agent's reward.

Here are some essential characteristics of reinforcement learning:

There is no supervisor. Agent only receives a reward signal

Sequential decision making

Agent's actions determine the subsequent data it receives

The term reinforcement comes from the fact that a reward received by an agent should reinforce its behavior in a positive or negative direction. A local reward indicates the success of the agent's recent action and not overall successes achieved by the agent so far. Of course, getting a large reward for some action doesn't mean that you won't face dramatic consequences later due to your previous decisions. Remember our example with a robot that decides to rob a computer - it could look like a brilliant idea until you think about the next day.

The problem can be considered as RL problem if we can define the following:

Agent: Define the subject, which takes some actions.

Environment: Define the system that receives an agent's actions.

Set of states: Define the set of states that an agent can receive. This set can be infinite.

Set of actions: Define the set of actions an agent can take. This set can be infinite.

Reward: Define what the agent's primary goal is and how it can be achieved with some reward system.

If all the above definitions can be obtained, you obviously deal with the reinforcement learning problem.

Reinforcement learning vs. supervised learning

When we have an intuitive understanding of reinforcement learning, we can examine how it differs from traditional supervised learning. A good rule of thumb is to treat reinforcement learning as a dynamic model and supervised learning as a static model. Let's elaborate on this.

We can use supervised learning as a statistical model that can extract some correlations and patterns from which they make predictions without being explicitly programmed. Generally speaking, supervised learning makes only one action. It takes input and returns the output. Its primary goal is to provide you with an automatically built function F that maps some input X into some output Y:

Figure 1.4: Supervised learning

While reinforcement learning builds an agent that makes a sequence of actions interacting with an environment, this agent cooperates with an environment and produces the sequence of actions:

Figure 1.5: Reinforcement learning

Let's summarize all distinctions between reinforcement learning and supervised learning in the following table:

Table 1.1: Reinforcement learning vs. supervised learning

It is important to understand the difference between reinforcement learning and supervised learning. This knowledge will help you in the correct use of each of these methods.

Examples of reinforcement learning

In this section, we will see some popular examples of RL problems. In all these problems, we have the following: agent, environment, set of states, set of actions, and the reward.

Stock trading

This type of activity assumes making a profit by buying and selling shares of different companies. All traders tend to buy stocks of a company when they are cheap and sell when they are high:

Table 1.2: Stock trading as RL problem

Chess

Chess is one of the oldest games. This game has many different styles and approaches. However, chess is also a reinforcement learning problem:

Table 1.3: Chess as RL problem

Neural Architecture Search (NAS)

RL has been successfully applied to the domain of Neural network Architecture Search (NAS). The goal is to get the best performance on some datasets by selecting the number of layers or their parameters, adding extra connections, or making other changes to the architecture. The reward, in this case, is the performance of neural network architecture:

Table 1.4: NAS as RL problem

As you can see, many practical problems can be solved using the reinforcement learning approach.

Conclusion

Reinforcement learning is a machine learning approach that aims to find optimal decision-making strategies. It differs from other machine learning approaches by emphasizing agent learning from direct interaction with its environment. It doesn't require traditional supervision or complete computational models of the environment. Reinforcement learning aims to find an appropriate long-term strategy that allows collecting maximum rewards to an agent. In the next chapter, we will study the theory of Markov decision processes that form the base of the entire reinforcement learning approach.

Points to remember

A solution that has a quick effect can be fatal in the long run.

RL doesn't assume any supervisor. Agent only receives a reward signal.

RL produces a sequential decision-making strategy.

Reinforcement learning is a dynamic model, and supervised learning is a static model.

Multiple choice questions

Let's consider a popular and simple computer game called Tetris, which has relatively simple mechanics. When the player builds one or more completed rows, the completed rows disappear, and the player gains some points. The game's goal is to prevent the blocks from stacking up to the top of the screen and collect as many points as possible.

Figure 1.6: Reinforcement learning

What do you think? Can Tetris be considered as an RL problem?

Yes

Considering Tetris as an RL problem, define an agent.

Score

Player

Number of disappeared lines

Considering Tetris as an RL problem, define a state.

Score

Arrangement of bricks and score

Arrangement of bricks, score, and the next element

Answers

Key terms

Agent: A decision-maker who defines what action to take.

Action: A concrete act in a surrounding environment that takes the agent.

Environment: A problem context that the agent cooperates with.

State: A position of an agent in the environment.

Reward: A numerical value returned by an environment as the reaction of the agent's action.

CHAPTER 2

Playing Monopoly and Markov Decision Process

In the last chapter, you got a general introduction to reinforcement learning ( RL ). We saw different examples for different problems and highlighted the main characteristics of reinforcement learning. But before we start solving practical problems, we will formally describe how you can solve them using the RL approach. One of the RL cornerstones is the Markov decision process ( MDP ). This concept is the foundation of the whole theory of reinforcement learning. We will dedicate this chapter to explaining what the Markov decision process is with the help of Monopoly game examples. We'll discuss MDPs in greater detail as we walk through the chapter. Markov chains and Markov decision processes are extensively used in many aspects of engineering and statistics. Reading this chapter will be useful for understanding the context of reinforcement learning and a much wider range of topics. If you're already familiar with MDPs, you can quickly get a grasp of this chapter, just by focusing on the terminology definitions that will be used later in the book.

Structure

In this chapter, we will discuss the following topics:

What is the best strategy for playing Monopoly?

Markov chain

Markov reward process

Markov decision process

Policy

Monopoly as Markov decision process

Objectives

The primary goal of this chapter is to provide the basics and fundamental concepts of reinforcement learning: Markov reward process and Policy. We will look at simple and straightforward examples that will allow us to understand what lies at the heart of these concepts. This chapter will give you a clear understanding of tasks that reinforcement learning deals with.

Choosing the best strategy for playing Monopoly

The formal mathematical explanation of Markov decision process often confuses the reader, although this concept is not as complicated as it might seem. In this chapter, we will explore what Markov decision process is by playing the popular game of Monopoly.

Let's create a list of simplified versions of the Monopoly game.

We will consider only simplified rules of the game here. This chapter does not need to go through a complete list of rules.

List of rules

Our custom simplified Monopoly game will follow the given set of rules:

Two players are playing. For the sake of simplicity, we will consider a game for two players only. We will denote the players by a square and a triangle:

Figure 2.1: Monopoly players

Each player rolls the dice and moves forward a certain number of cells:

Figure 2.2: Player 1 moves four steps forward

Each cell can be purchased for the price indicated on it. When a player gets on a free cell, they have two options:

Buy a cell

Do not buy a cell

It is not obligatory to buy a free cell:

Figure 2.3: Cell prices

If a player lands on someone else's cell, then he must pay the other player 20% of the cost of the cell.

Figure 2.4: Player 1 has to pay $2 to Player 2

Each player starts the game with $100.

There are surprise cells on the board. They randomly give three results:

Player gets $10 from the bank

Player gives $5 to the bank

Player skips one turn

A player loses when they run out of money.

Let's take a look at the entire board:

Figure 2.5: Monopoly playing board

Now that we have defined the rules, we have a more interesting question: what strategy should we choose for the game? It would seem that there is a reasonable and straightforward strategy: buy everything you can! Indeed, the more cells the player buys, the more rent he will receive when another player hits his cells. But everything is not so simple. Let's take a look at the example in Figure 2.6:

Figure 2.6: To buy or not to buy?

Suppose player 1 has only $40 left. And he just got on the cell that costs $40. Should they buy it? If player 1 buys it, then the probability of losing on the next move is extremely high. Because player 1 will have no money left, and they can get to the cells that have already been bought by player 2:

Figure 2.7: Player 1 can lose on the next turn if he buys a cell

As we can see, there is no primitive strategy in this game. A more advanced approach

Enjoying the preview?

Page 1 of 1

Practical Deep Reinforcement Learning with Python: Concise Implementation of Algorithms, Simplified Maths, and Effective Use of TensorFlow and PyTorch (English Edition)

About this ebook

Ivan Gridin

Read more from Ivan Gridin

Related authors

Related to Practical Deep Reinforcement Learning with Python

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Practical Deep Reinforcement Learning with Python

What did you think?

Book preview

Practical Deep Reinforcement Learning with Python - Ivan Gridin

Part - I

CHAPTER 1

Introducing Reinforcement Learning

Structure

Objectives

What is reinforcement learning?

Reinforcement learning mechanics

Reinforcement learning vs. supervised learning

Examples of reinforcement learning

Stock trading

Chess

Neural Architecture Search (NAS)

Conclusion

Points to remember

Multiple choice questions

Key terms

CHAPTER 2

Playing Monopoly and Markov Decision Process

Structure

Objectives

Choosing the best strategy for playing Monopoly

List of rules