Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing

Ebook524 pages4 hours

Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing

Name: Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing
ISBN: 9789815079210

By Gyanendra Verma and Rajesh Doriya

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This book is a detailed reference guide on deep learning and its applications. It aims to provide a basic understanding of deep learning and its different architectures that are applied to process images, speech, and natural language. It explains basic concepts and many modern use cases through fifteen chapters contributed by computer science academics and researchers. By the end of the book, the reader will become familiar with different deep learning approaches and models, and understand how to implement various deep learning algorithms using multiple frameworks and libraries.

This book is divided into three parts. The first part explains the basic operating understanding, history, evolution, and challenges associated with deep learning. The basic concepts of mathematics and the hardware requirements for deep learning implementation, and some of its popular frameworks for medical applications are also covered.

The second part is dedicated to sentiment analysis using deep learning and machine learning techniques. This book section covers the experimentation and application of deep learning techniques and architectures in real-world applications. It details the salient approaches, issues, and challenges in building ethically aligned machines. An approach inspired by traditional Eastern thought and wisdom is also presented.

The final part covers artificial intelligence approaches used to explain the machine learning models that enhance transparency for the benefit of users. A review and detailed description of the use of knowledge graphs in generating explanations for black-box recommender systems and a review of ethical system design and a model for sustainable education is included in this section. An additional chapter demonstrates how a semi-supervised machine learning technique can be used for cryptocurrency portfolio management.

The book is a timely reference for academicians, professionals, researchers and students at engineering and medical institutions working on artificial intelligence applications.

Skip carousel

Intelligence (AI) & Semantics

LanguageEnglish

PublisherBentham Science Publishers

Release dateFeb 6, 2000

ISBN9789815079210

Related to Deep Learning

Related ebooks

Skip carousel

Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing
Ebook
Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing
byGyanendra Verma
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence and Multimedia Data Engineering: Volume 1
Ebook
Artificial Intelligence and Multimedia Data Engineering: Volume 1
bySuman Kumar Swarnkar
Rating: 0 out of 5 stars
0 ratings
Video Data Analytics for Smart City Applications: Methods and Trends
Ebook
Video Data Analytics for Smart City Applications: Methods and Trends
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Recent Developments in Artificial Intelligence and Communication Technologies
Ebook
Recent Developments in Artificial Intelligence and Communication Technologies
byVikash Yadav
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Healthcare Services
Ebook
Deep Learning for Healthcare Services
byParma Nand
Rating: 0 out of 5 stars
0 ratings
Disease Prediction using Machine Learning, Deep Learning and Data Analytics
Ebook
Disease Prediction using Machine Learning, Deep Learning and Data Analytics
byGeeta Rani
Rating: 0 out of 5 stars
0 ratings
Artificial Neural Systems: Principle and Practice
Ebook
Artificial Neural Systems: Principle and Practice
byPierre Lorrentz
Rating: 0 out of 5 stars
0 ratings
Human-Computer Interaction and Beyond: Advances Towards Smart and Interconnected Environments (Part I)
Ebook
Human-Computer Interaction and Beyond: Advances Towards Smart and Interconnected Environments (Part I)
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: Models, Algorithms and Applications
Ebook
Artificial Intelligence: Models, Algorithms and Applications
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Dominant Algorithms to Evaluate Artificial Intelligence:From the View of Throughput Model
Ebook
Dominant Algorithms to Evaluate Artificial Intelligence:From the View of Throughput Model
byWaymond Rodgers
Rating: 0 out of 5 stars
0 ratings
Introduction to Sensors in IoT and Cloud Computing Applications
Ebook
Introduction to Sensors in IoT and Cloud Computing Applications
byAmbika Nagaraj
Rating: 0 out of 5 stars
0 ratings
The Role of AI in Enhancing IoT-Cloud Applications
Ebook
The Role of AI in Enhancing IoT-Cloud Applications
byAmbika Nagaraj
Rating: 0 out of 5 stars
0 ratings
Quick Guideline for Computational Drug Design (Revised Edition)
Ebook
Quick Guideline for Computational Drug Design (Revised Edition)
bySheikh Arslan Sehgal
Rating: 0 out of 5 stars
0 ratings
Trends in Future Informatics and Emerging Technologies
Ebook
Trends in Future Informatics and Emerging Technologies
byDeepak Kumar
Rating: 0 out of 5 stars
0 ratings
Future Farming: Advancing Agriculture with Artificial Intelligence
Ebook
Future Farming: Advancing Agriculture with Artificial Intelligence
byPraveen Kumar Shukla
Rating: 0 out of 5 stars
0 ratings
Modern Intelligent Instruments - Theory and Application
Ebook
Modern Intelligent Instruments - Theory and Application
byChangjian Deng
Rating: 0 out of 5 stars
0 ratings
Introduction to Machine Learning with Python
Ebook
Introduction to Machine Learning with Python
byDeepti Chopra
Rating: 0 out of 5 stars
0 ratings
Handbook of Mobile Application Development: A Guide to Selecting the Right Engineering and Quality Features
Ebook
Handbook of Mobile Application Development: A Guide to Selecting the Right Engineering and Quality Features
byMohamed Sarrab
Rating: 0 out of 5 stars
0 ratings
Computational Intelligence for Sustainable Transportation and Mobility: Volume 1
Ebook
Computational Intelligence for Sustainable Transportation and Mobility: Volume 1
byDeepak Gupt
Rating: 0 out of 5 stars
0 ratings
Data Science for Agricultural Innovation and Productivity
Ebook
Data Science for Agricultural Innovation and Productivity
byS. Gowrishankar
Rating: 0 out of 5 stars
0 ratings
Fractal Antenna Design using Bio-inspired Computing Algorithms
Ebook
Fractal Antenna Design using Bio-inspired Computing Algorithms
byBalwinder S. Dhaliwal
Rating: 0 out of 5 stars
0 ratings
IoT-enabled Sensor Networks: Architecture, Methodologies, Security, and Futuristic Applications
Ebook
IoT-enabled Sensor Networks: Architecture, Methodologies, Security, and Futuristic Applications
bySamayveer Singh
Rating: 0 out of 5 stars
0 ratings
Quick Guideline for Computational Drug Design
Ebook
Quick Guideline for Computational Drug Design
byA. Hammad Mirza
Rating: 0 out of 5 stars
0 ratings
Changing Humanities and Smart Application of Digital Technologies
Ebook
Changing Humanities and Smart Application of Digital Technologies
byKuo Hung Huang
Rating: 0 out of 5 stars
0 ratings
Smart Antennas: Recent Trends in Design and Applications
Ebook
Smart Antennas: Recent Trends in Design and Applications
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Cross-Industry Blockchain Technology: Opportunities and Challenges in Industry 4.0
Ebook
Cross-Industry Blockchain Technology: Opportunities and Challenges in Industry 4.0
byRajesh Singh
Rating: 0 out of 5 stars
0 ratings
Mobile Computing Solutions for Healthcare Systems
Ebook
Mobile Computing Solutions for Healthcare Systems
byPublishDrive
Rating: 0 out of 5 stars
0 ratings
Multi-Objective Optimization in Theory and Practice II: Metaheuristic Algorithms
Ebook
Multi-Objective Optimization in Theory and Practice II: Metaheuristic Algorithms
byAndré A. Keller
Rating: 0 out of 5 stars
0 ratings
Advances in Time Series Forecasting: Volume 2
Ebook
Advances in Time Series Forecasting: Volume 2
byCagdas Hakan Aladag
Rating: 0 out of 5 stars
0 ratings
Recent Advances in Analytical Techniques: Volume 1
Ebook
Recent Advances in Analytical Techniques: Volume 1
byAtta-ur Rahman
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
Ebook
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
byGuy Hart-Davis
Rating: 2 out of 5 stars
2/5
ChatGPT
Ebook
ChatGPT
byGary Stevens
Rating: 3 out of 5 stars
3/5
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
Ebook
How To Become A Data Scientist With ChatGPT: A Beginner's Guide to ChatGPT-Assisted Programming
byRafiq Muhammad
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Ebook
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
TensorFlow in 1 Day: Make your own Neural Network
Ebook
TensorFlow in 1 Day: Make your own Neural Network
byKrishna Rungta
Rating: 4 out of 5 stars
4/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
Ebook
Make Money with ChatGPT: Your Guide to Making Passive Income Online with Ease using AI: AI Wealth Mastery
byBen Preston
Rating: 0 out of 5 stars
0 ratings
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
Ebook
A Quickstart Guide To Becoming A ChatGPT Millionaire: The ChatGPT Book For Beginners (Lazy Money Series®)
byS M Howard
Rating: 4 out of 5 stars
4/5
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
ChatGPT: The Future of Intelligent Conversation
Ebook
ChatGPT: The Future of Intelligent Conversation
byCea West
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Putting the “Fun” in Functional with Frank Chen: Almost everyone is using Slack, and a lot of that is because of the work of those like Frank Chen, Slack’s Senior Staff Software Engineer. Frank is here to tell us how Slack keeps us all angrily typing. But equally as important is his own trajectory which
Podcast episode
Putting the “Fun” in Functional with Frank Chen: Almost everyone is using Slack, and a lot of that is because of the work of those like Frank Chen, Slack’s Senior Staff Software Engineer. Frank is here to tell us how Slack keeps us all angrily typing. But equally as important is his own trajectory which
byScreaming in the Cloud
0 ratings
0% found this document useful
How chaos engineering preps developers for the ultimate game day: On this sponsored episode, our fourth in the series with Intuit, Ben and Ryan chat with Deepthi Panthula, Senior Product Manager, and Shan Anwar, Principal Software Engineer, both of Intuit about how use self-serve chaos engineering tools to control the blast radius of failures, how game day tests and drills keep their systems resilient, and how their investment in open-source software powers their program.
Podcast episode
How chaos engineering preps developers for the ultimate game day: On this sponsored episode, our fourth in the series with Intuit, Ben and Ryan chat with Deepthi Panthula, Senior Product Manager, and Shan Anwar, Principal Software Engineer, both of Intuit about how use self-serve chaos engineering tools to control the blast radius of failures, how game day tests and drills keep their systems resilient, and how their investment in open-source software powers their program.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Ep. 33 - Code dependencies are the devil: Have you built your app on someone else's code? And beyond that, does the "secret sauce" of your product depend on external libraries or frameworks? While it's tempting to use the latest and greatest tech as soon as it comes out, that's not always a...
Podcast episode
Ep. 33 - Code dependencies are the devil: Have you built your app on someone else's code? And beyond that, does the "secret sauce" of your product depend on external libraries or frameworks? While it's tempting to use the latest and greatest tech as soon as it comes out, that's not always a...
byfreeCodeCamp Podcast
0 ratings
0% found this document useful
381 Programming Framework: Which Ones To Learn? - Simple Programmer Podcast: If you're a software developer I doubt you'll ever be able to learn everything that software developer has to offer. Every day new programming languages come out, technology changes and the process is updated. All this amount of information makes it...
Podcast episode
381 Programming Framework: Which Ones To Learn? - Simple Programmer Podcast: If you're a software developer I doubt you'll ever be able to learn everything that software developer has to offer. Every day new programming languages come out, technology changes and the process is updated. All this amount of information makes it...
bySimple Programmer Podcast
0 ratings
0% found this document useful
Episode 109 - Honest Security with Jason Meller: In this episode of Hacker Valley Studio podcast, Ron and Chris are joined by Jason Meller, Founder, and CEO of Kolide. Jason has over 10 years of experience in managing and leading security organizations. Jason’s interest in technology and cybersecur...
Podcast episode
Episode 109 - Honest Security with Jason Meller: In this episode of Hacker Valley Studio podcast, Ron and Chris are joined by Jason Meller, Founder, and CEO of Kolide. Jason has over 10 years of experience in managing and leading security organizations. Jason’s interest in technology and cybersecur...
byHacker Valley Studio
0 ratings
0% found this document useful
? #074: Andrew Rodgers on what "open" really means
Podcast episode
? #074: Andrew Rodgers on what "open" really means
byThe Nexus Podcast
0 ratings
0% found this document useful
Deep Learning: Did you know that the concept of deep learning goes way back to the 1950s? However, it is only in recent years that this technology has created a tremendous amount of buzz (and for good reason!). A subset of machine learning, deep learning is inspired...
Podcast episode
Deep Learning: Did you know that the concept of deep learning goes way back to the 1950s? However, it is only in recent years that this technology has created a tremendous amount of buzz (and for good reason!). A subset of machine learning, deep learning is inspired...
byOracle University Podcast
0 ratings
0% found this document useful
Hybrid CTO of Hardware and Software with Ahn Nguyen
Podcast episode
Hybrid CTO of Hardware and Software with Ahn Nguyen
byCTO Podcast
0 ratings
0% found this document useful
Intel Open Source Software
Podcast episode
Intel Open Source Software
byThe Cloudcast
0 ratings
0% found this document useful
Macular Degeneration Meets its Match: Multiple Use Eye technologies with Ocutrx Technologies: Place a fist in front of your eyes: that’s what someone with macular degeneration sees. But Mike Freeman and his brother have created a new medical technology to bring the center to the periphery through an augmented reality headset. Even better,...
Podcast episode
Macular Degeneration Meets its Match: Multiple Use Eye technologies with Ocutrx Technologies: Place a fist in front of your eyes: that’s what someone with macular degeneration sees. But Mike Freeman and his brother have created a new medical technology to bring the center to the periphery through an augmented reality headset. Even better,...
byFinding Genius Podcast
0 ratings
0% found this document useful
Potluck — VSCode × Vercel vs Netlify × Models × Mutations × Multi-Vendor Platforms × Websites vs Web Apps × More!: It’s another potluck! In this episode, Scott and Wes answer your questions about VSCode, Vercel vs Netlify, staying up to date with dev concepts, models and mutations, websites vs seb apps, adaptive vs responsive design, and more! Freshbooks -...
Podcast episode
Potluck — VSCode × Vercel vs Netlify × Models × Mutations × Multi-Vendor Platforms × Websites vs Web Apps × More!: It’s another potluck! In this episode, Scott and Wes answer your questions about VSCode, Vercel vs Netlify, staying up to date with dev concepts, models and mutations, websites vs seb apps, adaptive vs responsive design, and more! Freshbooks -...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Perpetual Licences vs Subscription Models.
Podcast episode
Perpetual Licences vs Subscription Models.
byProduction Expert Podcast
0 ratings
0% found this document useful
Reflection 14: /about
Podcast episode
Reflection 14: /about
byFuture of Coding
0 ratings
0% found this document useful
From search trees to neural nets, a deep dive into natural language processing: Today's episode is sponsored by Rev. We explore the history of automatic speech recognition and computer systems that can understand human commands. From there, we explain the machine learning revolution that has powered recent advancements in speech to text systems like the one employed by Rev. Finally, we look to the future, and imagine the features and services that the next generation of this AI could produce.
Podcast episode
From search trees to neural nets, a deep dive into natural language processing: Today's episode is sponsored by Rev. We explore the history of automatic speech recognition and computer systems that can understand human commands. From there, we explain the machine learning revolution that has powered recent advancements in speech to text systems like the one employed by Rev. Finally, we look to the future, and imagine the features and services that the next generation of this AI could produce.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
A murder mystery: who killed our user experience?: On this sponsored episode of the Stack Overflow Podcast, we talk with Greg Leffler of Splunk about the keys to instrumenting an observable system and how the OpenTelemetry standard makes observability easier, even if you aren’t using Splunk’s product.
Podcast episode
A murder mystery: who killed our user experience?: On this sponsored episode of the Stack Overflow Podcast, we talk with Greg Leffler of Splunk about the keys to instrumenting an observable system and how the OpenTelemetry standard makes observability easier, even if you aren’t using Splunk’s product.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Micro Services vs Monoliths With Jan Machacek: I don't know a lot about micro services. Like how to design them and what the various caveats and anti-patterns are. I'm currently working on a project that involves decomposing a monolithic application into separate parts, integrated...
Podcast episode
Micro Services vs Monoliths With Jan Machacek: I don't know a lot about micro services. Like how to design them and what the various caveats and anti-patterns are. I'm currently working on a project that involves decomposing a monolithic application into separate parts, integrated...
byCoRecursive: Coding Stories
0 ratings
0% found this document useful
#88 - Observability Engineering - Liz Fong-Jones
Podcast episode
#88 - Observability Engineering - Liz Fong-Jones
byTech Lead Journal
0 ratings
0% found this document useful
288: Turing Complete Sed: Software will never fix Spectre-type bugs, a proof that sed is Turing complete, managed jails using Bastille, new version of netdata, using grep with /dev/null, using GMail with mutt, and more.
Podcast episode
288: Turing Complete Sed: Software will never fix Spectre-type bugs, a proof that sed is Turing complete, managed jails using Bastille, new version of netdata, using grep with /dev/null, using GMail with mutt, and more.
byBSD Now
0 ratings
0% found this document useful
Build Your Second Brain One Piece At A Time: Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use. In this episode he explains the data collection and preparation process, the collection of model types and sizes that work together to power the experience, and how to incorporate it into your workflow to act as a second brain.
Podcast episode
Build Your Second Brain One Piece At A Time: Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful collection of tools that complements the tools that developers already use. In this episode he explains the data collection and preparation process, the collection of model types and sizes that work together to power the experience, and how to incorporate it into your workflow to act as a second brain.
byData Engineering Podcast
0 ratings
0% found this document useful
Episode 155: Testing PoW Consensus Algorithm Security with Ren Zhang from Nervos: In this week’s episode, we revisit the topic of Consensus Algorithms with Ren Zhang, a researchers at Nervos and previously at imec-COSIC (KU Leuven). We chat about an earlier work he did on evaluating PoW consensus protocols security and explore his more recent work on NC-Max - a consensus protocol that breaks the throughput limit and enables the full utilization of the nodes’ bandwidth in confirming transactions
Podcast episode
Episode 155: Testing PoW Consensus Algorithm Security with Ren Zhang from Nervos: In this week’s episode, we revisit the topic of Consensus Algorithms with Ren Zhang, a researchers at Nervos and previously at imec-COSIC (KU Leuven). We chat about an earlier work he did on evaluating PoW consensus protocols security and explore his more recent work on NC-Max - a consensus protocol that breaks the throughput limit and enables the full utilization of the nodes’ bandwidth in confirming transactions
byZero Knowledge
0 ratings
0% found this document useful
Data Observability Out Of The Box With Metaplane: An interview with Kevin Hu about his work on Metaplane to make implementing data observability practices as low friction as possible for data teams and organizations.
Podcast episode
Data Observability Out Of The Box With Metaplane: An interview with Kevin Hu about his work on Metaplane to make implementing data observability practices as low friction as possible for data teams and organizations.
byData Engineering Podcast
0 ratings
0% found this document useful
Building with ATF up front with Harshdeep Garg and Raghuveer Moorthy
Podcast episode
Building with ATF up front with Harshdeep Garg and Raghuveer Moorthy
byBreak Point
0 ratings
0% found this document useful
Building with ATF up front with Harshdeep Garg and Raghuveer Moorthy
Podcast episode
Building with ATF up front with Harshdeep Garg and Raghuveer Moorthy
byServiceNow Podcasts
0 ratings
0% found this document useful
Potluck - Web components × Gear × Docker × Web Dev Frameworks × Golden Handcuffs × Browser Testing × SSR React × Code Prediction × More!: It’s another Potluck! In this episode, Scott and Wes answer your questions about web components, gear, Docker, web dev frameworks, golden handcuffs, browser testing, SSR React, code prediction, and more! Sanity - Sponsor is a real-time...
Podcast episode
Potluck - Web components × Gear × Docker × Web Dev Frameworks × Golden Handcuffs × Browser Testing × SSR React × Code Prediction × More!: It’s another Potluck! In this episode, Scott and Wes answer your questions about web components, gear, Docker, web dev frameworks, golden handcuffs, browser testing, SSR React, code prediction, and more! Sanity - Sponsor is a real-time...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Mastering Back-End Functionalities and Development with AWS Amplify - JSJ 619
Podcast episode
Mastering Back-End Functionalities and Development with AWS Amplify - JSJ 619
byJavaScript Jabber
0 ratings
0% found this document useful
Encore Episode: Deep Learning: Did you know that the concept of deep learning goes way back to the 1950s? However, it is only in recent years that this technology has created a tremendous amount of buzz (and for good reason!). A subset of machine learning, deep learning is inspired...
Podcast episode
Encore Episode: Deep Learning: Did you know that the concept of deep learning goes way back to the 1950s? However, it is only in recent years that this technology has created a tremendous amount of buzz (and for good reason!). A subset of machine learning, deep learning is inspired...
byOracle University Podcast
0 ratings
0% found this document useful
Impact of New US National Cybersecurity Strategy on Organizations Building With OSS - Donald Fischer - ESW #312: Overall increase in government regulations. EU as well. Shift in liability from consumers to organizations.How to take advantage of safe harbor protections and reduce organizational risk and liability. NIST SSD Framework - how do you understand the...
Podcast episode
Impact of New US National Cybersecurity Strategy on Organizations Building With OSS - Donald Fischer - ESW #312: Overall increase in government regulations. EU as well. Shift in liability from consumers to organizations.How to take advantage of safe harbor protections and reduce organizational risk and liability. NIST SSD Framework - how do you understand the...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Impact of New US National Cybersecurity Strategy on Organizations Building With OSS - Donald Fischer - ESW #312: Overall increase in government regulations. EU as well. Shift in liability from consumers to organizations.How to take advantage of safe harbor protections and reduce organizational risk and liability. NIST SSD Framework - how do you understand the...
Podcast episode
Impact of New US National Cybersecurity Strategy on Organizations Building With OSS - Donald Fischer - ESW #312: Overall increase in government regulations. EU as well. Shift in liability from consumers to organizations.How to take advantage of safe harbor protections and reduce organizational risk and liability. NIST SSD Framework - how do you understand the...
byEnterprise Security Weekly (Video)
0 ratings
0% found this document useful
Episode 398: JSJ 393: Why You Should Be Using Web Workers with Surma
Podcast episode
Episode 398: JSJ 393: Why You Should Be Using Web Workers with Surma
byJavaScript Jabber
0 ratings
0% found this document useful
BEST-OF-BRAD: Using Top Tier Solutions to Build Hybrid Cloud Ecosystems with Brad Feakes: Today on What the Duck?!, we’re ducking around with Brad Feakes, an expert in operations, supply chain management, and information technology. Brad sits down with Host, Sarah Scudder, to discuss the use of best-of-breed solutions to build hybrid Cloud ecosystems that support ERP customer needs. Brad shares his personal and professional journey, including his education, career choices, and his experience working with ERP systems in manufacturing companies. They also touch upon Brad's role as a business analyst and his involvement in implementing Epicor as company-wide ERP system and his current role at EstesGroup.
Podcast episode
BEST-OF-BRAD: Using Top Tier Solutions to Build Hybrid Cloud Ecosystems with Brad Feakes: Today on What the Duck?!, we’re ducking around with Brad Feakes, an expert in operations, supply chain management, and information technology. Brad sits down with Host, Sarah Scudder, to discuss the use of best-of-breed solutions to build hybrid Cloud ecosystems that support ERP customer needs. Brad shares his personal and professional journey, including his education, career choices, and his experience working with ERP systems in manufacturing companies. They also touch upon Brad's role as a business analyst and his involvement in implementing Epicor as company-wide ERP system and his current role at EstesGroup.
byWhat the Duck - Another Supply Chain Podcast
0 ratings
0% found this document useful

Skip carousel

The Deep Learning Revolution For Artificial Intelligence
Facility Management
Article
The Deep Learning Revolution For Artificial Intelligence
Mar 28, 2019
3 min read
“There’s A Big Difference Between Research Work And The Risk You’re Likely To Be Exposed To”
PC Pro Magazine
Article
“There’s A Big Difference Between Research Work And The Risk You’re Likely To Be Exposed To”
Aug 7, 2022
Most cyber-scare stories have more in common with horror fiction than practical reality, and I’m not talking purely about the hyped-up cyber-warfare stuff that appears online. Me being me, I’m focused on the hacking threat stuff. Regular readers of m
6 min read
The Razor’s Edge
Linux Format
Article
The Razor’s Edge
Mar 10, 2020
10 min read
Remote Support Software 2023
PC Pro Magazine
Article
Remote Support Software 2023
Sep 7, 2023
3 min read
Opinion: Why Brain Decoding Is Not Mind Reading — And Why That Matters
STAT
Article
Opinion: Why Brain Decoding Is Not Mind Reading — And Why That Matters
Jun 8, 2023
1 min read
Algorithm Cuts 3D Printing Time Ii Half
Futurity
Article
Algorithm Cuts 3D Printing Time Ii Half
Nov 10, 2017
2 min read
Remote AI
Residential Tech Today
Article
Remote AI
Jun 28, 2019
Artificial Intelligence (AI) is changing our world at a dizzying pace, promising to improve lives and make us all better, faster, and stronger (or unemployed!). I spend a considerable amount of time studying where AI might impact the smart home, part
4 min read
Tool Finds Software Update Bugs In Hours, Not Days
Futurity
Article
Tool Finds Software Update Bugs In Hours, Not Days
Feb 13, 2020
2 min read
The Best Privacy And Security Apps For Android
Android Advisor
Article
The Best Privacy And Security Apps For Android
May 3, 2023
10 min read
Remote Support Software 2022
PC Pro Magazine
Article
Remote Support Software 2022
Sep 11, 2022
4 min read
Network-monitoring software 2024
PC Pro Magazine
Article
Network-monitoring software 2024
Feb 8, 2024
4 min read
8 Network Security For Your Home And Office
Techfastly
Article
8 Network Security For Your Home And Office
Nov 30, 2020
7 min read
Remote-support software 2021
PC Pro Magazine
Article
Remote-support software 2021
Sep 9, 2021
3 min read
Little Snitch 4 Review: Mac App Excels at Monitoring and Controlling Network Activity
MacWorld
Article
Little Snitch 4 Review: Mac App Excels at Monitoring and Controlling Network Activity
Oct 13, 2017
6 min read
Edit Video Like It’s Text With This Algorithm
Futurity
Article
Edit Video Like It’s Text With This Algorithm
Jun 13, 2019
3 min read
Algorithm Predicts Epileptic Seizures in Real-Time
Futurity
Article
Algorithm Predicts Epileptic Seizures in Real-Time
Apr 25, 2017
Engineering students have created a system designed to prevent seizures caused by epilepsy, a neurological disorder affecting millions. First, the team needed to develop a seizure-prediction algorithm. The students created a machine-learning algorith
2 min read
The Security Dilemma Of Iot Devices And Potential Consequences
HWM Singapore
Article
The Security Dilemma Of Iot Devices And Potential Consequences
Jan 10, 2021
3 min read
Remote Support Software 2020
PC Pro Magazine
Article
Remote Support Software 2020
Aug 13, 2020
3 min read
Circuit Programs Human Cells to Add and Subtract
Futurity
Article
Circuit Programs Human Cells to Add and Subtract
Apr 15, 2017
A new platform offers a fast and more efficient way to target and program mammalian cells as genetic circuits, even complex ones. “The problem synthetic biologists are trying to solve is how we ask cells to make decisions and try to design a strategy
2 min read
Supply Chain Attacks
TechLife
Article
Supply Chain Attacks
Aug 23, 2021
4 min read
Why Did Obama Just Honor Bug-free Software?
Nautilus
Article
Why Did Obama Just Honor Bug-free Software?
Dec 21, 2016
6 min read
How To Stay Secure When All Your Devices Rely On The Internet Of Things
Chicago Tribune
Article
How To Stay Secure When All Your Devices Rely On The Internet Of Things
Aug 16, 2018
3 min read
Secure Mobile Comms
RECOIL OFFGRID
Article
Secure Mobile Comms
Dec 6, 2022
9 min read
Contributing For Non - Coders
Linux Format
Article
Contributing For Non - Coders
Jan 10, 2023
9 min read
For More Trustworthy AI, We May Need an ‘Interpreter’
Futurity
Article
For More Trustworthy AI, We May Need an ‘Interpreter’
Jul 6, 2017
A team of researchers is working to build trust between humans and artificial intelligence (AI) by creating an “interpreter” that can explain how an AI arrived at the answer to a specific question. In an age of self-driving cars and autonomous drones
4 min read
Exclusive Downloads
APC
Article
Exclusive Downloads
Nov 6, 2023
Download these APC exclusives free from: www.apcmag.com/exclusives AUTO-CROP AND EDIT OBJECTS AND SWAP BACKGROUNDS. Ashampoo Background Remover crops persons and objects automatically thanks to smart object detection – no manual editing required in m
2 min read
Build A Club On The Next-gen Web
Linux Format
Article
Build A Club On The Next-gen Web
Aug 23, 2022
OUR EXPERT Onthe current web there are a few, enormous companies that dominate your activities and collect your data. For many of us this is a worrying development that we need to do something about. One technical solution is to develop a new versio
9 min read
Device Gauges Hand Gestures From Arm Signals
Futurity
Article
Device Gauges Hand Gestures From Arm Signals
Dec 29, 2020
3 min read
Sophos Intercept X Advanced
PC Pro Magazine
Article
Sophos Intercept X Advanced
Nov 11, 2021
2 min read
Prevent Drive-by Hacking
Maximum PC
Article
Prevent Drive-by Hacking
Jun 23, 2020
3 min read

Related categories

Skip carousel

Reviews for Deep Learning

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Deep Learning - Gyanendra Verma

Deep Learning: History and Evolution

Jaykumar Suraj Lachure¹, *, Gyanendra Verma¹, Rajesh Doriya¹

¹ National Institute of Technology Raipur, Raipur, India

Abstract

Recently, deep learning (DL) computing has become more popular in the machine learning (ML) community. In the field of ML, the most widely used computational approach is DL. It can solve many complex problems, cognitive tasks, and matching problems without any human performance or interface. ML cannot handle large amounts of data and DL can easily handle it. In the last few years, the field of DL has witnessed success in a range of applications. DL outperformed in many application domains, e.g., robotics, bioinformatics, agriculture, cybersecurity, natural language processing (NLP), medical information processing, etc. Despite various reviews on the state of the art in DL, they all concentrated on a single aspect of it, resulting in a general lack of understanding. There is a need to provide a better beginning point for comprehending DL. This paper aims to provide a more comprehensive overview of DL, including current advancements. This paper discusses the importance of DL and introduces DL approaches and networks. It then explains convolutional neural networks (CNNs), the most widely used DL network type and subsequent evolved model starting with LeNET, AlexNet with the Letnet-5, AlexNet, GoogleNet, and ResNet networks, and ending with the High-Resolution network. This paper also discusses the difficulties and solutions to help researchers recognize research gaps for DL applications.

Keywords: Convolution neural network, Deep learning applications, Deep Learning, Image classification, Machine Learning, Medical image analysis.Natural Language Processing.

* Corresponding author Jaykumar Suraj Lachure: National Institute of Technology Raipur, India; E-mail: jaykuamrlachure@gmail.com

INTRODUCTION

In the last decade, machine learning (ML) models [1-3] have been widely used in every field and have been applied in versatile applications like classification, image/video retrieval, text mining, multimedia, anomaly detection, attack detection, video recommendation, image classification, etc. Nowadays, deep learning (DL) is frequently employed in comparison to other machine learning methods. DL stands for representative learning. The unpredictable expansion of

DL and distributed learning necessitates ongoing study. Deep and distributed learning studies are continuing to emerge as a result of unanticipated advances in data availability and huge advancements in hardware technologies such as High-Performance Computing (HPC). DL is a Neural Network (NN) that outperforms its predecessors. DL also employs transformations and graph technology to create multi-layer learning models. In fields such as Natural Language Processing (NLP), data processing, visual data processing, and audio and speech processing, the most recent DL techniques have achieved extraordinary performance. The representation of input data is often what determines the success of an ML approach. A proper data representation outperforms a poor data representation. Thus, for many years, feature engineering has been a prominent study topic in ML. This method helps to build features from raw data. It also involves a lot of human effort and is quite field-specific. These are the scale-invariant feature transform (SIFT), histogram of oriented gradients (HOG), and bag of words (BoW).

The DL algorithms automatically extract features, and this helps researchers extract discriminative features with minimal human effort and field knowledge. A multi-layer data representation architecture extracts low-level features at the first layer, while the last layer extracts high-level features. Artificial Intelligence (AI) is the basis of all technology, including ML, DL, and NLP, etc., which processes data for particular applications, much like in the human brain's basic sensory regions. The human brain can automatically derive data representation using different scenes. This procedure's output is the classified objects, while the input is the incoming scene information. This mimics the human brain's workings. Thus, it accentuates DL's key advantage.

Due to its significant success, DL is presently one of the most important research fashions in ML. Architectures, issues, computational tools, the evolution matrix, and applications are all significant elements in DL. In DL networks, convolutional neural networks (CNN) are widely employed. CNN automatically finds key features, making it the most widely used. Therefore, we delved deep into CNN by showing its core elements. From the AlexNet network to the GoogleNet with high-resolution network, each uses the most prevalent CNN topologies.

Several deep learning models have solely dealt with one application or issue in recent years, such as examining CNN architectures or deep learning. There are different applications like autonomous machines, deep learning for plant disease detection and classification, deep learning for security and malicious attack detection, and so on. Table 1 shown below provides a few domains and applications of DL. Prior to diving into DL applications, it is important to grasp the concepts, problems, and benefits of DL. Learning DL to address research gaps and applications takes a lot of time and research. Our proposal is to conduct an extensive review of DL to provide a better starting point for a comprehensive grasp of DL.

Table 1 Different Domains of DL and Applications.

For our review, we focused on open challenges, computational tools, and applications. This review can also be a springboard for further DL discussions.

The review helps individuals learn more about recent breakthroughs in DL research, which will help them grow in the field. In order to deliver precise alternatives to the field, researchers would be given greater autonomy. Here are our contributions:

This review aids researchers and students in gaining comprehensive knowledge about DL.

We will describe the historical overview of neural networks.

We discuss deep learning approaches using Deep Feedforward Neural Networks, Deep Backward Neural Networks, and CNN, as well as their concepts, theories, and current architectures.

We describe the different CNN architectures like AlexNet, GoogleNet, and ResNet.

We describe deep learning models that use auto-encoders, long short-term memory, and a deep belief network architecture.

The rest of the paper is organized as follows: A description of neural networks and its fundamental structure is given in Section 2. Section 3 provides the different neural network architectures. Section 4 discusses the detailed study of CNN and its components, with different architectures of CNN models. Section 5 discusses the different DL models with a time-series base and a deep belief network. Section 6 concludes with the discussion of DL.

OVERVIEW OF THE NEURAL NETWORK

Over the years, many people have contributed to the development of neural networks [2, 4, 5]. Given the current spike in interest in DL, it's not surprising that credit for substantial advancements is being contested. The following is an overview of the most significant contributions in an objective manner. McCulloch and Pitts developed the first mathematical neuron model in 1943. However, this model does not attempt to replicate the biophysical mechanism of an actual neuron. Intriguingly, this model omitted education. Hebb developed the concept of physiologically driven learning in neural networks in 1949. Hebbian learning is an unsupervised neural network learning technique. Rosenblatt introduced the Perceptron in 1957. A perceptron is a single-layer based neural network that can be used to classify a perceptron. It uses the Heaviside activation function in the current ANN language. Widrow and Hoff introduced the delta-learning rule for learning a perceptron. To update the neurons' weights, the delta-learning rule uses gradient descent. It is a back propagation algorithm variation. To train neural networks, Ivakhnenko invented the Group Method of Data Handling (GMDH) in 1968. These networks were the first feedforward multilayer perceptron deep learning networks. In 1971, the first 8-layer deep GMDH net was used with the number of layers. Each level contains units per layer that could be learned rather than predetermined.

A perceptron cannot learn XOR since it is not linearly separable. In 1974, the error back propagation (BP) algorithm was proposed for weighted learning in a supervised manner. Fukushima introduced the Neocognitron in 1980. The Neocognitron is viewed as a deep neural network in the same vein as the deep GMDH networks (DNN). The D-FFNNs (Deep Feedforward Neural Networks) are the ancestors of this network, and it has a similar design. In 1982, Hopfield developed the Hopfield Network, which is also known as a content-addressable memory neural network. Recurrent neural networks are similar to Hopfield networks. In the given example, backpropagation resurfaced in 1986, and this learning technique can build meaningful internal representations for broad neural network learning tasks.

Terry Sejnowski created NETtalk in 1987. That programme improved over time in pronouncing English words. In 1989, the back propagation (CNN) first did handwritten digit learning. Hochreiter studied a basic issue in 1991 when training a deep learning network via backpropagation. According to his research, backpropagation signals either drop or rise without limits. In the event of a decline, the network depth is proportionate. also called the vanishing or bursting gradient issue. Pre-training Recurrent Neural Network (RNN) unsupervised to speed up future supervised learning was suggested in 1992 as a partial solution. The RNN investigated contained over 1000 layers. In 1995, Wang and Terman introduced oscillatory neural networks.

Image and audio segmentation, as well as time series production, are examples of applications. In 1997, Long Short-Term Memory (LSTM) was proposed by Hochreiter and Schmidhuber, which is a supervised model for learning recurrent neural networks (RNNs). LSTM networks avoid decaying error signals between layers.

It was integrated with backpropagation to improve learning at CNN in 1998. It was therefore created to classify handwritten numbers on checks using LeNet-5, which typically contains a 7-level convolutional network. The greedy layer-wise approach was used to train the model and was demonstrated by Hinton et al. in 2006. The third wave of neural networks popularised the phrase deep learning.

In 2012, CNN, with a GPU, AlexNet, beat LeNet5 to win the ImageNet Large Scale Visual Recognition Challenge. In 2014, Goodfellow et al. introduced generative adversarial networks. Two neural networks battle in the fashion of a game mode. Overall, this creates a generative model that can produce fresh data. This is the evolution of the Hopfield network to CNN and other CNN architectures that have been replaced over the years.coolest machine learning idea in 20 years, according to Yann LeCun. With deep neural networks, Yoshua Bengio, Yann LeCun, and Geoffrey Hinton won the Turing Award in 2019.

The Neural Network's Basic Structure

Artificial Neural Networks (ANNs) are basic mathematical models based on how the brain works [6]. However, the models discussed below are not biologically realistic. Instead, these models analyse the data. The different neural models are explained as follows:

Artificial Neuron Model with FFNN

Any neural network starts with a neuron model (Fig. 1) depicts an artificial neuron model. In a neuron model, the basic input, x, is feed with weighted w and bias b to summarized [7]. Assume that the input vector Rn and the weight vector w are both vectors, with n equal to the input dimension N. The bias term is not always existing and might be remove. They are added together to create the an activation function argument, giving the neuron model's output:(z)=wTx+b. Only the argument of provides a linear discriminant function. The activation function is identified as transfer or unit function or transforms z nonlinearly.

The ReLU activation function is termed as a rectifier and most widely used in DNNs. The softmax function:

Fig. (1))

Artificial Neuron Model.

The softmax maps an n-dimensional x to an n-dimensional y. Therefore, y represents the probability for each of the n elements. It is sometimes used as the last layer in a network. The activation function uses the Heaviside step function in the perceptron model. The neurons must be connected in NN. A feedforward arrangement in its simplest form is shown in Fig. (2) and Fig. (3)., which illustrate the shallow and deep architecture of NN.

Fig. (2))

Shallow Architecture of NN.

Fig. (3))

Deep Architecture of NN.

Generalized deepness of a network in NN is the sum of non-linear revolutions between the layers that are separated, whereas hidden layer width is the number of hidden neurons. Fig. (2) has a single hidden layer, whereas Fig. (3) has a three number of hidden layers. The depths for the shallow and deep architectures of NN are two and four. Debatable, however, topologies with two layers are called shallow and those with more than two hidden layers are typically called deep in Feedforward Neural Networks (FFNN).

The activation functions of a feedforward neural network (FNN) might be linear or non-linear. The NN lacks any cycles that would permit direct input. How an MLP gets its output from its input.

Equation (3) illustrates the neural network's discriminant function. An optimization method to find the optimal parameters for training data sets with a cost function or an error function is being developed.

Recurrent Neural Networks: The RNN family has 2 subclasses that are able to be identified by their characteristics of signal processing [8]. The first type is composed of Finite Recurrent Networks (FRN), whereas the second type is composed of Infinite Impulse Recurrent Networks (IIRN). However, an FRN comes under a directed acyclic graph (DAG) type that may be unrolled and replaced by a FNN, whereas an IIRN comes under a directed cyclic graph (DCG) that cannot be unrolled.

Hopfield Network: A Hopfield Network is an example of a FRN. It is a network of McCulloch-Pitts neurons that is entirely connected. For a

McCulloch-Pitts neuron, the activation function is as:

The activation neuron of the function is as:

xiis updated synchronously or asynchronously with the xj.wijis updated weight for updating the xi value for sign value.

Boltzmann Machine: It uses a noisy Hopfield network with a probabilistic-based activation function. From Eq. 7, it is shown that probability is updated with an update from Eq. 5. This model is significant as it was one of the first to use hidden units. The contrastive-divergence algorithm is used to train Boltzmann Machines.

Boltzmann Machines are two-layered neural networks with visible and hidden layers.

The edges between the two layers are undirected within the graph, which implies information could flow in both directions. The network is completely connected, which means every neuron is connected to another through undirected edges Fig. (4) shows how to transform the Boltzmann machine into an RBM [9]. RBM is a basic structure used in many applications and for creating different networks. (Table 2) provides the usage of models and their working nature, not the comparison. Each model in the table performs differently for different domains.

Fig. (4))

Conversion of Boltzmann Machine to Restricted Boltzmann machine (RBM).

Table 2 Deep Learning Models and its Learning Algorithms.

DEEP LEARNING NEURAL NETWORK

The neural network consists of deep layers of neurons [10]. The neurons must constantly learn to tackle tasks or to apply in different ways to produce better results. It learns every time based on new updated information. A deep neural network uses multiple layers of nodes to extract high-level functions from incoming data [1, 4]. It means changing data into something more creative and abstract. The Deep Forward Neural Networks (DFNN) are explained as below:

A Deep Forward Neural Network

A FNN contains a set of neurons and a hidden layer for any continuous function. The reason for adopting an FFNN with multiple hidden layers is that it uses the universal approximation theorem, which does not explain how to learn such a network. A related concern is that the network's diameter can grow exponentially. Unexpectedly, the universal approximation theorem holds for FFNN with a limited number of hidden neurons and numerous hidden layers. So DFFNNs are employed instead of shallow FFNNs for learnability. Approximating an unknown function f* is:

Here, f is a function with a specific family that is reliant on the parameters θ, and ɸ is a non-linear activation function with a single layer. For deep hidden layers, ɸ has the form is as below:

In place of assuming the precise family functions from f, D-FFNNs learn Eq. 9 function by approximating it withɸ, which is approached by the n separate hidden layers.

CNN Architecture and its Components

A CNN [4, 11-13] is a special type of FFNN that uses a combination of convolution layers, ReLU, and pooling layers. These layers are usually combined with several layers of FNN. In traditional ANN, each neuron in a layer is linked to all the neurons in the next layer. Each connection is a parameter in the network, and each connection is how the network works. In CNN, there could be different variables that are not fully connected layers. This significance cuts down on the number of parameters and reduces the operations in the network. All the connections between neurons and local receptive fields use a set of weights, and we call this set of weights a kernel, or core.

Kernel: All the neurons that attach to their local receptive fields will share the same kernel. The neurons' calculations results will be stored in a matrix called the activation map. Weight sharing refers to the fact that CNNs can share their weight. Consequently, different kernels will produce different activation maps, and hyper-parameters can be used to change the number of kernels in the map. The number of weights in a network is proportional to the kernel i.e. to the size of the local receptive field. Fig. (5) shows the typical CNN architecture with 3-channel input. Each channel was connected with a convolution layer, pooling, and then again, convolution, pooling, and merge. The merge layer connects with the fully connected layer (FC) to provide the decision using the softmax function.

Fig. (5))

Typical CNN with 3-Channel input.

The softmax equation is given in eq. 10, where it is calculated to provide the classification based on their threshold values.

The different layers in CNN models are explained as follows:

Convolution layer: A convolution layer is a critical component of a convolutional neural network's architecture. A convolutional layer, like a hidden layer in a conventional neural network, seeks to convert the input to a higher level of abstraction. On the other hand, the convolutional layer, rather than relying on total connectivity to perform calculations between the input and hidden neurons, takes advantage of local connectivity. A convolutional layer slides at least one kernel across the input, convoluting each region. The results are stored in activation maps, which are the outputs of the convolutional layer.

Pooling layer: It is frequently sandwiched between two layers of convolution. By retaining as much information as possible, pooling layers attempt to minimise the input dimension. Additionally, a pooling layer can impart spatial invariance to the network, hence increasing generality. The zero padding, stride, pooling window size, and hyperparameters of a pooling layer. The pooling layer, like the kernel of a convolutional layer, scans the whole input using the specified pooling window size. By pooling with a stride of 2, a window size of 2, and zero padding, the input dimension is halved. Min-pooling, averaging, and more sophisticated methods such as stochastic pooling and fractional max-pooling are examples of pooling procedures. Max pooling is the most commonly used pooling technique, as it efficiently captures picture invariance. Max-pooling is used to get the extreme value from each sub-window.

Fully connected layer: The smallest unit in FFNN is a completely connected layer. Between the penultimate and output layers of a normal CNN, a fully connected layer is frequently added to represent non-linear interactions between input features. However, the numerous criteria given have been questioned recently, posing the possibility of overfitting. It has been used in some CNN architectures instead of linear layers.

DIFFERENT CNN ARCHITECTURE

CNN is a common FFNN model that was designed to recognise visual patterns directly from group or pixel images with minimal preprocessing [11, 14]. An image database, ImageNet, was proposed for object recognition research. An annual software challenge called the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) tests software's ability to detect and classify objects and scenes. Below, we discuss the CNN architectures of ILSVRC's main competitors.

LeNet-5

In 1998, LeNet-5 used a 7-level convolutional network developed by LeCun et al. to classify digits. For processing higher resolution images, it requires a large number of convolutional layers; therefore, processing resources are restricted to computing in Fig. (6).

Fig. (6))

LetNet-5 Architecture.

AlexNet: In 2012, AlexNet surpassed all previous opponents, by cutting the topmost-5 errors from 26% to 15.3%. The AlexNet network was deeper, featured more filters per layer, and stacked convolutional layers were used than in LeNet5.

Enjoying the preview?

Page 1 of 1

Deep Learning: Theory, Architectures and Applications in Speech, Image and Language Processing

About this ebook

Related to Deep Learning

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Deep Learning

What did you think?

Book preview

Deep Learning - Gyanendra Verma

Abstract

INTRODUCTION

OVERVIEW OF THE NEURAL NETWORK

The Neural Network's Basic Structure

Artificial Neuron Model with FFNN

DEEP LEARNING NEURAL NETWORK

A Deep Forward Neural Network

CNN Architecture and its Components

DIFFERENT CNN ARCHITECTURE

LeNet-5