Introducing the HTML5 Web Speech API: Your Practical Introduction to Adding Browser-Based Speech Capabilities to your Websites and Online Applications

Ebook445 pages3 hours

Introducing the HTML5 Web Speech API: Your Practical Introduction to Adding Browser-Based Speech Capabilities to your Websites and Online Applications

Name: Introducing the HTML5 Web Speech API: Your Practical Introduction to Adding Browser-Based Speech Capabilities to your Websites and Online Applications
Author: Alex Libby
ISBN: 9781484257357

By Alex Libby

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Leverage the power of HTML5 Web Speech API to quickly add voice capabilities to your websites. This project-oriented book simplifies the process of setting up and manipulating the API in the browser using little more than a text editor or free software.

You'll be presented with a starting toolset that you can use to develop future projects, incorporate into your existing workflow and allow you to take your websites to the next level, reducing the reliance on entering choices through a keyboard and making the overall experience easier for customers.

This excellent resource is perfect for getting acquainted with creating and manipulating browser-based APIs. You don’t have to convert your whole work process immediately; you can incorporate as little or as much as you want of the API, and build on this as your skills develop. We live in an age where speed and simplicity are of the essence – this book provides a perfect way to add speech capabilities to our websites, directly in the browser and with the minimum of fuss.

Introducing the HTML5 Web Speech API is the right choice for developers who want to focus on simplicity to produce properly optimized content in modern browsers using tools already in their possession.

What You'll Learn

Implement the Web Speech API in a project
Explore some of the options for personalizing them for a project
Gain an appreciation of pointers around user experience and how this affects the API
Understand how to manage issues and security when using the API
Work through some example projects, from standalone demos to implementing with other tools or libraries

Who This Book Is For

Website developers who are already familiar with JavaScript, and are keen to learn how to leverage the Web Speech API to quickly add voice-enabled capabilities to a website, using little more than a text editor. It’s ideal for those in agile development teams, where time is of the essence, and the pressure is on to deliver results quickly.

Skip carousel

Internet & Web

LanguageEnglish

PublisherApress

Release dateApr 7, 2020

ISBN9781484257357

Author

Alex Libby

Related to Introducing the HTML5 Web Speech API

Related ebooks

Skip carousel

Learning Alfresco Web Scripts
Ebook
Learning Alfresco Web Scripts
byRamesh Chauhan
Rating: 0 out of 5 stars
0 ratings
Practical Hexo: A Hands-On Introduction to Building Blogs Using the Hexo Framework
Ebook
Practical Hexo: A Hands-On Introduction to Building Blogs Using the Hexo Framework
byAlex Libby
Rating: 0 out of 5 stars
0 ratings
Hybrid Mobile Development with Ionic
Ebook
Hybrid Mobile Development with Ionic
byGaurav Saini
Rating: 0 out of 5 stars
0 ratings
Learning PHP 7 High Performance
Ebook
Learning PHP 7 High Performance
byAltaf Hussain
Rating: 0 out of 5 stars
0 ratings
CodeIgniter Web Application Blueprints
Ebook
CodeIgniter Web Application Blueprints
byRob Foster
Rating: 0 out of 5 stars
0 ratings
Hands-On Network Programming with C: Learn socket programming in C and write secure and optimized network code
Ebook
Hands-On Network Programming with C: Learn socket programming in C and write secure and optimized network code
byLewis Van Winkle
Rating: 5 out of 5 stars
5/5
Learning Yii Testing
Ebook
Learning Yii Testing
byMatteo Pescarin
Rating: 1 out of 5 stars
1/5
Pro PHP 8 MVC: Model View Controller Architecture-Driven Application Development
Ebook
Pro PHP 8 MVC: Model View Controller Architecture-Driven Application Development
byChristopher Pitt
Rating: 0 out of 5 stars
0 ratings
Mastering PHP Design Patterns
Ebook
Mastering PHP Design Patterns
byJunade Ali
Rating: 0 out of 5 stars
0 ratings
Redmine Plugin Extension and Development
Ebook
Redmine Plugin Extension and Development
byAlex Bevilacqua
Rating: 0 out of 5 stars
0 ratings
SilverStripe: The Complete Guide to CMS Development
Ebook
SilverStripe: The Complete Guide to CMS Development
byIngo Schommer
Rating: 0 out of 5 stars
0 ratings
PHP 7 Programming Blueprints
Ebook
PHP 7 Programming Blueprints
byJose Palala
Rating: 0 out of 5 stars
0 ratings
DevSecOps for .NET Core: Securing Modern Software Applications
Ebook
DevSecOps for .NET Core: Securing Modern Software Applications
byAfzaal Ahmad Zeeshan
Rating: 0 out of 5 stars
0 ratings
Getting Started with Review Board
Ebook
Getting Started with Review Board
bySandeep Rawat
Rating: 0 out of 5 stars
0 ratings
Cross-Platform Modern Apps with VS Code: Combine the power of EF Core, ASP.NET Core and Xamarin.Forms to build multi-platform applications on Visual Studio Code
Ebook
Cross-Platform Modern Apps with VS Code: Combine the power of EF Core, ASP.NET Core and Xamarin.Forms to build multi-platform applications on Visual Studio Code
byOckert J. du Preez
Rating: 0 out of 5 stars
0 ratings
Mastering Play Framework for Scala
Ebook
Mastering Play Framework for Scala
byShiti Saxena
Rating: 0 out of 5 stars
0 ratings
Mastering Backbone.js
Ebook
Mastering Backbone.js
byEchamea Abiee
Rating: 0 out of 5 stars
0 ratings
WebRTC Blueprints
Ebook
WebRTC Blueprints
byAndrii Sergiienko
Rating: 0 out of 5 stars
0 ratings
API Design for C++
Ebook
API Design for C++
byMartin Reddy
Rating: 3 out of 5 stars
3/5
ASP.NET Web API Security Essentials
Ebook
ASP.NET Web API Security Essentials
byGunasundaram Rajesh
Rating: 0 out of 5 stars
0 ratings
Learning Salesforce Einstein
Ebook
Learning Salesforce Einstein
byMohith Shrivastava
Rating: 0 out of 5 stars
0 ratings
Practical API Architecture and Development with Azure and AWS: Design and Implementation of APIs for the Cloud
Ebook
Practical API Architecture and Development with Azure and AWS: Design and Implementation of APIs for the Cloud
byThurupathan Vijayakumar
Rating: 0 out of 5 stars
0 ratings
Microsoft Conversational AI Platform for Developers: End-to-End Chatbot Development from Planning to Deployment
Ebook
Microsoft Conversational AI Platform for Developers: End-to-End Chatbot Development from Planning to Deployment
byStephan Bisser
Rating: 0 out of 5 stars
0 ratings
Swift Essentials - Second Edition
Ebook
Swift Essentials - Second Edition
byBlewitt Dr Alex
Rating: 0 out of 5 stars
0 ratings
Raspberry Pi Android Projects
Ebook
Raspberry Pi Android Projects
byKurt Gökhan
Rating: 0 out of 5 stars
0 ratings
AngularJS Deployment Essentials
Ebook
AngularJS Deployment Essentials
byZachariah Moreno
Rating: 0 out of 5 stars
0 ratings
Learning Behavior-driven Development with JavaScript
Ebook
Learning Behavior-driven Development with JavaScript
byEnrique Amodeo
Rating: 5 out of 5 stars
5/5
PhoneGap for Enterprise
Ebook
PhoneGap for Enterprise
byShotts Kerri
Rating: 0 out of 5 stars
0 ratings
Building Progressive Web Applications with Vue.js: Reliable, Fast, and Engaging Apps with Vue.js
Ebook
Building Progressive Web Applications with Vue.js: Reliable, Fast, and Engaging Apps with Vue.js
byCarlos Rojas
Rating: 0 out of 5 stars
0 ratings
Enterprise API Management: Design and deliver valuable business APIs
Ebook
Enterprise API Management: Design and deliver valuable business APIs
byLuis Weir
Rating: 0 out of 5 stars
0 ratings

Internet & Web For You

Skip carousel

No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
Ebook
No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
byGlenn Greenwald
Rating: 4 out of 5 stars
4/5
How to Disappear and Live Off the Grid: A CIA Insider's Guide
Ebook
How to Disappear and Live Off the Grid: A CIA Insider's Guide
byJohn Kiriakou
Rating: 0 out of 5 stars
0 ratings
Social Engineering: The Science of Human Hacking
Ebook
Social Engineering: The Science of Human Hacking
byChristopher Hadnagy
Rating: 3 out of 5 stars
3/5
Coding For Dummies
Ebook
Coding For Dummies
byNikhil Abraham
Rating: 5 out of 5 stars
5/5
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
Ebook
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
byJ. J. Luna
Rating: 4 out of 5 stars
4/5
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
Six Figure Blogging Blueprint
Ebook
Six Figure Blogging Blueprint
byRaza Imam
Rating: 5 out of 5 stars
5/5
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5
The Beginner's Affiliate Marketing Blueprint
Ebook
The Beginner's Affiliate Marketing Blueprint
byAlex M
Rating: 4 out of 5 stars
4/5
The Digital Decluttering Workbook: How to Succeed with Digital Minimalism, Defeat Smartphone Addiction, Detox Social Media, and Organize Your Online Life: Declutter Workbook, #3
Ebook
The Digital Decluttering Workbook: How to Succeed with Digital Minimalism, Defeat Smartphone Addiction, Detox Social Media, and Organize Your Online Life: Declutter Workbook, #3
byAlex Wong
Rating: 3 out of 5 stars
3/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
The Gothic Novel Collection
Ebook
The Gothic Novel Collection
byGaston Leroux
Rating: 5 out of 5 stars
5/5
Get Rich or Lie Trying: Ambition and Deceit in the New Influencer Economy
Ebook
Get Rich or Lie Trying: Ambition and Deceit in the New Influencer Economy
bySymeon Brown
Rating: 0 out of 5 stars
0 ratings
Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
200+ Ways to Protect Your Privacy: Simple Ways to Prevent Hacks and Protect Your Privacy--On and Offline
Ebook
200+ Ways to Protect Your Privacy: Simple Ways to Prevent Hacks and Protect Your Privacy--On and Offline
byJeni Rogers
Rating: 0 out of 5 stars
0 ratings
Podcasting For Dummies
Ebook
Podcasting For Dummies
byTee Morris
Rating: 4 out of 5 stars
4/5
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
Ebook
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
byKristen Meinzer
Rating: 3 out of 5 stars
3/5
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
Ebook
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
byKevin Clark
Rating: 5 out of 5 stars
5/5
Remote/WebCam Notarization : Basic Understanding
Ebook
Remote/WebCam Notarization : Basic Understanding
byJeannie Eunice Franks
Rating: 3 out of 5 stars
3/5
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
Beginner's Guide To Starting An Etsy Print-On-Demand Shop
Ebook
Beginner's Guide To Starting An Etsy Print-On-Demand Shop
byAnn Eckhart
Rating: 0 out of 5 stars
0 ratings
More Porn - Faster!: 50 Tips & Tools for Faster and More Efficient Porn Browsing
Ebook
More Porn - Faster!: 50 Tips & Tools for Faster and More Efficient Porn Browsing
bySue Denim
Rating: 3 out of 5 stars
3/5
The Cyber Attack Survival Manual: Tools for Surviving Everything from Identity Theft to the Digital Apocalypse
Ebook
The Cyber Attack Survival Manual: Tools for Surviving Everything from Identity Theft to the Digital Apocalypse
byNick Selby
Rating: 0 out of 5 stars
0 ratings
The Digital Marketing Handbook: A Step-By-Step Guide to Creating Websites That Sell
Ebook
The Digital Marketing Handbook: A Step-By-Step Guide to Creating Websites That Sell
byRobert W. Bly
Rating: 5 out of 5 stars
5/5
The Logo Brainstorm Book: A Comprehensive Guide for Exploring Design Directions
Ebook
The Logo Brainstorm Book: A Comprehensive Guide for Exploring Design Directions
byJim Krause
Rating: 4 out of 5 stars
4/5
The Internet Is Not What You Think It Is: A History, a Philosophy, a Warning
Ebook
The Internet Is Not What You Think It Is: A History, a Philosophy, a Warning
byJustin Smith-Ruiu
Rating: 4 out of 5 stars
4/5
How To Start A Podcast
Ebook
How To Start A Podcast
byP Teague
Rating: 4 out of 5 stars
4/5
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
Ebook
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
byAlexander Cooper
Rating: 5 out of 5 stars
5/5
How to Destroy Surveillance Capitalism
Ebook
How to Destroy Surveillance Capitalism
byCory Doctorow
Rating: 4 out of 5 stars
4/5
Introduction to Internet Scams and Fraud: Credit Card Theft, Work-At-Home Scams and Lottery Scams
Ebook
Introduction to Internet Scams and Fraud: Credit Card Theft, Work-At-Home Scams and Lottery Scams
byDueep J. Singh
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

SAP + Apigee: The Power of APIs with Benjamin Schuler and Dave Feuer: Max Saltonstall and Carter Morgan co-host the podcast this week and talk APIs with our guests Benjamin Schuler and Dave Feuer.
Podcast episode
SAP + Apigee: The Power of APIs with Benjamin Schuler and Dave Feuer: Max Saltonstall and Carter Morgan co-host the podcast this week and talk APIs with our guests Benjamin Schuler and Dave Feuer.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Managing non-REST APIs like GraphQL and gRPC with Nandan Sridhar and David Feuer: Alexandrina Garcia-Verdin and Stephanie Wong host this week's episode all about managing non-REST APIs.
Podcast episode
Managing non-REST APIs like GraphQL and gRPC with Nandan Sridhar and David Feuer: Alexandrina Garcia-Verdin and Stephanie Wong host this week's episode all about managing non-REST APIs.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
A “API” Look Ahead for 2020
Podcast episode
A “API” Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
API Lifecycle with Alan Ho: This week Alan Ho from Apigee joins your cohosts Francesc and Mark to talk about the lifecycle of an API.
Podcast episode
API Lifecycle with Alan Ho: This week Alan Ho from Apigee joins your cohosts Francesc and Mark to talk about the lifecycle of an API.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
ThursdAI Aug 3 - OpenAI, Qwen 7B beats LLaMa, Orca is replicated, and more AI news
Podcast episode
ThursdAI Aug 3 - OpenAI, Qwen 7B beats LLaMa, Orca is replicated, and more AI news
byThursdAI - The top AI news from the past week
0 ratings
0% found this document useful
Couchbase and the Evolving World of Databases with Perry Krug
Podcast episode
Couchbase and the Evolving World of Databases with Perry Krug
byScreaming in the Cloud
0 ratings
0% found this document useful
#110 AI Engineering with Scrimba CEO & Engineer Per Borgan: In this week's episode of the podcast, freeCodeCamp founder Quincy Larson interviews Per Borgen about AI engineering and interactive developer education. Per is the co-founder and CEO of Scrimba and is a software engineer. Be sure to follow The...
Podcast episode
#110 AI Engineering with Scrimba CEO & Engineer Per Borgan: In this week's episode of the podcast, freeCodeCamp founder Quincy Larson interviews Per Borgen about AI engineering and interactive developer education. Per is the co-founder and CEO of Scrimba and is a software engineer. Be sure to follow The...
byfreeCodeCamp Podcast
0 ratings
0% found this document useful
Supper Club × GraphQL as an Aggregation Layer with Filipe Ferreira of Sky TV: In this supper club episode of Syntax, Wes and Scott talk with Filipe Ferreira of Sky TV about the tech stack used to deliver streaming TV content, build Apple TV apps, host media, and more. Gatsby - Sponsor Today’s episode was sponsored by Gatsby,...
Podcast episode
Supper Club × GraphQL as an Aggregation Layer with Filipe Ferreira of Sky TV: In this supper club episode of Syntax, Wes and Scott talk with Filipe Ferreira of Sky TV about the tech stack used to deliver streaming TV content, build Apple TV apps, host media, and more. Gatsby - Sponsor Today’s episode was sponsored by Gatsby,...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
60: Offline First: Summary The Offline First Heroes, Jan Lehnardt (@janl), John Kleinschmidt (@jkleinsc), Alex Russell (@slightlylate), and Jake Archibald (@jaffathecake) join forces to chat on why web developers should be designing and building with offline...
Podcast episode
60: Offline First: Summary The Offline First Heroes, Jan Lehnardt (@janl), John Kleinschmidt (@jkleinsc), Alex Russell (@slightlylate), and Jake Archibald (@jaffathecake) join forces to chat on why web developers should be designing and building with offline...
byThe Web Platform Podcast
0 ratings
0% found this document useful
PHP with Terry Ryan and Brent Shaffer: This week, Terry Ryan and Brent Shaffer join Francesc and guest host Chris Broadfoot to talk all about PHP on Google Cloud Platform
Podcast episode
PHP with Terry Ryan and Brent Shaffer: This week, Terry Ryan and Brent Shaffer join Francesc and guest host Chris Broadfoot to talk all about PHP on Google Cloud Platform
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Sophie DeBenedetto on Programming Phoenix LiveView: As users increasingly demand interactivity from their web experiences, Phoenix LiveView is becoming the dominant way of writing interactive Elixir applications without a loss to reliability. Today we welcome back Sophie DeBenedetto, GitHub engineer, to talk about her upcoming book, Programming Phoenix LiveView.
Podcast episode
Sophie DeBenedetto on Programming Phoenix LiveView: As users increasingly demand interactivity from their web experiences, Phoenix LiveView is becoming the dominant way of writing interactive Elixir applications without a loss to reliability. Today we welcome back Sophie DeBenedetto, GitHub engineer, to talk about her upcoming book, Programming Phoenix LiveView.
byElixir Wizards
0 ratings
0% found this document useful
Http4s and Functional Web Development With Ross Baker: The promise of functional programming is code that is easier to reason about, test and maintain. Referential transparency means there is no extra context to worry about, we can just focus on inputs and outputs. Examples of functional programming in...
Podcast episode
Http4s and Functional Web Development With Ross Baker: The promise of functional programming is code that is easier to reason about, test and maintain. Referential transparency means there is no extra context to worry about, we can just focus on inputs and outputs. Examples of functional programming in...
byCoRecursive: Coding Stories
0 ratings
0% found this document useful
WBSP252: Grow Your Business by Learning the Importance of DevOps for Manufacturing OT Environments w/ Adam Gluck
Podcast episode
WBSP252: Grow Your Business by Learning the Importance of DevOps for Manufacturing OT Environments w/ Adam Gluck
byWBSRocks: Business Growth with ERP and Digital Transformation
0 ratings
0% found this document useful
DevOps and Incident Response Evolution
Podcast episode
DevOps and Incident Response Evolution
byThe Cloudcast
0 ratings
0% found this document useful
Episode 189: Prioritizing Features with Corinn Pope: When developing a SaaS product, founders and product managers always have to deal with limited resources. Is there a formula for making the best feature decisions? Our guest today is Corinn Pope, founder of Speckled and prioritization expert. You’ll hear Corinn’s recommendations on roadmaps, decision-making formulas, prioritization processes, and management tips for all kinds of products.
Podcast episode
Episode 189: Prioritizing Features with Corinn Pope: When developing a SaaS product, founders and product managers always have to deal with limited resources. Is there a formula for making the best feature decisions? Our guest today is Corinn Pope, founder of Speckled and prioritization expert. You’ll hear Corinn’s recommendations on roadmaps, decision-making formulas, prioritization processes, and management tips for all kinds of products.
byUI Breakfast: UI/UX Design and Product Strategy
100%
100% found this document useful
Supper Club × Is No Code going to take our jobs? with Connor Finlayson: In this supper club episode of Syntax, Wes and Scott talk with Connor Finlayson about his experience building and teaching no code projects. Postlight Podcast - Sponsor Postlight is a strategy, design, and engineering firm that builds platforms for...
Podcast episode
Supper Club × Is No Code going to take our jobs? with Connor Finlayson: In this supper club episode of Syntax, Wes and Scott talk with Connor Finlayson about his experience building and teaching no code projects. Postlight Podcast - Sponsor Postlight is a strategy, design, and engineering firm that builds platforms for...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
API First, Lifecycles and Governance
Podcast episode
API First, Lifecycles and Governance
byThe Cloudcast
0 ratings
0% found this document useful
Using Pylance to Write Better Python Inside of Visual Studio Code
Podcast episode
Using Pylance to Write Better Python Inside of Visual Studio Code
byThe Real Python Podcast
0 ratings
0% found this document useful
2023 Look Ahead to Platform Engineering
Podcast episode
2023 Look Ahead to Platform Engineering
byThe Cloudcast
0 ratings
0% found this document useful
Building Cody, an Open Source AI Coding Assistant // Beyang Liu // MLOps Podcast #173
Podcast episode
Building Cody, an Open Source AI Coding Assistant // Beyang Liu // MLOps Podcast #173
byMLOps.community
0 ratings
0% found this document useful
WebRTC: Everyone has used programs that talk to a server: Uber, podcast apps, etc.. Most people also know about peer-to-peer applications such as bittorrent, but did you know that video chat programs such as Skype and Google Hangouts are also peer-to-peer? You
Podcast episode
WebRTC: Everyone has used programs that talk to a server: Uber, podcast apps, etc.. Most people also know about peer-to-peer applications such as bittorrent, but did you know that video chat programs such as Skype and Google Hangouts are also peer-to-peer? You
byProgramming Throwdown
0 ratings
0% found this document useful
186: Thinking Elixir News 186: Discover how Elixir reduces costs and enhances dev experiences, get tips for package maintainers, and catch the latest on NervesConf US and more!
Podcast episode
186: Thinking Elixir News 186: Discover how Elixir reduces costs and enhances dev experiences, get tips for package maintainers, and catch the latest on NervesConf US and more!
byThinking Elixir Podcast
0 ratings
0% found this document useful
OpenWRT for Enterprise and Labs - Gene Erik - PSW #698: OpenWRT is a mature and well supported project. It is supported on many hardware platforms and available as production-level products. OpenWRT has developed into a platform that is filled with enterprise level features, making it a successful product...
Podcast episode
OpenWRT for Enterprise and Labs - Gene Erik - PSW #698: OpenWRT is a mature and well supported project. It is supported on many hardware platforms and available as production-level products. OpenWRT has developed into a platform that is filled with enterprise level features, making it a successful product...
bySecurity Weekly Podcast Network (Video)
0 ratings
0% found this document useful
Voicea with Mohamed El-Geish: Today, Mohamed El-Geish joins us to talk about the voice AI technology powering Voicea.
Podcast episode
Voicea with Mohamed El-Geish: Today, Mohamed El-Geish joins us to talk about the voice AI technology powering Voicea.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Potluck EP × Vue.js × Headless WP × Typescript & Flow × Productivity × Server Side Rendering × Yeoman: It's a Potluck episode - you bring the questions and we turn them into tasty treats. DeployHQ - Sponsor DeployHQ is the easiest way to deploy websites to your servers. Invite your team and set up automated deployment processes to: Compile a...
Podcast episode
Potluck EP × Vue.js × Headless WP × Typescript & Flow × Productivity × Server Side Rendering × Yeoman: It's a Potluck episode - you bring the questions and we turn them into tasty treats. DeployHQ - Sponsor DeployHQ is the easiest way to deploy websites to your servers. Invite your team and set up automated deployment processes to: Compile a...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Cloud Video Intelligence API with Sara Robinson: Have you ever wanted to apply Cloud Vision API to videos? Well, Sara Robinson is here to tell us about Cloud Video Intelligence API, which allows you to do that and much more.
Podcast episode
Cloud Video Intelligence API with Sara Robinson: Have you ever wanted to apply Cloud Vision API to videos? Well, Sara Robinson is here to tell us about Cloud Video Intelligence API, which allows you to do that and much more.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
GraphQL, API Gateways, API-Led Growth – How to make APIs Observable with Sonja Chevre: APIs are powering and empowering software innovation as they enable new use cases on top of existing services. Observability into API usage to answer questions like: how APIs are called, what APIs do, where APIs fail, where APIs are slow, where APIs...
Podcast episode
GraphQL, API Gateways, API-Led Growth – How to make APIs Observable with Sonja Chevre: APIs are powering and empowering software innovation as they enable new use cases on top of existing services. Observability into API usage to answer questions like: how APIs are called, what APIs do, where APIs fail, where APIs are slow, where APIs...
byPurePerformance
0 ratings
0% found this document useful
57: PubNub on Web Crypto: Summary Jay Oster (@KodeWerx), Core Engineer at PubNub talks with us about working with Web Crypto as well as the landscape of Cryptography today. What is on the horizon for client side security & Web Crypto? Resources PubNub -...
Podcast episode
57: PubNub on Web Crypto: Summary Jay Oster (@KodeWerx), Core Engineer at PubNub talks with us about working with Web Crypto as well as the landscape of Cryptography today. What is on the horizon for client side security & Web Crypto? Resources PubNub -...
byThe Web Platform Podcast
0 ratings
0% found this document useful
Hasty Treat - New Level Up Tutorials Site: In this Hasty Treat, Scott and Wes talk about the new Level Up Tutorials website — some of the new tech powering the site, behind-the-scenes decisions, previous iterations, and more! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks...
Podcast episode
Hasty Treat - New Level Up Tutorials Site: In this Hasty Treat, Scott and Wes talk about the new Level Up Tutorials website — some of the new tech powering the site, behind-the-scenes decisions, previous iterations, and more! Freshbooks - Sponsor Get a 30 day free trial of Freshbooks...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
SeMI Technologies with Laura Ham: Today on the podcast, Gabi Ferrara and Jon Foust share a great interview with Laura Ham, Community Solution Engineer at SeMI Technologies.
Podcast episode
SeMI Technologies with Laura Ham: Today on the podcast, Gabi Ferrara and Jon Foust share a great interview with Laura Ham, Community Solution Engineer at SeMI Technologies.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful

Skip carousel

Smartproxy
Linux Format
Article
Smartproxy
Jan 9, 2024
2 min read
The Streaming Technology
Techfastly
Article
The Streaming Technology
May 1, 2022
7 min read
Artificial Intelligence Rules Of The Road
Linux Format
Article
Artificial Intelligence Rules Of The Road
Nov 14, 2023
AI FOR ALL! Anyone who works with computers needs to understand that AI will undoubtedly change how work is executed. That said, I don’t think we are anywhere near the much bleated “Everyone will lose their jobs!” IT-related jobs will change but they
2 min read
Mind Your Language!
Linux Format
Article
Mind Your Language!
Apr 4, 2023
9 min read
Integrated Development Environments
Linux Format
Article
Integrated Development Environments
Jul 28, 2020
1 min read
1password 7 For Mac: Password Manager With Small Improvements That Add Up
MacWorld
Article
1password 7 For Mac: Password Manager With Small Improvements That Add Up
Jul 17, 2018
4 min read
PyScript – Bring Python Coding To The Web
APC
Article
PyScript – Bring Python Coding To The Web
Aug 8, 2022
4 min read
The Smart Home Dream
MacFormat
Article
The Smart Home Dream
Dec 15, 2020
7 min read
Seven Ways To Future-proof Your SEO Strategy
Marketing
Article
Seven Ways To Future-proof Your SEO Strategy
Apr 8, 2018
Search engine optimisation (SEO) is always changing. To stay ahead of your competitors you need to be able to shift your SEO strategy. Expect to see mobile devices, artificial intelligence (AI) and voice search dominating the news. But what practical
3 min read
Top 10 Programming Languages
PC Pro Magazine
Article
Top 10 Programming Languages
Jan 5, 2023
8 min read
CoderDojo And Code Club Become Global Hits
Linux Format
Article
CoderDojo And Code Club Become Global Hits
Nov 14, 2023
2 min read
Coding up a Storm
MacLife
Article
Coding up a Storm
Mar 30, 2017
$9.99omz-software.com > This complete Python scripting environment makes the most of being specialized for one coding language, instead of several. It has a debugger included, tabs for multiple documents, a tool for quickly building visual interface
1 min read
Whisk
MacLife
Article
Whisk
Jul 21, 2020
2 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read
The Smart Home Dream
MacLife
Article
The Smart Home Dream
Feb 2, 2021
4 min read
Newsdesk
Linux Format
Article
Newsdesk
Dec 13, 2022
OPEN SOURCE FUNDING GitHub, Fastly and Mozilla are all looking for new projects to back, giving a boost to open source development. Small open source projects might be created solely by enthusiasts but most make use of outside developers, often paid
9 min read
Password Managers
Linux Format
Article
Password Managers
Feb 6, 2024
14 min read
Hot Picks
Linux Format
Article
Hot Picks
Mar 9, 2021
13 min read
OpenVX Lands On Raspberry Pi
Linux Format
Article
OpenVX Lands On Raspberry Pi
Jul 28, 2020
This Raspberry Pi thing is all getting very serious. Clearly, serious people need serious standards and things don’t get more serious than OpenVX. Developed by The Khronos Group – no, it’s not a cabal of evil supervillains – this is the serious indus
1 min read
Bitwarden
Linux Format
Article
Bitwarden
Sep 19, 2023
2 min read
HotPicks
Linux Format
Article
HotPicks
Nov 15, 2022
12 min read
Visual Studio Code 1.74
Linux Format
Article
Visual Studio Code 1.74
Mar 7, 2023
2 min read
WWDC 2022 SPECIAL FOCUS Young Singaporean App Developers
HWM Singapore
Article
WWDC 2022 SPECIAL FOCUS Young Singaporean App Developers
Jul 7, 2022
7 min read
Adobe Premiere Pro 2022
PC Pro Magazine
Article
Adobe Premiere Pro 2022
Jul 9, 2022
2 min read
Is eBPF Foundation Molding the Future of Infrastructure Software Space?
Techfastly
Article
Is eBPF Foundation Molding the Future of Infrastructure Software Space?
Apr 1, 2022
2 min read
So You’d Like to Start Computing in Your Own Language
Global Voices
Article
So You’d Like to Start Computing in Your Own Language
Apr 17, 2017
3 min read
Contributing For Non - Coders
Linux Format
Article
Contributing For Non - Coders
Jan 10, 2023
9 min read
Newsdesk
Linux Format
Article
Newsdesk
Mar 5, 2024
11 min read
Programming Language
Linux Format
Article
Programming Language
Oct 19, 2021
These projects are built in many languages. However, they all end up with the languages needed for your site: HTML, CSS and JavaScript. To get things to work, you need to use web-based techniques. To use the projects, though, you do not need to be fl
1 min read
How To Build The Linux Format Server
Linux Format
Article
How To Build The Linux Format Server
Oct 19, 2021
10 min read

Related categories

Skip carousel

Reviews for Introducing the HTML5 Web Speech API

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Introducing the HTML5 Web Speech API - Alex Libby

A. LibbyIntroducing the HTML5 Web Speech APIhttps://doi.org/10.1007/978-1-4842-5735-7_1

1. Getting Started

Alex Libby¹

(1)

Rugby, UK

Introducing the APIs

Hey Alexa, what time is it…?

In an age of smart assistant (SA) devices, I’ll bet that those words will be uttered dozens of times a day worldwide – it does not matter where; smart assistants have become immensely popular. Indeed, Juniper Research has forecasted that the number of smart assistants will triple from 2.5 billion in use at the end of 2018 to 8 billion by 2023. Just imagine – changing TV channels by voice (already possible, and which alone is expected to increase 120% over the next five years) or simply doing mundane tasks like reordering goods from the likes of Amazon.

But I digress. Smart assistants are great, but what if we could use them to control functionality in our online web site or application? How? I hear you ask. Well, let me introduce the HTML5 Speech API; it uses the same principle as smart assistants, to turn speech into text and vice versa. It’s available now for use in the browser, albeit still somewhat experimental.

Initially created back in 2012, but only really coming into full use now, this experimental API can be used to perform all manner of different tasks by the power of voice. How about using it to add products to a shopping cart and pay for them – all done remotely by voice? Adding in speech capabilities opens up some real possibilities for us. Over the course of this book, we’ll explore a handful of these in detail, to show you how we can put this API to good use. Before we do so, there is a little housekeeping we must take care of first. Let’s cover this off now, before we continue our journey through this API.

If you would like to get into the detail of how this API has been constructed and the standards that browser vendors must follow, then take a look at the W3C guidelines for this API at https://wicg.github.io/speech-api/. Beware – it makes for very dry reading!

Setting up our development environment

I’m pretty sure that no one likes admin, but in this instance, there are a couple of tasks we have to perform before we can use the API.

Don’t worry – they are straightforward. This is what we need to do:

The API only works in a secure HTTPS environment (yes, don’t even try running it under HTTP – it doesn’t work) – this means we need to have some secure web space we can use for the purposes of our demos. There are several ways to achieve this:

The simplest is to use CodePen (https://www.codepen.io) – you will need to create an account to save work, but it is free to sign up if you don’t already have an account you can use.

Do you have any web space available for another project, which could be used temporarily? As long as it can be secured under HTTPS, then this will work for our demos.

If you happen to be a developer who uses an MS tech stack, you can create an ASP.Net Core web application, with Configure for HTTPS selected, and click OK when prompted to trust the self-signed certificate, upon running the application. This will work fine for the demos throughout this book.

You can always try running a local web server – there are dozens available online. My personal favorite is MAMP PRO, available from https://www.mamp.info. It’s a paid-for option that runs on Windows and Mac; it makes generating the SSL certificates we need to use a cinch. Alternatively, if you have the likes of Node.js installed, then you can use one such as local web server (https://github.com/lwsjs/local-web-server), or create your own if you prefer. You will need to create a certificate for it and add it to your certificate store – a handy method for creating the certificate is outlined at https://bit.ly/30RjAD0.

The next important task is to avail yourself of a suitable microphone – after all, we clearly won’t get very far without one! You may already have one; if not, pretty much any microphone will work fine. My personal preference is to use a microphone/headset combo, as you might for talking over Skype. You should be able to pick up a relatively inexpensive one via Amazon or your local audio store.

A word of note If you are a laptop user, then you can use any microphone that is built-in to your laptop. The drawback is that reception won’t be so good – you might find yourself having to lean forward an awful lot for the best reception!

For all of our demos, we’ll use a central project folder – for the purposes of this book, I’ll assume you’ve created one called speech and that it is stored at the root of your C: drive. The exact location is not critical; if you’ve chosen a different location, then you will need to adjust the location accordingly when we come to complete the demos.

Excellent! With that admin now out of the way, we can focus on the interesting stuff! The HTML5 Speech API (or the API) comes in two parts: The first part is the SpeechSynthesis API , which takes care of reciting back any given text as speech. Second, in comparison – and to coin a phrase – the SpeechRecognition API does pretty much what it says in the name. We can say a phrase, and provided it matches preconfigured text it can recognize, it will perform any number of tasks that we assign on receipt of that phrase.

We could dive into how they work, but I know you’re itching to get stuck in, right? Absolutely. So without further ado, let’s run through two quick demos, so you get a flavor for how the API works before we use it in projects later in this book.

Don’t worry though about what it all means – we will absolutely explore the code in detail after each exercise! We’ll look at both in turn, starting first with the SpeechSynthesis API.

Implementing our first examples

Although both APIs require a bit of configuration to make them work, they are relatively easy to set up; neither requires the use of any specific frameworks or external libraries for basic operation.

To see what I mean, I’ve put together two quick demos using CodePen – they demonstrate the basics of what is needed to get started and will form code that we will use in projects later in this book. Let’s take a look at each, in turn, starting with reading back text as speech, using the SpeechSynthesis API.

Reading back text as speech

Our first exercise will keep things simple and make use of CodePen to host our code; for this, you will need to create an account if you want to save your work for future reference. If you’ve not used CodePen before, then don’t worry – it’s free to sign up! It’s a great way to get started with the API. We will move to using something more local in subsequent demos.

All of the code used in examples throughout this book is available in the code download that accompanies this book. We will use a mix of ECMAScript 2015 and vanilla JavaScript in most demos; you may need to adjust if you want to use a newer version of ECMAScript.

Reading Back Text

Assuming you’ve signed up and now have a CodePen account you can use, let’s make a start on creating our first example:

First, go ahead and fire up your browser, then navigate to https://codepen.io, and sign in with your account details. Once done, click Pen on the left to create our demo.

We need to add in the markup for this demo – for this, go ahead and add the following code into the HTML window:

https://fonts.googleapis.com/css?family=Open+Sans&display=swap rel=stylesheet>

page-wrapper>

Introducing HTML5 Speech API: Reading Text back as Speech

msg>

text name=speech-msg id=speech-msg>

option>

voice>Voice

Our demo will look very ordinary if we run it now – let alone the fact that it won’t actually work as expected! We can easily fix this. Let’s first add in some rudimentary styles to make our demo more presentable. There are a few styles to add in, so we will do it block by block. Leave a line between each block, when you add it into the demo:

*, *:before, *:after { box-sizing: border-box; }

html { font-family: 'Open Sans', sans-serif; font-size: 100%; }

#page-wrapper { width: 640px; background: #ffffff; padding: 16px; margin: 32px auto; border-top: 5px solid #9d9d9d; box-shadow: 0 2px 10px rgba(0,0,0,0.8); }

h2 { margin-top: 0; }

We need to add in some styles to indicate whether our browser supports the API:

#msg { font-size: 14px; line-height: 22px; }

#msg.not-supported strong { color: #cc0000; }

#msg > span { font-size: 24px; vertical-align: bottom; }

#msg > span.ok { color: #00ff00; }

#msg > span.notok { color: #ff0000; }

Next up are the styles for the voice drop-down:

#voice { margin: 0 70px 0 -70px; vertical-align: super; }

For the API to have something it can convert to speech, we need to have a way to enter text. For this, add in the following style rules:

input[type=text] { width: 100%; padding: 8px; font-size: 19px;

border-radius: 3px; border: 1px solid #d9d9d9; box-shadow: 0 2px 3px rgba(0,0,0,0.1) inset; }

label { display: inline-block; float: left; width: 150px; }

.option { margin: 16px 0; }

The last element to style is the Speak button at the bottom-right corner of our demo:

button { display: inline-block; border-radius: 3px; border: none; font-size: 14px; padding: 8px 12px; background: #dcdcdc;

border-bottom: 2px solid #9d9d9d; color: #000000; -webkit-font-smoothing: antialiased; font-weight: bold; margin: 0; width: 20%; text-align: center; }

button:hover, button:focus { opacity: 0.75; cursor: pointer; }

button:active { opacity: 1; box-shadow: 0 -3px 10px rgba(0, 0, 0, 0.1) inset; }

With the styles in place, we can now turn our attention to adding the glue to make this work. I don’t mean that literally, but in a figurative sense! All of the code we need to add goes in the JS window of our CodePen; we start with a check to see if our browser supports the API:

var supportMsg = document.getElementById('msg');

if ('speechSynthesis' in window) {

supportMsg.innerHTML = 'ok>☑ Your browser supports speech synthesis.';

} else {

supportMsg.innerHTML = 'notok>☒ Sorry your browser does not support speech synthesis.';

supportMsg.classList.add('not-supported');

}

Next up, we define three variables to store references to elements in our demo:

var button = document.getElementById('speak');

var speechMsgInput = document.getElementById('speech-msg');

var voiceSelect = document.getElementById('voice');

10.

When using the API, we can relay speech back using a variety of different voices – we need to load these into our demo before we can use them. For this, go ahead and drop in the following lines:

function loadVoices() {

var voices = speechSynthesis.getVoices();

voices.forEach(function(voice, i) {

var option = document.createElement('option');

option.value = voice.name;

option.innerHTML = voice.name;

voiceSelect.appendChild(option);

});

}

loadVoices();

window.speechSynthesis.onvoiceschanged = function(e) {

loadVoices();

};

11.

We come to the real meat of our demo – this is where we see the text we add be turned into speech! For this, leave a line after the previous block, and add in the following code:

function speak(text) {

var msg = new SpeechSynthesisUtterance();

msg.text = text;

if (voiceSelect.value) {

msg.voice = speechSynthesis.getVoices()

.filter(function(voice) {

return voice.name == voiceSelect.value;

})[0];

}

window.speechSynthesis.speak(msg);

}

12.

We’re almost there. The last step is to add in an event handler that fires off the conversion from text to speech when we hit the Speak button:

button.addEventListener('click', function(e) {

if (speechMsgInput.value.length > 0) {

speak(speechMsgInput.value);

}

});

13.

Go ahead and save your work. If all is well, we should see something akin to the screenshot shown in Figure 1-1.

../images/490753_1_En_1_Chapter/490753_1_En_1_Fig1_HTML.jpg

Figure 1-1

Our completed text-to-speech demo

Try then typing in some text and hit the Speech button. If all is working as expected, then you will hear your words recited back to you. If you choose a voice from the drop-down, you will hear your words spoken back with an accent; depending on what you type, you will get some very interesting results!

A completed version of this demo can be found in the code download that accompanies this book – it’s in the readingback folder.

At this stage, we now have the basic setup in place to allow our browser to read back any text we desire – granted it might still sound a little robotic. However, this is to be expected when working with an API that is still somewhat experimental!

This aside, I’ll bet there are two questions on your mind: How does this API function? And – more to the point – is it still safe to use, even though it is still technically an unofficial API? Don’t worry – the answers to these questions and more will be revealed later in this chapter. Let us first start with exploring how our demo works in more detail.

Understanding what happened

If we take a closer look at our code, you might be forgiven for thinking it looks a little complex – in reality though, it is very straightforward.

We start with some simple HTML markup and styling, to display an input box on the screen for the content to be replayed. We also have a drop-down which we will use to list the available voices. The real magic happens in the script that we’ve used – this starts by performing a check to see if our browser supports the API and displays a suitable message.

Assuming your browser does support the API (and most browsers from the last 3–4 years will), we then define a number of placeholder variables for various elements on the page. We then (through the loadVoices() function) iterate through the available voices before populating the drop-down with the results. Of particular note is the second call to loadVoices() ; this is necessary as Chrome loads them asynchronously.

It’s important to note that the extra voices (which start with Chrome…) are added as part of the API interacting with Google and so only appear in Chrome.

If we then jump to the end of the demo for a moment, we can see an event handler for the button element; this calls the speak() function that creates a new utterance of the SpeechSynthesisUtterance() object that acts as a request to speak. It then checks to make sure we’ve selected a voice, which we do using the speechSynthesis.getVoices() function . If a voice is selected, then the API queues the utterance and renders it as audio via your PC’s speakers.

Okay, let’s move on. We’ve explored the basics of how to render text as speech. This is only half of the story though. What about converting verbal content into text? This we can do by using the SpeechRecognition API – it requires a little more effort, so let’s dive into the second of our two demos to see what’s involved in making our laptops talk.

Converting speech to text

The ability to vocalize content through our PC’s speakers (or even headphones) is certainly useful , but a little limiting. What if we can ask the browser to perform something using the power of our voice? Well, we can do that using the second of the two Speech APIs. Let me introduce the SpeechRecognition API!

This sister API allows us to speak into any microphone connected to our PC, for our browser to perform any manner of preconfigured tasks, from something as simple as transcribing tasks to searching for the nearest restaurant to your given location. We’ll explore some examples of how to use this API in projects later in this book, but for now, let’s implement a simple demo so you can see how the API works in action.

I would not recommend using Firefox when working with demos that use the Speech Recognition API; although documentation on the Mozilla Developer Network (MDN) site suggests it is supported, this isn’t the case, and you will likely end up with a SpeechRecognition is not a constructor error in your console log.

What Did I Say?

Let’s crack on with our next exercise:

We’ll start by browsing to https://www.codepen.io and then clicking Pen. Make sure you’ve logged in with the account you created back in the first exercise.

Our demo makes use of Font Awesome for the microphone icon that you will see in use shortly – for this, we need to add in references for two CSS libraries. Go ahead and click Settings ➤ CSS. Then add in the following links into the spare slots at the bottom of the dialog box:

https://use.fontawesome.com/releases/v5.0.8/css/fontawesome.css

https://use.fontawesome.com/releases/v5.0.8/css/solid.css

Next, switch to the HTML pane and add the following markup which will form the basis for our demo:

https://fonts.googleapis.com/css?family=Open+Sans&display=swap rel=stylesheet>

page-wrapper>

Introducing HTML5 Speech API: Converting Speech to Text

fa fa-microphone> Click and talk to me!

response>

output_log>

output>You said: output_result>

voice>Spoken voice: US English

On its own, our markup certainly won’t win any awards for style! To fix this, we need to add in a few styles to make our demo look presentable. For this, add the following rules into the CSS pane, starting with some basic rules to style the container for our demo:

*, *:before, *:after { box-sizing: border-box; }

html { font-family: 'Open Sans', sans-serif; font-size: 100%; }

#page-wrapper { width: 640px; background: #ffffff; padding: 16px; margin: 32px auto; border-top: 5px solid #9d9d9d; box-shadow: 0 2px 10px rgba(0,0,0,0.8); }

h2 { margin-top: 0; }

Next come the rules we need to style our talk button:

button { color: #0000000; background: #dcdcdc; border-radius: 6px; text-shadow: 0 1px 1px rgba(0, 0, 0, 0.2); font-size: 19px; padding: 8px 16px; margin-right: 15px; }

button:focus { outline: 0; }

input[type=text] { border-radius: 6px; font-size: 19px; padding: 8px; box-shadow: inset 0 0 5px #666; width: 300px; margin-bottom: 8px; }

Our next rule makes use of Font Awesome to display a suitable microphone icon on the talk button:

.fa-microphone:before { content: \f130; }

This next set of rules will style the output once it has been transcribed, along with the confidence level and the voice characterization used:

.output_log { font-family: monospace; font-size: 24px; color: #999; display: inline-block; }

.output { height: 50px; font-size: 19px; color: #000000; margin-top: 30px; }

.response { padding-left: 260px; margin-top: -35px; height: 50px}

.voice { float: right; margin-top: -20px; }

Okay, so we have our markup in place, and it looks reasonably OK. What’s missing? Ah yes, the script to make it all work! For this, go ahead and add in the following code to the JS pane. We have a good chunk of code, so let’s break it down block by block, starting with some variable declarations:

'use strict';

const log = document.querySelector('.output_log');

const output = document.querySelector('.output_result');

const SpeechRecognition = window.SpeechRecognition || window.webkitSpeechRecognition;

const recognition = new SpeechRecognition();

recognition.interimResults = true;

recognition.maxAlternatives = 1;

Next up is an event handler that triggers the microphone. Leave a blank line and then add the following code:

document.querySelector('button').addEventListener('click', () => {

let recogLang = 'en-US';

recognition.lang = recogLang.value;

recognition.start();

});

10.

When using the Speech Recognition API, we trigger a number of events to which we must respond; the first one recognizes when we start talking. Go ahead and add the following lines into the JS pane of our CodePen demo:

recognition.addEventListener('speechstart', () => {

log.textContent = 'Speech has been detected.';

});

11.

Leave a blank line and then add in these lines – this event handler takes care of recognizing and transcribing anything we say into the microphone, as well as calculating a confidence level for accuracy:

recognition.addEventListener('result', (e) => {

log.textContent = 'Result has been detected.';

let last = e.results.length - 1;

let text = e.results[last][0].transcript;

output.textContent = text;

log.textContent = 'Confidence: ' + (e.results[0][0].confidence * 100).toFixed(2) + %;

});

12.

We’re almost done, but have two more event handlers to add in – these take care of switching off the Recognition API when we’re done and also displaying any errors on screen if any should appear. Leave a line and then drop in the following code:

recognition.addEventListener('speechend', () => {

recognition.stop();

});

recognition.addEventListener('error', (e) => {

output.textContent = 'Error: ' + e.error;

});

13.

At this point, we’re done with editing code. Go ahead and hit the Save button to save our work.

A completed version of this demo can be found in the code download that accompanies this book – it’s in the whatdidIsay folder.

At this point, we should be good to run our demo, but if you were to do so, it’s likely that you won’t get any response. How come? The simple reason is that we have to grant permission to use our PC’s microphone from within the browser. It is possible to activate it via the Settings entry in the site’s certificate details, but it’s not the cleanest method. There is a better way to prompt for access, which I will demonstrate in the next exercise.

Allowing access to the microphone

When using the Speech API , there is one thing we must bear in mind – access to the microphone will by default be disabled for security reasons; we must explicitly enable it before we can put it to use.

This is easy to do, although the exact steps will vary between browsers – it involves adding a couple of lines of code to our demo to request access to the microphone and changing a setting once prompted. We’ll

Enjoying the preview?

Page 1 of 1

Introducing the HTML5 Web Speech API: Your Practical Introduction to Adding Browser-Based Speech Capabilities to your Websites and Online Applications

About this ebook

Alex Libby

Read more from Alex Libby

Related authors

Related to Introducing the HTML5 Web Speech API

Related ebooks

Internet & Web For You

Related podcast episodes

Related articles

Related categories

Reviews for Introducing the HTML5 Web Speech API

What did you think?

Book preview

Introducing the HTML5 Web Speech API - Alex Libby

1. Getting Started

Introducing the APIs

Setting up our development environment

Implementing our first examples

Reading back text as speech

Introducing HTML5 Speech API: Reading Text back as Speech

Converting speech to text

Introducing HTML5 Speech API: Converting Speech to Text

Allowing access to the microphone