Introduction to Audio Analysis: A MATLAB® Approach

Ebook453 pages4 hours

Introduction to Audio Analysis: A MATLAB® Approach

Name: Introduction to Audio Analysis: A MATLAB® Approach
Brand: Academic Press
Rating: 4.5 (2 reviews)

By Theodoros Giannakopoulos and Aggelos Pikrakis

Rating: 4.5 out of 5 stars

4.5/5

()

Read preview

About this ebook

Introduction to Audio Analysis serves as a standalone introduction to audio analysis, providing theoretical background to many state-of-the-art techniques. It covers the essential theory necessary to develop audio engineering applications, but also uses programming techniques, notably MATLAB®, to take a more applied approach to the topic. Basic theory and reproducible experiments are combined to demonstrate theoretical concepts from a practical point of view and provide a solid foundation in the field of audio analysis.

Audio feature extraction, audio classification, audio segmentation, and music information retrieval are all addressed in detail, along with material on basic audio processing and frequency domain representations and filtering. Throughout the text, reproducible MATLAB® examples are accompanied by theoretical descriptions, illustrating how concepts and equations can be applied to the development of audio analysis systems and components. A blend of reproducible MATLAB® code and essential theory provides enable the reader to delve into the world of audio signals and develop real-world audio applications in various domains.

Practical approach to signal processing: The first book to focus on audio analysis from a signal processing perspective, demonstrating practical implementation alongside theoretical concepts
Bridge the gap between theory and practice: The authors demonstrate how to apply equations to real-life code examples and resources, giving you the technical skills to develop real-world applications
Library of MATLAB code: The book is accompanied by a well-documented library of MATLAB functions and reproducible experiments

Skip carousel

LanguageEnglish

PublisherAcademic Press

Release dateFeb 15, 2014

ISBN9780080993898

Author

Theodoros Giannakopoulos

Theodoros Giannakopoulos is a Research Associate in the Institute of Informatics and Telecommunications, National Center for Scientific Research DEMOKRITOS, Greece and in the Department of Informatics & Telecommunications of the University of Athens (UOA). He received his Ph.D. degree in Audio Analysis from UOA, in 2009. His main research interests are pattern recognition, data mining, and multimedia analysis.

Related authors

Skip carousel

Related to Introduction to Audio Analysis

Related ebooks

Skip carousel

Digital Signal Processing 101: Everything You Need to Know to Get Started
Ebook
Digital Signal Processing 101: Everything You Need to Know to Get Started
byMichael Parker
Rating: 3 out of 5 stars
3/5
DSP for Embedded and Real-Time Systems
Ebook
DSP for Embedded and Real-Time Systems
byRobert Oshana
Rating: 5 out of 5 stars
5/5
Introduction to Digital Signal Processing
Ebook
Introduction to Digital Signal Processing
byRobert Meddins
Rating: 3 out of 5 stars
3/5
Digital Signal Processing for Audio Applications: Volume 2 - Code
Ebook
Digital Signal Processing for Audio Applications: Volume 2 - Code
byAnton R Kamenov
Rating: 5 out of 5 stars
5/5
Filter Handbook: A Practical Design Guide
Ebook
Filter Handbook: A Practical Design Guide
byStefan Niewiadomski
Rating: 5 out of 5 stars
5/5
Pro Tools HD: Advanced Techniques and Workflows
Ebook
Pro Tools HD: Advanced Techniques and Workflows
byEdouard Camou
Rating: 4 out of 5 stars
4/5
Multimedia Programming Using Max/MSP and TouchDesigner
Ebook
Multimedia Programming Using Max/MSP and TouchDesigner
byPatrik Lechner
Rating: 5 out of 5 stars
5/5
Analog Electronics: Circuits, Systems and Signal Processing
Ebook
Analog Electronics: Circuits, Systems and Signal Processing
byDavid Crecraft
Rating: 0 out of 5 stars
0 ratings
Noise and Vibration Analysis: Signal Analysis and Experimental Procedures
Ebook
Noise and Vibration Analysis: Signal Analysis and Experimental Procedures
byAnders Brandt
Rating: 5 out of 5 stars
5/5
Master Handbook of Acoustics, Seventh Edition
Ebook
Master Handbook of Acoustics, Seventh Edition
byF. Alton Everest
Rating: 0 out of 5 stars
0 ratings
Physical and Applied Acoustics: An Introduction
Ebook
Physical and Applied Acoustics: An Introduction
byErwin Meyer
Rating: 0 out of 5 stars
0 ratings
Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography
Ebook
Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography
byEarl G. Williams
Rating: 0 out of 5 stars
0 ratings
Acoustic Signals and Hearing: A Time-Envelope and Phase Spectral Approach
Ebook
Acoustic Signals and Hearing: A Time-Envelope and Phase Spectral Approach
byMikio Tohyama
Rating: 0 out of 5 stars
0 ratings
Music, Physics and Engineering
Ebook
Music, Physics and Engineering
byHarry F. Olson
Rating: 4 out of 5 stars
4/5
Digital Signal Processing: A Practical Guide for Engineers and Scientists
Ebook
Digital Signal Processing: A Practical Guide for Engineers and Scientists
bySteven Smith
Rating: 5 out of 5 stars
5/5
Digital Signal Processing: Mathematical and Computational Methods, Software Development and Applications
Ebook
Digital Signal Processing: Mathematical and Computational Methods, Software Development and Applications
byJonathan M Blackledge
Rating: 5 out of 5 stars
5/5
An Introduction to Acoustics
Ebook
An Introduction to Acoustics
byRobert H. Randall
Rating: 1 out of 5 stars
1/5
Practical Digital Signal Processing
Ebook
Practical Digital Signal Processing
byEdmund Lai
Rating: 0 out of 5 stars
0 ratings
Back to Basics Audio
Ebook
Back to Basics Audio
byJulian Nathan
Rating: 3 out of 5 stars
3/5
Physics and Music: The Science of Musical Sound
Ebook
Physics and Music: The Science of Musical Sound
byHarvey E. White
Rating: 5 out of 5 stars
5/5
Acoustics: Sound Fields, Transducers and Vibration
Ebook
Acoustics: Sound Fields, Transducers and Vibration
byLeo Beranek
Rating: 4 out of 5 stars
4/5
Applied Digital Signal Processing and Applications
Ebook
Applied Digital Signal Processing and Applications
byOthman Omran Khalifa
Rating: 0 out of 5 stars
0 ratings
JBL Audio Engineering for Sound Reinforcement
Ebook
JBL Audio Engineering for Sound Reinforcement
byJohn M. Eargle
Rating: 5 out of 5 stars
5/5
Audio Engineering: Know It All
Ebook
Audio Engineering: Know It All
byDouglas Self
Rating: 5 out of 5 stars
5/5
Digital Signal Processing Demystified
Ebook
Digital Signal Processing Demystified
byJames D. Broesch
Rating: 5 out of 5 stars
5/5
Sound Foundations Audio Engineering Guide: 20-20 Audio Engineering Reference Guide Late 2019 TROONATNOOR Edition
Ebook
Sound Foundations Audio Engineering Guide: 20-20 Audio Engineering Reference Guide Late 2019 TROONATNOOR Edition
byMarkus Heinrich Rehbach
Rating: 0 out of 5 stars
0 ratings
Newnes Know It All
Ebook series
Newnes Know It All
byClive Maxfield
Acoustics: Sound Fields and Transducers
Ebook
Acoustics: Sound Fields and Transducers
byTim Mellow
Rating: 4 out of 5 stars
4/5
Digital Filters
Ebook
Digital Filters
byRichard W. Hamming
Rating: 4 out of 5 stars
4/5
Analog and Digital Filter Design
Ebook
Analog and Digital Filter Design
bySteve Winder
Rating: 0 out of 5 stars
0 ratings

Technology & Engineering For You

Skip carousel

The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
Ebook
The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos,
byAlbert Rutherford
Rating: 4 out of 5 stars
4/5
Sneaky Uses for Everyday Things: How to Turn a Penny into a Radio, Make a Flood Alarm with an Aspirin, Change Milk into Plastic, Extract Water and Electricity from Thin Air, Turn on a TV with your Ring, and Other Amazing Feats
Ebook
Sneaky Uses for Everyday Things: How to Turn a Penny into a Radio, Make a Flood Alarm with an Aspirin, Change Milk into Plastic, Extract Water and Electricity from Thin Air, Turn on a TV with your Ring, and Other Amazing Feats
byCy Tymony
Rating: 3 out of 5 stars
3/5
The Art of War
Ebook
The Art of War
bySun Tzu
Rating: 4 out of 5 stars
4/5
The Art of War
Ebook
The Art of War
bySun Tsu
Rating: 4 out of 5 stars
4/5
A Night to Remember: The Sinking of the Titanic
Ebook
A Night to Remember: The Sinking of the Titanic
byWalter Lord
Rating: 4 out of 5 stars
4/5
The Right Stuff
Ebook
The Right Stuff
byTom Wolfe
Rating: 4 out of 5 stars
4/5
The 48 Laws of Power in Practice: The 3 Most Powerful Laws & The 4 Indispensable Power Principles
Ebook
The 48 Laws of Power in Practice: The 3 Most Powerful Laws & The 4 Indispensable Power Principles
byJon Waterlow
Rating: 5 out of 5 stars
5/5
Longitude: The True Story of a Lone Genius Who Solved the Greatest Scientific Problem of His Time
Ebook
Longitude: The True Story of a Lone Genius Who Solved the Greatest Scientific Problem of His Time
byDava Sobel
Rating: 4 out of 5 stars
4/5
The Big Book of Hacks: 264 Amazing DIY Tech Projects
Ebook
The Big Book of Hacks: 264 Amazing DIY Tech Projects
byDoug Cantor
Rating: 4 out of 5 stars
4/5
How to Disappear and Live Off the Grid: A CIA Insider's Guide
Ebook
How to Disappear and Live Off the Grid: A CIA Insider's Guide
byJohn Kiriakou
Rating: 0 out of 5 stars
0 ratings
Vanderbilt: The Rise and Fall of an American Dynasty
Ebook
Vanderbilt: The Rise and Fall of an American Dynasty
byAnderson Cooper
Rating: 4 out of 5 stars
4/5
Death in Mud Lick: A Coal Country Fight against the Drug Companies That Delivered the Opioid Epidemic
Ebook
Death in Mud Lick: A Coal Country Fight against the Drug Companies That Delivered the Opioid Epidemic
byEric Eyre
Rating: 4 out of 5 stars
4/5
The Big Book of Maker Skills: Tools & Techniques for Building Great Tech Projects
Ebook
The Big Book of Maker Skills: Tools & Techniques for Building Great Tech Projects
byChris Hackett
Rating: 4 out of 5 stars
4/5
The Invisible Rainbow: A History of Electricity and Life
Ebook
The Invisible Rainbow: A History of Electricity and Life
byArthur Firstenberg
Rating: 4 out of 5 stars
4/5
Digital Minimalism - Summarized for Busy People: Choosing a Focused Life in a Noisy World: Based on the Book by Cal Newport
Ebook
Digital Minimalism - Summarized for Busy People: Choosing a Focused Life in a Noisy World: Based on the Book by Cal Newport
byGoldmine Reads
Rating: 4 out of 5 stars
4/5
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
Ebook
Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career
byScott H. Young
Rating: 4 out of 5 stars
4/5
80/20 Principle: The Secret to Working Less and Making More
Ebook
80/20 Principle: The Secret to Working Less and Making More
byPaul J. Stanley
Rating: 5 out of 5 stars
5/5
Electrical Engineering 101: Everything You Should Have Learned in School...but Probably Didn't
Ebook
Electrical Engineering 101: Everything You Should Have Learned in School...but Probably Didn't
byDarren Ashby
Rating: 5 out of 5 stars
5/5
The Fast Track to Your Technician Class Ham Radio License: For Exams July 1, 2022 - June 30, 2026
Ebook
The Fast Track to Your Technician Class Ham Radio License: For Exams July 1, 2022 - June 30, 2026
byMichael Burnette, AF7KB
Rating: 5 out of 5 stars
5/5
Summary of Nicolas Cole's The Art and Business of Online Writing
Ebook
Summary of Nicolas Cole's The Art and Business of Online Writing
byIRB Media
Rating: 4 out of 5 stars
4/5
Logic Pro X For Dummies
Ebook
Logic Pro X For Dummies
byGraham English
Rating: 0 out of 5 stars
0 ratings
The Basics of Bitcoins and Blockchains: An Introduction to Cryptocurrencies and the Technology that Powers Them (Cryptography, Derivatives Investments, Futures Trading, Digital Assets, NFT)
Ebook
The Basics of Bitcoins and Blockchains: An Introduction to Cryptocurrencies and the Technology that Powers Them (Cryptography, Derivatives Investments, Futures Trading, Digital Assets, NFT)
byAntony Lewis
Rating: 4 out of 5 stars
4/5
Selfie: How We Became So Self-Obsessed and What It's Doing to Us
Ebook
Selfie: How We Became So Self-Obsessed and What It's Doing to Us
byWill Storr
Rating: 4 out of 5 stars
4/5
The CIA Lockpicking Manual
Ebook
The CIA Lockpicking Manual
byCentral Intelligence Agency
Rating: 5 out of 5 stars
5/5
Understanding Media: The Extensions of Man
Ebook
Understanding Media: The Extensions of Man
byMarshall McLuhan
Rating: 4 out of 5 stars
4/5
My Inventions: The Autobiography of Nikola Tesla
Ebook
My Inventions: The Autobiography of Nikola Tesla
byNikola Tesla
Rating: 4 out of 5 stars
4/5
Summary of Empire of Pain: by Patrick Radden Keefe - The Secret History of the Sackler Dynasty - A Comprehensive Summary
Ebook
Summary of Empire of Pain: by Patrick Radden Keefe - The Secret History of the Sackler Dynasty - A Comprehensive Summary
byAlexander Cooper
Rating: 3 out of 5 stars
3/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
The Wuhan Cover-Up: And the Terrifying Bioweapons Arms Race
Ebook
The Wuhan Cover-Up: And the Terrifying Bioweapons Arms Race
byRobert F. Kennedy, Jr.
Rating: 0 out of 5 stars
0 ratings
Rust: The Longest War
Ebook
Rust: The Longest War
byJonathan Waldman
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Podcast 270: Maurizio Giri: Talking Max, MFL and Composition with Maurizio Giri
Podcast episode
Podcast 270: Maurizio Giri: Talking Max, MFL and Composition with Maurizio Giri
byArt + Music + Technology
0 ratings
0% found this document useful
#43: Three Vocal Delay Tricks to Improve Your Mix
Podcast episode
#43: Three Vocal Delay Tricks to Improve Your Mix
byInside The Mix | Music Production and Mixing Tips for Music Producers and Artists
0 ratings
0% found this document useful
#1: Lessons From My Journey
Podcast episode
#1: Lessons From My Journey
byMusic Production Podcast
0 ratings
0% found this document useful
Modern Day Music Theory with Ryan Miyakawa
Podcast episode
Modern Day Music Theory with Ryan Miyakawa
byModern Musician
0 ratings
0% found this document useful
Tim Anderson, “Popular Music in a Digital Music Economy” (Routledge, 2014): Since the 1990s, the music industry has been going through a massive transformation. After World War II, the primary way audiences participated in the music business in the period between 1945 and 1990 was by purchasing records and attending concerts.
Podcast episode
Tim Anderson, “Popular Music in a Digital Music Economy” (Routledge, 2014): Since the 1990s, the music industry has been going through a massive transformation. After World War II, the primary way audiences participated in the music business in the period between 1945 and 1990 was by purchasing records and attending concerts.
byNew Books in Economics
0 ratings
0% found this document useful
Harnessing Python for Research: Scientific Applications of Python with Michael Kennedy: Still scrabbling with Excel? Consider Python language uses, says programmer and podcaster Michael Kennedy. A general programming language that is easy to use in multiple environments, Python programming is limitless and has numerous open source...
Podcast episode
Harnessing Python for Research: Scientific Applications of Python with Michael Kennedy: Still scrabbling with Excel? Consider Python language uses, says programmer and podcaster Michael Kennedy. A general programming language that is easy to use in multiple environments, Python programming is limitless and has numerous open source...
byFinding Genius Podcast
0 ratings
0% found this document useful
Doing Software Engineering in Academia - Johanna Bayer
Podcast episode
Doing Software Engineering in Academia - Johanna Bayer
byDataTalks.Club
0 ratings
0% found this document useful
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
Podcast episode
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
byData Engineering Podcast
0 ratings
0% found this document useful
Kara Cotter: Creating Self-Paced Training for Communication Partners (Part 2): This week, we present Part 2 of Chris’s interview with Kara Cotter, a school-based AAC/AT Specialist who contacted Chris to ask about improving buy in, moving to the coaching model, making AAC more inclusive, and more! Before the interview, Chris shar...
Podcast episode
Kara Cotter: Creating Self-Paced Training for Communication Partners (Part 2): This week, we present Part 2 of Chris’s interview with Kara Cotter, a school-based AAC/AT Specialist who contacted Chris to ask about improving buy in, moving to the coaching model, making AAC more inclusive, and more! Before the interview, Chris shar...
byTalking With Tech AAC Podcast
0 ratings
0% found this document useful
The Possibilities of Acoustics: Demystifying how acoustics work in the built environment
Podcast episode
The Possibilities of Acoustics: Demystifying how acoustics work in the built environment
byThe Learning Objective
0 ratings
0% found this document useful
367: Tech Tools You Can Use to Streamline Your Life: How you can organize your life with the help of tech tools, so you can use your time more efficiently
Podcast episode
367: Tech Tools You Can Use to Streamline Your Life: How you can organize your life with the help of tech tools, so you can use your time more efficiently
byThe Law School Toolbox Podcast: Tools for Law Students from 1L to the Bar Exam, and Beyond
0 ratings
0% found this document useful
Automating Complex Internal Processes w/ AI with Alexander Chukovski - TWiML Talk #161: In this episode, I'm joined by Alexander Chukovski, Director of Data Services at Munich, Germany based career platform, Experteer. In our conversation, we explore Alex’s journey to implement machine learning at Experteer. Alex and I discuss the...
Podcast episode
Automating Complex Internal Processes w/ AI with Alexander Chukovski - TWiML Talk #161: In this episode, I'm joined by Alexander Chukovski, Director of Data Services at Munich, Germany based career platform, Experteer. In our conversation, we explore Alex’s journey to implement machine learning at Experteer. Alex and I discuss the...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
What's real and what's hype? - Decades of ML with Eugene Dubossarsky - 012: What does a person tell you who has decades of experience in ML? Learn statistics.
Podcast episode
What's real and what's hype? - Decades of ML with Eugene Dubossarsky - 012: What does a person tell you who has decades of experience in ML? Learn statistics.
byMachine Learning Cafe
0 ratings
0% found this document useful
A "AI & ML" Look Ahead for 2020
Podcast episode
A "AI & ML" Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
Autonomous Database on Serverless Infrastructure: Want to quickly provision your autonomous database? Then look no further than Oracle Autonomous Database Serverless, one of the two deployment choices offered by Oracle Autonomous Database. Autonomous Database Serverless delegates all...
Podcast episode
Autonomous Database on Serverless Infrastructure: Want to quickly provision your autonomous database? Then look no further than Oracle Autonomous Database Serverless, one of the two deployment choices offered by Oracle Autonomous Database. Autonomous Database Serverless delegates all...
byOracle University Podcast
0 ratings
0% found this document useful
Growing And Supporting The Data Science Community At Anaconda: An interview with Kevin Goldsmith, CTO of Anaconda, about the challenges that data scientists are faced with, how the role is continuing to evolve, and the tools and educational resources that they are building to support the community
Podcast episode
Growing And Supporting The Data Science Community At Anaconda: An interview with Kevin Goldsmith, CTO of Anaconda, about the challenges that data scientists are faced with, how the role is continuing to evolve, and the tools and educational resources that they are building to support the community
byThe Python Podcast.__init__
0 ratings
0% found this document useful
37. Sean Knapp - The brave new world of data engineering
Podcast episode
37. Sean Knapp - The brave new world of data engineering
byTowards Data Science
0 ratings
0% found this document useful
Autonomous Database Tools: In this episode, hosts Lois Houston and Nikita Abraham speak with Oracle Database experts about the various tools you can use with Autonomous Database, including Oracle Application Express (APEX), Oracle Machine Learning, and more. Oracle...
Podcast episode
Autonomous Database Tools: In this episode, hosts Lois Houston and Nikita Abraham speak with Oracle Database experts about the various tools you can use with Autonomous Database, including Oracle Application Express (APEX), Oracle Machine Learning, and more. Oracle...
byOracle University Podcast
0 ratings
0% found this document useful
Mastering Algorithms and Data Structures - Marcello La Rocca
Podcast episode
Mastering Algorithms and Data Structures - Marcello La Rocca
byDataTalks.Club
0 ratings
0% found this document useful
Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
Podcast episode
Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
byMLOps.community
0 ratings
0% found this document useful
SE4ML - Software Engineering for Machine Learning - Nadia Nahar
Podcast episode
SE4ML - Software Engineering for Machine Learning - Nadia Nahar
byDataTalks.Club
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
Podcast episode
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful
Recast: AAC Modeling Roundtable: In this “Recast” episode of Talking with Tech, we share a remastered episode that was previously aired on the podcast. This episode, Chris Bugaj, Rachel Madel, and Lucas Stuber have a roundtable discussion about the key components of aided language s...
Podcast episode
Recast: AAC Modeling Roundtable: In this “Recast” episode of Talking with Tech, we share a remastered episode that was previously aired on the podcast. This episode, Chris Bugaj, Rachel Madel, and Lucas Stuber have a roundtable discussion about the key components of aided language s...
byTalking With Tech AAC Podcast
0 ratings
0% found this document useful
ThursdAI Aug 10 - Deepfakes get real, OSS Embeddings heating up, Wizard 70B tops tops the charts and more!
Podcast episode
ThursdAI Aug 10 - Deepfakes get real, OSS Embeddings heating up, Wizard 70B tops tops the charts and more!
byThursdAI - The top AI news from the past week
0 ratings
0% found this document useful
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
Podcast episode
Making Email Better With AI At Shortwave: Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his team have overcome in integrating AI into their product, as well as the benefits and features that it provides to their customers.
byData Engineering Podcast
0 ratings
0% found this document useful
Lexical with Elena Bukareva and Acy Watson: In this episode, we talk about Lexical, the new extensible text editor framework from Meta, with Elena Bukareva, software engineering manager at Meta, and Acy Watson, software engineer for the Lexical core team.
Podcast episode
Lexical with Elena Bukareva and Acy Watson: In this episode, we talk about Lexical, the new extensible text editor framework from Meta, with Elena Bukareva, software engineering manager at Meta, and Acy Watson, software engineer for the Lexical core team.
byPodRocket - A web development podcast from LogRocket
0 ratings
0% found this document useful
050 - Hearing in 3D with Dr. Ivan Tashev
Podcast episode
050 - Hearing in 3D with Dr. Ivan Tashev
byMicrosoft Research Podcast
0 ratings
0% found this document useful
Episode 68: Profiler: Tor, Esteban and Chet in the Studio.In this episode, Chet and Tor talk with Esteban de la Canal about the new profiling tools in Android Studio 3.0. Join us to hear about the CPU profiler, the memory profiler, the network profiler, allocation tracking, he
Podcast episode
Episode 68: Profiler: Tor, Esteban and Chet in the Studio.In this episode, Chet and Tor talk with Esteban de la Canal about the new profiling tools in Android Studio 3.0. Join us to hear about the CPU profiler, the memory profiler, the network profiler, allocation tracking, he
byAndroid Developers Backstage
0 ratings
0% found this document useful
Brian Whitmer: Supporting Open & Free AAC Symbols, Communication Boards, & More: This week on TWT, Chris interviews Brian Whitmer, CEO of CoughDrop, about OpenAAC.org and the “open” movement supporting AAC users and their right to move communication boards from one system to another, access a free set of symbols, open 3rd party apps ...
Podcast episode
Brian Whitmer: Supporting Open & Free AAC Symbols, Communication Boards, & More: This week on TWT, Chris interviews Brian Whitmer, CEO of CoughDrop, about OpenAAC.org and the “open” movement supporting AAC users and their right to move communication boards from one system to another, access a free set of symbols, open 3rd party apps ...
byTalking With Tech AAC Podcast
0 ratings
0% found this document useful

Skip carousel

Practical Tip from Mastering Experts: Robert Babicz and Dominik de León
Beat English
Article
Practical Tip from Mastering Experts: Robert Babicz and Dominik de León
Mar 3, 2021
As a veteran of the Acid Techno scene, Robert Babicz aka Rob Acid certainly needs no introduction. In Bergisch Gladbach, the versatile artist, producer, sound designer, performer and photographer runs a renowned Mastering Studio with its very own sou
10 min read
Modern Mastering
Beat English
Article
Modern Mastering
Mar 3, 2021
Mastering is essential to give your productions a professional finishing touch. While a few years ago this required wickedly expensive analog hardware, today you can also achieve high-quality results with plug-ins and the appropriate know-how. In our
8 min read
Sound Design The Creative Guide
Music Tech Focus
Article
Sound Design The Creative Guide
Oct 5, 2018
12 min read
Three Top Full Versions
Beat English
Article
Three Top Full Versions
Sep 2, 2020
3 min read
Fm Synthesis
Music Tech Magazine
Article
Fm Synthesis
Jul 18, 2019
9 min read
Go Granular!
Electronic Musician
Article
Go Granular!
Feb 25, 2020
10 min read
Using Push As A Creative Tool With Ableton Live
Music Tech Focus
Article
Using Push As A Creative Tool With Ableton Live
Sep 7, 2017
5 min read
SIX OF THE BEST Software mastering tools
Music Tech Magazine
Article
SIX OF THE BEST Software mastering tools
Jul 16, 2020
3 min read
Using Audio Effects In Ableton Live
Music Tech Magazine
Article
Using Audio Effects In Ableton Live
May 21, 2020
Audio effects are a key part of Ableton Live, no matter what you’re doing with it, and Live 10 especially boasts some fantastic in-built options. Eventually you’re bound to reach for some third-party plug-ins to flesh out your library – no DAW does i
3 min read
Labours Of Love First-born Loudspeakers
Sound and Image
Article
Labours Of Love First-born Loudspeakers
Apr 27, 2020
12 min read
HOUSE & TECHNO MASTERCLASS
Computer Music
Article
HOUSE & TECHNO MASTERCLASS
Jul 10, 2019
From inner-city USA to the sunny terraces of Ibiza via the fields of rural England, house and techno culture has been the lifeblood of underground dance music since wannabe musicians began bashing away on old step sequencers and discarded drum machin
9 min read
Ableton Wavetable
Future Music
Article
Ableton Wavetable
Aug 25, 2020
10 min read
Jean-Michel Jarre
Future Music
Article
Jean-Michel Jarre
Sep 21, 2021
8 min read
Mixing Synth Pop Synths
Audio Technology
Article
Mixing Synth Pop Synths
Mar 18, 2020
8 min read
Mastering Pro Tips
Music Tech Focus
Article
Mastering Pro Tips
Sep 7, 2017
3 min read
Analogue Sequencing
Future Music
Article
Analogue Sequencing
Nov 16, 2021
4 min read
Dolby Atmos Is Just The Start. Next-gen Audio Format Ac-4 Will Offer Far More…
T3
Article
Dolby Atmos Is Just The Start. Next-gen Audio Format Ac-4 Will Offer Far More…
Jul 6, 2018
So it seems that Dolby Atmos is only the beginning. Dolby has a new next-gen audio format waiting in the wings, which evolves the concept of object-based audio for broadcasters. AC-4 builds on the core idea behind Atmos, but offers an even more versa
1 min read
Artists At The Forefront Of AI
Computer Music
Article
Artists At The Forefront Of AI
May 18, 2022
Despite the numerous AI platforms which serve up routes to auto-generate functional music, many artists who have overtly worked with AI have approached the concept via more individual means. Take Holly Herndon, the Berlin-based composer and musicolog
3 min read
Mix Master
Audio Technology
Article
Mix Master
Mar 12, 2018
11 min read
Active Audio Filter Design
CQ Amateur Radio
Article
Active Audio Filter Design
Jul 1, 2019
4 min read
Feature 100 Pro Tips
Music Tech Focus
Article
Feature 100 Pro Tips
Apr 2, 2020
29 min read
8 Tips For Better A Mixdown
Future Music
Article
8 Tips For Better A Mixdown
Feb 8, 2022
2 min read
How to Record Pop Vocals, Pt III — Mixing
Audio Technology
Article
How to Record Pop Vocals, Pt III — Mixing
Dec 20, 2018
Here we are, at the final stage of the vocal process — mixing. Previously, we’ve gone over recording and editing the perfect vocal take, but before we move on, let’s add some doubles and harmonies to the pie. A ‘standard’ approach would be to have yo
7 min read
The Budget Studio Guide
Music Tech Magazine
Article
The Budget Studio Guide
Jul 16, 2020
11 min read
the 6 WAYS TO TURN INTO SONGS
Music Tech Magazine
Article
the 6 WAYS TO TURN INTO SONGS
Sep 19, 2019
3 min read
22 Workflow Assistants
Beat English
Article
22 Workflow Assistants
Jul 1, 2020
7 min read
8 Ways To Compose And Arrange A Track
Music Tech Focus
Article
8 Ways To Compose And Arrange A Track
Jun 2, 2016
5 min read
Circuit Rhythm
Electronic Musician
Article
Circuit Rhythm
Jul 20, 2021
6 min read
Max For Live
Computer Music
Article
Max For Live
Jun 16, 2021
Like Reaktor, Max by Cycling ’74 is a visual programming tool but, unlike Reaktor, it is not in-and-of-itself dedicated to creating audio processors, and has capabilities that extend into many different fields: image and video processing, hardware co
3 min read
What Is Wavetable Synthesis?
Future Music
Article
What Is Wavetable Synthesis?
Jul 27, 2021
Wavetable synthesis is nothing new; it first appeared in the hardware realm in the early ’80s with the launch of Wolfgang Palm’s PPG Wave, later followed by a string of influential hardware synths created by PPG successors Waldorf. It has really come
2 min read

Related categories

Skip carousel

Reviews for Introduction to Audio Analysis

Rating: 4.5 out of 5 stars

4.5/5

2 ratings0 reviews

Book preview

Introduction to Audio Analysis - Theodoros Giannakopoulos

Introduction to Audio Analysis

A MATLAB Approach

First Edition

Theodoros Giannakopoulos and Aggelos Pikrakis

Cover image

Title page

Copyright

Preface

Acknowledgments

List of Tables

List of figures

1: Basic Concepts, Representations and Feature Extraction

1: Introduction

1.1 The MATLAB Audio Analysis Library

1.2 Outline of Chapters

1.3 A Note on Exercises

2: Getting Familiar with Audio Signals

2.1 Sampling

2.2 Playback

2.3 Mono and Stereo Audio Signals

2.4 Reading and Writing Audio Files

2.5 Reading Audio Files in Blocks

2.6 Recording Audio Data

2.7 Short-term Audio Processing

2.8 Exercises

3: Signal Transforms and Filtering Essentials

3.1 The Discrete Fourier Transform

3.2 The Short-Time Fourier Transform

3.3 Aliasing in More Detail

3.4 The Discrete Cosine Transform

3.5 The Discrete-Time Wavelet Transform

3.6 Digital Filtering Essentials

3.7 Digital Filters in MATLAB

3.8 Exercises

4: Audio Features

4.1 Short-Term and Mid-Term Processing

4.2 Class Definitions

4.3 Time-Domain Audio Features

4.4 Frequency-Domain Audio Features

4.5 Periodicity Estimation and Harmonic Ratio

4.6 Exercises

2: Audio Content Characterization

5: Audio Classification

5.1 Classification Fundamentals

5.2 Popular Classifiers

5.3 Implementation-Related Issues

5.4 Evaluation

5.5 Case Studies

5.6 Exercises

6: Audio Segmentation

6.1 Segmentation with Embedded Classification

6.2 Segmentation Without Classification

6.3 Exercises

7: Audio Alignment and Temporal Modeling

7.1 Audio Sequence Alignment

7.2 Hidden Markov Modeling

7.3 The Viterbi Algorithm

7.4 The Baum-Welch Algorithm

7.5 HMM Training

7.6 Exercises

3: Other Issues

8: Music Information Retrieval

8.1 Music Thumbnailing

8.2 Music Meter and Tempo Induction

8.3 Music Content Visualization

8.4 Exercises

Appendix A: The Matlab Audio Analysis Library

1 Supplementary data

2 Supplementary data

Appendix B: Audio-Related Libraries and Software

B.1 MATLAB

B.2 Python

B.3 C/C++

Appendix C: Audio Datasets

Bibliography

Index

Copyright

Academic Press is an imprint of Elsevier

The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, UK

225 Wyman Street, Waltham, MA 02451, USA

525 B Street, Suite 1800, San Diego, CA 92101-4495, USA

First edition 2014

MATLAB® is a registered trademarks of The MathWorks, Inc.

For MATLAB and Simulink product information, please contact:

The MathWorks, Inc.

3 Apple Hill Drive

Natick, MA, 01760-2098 USA

Tel: 508-647-7000

Fax: 508-647-7001

E-mail:

Web:

No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means electronic, mechanical, photocopying, recording or otherwise without the prior written permission of the publisher.

Permissions may be sought directly from Elsevier’s Science & Technology Rights Department in Oxford, UK: phone (+44) (0) 1865 843830; fax (+44) (0) 1865 853333; email , and selecting Obtaining permission to use Elsevier material.

Notice

No responsibility is assumed by the publisher for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions or ideas contained in the material herein. Because of rapid advances in the medical sciences, in particular, independent verification of diagnoses and drug dosages should be made.

British Library Cataloguing in Publication Data

A catalogue record for this book is available from the British Library

Library of Congress Cataloging-in-Publication Data

A catalog record for this book is available from the Library of Congress

ISBN: 978-0-08-099388-1

For information on all Academic Press publications visit our web site at

Printed and bound in United States of America

14 15 16 17 18 10 9 8 7 6 5 4 3 2 1

Preface

This book attempts to provide a gentle introduction to the field of audio analysis using the MATLAB programming environment as the vehicle of presentation. Audio analysis is a multidisciplinary field, which requires the reader to be familiar with concepts from diverse research disciplines, including digital signal processing and machine learning. As a result, it is a great challenge to write a book that can provide sufficient coverage of the important concepts in the field of audio analysis and, at the same time, be accessible to readers who do not necessarily possess the required scientific background.

Our main goal has been to provide a standalone introduction, involving a balanced presentation of theoretical descriptions and reproducible MATLAB examples. Our philosophy is that readers with diverse scientific backgrounds can gain an understanding of the field of audio analysis, if they are provided with basic theory, in conjunction with reproducible experiments that can help them deal with the theory from a more practical perspective. In addition, this type of approach allows the reader to acquire certain technical skills that are useful in the context of developing real-world audio analysis applications. To this end, we also provide an accompanying software library which can be downloaded from the companion site and includes the MATLAB functions and related data files that have been used throughout the text.

We believe that this book is suitable for students, researchers, and professionals alike, who need to develop practical skills, along with a basic understanding of the field. The book does not assume previous knowledge of digital signal processing and machine learning concepts, as it provides introductory material for the necessary topics for both disciplines. We expect that, after reading this book, the reader will feel comfortable with various key processing stages of the audio analysis chain, including audio content creation, representation, feature extraction, classification, segmentation, sequence alignment and temporal modeling. Furthermore, we believe that the study of the presented case studies will provide further insight into the development of real-world applications.

This book is the product of several years of teaching and research and reflects our teaching philosophy, which has been shaped via our interaction with our students and colleagues, and to whom we are both grateful. We hope that the will prove useful to all readers who are making their first steps in the field of audio analysis. Although we have made an effort to eliminate errors during the writing stage, we encourage the reader to contact us with any comments and suggestions for improvement, in either the text or the accompanying software library.

Theodoros Giannakopoulos and Aggelos Pikrakis

Athens, 2013

For access to the software library and other supporting materials, please visit the companion website at:

Acknowledgments

This book has improved thanks to the support of a number of colleagues, students, and friends, who have provided generous feedback and constructive comments, during the writing process. Above all, T. Giannakopoulos would like to thank his wife, Maria, and his daughter, Eleni, for always being cheerful and supportive. A. Pikrakis would like to thank his family for their patience and generous support and dedicates this book to all the teachers who have shaped his life.

List of Tables

List of Figures

Part 1: Basic Concepts, Representations and Feature Extraction

Outline

Introduction

Getting Familiar with Audio Signals

Signal Transforms and Filtering Essentials

Audio Features

1

Introduction

Abstract

This chapter has an introductory purpose. A chapter outline is provided, along with general notes on the book’s exercises and the companion software. Before we proceed, it is important to note that, although in this book the term audio does not exclude the speech signal, we are not focusing on traditional speech-related problems that have been studied by the research community for decades, e.g., speech recognition and coding.

Keywords

Audio analysis

MATLAB

During recent years we have witnessed the increasing availability of audio content via numerous distribution channels both for commercial and non-profit purposes. The resulting wealth of data has inevitably highlighted the need for systems that are capable of analyzing the audio content in order to extract useful knowledge that can be consumed by users or subsequently exploited by other processing systems.

Before we proceed, it is important to note that, although in this book the term ‘audio’ does not exclude the speech signal, we are not focusing on traditional speech-related problems that have been studied by the research community for decades, e.g. speech recognition and coding. It is our intention to provide analysis methods that can be used to study various audio modalities and their relationships in mixed audio streams. Consider, for example, the task of segmenting a radio broadcast into homogeneous parts that contain either speech, music, or silence. The development of a solution for such a task demands that we are familiar with various audio modalities and how they affect the performance of segmentation algorithms in audio streams. In other words, we are not interested in providing solutions that are well tailored to specific audio types (e.g. the speech signal) but are not applicable to other modalities.

As with several other types of media, the automatic analysis of audio signals has been gaining increasing interest during the past decade. Depending on the storage/distribution format, the respective audio content classes, the co-existence of other media types (e.g. moving image), the user requirements, the data volume, the application context, and numerous other parameters, a diversity of applications and research trends have emerged to deal with various audio analysis tasks. The following list includes both speech and non-speech tasks so as to provide a general idea of the trends in several popular areas of speech/audio processing:

• Speech recognition: this is the task of ‘translating’ a speech signal to text using computational tools. Speech recognition is the oldest domain of audio analysis, but it is beyond the purpose of this book to provide a detailed study on speech recognition. We only present generic dynamic time warping and temporal modeling techniques that can also be applied on other audio signals.

• Speaker identification, verification and diarization: These speaker-related tasks focus on designing methods that discriminate between different speakers. Speaker identification and verification can be useful in the development of secure systems and speaker diarization, being able to answer the question ‘who spoke when?’, can be used in conversation summarization systems.

• Music information retrieval (MIR): due to the huge increase in the amount of available digital music data during the past few years, there has been an increasing need for the automatic analysis of this type of data. MIR focuses on automatically extracting information from the music signal for the purposes of content tagging, intelligent indexing; retrieval; browsing of music tracks; recommendation of new tracks based on music content (possibly combined with user preferences and collaborative knowledge); segmentation of music tracks, generation of summaries; extraction of automated music transcriptions, etc.

• Audio event detection: this is the task of detecting audio events in audio streams. There can be numerous related applications, like audio-based surveillance, violence detection, and intrusion detection, to name but a few.

• Speech emotion recognition: this is the task of predicting the speaker’s emotional state (anger, sadness, etc.) using speech analysis techniques. Emotion recognition has been gaining increasing interest during the last decade. The audio stream is either used independently, or in collaboration with visual cues (e.g. facial features). Emotion recognition is expected to play an important role in the next-generation human-computer interaction systems, but it can be also be used to enhance the functionality of other systems that perform retrieval and multimedia content characterization tasks.

• Multimodal analysis of the movie content: this task aims to automatically recognize events and classes in movies based on audio, visual, and textual information. The audio cues can contain rich information regarding events like the existence of music, speech, sound effects (gunshots, human fights), emotions, etc. The resulting metadata can serve indexing and fast browsing purposes in the context of next-generation multimedia systems.

The purpose of this book is to serve as a standalone introduction to audio signal analysis by providing a sufficient theoretical background for many state-of-the-art techniques, along with a large number of reproducible MATLAB examples. It is important to note that it is not our intention to demand that the reader be familiar with concepts from a variety of disciplines, such as signal processing and machine learning, although, of course, knowledge improves the reading experience. However, in each chapter, we focus on providing a smooth transition from introductory issues to more advanced ones, assuming that the reader is a beginner in the field. For example, we present the classification of audio segments but instead of assuming that the reader has knowledge of the respective pattern recognition concepts, we provide an introduction to the subject, ensuring that we: (a) complement the description with MATLAB examples and (b) evaluate the audio analysis domain (e.g. discuss a binary classifier via a speech-music discrimination example). Furthermore, the first chapters of the book introduce basic signal processing concepts like sampling and frequency representations.

1.1 The MATLAB Audio Analysis Library

Further to the necessary theoretical background, we also provide a complete set of MATLAB files that constitute the MATLAB Audio Analysis Library of this book. Where we find it useful from a pedagogical perspective, parts of the code are listed in the book. However, in most cases, the complete MATLAB code is omitted. We prefer to describe how to ‘call specific functions,’ to report on what to expect, to present and discuss the results, and so on.

The accompanying library is an important companion to the book that is aimed at helping the reader to understand the related theory and experiment with their own audio analysis solutions. A list of the available MATLAB functions, along with brief descriptions, is given in the Appendix of this book.

1.2 Outline of Chapters

Chapter 2 provides information and techniques for the basic issues related to the creation, representation, playback, recording, and storing of audio signals in MATLAB. Although the focus of the chapter is on practical issues, we also describe the basic theory of content creation. At the end of the chapter, we describe the process of breaking an audio signal into short-term windows to enable audio analysis on a short-term basis. This is in preparation for the next two chapters, as frequency representations and feature extraction both require the short-term processing stage of the signal.

In Chapter 3 we present methods for representing audio signals in the frequency domain, mostly focusing on the discrete Fourier transform. In addition, we provide a basic description of filtering techniques by Means of MATLAB examples.

Chapter 4 presents a wide range of features from the time and frequency domains, that have been widely

Enjoying the preview?

Page 1 of 1

Introduction to Audio Analysis: A MATLAB® Approach

About this ebook

Theodoros Giannakopoulos

Related authors

Related to Introduction to Audio Analysis

Related ebooks

Technology & Engineering For You

Related podcast episodes

Related articles

Related categories

Reviews for Introduction to Audio Analysis

What did you think?

Book preview

Introduction to Audio Analysis - Theodoros Giannakopoulos

Table of Contents

1: Basic Concepts, Representations and Feature Extraction

2: Audio Content Characterization

3: Other Issues

Copyright

Preface

Acknowledgments

1

Introduction

Abstract

Keywords

1.1 The MATLAB Audio Analysis Library

1.2 Outline of Chapters