Speech Enhancement: A Signal Subspace Perspective
()
About this ebook
Speech enhancement is a classical problem in signal processing, yet still largely unsolved. Two of the conventional approaches for solving this problem are linear filtering, like the classical Wiener filter, and subspace methods. These approaches have traditionally been treated as different classes of methods and have been introduced in somewhat different contexts. Linear filtering methods originate in stochastic processes, while subspace methods have largely been based on developments in numerical linear algebra and matrix approximation theory.
This book bridges the gap between these two classes of methods by showing how the ideas behind subspace methods can be incorporated into traditional linear filtering. In the context of subspace methods, the enhancement problem can then be seen as a classical linear filter design problem. This means that various solutions can more easily be compared and their performance bounded and assessed in terms of noise reduction and speech distortion. The book shows how various filter designs can be obtained in this framework, including the maximum SNR, Wiener, LCMV, and MVDR filters, and how these can be applied in various contexts, like in single-channel and multichannel speech enhancement, and in both the time and frequency domains.
- First short book treating subspace approaches in a unified way for time and frequency domains, single-channel, multichannel, as well as binaural, speech enhancement
- Bridges the gap between optimal filtering methods and subspace approaches
- Includes original presentation of subspace methods from different perspectives
Jacob Benesty
Jacob Benesty received a Master degree in microwaves from Pierre & Marie Curie University, France, in 1987, and a Ph.D. degree in control and signal processing from Orsay University, France, in April 1991. During his Ph.D. (from Nov. 1989 to Apr. 1991), he worked on adaptive filters and fast algorithms at the Centre National d’Etudes des Telecomunications (CNET), Paris, France. From January 1994 to July 1995, he worked at Telecom Paris University on multichannel adaptive filters and acoustic echo cancellation. From October 1995 to May 2003, he was first a Consultant and then a Member of the Technical Staff at Bell Laboratories, Murray Hill, NJ, USA. In May 2003, he joined the University of Quebec, INRS-EMT, in Montreal, Quebec, Canada, as a Professor. His research interests are in signal processing, acoustic signal processing, and multimedia communications. He is the inventor of many important technologies. In particular, he was the lead researcher at Bell Labs who conceived and designed the world-first real-time hands-free full-duplex stereophonic teleconferencing system. Also, he conceived and designed the world-first PC-based multi-party hands-free full-duplex stereo conferencing system over IP networks. He was the co-chair of the 1999 International Workshop on Acoustic Echo and Noise Control and the general co-chair of the 2009 IEEEWorkshop on Applications of Signal Processing to Audio and Acoustics. He is the recipient, with Morgan and Sondhi, of the IEEE Signal Processing Society 2001 Best Paper Award. He is the recipient, with Chen, Huang, and Doclo, of the IEEE Signal Processing Society 2008 Best Paper Award. He is also the co-author of a paper for which Huang received the IEEE Signal Processing Society 2002 Young Author Best Paper Award. In 2010, he received the “Gheorghe Cartianu Award from the Romanian Academy. In 2011, he received the Best Paper Award from the IEEE WASPAA for a paper that he co-authored with Jingdong Chen.
Related to Speech Enhancement
Related ebooks
Advances in Independent Component Analysis and Learning Machines Rating: 0 out of 5 stars0 ratingsFunctions of a Complex Variable and Some of Their Applications Rating: 0 out of 5 stars0 ratingsOptimal Control of Differential and Functional Equations Rating: 0 out of 5 stars0 ratingsOperator Methods in Quantum Mechanics Rating: 0 out of 5 stars0 ratingsInfinite Abelian Groups Rating: 0 out of 5 stars0 ratingsInvariant Variational Principles Rating: 5 out of 5 stars5/5Submodular Functions and Optimization Rating: 0 out of 5 stars0 ratingsMeasure and Integration: A Concise Introduction to Real Analysis Rating: 0 out of 5 stars0 ratingsConvexity and Graph Theory Rating: 0 out of 5 stars0 ratingsTopological Theory of Dynamical Systems: Recent Advances Rating: 0 out of 5 stars0 ratingsHarmonic Analysis and Special Functions on Symmetric Spaces Rating: 0 out of 5 stars0 ratingsFinite Dimensional Vector Spaces. (AM-7), Volume 7 Rating: 4 out of 5 stars4/5Classical Recursion Theory: The Theory of Functions and Sets of Natural Numbers Rating: 0 out of 5 stars0 ratingsSingular Optimal Control Problems Rating: 0 out of 5 stars0 ratingsTopology and Its Applications Rating: 3 out of 5 stars3/5Vector-valued function and distribution spaces on the torus Rating: 0 out of 5 stars0 ratingsRepresentations of Finite Groups Rating: 0 out of 5 stars0 ratingsDelay and Functional Differential Equations and Their Applications Rating: 0 out of 5 stars0 ratingsMathematical Methods in Computer Aided Geometric Design II Rating: 0 out of 5 stars0 ratingsVLSI in Medicine Rating: 5 out of 5 stars5/5Automata Studies. (AM-34), Volume 34 Rating: 5 out of 5 stars5/5Finite Automata: Behavior and Synthesis Rating: 5 out of 5 stars5/5Applied Automata Theory Rating: 0 out of 5 stars0 ratingsStability of Linear Systems: Some Aspects of Kinematic Similarity Rating: 0 out of 5 stars0 ratingsSpheroidal Wave Functions Rating: 0 out of 5 stars0 ratingsLie Theory and Special Functions Rating: 0 out of 5 stars0 ratingsLie Algebras Rating: 0 out of 5 stars0 ratingsStatistical Inferences for Stochasic Processes: Theory and Methods Rating: 0 out of 5 stars0 ratingsAcoustic Wave Sensors: Theory, Design and Physico-Chemical Applications Rating: 0 out of 5 stars0 ratings
Technology & Engineering For You
The Big Book of Hacks: 264 Amazing DIY Tech Projects Rating: 4 out of 5 stars4/5The Big Book of Maker Skills: Tools & Techniques for Building Great Tech Projects Rating: 4 out of 5 stars4/5Artificial Intelligence: A Guide for Thinking Humans Rating: 4 out of 5 stars4/5The Art of War Rating: 4 out of 5 stars4/5The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology Rating: 0 out of 5 stars0 ratings80/20 Principle: The Secret to Working Less and Making More Rating: 5 out of 5 stars5/5Logic Pro X For Dummies Rating: 0 out of 5 stars0 ratingsThe CIA Lockpicking Manual Rating: 5 out of 5 stars5/5The Fast Track to Your Technician Class Ham Radio License: For Exams July 1, 2022 - June 30, 2026 Rating: 5 out of 5 stars5/5Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career Rating: 4 out of 5 stars4/5The 48 Laws of Power in Practice: The 3 Most Powerful Laws & The 4 Indispensable Power Principles Rating: 5 out of 5 stars5/5The Total Inventor's Manual: Transform Your Idea into a Top-Selling Product Rating: 1 out of 5 stars1/5The Total Motorcycling Manual: 291 Essential Skills Rating: 5 out of 5 stars5/5The Art of Tinkering: Meet 150+ Makers Working at the Intersection of Art, Science & Technology Rating: 4 out of 5 stars4/5Electrical Engineering 101: Everything You Should Have Learned in School...but Probably Didn't Rating: 5 out of 5 stars5/5The Systems Thinker: Essential Thinking Skills For Solving Problems, Managing Chaos, Rating: 4 out of 5 stars4/5Broken Money: Why Our Financial System is Failing Us and How We Can Make it Better Rating: 5 out of 5 stars5/5My Inventions: The Autobiography of Nikola Tesla Rating: 4 out of 5 stars4/5The Art of War Rating: 4 out of 5 stars4/5How to Disappear and Live Off the Grid: A CIA Insider's Guide Rating: 0 out of 5 stars0 ratingsNo Nonsense Technician Class License Study Guide: for Tests Given Between July 2018 and June 2022 Rating: 5 out of 5 stars5/5The Invisible Rainbow: A History of Electricity and Life Rating: 4 out of 5 stars4/5The Wuhan Cover-Up: And the Terrifying Bioweapons Arms Race Rating: 0 out of 5 stars0 ratingsUnderstanding Media: The Extensions of Man Rating: 4 out of 5 stars4/5Smart Phone Dumb Phone: Free Yourself from Digital Addiction Rating: 0 out of 5 stars0 ratingsVanderbilt: The Rise and Fall of an American Dynasty Rating: 4 out of 5 stars4/5The Complete Titanic Chronicles: A Night to Remember and The Night Lives On Rating: 4 out of 5 stars4/5
Reviews for Speech Enhancement
0 ratings0 reviews
Book preview
Speech Enhancement - Jacob Benesty
1
Introduction
Abstract
The presence of background noise is problematic for humans and computers alike, and the problem of dealing with it (called speech enhancement or noise reduction) is an important and long-standing problem in signal processing. The search for new and better methods continues today. Speech enhancement algorithms are important components in many systems where speech plays a part, including telephony, hearing aids, voice over IP, and automatic speech recognizers. Speech enhancement is generally concerned with the problem of enhancing the quality of speech signals. This can, of course, mean many things, but it is often associated with the specific problem of reducing the impact of additive noise, which is also what we are concerned with in the present book.
Keywords
Noise reduction; Speech enhancement; Subspace methods; Reduced-rank signal processing
In verbal communication, the presence of background noise, such as the sound of a passing car or an air vent, can impact the quality of the speech signal in a detrimental way, something that affects the listener and thus also the communication in several negative ways. Not only may the perceived quality of the speech be harmed, but also its intelligibility may be degraded. Even if only the perceived quality of the speech is affected, this may have a severe impact on the ability of the users to communicate, as exposure to noisy signals may cause listener fatigue. The presence of noise in signals is, though, not only a problem for humans. In speech processing systems, background noise causes additional problems, as such systems often comprise components that are designed under the assumption that only one, clean speech signal is present at any given time. This is, for example, the case for automatic speech recognizers and speech coders. This is typically done to simplify the design of these components, as the underlying statistical models then do not have to account for all possible noise types. Not only does this simplify the training of such models, it also, generally speaking, leads to faster algorithms; but it also renders these components vulnerable to noise.
As we have argued, the presence of background noise is problematic for humans and computers alike, and the problem of dealing with it, which is called speech enhancement or noise reduction, is an important and long-standing problem in signal processing (see, e.g., [1] and [2] for recent surveys), and the search for new and better methods continues today. Speech enhancement algorithms are important components in many systems, where speech plays a part, including telephony, hearing aids, voice over IP, and automatic speech recognizers. Speech enhancement is generally concerned with the problem of enhancing the quality of speech signals. This can, of course, mean many things, but it is often associated with the specific problem of reducing the impact of additive noise, which is also what we are concerned with in the present book. Additive noise occurs naturally in acoustic environments when multiple sources are present, and examples of common noise types are street, car, and babble. Moreover, it can also be caused by intrinsic noise in the sensor system, i.e., from the electrical components. To be more precise, the purpose of speech enhancement is to minimize the impact of the background noise while preserving the speech signal. Hence, there are two performance measures by which the efficiency of speech enhancement methods is compared: speech distortion and noise reduction [3]. These two measures are often conflicting, meaning that if we want to achieve the highest possible noise reduction, then we must accept speech distortion and, similarly, that if we cannot accept any speech distortion, then our ability to perform noise reduction will be hampered. An extreme example of this is the maximum signal-to-noise ratio filter [1] which achieves the highest possible noise reduction but at the cost of severe speech