English Deutsch Français 简体中文 繁體中文
Book123, Download eBooks for Free - Anytime! Submit your article

Categories

Share With Friends



Like Book123?! Give us +1

Archive by Date

Search Tag

Newest

Science/Engineering Statistical Mechanics, Third Edition
Science/Engineering Essentials of Toxic Chemical Risk: Science and Society
Science/Engineering Telefoncoaching: So machen Sie aus Ihren Mitarbeitern Telefonprofis
Science/Engineering Wireless Communications (Wiley - IEEE)
Science/Engineering Posttraumatische Belastungsstörungen (German Edition)
Science/Engineering Lernplattformen in Schulen: Ansätze für E-Learning und Blended Learning in Präsenzklassen (1 Auflage)
Science/Engineering Stochastik für Einsteiger: Eine Einführung in die faszinierende Welt des Zufalls. Mit über 220 Übungsaufgaben und Lösungen {Repost}
Science/Engineering Testtheorie und Fragebogenkonstruktion (Springer-Lehrbuch)
Science/Engineering Centrifugal Pumps, 2nd Edition
Science/Engineering Computational Intelligence for Modelling and Prediction (Studies in Computational Intelligence) 1 edition {Repost}
Science/Engineering Networks, Crowds, and Markets: Reasoning About a Highly Connected World {repost}
Science/Engineering Introduction to Biophotonics (repost)
Science/Engineering The Art and Science of Psychotherapy (repost)
Science/Engineering Advances in Chemical Physics - Volume 15: Stochastic Processes in Chemical Physics
Science/Engineering "Emulsion Science: Basic Principles" (repost)
Science/Engineering Elementary Principles of Chemical Processes 3rd edition
Science/Engineering Boundary Element Analysis (repost)
Science/Engineering Collection of books on physics 2
Science/Engineering A Practical Handbook of Preparative HPLC by Donald A. Wellings (Repost)
Science/Engineering Reviews of Environmental Contamination and Toxicology 184 by George W. Ware

Useful Links


Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment

Posted on 2010-03-16




Name:Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment
ASIN/ISBN:0387688358
Language:English
File size:4.4 Mb
Publish Date: 2009
ISBN: 0387688358
Pages: 228 pages
File Type: PDF
File Size: 4,4 MB
Other Info: Springer
   Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment



More

Julien Bourgeois, Wolfgang Minker, ""

Speech is a natural and therefore privileged communication modality. Safety and convenience issues require hands-free, eyes-free speech-based human-computer interfaces to manipulate complex functionalities and devices. For example, in cars, applications include entertainment, telephony as well as more advanced functions such as automatic spoken language dialog systems for in-vehicle navigation. With a seamless speech input, such interfaces bring an increased comfort but have to face several issues: degradation of the signal-to-noise ratio (SNR) at the microphone, reverberated speech signal, and, above all, the presence of interferences. The interferences, such as speech from the co-driver, can greatly hamper the performance of the speech recognition component, which is crucial for dialog applications. Especially for overlaid speech, the separation of the target speaker from the interferer represent a particular challenge.

Time-domain Beamforming and Convolutive Blind Source Separation addresses the problem of separating spontaneous multi-party speech by way of microphone arrays (beamformers) and adaptive signal processing techniques.

While existing techniques requires a Double-Talk Detector (DTD) that interrupts the adaptation when the target is active, the described method addresses the separation problem using continuous, uninterrupted adaptive algorithms. The advantage of such an approach is twofold: Firstly, the algorithm development is much simpler since no detection mechanism needs to be designed and no threshold to be tuned. Secondly, the performance can be improved due to the adaptation during periods of double-talk.

The book is organized in three parts, roughly described as follows:

The first line of attack, termed implicit beamforming, is built upon the classical supervised beamforming, i.e. it requires the position of the target speaker to be known. Using a time-varying pseudo-optimal step-size that takes over the adaptation control, a continuous adaptive algorithm is obtained. Experimentally, the performance of this algorithm appears to be sufficient if the microphones are oriented adequately. However, in general, more sophisticated Blind Source Separation (BSS) techniques are required.

In the second part, the time-domain BSS method (Buchner et al., 2005) exploiting second-order statistics of the source signals is considered. This method is based on the natural gradient and limited to square systems with an equal number of sources and microphones. Introducing the concept of partial separation, a novel approach is proposed to remove this restriction of the natural gradient. The Sylvester-based representation of the separation system allows a very concise derivation of second-order BSS algorithms in the time-domain but cannot be directly implemented. Revisiting the natural gradient in the z-domain, this implementation issue is clarified. Furthermore, the convergence and stability of BSS is discussed from a theoretical point of view, and its properties are compared to those of supervised beamforming.

Finally, combinations of beamforming and BSS are presented leading to already known, but also novel algorithms. The underlying idea is the following: if the position of the target speaker (the driver) is known in advance, a purely blind approach, which does not exploit this information, seems sub-optimal. Therefore, an emphasis is placed on the development of an algorithm that combines the benefits of both approaches. It outperforms BSS and removes the need for a DTD and allows for a continuous adaptation, even during double-talk.

The book is written is a concise manner and an effort has been made such that all presented algorithms can be straightforwardly implemented by the reader. All experimental results have been obtained with real in-car microphone recordings involving simultaneous speech of the driver and the co-driver, as opposed to computer-generated simulations. Experiments with background noise have been carried out in order to assess the robustness of the considered methods in noisy conditions.

Buy Book at Lowest Price on Amazon

Rating:

2.5 out of 5 by

 
Download Links
  ServerStatus
  Direct Download Link 1Alive
  Direct Download Link 2Alive
  uploading.comAlive
  depositfiles.comAlive
  mirrorAlive


Buy This Book at Best Price >>

Like this article?! Give us +1:

Related Articles


Technical Speech Separation By Humans and Machines

Technical Speech Separation By Humans and Machines

Science/Engineering Blind Speech Separation

Science/Engineering Blind Speech Separation

Blind Speech SeparationPublisher: Springer | Pages: 432 | 2007-09-20 | ISBN 1402064780 | PDF | 14 MBThis is the first book to provide a cutting edge reference to the fascinating topic of blind source separation (BSS) for convolved speech ...

Science/Engineering Interactive Speech Technology: Human Factors Issues In The Application Of Speech Input/Output To Computers

Science/Engineering Interactive Speech Technology: Human Factors Issues In The Application Of Speech Input/Output To Computers

Interactive Speech Technology: Human Factors Issues In The Application Of Speech Input/Output To ComputersBy Christopher Baber, J Noyes Publisher CRC | ISBN: 074840127X | edition 1993 | PDF | 212 pages | 1,51 mb Focusing on human compute ...

Science/Engineering Handbook of Statistics Volume 5 : Time Series in the Time Domain

Science/Engineering Handbook of Statistics Volume 5 : Time Series in the Time Domain

Handbook of Statistics Volume 5 : Time Series in the Time DomainElsevier Science Pub Co | 1985-08-01 | ISBN: 0444876294 | 482 Pages | PDF-RAR | DF, FF & RS | 20.80MBBook Description: In this volume prominent workers in the field discuss ...

Analysis And Control Of Nonlinear Systems With Stationary Sets: Time-Domain and Frequency-Domain Methods

Analysis And Control Of Nonlinear Systems With Stationary Sets: Time-Domain and Frequency-Domain Methods

Jinzhi Wang, Zhisheng Duan, Ying Yang, Lin Huang "Analysis And Control Of Nonlinear Systems With Stationary Sets: Time-Domain and Frequency-Domain Methods"World Scientific Publishing Company | English | 2009-03-12 | ISBN: 9812814698 | 336 p ...

Programming Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation

Programming Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation

Andrzej Cichocki, Rafal Zdunek, Anh Huy Phan, Shun-ichi Amari, "Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation" Wiley | 2009 | ISBN: 0470746661 | 500 pages | PDF ...

Share this page with your friends now!
Text link
Forum (BBCode)
Website (HTML)
Tags:
Separation   Environment   Speech   Input  
 

DISCLAIMER:

This site does not store Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment on its server. We only index and link to Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment provided by other sites. Please contact the content providers to delete Science/Engineering Time-Domain Beamforming and Blind Source Separation: Speech Input in the Car Environment if any and email us, we'll remove relevant links or contents immediately.

Comments (0) All

Verify: Verify

    Sign In   Not yet a member?

Sign In | Not yet a member?