Cargando…

Contemporary Methods for Speech Parameterization

Contemporary Methods for Speech Parameterization offers a general view of short-time cepstrum-based speech parameterization and provides a common ground for further in-depth studies on the subject. Specifically, it offers a comprehensive description, comparative analysis, and empirical performance e...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Ganchev, Todor (Autor)
Autor Corporativo: SpringerLink (Online service)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: New York, NY : Springer New York : Imprint: Springer, 2011.
Edición:1st ed. 2011.
Colección:SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning,
Temas:
Acceso en línea:Texto Completo

MARC

LEADER 00000nam a22000005i 4500
001 978-1-4419-8447-0
003 DE-He213
005 20220116002751.0
007 cr nn 008mamaa
008 110806s2011 xxu| s |||| 0|eng d
020 |a 9781441984470  |9 978-1-4419-8447-0 
024 7 |a 10.1007/978-1-4419-8447-0  |2 doi 
050 4 |a TK5102.9 
072 7 |a TJF  |2 bicssc 
072 7 |a UYS  |2 bicssc 
072 7 |a TEC008000  |2 bisacsh 
072 7 |a TJF  |2 thema 
072 7 |a UYS  |2 thema 
082 0 4 |a 621.382  |2 23 
100 1 |a Ganchev, Todor.  |e author.  |4 aut  |4 http://id.loc.gov/vocabulary/relators/aut 
245 1 0 |a Contemporary Methods for Speech Parameterization  |h [electronic resource] /  |c by Todor Ganchev. 
250 |a 1st ed. 2011. 
264 1 |a New York, NY :  |b Springer New York :  |b Imprint: Springer,  |c 2011. 
300 |a X, 114 p. 32 illus., 23 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning,  |x 2191-7388 
505 0 |a Basic Concepts and Applicability of Speech Parameterization -- Survey on speech parameterization -- Fourier transform based methods -- Wavelet packets based methods -- Evaluation on the speech recognition task -- Evaluation on the speaker recognition task -- Practical considerations -- Links to code and further sources of information. 
520 |a Contemporary Methods for Speech Parameterization offers a general view of short-time cepstrum-based speech parameterization and provides a common ground for further in-depth studies on the subject. Specifically, it offers a comprehensive description, comparative analysis, and empirical performance evaluation of eleven contemporary speech parameterization methods, which compute short-time cepstrum-based speech features. Among these are five discrete wavelet packet transform (DWPT)-based, six discrete Fourier transform (DFT)-based speech features and some of their variants which have been used on the speech recognition, speaker recognition, and other related speech processing tasks. The main similarities and differences in their computation are discussed and empirical results from performance evaluation in common experimental conditions are presented. The recognition accuracy obtained on the monophone recognition, continuous speech recognition and speaker recognition tasks is contrasted against the one obtained for the well-known and widely used Mel Frequency Cepstral Coefficients (MFCC). It is shown that many of these methods lead to speech features that do offer competitive performance on a certain speech processing setup when compared to the venerable MFCC. The last does not target the promotion of certain speech features but instead aims to enhance the common understanding about the advantages and disadvantages of the various speech parameterization techniques available today and to provide the basis for selection of an appropriate speech parameterization in each particular case. 
650 0 |a Signal processing. 
650 0 |a Natural language processing (Computer science). 
650 0 |a User interfaces (Computer systems). 
650 0 |a Human-computer interaction. 
650 1 4 |a Signal, Speech and Image Processing . 
650 2 4 |a Natural Language Processing (NLP). 
650 2 4 |a User Interfaces and Human Computer Interaction. 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer Nature eBook 
776 0 8 |i Printed edition:  |z 9781441984463 
776 0 8 |i Printed edition:  |z 9781441984487 
830 0 |a SpringerBriefs in Speech Technology, Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning,  |x 2191-7388 
856 4 0 |u https://doi.uam.elogim.com/10.1007/978-1-4419-8447-0  |z Texto Completo 
912 |a ZDB-2-ENG 
912 |a ZDB-2-SXE 
950 |a Engineering (SpringerNature-11647) 
950 |a Engineering (R0) (SpringerNature-43712)