Cargando…

Speech and Audio Processing for Coding, Enhancement and Recognition

This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor Corporativo:	SpringerLink (Online service)
Otros Autores:	Ogunfunmi, Tokunbo (Editor ), Togneri, Roberto (Editor ), Narasimha, Madihally (Sim) (Editor )
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	New York, NY : Springer New York : Imprint: Springer, 2015.
Edición:	1st ed. 2015.
Temas:	Signal processing. User interfaces (Computer systems). Human-computer interaction. Multimedia systems. Signal, Speech and Image Processing . User Interfaces and Human Computer Interaction. Multimedia Information Systems.
Acceso en línea:	Texto Completo

MARC


LEADER	00000nam a22000005i 4500
001	978-1-4939-1456-2
003	DE-He213
005	20220113021320.0
007	cr nn 008mamaa
008	141014s2015 xxu\| s \|\|\|\| 0\|eng d
020			\|a 9781493914562 \|9 978-1-4939-1456-2
024	7		\|a 10.1007/978-1-4939-1456-2 \|2 doi
050		4	\|a TK5102.9
072		7	\|a TJF \|2 bicssc
072		7	\|a UYS \|2 bicssc
072		7	\|a TEC008000 \|2 bisacsh
072		7	\|a TJF \|2 thema
072		7	\|a UYS \|2 thema
082	0	4	\|a 621.382 \|2 23
245	1	0	\|a Speech and Audio Processing for Coding, Enhancement and Recognition \|h [electronic resource] / \|c edited by Tokunbo Ogunfunmi, Roberto Togneri, Madihally (Sim) Narasimha.
250			\|a 1st ed. 2015.
264		1	\|a New York, NY : \|b Springer New York : \|b Imprint: Springer, \|c 2015.
300			\|a X, 345 p. 79 illus., 32 illus. in color. \|b online resource.
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
347			\|a text file \|b PDF \|2 rda
505	0		\|a From 'Harmonic Telegraph' to Cellular Phones -- Challenges in Speech Coding Research -- Recent Speech Coding Technologies and Standards -- Ensemble Learning Approaches in Speech Recognition -- Dynamic and Deep Networks For Speech Modeling and Recognition -- Speech Based Emotion Recognition -- Speaker Diarization: Challenges and Emerging Research -- Maximum a posteriori spectral estimation with source log-spectral priors for multichannel speech enhancement -- Modulation Processing for Speech Enhancement.
520			\|a This book describes the basic principles underlying the generation, coding, transmission and enhancement of speech and audio signals, including advanced statistical and machine learning techniques for speech and speaker recognition with an overview of the key innovations in these areas. Key research undertaken in speech coding, speech enhancement, speech recognition, emotion recognition and speaker diarization are also presented, along with recent advances and new paradigms in these areas. · Offers readers a single-source reference on the significant applications of speech and audio processing to speech coding, speech enhancement and speech/speaker recognition. Enables readers involved in algorithm development and implementation issues for speech coding to understand the historical development and future challenges in speech coding research; · Discusses speech coding methods yielding bit-streams that are multi-rate and scalable for Voice-over-IP (VoIP) Networks; · Presents an overview of recent developments in conversational speech coding technologies, important new algorithmic advances, and recent standardization activities in ITU-T, 3GPP, 3GPP2, MPEG and IETF that offer a significantly improved user experience during voice calls on existing and future communication systems; · Presents an overview of ensemble learning efforts based on different machine learning techniques that have emerged in automatic speech recognition in recent years; · Emphasizes signal processing for efficient time-domain and spectral-domain representations, reduction of noise, channel and session variabilities, extraction of temporal and spectral features for recognition and modeling; · Informs readers of the latest research and developments in advanced statistical estimation and deep neural networks for speech recognition; · Presents readers with the architectural framework and key approaches involved in the "hot" research areas of emotion recognition and speaker diairization systems; · Provides readers with a more enriching view of state of the art research in speech enhancement arising from novel multi-microphone and time-frequency solutions.
650		0	\|a Signal processing.
650		0	\|a User interfaces (Computer systems).
650		0	\|a Human-computer interaction.
650		0	\|a Multimedia systems.
650	1	4	\|a Signal, Speech and Image Processing .
650	2	4	\|a User Interfaces and Human Computer Interaction.
650	2	4	\|a Multimedia Information Systems.
700	1		\|a Ogunfunmi, Tokunbo. \|e editor. \|4 edt \|4 http://id.loc.gov/vocabulary/relators/edt
700	1		\|a Togneri, Roberto. \|e editor. \|4 edt \|4 http://id.loc.gov/vocabulary/relators/edt
700	1		\|a Narasimha, Madihally (Sim). \|e editor. \|4 edt \|4 http://id.loc.gov/vocabulary/relators/edt
710	2		\|a SpringerLink (Online service)
773	0		\|t Springer Nature eBook
776	0	8	\|i Printed edition: \|z 9781493914555
776	0	8	\|i Printed edition: \|z 9781493914579
776	0	8	\|i Printed edition: \|z 9781493948048
856	4	0	\|u https://doi.uam.elogim.com/10.1007/978-1-4939-1456-2 \|z Texto Completo
912			\|a ZDB-2-ENG
912			\|a ZDB-2-SXE
950			\|a Engineering (SpringerNature-11647)
950			\|a Engineering (R0) (SpringerNature-43712)

Speech and Audio Processing for Coding, Enhancement and Recognition

MARC

Ejemplares similares