Cargando…

Spoken Corpora and Linguistic Studies /

This article aims to show how a corpus driven theory that analyses speech through information units can better account for Discourse Markers (DM) identification and analysis. We propose that the speech flow can only be properly analyzed if segmented into utterances and tone units through prosodic pa...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Otros Autores: Raso, Tommaso, Mello, Heliana
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Amsterdam ; Philadelphia : John Benjamins Publishing Company, 2014.
Colección:Studies in corpus linguistics ; v. 61.
Temas:
Acceso en línea:Texto completo
Tabla de Contenidos:
  • Spoken Corpora and Linguistic Studies; Editorial page; Title page; LCC data; Table of contents; Acknowledgements; Introduction. Spoken corpora and linguistic studies: Problems and perspectives; 1. Why this book ; 2. Some important issues ; 2.1 What do we intend when we consider speech as the natural modality of language? ; 2.2 Prosody ; 2.3 Illocution, attitude, emotion ; 2.4 Information structure ; 3. The content of the book ; 3.1 Experiences and requirements of spoken corpora compilation ; 3.2 Multilevel corpus annotation ; 3.3 Prosody and its functional levels.
  • 3.4 Syntax and information structure References ; Section I. Experiences and requirements of spoken corpora compilation; 1. Methodological issues for spontaneous speech corpora compilation; 1. Introduction ; 2. Speech corpora: State of the art and spontaneous speech ; 2.1 Speech corpora types ; 2.2 Spontaneous speech corpora ; 2.3 Spoken corpora and linguistic diasystem ; 3. Architecture of spontaneous speech corpora and the importance of diaphasy ; 3.1 The architecture ; 3.2 Spontaneous speech corpus branching ; 3.3 Interactional typologies ; 3.4 Diaphasic variation ; 3.5 Metadata.
  • 4. Data collection and treatment 4.1 Recordings ; 4.2 Speech representation and transcription ; 5. Usability ; 5.1 Alignment ; 5.2 Informational annotation ; 6. Conclusion ; References ; 2. A multilingual speech corpus of North-Germanic languages; 1. Introduction ; 2. Challenges in the corpus design and development ; 2.1 Methodology for collecting speech ; 2.2 Transcription and tagging ; 2.3 Metadata ; 2.4 Multilingual search ; 2.5 Links to audio and video ; 2.6 Results presented on maps ; 3. Results from research on the Nordic Dialect Corpus ; 4. Conclusion ; Acknowledgements ; References.
  • Links 3. Methodological considerations for the development and use of sign language acquisition corpora; 1. Introduction ; 2. Metadata ; 3. Designing annotation patterns
  • ; 4. Sign IDs ; 5. Conclusion ; Acknowledgements ; References ; Section II. Multilevel corpus annotation; 4. The grammatical annotation of speech corpora: Techniques and perspectives; 1. Introduction ; 2. The corpora ; 3. Constraint Grammar ; 4. Parser architecture ; 5. CG adaptations for orality features in speech-like corpora ; 6. Cross-corpus parser evaluation ; 7. Comparing orality markers ; 7.1 General comparison.
  • 7.2 Pronouns 7.3 Emoticons ; 7.4 Non-standard syntax in TV news jargon: Examples and solutions ; 8. CG-Annotation of linguistically transcribed ordinary speech ; 9. C-ORAL-Brasil methodology
  • speech-specific adaptations ; 9.1 Text flow normalization ; 9.2 Tokenization ; 9.3 Lexical and orthographic normalization ; 9.4 Syntactic segmentation ; 10. Evaluating the PALAVRAS speech tagger ; 11. Conclusions and outlook ; References ; 5. The IPIC resource and a cross-linguistic analysis of information structure; 1. Introduction ; 2. Theoretical background.