Cargando…

Building and Using Comparable Corpora

The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor Corporativo: SpringerLink (Online service)
Otros Autores: Sharoff, Serge (Editor ), Rapp, Reinhard (Editor ), Zweigenbaum, Pierre (Editor ), Fung, Pascale (Editor )
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Berlin, Heidelberg : Springer Berlin Heidelberg : Imprint: Springer, 2013.
Edición:1st ed. 2013.
Temas:
Acceso en línea:Texto Completo

MARC

LEADER 00000nam a22000005i 4500
001 978-3-642-20128-8
003 DE-He213
005 20220118111523.0
007 cr nn 008mamaa
008 131213s2013 gw | s |||| 0|eng d
020 |a 9783642201288  |9 978-3-642-20128-8 
024 7 |a 10.1007/978-3-642-20128-8  |2 doi 
050 4 |a QA76.9.N38 
072 7 |a UYQL  |2 bicssc 
072 7 |a COM073000  |2 bisacsh 
072 7 |a UYQL  |2 thema 
082 0 4 |a 006.35  |2 23 
245 1 0 |a Building and Using Comparable Corpora  |h [electronic resource] /  |c edited by Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung. 
250 |a 1st ed. 2013. 
264 1 |a Berlin, Heidelberg :  |b Springer Berlin Heidelberg :  |b Imprint: Springer,  |c 2013. 
300 |a XII, 335 p. 70 illus., 14 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
505 0 |a Preface - Building and Using Comparable Corpora. S.Sharoff, R.Rapp, P.Zweigenbaum -- Overviewing Important Aspects of the Last 20 Years of Research in Comparable Corpora -- S.Sharoff, R.Rapp, P.Zweigenbaum -- Part I: Compiling and Measuring Comparable Corpora -- Multilingual Corpus Collection. S.Shi, P.Fung -- Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making. A.Gurrutxaga, I.Leturia, I.San Vicente, X.Saralegi -- Statistical Comparability: Methodological Caveats. R.Köhler -- Methods for Collection and Evaluation of Comparable Documents. M.Lestari Paramita, D.Guthrie, E.Kanoulas, R.Gaizauskas, P.Clough and M.Sanderson -- Measuring the Distance between Comparable Corpora between Languages. S.Sharoff -- Exploiting Comparable Corpora for Lexicon Extraction: Measuring and Improving Corpus Quality. B.Li, E.Gaussier -- Statistical Corpus and Language Comparison on Comparable Corpora. T.Eckart, U.Quasthoff -- Comparable Multilingual Patents as Large-scale Parallel Corpora. B.Lu and B.Tsou -- Part II: Using Comparable Corpora -- Extracting Parallel Phrases from Comparable Data. S.Hewavitharana, S.Vogel -- Exploiting Comparable Corpora. D.S.Munteanu, D.Marcu -- Paraphrase Detection in Comparable Monolingual Corpora. L.Deleger, B.Cartoni, P.Zweigenbaum -- Information Network Construction and Alignment from Automatically Acquired Comparable Corpora. H.Ji, W.-P.Lin -- Bilingual Terminology Mining from Comparable Corpora. B.Daille, E.Morin -- The Place of Comparable Corpora in Providing Terminological Reference Information to Online Translators: A Strategic Framework. K.Kageura, T.Abekawa -- Old Needs, New Solutions: Comparable Corpora for Language Professionals. S.Bernardini, A.Ferraresi -- Exploiting the Incomparability of Comparable Corpora for Contrastive Linguistics and Translation Studies. S.Neumann, S.Hansen-Schirra. 
520 |a The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation. 
650 0 |a Natural language processing (Computer science). 
650 0 |a Computational linguistics. 
650 0 |a Application software. 
650 1 4 |a Natural Language Processing (NLP). 
650 2 4 |a Computational Linguistics. 
650 2 4 |a Computer and Information Systems Applications. 
700 1 |a Sharoff, Serge.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Rapp, Reinhard.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Zweigenbaum, Pierre.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Fung, Pascale.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer Nature eBook 
776 0 8 |i Printed edition:  |z 9783642201295 
776 0 8 |i Printed edition:  |z 9783642201271 
776 0 8 |i Printed edition:  |z 9783662520062 
856 4 0 |u https://doi.uam.elogim.com/10.1007/978-3-642-20128-8  |z Texto Completo 
912 |a ZDB-2-SCS 
912 |a ZDB-2-SXCS 
950 |a Computer Science (SpringerNature-11645) 
950 |a Computer Science (R0) (SpringerNature-43710)