Cargando…

Compiling and Annotating a Learner Corpus for a Morphologically Rich Language

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Rosen, Alexandr
Otros Autores: Hana, Jiří, Vidová Hladká, Barbora
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Prague : Karolinum Press, 2020.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Mu 4500
001 EBOOKCENTRAL_on1231608795
003 OCoLC
005 20240329122006.0
006 m o d
007 cr |n|---|||||
008 210116s2020 xr o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d N$T  |d OCLCO  |d EBLCP  |d OCLCF  |d OCLCO  |d OCLCQ  |d OCLCO  |d OCLCL 
019 |a 1232480437 
020 |a 9788024647654 
020 |a 8024647656 
020 |z 9788024647593 
029 1 |a AU@  |b 000068723447 
035 |a (OCoLC)1231608795  |z (OCoLC)1232480437 
050 4 |a P128.C68 
082 0 4 |a 410.188  |2 23 
049 |a UAMI 
100 1 |a Rosen, Alexandr. 
245 1 0 |a Compiling and Annotating a Learner Corpus for a Morphologically Rich Language 
260 |a Prague :  |b Karolinum Press,  |c 2020. 
300 |a 1 online resource (281 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover -- Contents -- List of abbreviations -- Introduction -- About this book -- Reasons to study non-native Czech -- Some properties of non-native Czech -- Morphology -- Syntax -- Word segmentation -- Learner corpus -- Roadmap -- Learner corpora -- Terminology -- Various types of learner corpora -- The choice of texts -- Annotation -- Textual annotation -- Linguistic annotation -- Error annotation -- correction -- Error annotation -- categorization -- Annotation scheme -- Data access -- Some learner corpora -- ASK -- CLC -- COPLE2 -- CroLTeC -- Falko -- ICLE -- MERLIN -- RLC -- SweLL 
505 8 |a Relationships of CzeSL with other learner corpora -- Introducing the CzeSL project -- Specifications of CzeSL -- Intended usage -- AKCES -- the umbrella project -- Procurement of texts -- Text collection -- Transcription -- Anonymization -- Metadata -- Error annotation -- Errors and learner language -- More than one way to annotate errors in CzeSL -- A wishlist for error annotation -- Interference and other types of explanation -- Interpretation in terms of TH -- Word order -- Style -- Communication goal -- The two-tier annotation scheme -- Annotation scheme as a compromise -- Why multiple tiers 
505 8 |a How many tiers -- Multiple tiers in a tabular format -- Content of the tiers -- A sample text with T1 vs. T2 corrections -- Links between tiers -- Error tags -- Morphosyntactic references -- Follow-up corrections -- Alternative target hypotheses -- Error tagset -- Based on linguistic categories -- Grammar-based vs. formal errors -- Extent of the annotated unit -- Grammar-based tags -- Errors at T1 -- Errors at T2 -- Coarse-grained -- An example of complex annotation -- Evaluation of the manual tiered error annotation -- Inter-annotator agreement (IAA) -- A pilot annotation 
505 8 |a IAA on all doubly-annotated texts -- Error tags depend on target hypothesis -- Possible causes of the annotators' disagreements -- Formal tags -- Automatic extension and modification of error annotation -- Automatic detection of formal errors on T1 -- Formal orthographic errors -- Formal errors sometimes influencing pronunciation -- Formal errors influencing pronunciation -- Other types of errors -- Automatic classification of word-boundary errors -- Implicit error annotation -- Multi-dimensional error annotation (MD) -- Focus on morphology -- All annotation applied to the source text 
505 8 |a Extent of the annotated unit -- Alternative error domains -- Source text, target hypothesis, annotated strings -- Domains and features -- Linguistic annotation -- Annotation with tools for Standard Czech -- Annotation of target hypothesis -- Annotation of T1 -- Annotation of source texts -- Annotation of interlanguage in UD -- Tokenization -- Part-of-speech and morphology -- Lemmata -- Syntactic Structure -- Evaluation -- Annotation process -- Overview of the annotation process -- Transcription and anonymization of manuscripts -- Tiered error annotation -- Manual error annotation 
500 |a Automatic annotation checking. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
650 0 |a Corpora (Linguistics) 
650 0 |a Czech language. 
650 6 |a Corpus (Linguistique) 
650 6 |a Tchèque (Langue) 
650 7 |a Corpora (Linguistics)  |2 fast 
650 7 |a Czech language  |2 fast 
700 1 |a Hana, Jiří. 
700 1 |a Vidová Hladká, Barbora. 
758 |i has work:  |a Compiling and annotating a learner corpus for a morphologically rich language (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCGw8VTG4p34KcrtcjvyQVP  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |a Rosen, Alexandr.  |t Compiling and Annotating a Learner Corpus for a Morphologically Rich Language.  |d Prague : Karolinum Press, ©2020  |z 9788024647593 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=6456005  |z Texto completo 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6456005 
938 |a EBSCOhost  |b EBSC  |n 2729804 
994 |a 92  |b IZTAP