Loading…

Compiling and Annotating a Learner Corpus for a Morphologically Rich Language

Bibliographic Details
Call Number:Libro Electrónico
Main Author: Rosen, Alexandr
Other Authors: Hana, Jiří, Vidová Hladká, Barbora
Format: Electronic eBook
Language:Inglés
Published: Prague : Karolinum Press, 2020.
Subjects:
Online Access:Texto completo

MARC

LEADER 00000cam a2200000Mu 4500
001 EBSCO_on1231608795
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|---|||||
008 210116s2020 xr o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d N$T  |d OCLCO  |d EBLCP  |d OCLCF  |d OCLCO  |d OCLCQ 
019 |a 1232480437 
020 |a 9788024647654 
020 |a 8024647656 
020 |z 9788024647593 
029 1 |a AU@  |b 000068723447 
035 |a (OCoLC)1231608795  |z (OCoLC)1232480437 
050 4 |a P128.C68 
082 0 4 |a 410.188  |2 23 
049 |a UAMI 
100 1 |a Rosen, Alexandr. 
245 1 0 |a Compiling and Annotating a Learner Corpus for a Morphologically Rich Language 
260 |a Prague :  |b Karolinum Press,  |c 2020. 
300 |a 1 online resource (281 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover -- Contents -- List of abbreviations -- Introduction -- About this book -- Reasons to study non-native Czech -- Some properties of non-native Czech -- Morphology -- Syntax -- Word segmentation -- Learner corpus -- Roadmap -- Learner corpora -- Terminology -- Various types of learner corpora -- The choice of texts -- Annotation -- Textual annotation -- Linguistic annotation -- Error annotation -- correction -- Error annotation -- categorization -- Annotation scheme -- Data access -- Some learner corpora -- ASK -- CLC -- COPLE2 -- CroLTeC -- Falko -- ICLE -- MERLIN -- RLC -- SweLL 
505 8 |a Relationships of CzeSL with other learner corpora -- Introducing the CzeSL project -- Specifications of CzeSL -- Intended usage -- AKCES -- the umbrella project -- Procurement of texts -- Text collection -- Transcription -- Anonymization -- Metadata -- Error annotation -- Errors and learner language -- More than one way to annotate errors in CzeSL -- A wishlist for error annotation -- Interference and other types of explanation -- Interpretation in terms of TH -- Word order -- Style -- Communication goal -- The two-tier annotation scheme -- Annotation scheme as a compromise -- Why multiple tiers 
505 8 |a How many tiers -- Multiple tiers in a tabular format -- Content of the tiers -- A sample text with T1 vs. T2 corrections -- Links between tiers -- Error tags -- Morphosyntactic references -- Follow-up corrections -- Alternative target hypotheses -- Error tagset -- Based on linguistic categories -- Grammar-based vs. formal errors -- Extent of the annotated unit -- Grammar-based tags -- Errors at T1 -- Errors at T2 -- Coarse-grained -- An example of complex annotation -- Evaluation of the manual tiered error annotation -- Inter-annotator agreement (IAA) -- A pilot annotation 
505 8 |a IAA on all doubly-annotated texts -- Error tags depend on target hypothesis -- Possible causes of the annotators' disagreements -- Formal tags -- Automatic extension and modification of error annotation -- Automatic detection of formal errors on T1 -- Formal orthographic errors -- Formal errors sometimes influencing pronunciation -- Formal errors influencing pronunciation -- Other types of errors -- Automatic classification of word-boundary errors -- Implicit error annotation -- Multi-dimensional error annotation (MD) -- Focus on morphology -- All annotation applied to the source text 
505 8 |a Extent of the annotated unit -- Alternative error domains -- Source text, target hypothesis, annotated strings -- Domains and features -- Linguistic annotation -- Annotation with tools for Standard Czech -- Annotation of target hypothesis -- Annotation of T1 -- Annotation of source texts -- Annotation of interlanguage in UD -- Tokenization -- Part-of-speech and morphology -- Lemmata -- Syntactic Structure -- Evaluation -- Annotation process -- Overview of the annotation process -- Transcription and anonymization of manuscripts -- Tiered error annotation -- Manual error annotation 
500 |a Automatic annotation checking. 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
650 0 |a Corpora (Linguistics) 
650 0 |a Czech language. 
650 6 |a Corpus (Linguistique) 
650 6 |a Tchèque (Langue) 
650 7 |a Corpora (Linguistics)  |2 fast  |0 (OCoLC)fst01740921 
650 7 |a Czech language.  |2 fast  |0 (OCoLC)fst00886348 
700 1 |a Hana, Jiří. 
700 1 |a Vidová Hladká, Barbora. 
776 0 8 |i Print version:  |a Rosen, Alexandr.  |t Compiling and Annotating a Learner Corpus for a Morphologically Rich Language.  |d Prague : Karolinum Press, ©2020  |z 9788024647593 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2729804  |z Texto completo 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6456005 
938 |a EBSCOhost  |b EBSC  |n 2729804 
994 |a 92  |b IZTAP