Cargando…

Coreference : Annotation, Resolution and Evaluation in Polish.

The book presents work on coreference understanding, annotation and resolution of a slavic language which can be applied to natural language processing in computers and software using English and other languages. By presenting the steps of building a coreference-related component of the NLP toolset,...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Savary, Agata
Otros Autores: Ogrodniczuk, Maciej, Zawislawska, Magdalena, Glowinska, Katarzyna, Kopec, Mateusz
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Berlin/Boston : De Gruyter, 2015.
Temas:
Acceso en línea:Texto completo
Tabla de Contenidos:
  • Preface; Part I Introduction; 1 Reference, anaphora, coreference; 1.1 The concept of reference; 1.2 The typology of reference; 1.3 The real world versus the mental world; 1.4 Reference, coreference and anaphora; 1.5 Coreference and identity; 2 Polish coreference-related studies; 2.1 Terminology and characteristics of the problem; 2.1.1 Klemensiewicz; 2.1.2 Topolinska; 2.1.3 Pasek; 2.1.4 Fall; 2.2 Text coherence, cohesion and intra-document linking; 2.2.1 Pisarkowa; 2.2.2 Bellert; 2.2.3 Wajszczuk; 2.2.4 Marciszewski; 2.2.5 Stroinska, Szkudlarek and Trofimiec; 2.2.6 Pisarek.
  • 2.3 Reference and anaphora in text understanding2.3.1 Grzegorczykowa; 2.3.2 Gajda; 2.4 Text genres and language stylistics; 2.4.1 Szwedek and Duszak; 2.4.2 Honowska; 2.4.3 Fontanski; 2.4.4 Dobrzynska; 2.5 Anaphora and first-order logic; 2.5.1 Dunin-Keplicz; 2.5.2 Studnicki, Polanowska, Fall and Puczylowski; 2.6 Application of formal binding theories to Polish; 2.6.1 Reinders-Machowska; 2.6.2 Kupsc and Marciniak; 2.6.3 Trawinski; Part II Coreference Annotation; 3 Related work; 3.1 Coreference annotation abroad; 3.2 Polish anaphora and coreference annotation; 3.2.1 LUNA-related work.
  • 3.2.2 Pronominal anaphora annotation for information extraction3.2.3 Anaphora representation in the KPWr corpus; 4 Annotation models; 4.1 Annotation requirements; 4.1.1 Requirement 1: Linguistic compatibility; 4.1.2 Requirement 2: Offline mode; 4.1.3 Requirement 3: Standard, open, stand-off annotation format; 4.1.4 Requirement 4: Simple, user-centered design; 4.1.5 Requirement 5: Reliability; 4.1.6 Requirement 6: Extensibility, adaptability and open source availability; 4.2 Annotation tools review; 4.2.1 Available tools; 4.2.2 PALinkA; 4.2.3 MMAX2; 4.2.4 Conclusion; 5 Annotation guidelines.
  • 5.1 Types of coreference5.1.1 Identity of reference, identity of sense; 5.1.2 Identity, quasi-identity; 5.1.3 Dominant expression; 5.2 Scope of annotation and typology of coreferential constructions; 5.2.1 Anaphoric expressions in grammatical and lexical form; 5.2.2 The borders of the nominal phrase; 5.3 Particular annotation problems; 5.3.1 Embedded phrases; 5.3.2 Discontinuous phrases; 5.3.3 Annotation of zero subject; 5.3.4 Idioms; 5.3.5 Indirect speech, direct speech and free direct speech; 5.3.6 Gerunds; 5.3.7 Definiteness and indefiniteness; 5.3.8 Elective phrases.
  • 5.3.9 Extended nominal phrases5.3.10 Enumerations; 6 Annotation methodology; 6.1 Initial annotation experiment; 6.1.1 The SemEval-based format; 6.1.2 Annotation data; 6.1.3 Annotation procedure; 6.1.4 First visualisation attempts; 6.1.5 Findings from the process; 6.2 Series versus parallel annotation; 6.3 Annotation workflow; 6.3.1 Design decisions; 6.3.2 Corpus creation procedure; 6.3.3 Text selection; 6.3.4 Text preparation; 6.3.5 Text distribution and acquisition; 6.3.6 Manual text annotation; 6.3.7 Corpus publication process; 7 Annotation tools; 7.1 Tools used in the CORE project.