A resource-light approach to morpho-syntactic tagging /
General Material Designation
[Book]
First Statement of Responsibility
Anna Feldman and Jirka Hana.
.PUBLICATION, DISTRIBUTION, ETC
Place of Publication, Distribution, etc.
New York, NY :
Name of Publisher, Distributor, etc.
Rodopi,
Date of Publication, Distribution, etc.
2010.
PHYSICAL DESCRIPTION
Specific Material Designation and Extent of Item
1 online resource (xiv, 185 pages) :
Other Physical Details
illustrations.
SERIES
Series Title
Language and computers : studies in practical linguistics ;
Volume Designation
no. 70
INTERNAL BIBLIOGRAPHIES/INDEXES NOTE
Text of Note
Includes bibliographical references (pages 133-148) and index.
CONTENTS NOTE
Text of Note
Preliminary Material -- Introduction -- Common tagging techniques -- Previous resource-light approaches to NLP -- Languages, corpora and tagsets -- Quantifying language properties -- Resource-light morphological analysis -- Cross-language morphological tagging -- Summary and further work -- Bibliography -- Tagsets we use -- Corpora -- Language properties -- Citation Index.
00
SUMMARY OR ABSTRACT
Text of Note
While supervised corpus-based methods are highly accurate for different NLP tasks, including morphological tagging, they are difficult to port to other languages because they require resources that are expensive to create. As a result, many languages have no realistic prospect for morpho-syntactic annotation in the foreseeable future. The method presented in this book aims to overcome this problem by significantly limiting the necessary data and instead extrapolating the relevant information from another, related language. The approach has been tested on Catalan, Portuguese, and Russian. Although these languages are only relatively resource-poor, the same method can be in principle applied to any inflected language, as long as there is an annotated corpus of a related language available. Time needed for adjusting the system to a new language constitutes a fraction of the time needed for systems with extensive, manually created resources: days instead of years. This book touches upon a number of topics: typology, morphology, corpus linguistics, contrastive linguistics, linguistic annotation, computational linguistics and Natural Language Processing (NLP). Researchers and students who are interested in these scientific areas as well as in cross-lingual studies and applications will greatly benefit from this work. Scholars and practitioners in computer science and linguistics are the prospective readers of this book.
OTHER EDITION IN ANOTHER MEDIUM
Title
Resource-light approach to morpho-syntactic tagging.
International Standard Book Number
9789042027688
TOPICAL NAME USED AS SUBJECT
Computational linguistics.
Cross-language information retrieval.
Grammar, Comparative and general-- Morphosyntax.
Language transfer (Language learning)
Computational linguistics.
Cross-language information retrieval.
Grammar, Comparative and general-- Morphosyntax.
LANGUAGE ARTS & DISCIPLINES-- Grammar & Punctuation.
LANGUAGE ARTS & DISCIPLINES-- Linguistics-- Syntax.