Digital speech corpora -- Language corpora: Indian scenario -- Issues in corpus generation -- Process of corpus -- Corpus sanitation and pre-editing -- Statistical studies on corpus -- Corpus text processing -- Corpus as primary resource for ELT -- Corpus as secondary resource for ELT -- Corpus and lexicography -- Corpus and dialect study chapter -- Corpus and word sense disambiguation -- Corpus and language technology -- Corpus and other branches of linguistics -- Corpora: future Indian needs.
بدون عنوان
0
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
This book discusses some of the basic issues relating to corpus generation and the methods normally used to generate a corpus. Since corpus-related research goes beyond corpus generation, the book also addresses other major topics connected with the use and application of language corpora, namely, corpus readiness in the context of corpus sanitation and pre-editing of corpus texts; the application of statistical methods; and various text processing techniques. Importantly, it explores how corpora can be used as a primary or secondary resource in English language teaching, in creating dictionaries, in word sense disambiguation, in various language technologies, and in other branches of linguistics. Lastly, the book sheds light on the status quo of corpus generation in Indian languages and identifies current and future needs. Discussing various technical issues in the field in a lucid manner, providing extensive new diagrams and charts for easy comprehension, and using simplified English, the book is an ideal resource for non-native English readers. Written by academics with many years of experience teaching and researching corpus linguistics, its focus on Indian languages and on English corpora makes it applicable to graduate and postgraduate students of applied linguistics, computational linguistics and language processing in South Asia and across countries where English is spoken as a first or second language.--
یادداشتهای مربوط به سفارشات
منبع سفارش / آدرس اشتراک
Springer Nature
شماره انبار
com.springer.onix.9789811318016
ویراست دیگر از اثر در قالب دیگر رسانه
عنوان
Utility and application of language corpora.
شماره استاندارد بين المللي کتاب و موسيقي
9789811318009
موضوع (اسم عام یاعبارت اسمی عام)
موضوع مستند نشده
Corpora (Linguistics)
موضوع مستند نشده
Linguistics.
موضوع مستند نشده
Translators (Computer programs)
موضوع مستند نشده
Computers-- Natural Language Processing.
موضوع مستند نشده
Corpora (Linguistics)
موضوع مستند نشده
Education-- Educational Psychology.
موضوع مستند نشده
Language Arts & Disciplines-- Linguistics-- General.
موضوع مستند نشده
Linguistics.
موضوع مستند نشده
Linguistics.
موضوع مستند نشده
Natural language & machine translation.
موضوع مستند نشده
Teaching skills & techniques.
موضوع مستند نشده
Translators (Computer programs)
مقوله موضوعی
موضوع مستند نشده
CF
موضوع مستند نشده
CF
موضوع مستند نشده
LAN009000
رده بندی ديویی
شماره
410
.
188
رده بندی کنگره
شماره رده
P128
.
C68
نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )