Synthesis lectures on data mining and knowledge discovery,
مشخصه جلد
#15
شاپا ي ISSN فروست
2151-0075 ;
یادداشتهای مربوط به کتابنامه ، واژه نامه و نمایه های داخل اثر
متن يادداشت
Includes bibliographical references (pages 167-181).
یادداشتهای مربوط به مندرجات
متن يادداشت
Introduction -- Background -- Literature review -- Entity recognition and typing with knowledge bases -- Fine-grained entity typing with knowledge bases -- Synonym discovery from large corpus / Meng Qu -- Joint extraction of typed entities and relationships -- Pattern-enhanced embedding learning for relation extraction / Meng Qu -- Heterogeneous supervision for relation extraction / Liyuan Liu -- Indirect supervision: leveraging knowledge from auxiliary tasks / Zeqiu Wu -- Mining entity attribute values with meta patterns / Meng Jiang -- Open information extraction with global structure cohesiveness / Qi Zhu -- Applications -- Conclusion -- Vision and future work.
بدون عنوان
0
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
The real-world data, though massive, is largely unstructured, in the form of natural-language text. It is challenging but highly desirable to mine structures from massive text data, without extensive human annotation and labeling. In this book, we investigate the principles and methodologies of mining structures of factual knowledge (e.g., entities and their relationships) from massive, unstructured text corpora. Departing from many existing structure extraction methods that have heavy reliance on human annotated data for model training, our effort-light approach leverages human-curated facts stored in external knowledge bases as distant supervision and exploits rich data redundancy in large text corpora for context understanding. This effort-light mining approach leads to a series of new principles and powerful methodologies for structuring text corpora, including: (1) entity recognition, typing, and synonym discovery; (2) entity relation extraction; and (3) open-domain attribute-value mining and information extraction. This book introduces this new research frontier and points out some promising research directions.
ویراست دیگر از اثر در قالب دیگر رسانه
عنوان
MINING STRUCTURES OF FACTUAL KNOWLEDGE FROM TEXT.
شماره استاندارد بين المللي کتاب و موسيقي
1681733943
موضوع (اسم عام یاعبارت اسمی عام)
موضوع مستند نشده
Data mining.
موضوع مستند نشده
Data structures (Computer science)
موضوع مستند نشده
Electronic information resource searching.
موضوع مستند نشده
Natural language processing (Computer science)
موضوع مستند نشده
COMPUTERS-- General.
موضوع مستند نشده
Data mining.
موضوع مستند نشده
Data structures (Computer science)
موضوع مستند نشده
Electronic information resource searching.
موضوع مستند نشده
Natural language processing (Computer science)
مقوله موضوعی
موضوع مستند نشده
COM-- 000000
رده بندی ديویی
شماره
006
.
312
ويراست
23
رده بندی کنگره
شماره رده
QA76
.
9
.
D343
نشانه اثر
R455
2018
نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )