1 Textual Statistics: Scope and Applications -- 2 The Units of Textual Statistics -- 3 Correspondence Analysis of Lexical Tables -- 4 Cluster Analysis of Words and Texts -- 5 Visualization of Textual Data -- 6 Characteristic Textual Units, Modal Responses and Modal Texts -- 7 Longitudinal Partitions, Textual Time Series -- 8 Textual Discriminant Analysis -- Appendix 1: Singular value decomposition and correspondence analysis -- Appendix 2: Clustering techniques -- Appendix 3: More details about the nonparametric estimation model -- Appendix 4: Search for repeated segments in a corpus -- References -- Author Index -- Symbols.
0
Researchers in a number of disciplines deal with large text sets requiring both text management and text analysis. Faced with a large amount of textual data collected in marketing surveys, literary investigations, historical archives and documentary data bases, these researchers require assistance with organizing, describing and comparing texts. Exploring Textual Data demonstrates how exploratory multivariate statistical methods such as correspondence analysis and cluster analysis can be used to help investigate, assimilate and evaluate textual data. The main text does not contain any strictly mathematical demonstrations, making it accessible to a large audience. This book is very user-friendly with proofs abstracted in the appendices. Full definitions of concepts, implementations of procedures and rules for reading and interpreting results are fully explored. A succession of examples is intended to allow the reader to appreciate the variety of actual and potential applications and the complementary processing methods. A glossary of terms is provided.