Michael W. Berry, Survey of Text Mining I: Clustering, Classification, and Retrieval
ISBN: 0387955631 | edition 2003 | PDF | 262 pages | 5 mb
Extracting content from text continues to be an important research problem for information processing and management. Approaches to capture the semantics of text-based document collections may be based on Bayesian models, probability theory, vector space models, statistical models, or even graph theory. As the volume of digitized textual media continues to grow, so does the need for designing robust, scalable indexing and search strategies (software) to meet a variety of user needs.