Buch, Englisch, 240 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1200 g
Clustering, Classification, and Retrieval
Buch, Englisch, 240 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1200 g
ISBN: 978-1-84800-045-2
Verlag: Springer
The development of techniques for mining unstructured, semi-structured, and fully structured textual data has become critical in both academia and industry. This Second Edition brings readers thoroughly up to date with the emerging field of text mining, the application of techniques of machine learning in conjunction with natural language processing, information extraction, and algebraic/mathematical approaches to computational information retrieval. The book explores a broad range of issues, ranging from the development of new learning approaches to the parallelization of existing algorithms. In addition, it describes new application problems in areas such as email surveillance and anomaly detection. Presenting a comprehensive selection of topics within the field, this book is an indispensable resource for researchers, practitioners, and professionals involved in information retrieval, computational statistics, and datamining.
Zielgruppe
Research
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik Mathematik Numerik und Wissenschaftliches Rechnen Angewandte Mathematik, Mathematische Modelle
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Datenkompression, Dokumentaustauschformate
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Grafikprogrammierung
- Mathematik | Informatik EDV | Informatik Technische Informatik Systemverwaltung & Management
Weitere Infos & Material
Clustering.- Cluster-Preserving Dimension Reduction Methods for Document Classification.- Automatic Discovery of SimilarWords.- Principal Direction Divisive Partitioning with Kernels and k-Means Steering.- Hybrid Clustering with Divergences.- Text Clustering with Local Semantic Kernels.- Document Retrieval and Representation.- Vector Space Models for Search and Cluster Mining.- Applications of Semidefinite Programming in XML Document Classification.- Email Surveillance and Filtering.- Discussion Tracking in Enron Email Using PARAFAC.- Spam Filtering Based on Latent Semantic Indexing.- Anomaly Detection.- A Probabilistic Model for Fast and Confident Categorization of Textual Documents.- Anomaly Detection Using Nonnegative Matrix Factorization.- Document Representation and Quality of Text: An Analysis.