Buch, Englisch, Band 608, 416 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1780 g
Reihe: The Springer International Series in Engineering and Computer Science
Buch, Englisch, Band 608, 416 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 1780 g
Reihe: The Springer International Series in Engineering and Computer Science
ISBN: 978-0-7923-7209-7
Verlag: Springer US
One of the major means of instance selection is sampling whereby a sample is selected for testing and analysis, and randomness is a key element in the process. Instance selection also covers methods that require search. Examples can be found in density estimation (finding the representative instances - data points - for a cluster); boundary hunting (finding the critical instances to form boundaries to differentiate data points of different classes); and data squashing (producing weighted new data with equivalent sufficient statistics). Other important issues related to instance selection extend to unwanted precision, focusing, concept drifts, noise/outlier removal, data smoothing, etc.
brings researchers and practitioners together to report new developments and applications, to share hard-learned experiences in order to avoid similar pitfalls, and to shed light on the future development of instance selection. This volume serves as a comprehensive reference for graduate students, practitioners and researchers in KDD.
Zielgruppe
Research
Autoren/Hrsg.
Fachgebiete
- Technische Wissenschaften Elektronik | Nachrichtentechnik Elektronik Robotik
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Informationstheorie, Kodierungstheorie
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Wissensbasierte Systeme, Expertensysteme
- Interdisziplinäres Wissenschaften Wissenschaften: Forschung und Information Informationstheorie, Kodierungstheorie
- Interdisziplinäres Wissenschaften Wissenschaften: Forschung und Information Datenanalyse, Datenverarbeitung
- Mathematik | Informatik Mathematik Stochastik Mathematische Statistik
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Zeichen- und Zahlendarstellungen
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Datenkompression, Dokumentaustauschformate
Weitere Infos & Material
1 Data Reduction via Instance Selection.- 2 Sampling: Knowing Whole from Its Part.- 3 A Unifying View on Instance Selection.- 4 Competence Guided Instance Selection for Case-Based Reasoning.- 5 Identifying Competence-Critical Instances for Instance-Based Learners.- 6 Genetic-Algorithm-Based Instance and Feature Selection.- 7 The Landmark Model: An Instance Selection Method for Time Series Data.- 8 Adaptive Sampling Methods for Scaling Up Knowledge Discovery Algorithms.- 9 Progressive Sampling.- 10 Sampling Strategy for Building Decision Trees from Very Large Databases Comprising Many Continuous Attributes.- 11 Incremental Classification Using Tree-Based Sampling for Large Data.- 12 Instance Construction via Likelihood-Based Data Squashing.- 13 Learning via Prototype Generation and Filtering.- 14 Instance Selection Based on Hypertuples.- 15 KBIS: Using Domain Knowledge to Guide Instance Selection.- 16 Instance Sampling for Boosted and Standalone Nearest Neighbor Classifiers.- 17 Prototype Selection Using Boosted Nearest-Neighbors.- 18 DAGGER: Instance Selection for Combining Multiple Models Learnt from Disjoint Subsets.- 19 Using Genetic Algorithms for Training Data Selection in RBF Networks.- 20 An Active Learning Formulation for Instance Selection with Applications to Object Detection.- 21 Filtering Noisy Instances and Outliers.- 22 Instance Selection Based on Support Vector Machine.- Appendix: Meningoencepalitis Data Set.