This book explains the principal techniques of data mining: for classification, generation of association rules and clustering. It is written for readers without a strong background in mathematics or statistics and focuses on detailed examples and explanations of the algorithms given. This will benefit readers of all levels, from those who use data mining via commercial packages, right through to academic researchers. The book aims to help the general reader develop the necessary understanding to use commercial data mining packages, and to enable advanced readers to understand or contribute to future technical advances. Includes exercises and glossary.
Les mer
This book explores the principal techniques of data mining: for classification, generation of association rules and clustering. It is written for readers without a strong background in mathematics or statistics and focuses on detailed examples and explanations of the algorithms given.
Les mer
Introduction to Data Mining.- Data for Data Mining.- Introduction to Classification: Naive Bayes and Nearest Neighbour.- Using Decision Trees for Classification.- Decision Tree Induction: Using Entropy for Attribute Selection.- Decision Tree Induction: Using Frequency Tables for Attribute Selection.- Estimating the Predictive Accuracy of a Classifier.- Continuous Attributes.- Avoiding Overfitting of Decision Trees.- More about Entropy.- Inducing Modular Rules for Classification.- Measuring the Performance of a Classifier.- Association Rule Mining I.- Association Rule Mining II.- Clustering.- Text Mining.- References.- Appendix A: Essential Mathematics.- Appendix B: Datasets.- Appendix C: Sources of Further Information.- Appendix D: Glossary and Notation.- Appendix E: Solutions to Self-assessment Exercises.- Index.
Les mer

Produktdetaljer

ISBN
9781846287657
Publisert
2007-04-26
Utgiver
Vendor
Springer London Ltd
Vekt
1250 gr
Høyde
244 mm
Bredde
170 mm
Dybde
18 mm
Aldersnivå
P, 06
Språk
Product language
Engelsk
Format
Product format
Heftet
Antall sider
354

Forfatter