Mining Very Large Databases with Parallel Processing addresses the problem of large-scale data mining. It is an interdisciplinary text, describing advances in the integration of three computer science areas, namely `intelligent' (machine learning-based) data mining techniques, relational databases and parallel processing. The basic idea is to use concepts and techniques of the latter two areas - particularly parallel processing - to speed up and scale up data mining algorithms.
The book is divided into three parts. The first part presents a comprehensive review of intelligent data mining techniques such as rule induction, instance-based learning, neural networks and genetic algorithms. Likewise, the second part presents a comprehensive review of parallel processing and parallel databases. Each of these parts includes an overview of commercially-available, state-of-the-art tools. The third part deals with the application of parallel processing to data mining. The emphasis is on finding generic, cost-effective solutions for realistic data volumes. Two parallel computational environments are discussed, the first excluding the use of commercial-strength DBMS, and the second using parallel DBMS servers.
It is assumed that the reader has a knowledge roughly equivalent to a first degree (BSc) in accurate sciences, so that (s)he is reasonably familiar with basic concepts of statistics and computer science.
The primary audience for Mining Very Large Databases with Parallel Processing is industry data miners and practitioners in general, who would like to apply intelligent data mining techniques to large amounts of data. The book will also be of interest to academic researchers and postgraduate students, particularly database researchers, interested in advanced, intelligent database applications, and artificial intelligence researchers interested in industrial, real-world applications of machine learning.
Les mer
<em>Mining Very Large Databases with Parallel Processing</em> addresses the problem of large-scale data mining.
The Motivation for Data Mining and Knowledge Discovery.- The Inter-disciplinary Nature of Knowledge Discovery in Databases (KDD).- The Challenge of Efficient Knowledge Discovery in Large Databases and Data Warehouses.- Organization of the Book.- I Knowledge Discovery and Data Mining.- 1 Knowledge Discovery Tasks.- 2 Knowledge Discovery Paradigms.- 3 The Knowledge Discovery Process.- 4 Data Mining.- 5 Data Mining Tools.- II Parallel Database Systems.- 6 Basic Concepts on Parallel Processing.- 7 Data Parallelism, Control Parallelism and Related Issues.- 8 Parallel Database Servers.- III Parallel Data Mining.- 9 Approaches to Speed up Data Mining.- 10 Parallel Data Mining without Dbms Facilities.- 11 Parallel Data Mining with Database Facilities.- 12 Summary and Some Open Problems.- References.
Les mer
Springer Book Archives
GPSR Compliance The European Union's (EU) General Product Safety Regulation (GPSR) is a set of rules that requires consumer products to be safe and our obligations to ensure this. If you have any concerns about our products you can contact us on ProductSafety@springernature.com. In case Publisher is established outside the EU, the EU authorized representative is: Springer Nature Customer Service Center GmbH Europaplatz 3 69115 Heidelberg, Germany ProductSafety@springernature.com
Les mer

Produktdetaljer

ISBN
9780792380481
Publisert
1997-11-30
Utgiver
Vendor
Springer
Høyde
235 mm
Bredde
155 mm
Aldersnivå
Research, UU, UP, P, 05, 06
Språk
Product language
Engelsk
Format
Product format
Innbundet