Mining Imperfect Data

Mining Imperfect Data
Author :
Publisher : SIAM
Total Pages : 492
Release :
ISBN-10 : 9781611976274
ISBN-13 : 1611976278
Rating : 4/5 (278 Downloads)

Book Synopsis Mining Imperfect Data by : Ronald K. Pearson

Download or read book Mining Imperfect Data written by Ronald K. Pearson and published by SIAM. This book was released on 2020-09-10 with total page 492 pages. Available in PDF, EPUB and Kindle. Book excerpt: It has been estimated that as much as 80% of the total effort in a typical data analysis project is taken up with data preparation, including reconciling and merging data from different sources, identifying and interpreting various data anomalies, and selecting and implementing appropriate treatment strategies for the anomalies that are found. This book focuses on the identification and treatment of data anomalies, including examples that highlight different types of anomalies, their potential consequences if left undetected and untreated, and options for dealing with them. As both data sources and free, open-source data analysis software environments proliferate, more people and organizations are motivated to extract useful insights and information from data of many different kinds (e.g., numerical, categorical, and text). The book emphasizes the range of open-source tools available for identifying and treating data anomalies, mostly in R but also with several examples in Python. Mining Imperfect Data: With Examples in R and Python, Second Edition presents a unified coverage of 10 different types of data anomalies (outliers, missing data, inliers, metadata errors, misalignment errors, thin levels in categorical variables, noninformative variables, duplicated records, coarsening of numerical data, and target leakage). It includes an in-depth treatment of time-series outliers and simple nonlinear digital filtering strategies for dealing with them, and it provides a detailed introduction to several useful mathematical characteristics of important data characterizations that do not appear to be widely known among practitioners, such as functional equations and key inequalities. While this book is primarily for data scientists, researchers in a variety of fields—namely statistics, machine learning, physics, engineering, medicine, social sciences, economics, and business—will also find it useful.


Mining Imperfect Data Related Books

Mining Imperfect Data
Language: en
Pages: 309
Authors: Ronald K. Pearson
Categories: Computers
Type: BOOK - Published: 2005-04-01 - Publisher: SIAM

DOWNLOAD EBOOK

This book discusses the problems that can occur in data mining, including their sources, consequences, detection and treatment.
Mining Imperfect Data
Language: en
Pages:
Authors: Ronald K. Pearson
Categories: Data mining
Type: BOOK - Published: 2020 - Publisher:

DOWNLOAD EBOOK

"This second edition of Mining Imperfect Data reflects changes in the size and nature of the datasets commonly encountered for analysis, and the evolution of th
Mining Imperfect Data
Language: en
Pages: 492
Authors: Ronald K. Pearson
Categories: Computers
Type: BOOK - Published: 2020-09-10 - Publisher: SIAM

DOWNLOAD EBOOK

It has been estimated that as much as 80% of the total effort in a typical data analysis project is taken up with data preparation, including reconciling and me
Soft Computing for Data Mining Applications
Language: en
Pages: 354
Authors: K. R. Venugopal
Categories: Computers
Type: BOOK - Published: 2009-02-24 - Publisher: Springer

DOWNLOAD EBOOK

The authors have consolidated their research work in this volume titled Soft Computing for Data Mining Applications. The monograph gives an insight into the res
Principles of Data Mining
Language: en
Pages: 594
Authors: David J. Hand
Categories: Computers
Type: BOOK - Published: 2001-08-17 - Publisher: MIT Press

DOWNLOAD EBOOK

The first truly interdisciplinary text on data mining, blending the contributions of information science, computer science, and statistics. The growing interest