
Автор: Mark van der Loo, Edwin de Jonge
Издательство: Wiley
ISBN: 1118897153
Год: 2018
Страниц: 320
Язык: английский
Формат: pdf, rtf, djvu
Размер: 10.5 MB
A comprehensive guide to automated statistical data cleaning. The production of clean data is a complex and time-consuming process that requires both technical know-how and statistical expertise. Statistical Data Cleaning brings together a wide range of techniques for cleaning textual, numeric or categorical data. This book examines technical data cleaning methods relating to data representation and data structure. A prominent role is given to statistical data validation, data cleaning based on predefined restrictions, and data cleaning strategy.