Автор: AI Publishing
Издательство: AI Publishing
Год: 2020
Формат: pdf/azw3
Страниц: 250
Размер: 10.2 Mb
Язык: English
The book follows a very simple approach. It is divided into nine chapters. Chapter 1 introduces the basic concept of data preparation, along with the installation steps for the software that we will need to perform data preparation in this book. Chapter 1 also contains a crash course on Python. A brief overview of different data types is given in Chapter 2. Chapter 3 explains how to handle missing values in the data, while the categorical encoding of numeric data is explained in Chapter 4. Data discretization is presented in Chapter 5. Chapter 6 explains the process of handline outliers, while Chapter 7 explains how to scale features in the dataset. Handling of mixed and DateTime data type is explained in Chapter 8, while data balancing and resampling has been explained in Chapter 9. A full data preparation final project is also available at the end of the book.
In each chapter, different types of data preparation techniques have been explained theoretically, followed by practical examples. Each chapter also contains an exercise that students can use to evaluate their understanding of the concepts explained in the chapter.