Автор: Oswald Campesato
Издательство: Mercury Learning and Information
Год: 2023
Страниц: 275
Язык: английский
Формат: pdf (true)
Размер: 13.0 MB
This book contains a fast-paced introduction to as much relevant information about managing data that can be reasonably included in a book of this size. However, you will be exposed to a variety of features of NumPy and Pandas, how to create databases and tables in MySQL, and how to perform many data cleaning tasks and data wrangling.
Some topics are presented in a cursory manner, which is for two main reasons. First, it’s important that you be exposed to these concepts. In some cases, you will find topics that might pique your interest, and hence motivate you to learn more about them through self-study; in other cases, you will probably be satisfied with a brief introduction. In other words, you decide whether to delve into more detail regarding the topics in this book. Second, a full treatment of all the topics that are covered in this book would significantly increase its size, and few people have the time to read technical tomes. Chapter 7 covers many data wrangling tasks using Python scripts and awk-based shell scripts. Companion files with code are available for downloading from the publisher.
The Target Audience:
This book is intended primarily for people who plan to become data scientists as well as anyone who needs to perform data cleaning tasks. This book is also intended to reach an international audience of readers with highly diverse backgrounds in various age groups. Hence, this book uses standard English rather than colloquial expressions that might be confusing to those readers. People learn by different types of imitation, which includes reading, writing, or hearing new material. This book takes these points into consideration to provide a comfortable and meaningful learning experience for the intended readers.
What do i need to know for this book?
Current knowledge of Python 3.x is the most helpful skill. Knowledge of other programming languages (such as Java) can also be helpful because of the exposure to programming concepts and constructs. The less technical knowledge that you have, the more diligence will be required to understand the various topics that are covered.
Features:
+Provides the reader with basic Python 3, Java, and Pandas programming concepts, and an introduction to awk
+Includes a chapter on RDBMs and SQL
+Companion files with code (available with Amazon proof of purchase by writing to the publisher at info@merclearning.com)
Table of Contents:
1: Introduction to Python. 2: Working with Data. 3: Introduction to Pandas. 4: RDBMs and SQL. 5: Java, JSON, and XML. 6: Data Cleaning Tasks. 7: Data Wrangling. Appendix: Working with awk. Index.
Скачать Data Wrangling Using Pandas, SQL, and Java