Автор: Mark Smart
Издательство: Amazon Digital Services LLC
ASIN: B07KB2PF8X
Год: 2018
Страниц: 102
Язык: английский
Формат: epub, pdf (conv)
Размер: 10.17 MB
This book is a guide for you on how to use Pandas and Numpy in Python programming language for data analysis. The author begins by helping you familiarize yourself with the basics of data science, Numpy and Pandas. You are guided on how to work with Numpy arrays and how to manipulate them. The various operations that you can perform on your data via the Pandas library have been discussed. You will also know how to create various data structures in Pandas for data storage.
Data from the environment is dirty. The process of cleaning such data has been discussed. This involves handling outliers, missing values etc. The author guides you on how to work with data in various types of storage formats. Examples include MS Excel, CSV files, JSON, etc. You are also guided on how to calculate various measures for your data. The process of visualizing data has been explored in detail.
About this book:
- Getting Started with Python for Data Science
- Working with Numpy
- Working with Pandas
- Cleansing Data
- Working with CSV Data
- Working with XLS Data
- Data Wrangling
- Measures of Central Tendency
- Calculating Variance
- Normal Distribution
- Working with JSON Data
- Data Visualization
Скачать Introduction to Data Science with Python: Basics of Numpy and Pandas