
Автор: Mahmoud Parsian
Издательство: O’Reilly Media
Год: 2020
Страниц: 166
Язык: английский
Формат: pdf, epub
Размер: 10.1 MB
Apache Spark’s speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples for this framework using PySpark.