
Автор: Jules Damji, Denny Lee, Brooke Wenig, Tathagata Das
Издательство: O'Reilly Media
Год: 2019
Страниц: 107
Язык: английский
Формат: pdf
Размер: 10.8 MB
Data is getting bigger, arriving faster, and coming in varied formats—and it all needs to be processed at scale for analytics or machine learning. How can you process such varied data workloads efficiently? Enter Apache Spark. Updated to emphasize new features in Spark 2.x., this second edition shows data engineers and scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine-learning algorithms.