Автор: Ramcharan Kakarla, Sundar Krishnan, Balaji Dhamodharan, Venkata Gunnu
Издательство: Apress
Год: 2024
Страниц: 447
Язык: английский
Формат: pdf
Размер: 18.0 MB
This comprehensive guide with hand-picked examples of daily use cases will walk you through the end-to-end predictive model-building cycle with the latest techniques and tricks of the trade.
In Chapters 1, 2 & 3, we will get started with setting up the environment, and the basics of PySpark focusing on data manipulations. In Chapter 4, we will dive into the art of Variable Selection where we demonstrate various selection techniques available in PySpark. In Chapters 5, 6 & 7, we take you on the journey of machine learning algorithms, implementations and fine-tuning techniques. Chapters 8 and 9 will walk you through machine learning pipelines, and various methods available to operationalize the model and serve it through docker/API. Chapter 10 will demonstrate how can you unlock the power of predictive models when used in coherence to create a meaningful impact on your business. Chapter 11 will introduce you to some of the most used and powerful modelling frameworks to unlock real value from data.
In this new edition, you will learn predictive modelling frameworks that can quantify customer lifetime values and estimate the return of your predictive modelling investments. This edition also contains methods to measure engagement and identify actionable populations for churn treatments effectively. In addition, a dedicated chapter for experimentation design including steps to efficiently design, conduct, test and measure the results of your models is added. All the codes will be refreshed as needed to reflect the latest stable version of Spark.
You will:
Learn the overview of end to end predictive model building
Understand Multiple variable selection techniques & implementations
Work with Operationalizing models
Perform Data science experimentations & tips
Who This Book is For:
Data scientists and machine learning and deep learning engineers who want to learn and use PySpark for real-time analysis of streaming data.
Скачать Applied Data Science Using Pyspark: Learn the End-to-end Predictive Model-building Cycle, 2nd Edition