Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud

Автор: literator от 11-06-2020, 13:34, Коментариев: 0

Категория: КНИГИ » ПРОГРАММИРОВАНИЕ

Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the CloudНазвание: Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud
Автор: Robert Ilijason
Издательство: Apress
Год: 2020
Страниц: 281
Язык: английский
Формат: pdf (true), epub
Размер: 10.2 MB

Analyze vast amounts of data in record time using Apache Spark with Databricks in the Cloud. Learn the fundamentals, and more, of running analytics on large clusters in Azure and AWS, using Apache Spark with Databricks on top. Discover how to squeeze the most value out of your data at a mere fraction of what classical analytics solutions cost, while at the same time getting the results you need, incrementally faster.

This book explains how the confluence of these pivotal technologies gives you enormous power, and cheaply, when it comes to huge datasets. You will begin by learning how cloud infrastructure makes it possible to scale your code to large amounts of processing units, without having to pay for the machinery in advance. From there you will learn how Apache Spark, an open source framework, can enable all those CPUs for data analytics use. Finally, you will see how services such as Databricks provide the power of Apache Spark, without you having to know anything about configuring hardware or software. By removing the need for expensive experts and hardware, your resources can instead be allocated to actually finding business value in the data.

Once you know how to use the tool, it’s time to start coding. You’ll familiarize yourself with (Structured Query Language) SQL and Python, the two main languages for data analysis work. It doesn’t stop there, we’ll follow it up by digging deeper into advanced data wrangling techniques where we’ll see a lot of the if’s and but’s you’ll come across when you work with data in reality.

This book guides you through some advanced topics such as analytics in the cloud, data lakes, data ingestion, architecture, machine learning, and tools, including Apache Spark, Apache Hadoop, Apache Hive, Python, and SQL. Valuable exercises help reinforce what you have learned.

What You Will Learn:

Discover the value of big data analytics that leverage the power of the cloud
Get started with Databricks using SQL and Python in either Microsoft Azure or AWS
Understand the underlying technology, and how the cloud and Apache Spark fit into the bigger picture
See how these tools are used in the real world
Run basic analytics, including Machine Learning, on billions of rows at a fraction of a cost or free

Скачать Beginning Apache Spark Using Azure Databricks: Unleashing Large Cluster Analytics in the Cloud




ОТСУТСТВУЕТ ССЫЛКА/ НЕ РАБОЧАЯ ССЫЛКА ЕСТЬ РЕШЕНИЕ, ПИШИМ СЮДА!


Нашел ошибку? Есть жалоба? Жми!
Пожаловаться администрации
Уважаемый посетитель, Вы зашли на сайт как незарегистрированный пользователь.
Мы рекомендуем Вам зарегистрироваться либо войти на сайт под своим именем.
Информация
Посетители, находящиеся в группе Гости, не могут оставлять комментарии к данной публикации.