Advanced Analytics with Spark

By Sandy Ryza, Uri Laserson, Sean Owen , Josh Wills

Book cover of Advanced Analytics with Spark: Patterns for Learning from Data at Scale

Book description

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You'll start with an introduction to Spark and…

When you buy books, we may earn a commission that helps keep our lights on (or join the rebellion as a member).

Why read it?

1 author picked Advanced Analytics with Spark as one of their favorite books. Why do they recommend it?

Apache Spark has a very high point of entry for newcomers to the Big Data ecosystem.

However, it is a key tool that almost everyone is using for running distributed processing. I recommend everyone to read this book before delving into production solutions based on Apache Spark.

This book will allow you to alleviate many spark problems, such as serialization, memory utilization, and parallelization of processing.

From Tomasz's list on big data processing ecosystem.

Want books like Advanced Analytics with Spark?

Our community of 12,000+ authors has personally recommended 56 books like Advanced Analytics with Spark.

Browse books like Advanced Analytics with Spark

Book cover of Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Book cover of Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e: Concepts, Tools, and Techniques to Build Intelligent Systems
Book cover of Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale

Share your top 3 reads of 2024!

And get a beautiful page showing off your 3 favorite reads.

1,187

readers submitted
so far, will you?

5 book lists we think you will like!

Interested in data mining, big data, and machine learning?

Data Mining 13 books
Big Data 29 books
Machine Learning 53 books