100 handpicked books like Kafka (picked by fans)

Designing Data-Intensive Applications

From my list on practical, hands-on books on DevOps and software delivery.

By Yevgeniy Brikman Author

Why did Yevgeniy love this book?

This is the best overview of data storage and distributed systems—two key concepts for building almost any piece of software today—that I've seen anywhere. Martin does a wonderful job of taking a massive body of research and distilling complicated concepts and difficult trade-offs down to a level anyone can understand.

I learned a lot about replication, partitioning, linearizability, locking, write skew, phantoms, transactions, event logs, and more. I'm also a big fan of the final chapter, The Future of Data Systems, which covers ideas such as "unbundling the database", end-to-end event streams, and an important discussion on ethics in programming and data systems.

Designing Data-Intensive Applications

By Martin Kleppmann,

Why should I read it?

2 authors picked Designing Data-Intensive Applications as one of their favorite books, and they share why you should read it.

What is this book about?

Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords? In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain…

Explore

Topics

Big data

Genres

Design

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e

By Géron Aurélien,

From my list on big data processing ecosystem.

By Tomasz Lelek Author

Why did Tomasz love this book?

The Hands-on Machine Learning book presents an end-to-end approach to many problems that can be solved with machine learning.

Every concept and topic is backed up with a running code that you can experiment with and adapt to your real-world problems.

Thanks to this book, you will be able to understand the state of the art of today's machine learning and feel comfortable using the most up-to-date ML methods.

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e

By Géron Aurélien,

Why should I read it?

1 author picked Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e as one of their favorite books, and they share why you should read it.

What is this book about?

Through a recent series of breakthroughs, deep learning has boosted the entire field of machine learning. Now, even programmers who know close to nothing about this technology can use simple, efficient tools to implement programs capable of learning from data. This best-selling book uses concrete examples, minimal theory, and production-ready Python frameworks--scikit-learn, Keras, and TensorFlow--to help you gain an intuitive understanding of the concepts and tools for building intelligent systems.

With this updated third edition, author Aurelien Geron explores a range of techniques, starting with simple linear regression and progressing to deep neural networks. Numerous code examples and exercises throughout…

Explore

Topics

Genres

Coming soon!

Advanced Analytics with Spark

By Sandy Ryza, Uri Laserson, Sean Owen , Josh Wills

From my list on big data processing ecosystem.

By Tomasz Lelek Author

Why did Tomasz love this book?

Apache Spark has a very high point of entry for newcomers to the Big Data ecosystem.

However, it is a key tool that almost everyone is using for running distributed processing. I recommend everyone to read this book before delving into production solutions based on Apache Spark.

This book will allow you to alleviate many spark problems, such as serialization, memory utilization, and parallelization of processing.

Advanced Analytics with Spark

By Sandy Ryza, Uri Laserson, Sean Owen , Josh Wills

Why should I read it?

1 author picked Advanced Analytics with Spark as one of their favorite books, and they share why you should read it.

What is this book about?

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-classification, collaborative filtering, and anomaly detection among others-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for…

Explore

Topics

Genres

Coming soon!

Database Internals

By Alex Petrov,

From my list on big data processing ecosystem.

By Tomasz Lelek Author

Why did Tomasz love this book?

The Database Internals will allow you to go one step further in your understanding of how distributed databases work.

The author has a lot of experience with one of the most successful distributed databases - Apache Cassandra and shares his knowledge about low-level details and internals of distributed databases.

Database Internals

By Alex Petrov,

Why should I read it?

1 author picked Database Internals as one of their favorite books, and they share why you should read it.

What is this book about?

When it comes to choosing, using, and maintaining a database, understanding its internals is essential. But with so many distributed databases and tools available today, it's often difficult to understand what each one offers and how they differ. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals.

Throughout the book, you'll explore relevant material gleaned from numerous books, papers, blog posts, and the source code of several open source databases. These resources are listed at the end of parts one and two. You'll discover that the most significant distinctions among many…

Explore

Topics

Big data

Genres

Coming soon!

Predict and Surveil

By Sarah Brayne,

From my list on the cluster-f*ck we call policing.

By Luke Hunt Author

Why did Luke love this book?

I love this book because it reminds us of the many ways that technology can affect justice.

It is tempting to think sophisticated tactics such as “predictive policing” can solve all problems relating to human bias. However, Brayne shows that data and algorithms do not eliminate bias and discretion. Instead, high-tech police tools simply make bias less overt and visible, which erodes the public’s ability to hold the police accountable.

I especially enjoyed how the book flips the script, considering diverse ways to use these tools to help the public. For example, how can municipalities use technology to analyze the underlying factors that contribute to policing problems in the first place?

Predict and Surveil

By Sarah Brayne,

Why should I read it?

1 author picked Predict and Surveil as one of their favorite books, and they share why you should read it.

What is this book about?

The scope of criminal justice surveillance, from the police to the prisons, has expanded rapidly in recent decades. At the same time, the use of big data has spread across a range of fields, including finance, politics, health, and marketing. While law enforcement's use of big data is hotly contested, very little is known about how the police actually use it in daily operations and with what consequences.

In Predict and Surveil, Sarah Brayne offers an unprecedented, inside look at how police use big data and new surveillance technologies, leveraging on-the-ground fieldwork with one of the most technologically advanced law…

Explore

Topics

Genres

Coming soon!

Privacy Is Power

By Carissa Veliz,

From my list on how technology affects your human rights.

By Susie Alegre Author

Why did Susie love this book?

Privacy Is Power gets to the heart of why we should all be worried about encroachments on our privacy.

Carissa Veliz is a philosopher and a talented writer who brings complex and profound ideas to life on the page. Some writing about technology can feel dry and detached, but Veliz makes you understand viscerally how the impact of technology is a human, not a technological issue.

Privacy Is Power

By Carissa Veliz,

Why should I read it?

2 authors picked Privacy Is Power as one of their favorite books, and they share why you should read it.

What is this book about?

An Economist BEST BOOK OF THE YEAR

As the data economy grows in power, Carissa Veliz exposes how our privacy is eroded by big tech and governments, why that matters and what we can do about it.

The moment you check your phone in the morning you are giving away your data. Before you've even switched off your alarm, a whole host of organisations have been alerted to when you woke up, where you slept, and with whom. As you check the weather, scroll through your 'suggested friends' on Facebook, you continually compromise your privacy.

Without your permission, or even…

Explore

Topics

Genres

Information is Beautiful

By David McCandless,

From my list on what big data is and how it impacts us.

By Roger Highfield Author

Why did Roger love this book?

Big data can be beautiful and visualisations make for a wonderful coffee-table book. In Information is Beautiful, David McCandless turns dry-as-dust data into pop art to show the kind of world we live in, linking politics to life expectancy, women’s education to GDP growth, and more. Through colourful graphics, we get vivid and novel perspectives on current obsessions, from maps of cliches to the most fashionable colours. A testament to how the power of big data comes from being able to distill information to reveal hidden patterns and discern trends.

Information is Beautiful

By David McCandless,

Why should I read it?

1 author picked Information is Beautiful as one of their favorite books, and they share why you should read it.

What is this book about?

A visual guide to the way the world really works

Every day, every hour, every minute we are bombarded by information - from television, from newspapers, from the internet, we're steeped in it, maybe even lost in it. We need a new way to relate to it, to discover the beauty and the fun of information for information's sake.
No dry facts, theories or statistics. Instead, Information is Beautiful contains visually stunning displays of information that blend the facts with their connections, their context and their relationships - making information meaningful, entertaining and beautiful.
This is information like you have…

Explore

Topics

Big data
God

Genres

Coming soon!

An Ugly Truth

By Sheera Frenkel, Cecilia Kang,

From my list on what big data is and how it impacts us.

By Roger Highfield Author

Why did Roger love this book?

‘They trust me….dumb f*cks.’ This telling exchange from the Harvard days of Facebook co-founder and CEO, Mark Zuckerberg appears in An Ugly Truth, which shines a harsh light on the tech behemoth that, ultimately, is built on the data of billions of people. As Meta, Zuckerberg’s new business incarnation, wafts into the virtual worlds of the metaverse, the story of Facebook is far from over, which makes this engaging book a tad unsatisfying. Nonetheless, it is a vivid example of how with Big Data comes Big Responsibility.

An Ugly Truth

By Sheera Frenkel, Cecilia Kang,

Why should I read it?

1 author picked An Ugly Truth as one of their favorite books, and they share why you should read it.

What is this book about?

'An explosive new book' Daily Mail

'[A] careful, comprehensive interrogation of every major Facebook scandal. An Ugly Truth provides the kind of satisfaction you might get if you hired a private investigator to track a cheating spouse: it confirms your worst suspicions and then gives you all the dates and details you need to cut through the company's spin' New York Times

__________________________________________

Award-winning New York Times reporters Sheera Frenkel and Cecilia Kang unveil the tech story of our times in this riveting, behind-the-scenes expose that offers the definitive account of Facebook's fall from grace. Once one of Silicon Valley's…

Explore

Topics

Genres

Forewarned

By Paul Goodwin,

From my list on getting an insight into forecasting.

By David F. Hendry Author

Why did David love this book?

When can we trust a forecast? Given how often forecasts end up being very wide of the mark, a degree of scepticism might well be warranted. Paul Goodwin provides an entertaining account of forecasting, arguing that intuition may serve us well in some settings, but that computer-based analysis of big data might be expected to prevail in others.

Forewarned

By Paul Goodwin,

Why should I read it?

1 author picked Forewarned as one of their favorite books, and they share why you should read it.

What is this book about?

Whether it's an unforeseen financial crash, a shock election result or a washout summer that threatens to ruin a holiday in the sun, forecasts are part and parcel of our everyday lives. We rely wholeheartedly on them, and become outraged when things don't go exactly to plan.

But should we really put so much trust in predictions? Perhaps gut instincts can trump years of methodically compiled expert knowledge? And when exactly is a forecast not a forecast? Forewarned will answer all of these intriguing questions, and many more.

Packed with fun anecdotes and startling facts, Forewarned is a myth-busting guide…

Explore

Topics

Genres

Business

Uncanny Valley

By Anna Wiener,

From my list on the importance of expecting less from your workplace.

By David Buckmaster Author

Why did David love this book?

Changing careers from publishing to tech is a path not often traveled. Wiener made this jump from a world legendary for its light pay compensated by romanticism, to an industry best known for generous “perks that landed somewhere between the collegiate and the feudal.” Wiener’s experience makes for one of the most entertaining books I’ve read in years—she is a gifted writer and unafraid to call out the over-seriousness of the tech bro mentality as an ultimately “dreary” worldview.

Uncanny Valley

By Anna Wiener,

Why should I read it?

3 authors picked Uncanny Valley as one of their favorite books, and they share why you should read it.

What is this book about?

A NEW YORK TIMES BESTSELLER. ONE OF THE NEW YORK TIMES'S 10 BEST BOOKS OF 2020.

Named one of the Best Books of 2020 by The Washington Post, The Atlantic, NPR, the Los Angeles Times, ELLE, Esquire, Parade, Teen Vogue, The Boston Globe, Forbes, The Times (UK), Fortune, Chicago Tribune, Glamour, The A.V. Club, Vox, Jezebel, Town & Country, OneZero, Apartment Therapy, Good Housekeeping, PopMatters, Electric Literature, Self, The Week (UK) and BookPage.A New York Times Book Review Editors' Choice and a January 2020 IndieNext Pick.

"A definitive document of a world in transition: I won't be alone in returning…

Explore

Topics

Genres

70 books like Kafka

Designing Data-Intensive Applications

Why did Yevgeniy love this book?

Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e

Why did Tomasz love this book?

Advanced Analytics with Spark

Why did Tomasz love this book?

Database Internals

Why did Tomasz love this book?

Predict and Surveil

Why did Luke love this book?

Privacy Is Power

Why did Susie love this book?

Information is Beautiful

Why did Roger love this book?

An Ugly Truth

Why did Roger love this book?

Forewarned

Why did David love this book?

Uncanny Valley

Why did David love this book?

5 book lists we think you will like!

Interested in big data, the ecosystem, and data processing?

70 books like Kafka

Why am I passionate about this?

Yevgeniy's book list on practical, hands-on books on DevOps and software delivery

Why did Yevgeniy love this book?

Why am I passionate about this?

Tomasz's book list on big data processing ecosystem

Why did Tomasz love this book?

Why am I passionate about this?

Tomasz's book list on big data processing ecosystem

Why did Tomasz love this book?

Why am I passionate about this?

Tomasz's book list on big data processing ecosystem

Why did Tomasz love this book?

Why am I passionate about this?

Luke's book list on the cluster-f*ck we call policing

Why did Luke love this book?

Why am I passionate about this?

Susie's book list on how technology affects your human rights

Why did Susie love this book?

Why am I passionate about this?

Roger's book list on what big data is and how it impacts us

Why did Roger love this book?

Why am I passionate about this?

Roger's book list on what big data is and how it impacts us

Why did Roger love this book?

Why am I passionate about this?

David's book list on getting an insight into forecasting

Why did David love this book?

Why am I passionate about this?

David's book list on the importance of expecting less from your workplace

Why did David love this book?

Share your top 3 reads of 2024!

1,187

5 book lists we think you will like!

Interested in big data, the ecosystem, and data processing?