The best big data books

Who picked these books? Meet our 39 experts.

39 authors created a book list connected to big data, and here are their favorite big data books.
Shepherd is reader supported. When you buy books, we may earn an affiliate commission

What type of big data book?

Loading...
Loading...

Out of the Crisis

By W. Edwards Deming,

Book cover of Out of the Crisis

Steve Fenton Author Of Web Operations Dashboards, Monitoring, & Alerting

From the list on DevOps from before DevOps was invented.

Who am I?

I'm a programmer and technical author at Octopus Deploy and I'm deeply interested in DevOps. Since the 1950s, people have been studying software delivery in search of better ways of working. We’ve seen many revolutions since Lincoln Labs first introduced us to phased delivery, with lightweight methods transforming how we wrote software at the turn of the century. My interest in DevOps goes beyond my enthusiasm for methods in general, because we now have a great body of research that adds to our empirical observations on the ways we work.

Steve's book list on DevOps from before DevOps was invented

Discover why each book is one of Steve's favorite books.

Why did Steve love this book?

Before Agile and Lean had rocked the software development industry, William Deming was busy forging this new world of work.

Out of the Crisis is predominantly a management book, but it’s really the spark that started the lightweight movement in software delivery. A key concept in the book is how to identify the work system's performance, separate from the performance of individuals.

By W. Edwards Deming,

Why should I read it?

3 authors picked Out of the Crisis as one of their favorite books, and they share why you should read it.

What is this book about?

Essential reading for managers and leaders, this is the classic work on management, problem solving, quality control, and more—based on the famous theory, 14 Points for Management

In his classic Out of the Crisis, W. Edwards Deming describes the foundations for a completely new and transformational way to lead and manage people, processes, and resources. Translated into twelve languages and continuously in print since its original publication, it has proved highly influential. Research shows that Deming’s approach has high levels of success and sustainability. Readers today will find Deming’s insights relevant, significant, and effective in business thinking and practice. This…


Kafka

By Neha Narkhede, Gwen Shapira, Todd Palino

Book cover of Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From the list on big data processing ecosystem.

Who am I?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Discover why each book is one of Tomasz's favorite books.

Why did Tomasz love this book?

Apache Kafka is the backbone of almost every streaming-based system today.

The solutions created and implemented in Kafka are the key concepts in every streaming system that you will work with.

This book will allow you to fully understand the Kafka architecture, its internals, and APIs and allow you to become an expert in this technology.

By Neha Narkhede, Gwen Shapira, Todd Palino

Why should I read it?

1 author picked Kafka as one of their favorite books, and they share why you should read it.

What is this book about?

Every enterprise application creates data, whether it's log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you're an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.

Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream-processing applications with this platform. Through detailed examples, you'll learn Kafka's…


Winning the Loser's Game

By Charles Ellis,

Book cover of Winning the Loser's Game: Timeless Strategies for Successful Investing

Stephen R. Foerster Author Of In Pursuit of the Perfect Portfolio: The Stories, Voices, and Key Insights of the Pioneers Who Shaped the Way We Invest

From the list on developing your investment philosophy.

Who am I?

I’ve been interested in investing for over four decades since I started as a finance PhD student at Wharton. Since then my research has focused on understanding the stock market. Early on, I tried applying my research to my investing. For example, I was convinced that a recently listed stock called Google was way overvalued—was I ever wrong! That got me to reflect on my investment philosophy—what did I truly believe about how markets really behaved? That brought me back to understanding and appreciating the contributors to Modern Portfolio Theory, which led to a fun decade-long book project. Currently I enjoy writing about investing through my blog.

Stephen's book list on developing your investment philosophy

Discover why each book is one of Stephen's favorite books.

Why did Stephen love this book?

I had the pleasure of interviewing Charley for our book.

He’s a great storyteller. He was probably the first practitioner to advocate for passive index investing. He’s a tennis enthusiast, and his book was inspired by a book he read aimed at amateur tennis players. Ellis learned that to win at tennis, the best strategy is to simply try to not lose, and to not try to act like professional players.

He realized that the same strategy worked for investors as well. That means that investors shouldn’t try to beat the market.

By Charles Ellis,

Why should I read it?

5 authors picked Winning the Loser's Game as one of their favorite books, and they share why you should read it.

What is this book about?

The definitive guide to long-term investing success-fully updated to address the realities of today's markets

Technology, information overload, and increasing market dominance by expert investors and computers make it harder than ever to produce investing results that overcome operating costs and fees. Winning the Loser's Game reveals everything you need to know to reduce costs, fees, and taxes, and focus on long-term policies that are right for you.

Candid, short, and super easy to read, Winning the Loser's Game walks you through the process of developing and implementing a powerful investing strategy that generates solid profits year after year. In…


Book cover of Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From the list on big data processing ecosystem.

Who am I?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Discover why each book is one of Tomasz's favorite books.

Why did Tomasz love this book?

Designing Data-Intensive Applications is the best book if you want to learn about the main principles behind every system that is able to store and process big amounts of data.

You'll learn about distributed storage systems, their tradeoffs (availability, consistency, fault-tolerance), streaming processing systems, and main algorithms.

Those are the critical concepts behind almost every successful company that needs to create scalable solutions. 

By Martin Kleppmann,

Why should I read it?

1 author picked Designing Data-Intensive Applications as one of their favorite books, and they share why you should read it.

What is this book about?

Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords? In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data. Software keeps changing, but the fundamental principles remain…


Be Data Literate

By Jordan Morrow,

Book cover of Be Data Literate: The Data Literacy Skills Everyone Needs to Succeed

Jeremy Adamson Author Of Minding the Machines: Building and Leading Data Science and Analytics Teams

From the list on for data science and analytics leaders.

Who am I?

I am a leader in analytics and AI strategy, and have a broad range of experience in aviation, energy, financial services, and the public sector.  I have worked with several major organizations to help them establish a leadership position in data science and to unlock real business value using advanced analytics. 

Jeremy's book list on for data science and analytics leaders

Discover why each book is one of Jeremy's favorite books.

Why did Jeremy love this book?

Not everybody needs to be a data scientist, but everybody does need to be data literate. Without an intentional focus on evangelism and building a strong data culture in your organization it will be an uphill battle to make meaningful change. This book helps individuals and leaders to understand what data literacy is, and how we can build it like any other skill.

By Jordan Morrow,

Why should I read it?

1 author picked Be Data Literate as one of their favorite books, and they share why you should read it.

What is this book about?

In the fast moving world of the fourth industrial revolution not everyone needs to be a data scientist but everyone should be data literate, with the ability to read, analyze and communicate with data. It is not enough for a business to have the best data if those using it don't understand the right questions to ask or how to use the information generated to make decisions. Be Data Literate is the essential guide to developing the curiosity, creativity and critical thinking necessary to make anyone data literate, without retraining as a data scientist or statistician. With learnings to show…


Discriminating Data

By Wendy Hui Kyong Chun, Alex Barnett (illustrator),

Book cover of Discriminating Data: Correlation, Neighborhoods, and the New Politics of Recognition

David Theo Goldberg Author Of The Threat of Race: Reflections on Racial Neoliberalism

From the list on spotlighting race and neoliberalization.

Who am I?

I grew up and completed the formative years of my college education in Cape Town, South Africa, while active also in anti-apartheid struggles. My Ph.D. dissertation in the 1980s focused on the elaboration of key racial ideas in the modern history of philosophy. I have published extensively on race and racism in the U.S. and globally, in books, articles, and public media. My interests have especially focused on the transforming logics and expressions of racism over time, and its updating to discipline and constrain its conventional targets anew and new targets more or less conventionally. My interest has always been to understand racism in order to face it down.

David's book list on spotlighting race and neoliberalization

Discover why each book is one of David's favorite books.

Why did David love this book?

Digital technology, like technology generally, is commonly assumed to be value neutral. Wendy Chun reveals that structurally embedded in digital operating systems and data collection are values that reproduce and extend existing modes of discriminating while also originating new ones. In prompting and promoting the grouping together of people who are alike—in habits, culture, looks, and preferences—the logic of the algorithm reproduces and amplifies discriminatory trends. Chun reveals how the logics of the digital reinforce the restructuring of racism by the neoliberal turn that my own book lays out.

By Wendy Hui Kyong Chun, Alex Barnett (illustrator),

Why should I read it?

1 author picked Discriminating Data as one of their favorite books, and they share why you should read it.

What is this book about?

How big data and machine learning encode discrimination and create agitated clusters of comforting rage.

In Discriminating Data, Wendy Hui Kyong Chun reveals how polarization is a goal—not an error—within big data and machine learning. These methods, she argues, encode segregation, eugenics, and identity politics through their default assumptions and conditions. Correlation, which grounds big data’s predictive potential, stems from twentieth-century eugenic attempts to “breed” a better future. Recommender systems foster angry clusters of sameness through homophily. Users are “trained” to become authentically predictable via a politics and technology of recognition. Machine learning and data analytics thus seek to disrupt…


Uncanny Valley

By Anna Wiener,

Book cover of Uncanny Valley

David Buckmaster Author Of Fair Pay: How to Get a Raise, Close the Wage Gap, and Build Stronger Businesses

From the list on the importance of expecting less from your workplace.

Who am I?

I’ve worked with business leaders on pay projects all over the world, at companies like Nike and Starbucks, in places like Brazil, Mexico, Vietnam, Singapore, the UAE, and all over Europe. While many business books are written from a theoretical or academic perspective, I bring an operator’s perspective. I get to work out the ideas in my book, Fair Pay, on a daily basis, and so I wrote the book to be a realistic and practical guide for understanding the perspectives of business leaders, human resources, and the typical employee. 

David's book list on the importance of expecting less from your workplace

Discover why each book is one of David's favorite books.

Why did David love this book?

Changing careers from publishing to tech is a path not often traveled. Wiener made this jump from a world legendary for its light pay compensated by romanticism, to an industry best known for generous “perks that landed somewhere between the collegiate and the feudal.” Wiener’s experience makes for one of the most entertaining books I’ve read in years—she is a gifted writer and unafraid to call out the over-seriousness of the tech bro mentality as an ultimately “dreary” worldview. 

By Anna Wiener,

Why should I read it?

3 authors picked Uncanny Valley as one of their favorite books, and they share why you should read it.

What is this book about?

A NEW YORK TIMES BESTSELLER. ONE OF THE NEW YORK TIMES'S 10 BEST BOOKS OF 2020.

Named one of the Best Books of 2020 by The Washington Post, The Atlantic, NPR, the Los Angeles Times, ELLE, Esquire, Parade, Teen Vogue, The Boston Globe, Forbes, The Times (UK), Fortune, Chicago Tribune, Glamour, The A.V. Club, Vox, Jezebel, Town & Country, OneZero, Apartment Therapy, Good Housekeeping, PopMatters, Electric Literature, Self, The Week (UK) and BookPage.A New York Times Book Review Editors' Choice and a January 2020 IndieNext Pick.

"A definitive document of a world in transition: I won't be alone in returning…


The Art of Statistics

By David Spiegelhalter,

Book cover of The Art of Statistics: How to Learn from Data

Valliappa Lakshmanan Author Of Data Science on the Google Cloud Platform: Implementing End-To-End Real-Time Data Pipelines: From Ingest to Machine Learning

From the list on if you want to become a data scientist.

Who am I?

I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.

Valliappa's book list on if you want to become a data scientist

Discover why each book is one of Valliappa's favorite books.

Why did Valliappa love this book?

What if you are faced with a problem for which a standard approach doesn’t yet exist? In such a case, you will need to be able to figure out the approach from the first principles. This book will help you learn how to derive insights starting from raw data.

By David Spiegelhalter,

Why should I read it?

2 authors picked The Art of Statistics as one of their favorite books, and they share why you should read it.

What is this book about?

'A statistical national treasure' Jeremy Vine, BBC Radio 2

'Required reading for all politicians, journalists, medics and anyone who tries to influence people (or is influenced) by statistics. A tour de force' Popular Science

Do busier hospitals have higher survival rates? How many trees are there on the planet? Why do old men have big ears? David Spiegelhalter reveals the answers to these and many other questions - questions that can only be addressed using statistical science.

Statistics has played a leading role in our scientific understanding of the world for centuries, yet we are all familiar with the way…


Database Internals

By Alex Petrov,

Book cover of Database Internals: A Deep-Dive Into How Distributed Data Systems Work

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From the list on big data processing ecosystem.

Who am I?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Discover why each book is one of Tomasz's favorite books.

Why did Tomasz love this book?

The Database Internals will allow you to go one step further in your understanding of how distributed databases work.

The author has a lot of experience with one of the most successful distributed databases - Apache Cassandra and shares his knowledge about low-level details and internals of distributed databases.

By Alex Petrov,

Why should I read it?

1 author picked Database Internals as one of their favorite books, and they share why you should read it.

What is this book about?

When it comes to choosing, using, and maintaining a database, understanding its internals is essential. But with so many distributed databases and tools available today, it's often difficult to understand what each one offers and how they differ. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals.

Throughout the book, you'll explore relevant material gleaned from numerous books, papers, blog posts, and the source code of several open source databases. These resources are listed at the end of parts one and two. You'll discover that the most significant distinctions among many…


Counting

By Deborah Stone,

Book cover of Counting: How We Use Numbers to Decide What Matters

Carolyn Purnell Author Of The Sensational Past: How the Enlightenment Changed the Way We Use Our Senses

From the list on everyday things we take for granted.

Who am I?

I’m a historian who’s spent far too much time thinking about how the color magenta contributed to climate change and why eighteenth-century humanitarians were obsessed with tobacco enemas. My favorite historical topics—like sensation, color, and truth—don’t initially seem historical, but that’s exactly why they need to be explored. I’ve learned that the things that seem like second nature are where our deepest cultural assumptions and unconscious biases hide. In addition to writing nonfiction, I’ve been lucky enough to grow up on a ranch, live in Paris, work as an interior design writer, teach high school and college, and help stray dogs get adopted.

Carolyn's book list on everyday things we take for granted

Discover why each book is one of Carolyn's favorite books.

Why did Carolyn love this book?

I had never really given much thought to counting until I read this book, but in the very first chapter, Stone made me rethink everything I thought I knew about “one fish, two fish, red fish, blue fish.” She shows that every time we count, we’re making cultural assumptions. For example, what counts as a fish? And what makes the color of the fish more relevant than other features? Counting reveals that while these choices may seem intuitive, basic, and meaningless, they have very real impacts on people’s lives. Especially when we use numbers to measure things like merit, poverty, race, and productivity, those fundamental assumptions matter more than we care to admit.  

By Deborah Stone,

Why should I read it?

1 author picked Counting as one of their favorite books, and they share why you should read it.

What is this book about?

Early in her extraordinary career, Deborah Stone wrote Policy Paradox, a landmark work on politics. Now, in Counting, she revolutionises how we approach numbers and shows how counting shapes the way we see the world. Most of us think of counting as a skill so basic that we see numbers as objective, indisputable facts. Not so, says Stone. In this playful-yet-probing work, Stone reveals the inescapable link between quantifying and classifying, and explains how counting determines almost every facet of our lives-from how we are evaluated at work to how our political opinions are polled to whether we get into…


Small Data

By Martin Lindstrom,

Book cover of Small Data: The Tiny Clues That Uncover Huge Trends

Wayne Moloney Author Of The Wentworth Prospect: A novel guide to success in B2B sales

From the list on B2B salespeople to stay relevant and successful.

Who am I?

Everyone survives by selling something whether we wear the title or not. Selling has been my career, even before I was a salesperson. I started my career in engineering but quickly realised my passion was in developing business, not designing industrial ventilation systems. Helped by a boss who also saw I was better suited to roles other than engineering (he wasn’t so polite) I went on to enjoy a successful career spanning 4 decades working in Australian, Asian, and European markets that embraced all facets of sales and business development. Helped by great mentors and learning from the experience of others, I have endeavoured to give back by mentoring business owners, salespeople, and writing.

Wayne's book list on B2B salespeople to stay relevant and successful

Discover why each book is one of Wayne's favorite books.

Why did Wayne love this book?

The business world is obsessed with big data and artificial intelligence. 

Big data is collected for analysis. Analysis through AI. It lacks to ability to capture emotion and to be successful long-term, businesses need to connect with their markets at an emotional level.

Lindstrom uses real-life case studies to show how looking at the little things; how people use items and why, and observing what others don’t see in a home, a workplace, or a community can help better deliver concepts that big data just can’t see.

By Martin Lindstrom,

Why should I read it?

1 author picked Small Data as one of their favorite books, and they share why you should read it.

What is this book about?

The New York Times Bestseller named one of the "Most Important Books of 2016" by Inc, and a Forbes 2016 "Must Read Business Book"

'If you love 'Bones' and 'CSI', this book is your kind of candy' Paco Underhill, author of Why We Buy

'Martin's best book to date. A personal, intuitive, powerful way to look at making an impact with your work' Seth Godin, author of Purple Cow

Martin Lindstrom, one of Time Magazine's 100 Most Influential People in The World and a modern-day Sherlock Holmes, harnesses the power of "small data" in his quest to discover the next…


The Genome War

By James Shreeve,

Book cover of The Genome War: How Craig Venter Tried to Capture the Code of Life and Save the World

Roger Highfield Author Of The Dance of Life: Symmetry, Cells and How We Become Human

From the list on what big data is and how it impacts us.

Who am I?

I’m the Science Director of the Science Museum Group, based at the Science Museum in London, and visiting professor at the Dunn School, University of Oxford, and Department of Chemistry, University College London. Every time I write a book I swear that it will be my last and yet I'm now working on my ninth, after earlier forays into the physics of Christmas and the love life of Albert Einstein. Working with Peter Coveney of UCL, we're exploring ideas about computation and complexity we tackled in our two earlier books, along with the revolutionary implications of creating digital twins of people from the colossal amount of patient data now flowing from labs worldwide.

Roger's book list on what big data is and how it impacts us

Discover why each book is one of Roger's favorite books.

Why did Roger love this book?

This might not look like a big data book but, for me, the race to read the human genome marks the birth of big data in biology, in the form of a tsunami of DNA sequencing data. I edited Craig Venter’s A Life Decoded, the first genetic autobiography, which explored the implications of becoming the first person to gaze upon all six billion ‘letters’ of their own genetic code. While working on Craig’s extraordinary story I came across The Genome War and thought James Shreeve did a brilliant job in describing the drama, rivalry, and personalities in the race to sequence the very first human genomes between government-backed scientists and Celera, Craig’s company.

By James Shreeve,

Why should I read it?

2 authors picked The Genome War as one of their favorite books, and they share why you should read it.

What is this book about?

The long-awaited story of the science, the business, the politics, the intrigue behind the scenes of the most ferocious competition in the history of modern science—the race to map the human genome.
On May 10, 1998, biologist Craig Venter, director of the Institute for Genomic Research, announced that he was forming a private company that within three years would unravel the complete genetic code of human life—seven years before the projected finish of the U.S. government’s Human Genome Project. Venter hoped that by decoding the genome ahead of schedule, he would speed up the pace of biomedical research and save…


The Deep Learning Revolution

By Terrence J. Sejnowski,

Book cover of The Deep Learning Revolution

Gordon M. Shepherd Author Of Neurogastronomy: How the Brain Creates Flavor and Why It Matters

From the list on understanding the brain and behavior.

Who am I?

I was stimulated by Norbert Wiener’s “Cybernetics” to study circuits in the brain that control behavior. For my graduate studies, I chose the olfactory bulb for its experimental advantages, which led to constructing the first computer models of brain neurons and microcircuits. Then I got interested in how the smell patterns are activated when we eat food, which led to a new field called Neurogastronomy, which is the neuroscience of the circuits that create the perception of food flavor. Finally, because all animals use their brains to find and eat food, the olfactory system has provided new insights into the evolution of the mammalian brain and the basic organization of the cerebral cortex.

Gordon's book list on understanding the brain and behavior

Discover why each book is one of Gordon's favorite books.

Why did Gordon love this book?

The other books in this series are mostly about the real brain. But artificial intelligence promises us a new enhanced brain. What does the future hold? Terrence Sejnowski is a neuroscientist who was one of the first to realize the potential of AI. Since he has been there from the start, in this book he gives the reader an exciting inside story on the people and the advances that are reshaping our lives.

Early attempts at AI were limited, but once computational power took off big computers running multilayer neural nets began proving that they could defeat humans at the most demanding games, enhance human capabilities such as pattern recognition, text recognition, language translation, and driverless vehicles, and work to obtain rewards, just like a human. While these advances are dramatic, it is well to remember that the networks are built not from representations of real neurons, but rather from…

By Terrence J. Sejnowski,

Why should I read it?

1 author picked The Deep Learning Revolution as one of their favorite books, and they share why you should read it.

What is this book about?

How deep learning—from Google Translate to driverless cars to personal cognitive assistants—is changing our lives and transforming every sector of the economy.

The deep learning revolution has brought us driverless cars, the greatly improved Google Translate, fluent conversations with Siri and Alexa, and enormous profits from automated trading on the New York Stock Exchange. Deep learning networks can play poker better than professional poker players and defeat a world champion at Go. In this book, Terry Sejnowski explains how deep learning went from being an arcane academic field to a disruptive technology in the information economy.

Sejnowski played an important…


An Ugly Truth

By Sheera Frenkel, Cecilia Kang,

Book cover of An Ugly Truth: Inside Facebook's Battle for Domination

Roger Highfield Author Of The Dance of Life: Symmetry, Cells and How We Become Human

From the list on what big data is and how it impacts us.

Who am I?

I’m the Science Director of the Science Museum Group, based at the Science Museum in London, and visiting professor at the Dunn School, University of Oxford, and Department of Chemistry, University College London. Every time I write a book I swear that it will be my last and yet I'm now working on my ninth, after earlier forays into the physics of Christmas and the love life of Albert Einstein. Working with Peter Coveney of UCL, we're exploring ideas about computation and complexity we tackled in our two earlier books, along with the revolutionary implications of creating digital twins of people from the colossal amount of patient data now flowing from labs worldwide.

Roger's book list on what big data is and how it impacts us

Discover why each book is one of Roger's favorite books.

Why did Roger love this book?

‘They trust me….dumb f*cks.’ This telling exchange from the Harvard days of Facebook co-founder and CEO, Mark Zuckerberg appears in An Ugly Truth, which shines a harsh light on the tech behemoth that, ultimately, is built on the data of billions of people. As Meta, Zuckerberg’s new business incarnation, wafts into the virtual worlds of the metaverse, the story of Facebook is far from over, which makes this engaging book a tad unsatisfying. Nonetheless, it is a vivid example of how with Big Data comes Big Responsibility.

By Sheera Frenkel, Cecilia Kang,

Why should I read it?

1 author picked An Ugly Truth as one of their favorite books, and they share why you should read it.

What is this book about?

'An explosive new book' Daily Mail

'[A] careful, comprehensive interrogation of every major Facebook scandal. An Ugly Truth provides the kind of satisfaction you might get if you hired a private investigator to track a cheating spouse: it confirms your worst suspicions and then gives you all the dates and details you need to cut through the company's spin' New York Times

__________________________________________

Award-winning New York Times reporters Sheera Frenkel and Cecilia Kang unveil the tech story of our times in this riveting, behind-the-scenes expose that offers the definitive account of Facebook's fall from grace. Once one of Silicon Valley's…


The AI Factor

By Asha Saxena,

Book cover of The AI Factor: How to Apply Artificial Intelligence and Use Big Data to Grow Your Business Exponentially

Tim Vandehey Author Of Swipe: The Science Behind Why We Don't Finish What We Start

From the list on how technology is changing how we live.

Who am I?

I’m a journalist and a tinkerer. I’m fascinated not only by how things work but by how small levers can move mountains. Growing up in the workshop of my grandfather, an old Boston boatwright, I was mesmerized by the idea that a small rudder could maneuver a huge vessel. In college, I fell in love with how a small idea or expression could redirect a course of research or a country. As a self-taught maker of things, I appreciate how technologies empower us. I’ve chosen these books because they’re examples of how small ideas become things, lines of research, or patterns of thinking that shift human progress in unknowable ways.

Tim's book list on how technology is changing how we live

Discover why each book is one of Tim's favorite books.

Why did Tim love this book?

I love gutsy books by outsiders, and Ms. Saxena, as a woman of color working in the Ivy League and the worlds of artificial intelligence and Big Data, is very much an outsider.

That makes her deep knowledge and insights into how AI and Big Data are changing business even that much more interesting. Plus, this is one of the only books I’ve read that explains how artificial intelligence works in a clear, direct way that doesn’t assume the reader already knows about things like machine learning and neural nets. 

By Asha Saxena,

Why should I read it?

1 author picked The AI Factor as one of their favorite books, and they share why you should read it.

What is this book about?

Have you heard about artificial intelligence (AI) and big data but felt they are technologies too big or too complicated for you or your business? Do you imagine AI as a Hollywood science fiction stereotype or something in the far and distant future?

Take heart. AI is none of those things. It's part of our everyday lives, and it has the power to transform your business.

This book will put AI, big data, the cloud, robotics, and smart devices in context. It will reveal how these technologies can dramatically multiply any businesses-including yours-by strategically using your data's latent, transformative potential.…


Book cover of The College Dropout Scandal

Peter Temin Author Of The Vanishing Middle Class: Prejudice and Power in a Dual Economy

From the list on racial and economic inequality in the USA.

Who am I?

Peter Temin is an economist and economic historian, currently a professor at MIT and the former head of the Economics Department. His research interests include macroeconomic history, the Great Depression, industry studies in both the nineteenth and twentieth centuries, and ancient Rome. 

Peter's book list on racial and economic inequality in the USA

Discover why each book is one of Peter's favorite books.

Why did Peter love this book?

This is a positive book that shows how education can help Blacks and other minorities get an education that will help them stay out of mass incarceration. It is good to have a positive program as we attempt to deal with American racism.

By David Kirp,

Why should I read it?

1 author picked The College Dropout Scandal as one of their favorite books, and they share why you should read it.

What is this book about?

Higher education today faces numerous challenges, from quality to cost. But the fact that fewer than sixty percent of college freshmen graduate in six years and fewer than forty percent earn an associate degree in three years turns few heads. The dropout problem is especially acute for black and Latino students, those from poor families, and those who are first in their families to go to college. In short, millions of students are leaving college without a degree,
saddled with debt, and little to show for it.

In The College Dropout Scandal, David Kirp outlines the scale of the problem…


Information is Beautiful

By David McCandless,

Book cover of Information is Beautiful

Roger Highfield Author Of The Dance of Life: Symmetry, Cells and How We Become Human

From the list on what big data is and how it impacts us.

Who am I?

I’m the Science Director of the Science Museum Group, based at the Science Museum in London, and visiting professor at the Dunn School, University of Oxford, and Department of Chemistry, University College London. Every time I write a book I swear that it will be my last and yet I'm now working on my ninth, after earlier forays into the physics of Christmas and the love life of Albert Einstein. Working with Peter Coveney of UCL, we're exploring ideas about computation and complexity we tackled in our two earlier books, along with the revolutionary implications of creating digital twins of people from the colossal amount of patient data now flowing from labs worldwide.

Roger's book list on what big data is and how it impacts us

Discover why each book is one of Roger's favorite books.

Why did Roger love this book?

Big data can be beautiful and visualisations make for a wonderful coffee-table book. In Information is Beautiful, David McCandless turns dry-as-dust data into pop art to show the kind of world we live in, linking politics to life expectancy, women’s education to GDP growth, and more. Through colourful graphics, we get vivid and novel perspectives on current obsessions, from maps of cliches to the most fashionable colours. A testament to how the power of big data comes from being able to distill information to reveal hidden patterns and discern trends. 

By David McCandless,

Why should I read it?

1 author picked Information is Beautiful as one of their favorite books, and they share why you should read it.

What is this book about?

A visual guide to the way the world really works

Every day, every hour, every minute we are bombarded by information - from television, from newspapers, from the internet, we're steeped in it, maybe even lost in it. We need a new way to relate to it, to discover the beauty and the fun of information for information's sake.
No dry facts, theories or statistics. Instead, Information is Beautiful contains visually stunning displays of information that blend the facts with their connections, their context and their relationships - making information meaningful, entertaining and beautiful.
This is information like you have…


Forewarned

By Paul Goodwin,

Book cover of Forewarned: A Sceptic's Guide to Prediction

David F. Hendry Author Of Forecasting: An Essential Introduction

From the list on getting an insight into forecasting.

Who am I?

Accurate and precise forecasting is essential for successful planning and policy from economics to epidemiology. We have been keen to understand why so many forecasts turn out to be highly inaccurate since making dreadful forecasts ourselves, and advising UK government agencies (Treasury, Parliament, Bank of England) during turbulent periods. As simple extrapolation often beats model-based forecasting, we have been developing improved methods that draw on the best aspects of both, and have published more than 60 articles and 6 books attracting more than 6000 citations by other scholars. Our recommended books cover a wide range of forecasting methods—suggesting there is no optimal way to look into the future.

David's book list on getting an insight into forecasting

Discover why each book is one of David's favorite books.

Why did David love this book?

When can we trust a forecast? Given how often forecasts end up being very wide of the mark, a degree of scepticism might well be warranted. Paul Goodwin provides an entertaining account of forecasting, arguing that intuition may serve us well in some settings, but that computer-based analysis of big data might be expected to prevail in others.        

By Paul Goodwin,

Why should I read it?

1 author picked Forewarned as one of their favorite books, and they share why you should read it.

What is this book about?

Whether it's an unforeseen financial crash, a shock election result or a washout summer that threatens to ruin a holiday in the sun, forecasts are part and parcel of our everyday lives. We rely wholeheartedly on them, and become outraged when things don't go exactly to plan.

But should we really put so much trust in predictions? Perhaps gut instincts can trump years of methodically compiled expert knowledge? And when exactly is a forecast not a forecast? Forewarned will answer all of these intriguing questions, and many more.

Packed with fun anecdotes and startling facts, Forewarned is a myth-busting guide…


Book cover of Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e: Concepts, Tools, and Techniques to Build Intelligent Systems

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From the list on big data processing ecosystem.

Who am I?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Discover why each book is one of Tomasz's favorite books.

Why did Tomasz love this book?

The Hands-on Machine Learning book presents an end-to-end approach to many problems that can be solved with machine learning.

Every concept and topic is backed up with a running code that you can experiment with and adapt to your real-world problems.

Thanks to this book, you will be able to understand the state of the art of today's machine learning and feel comfortable using the most up-to-date ML methods.

By Géron Aurélien,

Why should I read it?

1 author picked Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow 3e as one of their favorite books, and they share why you should read it.

What is this book about?

Through a recent series of breakthroughs, deep learning has boosted the entire field of machine learning. Now, even programmers who know close to nothing about this technology can use simple, efficient tools to implement programs capable of learning from data. This best-selling book uses concrete examples, minimal theory, and production-ready Python frameworks--scikit-learn, Keras, and TensorFlow--to help you gain an intuitive understanding of the concepts and tools for building intelligent systems.

With this updated third edition, author Aurelien Geron explores a range of techniques, starting with simple linear regression and progressing to deep neural networks. Numerous code examples and exercises throughout…


Dear Data

By Giorgia Lupi, Stefanie Posavec,

Book cover of Dear Data

Roger Highfield Author Of The Dance of Life: Symmetry, Cells and How We Become Human

From the list on what big data is and how it impacts us.

Who am I?

I’m the Science Director of the Science Museum Group, based at the Science Museum in London, and visiting professor at the Dunn School, University of Oxford, and Department of Chemistry, University College London. Every time I write a book I swear that it will be my last and yet I'm now working on my ninth, after earlier forays into the physics of Christmas and the love life of Albert Einstein. Working with Peter Coveney of UCL, we're exploring ideas about computation and complexity we tackled in our two earlier books, along with the revolutionary implications of creating digital twins of people from the colossal amount of patient data now flowing from labs worldwide.

Roger's book list on what big data is and how it impacts us

Discover why each book is one of Roger's favorite books.

Why did Roger love this book?

Over a single year, Giorgia Lupi, an Italian living in New York, and Stefanie Posavec, an American in London, exchanged hand-drawn postcards to chart the granular details of their lives using clusters, plots, and graphs. We featured the outpourings of these talented “information designers” in a 2016 Science Museum exhibition on big data and these striking images, in turn, paved the way for their book, Dear Data, which provides a remarkable portrait of these artists. An intimate and human take on big data that invites us all to ponder how to represent our own lives.   

By Giorgia Lupi, Stefanie Posavec,

Why should I read it?

2 authors picked Dear Data as one of their favorite books, and they share why you should read it.

What is this book about?

From an award-winning project comes an inspiring, collaborative book that makes data artistic, personal - and open to all

Each week for a year, Giorgia and Stefanie sent each other a postcard describing what had happened to them during that week around a particular theme. But they didn't write it, they drew it: a week of smiling, a week of apologies, a week of desires.

Presenting their fifty-two cards, along with thoughts and ideas about the data-drawing process, Dear Data hopes to inspire you to draw, slow down and make connections with other people, to see the world through a…