The most recommended data science books

Who picked these books? Meet our 19 experts.

19 authors created a book list connected to data science, and here are their favorite data science books.
When you buy books, we may earn a commission that helps keep our lights on (or join the rebellion as a member).

What type of data science book?

Loading...
Loading...

Book cover of Jumpstart Snowflake: A Step-by-Step Guide to Modern Cloud Analytics

Valliappa Lakshmanan Author Of Data Science on the Google Cloud Platform: Implementing End-To-End Real-Time Data Pipelines: From Ingest to Machine Learning

From my list on if you want to become a data scientist.

Why am I passionate about this?

I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.

Valliappa's book list on if you want to become a data scientist

Valliappa Lakshmanan Why did Valliappa love this book?

In industry, your data is very likely to live within a data warehouse such as BigQuery, Redshift, or Snowflake. Therefore, to be an effective data scientist in the industry, you should learn how to use data warehouses effectively. 

Once you learn data warehousing and SQL with any one of these products, it is quite easy to pick up another. So which one do you start with?

You can use Snowflake on all three of the major public clouds. Because it’s a standalone product, it is the most similar to a “traditional” data warehouse and can be picked up easily even if you are not familiar with cloud computing. That makes it a good data warehouse to start with, and is the reason my second book pick is this book on Snowflake.

BigQuery is also available on all three major public clouds, but it works best (and is used most commonly)…

By Dmitry Anoshin, Dmitry Shirokov, Donna Strok

Why should I read it?

1 author picked Jumpstart Snowflake as one of their favorite books, and they share why you should read it.

What is this book about?

Explore the modern market of data analytics platforms and the benefits of using Snowflake computing, the data warehouse built for the cloud.

With the rise of cloud technologies, organizations prefer to deploy their analytics using cloud providers such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform. Cloud vendors are offering modern data platforms for building cloud analytics solutions to collect data and consolidate into single storage solutions that provide insights for business users. The core of any analytics framework is the data warehouse, and previously customers did not have many choices of platform to use.

Snowflake was…


Book cover of The Real Work of Data Science: Turning Data Into Information, Better Decisions, and Stronger Organizations

Roger W. Hoerl Author Of Statistical Thinking: Improving Business Performance

From my list on AI and data science that are actually readable.

Why am I passionate about this?

As a professional statistician, I am naturally interested in AI and data science. However, in our current information age, everyone, in all segments of society, needs to understand the basics of AI and data science. These basics include such things as what these disciplines are, what they can contribute to society, and perhaps most importantly, what can go wrong. However, I have found that much of the literature on these topics is highly technical and beyond the reach of most readers. These books are specifically selected because they are readable by virtually everyone, and yet convey the key concepts needed to be data-literate in the 21st century. Enjoy!

Roger's book list on AI and data science that are actually readable

Roger W. Hoerl Why did Roger love this book?

This book goes beyond the hype of data science, the details of machine learning methods, and the coding so closely associated with data science. Rather, it emphasizes the real types of problems for which data science may help, and explains the practical issues (“the real work”) that often lead to failure in data science projects.

These issues tend to be overlooked in more technical presentations of data science. They include such critical considerations as defining the right problem to begin with, understanding the “pedigree” (background and quality) of any data used, and ensuring that the right people are involved from the start.

By Ron S. Kenett, Thomas C. Redman,

Why should I read it?

1 author picked The Real Work of Data Science as one of their favorite books, and they share why you should read it.

What is this book about?

The essential guide for data scientists and for leaders who must get more from their data science teams

The Economist boldly claims that data are now "the world's most valuable resource." But, as Kenett and Redman so richly describe, unlocking that value requires far more than technical excellence. The Real Work of Data Science explores understanding the problems, dealing with quality issues, building trust with decision makers, putting data science teams in the right organizational spots, and helping companies become data-driven. This is the work that spells the difference between a good data scientist and a great one, between a…


Book cover of R in Action: Data Analysis and Graphics with R

Tilman M. Davies Author Of The Book of R: A First Course in Programming and Statistics

From my list on intro to programming and data science with R.

Why am I passionate about this?

I’m an applied statistician and academic researcher/lecturer at New Zealand’s oldest university – the University of Otago. R facilitates everything I do – research, academic publication, and teaching. It’s the latter part of my job that motivated my own book on R. From first-year statistics students who have never seen R to my own Ph.D. students using R to implement novel and highly complex statistical methods and models, my experience is that all ultimately love the ease with which the R language permits exploration, visualisation, analysis, and inference of one’s data. The ever-growing need in today’s society for skilled statisticians and data scientists means there's never been a better time to learn this essential language.

Tilman's book list on intro to programming and data science with R

Tilman M. Davies Why did Tilman love this book?

This provides a superb balance between technical aspects of R coding and the statistical methods that motivate its use. It's rare to find a book on topics like this that are written with Kabacoff's easygoing yet precise style, which makes it ideal for beginners. From my own experience, it is obvious the author has spent many years teaching this type of content, knowing where things deserve extra explanation up front and where other more technical details can be relegated to more advanced texts.

By Robert I. Kabacoff,

Why should I read it?

1 author picked R in Action as one of their favorite books, and they share why you should read it.

What is this book about?

DESCRIPTION

R is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It runs on all important platforms and provides thousands of useful specialized modules and utilities. This makes R a great way to get meaningful information from mountains of raw data.



R in Action, Second Edition is language tutorial focused on practical problems. Written by a research methodologist, it takes a direct and modular approach to quickly give readers the information they need to produce useful results. Focusing on realistic data analyses and a comprehensive integration of graphics, it follows the steps that…


Book cover of How to Lie with Statistics

Bastiaan C. van Fraassen Author Of Philosophy and Science of Risk: An Introduction

From my list on exploring the meaning of probability and risk.

Why am I passionate about this?

I’ve wanted to be a philosopher since I read Plato’s Phaedo when I was 17, a new immigrant in Canada. Since then, I’ve been fascinated with time, space, and quantum mechanics and involved in the great debates about their mysteries. I saw probability coming into play more and more in curious roles both in the sciences and in practical life. These five books led me on an exciting journey into the history of probability, the meaning of risk, and the use of probability to assess the possibility of harm. I was gripped, entertained, illuminated, and often amazed at what I was discovering. 

Bastiaan's book list on exploring the meaning of probability and risk

Bastiaan C. van Fraassen Why did Bastiaan love this book?

I am laughing out loud, even now that I am rereading this book for the umpteenth time. Fraudsters are so clever, and so is advertising. And then there is sloppy journalism with its “wow” statistics.

I like his book enormously, not least because of its witty illustrations. It is subversive, comic, and provocative, and it makes me wise to seductive, misleading practices–and it does so with a light touch.

By Darrell Huff, Irving Geis (illustrator),

Why should I read it?

3 authors picked How to Lie with Statistics as one of their favorite books, and they share why you should read it.

What is this book about?

From distorted graphs and biased samples to misleading averages, there are countless statistical dodges that lend cover to anyone with an ax to grind or a product to sell. With abundant examples and illustrations, Darrell Huff's lively and engaging primer clarifies the basic principles of statistics and explains how they're used to present information in honest and not-so-honest ways. Now even more indispensable in our data-driven world than it was when first published, How to Lie with Statistics is the book that generations of readers have relied on to keep from being fooled.


Book cover of Be Data Literate: The Data Literacy Skills Everyone Needs to Succeed

Jeremy Adamson Author Of Minding the Machines: Building and Leading Data Science and Analytics Teams

From my list on for data science and analytics leaders.

Why am I passionate about this?

I am a leader in analytics and AI strategy, and have a broad range of experience in aviation, energy, financial services, and the public sector.  I have worked with several major organizations to help them establish a leadership position in data science and to unlock real business value using advanced analytics. 

Jeremy's book list on for data science and analytics leaders

Jeremy Adamson Why did Jeremy love this book?

Not everybody needs to be a data scientist, but everybody does need to be data literate. Without an intentional focus on evangelism and building a strong data culture in your organization it will be an uphill battle to make meaningful change. This book helps individuals and leaders to understand what data literacy is, and how we can build it like any other skill.

By Jordan Morrow,

Why should I read it?

1 author picked Be Data Literate as one of their favorite books, and they share why you should read it.

What is this book about?

In the fast moving world of the fourth industrial revolution not everyone needs to be a data scientist but everyone should be data literate, with the ability to read, analyze and communicate with data. It is not enough for a business to have the best data if those using it don't understand the right questions to ask or how to use the information generated to make decisions. Be Data Literate is the essential guide to developing the curiosity, creativity and critical thinking necessary to make anyone data literate, without retraining as a data scientist or statistician. With learnings to show…


Book cover of Data Sketches

Adam Fortuna

From Adam's 3 favorite reads in 2023.

Why am I passionate about this?

Programmer Community builder Playful Explorer Optimizer

Adam's 3 favorite reads in 2023

Adam Fortuna Why did Adam love this book?

Data visualizations are a cross between art, programming, and storytelling.

I've always been fascinated by the process creators go through to bring something from their imagination into existence. What amazed me was how the journey isn't a clear path from idea to finished product. I loved how Nadieh and Shirley documented their thought process – bringing me along and sharing why they made each decision.

Each chapter is a breakdown of a different data visualization. I laughed at how many of them were nerdy interests I loved: Dance Dance Revolution, Card Captor Sakura, and Lord of the Rings, to name a few. It reminded me that if I have fun, that'll show up in the finished product.

By Shirley Wu, Nadieh Bremer,

Why should I read it?

1 author picked Data Sketches as one of their favorite books, and they share why you should read it.

What is this book about?

In Data Sketches, Nadieh Bremer and Shirley Wu document the deeply creative process behind 24 unique data visualization projects, and they combine this with powerful technical insights which reveal the mindset behind coding creatively. Exploring 12 different themes - from the Olympics to Presidents & Royals and from Movies to Myths & Legends - each pair of visualizations explores different technologies and forms, blurring the boundary between visualization as an exploratory tool and an artform in its own right. This beautiful book provides an intimate, behind-the-scenes account of all 24 projects and shares the authors' personal notes and drafts every…


Book cover of The Practice of Management

Jeremy Adamson Author Of Minding the Machines: Building and Leading Data Science and Analytics Teams

From my list on for data science and analytics leaders.

Why am I passionate about this?

I am a leader in analytics and AI strategy, and have a broad range of experience in aviation, energy, financial services, and the public sector.  I have worked with several major organizations to help them establish a leadership position in data science and to unlock real business value using advanced analytics. 

Jeremy's book list on for data science and analytics leaders

Jeremy Adamson Why did Jeremy love this book?

Management as a skill is typically established and honed by osmosis, mimicry, and corporate crash courses. Data scientists pursuing management roles need to understand management from base principles to create meaningful change and establish productive team conventions. After almost 70 years, Drucker’s book still stands up as a foundational piece of reading.

By Peter F. Drucker,

Why should I read it?

1 author picked The Practice of Management as one of their favorite books, and they share why you should read it.

What is this book about?

A classic since its publication in 1954, The Practice of Management was the first book to look at management as a whole and being a manager as a separate responsibility. The Practice of Management created the discipline of modern management practices. Readable, fundamental, and basic, it remains an essential book for students, aspiring managers, and seasoned professionals.


Book cover of People Skills for Analytical Thinkers

Jeremy Adamson Author Of Minding the Machines: Building and Leading Data Science and Analytics Teams

From my list on for data science and analytics leaders.

Why am I passionate about this?

I am a leader in analytics and AI strategy, and have a broad range of experience in aviation, energy, financial services, and the public sector.  I have worked with several major organizations to help them establish a leadership position in data science and to unlock real business value using advanced analytics. 

Jeremy's book list on for data science and analytics leaders

Jeremy Adamson Why did Jeremy love this book?

Since data science is, at its core, people helping people make decisions, it is essential that we can establish productive relationships with our stakeholders. This is a skill that needs to be given the same level of effort as we give to coding or statistics. Gilbert’s book is a great resource to help technically oriented people to advance their people skills.

By Gilbert Eijkelenboom,

Why should I read it?

1 author picked People Skills for Analytical Thinkers as one of their favorite books, and they share why you should read it.

What is this book about?

"For the engineer, scientist, or technology professional seeking to communicate better in the business world, this is the book you've been craving your entire career!" ”
— Douglas Laney, Innovation Fellow, West Monroe, and best-selling author of "Infonomics"

Your analytical skills are incredibly valuable. However, rational thinking alone isn’t enough.

Have you ever: Presented an idea, but then no one seemed to care? Explained your analysis, only to leave your colleague confused? Struggled to work with people who are less analytical and more emotional?

In these situations, people skills make the difference, and research shows these skills are becoming increasingly…


Book cover of All-in On AI: How Smart Companies Win Big with Artificial Intelligence

Flora Delaney Author Of Retail The Second-Oldest Profession: 7 Timeless Principles to WIN in Retail Today

From Flora's 3 favorite reads in 2024.

Why am I passionate about this?

Author

Flora's 3 favorite reads in 2024

Flora Delaney Why did Flora love this book?

A great way to see how AI is being used by companies and not just the future predictions of how AI could be used. Made me more open to how AI will change my industry (retail) and how people can use it to make better decisions. It kicked off my current journy to become more AI aware. As always, I appreciate anything that Thomas Davenport writes.

By Thomas H. Davenport, Nitin Mittal,

Why should I read it?

2 authors picked All-in On AI as one of their favorite books, and they share why you should read it.

What is this book about?

A Wall Street Journal bestseller

A Publisher's Weekly bestseller

A fascinating look at the trailblazing companies using artificial intelligence to create new competitive advantage, from the author of the business classic, Competing on Analytics, and the head of Deloitte's US AI practice.

Though most organizations are placing modest bets on artificial intelligence, there is a world-class group of companies that are going all-in on the technology and radically transforming their products, processes, strategies, customer relationships, and cultures.

Though these organizations represent less than 1 percent of large companies, they are all high performers in their industries. They have better business…


Book cover of The Golem: What You Should Know about Science

Aubrey Clayton Author Of Bernoulli's Fallacy: Statistical Illogic and the Crisis of Modern Science

From my list on for data scientists trying to be ethical people.

Why am I passionate about this?

I studied statistics and data science for years before anyone ever suggested to me that these topics might have an ethical dimension, or that my numerical tools were products of human beings with motivations specific to their time and place. I’ve since written about the history and philosophy of mathematical probability and statistics, and I’ve come to understand just how important that historical background is and how critically important it is that the next generation of data scientists understand where these ideas come from and their potential to do harm. I hope anyone who reads these books avoids getting blinkered by the ideas that data = objectivity and that science is morally neutral.

Aubrey's book list on for data scientists trying to be ethical people

Aubrey Clayton Why did Aubrey love this book?

The thing you should know about science is that it’s a human enterprise. As a result, it’s dependent on human factors like social consensus and prejudice. In this series of case studies of famously expensive and difficult-to-replicate experiments probing the limits of scientific understanding from biology to theoretical physics, Collins and Pinch show how scientific knowledge gathering is rarely straightforward because there are always alternative explanations available for the data. Was the phenomenon real or was the experiment set up badly? We can never know for sure, but we decide collectively what we believe. Scientists are experts participating in human culture, they argue, not mysterious clergy issuing declarations of absolute truth.

By Harry M. Collins, Trevor Pinch,

Why should I read it?

1 author picked The Golem as one of their favorite books, and they share why you should read it.

What is this book about?

Harry Collins and Trevor Pinch liken science to the Golem, a creature from Jewish mythology, powerful yet potentially dangerous, a gentle, helpful creature that may yet run amok at any moment. Through a series of intriguing case studies the authors debunk the traditional view that science is the straightforward result of competent theorisation, observation and experimentation. The very well-received first edition generated much debate, reflected in a substantial new Afterword in this second edition, which seeks to place the book in what have become known as 'the science wars'.


Book cover of Jumpstart Snowflake: A Step-by-Step Guide to Modern Cloud Analytics
Book cover of The Real Work of Data Science: Turning Data Into Information, Better Decisions, and Stronger Organizations
Book cover of R in Action: Data Analysis and Graphics with R

Share your top 3 reads of 2024!

And get a beautiful page showing off your 3 favorite reads.

1,587

readers submitted
so far, will you?