The most recommended data mining books

Who picked these books? Meet our 11 experts.

Ronny Kohavi Author
Gary Smith Author
Gregg Bernstein Author
Yuxi (Hayden) Liu Author
Jeremy Adamson Author
Peter J. Bentley Author
+5

11 authors created a book list connected to data mining, and here are their favorite data mining books.

When you buy books, we may earn a commission that helps keep our lights on (or join the rebellion as a member).

What type of data mining book?

Genre

Computers Artificial intelligence Programming Probability Data science Business Math Statistics Algorithms Open source Computer science Python Machine theory Mathematical & statistical software Psychology

Topic

Machine learning Big data Programming language Statistics Data science Algorithm Innovation Prejudice PageRank NumPy R (programming language) Boston Red Sox Artificial intelligence Skepticism Statistician

Shuffle

Show me:

Computers Artificial intelligence Programming Probability Data science Machine learning Big data Programming language

Calling Bullshit

By Carl T. Bergstrom, Jevin D. West,

From Olivier's 3 favorite reads in 2024.

By Olivier Sibony Author

Why did Olivier love this book?

There are lots of book that claim to help you think. There is even a "smart thinking" category in some bookstores, presumably to distinguish the books it contains from the ones that promote dumb thinking. I have read many such books (and written a couple myself). This one is my favorite. It is packed with concrete examples and fun to read. You will genuinely be smarter after reading this book.

Calling Bullshit

By Carl T. Bergstrom, Jevin D. West,

Why should I read it?

4 authors picked Calling Bullshit as one of their favorite books, and they share why you should read it.

What is this book about?

Bullshit isn’t what it used to be. Now, two science professors give us the tools to dismantle misinformation and think clearly in a world of fake news and bad data.

“A modern classic . . . a straight-talking survival guide to the mean streets of a dying democracy and a global pandemic.”—Wired

Misinformation, disinformation, and fake news abound and it’s increasingly difficult to know what’s true. Our media environment has become hyperpartisan. Science is conducted by press release. Startup culture elevates bullshit to high art. We are fairly well equipped to spot the sort of old-school bullshit that is based…

Explore

Topics

Genres

Political

Interviewing Users

By Steve Portigal,

From my list on understanding user research.

By Gregg Bernstein Author

Why did Gregg love this book?

Listening to users is essential to product design and development, full stop. Interviews allow us to understand who uses our products and the contexts our products fit into, and Steve Portigal demonstrates how to do it like a pro in Interviewing Users. Steve breaks down every angle of the interview process, from planning to conducting to documentation. (I particularly love Steve’s approach to the interview field guide in chapter 3!)

Interviewing Users

By Steve Portigal,

Why should I read it?

1 author picked Interviewing Users as one of their favorite books, and they share why you should read it.

What is this book about?

Interviewing is a foundational user research tool that people assume they already possess. Everyone can ask questions, right? Unfortunately, that's not the case. Interviewing Users provides invaluable interviewing techniques and tools that enable you to conduct informative interviews with anyone. You'll move from simply gathering data to uncovering powerful insights about people.

Explore

Topics

Data mining

Genres

Design

Programming Collective Intelligence

By Toby Segaran,

From my list on machine learning for beginners.

By Yuxi (Hayden) Liu Author

Why did Yuxi love this book?

This was my favorite book when I started my career. It talks about how information is processed, in an intelligent way, in the internet age. It acts as a tutorial to teach developers how to code our own ML programs, from online dating services, to document analyzer, and search engine. The author did an excellent job of explaining abstract ML algorithms with clear examples. His coding style in Python reads clearly, which makes the book more beginner-friendly.

Don’t get disappointed when you know this book is more than a decade old. It was a visionary book back in the day and it is still relevant today.

Programming Collective Intelligence

By Toby Segaran,

Why should I read it?

1 author picked Programming Collective Intelligence as one of their favorite books, and they share why you should read it.

What is this book about?

Want to tap the power behind search rankings, product recommendations, social bookmarking, and online matchmaking? This fascinating book demonstrates how you can build Web 2.0 applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it. Programming Collective Intelligence takes you into the world of machine learning and statistics, and explains how to draw conclusions about user experience, marketing,…

Explore

Topics

Genres

Coming soon!

Be Data Literate

By Jordan Morrow,

From my list on for data science and analytics leaders.

By Jeremy Adamson Author

Why did Jeremy love this book?

Not everybody needs to be a data scientist, but everybody does need to be data literate. Without an intentional focus on evangelism and building a strong data culture in your organization it will be an uphill battle to make meaningful change. This book helps individuals and leaders to understand what data literacy is, and how we can build it like any other skill.

Be Data Literate

By Jordan Morrow,

Why should I read it?

1 author picked Be Data Literate as one of their favorite books, and they share why you should read it.

What is this book about?

In the fast moving world of the fourth industrial revolution not everyone needs to be a data scientist but everyone should be data literate, with the ability to read, analyze and communicate with data. It is not enough for a business to have the best data if those using it don't understand the right questions to ask or how to use the information generated to make decisions. Be Data Literate is the essential guide to developing the curiosity, creativity and critical thinking necessary to make anyone data literate, without retraining as a data scientist or statistician. With learnings to show…

Explore

Topics

Genres

Business

Rage Inside the Machine

By Robert Elliott Smith,

From my list on no hype and no nonsense artificial intelligence.

By Peter J. Bentley Author

Why did Peter love this book?

OK, I’m biased here because Rob is an old friend of mine. We first met at academic conferences and had several heated debates (arguments). But after spending a little time together at a workshop we realised each probably knew what they were talking about after all. Robert Elliott Smith, I should make clear it's not the Rob Smith who writes about “Artificial Superintelligence”. Those books definitely do not make this list.

Our Rob is a coherent, grounded scientist with bags of real-world experience, and he brings his knowledge to this title with gusto, telling us about how AI is affecting our lives in ways you never thought possible – and often not in a good way. If you want to understand what can go wrong with AI and what we should be doing to stop it, don’t read about singularities or other such nonsense, read this.

Rage Inside the Machine

By Robert Elliott Smith,

Why should I read it?

1 author picked Rage Inside the Machine as one of their favorite books, and they share why you should read it.

What is this book about?

Shortlisted for the 2020 Business Book Awards

We live in a world increasingly ruled by technology; we seem as governed by technology as we do by laws and regulations. Frighteningly often, the influence of technology in and on our lives goes completely unchallenged by citizens and governments. We comfort ourselves with the soothing refrain that technology has no morals and can display no prejudice, and it's only the users of technology who distort certain aspects of it.

But is this statement actually true? Dr Robert Smith thinks it is dangerously untrue in the modern era.

Having worked in the field…

Explore

Topics

Genres

Business

Information Quality

By Ron S. Kenett, Galit Shmueli,

From my list on how numbers turn into information.

By Ron S. Kenett Author

Why did Ron love this book?

A lightly technical introduction to a comprehensive framework defining and evaluating the quality of information generated by statistical analysis. It expands the role of analytics by including dimensions that affect information quality such as data resolution, data integration, operationalization, and generalizability of findings. This wide-angle perspective provides a practical checklist that has been found useful in applications. Multiple case studies enable the reader to connect to his favorite topic, but also learn from other areas.

Information Quality

By Ron S. Kenett, Galit Shmueli,

Why should I read it?

1 author picked Information Quality as one of their favorite books, and they share why you should read it.

What is this book about?

Provides an important framework for data analysts in assessing the quality of data and its potential to provide meaningful insights through analysis Analytics and statistical analysis have become pervasive topics, mainly due to the growing availability of data and analytic tools. Technology, however, fails to deliver insights with added value if the quality of the information it generates is not assured. Information Quality (InfoQ) is a tool developed by the authors to assess the potential of a dataset to achieve a goal of interest, using data analysis. Whether the information quality of a dataset is sufficient is of practical importance…

Explore

Topics

Data mining

Genres

Coming soon!

R for Data Science

By Hadley Wickham, Garrett Grolemund,

From my list on intro to programming and data science with R.

By Tilman M. Davies Author

Why did Tilman love this book?

For those intending to use R with an eye on the popular 'Tidyverse' suite of packages – which facilitate the handling, manipulation, and visualisation of data sets – it's hard to go past this book. From the founding contributors of the RStudio/Tidyverse worlds, this is a great way to learn about this dialect of R against the overarching backdrop of statistical data analysis and data science.

R for Data Science

By Hadley Wickham, Garrett Grolemund,

Why should I read it?

1 author picked R for Data Science as one of their favorite books, and they share why you should read it.

What is this book about?

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along…

Explore

Topics

Genres

Coming soon!

Competing on Analytics

By Thomas H. Davenport, Jeanne G. Harris,

From my list on for data science and analytics leaders.

By Jeremy Adamson Author

Why did Jeremy love this book?

This is a foundational book on analytics and data science as a business function and helped to shape the development of the practice. It provides a view of the discipline through a business lens and avoids deep technical examinations. Though much has changed in the 15 years since it was originally published, it is still essential reading for a leader in the field. No book since has captured as well the competitive differentiation that analytics provides.

Competing on Analytics

By Thomas H. Davenport, Jeanne G. Harris,

Why should I read it?

1 author picked Competing on Analytics as one of their favorite books, and they share why you should read it.

What is this book about?

You have more information at hand about your business environment than ever before. But are you using it to "out-think" your rivals? If not, you may be missing out on a potent competitive tool. In Competing on Analytics: The New Science of Winning, Thomas H. Davenport and Jeanne G. Harris argue that the frontier for using data to make decisions has shifted dramatically. Certain high-performing enterprises are now building their competitive strategies around data-driven insights that in turn generate impressive business results. Their secret weapon? Analytics: sophisticated quantitative and statistical analysis and predictive modeling. Exemplars of analytics are using new…

Explore

Topics

Genres

Business

Introduction to Machine Learning with Python

By Andreas C. Müller, Sarah Guido,

From my list on machine learning for beginners.

By Yuxi (Hayden) Liu Author

Why did Yuxi love this book?

This book is more advanced than the first book I recommended. It presents ML theoretical and practical aspects step-by-step from the bottom up. Each chapter elaborates at length on a core building block in the ML life cycle. For example, feature engineering, supervised learning, and model evaluation have their own separate chapters, with intuitive discussions of how they work. Most of the concept is taught through the simple yet powerful Python Module Scikit-Learn so it won’t overburden you with heavy programming. This book will be perfect for practitioners with some understanding of statistics and linear algebra.

Introduction to Machine Learning with Python

By Andreas C. Müller, Sarah Guido,

Why should I read it?

1 author picked Introduction to Machine Learning with Python as one of their favorite books, and they share why you should read it.

What is this book about?

Machine learning has become an integral part of many commercial applications and research projects, but this field is not exclusive to large companies with extensive research teams. If you use Python, even as a beginner, this book will teach you practical ways to build your own machine learning solutions. With all the data available today, machine learning applications are limited only by your imagination. You'll learn the steps necessary to create a successful machine-learning application with Python and the scikit-learn library. Authors Andreas Muller and Sarah Guido focus on the practical aspects of using machine learning algorithms, rather than the…

Explore

Topics

Genres

Coming soon!

Machine Learning For Absolute Beginners

By Oliver Theobald,

From my list on machine learning for beginners.

By Yuxi (Hayden) Liu Author

Why did Yuxi love this book?

This could be the first stop of your brand new machine learning journey. I personally like how the technical concept is translated into plain English – each chapter starts with a high-level overview of a ML algorithm or methodology, concise and clear, followed by lots of visual examples and real world scenarios. I can guarantee you won’t get lost halfway. The book focuses on getting you introduced to ML with minimal math. But if you want to grasp some more of math, the next book I recommend is waiting for you.

Machine Learning For Absolute Beginners

By Oliver Theobald,

Why should I read it?

1 author picked Machine Learning For Absolute Beginners as one of their favorite books, and they share why you should read it.

What is this book about?

NOTICE: To buy the newest edition of this book (2021), please search "Machine Learning Absolute Beginners Third Edition" on Amazon. The product page you are currently viewing is for the 2nd Edition (2017) of this book.

Featured by Tableau as the first of "7 Books About Machine Learning for Beginners."

Ready to spin up a virtual GPU instance and smash through petabytes of data? Want to add 'Machine Learning' to your LinkedIn profile?

Well, hold on there...

Before you embark on your epic journey, there are some high-level theory and statistical principles to weave through first.
But rather than spend…

Explore

Topics

Genres

Coming soon!

The most recommended data mining books

Who picked these books? Meet our 11 experts.

What type of data mining book?

Why am I passionate about this?

Why did Olivier love this book?

Why am I passionate about this?

Gregg's book list on understanding user research

Why did Gregg love this book?

Why am I passionate about this?

Yuxi's book list on machine learning for beginners

Why did Yuxi love this book?

Why am I passionate about this?

Jeremy's book list on for data science and analytics leaders

Why did Jeremy love this book?

Why am I passionate about this?

Peter's book list on no hype and no nonsense artificial intelligence

Why did Peter love this book?

Why am I passionate about this?

Ron's book list on how numbers turn into information

Why did Ron love this book?

Why am I passionate about this?

Tilman's book list on intro to programming and data science with R

Why did Tilman love this book?

Why am I passionate about this?

Jeremy's book list on for data science and analytics leaders

Why did Jeremy love this book?

Why am I passionate about this?

Yuxi's book list on machine learning for beginners

Why did Yuxi love this book?

Why am I passionate about this?

Yuxi's book list on machine learning for beginners

Why did Yuxi love this book?

Share your top 3 reads of 2024!

1,578