My favorite books to learn about machine learning and deep neural networks

Why am I passionate about this?

I started my career in neuroscience. I wanted to understand brains. That is still proving difficult, and somewhere along the way, I realized my real motivation was to build things, and I wound up working in AI. I love the elegance of mathematical models of the world. Even the simplest machine learning model has complex implications, and exploring them is a joy.


I wrote...

Understanding Deep Learning

By Simon J.D. Prince,

Book cover of Understanding Deep Learning

What is my book about?

This book is about the ideas that underlie deep learning. The first part of the book introduces deep learning models and discusses how to train them, measure their performance, and improve this performance. The next part considers architectures that are specialized to images, text, and graph data. 

If you know nothing about deep learning, this book will take you close to the frontier of research. If you are teaching deep learning, it has many novel illustrations, problems, and 68 Python notebooks for your class. If you work in or with deep learning, it will fill in the gaps in your knowledge and make you think about things you already know from different viewpoints.

Shepherd is reader supported. When you buy books, we may earn an affiliate commission.

The books I picked & why

Book cover of Probabilistic Machine Learning: An Introduction

Simon J.D. Prince Why did I love this book?

My knees tremble and my heart quakes when I think of how much work must have gone into these two companion volumes. Collectively, they are more than four times the length of my book, covering the whole of machine learning.

It is an essential encyclopedic resource that should be on the desk of anyone serious about machine learning.

By Kevin P. Murphy,

Why should I read it?

1 author picked Probabilistic Machine Learning as one of their favorite books, and they share why you should read it.

What is this book about?

A detailed and up-to-date introduction to machine learning, presented through the unifying lens of probabilistic modeling and Bayesian decision theory.

This book offers a detailed and up-to-date introduction to machine learning (including deep learning) through the unifying lens of probabilistic modeling and Bayesian decision theory. The book covers mathematical background (including linear algebra and optimization), basic supervised learning (including linear and logistic regression and deep neural networks), as well as more advanced topics (including transfer learning and unsupervised learning). End-of-chapter exercises allow students to apply what they have learned, and an appendix covers notation.

Probabilistic Machine Learning grew out of…


Book cover of Dive into Deep Learning

Simon J.D. Prince Why did I love this book?

This is the practical book that best accompanies my book (which is more about the underlying ideas.)

If you want a book that will show you how deep learning systems are built in practice, then this is the best place to start. It’s full of code snippets that translate between theory and building real systems.

By Aston Zhang, Zachary C. Lipton, Mu Li , Alexander J. Smola

Why should I read it?

1 author picked Dive into Deep Learning as one of their favorite books, and they share why you should read it.

What is this book about?

Deep learning has revolutionized pattern recognition, introducing tools that power a wide range of technologies in such diverse fields as computer vision, natural language processing, and automatic speech recognition. Applying deep learning requires you to simultaneously understand how to cast a problem, the basic mathematics of modeling, the algorithms for fitting your models to data, and the engineering techniques to implement it all. This book is a comprehensive resource that makes deep learning approachable, while still providing sufficient technical depth to enable engineers, scientists, and students to use deep learning in their own work. No previous background in machine learning…


Book cover of Information Theory, Inference and Learning Algorithms

Simon J.D. Prince Why did I love this book?

The best parts of this book really represent a gold standard in pedagogical clarity.

Although it’s now twenty years old, there is still much to learn from this rather unconventional book that covers the boundary between machine learning, information theory, and Bayesian methods. There are also odd tangents and curiosities, some of which work better than others but are never dull.

Just writing this review makes me want to go back to it and squeeze more out of it.

By David JC MacKay,

Why should I read it?

2 authors picked Information Theory, Inference and Learning Algorithms as one of their favorite books, and they share why you should read it.

What is this book about?

Information theory and inference, taught together in this exciting textbook, lie at the heart of many important areas of modern technology - communication, signal processing, data mining, machine learning, pattern recognition, computational neuroscience, bioinformatics and cryptography. The book introduces theory in tandem with applications. Information theory is taught alongside practical communication systems such as arithmetic coding for data compression and sparse-graph codes for error-correction. Inference techniques, including message-passing algorithms, Monte Carlo methods and variational approximations, are developed alongside applications to clustering, convolutional codes, independent component analysis, and neural networks. Uniquely, the book covers state-of-the-art error-correcting codes, including low-density-parity-check codes, turbo…


Book cover of The Shortcut: Why Intelligent Machines Do Not Think Like Us

Simon J.D. Prince Why did I love this book?

This is a popular science book, so a little different from the others on this list. It is a beautifully written book that is accessible to people who don’t know much about AI but is simultaneously thought-provoking for experts.

It contains probably the best discussion of "intelligence" that I've read, interesting insights into how Google and other tech giants came to develop their machine learning strategy, and a fascinating chapter that views recommendation engines and their users as parts of a single intelligent organism. It's concise and easy to read.

I've read many popular AI books, which are highly variable in quality, and this criminally underappreciated work is the best by miles. 

By Nello Cristianini,

Why should I read it?

1 author picked The Shortcut as one of their favorite books, and they share why you should read it.

What is this book about?

- The author is one of the most influential AI reseachers of recent decades.

- Written in an accessible language, the book provides a probing account of AI today and proposes a new narrative to connect and make sense of events that happened in the recent tumultuous past and enable us to think soberly about the road ahead.

- The book is divided into ten carefully crafted and easily-digestible chapters, each grapples with an important question for AI, ranging from the scientific concepts that underpin the technology to wider implications for society, using real examples wherever possible.


Book cover of Foundations of Deep Reinforcement Learning: Theory and Practice in Python

Simon J.D. Prince Why did I love this book?

Of course, this is not the obvious book to recommend for reinforcement learning, but if you are a beginner, then it’s a quick and easy place to start. It’s compact and gets straight into the main algorithms.

It has a good balance between theory and code and will get you up and running quickly.

By Laura Graesser, Wah Loon Keng,

Why should I read it?

1 author picked Foundations of Deep Reinforcement Learning as one of their favorite books, and they share why you should read it.

What is this book about?

The Contemporary Introduction to Deep Reinforcement Learning that Combines Theory and Practice

Deep reinforcement learning (deep RL) combines deep learning and reinforcement learning, in which artificial agents learn to solve sequential decision-making problems. In the past decade deep RL has achieved remarkable results on a range of problems, from single and multiplayer games-such as Go, Atari games, and DotA 2-to robotics.

Foundations of Deep Reinforcement Learning is an introduction to deep RL that uniquely combines both theory and implementation. It starts with intuition, then carefully explains the theory of deep RL algorithms, discusses implementations in its companion software library SLM…


You might also like...

American Flygirl

By Susan Tate Ankeny,

Book cover of American Flygirl

Susan Tate Ankeny Author Of The Girl and the Bombardier: A True Story of Resistance and Rescue in Nazi-Occupied France

New book alert!

Why am I passionate about this?

Susan Tate Ankeny left a career in teaching to write the story of her father’s escape from Nazi-occupied France. In 2011, after being led on his path through France by the same Resistance fighters who guided him in 1944, she felt inspired to tell the story of these brave French patriots, especially the 17-year-old- girl who risked her own life to save her father’s. Susan is a member of the 8th Air Force Historical Society, the Air Force Escape and Evasion Society, and the Association des Sauveteurs d’Aviateurs Alliés. 

Susan's book list on women during WW2

What is my book about?

The first and only full-length biography of Hazel Ying Lee, an unrecognized pioneer and unsung World War II hero who fought for a country that actively discriminated against her gender, race, and ambition.

This unique hidden figure defied countless stereotypes to become the first Asian American woman in United States history to earn a pilot's license, and the first female Asian American pilot to fly for the military.

Her achievements, passionate drive, and resistance in the face of oppression as a daughter of Chinese immigrants and a female aviator changed the course of history. Now the remarkable story of a fearless underdog finally surfaces to inspire anyone to reach toward the sky.

American Flygirl

By Susan Tate Ankeny,

What is this book about?

One of WWII’s most uniquely hidden figures, Hazel Ying Lee was the first Asian American woman to earn a pilot’s license, join the WASPs, and fly for the United States military amid widespread anti-Asian sentiment and policies.

Her singular story of patriotism, barrier breaking, and fearless sacrifice is told for the first time in full for readers of The Women with Silver Wings by Katherine Sharp Landdeck, A Woman of No Importance by Sonia Purnell, The Last Boat Out of Shanghai by Helen Zia, Facing the Mountain by Daniel James Brown and all Asian American, women’s and WWII history books.…


5 book lists we think you will like!

Interested in machine learning, deep learning, and modernity?

10,000+ authors have recommended their favorite books and what they love about them. Browse their picks for the best books about machine learning, deep learning, and modernity.

Machine Learning Explore 47 books about machine learning
Deep Learning Explore 16 books about deep learning
Modernity Explore 49 books about modernity