Here are 100 books that Computer Vision fans have personally recommended if you like
Computer Vision.
Shepherd is a community of 12,000+ authors and super readers sharing their favorite books with the world.
It’s been fantastic to work in computer vision, especially when it is used to build biometric systems. I and my 80 odd PhD students have pioneered systems that recognise people by the way they walk, by their ears, and many other new things too. To build the systems, we needed computer vision techniques and architectures, both of which work with complex real-world imagery. That’s what computer vision gives you: a capability to ‘see’ using a computer. I think we can still go a lot further: to give blind people sight, to enable better invasive surgery, to autonomise more of our industrial society, and to give us capabilities we never knew we’d have.
David Marr shaped the field of computer vision in its early days. His seminal book laid the structure for interpreting images and one which is still largely followed. He popularised notions of the primal sketch and his work on edge detection led to one of the most sophisticated approaches. His work and influence continue to endure despite his early death: we missed and miss him a lot.
Available again, an influential book that offers a framework for understanding visual perception and considers fundamental questions about the brain and its functions.
David Marr's posthumously published Vision (1982) influenced a generation of brain and cognitive scientists, inspiring many to enter the field. In Vision, Marr describes a general framework for understanding visual perception and touches on broader questions about how the brain and its functions can be studied and understood. Researchers from a range of brain and cognitive sciences have long valued Marr's creativity, intellectual power, and ability to integrate insights and data from neuroscience, psychology, and computation. This…
It’s been fantastic to work in computer vision, especially when it is used to build biometric systems. I and my 80 odd PhD students have pioneered systems that recognise people by the way they walk, by their ears, and many other new things too. To build the systems, we needed computer vision techniques and architectures, both of which work with complex real-world imagery. That’s what computer vision gives you: a capability to ‘see’ using a computer. I think we can still go a lot further: to give blind people sight, to enable better invasive surgery, to autonomise more of our industrial society, and to give us capabilities we never knew we’d have.
Adding perspective puzzled artists in the fourteenth century; analysing perspective is integral to applied computer vision. You might have seen Hawkeye in action: the principles by which it works are explained superbly within this book. It was the first of its kind to set this analysis in a lucid and compelling format. Richard and Andrew’s text will be on researchers’ bookshelves for many years for its bedrock description of how we analyse three-dimensional scenes.
A basic problem in computer vision is to understand the structure of a real world scene given several images of it. Techniques for solving this problem are taken from projective geometry and photogrammetry. Here, the authors cover the geometric principles and their algebraic representation in terms of camera projection matrices, the fundamental matrix and the trifocal tensor. The theory and methods of computation of these entities are discussed with real examples, as is their use in the reconstruction of scenes from multiple images. The new edition features an extended introduction covering the key ideas in the book (which itself has…
It’s been fantastic to work in computer vision, especially when it is used to build biometric systems. I and my 80 odd PhD students have pioneered systems that recognise people by the way they walk, by their ears, and many other new things too. To build the systems, we needed computer vision techniques and architectures, both of which work with complex real-world imagery. That’s what computer vision gives you: a capability to ‘see’ using a computer. I think we can still go a lot further: to give blind people sight, to enable better invasive surgery, to autonomise more of our industrial society, and to give us capabilities we never knew we’d have.
This fine book is about learning the relationships between what is seen in an image, and what is known about the world. It’s a counterpart to our book on feature extraction and it shows you what can be achieved with the features. It’s not for those who shy from maths, as is the case for all of the books here. So that you can build the techniques, Simon’s book also includes a wide variety of algorithms to help you on your way.
This modern treatment of computer vision focuses on learning and inference in probabilistic models as a unifying theme. It shows how to use training data to learn the relationships between the observed image data and the aspects of the world that we wish to estimate, such as the 3D structure or the object class, and how to exploit these relationships to make new inferences about the world from new image data. With minimal prerequisites, the book starts from the basics of probability and model fitting and works up to real examples that the reader can implement and modify to build…
Tap Dancing on Everest, part coming-of-age memoir, part true-survival adventure story, is about a young medical student, the daughter of a Holocaust survivor raised in N.Y.C., who battles self-doubt to serve as the doctor—and only woman—on a remote Everest climb in Tibet.
It’s been fantastic to work in computer vision, especially when it is used to build biometric systems. I and my 80 odd PhD students have pioneered systems that recognise people by the way they walk, by their ears, and many other new things too. To build the systems, we needed computer vision techniques and architectures, both of which work with complex real-world imagery. That’s what computer vision gives you: a capability to ‘see’ using a computer. I think we can still go a lot further: to give blind people sight, to enable better invasive surgery, to autonomise more of our industrial society, and to give us capabilities we never knew we’d have.
The advances of deep learning have been awesome, and fast. It’s been hard for the textbooks to keep up, so it’s good to include one that describes the advances and state of art very well. It seems appropriate that it’s edited by two leading researchers who are Roy – who described computer vision systems implementations in a long series of excellent books – and Matt, whose work on face recognition revolutionised and transformed the progress of face recognition in the 1990s. This book gives you an image of where we are now in computer vision, and where we are going.
Advanced Methods and Deep Learning in Computer Vision presents advanced computer vision methods, emphasizing machine and deep learning techniques that have emerged during the past 5-10 years. The book provides clear explanations of principles and algorithms supported with applications. Topics covered include machine learning, deep learning networks, generative adversarial networks, deep reinforcement learning, self-supervised learning, extraction of robust features, object detection, semantic segmentation, linguistic descriptions of images, visual search, visual tracking, 3D shape retrieval, image inpainting, novelty and anomaly detection.
This book provides easy learning for researchers and practitioners of advanced computer vision methods, but it is also suitable as…
I’m a historian of Southern Africa who is fascinated by questions of visibility and invisibility. I love probing beneath the surface of the past. For example, why is thisperson famous and renowned, butthatperson isn’t? To me, recognition and reputation are interesting to scrutinize as social categories in their own right, rather than as factual statements. I’ve written two books focusing on the history of religious expression in Southern Africa, and my most recent book is a biography of the forgotten South African writer and politician Regina Gelana Twala.
This anthology of African women writers has been my personal lodestar in writing about Regina Twala, a forgotten African writer.
Busby (a pioneering editor and publisher of Ghanaian heritage) was one of the first to recognize that the canon of African writers was much bigger than famous men like Chinua Achebe and Wole Soyinka.
Her work taught me about a longstanding rich female literary tradition on the African continent – some of her earliest examples of women writers date to Ancient Egypt!
Busby recognizes that we can’t always look to the written page for evidence of this, given that many women writers were denied opportunities to publish their work.
So she broadens the focus of her anthology by paying attention to both “wordsandwriting,” thinking about female writers of novels, poetry, plays, non-fiction, and journalism.
Three decades after her pioneering anthology, Daughters of Africa, Margaret Busby curates an extraordinary collection of contemporary writing by 200 women writers of African descent, including Zadie Smith, Bernardine Evaristo and Chimamanda Ngozi Adichie.
A glorious portrayal of the richness and range of African women's voices, this major international book brings together their achievements across a wealth of genres. From Antigua to Zimbabwe and Angola to the USA, overlooked artists of the past join key figures, popular contemporaries and emerging writers in paying tribute to the heritage that unites them, the strong links that endure from generation to generation, and…
My passion for generative AI first ignited in 2016 when I spoke about it at a conference, and ever since then, I can’t stop! I've created an online course, a newsletter and even wrote a book to spread knowledge on this groundbreaking technology. As an instructor, I empower others to explore the boundless potential of generative AI applications. Day in day out, I assist clients in crafting their own generative AI solutions, tailoring them to their unique needs.
Bishop’s book laid the mathematical groundwork for me, making it a solid foundation for anyone venturing into Generative AI.
I love how it covers Bayesian inference, graphical models, and machine learning fundamentals in a clear, approachable way. I also think, in my personal opinion, that reading my book after this one would be a natural progression to understand where AI is heading, building on the core concepts that Bishop established.
Pattern recognition has its origins in engineering, whereas machine learning grew out of computer science. However, these activities can be viewed as two facets of the same field, and together they have undergone substantial development over the past ten years. In particular, Bayesian methods have grown from a specialist niche to become mainstream, while graphical models have emerged as a general framework for describing and applying probabilistic models. Also, the practical applicability of Bayesian methods has been greatly enhanced through the development of a range of approximate inference algorithms such as variational Bayes and expectation pro- gation. Similarly, new models…
I’m the Head of Trend and Innovation Scouting for Nokia, and I’ve been with the company since the glory days of Nokia mobile phone world dominance. I know first-hand what happens when a company focuses exclusively on the technology, not the humans that use it, and how quickly that can lead to disaster. One of the lessons that I see repeated continuously in the field of innovation is that a huge amount of attention gets paid to the new technology, and not nearly enough on how the technology will interact with our existing systems, beliefs, attitudes, and culture. Learning from the mistakes is the best way to make sure that the future doesn’t repeat them!
While the term the “Metaverse” usually makes people think of a fully digital, immersive world, my own feeling is that technologies that bring digital information and entertainment into our physical world is a much more powerful and important arena. This leads us to the transformative and still-developing world of Augmented Reality.
David Rose of the MIT Media Lab has been working with Augmented Reality for more than a decade, and Supersight is an overview of what he's seen and what he’s learned in this time.
What I love about Supersight is that while David is clearly as excited about this topic as I am, he’s also a realist, and openly discusses issues and challenges with Augmented Reality. Perhaps most valuable are the 14 Augmented Reality Design Principles that he outlines – super realistic, super useful.
After reading this, you’ll have a very grounded idea of the capabilities and potential of…
For thousands of years, human vision has been largely unchanged by evolution.
We’re about to get a software update.
Today, Apple, Google, Microsoft, Facebook, Snap, Samsung, and a host of startups are racing to radically change the way we see. The building blocks are already falling into place: cloud computing and 5G networks, AI computer vision algorithms, smart glasses and VR headsets, and mixed reality games like Pokémon GO. But what’s coming next is a fundamental shift in how we experience the world and interact…
I’m a professor of computer science at Oregon State University. My research focus is on programming languages, but I also work on computer science education and outreach. I grew up in Germany and moved to the United States in 2000. Since computer science is a fairly new and not widely understood discipline, I am interested in explaining its core ideas to the general public. I believe that in order to attract a more diverse set of people to the field we should emphasize that coding is only a small part of computer science.
This book provides a brief introduction to the concept of algorithms before discussing the limitations of computation. Specifically, Harel explains undecidable problems (that is, problems for which no algorithm exists) and infeasible problems (that is, problems for which only algorithms are known that have an exponential runtime). I like this book (and its splendid title) because of its focus on the limitations of computation. Harel does a marvelous job in explaining two difficult topics about computation. The understanding of any scientific discipline requires the understanding of its limits, and the limits of computation are as significant as they are surprising.
Computers are incredible. They are one of the most important inventions of the 20th century, dramatically and irrevocably changing the way we live. That is the good news. The bad news is that there are still major limitations to computers, serious problems that not even the most powerful computers can solve. The consequences of such limitations can be serious. Too often these limits get overlooked, in the quest for bigger, better, and more powerful computers. In Computers Ltd., David Harel, best-selling author of Algorithmics, explains and illustrates one of the most fundamental, yet under-exposed facets of computers - their inherent…
I have been coding for over 30 years. I’ve seen some miserable interfaces, and some large programs that collapse under their own weight. Software was, at one point, notorious for being late, over budget, and unreliable. These books have helped turn the corner on these failings, and I have found each of them very valuable in my day-to-day programming. While you can learn technique and even languages online, the kind of insight found in these books is rare and worth spending time and money on.
This book changed my entire perspective on writing the UI and UX of great software. Even the revised edition is a bit old but still has many valuable lessons to teach. Platt established many of the fundamental principles of writing usable and transparent software, and his book should be read not only by designers, but perhaps especially by programmers.
This non-technical book discusses the annoyances and dangers we encounter every day when using computers. Written with delightful wit and humor, as well as the insight of an experienced insider, it rips into the design of software much as Atul Gawande's Complications exposed the practice of medicine. Its basic message to ordinary people having problems learning or using their software is this: It's not your fault! It's not because you're dumb! Aimed primarily at casual users of software, the book tells readers what they should expect from their software and how to make their voices heard so that they receive…
After World imagines a not-so-distant future where, due to worsening global environmental collapse, an artificial intelligence determines that the planet would be better off without the presence of humans. After a virus that sterilizes the entire human population is released, humanity must reckon with how they leave this world before…
As a kid, I used to do all the math problems in my textbooks just for fun, even if they weren’t part of a homework assignment. My grandchildren cringe when I tell them this. I am a researcher and educator in secure software engineering and have enjoyed a productive career in software development and management, software engineering and software security research, and software and secure software engineering education.
Although strictly speaking, this book is not on software security, it is so well-known in the field as a general reference that it deserves to be on this list. It discusses the important issues of computer security and can be used as either a textbook or a reference. No doubt that many, if not most, students of computer security are familiar with this book.
Today, everyone recognizes the importance of safeguarding computer systems and networks from vulnerability, attack, and compromise. But computer security is neither an easy art nor a simple science: its methodologies and technologies require rigorous study, and a deep grounding in principles that can be applied even as technologies change. Moreover, practitioners must understand how to align concepts with real policies, and then actually implement those policies -- managing inevitable tradeoffs such as "How secure do our devices really need to be, and how much inconvenience can we accept?"
In his extensively updated Computer Security: Art and Science, 2nd Edition, University…