Computer Vision (Deep Learning)

Deep-learning approaches to vision tasks.

foundation tier

Computer Vision (Deep Learning) addresses deep-learning approaches to vision tasks. It sits within AI and Machine Learning and inherits that area’s core questions about correctness, scale, and tractability. This page surveys the conceptual axes of the topic and points to the references that frame ongoing research and teaching. The intent is to be useful both as an entry point for newcomers and as an index for practitioners cross-checking their mental model against the field’s primary sources.

Work on computer vision (deep learning) can be organised around a few interlocking concerns: the formal objects under study, the algorithms or systems that compute over them, the resource trade-offs (time, memory, communication, statistical efficiency), and the empirical or theoretical guarantees that practitioners rely on. The sources cited below approach the topic from a mix of these angles.

Foundational references

Szeliski, Computer Vision: Algorithms and Applications (2022) is a standard reference for this material and is used both as a curriculum anchor and as a long-form survey of techniques.

Historical context

Deep Residual Learning for Image Recognition (He, 2016) situates the topic in its historical trajectory; revisiting it clarifies which ideas in current practice are recent and which trace back to the field’s founding texts. ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky, 2012) situates the topic in its historical trajectory; revisiting it clarifies which ideas in current practice are recent and which trace back to the field’s founding texts.

Open methodological questions in computer vision (deep learning) cluster around how to compose the techniques above under realistic constraints — scale, adversarial inputs, partial observability, and shifting workloads. The cited references give the precise statements, proofs, and empirical evaluations that this overview only sketches; downstream topic pages drill into specific subfields.

Prerequisites

Sources

textbook · primary · 2022

Computer Vision: Algorithms and Applications

szeliski-2022
paper · historical · 2016

Deep Residual Learning for Image Recognition

he-kaiming-2016
paper · historical · 2012

ImageNet Classification with Deep Convolutional Neural Networks

krizhevsky-2012

In context

Where this topic sits in the prerequisite graph. Click any node to jump.

Open in full atlas →

Reviewed by

@lucaderumier field

Explore

Review this topic

This page was drafted by an agent and is waiting on expert review. Spotted a wrong prerequisite, a missing concept, a misattributed source, or a factual slip? Tell us — your review opens a tracked issue maintainers act on.

Computer Vision (Deep Learning)

Foundational references

Historical context

Prerequisites

Sources

In context

Reviewed by

Explore

Image Classification

Neural Scene Representations

Diffusion Priors for 3D Generation

Object Detection

Semantic Segmentation

Vision Transformers

Instance Segmentation

Vision Foundation Models

Panoptic Segmentation

Depth Estimation

Pose Estimation

Action Recognition

Video Understanding

Object Tracking

3D Reconstruction

3D Gaussian Splatting

Image Restoration

Super-Resolution

Image Generation Models

Face Recognition

Medical Image Analysis

Remote Sensing

Document Analysis

Self-Supervised Vision

Adversarial Examples in Vision

Event-Based Vision

Review this topic