Databases
Database systems, query languages, and storage engines.
Databases addresses database systems, query languages, and storage engines. It sits within Data and Information and inherits that area’s core questions about correctness, scale, and tractability. This page surveys the conceptual axes of the topic and points to the references that frame ongoing research and teaching. The intent is to be useful both as an entry point for newcomers and as an index for practitioners cross-checking their mental model against the field’s primary sources.
Work on databases can be organised around a few interlocking concerns: the formal objects under study, the algorithms or systems that compute over them, the resource trade-offs (time, memory, communication, statistical efficiency), and the empirical or theoretical guarantees that practitioners rely on. The sources cited below approach the topic from a mix of these angles.
Foundational references
Silberschatz, Database System Concepts (2019) is a standard reference for this material and is used both as a curriculum anchor and as a long-form survey of techniques.
Supporting and complementary work
Hellerstein, Readings in Database Systems (2015) provides supporting material that complements the primary references — readers comparing approaches will find useful framings, alternative notations, or extensions there.
Historical context
A Relational Model of Data for Large Shared Data Banks (Codd, 1970) situates the topic in its historical trajectory; revisiting it clarifies which ideas in current practice are recent and which trace back to the field’s founding texts.
Open methodological questions in databases cluster around how to compose the techniques above under realistic constraints — scale, adversarial inputs, partial observability, and shifting workloads. The cited references give the precise statements, proofs, and empirical evaluations that this overview only sketches; downstream topic pages drill into specific subfields.
Prerequisites
Sources
- textbook · primary · 2019Database System Conceptssilberschatz-2019
-
-
In context
Where this topic sits in the prerequisite graph. Click any node to jump.
Explore
- 01
Relational Databases
Relational model, SQL, and normalization.
- 02
Query Optimization
Cost-based and rule-based query optimizers.
- 03
Query Execution
Volcano, vectorized, and compiled execution engines.
- 04
Transactions and Concurrency Control
ACID, MVCC, and serializability.
- 05
Storage Engines
B-tree, LSM-tree, and column-store storage engines.
- 06
Distributed Databases
Spanner-style and NewSQL distributed databases.
- 07
NoSQL Databases
Key-value, document, wide-column, and graph databases.
- 08
Graph Databases
Property graphs, RDF, and graph query languages.
- 09
Time-Series Databases
Storage and querying for time-stamped data.
- 10
Streaming Databases
Continuous queries, event-time processing, and materialized views.
- 11
Data Warehousing
OLAP cubes, star schemas, and analytical databases.
- 12
Lakehouse Architectures
Delta Lake, Iceberg, and unified data-lake/warehouse designs.
- 13
Database Recovery
ARIES, write-ahead logging, and crash recovery.
- 14
Data Cleaning and Integration
Entity resolution, schema matching, and ETL.
- 15
Learned Database Systems
ML-augmented indexes, optimizers, and components.
Review this topic
This page was drafted by an agent and is waiting on expert review. Spotted a wrong prerequisite, a missing concept, a misattributed source, or a factual slip? Tell us — your review opens a tracked issue maintainers act on.