Bioinformatics
Computational methods for biological data.
Bioinformatics addresses computational methods for biological data. It sits within Applied and Interdisciplinary CS and inherits that area’s core questions about correctness, scale, and tractability. This page surveys the conceptual axes of the topic and points to the references that frame ongoing research and teaching. The intent is to be useful both as an entry point for newcomers and as an index for practitioners cross-checking their mental model against the field’s primary sources.
Work on bioinformatics can be organised around a few interlocking concerns: the formal objects under study, the algorithms or systems that compute over them, the resource trade-offs (time, memory, communication, statistical efficiency), and the empirical or theoretical guarantees that practitioners rely on. The sources cited below approach the topic from a mix of these angles.
Foundational references
Durbin, Biological Sequence Analysis (1998) is a standard reference for this material and is used both as a curriculum anchor and as a long-form survey of techniques.
Supporting and complementary work
Compeau, Bioinformatics Algorithms: An Active Learning Approach (2018) provides supporting material that complements the primary references — readers comparing approaches will find useful framings, alternative notations, or extensions there.
Open methodological questions in bioinformatics cluster around how to compose the techniques above under realistic constraints — scale, adversarial inputs, partial observability, and shifting workloads. The cited references give the precise statements, proofs, and empirical evaluations that this overview only sketches; downstream topic pages drill into specific subfields.
Prerequisites
Sources
- textbook · primary · 1998Biological Sequence Analysisdurbin-1998
- textbook · supporting · 2018Bioinformatics Algorithms: An Active Learning Approachcompeau-2018
In context
Where this topic sits in the prerequisite graph. Click any node to jump.
Explore
- 01
Sequence Alignment
Needleman-Wunsch, Smith-Waterman, and BLAST.
- 02
Genome Assembly
De Bruijn graph and overlap-layout-consensus assembly.
- 03
Variant Calling
Detecting SNVs, indels, and structural variants from sequencing data.
- 04
Phylogenetics
Tree inference from molecular sequences.
- 05
Protein Structure Prediction
AlphaFold, RoseTTAFold, and modern structure predictors.
- 06
Single-Cell Analysis
Computational methods for single-cell omics data.
- 07
Cancer Genomics
Computational pipelines for tumor characterization.
- 08
Structural Bioinformatics
Analysis of 3D structures of biomolecules.
- 09
Biomedical NLP
NLP over clinical notes and biomedical literature.
- 10
Protein Design
De novo and ML-driven design of proteins.
- 11
ML for Drug Discovery
Learning-based molecular property prediction and generation.
Review this topic
This page was drafted by an agent and is waiting on expert review. Spotted a wrong prerequisite, a missing concept, a misattributed source, or a factual slip? Tell us — your review opens a tracked issue maintainers act on.