IB Biology · Theme A · A3.2 · HL only

Drawing
the tree
of life.

If species are real, we need a way to draw them. Cladistics is biology's family-tree algorithm — shared derived characters all the way down.

9Sub-topics
18Key terms
HL onlyLevel
OrganismsLevel of organisation
CLADOGRAM
Why this topic

What this topic answers.

Every sub-topic below feeds at least one of these questions.

Guiding question 1

What tools are used to classify organisms into taxonomic groups?

Guiding question 2

How do cladistic methods differ from traditional taxonomic methods?

HL extension

Higher Level only.

An extra 9 sub-topics for HL — same syllabus, deeper mechanism.

HL only

Need for classification of organisms

Need for classification of organisms

HL only

Difficulties classifying organisms into the traditional hierarchy of taxa

Difficulties classifying organisms into the traditional hierarchy of taxa

HL only

Advantages of classification corresponding to evolutionary relationships

Advantages of classification corresponding to evolutionary relationships

HL only

Clades as groups of organisms with common ancestry and shared characteristics

Clades as groups of organisms with common ancestry and shared characteristics

HL only

Gradual accumulation of sequence differences as the basis for estimates of when clades diverged from a common ancestor

Gradual accumulation of sequence differences as the basis for estimates of when clades diverged from a common ancestor

HL only

Base sequences of genes or amino acid sequences of proteins as the basis for constructing cladograms

Examples can be simple and based on sample data to illustrate the tool.

HL only

Analysing cladograms

Analysing cladograms

HL only

Using cladistics to investigate whether the classification of groups corresponds to evolutionary relationships

Using cladistics to investigate whether the classification of groups corresponds to evolutionary relationships

HL only

Classification of all organisms into three domains using evidence from rRNA base sequences

Classification of all organisms into three domains using evidence from rRNA base sequences

A3.2.1 · Why classify?

~8.7 million species. Order required.

Classification is biology's filing system. Once you know an organism's group, you know hundreds of its properties without having to study them individually.

The estimated number of species on Earth is about 8.7 million. Classification organises this overwhelming diversity into a nested hierarchy, so that any further study can build on what we already know about the group. Identify a newly discovered mammal — and you instantly know it has hair, mammary glands, three middle-ear bones, a four-chambered heart, and gives live birth (with rare exceptions). That's the leverage classification gives.

A3.2.2 · The trouble with the traditional hierarchy

Linnaeus' arbitrary ranks.

Domain → Kingdom → Phylum → Class → Order → Family → Genus → Species. Memorable but problematic: the ranks don't always reflect evolutionary patterns.

🧠

Mnemonic

"Do Kindly Place Cover On Fresh Green Spring vegetables" — Domain, Kingdom, Phylum, Class, Order, Family, Genus, Species.

"Reptiles" don't exist

A famous example: the traditional class Reptilia (turtles, crocodiles, snakes, lizards) is not a clade. Birds — which Linnaeus put in a separate class — are more closely related to crocodiles than crocodiles are to turtles. To make Reptilia a clade, you'd need to include birds in it. Either reptiles is paraphyletic (excludes some descendants) or it doesn't exist as a proper taxonomic group.

🔄

Nature of Science · paradigm shift

Switching from traditional ranked classification to unranked clades based on common ancestry is a paradigm shift — a fundamental change in how the field organises its knowledge.

A3.2.3 · Why evolutionary classification wins

Groups that predict.

If classification follows evolutionary ancestry, members of a group truly share traits because they inherited them from a common ancestor.

The ideal classification follows real evolutionary history. In an evolution-based classification, every taxonomic group is a clade — a common ancestor plus all its descendants. The advantages:

  • Shared traits are real homologies, not coincidences of appearance.
  • You can predict properties of an organism from its clade — including traits not yet observed.
  • The classification reflects something true about the world, not just a convenient sorting.
A3.2.4 · Clades

Common ancestry plus shared characters.

A clade groups organisms (or viruses) that share a common ancestor and the derived characters inherited from it.

Best evidence

Molecular data

The most objective evidence for grouping organisms into clades comes from base sequences of genes or amino acid sequences of proteins. Sequences accumulate mutations as lineages diverge — fewer differences = more recent common ancestor.

Supporting evidence

Morphology

Shared physical traits can also support clade assignment, but watch for convergent evolution — unrelated species evolving similar features for similar functions. Wings of bats and birds are convergent, not shared from a common ancestor with wings.

A3.2.5 · The molecular clock

Dating divergence by counting mutations.

Mutations accumulate at a roughly constant rate. The more sequence differences between two lineages, the longer ago they shared a common ancestor.

Closely related species (recent common ancestor) have few differences in gene/protein sequences. Distantly related species have many. Calibrate the mutation rate (using fossils of known age) and you can convert sequence differences into divergence dates — millions of years ago when lineages last shared a common ancestor.

Limits of the molecular clock

  • Generation time — species with short generations (bacteria, insects) accumulate mutations faster than species with long generations (elephants, trees).
  • Population size — mutations get fixed in small populations more easily, speeding up the apparent clock.
  • Selective pressure — strong selection can accelerate or slow particular genes' clocks compared with neutral parts of the genome.

The molecular clock is therefore an estimate. To pin dates accurately, biologists calibrate clocks with the fossil record wherever they can.

A3.2.6 · Constructing cladograms

The tree with the fewest changes.

From sequence comparisons, multiple cladograms are mathematically possible. Biologists choose between them using parsimony.

A cladogram is a branching tree showing the most probable sequence of divergences. Nodes represent common ancestors; branches are lineages. Construction steps:

  1. Align sequences (genes or proteins) from multiple species.
  2. Count differences between every pair.
  3. Generate candidate cladograms — many topologies are possible.
  4. Apply parsimony: pick the cladogram that requires the fewest evolutionary changes.
📐

Parsimony principle

"A simple hypothesis with fewer evolutionary changes is more likely than a complex one with many." Doesn't guarantee the truth — but is the most defensible default when multiple histories are consistent with the data.

A3.2.7 · Reading cladograms

Common ancestors, read off the tree.

Three terms you must know — and one rule that controls them all.

Root

The deep ancestor

The bottommost node — represents the theoretical last common ancestor of every species on the cladogram.

Node

A branch point

A point where one lineage splits into two or more descendant lineages. Each node represents a (hypothetical) common ancestor at the moment of divergence.

Terminal branch

A tip

An endpoint — one species or group at the leaves of the tree.

📖

The rule

The further back (closer to the root) the most recent common ancestor of two groups, the more distantly related they are. Two species sharing a node near the tips of the tree are close cousins; two sharing only the root are most distant.

A clade = a node plus all its descendants. Cut any branch at any node and you have a clade. That's why classification by clades scales: every branch is a potentially named group.

A3.2.8 · When morphology lies

The figwort family story.

A classic case: a plant family classified by morphology, then dismantled by molecular evidence. Convergent evolution had fooled the morphologists.

The figwort family (Scrophulariaceae) was named in 1879 by Antoine de Jussieu, based on shared morphology. Two centuries later, cladistic analysis of three chloroplast genes for every member of the family revealed a much messier picture. The reclassification:

  • 50 genera → reclassified into the plantain family (Plantaginaceae)
  • 12 genera → reclassified into the broomrape family (Orobanchaceae)
  • 13 genera → new family Linderniaceae
  • 2 genera → new family Calceolariaceae
  • 2 smaller families merged into Scrophulariaceae

The morphological similarities that grouped these plants were due to convergent evolution — similar floral structures evolved independently in distantly related lineages, probably because the same pollinators favoured the same flower shape.

🔬

Nature of Science · falsification

Scientific theories — including taxonomic classifications — can be falsified by new evidence. The morphological figwort family was falsified by molecular data. Science accepts knowledge claims provisionally, always open to revision.

A3.2.9 · Three domains

The 1977 revolution.

Carl Woese added a new top-level rank — Domain — after rRNA sequencing revealed that prokaryotes are actually two deeply different groups.

Before 1977, organisms were placed into five kingdoms (Plants, Animals, Fungi, Protists, Bacteria). Woese's sequencing of ribosomal RNA (rRNA) — a gene present in every living thing and changing extremely slowly — showed that "bacteria" were actually two distinct domains, as different from each other as either was from eukaryotes. Crucially, Archaea rRNA is more similar to eukaryote rRNA than to bacterial rRNA — meaning archaea and eukaryotes share a more recent common ancestor than either does with bacteria.

Domain 1

Archaea

Prokaryotic. Many extremophiles (thermophiles, halophiles, methanogens). Unique membrane lipids; pseudopeptidoglycan cell walls. Closer to eukaryotes in rRNA.

Domain 2

Eubacteria

Prokaryotic. Peptidoglycan cell walls. The "true bacteria" we are most familiar with — E. coli, Streptococcus, cyanobacteria.

Domain 3

Eukarya

Eukaryotic — membrane-bound nuclei and organelles. Animals, plants, fungi, protists. More closely related to Archaea than to Bacteria.

HL-only key terms

TaxonomyTaxon / TaxaCladisticsCladeCladogramRoot (of a cladogram)NodeTerminal BranchEvolutionMorphologyMolecular ClockParsimony AnalysisCommon AncestorDomainArchaeaEubacteriaEukaryaParsimony analysis
Vocabulary

0 terms to own.

If you can't define one of these in a sentence, that's where to revise next.

IB Linking Questions

“What mechanisms contribute to convergent evolution?”

“To what extent is the natural history of life characterized by increasing complexity or simplicity?”