IB Biology · Theme B · B1.2

The shape
is the
function.

Twenty letters. One folding code. Almost everything cells actually do, done by proteins.

12Sub-topics
25Key terms
SL+HLLevel
MoleculesLevel of organisation
B1.2
Why this topic

What this topic answers.

Every sub-topic below feeds at least one of these questions.

Guiding question 1

What is the relationship between amino acid sequence and the diversity in form and function of proteins?

Guiding question 2

How are protein molecules affected by their chemical and physical environments?

B1.2.1 – B1.2.5 · Standard Level

5 things to lock in.

The required syllabus content for B1.2, in order. Each card is one lesson-sized checkpoint.

B1.2.1

Generalized structure of an amino acid

Generalized structure of an amino acid

B1.2.2

Condensation reactions forming dipeptides and longer chains of amino acids

Condensation reactions forming dipeptides and longer chains of amino acids

B1.2.3

Dietary requirements for amino acids

Dietary requirements for amino acids

B1.2.4

Infinite variety of possible peptide chains

Infinite variety of possible peptide chains

B1.2.5

Effect of pH and temperature on protein structure

Effect of pH and temperature on protein structure

B1.2.1 · Amino acid structure

Twenty letters. One template.

All 20 standard amino acids share the same four-group skeleton — only the R-group changes.

Every amino acid has the same central α-carbon, bonded to:

1

Amine group (-NH₂)

Basic. The same in every amino acid.

2

Carboxylic acid (-COOH)

Acidic. The same in every amino acid.

3

Hydrogen (-H)

A single hydrogen atom on the α-carbon.

4

R-group (side chain)

The variable group. This is what distinguishes the 20 amino acids.

B1.2.2 · Peptide bonds & polypeptides

From amino acids → polypeptides.

A condensation reaction between two amino acids forms a peptide bond and a dipeptide. Repeat thousands of times → a polypeptide.

The reaction joins the carboxylic acid carbon of one amino acid to the amine nitrogen of the next. –OH leaves from the carboxyl, –H from the amine, and they combine to release a water molecule. The remaining C–N bond is the peptide bond.

📝

Word equation (you should be able to write this)

Amino acid + amino acid → dipeptide + water

In a cell, polypeptide chains are built at ribosomes during the process of translation — many condensation reactions in sequence, building the polypeptide one amino acid at a time.

B1.2.3 · Dietary amino acids

9 essential. 11 non-essential.

Humans require all 20 amino acids to build proteins. We can synthesise 11 of them from other compounds. The other 9 — the essential amino acids — must come from food.

Animal protein sources (meat, dairy, eggs) generally provide all 9 essential amino acids in one source — they are "complete proteins". Most individual plant proteins are missing one or more essential amino acids.

🥗

Why vegan diets need variety

Vegans need to consume a mix of plant protein sources to make sure all essential amino acids are obtained. Common combinations: legumes + grains; nuts/seeds + vegetables. Soy and quinoa are unusual plants in providing all 9 essential amino acids in a single source.

B1.2.4 · Infinite variety

The combinatorial protein space.

With 20 possible amino acids at each position and chains thousands of amino acids long, the number of possible polypeptides is effectively infinite.

Amino acid alphabet
20 letters

The set coded for by the standard genetic code.

Shortest human polypeptide
51 AAs

Insulin's two chains: 21 + 30 amino acids.

Longest human polypeptide
34,350 AAs

Titin — the giant elastic protein of muscle.

Possible 100-AA sequences
20¹⁰⁰

≈10¹³⁰ — more than the atoms in the universe.

The genetic code specifies one of 20 amino acids per codon. Amino acids can be in any order. Chains can be any length from a few to tens of thousands. The combinatorial space is effectively unlimited — which is why proteins can carry out almost every function in biology.

B1.2.5 · Denaturation

When proteins lose their shape.

A protein's 3D shape determines its function. Disrupt the shape — by extreme pH, extreme temperature, or harsh chemicals — and the protein denatures: same chain, no function.

Denaturation is a conformational change in a protein that disrupts its 3D structure (without breaking the polypeptide chain). The protein loses its function. Often irreversible.

HL extension

Higher Level only.

An extra 7 sub-topics for HL — same syllabus, deeper mechanism.

HL only

Chemical diversity in the R-groups of amino acids as a basis for the immense diversity in protein form and function

Chemical diversity in the R-groups of amino acids as a basis for the immense diversity in protein form and function

HL only

Impact of primary structure on the conformation of proteins

Impact of primary structure on the conformation of proteins

HL only

Pleating and coiling of secondary structure of proteins

Pleating and coiling of secondary structure of proteins

HL only

Dependence of tertiary structure on hydrogen bonds, ionic bonds, disulfide covalent bonds and hydrophobic interactions

Dependence of tertiary structure on hydrogen bonds, ionic bonds, disulfide covalent bonds and hydrophobic interactions

HL only

Effect of polar and non-polar amino acids on tertiary structure of proteins

Effect of polar and non-polar amino acids on tertiary structure of proteins

HL only

Quaternary structure of non-conjugated and conjugated proteins

Quaternary structure of non-conjugated and conjugated proteins

HL only

Relationship of form and function in globular and fibrous proteins

Relationship of form and function in globular and fibrous proteins

B1.2.6 · R-group diversity

The R-group does the work.

R-groups are where the chemistry lives. Their diversity — polar, non-polar, acidic, basic — is what makes proteins so versatile.

R-groups vary in size, shape, polarity, and charge. They can be:

  • Hydrophobic — non-polar, repelled by water. Cluster in the core of soluble proteins.
  • Hydrophilic — polar — uncharged but with partial charges. Attracted to water.
  • Hydrophilic — charged — fully positive (basic) or fully negative (acidic) at physiological pH. Form ionic bonds.

The chemistry of the R-groups, plus their precise positions in the polypeptide, determines how the chain folds — and therefore the shape, location and function of the finished protein.

B1.2.7 · Primary structure

The sequence determines everything.

A protein's primary structure is the sequence of amino acids in its polypeptide — and this sequence alone, in most cases, determines the final 3D shape.

Primary structure is specified by the gene that codes for the protein. The DNA sequence is transcribed to mRNA, which is translated to a polypeptide with a specific sequence. Change a single base in the gene → change one amino acid → can change the entire protein's shape and function (sickle cell anaemia is one base change away from normal haemoglobin).

The remarkable thing is that proteins have precise, predictable, repeatable shapes — billions of polypeptides folding identically. Predicting structure from sequence is the "protein folding problem", recently solved with great success by AlphaFold.

B1.2.8 · Secondary structure

α-helices and β-pleated sheets.

Polypeptide backbones fold into two recurring local patterns — both stabilised by hydrogen bonds between backbone C=O and N–H groups.

α-helix

A right-handed coil

The polypeptide twists into a regular helix. Each turn contains ~3.6 amino acids. Hydrogen bonds form between every 4th amino acid's C=O and the next residue's N–H, stabilising the spiral.

β-pleated sheet

Side-by-side strands

Polypeptide strands lie next to each other (parallel or antiparallel), with hydrogen bonds between adjacent strands' backbones. Creates a corrugated "sheet" structure.

Secondary structure is essentially independent of R-groups — it's about the chain backbone. The recurring patterns add stability without specifying global shape.

B1.2.9 · Tertiary structure

R-groups find each other.

Tertiary structure is the overall 3D fold of a single polypeptide — driven by four kinds of interaction between R-groups.

1

Hydrogen bonds

Between polar R-groups — many but individually weak. Cumulatively important.

2

Ionic bonds

Between oppositely charged R-groups. Amine and carboxyl groups in R-groups can become positively or negatively charged by gaining or losing H⁺ — then they attract.

3

Disulfide bonds

Covalent S–S bonds between the R-groups of two cysteine amino acids. Strongest of the four. Lock in shape — important in proteins exported from the cell.

4

Hydrophobic interactions

Hydrophobic R-groups cluster together in the protein's core, away from surrounding water. Hydrophilic R-groups face outward.

B1.2.10 · Polar vs non-polar R-groups

Where the protein lives shapes its sequence.

Soluble proteins keep their hydrophobic R-groups buried inside. Membrane proteins flip the logic — they hide their hydrophilic groups and expose hydrophobic ones to the lipid bilayer.

Cytoplasm proteins

Hydrophobic core

Hydrophobic R-groups cluster in the protein's centre, shielded from water. Hydrophilic R-groups face outward toward the aqueous environment. This is how globular proteins (insulin, enzymes) stay soluble.

Membrane proteins

Hydrophobic belt

Integral membrane proteins have a band of hydrophobic R-groups around their middle — matching the hydrophobic tails of the phospholipid bilayer. Hydrophilic R-groups face the cytoplasm and external fluid at either end.

B1.2.11 · Quaternary structure

Multiple chains, sometimes a prosthetic group.

Many proteins consist of more than one polypeptide. Some also bind a non-polypeptide prosthetic group. Together these form the quaternary structure.

Non-conjugated

Insulin · collagen

Insulin: two polypeptide chains (A and B) joined by two disulfide bridges. Globular hormone — soluble in plasma, regulates blood glucose.
Collagen: three polypeptide chains coiled together into a triple helix. Fibrous structural protein found in skin, cartilage, bones.

Conjugated

Haemoglobin

Four polypeptide chains (2α + 2β) plus four haem prosthetic groups (each with an iron atom that binds O₂). The haem is non-polypeptide — protein + prosthetic group = conjugated protein.

🔬

Nature of Science · technology enables observation

Cryogenic electron microscopy (cryo-EM) has revolutionised the imaging of single-protein molecules — including how they interact with other molecules. It earned the 2017 Nobel Prize in Chemistry.

B1.2.12 · Globular vs fibrous

Form, tailored to function.

The big morphological division of proteins. Globular = compact, soluble, functional; fibrous = elongated, insoluble, structural.

Feature Insulin (globular) Collagen (fibrous)
ShapeSpherical, compactLong, narrow fibres
Amino acid sequenceIrregular; hydrophobic AAs buried in coreHighly repetitive
SolubilitySoluble in waterInsoluble in water
FunctionFunctional (hormone) — binds receptors, transported in bloodStructural — flexible, high tensile strength
Polypeptides2 chains, disulfide bridges3 chains, hydrogen bonds → triple helix

HL-only key terms

HydrophobicHydrophilicPolarNonpolarDenaturePrimary Protein StructureSecondary Protein StructureTertiary Protein StructureQuaternary Protein StructureConjugated ProteinProsthetic GroupGlobular ProteinFibrous Protein
Vocabulary

12 terms to own.

If you can't define one of these in a sentence, that's where to revise next.

Amino acidAmine GroupCarboxylic Acid GroupSide Chain (R Group)Condensation ReactionDipeptidePeptide BondPolypeptideEssential Amino AcidsNon-Essential Amino AcidsGenetic CodeDenaturation

IB Linking Questions

“How do abiotic factors influence the form of molecules?”

“What is the relationship between the genome and the proteome of an organism?”