Collider Measurements: Theory → Variable → Result
A collider experiment is a pipeline that turns a QFT prediction into a number with an uncertainty. This doc traces that pipeline end-to-end for the most important measurements, showing for each one:
- What theory predicts (the QFT computation, formula, free parameter(s))
- Which detector-level variable carries the prediction's signal
- How experimentalists reconstruct it from raw detector data
- The published result + dominant systematic uncertainty
The companion observables docs (electroweak.md, standard-model.md, cross-sections.md, decay-rates.md) catalogue what is measured; this doc explains how the measurement is actually done.
1. The Generic Collider-Measurement Pipeline
Every collider measurement follows the same chain. The vocabulary differs by measurement type but the structure does not:
The right end ( — a mass, coupling, cross section, asymmetry) is what gets published; everything to its left is what makes the comparison possible. The five chain steps map to standard software stacks:
| Step | What it does | Typical tools |
|---|---|---|
| Theory | Compute fixed-order amplitudes; convolute with PDFs | MadGraph, NLOJET++, MCFM, FEWZ, NNLOJET, OpenLoops |
| Parton shower + hadronization | Dress partons into hadrons | Pythia, Herwig, Sherpa |
| Detector simulation | Trace particles through magnetic field, calorimeters, trackers | Geant4, Delphes (fast) |
| Reconstruction | Tracks, vertices, jets, leptons, missing | Experiment-specific (CMSSW, Athena) |
| Statistical analysis | Likelihood, profile fits, unfolding | RooFit, RooStats, HistFactory |
Two general design principles:
- Calibrate against a known reference. Every measurement uses another measurement as a calibration anchor. is calibrated against ; cross sections are normalized to a known luminosity (itself measured via van der Meer scans against forward elastic scattering); jet energies are calibrated using photon+jet balance. Nothing is purely ab initio.
- Make a histogram, fit a template. Almost every published result reduces to: build a histogram of some variable ; predict its shape from SM + nuisance parameters; vary the parameter(s) of interest + nuisance parameters until the predicted histogram matches data. The "fit" is a maximum-likelihood / profile-likelihood-ratio procedure with hundreds to thousands of nuisance parameters.
2. Anatomy of a Modern Collider Experiment
Modern colliders (LHC) and earlier colliders (LEP, SLC, BaBar, Belle) all share a common geometry:
- Beam pipe — vacuum, with bunches of accelerated particles crossing at the interaction point (IP) every (LHC).
- Tracker — silicon pixel + strip layers in a solenoidal magnetic field (). Measures charged-particle momenta via curvature; resolves primary (collision) and secondary (displaced, from -decays) vertices.
- Electromagnetic calorimeter (ECAL) — measures energies of by total absorption.
- Hadronic calorimeter (HCAL) — measures energies of hadrons (charged + neutral) by absorption.
- Muon spectrometer — outermost, beyond the calorimeters. Muons are the only charged particles that punch through; identified by hits in muon chambers.
- Trigger — multi-level system reducing the bunch-crossing rate to the that can be written to disk, by online cuts on , energy, isolation.
From these raw signals, physics objects are reconstructed:
| Object | Reconstruction | Resolution at LHC |
|---|---|---|
| Charged tracks | Helix fits to tracker hits | at |
| Electrons | Track + ECAL cluster matched | on energy |
| Photons | ECAL cluster, no track | same |
| Muons | Tracker + muon-spectrometer combined fit | on |
| Jets | Sequential recombination (anti- algorithm) of calorimeter towers / particle-flow objects | on energy |
| -jets | Jets + displaced-vertex / secondary-vertex tagger | efficiency, light-jet mistag |
| Missing () | in | |
| -jets | Narrow jet + 1 or 3 tracks + ID variables | ID, jet mistag |
The neutrinos, the dark matter, and any new weakly-interacting BSM particles leave no signal — they show up as , the vector sum of all visible-transverse momenta with a sign flip.
Every measurement that follows uses these objects as primitives.
3. Case Studies
3.1 Measuring the boson mass
Theory. In near , the cross section is a relativistic Breit–Wigner resonance:
The peak position fixes ; the FWHM fixes ; the peak height (combined with , known from ) fixes the absolute coupling normalization. Three independent quantities from one curve.
Variable. measured at energy points around the peak (a scan). Hadronic channel because — highest statistics.
Reconstruction. events are trivially identified at LEP1 (huge visible energy, charged tracks; has 2 tracks, has 2–6 tracks with displaced vertices — all separable). Counted with efficiency.
The hard part is calibrating :
- LEP measured beam energies via resonant depolarization: transversely polarized beams have a spin-precession frequency ; sweep an RF kicker, find the depolarizing resonance, read to .
- Corrections were applied for: lunar tides distorting the LEP tunnel (1 cm radial deformation ↔ 1 MeV beam-energy shift), train passages on the Geneva–Bellegarde TGV line (RF coupling to the LEP RF system), seasonal water-table changes (Versoix river level), and even the time of day.
Result. — relative uncertainty . Beam-energy calibration is the dominant systematic; statistical uncertainty was negligible after M s.
Pipeline summary.
3.2 Measuring the boson mass
Theory. Unlike the , the cannot be produced via (charge conservation forbids it), and its decay contains a neutrino whose full 4-momentum is not reconstructible — only the transverse component . So never appears as a clean pole in a directly-measured ; it shows up in three cross-section formulas, each carrying differently:
-
Pair-threshold formula ( at LEP2):
Near the cross section rises from zero as ; the rate of rise pins down . LEP2 scanned GeV across threshold and the on-shell region.
-
Single- Breit–Wigner in transverse mass (, used by Tevatron + LHC):
where the transverse mass
plays for the the role plays for the — the variable in which the cross section has a Breit–Wigner pole. For a at rest ; for a recoiling the distribution has a Jacobian peak at . The lepton- spectrum has the analogous Jacobian peak at . The kinematic Jacobian contains all the hadron-collider complexity (PDFs, recoil distribution, QED FSR off the lepton) and is the source of most systematic uncertainty.
-
Propagator factor in -mediated processes (Drell–Yan, deep-inelastic , , etc.):
At the propagator collapses to and gives Fermi's — the source of the historical "weak force is weak". At (deep-inelastic neutrino scattering, -search regions) the full propagator resolves and the cross section is directly sensitive to as an independent parameter. Historically used to constrain from DIS before its 1983 direct discovery.
Modern precision measurements use formula 2 — the transverse-mass Breit–Wigner — fit as a template.
Variable. Three nearly equivalent variables, often combined:
- distribution (most robust, less sensitive to recoil)
- Lepton distribution (most sensitive to at the Jacobian peak)
- distribution (least used because of pileup contamination)
Reconstruction.
- Charged lepton — directly from tracker + muon-system (muons) or tracker + ECAL (electrons). Lepton momentum scale is the leading systematic; calibrated against events whose is known to from LEP1.
- — over the entire detector. Requires modeling the "hadronic recoil": every particle in the event that is not the signal lepton.
- Hadronic recoil response — calibrated using events, where the recoil is fully measurable (no missing ). The same calibration is then transferred to the analysis.
- PDF inputs — proton PDFs (NNPDF, MSHT, CT) determine the rapidity / template; PDF uncertainties contribute to the systematic budget.
The measurement is a template fit: simulate for many values of (with all nuisance parameters profiled — calibration scales, PDFs, QED FSR), find the value of that best matches the data histogram.
Result. Per-experiment values (transverse-mass + combination):
| Experiment | Year | [GeV] | Stat. precision |
|---|---|---|---|
| LEP2 combination | ~2013 | dominated by stats | |
| D0 (Tevatron Run II) | 2013 | calibration-dominated | |
| CDF (Tevatron Run II) | 2022 | calibration-dominated | |
| ATLAS (LHC, 7 TeV) | 2018, 2024 | PDF+calibration | |
| LHCb (LHC, 13 TeV) | 2022 | clean low-pileup | |
| CMS (LHC, 13 TeV) | 2024 | calibration+PDF | |
| World average (excluding CDF) |
The CDF result is above the others. Subsequent ATLAS, LHCb, and CMS measurements all agreed with the non-CDF world average; the CDF anomaly remains unresolved as of 2026 and is the single most active open puzzle in EW precision physics.
Pipeline summary.
3.3 Measuring the Higgs boson mass
Theory. The Higgs is produced (Section 3.5) and decays. Two clean channels give an invariant-mass peak over a smooth background:
- : SM BR , but two photons are cleanly reconstructed with energy resolution.
- (): SM BR , but signal-to-background is enormous and resolution on is .
In either case, theory predicts a Breit–Wigner of width (intrinsic) broadened by detector resolution to .
Variable.
- — diphoton invariant mass.
- — four-lepton invariant mass.
Reconstruction.
- Diphoton. Two ECAL clusters above (or so), each isolated from tracks (to reject jets). Photon energy scale calibrated using (electrons treated as photons after dropping the track requirement). .
- Four-lepton. Four well-identified, isolated leptons; reconstruct closest to , the other pair, then . Lepton momentum scale again calibrated on .
Result. (ATLAS + CMS Run-2 combination). Photon-energy / lepton-momentum scale is the dominant systematic; statistics enter at the same level after Run 2.
Pipeline summary.
3.4 Discovering a new particle: the Higgs in 2012
Theory. Compute expected signal yield for a Higgs of mass in a chosen channel, vs. expected background yield from QCD continuum (estimated from data sidebands or simulation).
Variable. The signal channel's invariant-mass distribution (, , ).
Reconstruction + statistical extraction.
- Reconstruct events passing the analysis selection; histogram them in .
- Define a likelihood ratio: where is the signal-strength modifier (best-fit signal divided by SM expectation) and the likelihood is built from a Poisson product over mass bins.
- The test statistic has a -like distribution under the no-signal null; convert observed to a -value, then to a number of Gaussian-equivalent .
Result. Both ATLAS and CMS reported excesses at on July 4, 2012, combining and (and as supporting evidence). The "5 discovery threshold" corresponds to for a single channel — chosen conservatively to account for the look-elsewhere effect (the fact that "a bump anywhere in the allowed mass range" is more likely than "a bump at this specific mass").
Pipeline summary.
3.5 Measuring a cross section: Higgs production at the LHC
Theory. Factorization theorem (see QCD § Factorization):
For (the dominant channel, ), is computed via a top-quark loop. NNLO QCD K-factors are relative to LO; full N3LO (NNNLO) results are now standard. Predicted total at is for .
Variable. Number of observed events:
with the kinematic acceptance (fraction of events passing fiducial cuts) and the reconstruction efficiency.
Reconstruction.
- Luminosity . Measured by van der Meer scans: vertically and horizontally separate the colliding bunches, measure the rate vs. separation, fit a Gaussian, extract the bunch overlap integral. Cross-checked against forward elastic scattering (TOTEM, LHCf at the LHC). Per-experiment uncertainty: .
- Signal count . Bump-fit (Section 3.4) or counting in a signal region after sideband background subtraction.
- Acceptance and efficiency . From signal Monte Carlo, validated against data control regions.
Result. Typical published format:
equivalently as a signal-strength modifier . All five production modes (ggH, VBF, VH, , ) are now individually measured.
Pipeline summary.
3.6 Measuring a branching ratio:
Theory. A FCNC process forbidden at tree level (GIM), proceeding through a -box + -penguin loop. The SM rate is
computed from CKM elements (), top-quark loop function, and the decay constant from lattice QCD.
Variable. Number of reconstructed candidates relative to a normalization channel (typically ):
The normalization channel cancels luminosity, PDFs, and many systematics; is the fragmentation-fraction ratio (separately measured).
Reconstruction.
- Two opposite-sign muons forming a vertex displaced from the primary vertex by .
- Vertex isolation cuts to reject background from .
- Multivariate classifier (BDT or neural net) trained on simulation, then transferred to data.
Result. LHCb 2022 + ATLAS + CMS combined:
Consistent with SM at . Constrains BSM models with new scalars or modified -penguins.
Pipeline summary.
3.7 Measuring an asymmetry: at the -pole
Theory. In near , the differential cross section in (the direction relative to the beam in the c.m.) is
The asymmetry depends on through the vector and axial couplings. SM predicts at the peak — a small but very clean number.
Variable.
with = events with , = events with .
Reconstruction. Identify events (two opposite-sign isolated muons with near ). Compute from the muon directions. Count.
Result. Combined from LEP1 with millions of events per flavor:
The ratio cancels luminosity, efficiency-symmetric reconstruction effects, and most acceptance corrections — that's the point of asymmetries. Combined across all asymmetries, the -pole data pins down .
Pipeline summary.
3.8 Measuring a coupling: Higgs from
Theory. Higgs couples to fermions via Yukawa . Predicted SM branching for . The coupling modifier
is in the SM by definition.
Variable. Signal-strength modifier , then convert: where depends on the production mode and is the total-width modifier.
Reconstruction.
- Identify -leptons in their decay modes: (one muon), (one electron), (a narrow 1-prong or 3-prong -jet). Six final-state combinations (, ...).
- Reconstruct via the SVfit / collinear-approximation algorithm (the neutrinos add to , which is decomposed back along the directions).
- Multivariate analysis to separate signal from background and from QCD multijet fakes.
Result. ATLAS + CMS combination: — consistent with SM () at .
Similar fits for — all consistent with 1. Predicted couplings to first-generation fermions () are below current LHC sensitivity; HL-LHC and future colliders are needed.
Pipeline summary.
4. Common Threads
The eight case studies above span very different physical observables — masses, cross sections, branching ratios, asymmetries, discovery significance, coupling modifiers — but the measurement structure is essentially the same:
| Element | Universal role |
|---|---|
| Differential prediction | Always either a cross section or a rate; the QFT-side output |
| Histogram in some | The bridge between theory and data; everything fits a histogram in the end |
| Template fit / counting | Find the SM parameter values (or BSM contribution) that match the observed histogram |
| Calibration anchor | Every measurement borrows accuracy from a previously-measured reference: for energy scale, for vertex resolution, for hadronic recoil, van der Meer for luminosity |
| Profile-likelihood fit | The standard statistical machinery; treats systematics as nuisance parameters and integrates them out |
| Quoted uncertainty | (stat) ⊕ (calibration / detector systematics) ⊕ (theory / PDF / Higher-order) |
A cross-cutting consequence: no LHC measurement is purely "data-driven". All of them depend on (a) Monte-Carlo simulation of signal + background, (b) NLO/NNLO QCD predictions, (c) PDF parametrizations, (d) parton-shower modeling, and (e) detector calibration anchors. The error budget always has a non-trivial "theory" component, and progress on the theory side (e.g. better PDFs, higher-order QCD calculations) reduces published uncertainties even without new data.
4.1 Direct vs. inferred observables
A second cross-cutting point: almost nothing in published collider results is "directly observed". The detector records a small universal set of primitives (track hits, calorimeter energy deposits, magnetic-field curvature, timing, polarization), and everything else — cross sections, masses, branching ratios, couplings, , CKM elements — is a fit-extracted parameter obtained by inferring the value that makes a theoretical model match a histogram of those primitives.
The full taxonomy — direct primitives, the per-class list of inferred observables (cross sections, masses, couplings, vertex form factors, oscillation parameters, composite global fits), the inference-distance spectrum (short for and , long for and ), and the renormalization-scheme wrinkle (the "which mass?" question) — lives in direct-vs-inferred.md. The short version of the chain:
The only left-end node is data; everything to its right is model + inference.
5. Where Each Measurement Lives in This Repo
| Quantity | Observable doc | Theory doc |
|---|---|---|
| , , | electroweak.md §1–§2 | electroweak/from-postulates.md §3.2 |
| , Higgs couplings | electroweak.md §5 | electroweak/from-postulates.md §3 |
| CKM elements | electroweak.md §4 | electroweak/from-postulates.md §3.4 |
| FCNC rare decays | electroweak.md §4.3 | standard-model/from-postulates.md §B GIM |
| Cross sections, master formula | cross-sections.md | (general) |
| Decay rates, master formula | decay-rates.md | (general) |
| Global EW fit, UT triangle | standard-model.md | standard-model/from-postulates.md |
6. See Also
- Cross Sections — master formula, flux factor, phase space.
- Decay Rates — master formula, worked muon-lifetime example.
- Electroweak Observables — full inventory of what is measured at EW experiments.
- Standard Model Observables — cross-sector observables (UT triangle, global EW fit, GIM, lepton universality).
- Observables — General Map — the structural classification (Class A/B/C) all of the above instantiate.
- QCD § Factorization — the PDF × partonic-cross-section structure used by all hadron-collider measurements.
- QED/compton.md — the cleanest worked theory calculation in this repo, for comparison with the experimental extraction side covered here.