Modern Foundations — From SR + QM to the QFT Postulates
This is the Wigner–Weinberg derivation of relativistic quantum field theory: starting from special relativity, ordinary quantum mechanics, and the cluster-decomposition principle, derive the framework whose end-product is the QFT postulates. It is the modern counterpart to the historical / canonical-quantization route, and the conceptual prequel to every specific theory built on the resulting postulates — QED, QCD, electroweak theory, the Standard Model, and any other theory obtained by adding a choice of field content, gauge symmetry, and renormalizable Lagrangian.
The companion in the historical direction is QED/historical.md, which goes the other way (single-particle Dirac equation → quantization → field theory). Where the historical route postulates relativistic wave equations and derives gauge invariance later as a consequence, the modern route here postulates relativistic invariance + cluster decomposition and derives the wave-equation structure, the spin–statistics theorem, the field operator framework, and gauge invariance for massless particles all as theorems.
To keep the logical structure transparent we tag every step as one of:
- (Postulate) — a primitive assumption (an empirical input or general principle).
- (Definition) — mathematical notation introduced for use later.
- (Theorem) — a result that follows from previous postulates plus standard mathematics.
- (Heuristic) — physically motivated step whose rigorous justification is deferred.
The modern route rests on three primitive postulates:
| # | Postulate | Role |
|---|---|---|
| M1 | Special relativity: physics is invariant under the proper orthochronous Poincaré group | Geometric backdrop |
| M2 | Quantum mechanics: states are unit rays in a complex separable Hilbert space; observables are self-adjoint operators; probabilities follow the Born rule; time evolution is unitary | Inherited from QM postulates |
| M3 | Cluster decomposition: distant experiments are uncorrelated. Formally, connected -matrix elements between widely separated wavepackets vanish | Locality / no-spooky-distant-correlations |
Everything in this document — fields as operator-valued distributions, microcausality, spin–statistics, , and the connection to gauge invariance — is derived from M1 + M2 + M3 plus standard mathematical machinery (group representation theory, distribution theory, -algebras).
Reference. Weinberg, The Quantum Theory of Fields, Vol. 1 (1995), Chs. 2–5, is the canonical reference. This document is a structural summary of that derivation, with cross-references to where each ingredient lives in this repository.
0. Preliminaries
The mathematical machinery used below — Lorentz / Poincaré groups, Wigner's classification, Lie algebras, irreducible unitary representations, cluster decomposition, -algebras — is collected in QFT/preliminaries.md. This section names the specific ingredients we will rely on, without re-deriving them.
0.1 The Poincaré group and its Lie algebra
The Poincaré group is the semidirect product of spacetime translations and Lorentz transformations. Its Lie algebra is generated by:
- 4 translation generators (energy-momentum),
- 6 Lorentz generators (3 rotations + 3 boosts ).
Commutation relations:
The Casimir invariants are
with the Pauli–Lubanski vector.
0.2 Cluster decomposition principle (M3, restated)
Let denote a multi-particle scattering state with momenta and other quantum numbers . The connected part of the -matrix, , is defined recursively by removing all "factorizable" pieces. Cluster decomposition demands
Equivalently (and more usefully): the full -matrix factorizes when the experiment splits into well-separated, non-overlapping subexperiments. This is the relativistic generalization of "labs in different cities don't influence each other" and is the seed of locality.
0.3 The minimal additional inputs (where empiricism enters)
On top of M1+M2+M3, the framework needs empirical inputs to specify which QFT we live in:
- A list of species (one-particle representations) observed in nature.
- For each, mass and spin (or, if massless, helicity).
- A choice of interaction consistent with M1+M2+M3.
The first two are pinned down by Wigner's classification (§1 below); the third is constrained but not fixed by the framework, and is where QED, QCD, electroweak theory, etc. diverge as different specializations of the same postulates.
1. The State Space
This section builds the state space of a relativistic quantum theory step by step, tagging each step as (Postulate), (Definition), (Theorem), or (Empirical input) so the assumption budget is transparent. The goal is to expose which assumptions feed into the Fock-space construction — several of them are silently invoked in textbook treatments.
1.1 Defining "particle" (Definition + empirical input)
The Wigner-Weinberg framework rests on a specific operational identification:
Definition. A particle in a relativistic quantum theory is an irreducible unitary representation of the proper orthochronous Poincaré group acting on a complex separable Hilbert space.
This is a definition, not a theorem. The next three subsections unpack what each clause means in concrete physical terms; the definitional content and its empirical commitments are summarised in §1.1.4. Two pieces of empirical input are baked in:
- (Empirical-1) Identification of "particle" with "Poincaré irrep". Justified physically: an elementary excitation should be characterized by quantum numbers (mass, spin) that are invariant under choice of inertial frame. Irreducibility = "cannot be split into uncoupled subspecies" = elementarity. Mathematically tight, but the physical claim that nature's elementary objects are organized this way is empirical.
- (Empirical-2) Hilbert space is complex and separable. Inherited from M2 (standard QM). Real-Hilbert-space and quaternionic-Hilbert-space alternatives exist mathematically but are not used in the Standard Model.
1.1.1 What "representation" means here
A unitary representation of on a Hilbert space is a group homomorphism
where each is a unitary operator on and . Equivalently: every Poincaré symmetry — boost, rotation, translation — is realized as a concrete unitary operator moving state vectors around inside . Three adjectives carry the content:
- Unitary, because Poincaré transformations are symmetries of the probabilistic structure of QM: must be invariant under change of inertial frame.
- Irreducible, meaning no proper -invariant subspace exists: there is no non-trivial closed under all . Physically: the representation does not split into uncoupled subsectors that the Poincaré group keeps separate — anything bigger than an irrep would be several species in one Hilbert space.
- Projective allowed, i.e. equality up to a phase, because physical states are rays not vectors. The projective representations of correspond to ordinary representations of its universal cover (see math/group-theory.md § Group actions and representations and § Connectedness and discrete components). This is what allows half-integer-spin particles.
1.1.2 "Particle = irrep" is a statement about the Hilbert space, not a single state
The most common point of confusion. The definition does not say "a particle is a state vector "; it says
"a particle species" the entire one-particle Hilbert space together with the action on it.
The single space contains all possible states of one specimen of that species (any momentum, any spin orientation, any superposition). The Poincaré group shuffles them around — boosts change momentum, rotations rotate spin, translations multiply by a phase . Irreducibility says you can reach every state in from any other by some Poincaré action (plus superposition); that is what makes the species one species, with no leftover "different kind of electron" sub-Hilbert-space.
| Level | Object |
|---|---|
| Species (e.g. "electron") | An irrep — a Hilbert space with a action, labelled by Casimirs or |
| Particular particle in a particular state | A single vector (modulo phase) |
| identical particles | A vector in or (§1.3) |
| Arbitrary number | A vector in Fock space (§1.4) |
So "what is a particle?" has two answers depending on which question you are asking: the kind of particle is an irrep; a particle in a state is a vector in that irrep's Hilbert space.
1.1.3 The natural basis: momentum and spin, not position
Wigner's classification (§1.2) hands you more than the existence of — it hands you a canonical basis labelled by simultaneous eigenstates of the translation generators together with a little-group component :
where runs over the spin -component (massive case) or the helicity (massless case). A general one-particle state is
with the Lorentz-invariant momentum measure. The momentum-space wavefunction is the natural description because momentum and spin component are Casimir-respecting quantum numbers handed to you by the group action itself.
Why momentum and not position? Three reasons, in order of severity:
- Position is not a generator of the Poincaré algebra. The algebra has (translation generators ↔ momentum) and (Lorentz generators ↔ angular momentum / boosts). There is no . Momentum is intrinsic; position would have to be defined on top of , and no canonical choice exists.
- There is no Lorentz-covariant position operator. The best-known attempt — the Newton–Wigner position operator — is self-adjoint on but
- is not Lorentz-covariant: a state localized at in one frame is not localized in any boosted frame;
- has eigenstates with non-local tails of width (the Compton wavelength);
- fails entirely for massless particles of helicity (no localized photon states — the Newton–Wigner / Hegerfeldt obstructions).
- Causality is in tension with localization. Any state strictly localized inside a spatial region at instantaneously develops support everywhere by (Hegerfeldt's theorem). For single-particle states there is no finite-signal-velocity rescue.
The way QFT resolves this is to stop localizing particles and localize fields instead. Field operators (built in §2) live at spacetime points and satisfy microcausality at spacelike separation (§3). The matrix element
looks like a position-space wavefunction but is genuinely the matrix element of an operator-valued distribution between two states — see QFT/preliminaries.md § States vs. Fields. Position-dependence re-enters at the operator level, not the state level. This inversion is exactly what makes relativistic quantum theory a field theory rather than a particle theory: particles are irreps (defined by momentum-space data), and the spacetime point is an argument of the operator, not a label on a state.
1.1.4 Summary
- "Representation" = a unitary action of on . Irreducibility = no invariant sub-Hilbert-space = elementarity.
- A particle is a state only in the loose sense that states of that species live in . The species itself is the entire irrep, labelled by or .
- The natural basis is momentum + spin component, not position. There is no good relativistic position operator, and any attempt at one breaks Lorentz covariance or causality. Position re-enters as the argument of field operators in §2, not as a label of single-particle states.
With this definition fixed, the rest of §1 unfolds with very few additional postulates.
1.2 One-particle states (Theorem — Wigner classification)
Premise: M1 (special relativity) + M2 (QM) + the §1.1 definition of particle.
Wigner's theorem (1939). The irreducible unitary representations of are classified by two Casimirs:
- Mass-squared with .
- For : spin (irrep of the little group ).
- For : helicity (irrep of the little group , with quantized in integer/half-integer units by rotation closure).
Each irrep is realized on a one-particle Hilbert space . The full derivation lives in QFT/preliminaries.md § Wigner's Classification.
Conclusion. is forced, not chosen. Specifying a species means picking or .
Empirical-3 (caveat). Continuous-spin representations of the massless little group exist mathematically but are not observed in nature. Their absence is empirical, not a theorem.
1.3 Identical particles and (anti)symmetrization (Postulate — symmetrization)
Premise: copies of for the same species, plus the empirical fact that identical particles are truly indistinguishable.
The naive -fold tensor product contains many states distinguished by labels on the particles (which particle has which momentum). Indistinguishability requires that the physical state space be only the part of on which the symmetric group acts trivially up to a phase:
(Postulate — symmetrization) Multi-particle states of identical species lie in either or , never in any other subspace. This is an additional assumption beyond M1+M2+M3 in spacetime dimensions. (In d it can be relaxed to give anyons — irreps of the braid group with arbitrary phases.)
Note on bosons vs. fermions. Which of vs each species uses is not postulated here — it is determined later by the spin–statistics theorem (§4) from M1+M2+M3 + the field construction of §2. The symmetrization postulate only fixes the binary structure, not the assignment.
1.4 Variable particle number → Fock space (Empirical input + Definition)
Premise: the empirical observation that physical processes change the number of particles.
In non-relativistic QM, particle number is conserved and the state space is a single for fixed . In relativistic QFT this fails:
- permits pair creation/annihilation when sufficient energy is available.
- Decays ( becomes ).
- Scattering with different in/out particle counts.
(Empirical-4) Variable particle number is an empirical input of relativistic QFT, justified by relativity () and observation. It is not derivable from M1+M2+M3 alone — it is a fact about nature that the formalism must accommodate.
Definition. Given variable , the natural state space is the Fock space direct sum over all sectors:
with the one-dimensional vacuum sector and the appropriately (anti)symmetrized -particle sector from §1.3.
Inputs combined. Constructing thus requires:
- M1 + M2 + §1.1 definition (gives );
- §1.3 symmetrization postulate (gives for each );
- §1.4 variable-number empirical input (motivates ).
The detailed structure of — sectors, vacuum, ladder action, etc. — is in fock-space-inventory.md.
Caveat — free vs. interacting. is the free Fock space, built from one-particle Wigner reps. The interacting Hilbert space of a non-trivial theory is not unitarily equivalent to (Haag's theorem). is the appropriate state space for asymptotic in/out states only — see §2.0.1 and QFT/remarks.md § Haag's theorem.
1.5 Cluster decomposition forces creation/annihilation operators (Theorem)
Premise: Fock space from §1.4 + M3 (cluster decomposition).
Working with -particle states sector by sector is unwieldy — every requires its own . Cluster decomposition singles out a much more economical formalism: build everything from operators that raise or lower particle number by one.
Definition. Creation operators take an -particle state to an -particle state by adjoining a particle of momentum and spin component . Annihilation operators are their adjoints. On Fock space they satisfy
with the bosonic commutator or fermionic anticommutator according to the species statistics. (See QFT/preliminaries.md § Fock Space for the Terminology — ladder operators callout connecting these to the QM harmonic-oscillator .)
Theorem (Weinberg Vol. 1 §4.4). Given from §1.4, any operator whose connected matrix elements satisfy cluster decomposition can be written as a polynomial in and . Conversely, an interaction Hamiltonian not expressible in terms of these operators violates cluster decomposition (its connected part fails to vanish at large separation).
Conclusion. M3 + §1.4 ⇒ ladder-operator formalism is unique among the alternatives. Cluster decomposition does not derive Fock space itself (that came from §1.4); it derives the ladder-operator structure on top of an already-given Fock space.
1.6 Summary of §1: assumption budget
| Step | Status | Premise | Conclusion |
|---|---|---|---|
| §1.1 | Definition + Empirical | "particle" = Poincaré irrep on a complex separable Hilbert space | One-particle space exists |
| §1.2 | Theorem | M1 + M2 + §1.1 | Wigner classification: labelled by or |
| §1.3 | Postulate | identical-particle indistinguishability | Multi-particle sectors are or |
| §1.4 | Empirical + Definition | variable particle number | Fock space |
| §1.5 | Theorem | M3 + §1.4 | Ladder operators are the unique cluster-decomposable formalism |
Three inputs beyond M1+M2+M3 are quietly invoked along the way: the operational definition of particle (§1.1), the symmetrization postulate (§1.3), and the empirical fact of variable particle number (§1.4). All three are universally accepted in standard QFT but are additional assumptions, not consequences of the core M1+M2+M3.
2. The Need for Local Fields
2.0 The S-matrix as the central object (Postulate / framing)
Before constructing fields, it is worth being explicit about what we are constructing them for. In the modern story, the S-matrix is closer to a primitive than the Hamiltonian is. This is the conceptual inversion relative to the historical route, and clarifies the logic of the rest of §2. The next three sub-subsections introduce the asymptotic-state framework that acts on (§2.0.1), define together with its three structural constraints (§2.0.2), and unpack what it means to take as the primary object (§2.0.3).
2.0.1 Asymptotic states (Definition)
Real experiments prepare configurations that are effectively free long before scattering and detect configurations that are effectively free long after. The mathematical objects representing these are asymptotic in-states and asymptotic out-states .
Heuristic content. An asymptotic state is a multi-particle Fock-space configuration of the species classified by Wigner (§1.2) — a definite collection of momenta, spins, and species — whose interactions are negligible in the relevant time limit. The interacting Hilbert space does not literally contain free states (interactions never fully turn off), but it contains states that behave as if free in the limits .
Formal construction (deferred). Two equivalent formal definitions exist; both are technically delicate and we defer derivations to standard references:
- Møller operators. Define (limits taken in a strong-operator-on-wavepackets sense). maps a free Fock state to its dressed in-counterpart, to its dressed out-counterpart:
- LSZ. Asymptotic states are extracted as residues of single-particle poles in the time-ordered correlators of interpolating fields (any local field with non-zero matrix element between the vacuum and a one-particle state). Unlike the Møller construction above, LSZ does not require a Hamiltonian — it takes correlators as input, which can come from a Lagrangian, the lattice, the conformal bootstrap, or any other source. See QFT/preliminaries.md § LSZ Reduction Formula for the formula and properties.
Asymptotic Hilbert spaces. The in-states span ; the out-states span . Each is naturally a Fock space built on one-particle Wigner reps (§1.2–1.4). The two spaces are independent by construction, and may a priori be different from each other and from the full interacting .
The two-part structure of P10. QFT Postulate 10 packages two genuinely separate assumptions, plus a derived consequence:
| Part | Statement | Where it can fail |
|---|---|---|
| (P10a) Existence | The Møller operators exist on a dense set of free states | Long-range potentials (Coulomb), QED soft photons — the interaction does not switch off enough at large times |
| (P10b) Asymptotic completeness | , equivalently — every interacting state is reachable as some asymptotic state | Confinement (QCD): quarks/gluons are not asymptotic states; only hadrons are |
| Consequence: is unitary | is automatically a well-defined unitary operator | (follows from P10a + P10b) |
So " exists as a unitary operator" is not an independent assumption — it is a theorem given (P10a) and (P10b). The substantive content of P10 is exactly (P10a) + (P10b), and the failure modes of in real theories trace back to whichever of these two is violated.
Where these are guaranteed vs. assumed.
- Free theory: (a) and (b) are trivially satisfied, .
- Constructive QFT in : (a) and (b) are theorems (Haag–Ruelle 1962, given Wightman axioms + isolated mass shells).
- Realistic 4D QFT (QED, QCD): (a) and (b) are conjectural — no rigorous construction exists; they are physically motivated assumptions justified retroactively by the agreement of perturbative -matrix elements with experiment.
- Modified asymptotics: When (a) or (b) fails, the framework is modified rather than abandoned — Faddeev–Kulish coherent states for QED soft photons; hadron Fock space for QCD; conformal-bootstrap data for CFTs without a mass gap.
For the rest of this document we take (P10a) + (P10b) for granted, consistent with how Wightman / Weinberg structure the framework.
2.0.2 The S-matrix and its three structural constraints (Definition)
Given asymptotic states from §2.0.1, the S-matrix is the unitary operator linking the in- and out-bases of :
Its matrix elements encode all observable scattering probabilities. Three structural constraints — each a direct consequence of one of the modern postulates M1, M2, M3 (plus P10) — pin down what can look like:
- Unitarity (M2 ⇒ probability conservation): Equivalent to "total probability of some outcome equals 1": . The optical theorem () is the perturbative content of this identity.
- Lorentz invariance (M1 ⇒ frame-independence): with the strongly continuous unitary representation of the Poincaré group on (see QFT/preliminaries.md § Lorentz and Poincaré Groups). This is what forces the spacetime-translation to factor out of every matrix element, leading to the -residue construction (see Computational machinery below).
- Cluster decomposition (M3 ⇒ no spooky long-distance correlations): if a multi-particle process splits into well-separated, non-overlapping subprocesses, the connected -matrix factorizes: This is what §1.5 used to force the ladder-operator structure, and §2.1 will use to force locality of the interaction density.
Together, (unitarity + Lorentz invariance + cluster decomposition) is the modern definition of "a relativistic quantum scattering theory". Specifying which — i.e. which species couple to which, and how — is the empirical content of any given theory (QED, QCD, …).
The transition operator and the invariant amplitude are introduced from exactly as in QED/historical.md §5.3.
2.0.3 The status of as primary (Framing)
What this means structurally. "Specifying a relativistic quantum theory" is operationally the same as "specifying a Lorentz-invariant, cluster-decomposable, unitary -matrix". The Hamiltonian formalism, the Lagrangian formalism, and Feynman-diagram perturbation theory are all means to that end — concrete parameterizations of valid -matrices, none more privileged than the others. (In its purest form, this idea is the S-matrix bootstrap / modern amplitudes program, where one writes down constraints — Lorentz, unitarity, analyticity, locality — directly on and never introduces a Lagrangian at all. This is most powerful for theories where Lagrangian descriptions are awkward or non-existent, e.g. higher-spin, certain CFTs, and the on-shell amplitudes literature.)
Why fields then appear. Demanding that be Lorentz invariant + cluster-decomposable + built from the ladder operators of §1.5 forces the construction in §2.1: the only consistent way to package the ladder operators into a Lorentz-covariant local interaction is via local fields . Fields are construction tools for , not the fundamental objects.
Comparison with the historical route. QED/historical.md §5.0 starts from a Hamiltonian and defines as the interaction-picture evolution operator. There is derived from . Here we run the logic the other way: is postulated to exist (P10), and the Hamiltonian (or Lagrangian) is one of several conventional ways to specify which . Both routes converge at the level of the Dyson + Wick + Feynman-rules computation — the same machinery, motivated differently. The distinction is conceptual: "what is fundamental?" not "what do we actually compute?".
Computational machinery (deferred). Once has been specified by a choice of Lagrangian / Hamiltonian, the entire computational chain
is identical to the one in the historical route, and we do not reproduce it here. The full derivation lives in:
- QED/historical.md §5.0 — Perturbative machinery: S-matrix, Dyson series, Wick's theorem — Dyson series, Wick's theorem, generic Feynman-diagram structure.
- QED/historical.md §5.2 — The QED S-matrix and Feynman rules — momentum-space Feynman-rules table.
- QED/historical.md §5.3 — The Invariant Amplitude — definition of and as the residue after stripping the spacetime-translation .
- QED/historical.md §5.4 — The Squared Amplitude — Casimir's trick, spin/polarization sums, physical interpretation, six-step worked-example pipeline.
- QFT/cross-sections.md — master formula ; the parallel decay-rate formula lives in decay-rates.md.
- QFT/preliminaries.md § Time-Ordering Operator — the convention used in the Dyson series.
These computational results carry over verbatim to the modern story; only their interpretive status differs (e.g. the Hamiltonian whose Dyson series is being expanded is, in the modern view, just a parameterization of rather than the primary object).
Aside — is derived from ? The modern viewpoint elevates above , but does not provide a constructive recipe to extract from . The relationship is more subtle:
- is fully determined by the species content (Wigner classification, §1). For free fields the Hamiltonian is forced:
- is only existentially constructive. Weinberg Vol. 1 Ch. 3 shows that any Lorentz-invariant, cluster-decomposable, unitary admits some interaction-picture Hamiltonian density built from local fields satisfying for . But this is not unique: many different Hamiltonians (differing by field redefinitions) produce the same .
- Field-redefinition equivalence (LSZ). Two 's related by a local field redefinition give the same on-shell -matrix. This is why effective field theories using different operator bases can describe identical physics, and why the choice of "fundamental" field is partly conventional.
- The bootstrap view. Modern programs (on-shell amplitudes, conformal bootstrap, double-copy constructions) compute -matrix elements directly from Lorentz / unitarity / locality / analyticity constraints without writing down any — empirical evidence that is genuinely a parameterization, not a derivation target.
So the modern claim is not " is computed from " but rather " exists, is non-unique, and is one of several equivalent parameterizations of the same ." The historical and modern routes thus differ on which is primary, but agree that they are equally complete descriptions of any given theory.
2.1 Lorentz invariance + cluster decomposition demand a special structure (Theorem)
To build a Lorentz-invariant interaction Hamiltonian , the natural ansatz is
with a scalar density under the Poincaré group. But this Hamiltonian density must be built from creation/annihilation operators (by §1.5, to satisfy cluster decomposition), and must be a Lorentz scalar.
Naive ladder-operator combinations don't transform covariantly. Under a Lorentz transformation, individual transforms via Wigner's little-group rotation , which depends on — not as an ordinary tensor field. Building a Lorentz scalar density out of these is non-trivial.
Solution: package the ladder operators into local fields. Define
with mode functions and chosen so that the combined field
transforms as a finite-dimensional representation of the Lorentz group:
The mode functions are uniquely determined by this requirement (up to normalization and basis choice) for each species; for spin they are the Dirac spinors of the historical route, for spin 1 the polarization vectors, etc.
Two non-trivial consequences emerge automatically:
-
Antiparticles. The piece must contain — creation operators for a different species (the antiparticle) with the same mass and spin and opposite charge. (For neutral self-conjugate fields, particle = antiparticle.) The existence of antiparticles is not postulated separately; it is forced by the requirement that transform locally and covariantly.
-
Microcausality. For the interaction Hamiltonian density to commute with itself at spacelike separation (which is necessary for Lorentz-invariant time-evolution and for cluster decomposition of the -matrix), one finds that the fields themselves must (anti)commute at spacelike separation:
This is microcausality — derived, not postulated.
2.2 Fields are operator-valued distributions (Theorem — heuristic)
The fields defined above are not operators on in the strict sense: has infinite norm (the integrand involves at one spacetime point, with no damping). They are operator-valued tempered distributions: only the smeared objects
are bona fide (unbounded) operators. This is the source of the technical complication that distinguishes QFT from finite-DOF QM.
Why "heuristic"? The above is rigorous for free fields. For interacting fields in , Haag's theorem shows the construction does not survive intact and the fields must be reconstructed via renormalization. See QFT/remarks.md § Haag's Theorem.
3. Microcausality and Locality
§2.1 already established microcausality
as a consequence of M1 + M3 plus the field construction. The (anti)commutator choice is fixed by spin–statistics (§4 below). Three points worth highlighting:
- Microcausality is what makes locality precise. Cluster decomposition (M3) is a statement about scattering at large spatial separation; microcausality is the operator-level consequence in Hilbert space.
- It is necessary for relativistic causality. Two operators that don't (anti)commute at spacelike separation could in principle propagate signals faster than light by repeated measurement.
- The (anti)commutator must vanish, not just be small. This is a much stronger condition than thermodynamic locality and is what fails in (e.g.) string field theory, leading to its non-local behavior on small scales.
The full statement is QFT Postulate 6; the modern derivation just outlined is summarized in §2.1.
4. Spin–Statistics (Theorem)
In §1.3 we left the choice between (bosonic) and (fermionic) open. The spin–statistics theorem fixes it:
Theorem (Pauli 1940; Lüders–Zumino 1958). In a Lorentz-invariant local QFT with positive energy and a unique vacuum, fields of integer spin must be quantized as bosons (commutators) and fields of half-integer spin as fermions (anticommutators). Any other choice produces either negative-norm states, violation of microcausality, or violation of the spectrum condition.
Proof sketch. Construct the field for spin using both choices (commutators vs anticommutators) and compute at spacelike separation. The result has the form for spin , for spin 1, etc., where is the commutator function. For half-integer spin, anticommutators give the spacelike-vanishing combination; commutators give a result that does not vanish (and would violate microcausality). For integer spin it's the reverse.
So spin–statistics is not an independent postulate in the modern derivation — it is forced by M1 + M2 + M3 + the field construction of §2.
Corollary — Pauli exclusion. Anticommutators of fermion creation operators give , forbidding two identical fermions in the same state. The Pauli exclusion principle is a downstream consequence of M1 + M2 + M3, not an independent postulate.
5. Massless Particles and the Origin of Gauge Invariance (Theorem)
Massless particles have a richer little group than massive ones — instead of — with consequences for the construction in §2.1.
5.1 Polarization vectors of massless spin 1 don't transform as 4-vectors
For a massive spin-1 particle, the polarization vectors form a true 4-vector representation of the little group (3 transverse polarizations). For a massless spin-1 particle, the little group acts on with an inhomogeneous piece:
The piece is not removable. So is not a true 4-vector under Lorentz transformations.
5.2 Lorentz invariance forces gauge invariance
For an interaction term to be Lorentz-invariant despite the non-tensorial transformation of , the inhomogeneous piece must drop out. This requires
i.e. the current must be conserved: .
By Noether's theorem, a conserved current corresponds to an internal symmetry. For an external photon line with polarization , the residual transformation must be a symmetry of the interaction — and that residual transformation is exactly gauge invariance:
So gauge invariance of QED is a theorem in the modern derivation: it is what is required to make a massless spin-1 particle interact in a Lorentz-invariant way. The photon's gauge invariance is not a separate postulate; it is a consistency requirement of M1 + M2 + M3 applied to a massless spin-1 species.
Generalization. For multiple massless spin-1 particles forming a non-trivial multiplet under an internal symmetry , the same argument forces the Yang–Mills structure: the connection 1-form, structure constants , and self-interactions are all uniquely determined. See QED for the abelian case (, one massless spin-1) and QCD for the non-abelian realization (, eight massless spin-1 in the adjoint).
What this story does not derive. The existence of multiple massless spin-1 particles forming a -multiplet is an empirical input, not a theorem. Likewise, massive gauge bosons (e.g. ) are permitted by Wigner classification without gauge invariance, but their longitudinal polarizations break perturbative unitarity at high energies — the resolution (the Higgs mechanism, with a scalar VEV breaking the gauge symmetry spontaneously) is a postulate added on top of the modern story, not a derivation. See QFT/remarks.md § Spontaneous Symmetry Breaking and the electroweak theory doc, where the Higgs mechanism is worked out in detail.
6. CPT Theorem (Theorem)
Theorem (Pauli 1955; Lüders 1957; Jost 1957). Every Lorentz-invariant local QFT with positive energy and a unique vacuum is invariant under the combined action (charge conjugation × parity × time reversal), where swaps particles and antiparticles, inverts spatial coordinates, and reverses time.
The theorem is a consequence of M1 + M2 + M3 + the field construction of §2 (specifically, the analytic-continuation properties of the Wightman functions in complexified Minkowski space). , , and individually are not required to be symmetries — and indeed are violated by the weak interaction — but their combined product must be.
For the proof structure see Streater–Wightman PCT, Spin and Statistics, and All That (1964); for the modern statement see Weinberg Vol. 1 §5.8.
7. Arrival at the QFT Postulates
We can now read off the QFT postulates as theorems / definitions / postulates of the modern derivation:
| QFT Postulate | Status in this derivation |
|---|---|
| P1 — Relativistic state space | Theorem (M1 + M2 + Wigner classification, §1) |
| P2 — Spectrum condition | Theorem (Wigner: physical particles have , ; §1.2) |
| P3 — Unique Poincaré-invariant vacuum | Theorem (cluster decomposition + irreducibility; the alternative — degenerate vacua — corresponds to spontaneous symmetry breaking, treated as a separate empirical case) |
| P4 — Field operators (operator-valued distributions) | Theorem (M1 + M3 + Weinberg's theorem on cluster decomposition force ladder operators; Lorentz covariance forces packaging into local fields; §2) |
| P5 — Poincaré covariance of fields | Theorem (built into the construction of §2.1) |
| P6 — Microcausality | Theorem (M3 + scalar interaction density; §3) |
| P7 — Spin–statistics | Theorem (Pauli; §4) |
| P8 — Cyclicity of the vacuum | Theorem (cluster decomposition + irreducibility ⇒ the field algebra acts cyclically on ) |
| P9 — Dynamics from a local action | Definition / convention (Lagrangian field theory is the practical way to specify subject to the constraints derived above) |
| P10 — Asymptotic completeness / S-matrix | Postulate (an additional assumption — that the interacting theory has a well-defined asymptotic limit; equivalent to the LSZ reduction). Not derivable from M1 + M2 + M3 alone. |
Two postulates remain genuinely additional in the modern story:
- P9 is more of a definitional convention — the Lagrangian formalism is a practical way to specify a Lorentz-invariant interaction, but other formalisms (Hamiltonian, path integral, S-matrix bootstrap) would also work.
- P10 is a real assumption about the dynamics: that the in/out spaces exist and equal . Examples (confinement in QCD) where individual asymptotic-particle interpretations break down show this is not automatic.
So the modern story compresses 10 postulates into 3 (M1, M2, M3) + 1 dynamical assumption (P10) + empirical inputs (field content, masses, couplings).
7.1 Reverse mapping: where do §1's non-M assumptions live in the QFT postulates?
The §1 derivation invokes three inputs beyond the core M1+M2+M3 (see §1.6 assumption budget). Tracing each back into QFT/postulates.md shows where they are explicit, where they are absorbed into other postulates, and where they are silently assumed:
| §1 non-M input | Maps to QFT postulate(s) | Status of mapping |
|---|---|---|
| §1.1 — particle = Poincaré irrep on a complex separable Hilbert space | P1 (relativistic state space) + P5 (Poincaré covariance) | Upstream framing. P1 says "states form a complex separable Hilbert space with acting unitarily" — but doesn't say particles = irreps. The "particle = irrep" identification is logically prior to P1; P1 is its formalization. |
| §1.3 — symmetrization postulate (states are sym OR antisym under exchange) | P7 (spin–statistics) | Logically prior, finer-grained. P7 packages two distinct claims: (a) the binary sym/antisym structure (= §1.3) and (b) the assignment "integer spin → bosons, half-integer → fermions" (= the spin–statistics theorem of §4). Most treatments don't separate (a) from (b); the modern derivation makes the split visible. |
| §1.4 — variable particle number (empirical input) | None directly | Hidden empirical input. No QFT postulate explicitly states "particle number is not conserved". P4 (operator-valued field distributions) and P10 (asymptotic completeness with multi-particle in/out) implicitly use variable- Fock space, but the empirical input is silently absorbed rather than stated. This is a gap in the standard axiomatic statement. |
So the 10 P-postulates are not literally a complete restatement of the modern derivation: variable particle number in particular is an unflagged empirical input in postulates.md. For a corresponding note on the postulates side, see QFT/postulates.md § Implicit Empirical Inputs.
8. Comparison with Other Routes
| Aspect | Modern (this doc) | Historical (QED/historical.md) | Wightman-axiomatic (postulates.md) |
|---|---|---|---|
| Foundational input | SR + QM + cluster decomposition + species data | Relativistic single-particle wave equation + minimal coupling + second quantization | All 10 Wightman postulates stated upfront |
| Where fields come from | Theorem (forced by Lorentz + cluster decomposition) | Promoted from wavefunction by hand at second quantization (postulate H3) | Postulated as operator-valued distributions (P4) |
| Spin–statistics | Theorem (Pauli 1940) | Postulated (anticommutators by hand) | Postulated as P7 (with note that it's a theorem) |
| Microcausality | Theorem (consequence of locality of ) | Inherited from canonical quantization | Postulated as P6 |
| Antiparticles | Theorem (forced by Lorentz covariance of ) | Postulated via Dirac sea, then reformulated via vs | Built into the field operator definition |
| Gauge invariance (massless spin 1) | Theorem (forced by Lorentz consistency of polarization vectors) | Derived from minimal coupling postulate | Not a Wightman postulate; added as a specialization |
| Pedagogical accessibility | Hardest (group theory + abstract framework) | Easiest (extends single-particle QM step by step) | Cleanest mathematically, opaque physically |
| Generalization to non-abelian, Higgs | Multi-particle + multiplet gives YM directly; Higgs is a separate empirical postulate | Gauge generalization requires a leap of faith (no obvious route) | Symmetry structure built in by hand |
All three routes converge on the same physical content — the Wightman postulates and the Lagrangians of the Standard Model — but they organize the inputs differently. The modern route is the most economical (fewest primitive postulates) but requires the most mathematical machinery; the historical route is most intuitive but forces gauge invariance to be discovered later as a "lucky accident"; the Wightman axioms are the most rigorous but presuppose what other routes try to motivate.
9. What Comes Next
Given the postulates of §7, the next questions are:
- Specialize to a specific theory. Pick field content, internal symmetry, renormalizable Lagrangian. Worked examples:
- QED — with Dirac matter.
- QCD — with quark matter.
- Electroweak — with chiral fermions, broken to by the Higgs mechanism.
- Standard Model — the union with cross-sector content (anomaly cancellation, generations + GIM, CP problems).
- GUTs, supersymmetric extensions, effective theories (forthcoming).
- Extract observables. S-matrix elements via LSZ + Feynman rules; cross sections via cross-sections.md, decay rates via decay-rates.md; the broader observable inventory is in observables.md.
- Address foundational caveats. Haag's theorem, rigorous construction in , measurement in QFT — see QFT/remarks.md.
- Move beyond. EFT / Wilsonian framing; QFT as the universal IR description of relativistic systems; possible UV completion (string theory, asymptotic safety).