Definitions and Preliminaries
Familiarity with non-relativistic quantum mechanics (Hilbert spaces, Hermitian operators, the Born rule, etc.) is assumed — see the QM notes. This page collects the additional structures specific to QFT.
Minkowski Spacetime
QFT is formulated on Minkowski spacetime with metric
A spacetime point is denoted (with ). The invariant interval classifies separations as timelike (), lightlike (), or spacelike (). Greek indices run over and are raised/lowered with ; repeated indices are summed (Einstein convention).
Group Theory Prerequisites
The Lorentz, Poincaré, and internal symmetry groups (e.g. , , ) used throughout QFT are all instances of Lie groups with associated Lie algebras and representations. The abstract definitions — group axioms, subgroups and quotients, homomorphisms, representations (unitary, irreducible, projective), the standard matrix groups, Lie groups and Lie algebras (generators, structure constants, Casimirs), direct and semidirect products, compactness, connectedness and universal covers, and worked examples ( and ) — are collected separately in math/group-theory.md. The rest of this page assumes that material as background.
Lorentz and Poincaré Groups
The Lorentz and Poincaré groups are the symmetry groups of special relativity, and are the geometric input of every relativistic quantum theory. They are 6- and 10-dimensional Lie groups respectively; this section catalogues their components, generators, algebra, representations, and the Casimir invariants used to label particles.
The Lorentz group
The Lorentz group consists of linear transformations preserving the metric:
It is a 6-dimensional non-compact Lie group with four disconnected components, distinguished by two -valued discrete invariants:
- — proper () vs. improper ().
- — orthochronous (, preserves the direction of time) vs. non-orthochronous ().
The four components are connected to one another by the discrete operations:
| Component | Symbol | Contains | Got from by |
|---|---|---|---|
| Proper, orthochronous | identity, rotations, boosts | (identity component) | |
| Improper, orthochronous | parity-flipped rotations | ||
| Proper, non-orthochronous | combined rotations | ||
| Improper, non-orthochronous | time-flipped rotations |
The proper orthochronous Lorentz group is the identity-component subgroup; the discrete symmetries and are separate (and may or may not be symmetries of a given physical theory — the weak interaction violates both).
The Poincaré group
Spacetime translations commute with each other but not with Lorentz transformations: combining a Lorentz transformation followed by a translation gives , and the order matters. This is captured by the semidirect product structure:
with group multiplication
The four-dimensional translation subgroup is normal in ; the Lorentz subgroup is not (boosting and translating do not commute). is 10-dimensional (6 Lorentz + 4 translations) and has the same four-component structure as the Lorentz group.
The proper orthochronous Poincaré group is the connected identity component. It is the symmetry group of relativistic physics (excluding the discrete operations, which are treated separately).
Lie algebra and generators
The Lie algebra of has 10 generators:
- 4 translation generators (energy–momentum), .
- 6 Lorentz generators , conventionally split into:
- 3 rotation generators (angular momentum),
- 3 boost generators .
The defining commutation relations are
In rotation/boost form these read:
The combinations satisfy two independent algebras:
so the complexified Lorentz algebra factorizes as . Finite-dimensional representations are therefore labelled by two half-integers — see Representations below. (The are not Hermitian on their own, since is anti-Hermitian in unitary representations; the factorization is at the level of the complexified algebra.)
Universal cover
The proper orthochronous Lorentz group is doubly connected: there are loops in it (e.g. rotations) that cannot be continuously contracted to a point, but all such loops contract after being traversed twice (a rotation). Its universal cover is
where the kernel of the covering map is . The Poincaré universal cover is correspondingly .
Why we care. Single-valued representations of are double-valued representations of — i.e. spinor representations. To accommodate spin- particles (electrons, quarks, neutrinos) the relevant covering group is , not . This is the deep reason a rotation flips the sign of a spinor wavefunction: at the level of the fundamental physical group, " rotation" is not the identity but .
Representations
Finite-dimensional irreducible representations of (= projective irreps of ) are labelled by a pair with , of dimension . The most-used cases:
| Dimension | Field type | Examples | |
|---|---|---|---|
| 1 | Lorentz scalar | Higgs, pion | |
| 2 | left-handed Weyl spinor | left-handed neutrino field | |
| 2 | right-handed Weyl spinor | right-handed component of the electron | |
| 4 | Dirac spinor | electron, quark | |
| 4 | 4-vector | photon , | |
| 6 | self-dual + anti-self-dual 2-form | field strength | |
| 12 | Rarita–Schwinger spinor-vector | gravitino | |
| 9 | symmetric traceless tensor | graviton (linearized) |
The spin of a finite-dimensional representation is ; equivalently, under the rotation subgroup , the rep decomposes as .
These are the representations carried by classical fields and by field operators in QFT. They are all non-unitary (because is non-compact), which is why they describe field components, not Hilbert-space states. Unitary representations of the Poincaré group are infinite-dimensional and are labelled differently — see Wigner's Classification below.
Casimir invariants
The Poincaré algebra has two independent Casimir operators (commuting with all generators):
- — the mass-squared invariant. On any irreducible unitary representation it acts as a scalar .
- , where is the Pauli–Lubanski vector. On an irrep with it acts as with the spin; on a massless irrep its eigenvalues are different (see Wigner classification below).
These two scalars are the only Poincaré-invariant labels of single-particle states, and they are the basis of Wigner's classification.
Little groups
For each non-zero momentum , the little group is the subgroup of Lorentz transformations leaving fixed. Wigner's strategy is to classify states first at a standard momentum (a representative of each Lorentz orbit) by their little-group transformation, then propagate to all other momenta by Lorentz boost.
| Orbit of | Standard | Little group | Physical interpretation |
|---|---|---|---|
| , | (rotations) | massive particle, spin | |
| , | (2D Euclidean) | massless particle, helicity | |
| tachyon (unphysical) | |||
| vacuum | |||
| , but limit | — | "continuous-spin" reps (not observed) |
The little group structure is what produces the discrete spin/helicity quantum numbers attached to single-particle states.
Why this matters
Three things in QFT all trace back to the Poincaré group structure laid out above:
- Wigner classification of single-particle states (next subsection) — uses the Casimirs and little groups.
- Field representations — the table above lists which classical/operator-valued field types are available, used in QFT Postulate 5 and in specifying field content for any specific theory (QED, QCD, ...).
- Spin–statistics, , gauge invariance for massless spin-1 — all derive from the Poincaré structure plus locality / cluster decomposition in the modern story (foundations-modern.md).
Wigner's Classification of Particles
A particle is identified with an irreducible unitary representation of the Poincaré group on a complex separable Hilbert space. Wigner's theorem (1939) classifies these irreps. The result is that every physical irrep is labelled by two Casimirs (mass and spin/helicity); the structure of the classification follows from Mackey's induced-representation theorem applied to the semidirect-product structure .
Statement
The irreducible unitary representations of (or, more precisely, of its universal cover) are classified by:
- Mass-squared with , the eigenvalue of the first Casimir.
- A choice of irrep of the little group of a standard momentum on the corresponding mass shell:
- : little group , irreps labelled by spin .
- : little group , irreps labelled by helicity (with helicity quantized in integer/half-integer units by rotation closure of the universal cover; "continuous-spin" reps are mathematically allowed but empirically absent).
- (tachyons): little group — unphysical, excluded by the spectrum condition (P2).
- : trivial rep — the vacuum.
Sketch of derivation (Mackey induction)
The classification proceeds in five steps. Each is mechanical given the Lie-algebra and topology data already in § Lorentz and Poincaré Groups; we sketch the logic and defer technical proofs to references.
Step 1 — Diagonalize translations. is abelian and normal; on any unitary representation it can be simultaneously diagonalized, with eigenvalues labelled by a four-momentum (the spectrum of ). So the Hilbert space decomposes as a direct integral
with labelling extra degrees of freedom at each .
Step 2 — Lorentz orbits. Lorentz transformations move around. Irreducibility forces the support of to be a single Lorentz orbit (otherwise the representation would split into pieces supported on different orbits). The orbits are:
| Orbit | Standard | Sign of |
|---|---|---|
| Massive forward | , | |
| Massless forward | , | |
| Tachyonic | ||
| Trivial |
(Backward-pointing orbits with are excluded by the spectrum condition.)
Step 3 — Pick a standard momentum and identify the little group. For each orbit , pick a representative and define the little group
— the Lorentz transformations that fix . The little groups for each orbit (computed by direct algebra):
| Orbit | Universal cover acting in irreps | |
|---|---|---|
| covered by | ||
The relevant little group is the universal cover (since we want projective reps of , ordinary reps of its universal cover; see math/group-theory.md § Connectedness and discrete components).
Step 4 — Mackey induction: irreps of ↔ irreps of . Mackey's induced-representation theorem (a general result for semidirect products with abelian) states:
The irreducible unitary representations of supported on a Lorentz orbit are in bijective correspondence with the irreducible unitary representations of the little group of any standard momentum .
Concretely, given an irrep of , the induced rep is built on the Hilbert space . A boost acts by , where the Wigner rotation is the little-group-valued rotation that compensates for the -dependent boost (see Weinberg Vol. 1 §2.5 for the explicit construction). Irreducibility on the side ⇔ irreducibility of on the side.
Step 5 — Classify irreps of each little group.
- (massive): irreps are the spin- reps with of dimension . (Standard representation theory; see math/group-theory.md § Worked examples: and .)
- Universal cover of (massless): the abelian subgroup has unitary characters labelled by a vector . Two cases:
- : trivial action, leaving the rotation generator to label irreps by helicity . For the cover, is quantized in .
- : the continuous-spin representations, parameterized by . Allowed mathematically, not observed in nature (their absence is empirical, sometimes posed as an extra "Wigner condition").
- (tachyonic): irreps exist but tachyonic states violate causality — excluded by the spectrum condition (P2).
Conclusion
Combining Steps 1–5: every irreducible unitary representation of supported on a forward orbit (massive or massless, discrete) is labelled by
- mass , and
- spin (massive) or helicity (massless).
This is Wigner's classification. The labels are exactly the two Casimirs and (for ) where is the Pauli–Lubanski vector. The full proof — including the technical analysis of induced representations, projective representations, and continuous-spin exclusion — is laid out in:
- Weinberg, The Quantum Theory of Fields, Vol. 1, Ch. 2 (the standard physics treatment).
- Streater & Wightman, PCT, Spin and Statistics, and All That, Ch. 1 (axiomatic version).
- Tung, Group Theory in Physics, Chs. 9–10 (representation-theoretic emphasis).
- Bargmann–Wigner (1948), the original induced-representation construction.
Classical Fields and Lagrangians
A classical field is a function taking values in some target space carrying a representation of the Lorentz group:
- Scalar field : trivial representation, e.g. the Higgs field.
- Spinor field : spin- representation of , e.g. the Dirac field.
- Vector field : the four-vector representation, e.g. the electromagnetic potential.
- Tensor / spinor-tensor fields: higher-spin generalizations.
Dynamics are encoded in a Lagrangian density , a Lorentz scalar, with action . The classical equations of motion follow from the Euler–Lagrange equations
Worked example: free real scalar field and the Klein–Gordon equation
The simplest non-trivial Lagrangian field theory is a single real scalar with the free scalar Lagrangian
This is essentially uniquely fixed by demanding: Lorentz invariance, polynomial in and , at most two derivatives (for second-order field equations), reflection symmetry , and a kinetic term with conventional sign and normalization. Applying the Euler–Lagrange equation gives
i.e. the Klein–Gordon equation
In this Lagrangian route the Klein–Gordon equation is derived, not postulated; the postulate has moved from "the equation" to "the Lagrangian ".
Three routes to Klein–Gordon
For completeness, the Klein–Gordon equation can be reached three ways, with the genuine input shifting in each:
| Route | KG status | Genuine input |
|---|---|---|
| A. Canonical-substitution heuristic | Postulated by analogy | Take the classical dispersion and apply to a wavefunction. The substitution rule and the choice over are themselves not derived from anything. See QED/historical.md § 1.1 for the historical version. |
| B. Lagrangian field theory | Derived from Euler–Lagrange | The scalar-field Lagrangian . (This subsection.) |
| C. Wigner / Casimir | Theorem | The definition of "spin-0 massive particle" as a Poincaré irrep with first Casimir . On a position-space realization , with translations acting as , this Casimir constraint is . The mass-shell condition from Wigner's Classification automatically forces KG on any scalar interpolating field. |
All three give the same equation. The physical input differs: a substitution rule (A), a choice of Lagrangian (B), or the Casimir-eigenvalue definition of a massive spin-0 species (C). Routes B and C are the modern viewpoint; Route A survives only as a heuristic motivating Dirac's first-order ansatz in QED-historical §1.1.
As a field operator
After quantization (whether canonical or in the Wigner-construction sense), is promoted to an operator-valued distribution on Fock space, still satisfying as an operator equation. Its mode expansion and ladder structure are given below in § Fock Space; its propagator enters Feynman-rule calculations; the Klein–Gordon operator reappears as the external-leg amputation operator in § LSZ Reduction Formula.
The free Dirac field is the spin- analogue (postulating instead, derived in QED/historical.md § 1.4); the free photon field is the massless spin-1 analogue, from in QED/historical.md § 0.2. Each is built by the same Route-B recipe: write down the simplest Lorentz-scalar Lagrangian in the appropriate field, derive the EOM, quantize.
Canonical Structure
The conjugate momentum to a field is
The classical Hamiltonian density is .
Operator-Valued Distributions
In QFT, fields cannot be ordinary operator-valued functions of — products like are too singular. Instead, is an operator-valued tempered distribution: it is well-defined only after smearing against a Schwartz test function ,
yielding an (unbounded) operator on a dense domain .
Fock Space
For a free field of mass and spin , the Hilbert space is the Fock space
or with antisymmetrization for fermions. Here is the one-particle Hilbert space (an irreducible Wigner representation). Fock space is built from a vacuum via creation () and annihilation () operators satisfying canonical (anti)commutation relations:
Terminology — ladder operators. The pair is collectively called ladder operators (or raising/lowering operators): raises the particle number by one (), lowers it (, with ). The name comes from the harmonic oscillator in QM (see QM/heisenberg-picture.md), where the same algebra moves between energy eigenstates on the "ladder" of equally spaced levels. The QFT usage is the same algebra applied per Fourier mode: each momentum mode of a free field is an independent harmonic oscillator, and are its ladder operators. Particle number is the count of excitations across all modes. In QFT-specific contexts "creation/annihilation operators" is more common than "ladder operators", but the terms are interchangeable.
A free scalar field admits the mode expansion
In an interacting theory, no such Fock space exists for the full interacting field (this is the content of Haag's theorem — see Remarks), but Fock spaces remain the appropriate description for asymptotic in/out states.
Vacuum
The vacuum is the lowest-energy, Poincaré-invariant state. For a free theory it is annihilated by all . In an interacting theory the physical vacuum differs from the free (Fock) vacuum and is in general unitarily inequivalent to it.
States vs. Fields: Why QFT Looks "Operator-Heavy"
A reader coming from non-relativistic QM may notice that QFT seems to focus almost entirely on operators (the field operators , , and their correlators) while saying very little about states. This is real — and worth being explicit about.
| Non-relativistic QM | QFT | |
|---|---|---|
| Primary object | State (or wavefunction ) | Field operators |
| Time evolution | State evolves (Schrödinger picture) | Operators evolve (Heisenberg picture) |
| What you compute | , transition amplitudes | , then S-matrix elements |
The state is still primary in principle, but in practice it is usually fixed implicitly to be one of:
- the vacuum — for vacuum correlation functions and most perturbative computations,
- an asymptotic Fock state — for S-matrix calculations,
- a coherent state of a bosonic field — for connecting to classical fields and for IR problems,
- a bound-state wavefunction (positronium, hydrogen) — handled non-perturbatively (Bethe–Salpeter), and rarely written down explicitly,
- a density matrix — for thermal QFT, decoherence, and open systems.
Several reasons drive the operator-centric emphasis:
- Heisenberg picture is manifestly Lorentz-covariant. Putting all spacetime dependence into operators avoids singling out the time slice that the Schrödinger picture requires.
- The state space is Fock space, not . A multi-particle state is a function over arbitrary numbers of particles with arbitrary momenta; there is no useful single "wavefunction in position space" to write down.
- Most observables of interest are scattering amplitudes. Prepare an asymptotic in-state, evolve, take the overlap with an asymptotic out-state — the details of the state during the interaction never appear; only does, computed from operator correlators via LSZ.
- Haag's theorem (see Remarks) says the interacting vacuum and Fock states are unitarily inequivalent to the free ones — there is no concrete Hilbert space on which to write down "the interacting state." So one works with operator correlators and asymptotic states only.
Warning: the Field Operator Is Not a Wavefunction
This is a major terminological collision. In some derivations of QED (notably the historical / Dirac-equation route), starts out as a single-particle relativistic wavefunction — a state. After second quantization, the symbol is reused to denote a field operator on Fock space. From that point on:
- is an operator, not a state.
- is the (improper) state of one particle localized at .
- The matrix element is a Dirac wavefunction, but it's the matrix element of the field operator between two particular states — not the state itself.
Conflating these two meanings of is one of the most common sources of confusion when crossing from QM to QFT.
Time-Ordering Operator
The time-ordering operator is the instruction to permute a product of operators so that those with larger time arguments stand to the left:
where is the permutation that gives . For the two-operator case:
For fermionic operators an extra sign from the permutation is included (so ).
is not an operator on Fock space — it is a notational rule for handling non-commuting operators at different times. It plays a central role in:
- The Dyson series for the S-matrix, (see QED/historical.md §0.6).
- Time-ordered correlators / Green's functions (next subsection), which are the inputs to the LSZ reduction formula.
- Path-integral derivations of operator correlators (see QM/path-integral.md §1.6 for the full discussion in the simpler QM setting).
Notation collision warning. The same symbol is used in QFT for the transition operator — a genuine operator on Fock space — that appears in the splitting . The two meanings are entirely unrelated; context disambiguates: time-ordering always sits in front of a product of operators ( or ); transition sits between a bra and a ket as . See QED/historical.md §0.6 for the corresponding callout in context.
Correlation Functions
The fundamental observables of QFT are vacuum expectation values of products of fields:
- Wightman functions: .
- Time-ordered (Green's) functions: , where orders fields by decreasing (with a sign for fermion exchanges).
Time-ordered correlators are what enter the LSZ reduction formula (next subsection) to compute -matrix elements.
LSZ Reduction Formula
The Lehmann–Symanzik–Zimmermann (LSZ) reduction formula is the bridge between time-ordered correlators of local fields (what perturbation theory and Feynman rules naturally compute) and S-matrix elements between asymptotic states (what observables require).
Setup. Pick any local field with non-zero matrix element to a one-particle state of the species of interest:
Such a is called an interpolating field for that species. Different choices of give the same on-shell -matrix elements (this is the LSZ equivalence of field redefinitions).
The formula for external particles (in the simplest scalar case; spin and species labels suppressed):
In words: each external leg contributes a Klein–Gordon operator (which on the mass shell extracts the residue of the single-particle pole in momentum space) plus a Fourier factor; everything else is the time-ordered -point correlator.
What it actually says. When the external momenta are on-shell (, ), the time-ordered correlator has poles in each external momentum at from the propagation of single-particle intermediate states (the Källén–Lehmann spectral representation). The LSZ recipe extracts the residue at all those poles simultaneously and identifies it with the on-shell -matrix element.
Why this is significant.
- No Hamiltonian required. LSZ takes correlators as input. Correlators can be computed from a Lagrangian via path integrals, but they could equally come from lattice simulations, the conformal bootstrap, integrability, or any other source. So LSZ provides an -free route to -matrix elements (compare the Møller-operator construction in foundations-modern.md §2.0.1, which does require ).
- Field-redefinition equivalence. Replacing for any local function leaves on-shell -matrix elements invariant. This is why effective field theories with different operator bases can describe identical physics, and why the "fundamental field" choice is largely conventional.
- Bridges Feynman rules to observables. Step 6 of any QFT calculation (in QED, QED/historical.md §5.2, and elsewhere) implicitly uses LSZ to convert amputated time-ordered diagrams into -matrix elements: the operators on the external legs cancel against the external propagators of an amputated diagram, leaving exactly (up to wavefunction renormalization factors ).
For the full derivation see Peskin–Schroeder Ch. 7.2 or Weinberg Vol. 1 §10.3. For its role in pinning down asymptotic states without invoking , see foundations-modern.md §2.0.1.
S-Matrix and Cross Sections
The S-matrix maps asymptotic in-states (free particles in the far past) to asymptotic out-states (free particles in the far future):
Writing with , the invariant amplitude determines physical observables: differential cross sections , decay rates , etc. The full master formulas connecting to measurable rates and the necessary kinematic ingredients (flux factor, Lorentz-invariant phase space, units, the optical theorem) are collected separately in Cross Sections and Decay Rates; the broader observable inventory is in Observables.
Symmetries and Noether Currents
A continuous symmetry of the action implies, by Noether's theorem, the existence of a conserved current with and a conserved charge . In the quantum theory, generates the symmetry on operators via .
Symmetries are classified as:
- Global (parameter independent of ) vs. local / gauge (parameter depends on ).
- Internal (acting on field indices) vs. spacetime (Poincaré, conformal, ...).
- Continuous (Lie group) vs. discrete (, , ).
Gauge Fields
A gauge theory has a local internal symmetry . To make the Lagrangian invariant one introduces a gauge field valued in the Lie algebra of , and a covariant derivative , where are generators of in the relevant representation. The field strength
(with structure constants ) generalizes the electromagnetic . Quantization requires gauge fixing to remove redundant degrees of freedom.
Regularization and Renormalization
Naive computations in interacting QFT yield divergent loop integrals. Regularization (cutoff, dimensional, Pauli–Villars, lattice) parametrizes the divergences; renormalization absorbs them into a redefinition of a finite number of parameters (masses, couplings, field normalizations). A theory is renormalizable if this can be done with finitely many counterterms; otherwise it is an effective field theory valid only below some energy scale.
The renormalization group describes how renormalized parameters depend on the chosen energy scale , governed by beta functions .