Definitions and Preliminaries

Familiarity with non-relativistic quantum mechanics (Hilbert spaces, Hermitian operators, the Born rule, etc.) is assumed — see the QM notes. This page collects the additional structures specific to QFT.

Minkowski Spacetime

QFT is formulated on Minkowski spacetime $M^{4} = (R^{4}, η)$ with metric

$η_{μν} = diag (+ 1, - 1, - 1, - 1) .$

A spacetime point is denoted $x = (x^{0}, x) = (t, x)$ (with $c = 1$ ). The invariant interval $x^{2} = η_{μν} x^{μ} x^{ν}$ classifies separations as timelike ( $x^{2} > 0$ ), lightlike ( $x^{2} = 0$ ), or spacelike ( $x^{2} < 0$ ). Greek indices run over $0, 1, 2, 3$ and are raised/lowered with $η$ ; repeated indices are summed (Einstein convention).

Group Theory Prerequisites

The Lorentz, Poincaré, and internal symmetry groups (e.g. $U (1)$ , $S U (2)$ , $S U (3)$ ) used throughout QFT are all instances of Lie groups with associated Lie algebras and representations. The abstract definitions — group axioms, subgroups and quotients, homomorphisms, representations (unitary, irreducible, projective), the standard matrix groups, Lie groups and Lie algebras (generators, structure constants, Casimirs), direct and semidirect products, compactness, connectedness and universal covers, and worked examples ( $SO (2)$ and $S U (2)$ ) — are collected separately in math/group-theory.md. The rest of this page assumes that material as background.

Lorentz and Poincaré Groups

The Lorentz and Poincaré groups are the symmetry groups of special relativity, and are the geometric input of every relativistic quantum theory. They are 6- and 10-dimensional Lie groups respectively; this section catalogues their components, generators, algebra, representations, and the Casimir invariants used to label particles.

The Lorentz group

The Lorentz group $O (1, 3)$ consists of linear transformations $Λ : M^{4} \to M^{4}$ preserving the metric:

$Λ^{μ}_{ρ} Λ^{ν}_{σ} η_{μν} = η_{ρ σ} ⟺ Λ^{T} η Λ = η .$

It is a 6-dimensional non-compact Lie group with four disconnected components, distinguished by two $Z_{2}$ -valued discrete invariants:

$det Λ = \pm 1$ — proper ( $+ 1$ ) vs. improper ( $- 1$ ).
$sign (Λ^{0}_{0}) = \pm 1$ — orthochronous ( $+ 1$ , preserves the direction of time) vs. non-orthochronous ( $- 1$ ).

The four components are connected to one another by the discrete operations:

Component	Symbol	Contains	Got from $L_{+}^{↑}$ by
Proper, orthochronous	$L_{+}^{↑} = S O^{+} (1, 3)$	identity, rotations, boosts	(identity component)
Improper, orthochronous	$L_{-}^{↑}$	parity-flipped rotations	$P = diag (+ 1, - 1, - 1, - 1)$
Proper, non-orthochronous	$L_{+}^{↓}$	combined $PT$ rotations	$PT = - 1$
Improper, non-orthochronous	$L_{-}^{↓}$	time-flipped rotations	$T = diag (- 1, + 1, + 1, + 1)$

The proper orthochronous Lorentz group $L_{+}^{↑}$ is the identity-component subgroup; the discrete symmetries $P$ and $T$ are separate (and may or may not be symmetries of a given physical theory — the weak interaction violates both).

The Poincaré group

Spacetime translations $x^{μ} \to x^{μ} + a^{μ}$ commute with each other but not with Lorentz transformations: combining a Lorentz transformation followed by a translation gives $x^{μ} \to Λ^{μ}_{ν} x^{ν} + a^{μ}$ , and the order matters. This is captured by the semidirect product structure:

$P = R^{1, 3} ⋊ L,$

with group multiplication

$(Λ_{1}, a_{1}) \cdot (Λ_{2}, a_{2}) = (Λ_{1} Λ_{2}, a_{1} + Λ_{1} a_{2}) .$

The four-dimensional translation subgroup $R^{1, 3}$ is normal in $P$ ; the Lorentz subgroup $L$ is not (boosting and translating do not commute). $P$ is 10-dimensional (6 Lorentz + 4 translations) and has the same four-component structure as the Lorentz group.

The proper orthochronous Poincaré group $P_{+}^{↑} = R^{1, 3} ⋊ L_{+}^{↑}$ is the connected identity component. It is the symmetry group of relativistic physics (excluding the discrete $C, P, T$ operations, which are treated separately).

Lie algebra and generators

The Lie algebra $p$ of $P_{+}^{↑}$ has 10 generators:

4 translation generators $P^{μ}$ (energy–momentum), $μ = 0, 1, 2, 3$ .
6 Lorentz generators $J^{μν} = - J^{νμ}$ , conventionally split into:
- 3 rotation generators $J^{i} \equiv \frac{1}{2} ϵ^{ijk} J^{jk}$ (angular momentum),
- 3 boost generators $K^{i} \equiv J^{0 i}$ .

The defining commutation relations are

$[P^{μ}, P^{ν}] = 0, [J^{μν}, P^{ρ}] = i (η^{ν ρ} P^{μ} - η^{μ ρ} P^{ν}),$

$[J^{μν}, J^{ρ σ}] = i (η^{ν ρ} J^{μ σ} - η^{μ ρ} J^{ν σ} - η^{ν σ} J^{μ ρ} + η^{μ σ} J^{ν ρ}) .$

In rotation/boost form these read:

$[J^{i}, J^{j}] = i ϵ^{ijk} J^{k}, [J^{i}, K^{j}] = i ϵ^{ijk} K^{k}, [K^{i}, K^{j}] = - i ϵ^{ijk} J^{k}, [P^{0}, J^{i}] = 0, [P^{0}, K^{i}] = i P^{i} .$

The combinations $J_{\pm}^{i} = \frac{1}{2} (J^{i} \pm i K^{i})$ satisfy two independent $su (2)$ algebras:

$[J_{+}^{i}, J_{+}^{j}] = i ϵ^{ijk} J_{+}^{k}, [J_{-}^{i}, J_{-}^{j}] = i ϵ^{ijk} J_{-}^{k}, [J_{+}^{i}, J_{-}^{j}] = 0,$

so the complexified Lorentz algebra factorizes as $so (1, 3)_{C} ≅ su (2) \oplus su (2)$ . Finite-dimensional representations are therefore labelled by two half-integers $(j_{+}, j_{-})$ — see Representations below. (The $J_{\pm}^{i}$ are not Hermitian on their own, since $K^{i}$ is anti-Hermitian in unitary representations; the factorization is at the level of the complexified algebra.)

Universal cover $S L (2, C)$

The proper orthochronous Lorentz group $L_{+}^{↑} = S O^{+} (1, 3)$ is doubly connected: there are loops in it (e.g. $2 π$ rotations) that cannot be continuously contracted to a point, but all such loops contract after being traversed twice (a $4 π$ rotation). Its universal cover is

$S L (2, C) 2 : 1 L_{+}^{↑},$

where the kernel of the covering map is ${\pm 1} \subset S L (2, C)$ . The Poincaré universal cover is correspondingly $R^{1, 3} ⋊ S L (2, C)$ .

Why we care. Single-valued representations of $S L (2, C)$ are double-valued representations of $S O^{+} (1, 3)$ — i.e. spinor representations. To accommodate spin- $\frac{1}{2}$ particles (electrons, quarks, neutrinos) the relevant covering group is $S L (2, C)$ , not $S O^{+} (1, 3)$ . This is the deep reason a $2 π$ rotation flips the sign of a spinor wavefunction: at the level of the fundamental physical group, " $2 π$ rotation" is not the identity but $- 1$ .

Representations

Finite-dimensional irreducible representations of $S L (2, C)$ (= projective irreps of $L_{+}^{↑}$ ) are labelled by a pair $(j_{+}, j_{-})$ with $j_{\pm} \in \frac{1}{2} Z_{\geq 0}$ , of dimension $(2 j_{+} + 1) (2 j_{-} + 1)$ . The most-used cases:

$(j_{+}, j_{-})$	Dimension	Field type	Examples
$(0, 0)$	1	Lorentz scalar	Higgs, pion
$(\frac{1}{2}, 0)$	2	left-handed Weyl spinor	left-handed neutrino field
$(0, \frac{1}{2})$	2	right-handed Weyl spinor	right-handed component of the electron
$(\frac{1}{2}, 0) \oplus (0, \frac{1}{2})$	4	Dirac spinor	electron, quark
$(\frac{1}{2}, \frac{1}{2})$	4	4-vector	photon $A_{μ}$ , $W^{\pm}, Z^{0}$
$(1, 0) \oplus (0, 1)$	6	self-dual + anti-self-dual 2-form	field strength $F_{μν}$
$(1, \frac{1}{2}) \oplus (\frac{1}{2}, 1)$	12	Rarita–Schwinger spinor-vector	gravitino
$(1, 1)$	9	symmetric traceless tensor	graviton $h_{μν}$ (linearized)

The spin of a finite-dimensional representation is $j_{+} + j_{-}$ ; equivalently, under the rotation subgroup $SO (3) \subset L_{+}^{↑}$ , the rep $(j_{+}, j_{-})$ decomposes as $j_{+} \otimes j_{-} = ∣ j_{+} - j_{-} ∣ \oplus \dots \oplus (j_{+} + j_{-})$ .

These are the representations carried by classical fields and by field operators in QFT. They are all non-unitary (because $S L (2, C)$ is non-compact), which is why they describe field components, not Hilbert-space states. Unitary representations of the Poincaré group are infinite-dimensional and are labelled differently — see Wigner's Classification below.

Casimir invariants

The Poincaré algebra has two independent Casimir operators (commuting with all generators):

$P^{2} \equiv P^{μ} P_{μ}$ — the mass-squared invariant. On any irreducible unitary representation it acts as a scalar $m^{2} \geq 0$ .
$W^{2} \equiv W^{μ} W_{μ}$ , where $W^{μ} \equiv - \frac{1}{2} ϵ^{μν ρ σ} J_{ν ρ} P_{σ}$ is the Pauli–Lubanski vector. On an irrep with $m > 0$ it acts as $- m^{2} s (s + 1)$ with $s$ the spin; on a massless irrep its eigenvalues are different (see Wigner classification below).

These two scalars are the only Poincaré-invariant labels of single-particle states, and they are the basis of Wigner's classification.

Little groups

For each non-zero momentum $p^{μ}$ , the little group $G_{p} \subset L_{+}^{↑}$ is the subgroup of Lorentz transformations leaving $p^{μ}$ fixed. Wigner's strategy is to classify states first at a standard momentum (a representative of each Lorentz orbit) by their little-group transformation, then propagate to all other momenta by Lorentz boost.

Orbit of $p^{μ}$	Standard $p^{μ}$	Little group	Physical interpretation
$p^{2} = m^{2} > 0$ , $p^{0} > 0$	$(m, 0)$	$SO (3)$ (rotations)	massive particle, spin $s$
$p^{2} = 0$ , $p^{0} > 0$	$(ω, 0, 0, ω)$	$I SO (2)$ (2D Euclidean)	massless particle, helicity $h$
$p^{2} = - ∣ p ∣^{2} < 0$	$(0, 0, 0, ∣ p ∣)$	$SO (1, 2)$	tachyon (unphysical)
$p^{μ} = 0$	$0$	$L_{+}^{↑}$	vacuum
$p^{2} = 0$ , $p^{μ} = 0$ but $p^{μ} \to 0$ limit	—	$I SO (2)$	"continuous-spin" reps (not observed)

The little group structure is what produces the discrete spin/helicity quantum numbers attached to single-particle states.

Why this matters

Three things in QFT all trace back to the Poincaré group structure laid out above:

Wigner classification of single-particle states (next subsection) — uses the Casimirs and little groups.
Field representations $(j_{+}, j_{-})$ — the table above lists which classical/operator-valued field types are available, used in QFT Postulate 5 and in specifying field content for any specific theory (QED, QCD, ...).
Spin–statistics, $CPT$ , gauge invariance for massless spin-1 — all derive from the Poincaré structure plus locality / cluster decomposition in the modern story (foundations-modern.md).

Wigner's Classification of Particles

A particle is identified with an irreducible unitary representation of the Poincaré group on a complex separable Hilbert space. Wigner's theorem (1939) classifies these irreps. The result is that every physical irrep is labelled by two Casimirs (mass and spin/helicity); the structure of the classification follows from Mackey's induced-representation theorem applied to the semidirect-product structure $P_{+}^{↑} = R^{1, 3} ⋊ S L (2, C)$ .

Statement

The irreducible unitary representations of $P_{+}^{↑}$ (or, more precisely, of its universal cover) are classified by:

Mass-squared $P^{2} = m^{2}$ with $m \geq 0$ , the eigenvalue of the first Casimir.
A choice of irrep of the little group of a standard momentum on the corresponding mass shell:
- $m > 0$ : little group $SO (3)$ , irreps labelled by spin $j \in {0, \frac{1}{2}, 1, \frac{3}{2}, \dots}$ .
- $m = 0$ : little group $I SO (2)$ , irreps labelled by helicity $h \in \frac{1}{2} Z$ (with helicity quantized in integer/half-integer units by $4 π$ rotation closure of the universal cover; "continuous-spin" reps are mathematically allowed but empirically absent).
- $m^{2} < 0$ (tachyons): little group $SO (1, 2)$ — unphysical, excluded by the spectrum condition (P2).
- $p^{μ} = 0$ : trivial rep — the vacuum.

Sketch of derivation (Mackey induction)

The classification proceeds in five steps. Each is mechanical given the Lie-algebra and topology data already in § Lorentz and Poincaré Groups; we sketch the logic and defer technical proofs to references.

Step 1 — Diagonalize translations. $R^{1, 3} \subset P_{+}^{↑}$ is abelian and normal; on any unitary representation it can be simultaneously diagonalized, with eigenvalues labelled by a four-momentum $p^{μ}$ (the spectrum of $P^{μ}$ ). So the Hilbert space decomposes as a direct integral

$H = \int^{\oplus} d μ (p) H_{p}, P^{μ} ∣ p, σ ⟩ = p^{μ} ∣ p, σ ⟩,$

with $σ$ labelling extra degrees of freedom at each $p$ .

Step 2 — Lorentz orbits. Lorentz transformations move $p$ around. Irreducibility forces the support of $μ$ to be a single Lorentz orbit $O \subset R^{1, 3}$ (otherwise the representation would split into pieces supported on different orbits). The orbits are:

Orbit	Standard $\overset{p}{ˉ}^{μ}$	Sign of $p^{2}$
Massive forward	$(m, 0)$ , $m > 0$	$p^{2} = m^{2} > 0, p^{0} > 0$
Massless forward	$(ω, 0, 0, ω)$ , $ω > 0$	$p^{2} = 0, p^{0} > 0$
Tachyonic	$(0, 0, 0, ∣ p ∣)$	$p^{2} < 0$
Trivial	$0$	$p^{μ} = 0$

(Backward-pointing orbits with $p^{0} < 0$ are excluded by the spectrum condition.)

Step 3 — Pick a standard momentum and identify the little group. For each orbit $O$ , pick a representative $\overset{p}{ˉ}$ and define the little group

$G_{\overset{p}{ˉ}} = {Λ \in S L (2, C) : Λ \overset{p}{ˉ} = \overset{p}{ˉ}}$

— the Lorentz transformations that fix $\overset{p}{ˉ}$ . The little groups for each orbit (computed by direct algebra):

Orbit	$G_{\overset{p}{ˉ}}$	Universal cover acting in irreps
$(m, 0)$	$SO (3)$	$S U (2)$
$(ω, 0, 0, ω)$	$I SO (2)$	covered by $R^{2} ⋊ R$
$(0, 0, 0, ∣ p ∣)$	$SO (1, 2)$	$S U (1, 1)$

The relevant little group is the universal cover (since we want projective reps of $P_{+}^{↑}$ , ordinary reps of its universal cover; see math/group-theory.md § Connectedness and discrete components).

Step 4 — Mackey induction: irreps of $P_{+}^{↑}$ ↔ irreps of $G_{\overset{p}{ˉ}}$ . Mackey's induced-representation theorem (a general result for semidirect products $N ⋊ G$ with $N$ abelian) states:

The irreducible unitary representations of $P_{+}^{↑}$ supported on a Lorentz orbit $O$ are in bijective correspondence with the irreducible unitary representations of the little group $G_{\overset{p}{ˉ}}$ of any standard momentum $\overset{p}{ˉ} \in O$ .

Concretely, given an irrep $σ$ of $G_{\overset{p}{ˉ}}$ , the induced rep $Ind_{G_{\overset{p}{ˉ}}}^{P_{+}^{↑}} (σ)$ is built on the Hilbert space $L^{2} (O) \otimes V_{σ}$ . A boost $Λ$ acts by $Λ : (p, v) \mapsto (Λ p, W (Λ, p) v)$ , where the Wigner rotation $W (Λ, p) \in G_{\overset{p}{ˉ}}$ is the little-group-valued rotation that compensates for the $p$ -dependent boost (see Weinberg Vol. 1 §2.5 for the explicit construction). Irreducibility on the $P_{+}^{↑}$ side ⇔ irreducibility of $σ$ on the $G_{\overset{p}{ˉ}}$ side.

Step 5 — Classify irreps of each little group.

$G_{\overset{p}{ˉ}} = S U (2)$ (massive): irreps are the spin- $j$ reps with $j \in {0, \frac{1}{2}, 1, \frac{3}{2}, \dots}$ of dimension $2 j + 1$ . (Standard $S U (2)$ representation theory; see math/group-theory.md § Worked examples: $SO (2)$ and $S U (2)$ .)
Universal cover of $I SO (2)$ (massless): the abelian subgroup $R^{2}$ has unitary characters labelled by a vector $(a, b) \in R^{2}$ . Two cases:
- $(a, b) = 0$ : trivial action, leaving the rotation generator to label irreps by helicity $h$ . For the cover, $h$ is quantized in $\frac{1}{2} Z$ .
- $(a, b) \neq = 0$ : the continuous-spin representations, parameterized by $ρ = a^{2} + b^{2} > 0$ . Allowed mathematically, not observed in nature (their absence is empirical, sometimes posed as an extra "Wigner condition").
$S U (1, 1)$ (tachyonic): irreps exist but tachyonic states violate causality — excluded by the spectrum condition (P2).

Conclusion

Combining Steps 1–5: every irreducible unitary representation of $P_{+}^{↑}$ supported on a forward orbit (massive or massless, $h$ discrete) is labelled by

mass $m \geq 0$ , and
spin $j \in \frac{1}{2} Z_{\geq 0}$ (massive) or helicity $h \in \frac{1}{2} Z$ (massless).

This is Wigner's classification. The labels are exactly the two Casimirs $P^{2} = m^{2}$ and (for $m > 0$ ) $W^{2} = - m^{2} j (j + 1)$ where $W^{μ}$ is the Pauli–Lubanski vector. The full proof — including the technical analysis of induced representations, projective representations, and continuous-spin exclusion — is laid out in:

Weinberg, The Quantum Theory of Fields, Vol. 1, Ch. 2 (the standard physics treatment).
Streater & Wightman, PCT, Spin and Statistics, and All That, Ch. 1 (axiomatic version).
Tung, Group Theory in Physics, Chs. 9–10 (representation-theoretic emphasis).
Bargmann–Wigner (1948), the original induced-representation construction.

Classical Fields and Lagrangians

A classical field is a function $ϕ : M^{4} \to V$ taking values in some target space $V$ carrying a representation of the Lorentz group:

Scalar field $ϕ (x)$ : trivial representation, e.g. the Higgs field.
Spinor field $ψ_{α} (x)$ : spin- $\frac{1}{2}$ representation of $S L (2, C)$ , e.g. the Dirac field.
Vector field $A_{μ} (x)$ : the four-vector representation, e.g. the electromagnetic potential.
Tensor / spinor-tensor fields: higher-spin generalizations.

Dynamics are encoded in a Lagrangian density $L (ϕ, \partial_{μ} ϕ)$ , a Lorentz scalar, with action $S [ϕ] = \int d^{4} x L$ . The classical equations of motion follow from the Euler–Lagrange equations

$\frac{\partial L}{\partial ϕ} - \partial_{μ} \frac{\partial L}{\partial ( \partial _{μ} ϕ )} = 0.$

Worked example: free real scalar field and the Klein–Gordon equation

The simplest non-trivial Lagrangian field theory is a single real scalar $ϕ (x)$ with the free scalar Lagrangian

$L_{ϕ} = \frac{1}{2} \partial_{μ} ϕ \partial^{μ} ϕ - \frac{1}{2} m^{2} ϕ^{2} .$

This is essentially uniquely fixed by demanding: Lorentz invariance, polynomial in $ϕ$ and $\partial ϕ$ , at most two derivatives (for second-order field equations), reflection symmetry $ϕ \to - ϕ$ , and a kinetic term with conventional sign and normalization. Applying the Euler–Lagrange equation gives

$\partial_{μ} \frac{\partial L _{ϕ}}{\partial ( \partial _{μ} ϕ )} - \frac{\partial L _{ϕ}}{\partial ϕ} = \partial_{μ} \partial^{μ} ϕ + m^{2} ϕ = 0,$

i.e. the Klein–Gordon equation

$(□ + m^{2}) ϕ (x) = 0, □ \equiv \partial^{μ} \partial_{μ} .$

In this Lagrangian route the Klein–Gordon equation is derived, not postulated; the postulate has moved from "the equation" to "the Lagrangian $L_{ϕ}$ ".

Three routes to Klein–Gordon

For completeness, the Klein–Gordon equation can be reached three ways, with the genuine input shifting in each:

Route	KG status	Genuine input
A. Canonical-substitution heuristic	Postulated by analogy	Take the classical dispersion $E^{2} = p^{2} + m^{2}$ and apply $E \to i \partial_{t}, p \to - i \nabla$ to a wavefunction. The substitution rule and the choice $E^{2}$ over $E$ are themselves not derived from anything. See QED/historical.md § 1.1 for the historical version.
B. Lagrangian field theory	Derived from Euler–Lagrange	The scalar-field Lagrangian $L_{ϕ} = \frac{1}{2} (\partial ϕ)^{2} - \frac{1}{2} m^{2} ϕ^{2}$ . (This subsection.)
C. Wigner / Casimir	Theorem	The definition of "spin-0 massive particle" as a Poincaré irrep with first Casimir $P^{μ} P_{μ} = m^{2}$ . On a position-space realization $ϕ (x)$ , with translations acting as $P^{μ} \to - i \partial^{μ}$ , this Casimir constraint is $(□ + m^{2}) ϕ = 0$ . The mass-shell condition $p^{2} = m^{2}$ from Wigner's Classification automatically forces KG on any scalar interpolating field.

All three give the same equation. The physical input differs: a substitution rule (A), a choice of Lagrangian (B), or the Casimir-eigenvalue definition of a massive spin-0 species (C). Routes B and C are the modern viewpoint; Route A survives only as a heuristic motivating Dirac's first-order ansatz in QED-historical §1.1.

As a field operator

After quantization (whether canonical or in the Wigner-construction sense), $ϕ (x)$ is promoted to an operator-valued distribution $\hat{ϕ} (x)$ on Fock space, still satisfying $(□ + m^{2}) \hat{ϕ} = 0$ as an operator equation. Its mode expansion and ladder structure are given below in § Fock Space; its propagator $⟨ 0∣ T \hat{ϕ} (x) \hat{ϕ} (y) ∣0 ⟩$ enters Feynman-rule calculations; the Klein–Gordon operator $(□ + m^{2})$ reappears as the external-leg amputation operator in § LSZ Reduction Formula.

The free Dirac field is the spin- $\frac{1}{2}$ analogue (postulating $L_{ψ} = \overset{ˉ}{ψ} (i γ^{μ} \partial_{μ} - m) ψ$ instead, derived in QED/historical.md § 1.4); the free photon field is the massless spin-1 analogue, from $L_{EM}$ in QED/historical.md § 0.2. Each is built by the same Route-B recipe: write down the simplest Lorentz-scalar Lagrangian in the appropriate field, derive the EOM, quantize.

Canonical Structure

The conjugate momentum to a field $ϕ (x)$ is

$π (x) = \frac{\partial L}{\partial ( \partial _{0} ϕ ( x ))} .$

The classical Hamiltonian density is $H = π \partial_{0} ϕ - L$ .

Operator-Valued Distributions

In QFT, fields cannot be ordinary operator-valued functions of $x$ — products like $ϕ (x)^{2}$ are too singular. Instead, $ϕ (x)$ is an operator-valued tempered distribution: it is well-defined only after smearing against a Schwartz test function $f \in S (M^{4})$ ,

$ϕ (f) = \int d^{4} x f (x) ϕ (x),$

yielding an (unbounded) operator on a dense domain $D \subset H$ .

Fock Space

For a free field of mass $m$ and spin $s$ , the Hilbert space is the Fock space

$F (H_{1}) = n = 0 ⨁ \infty H_{1}^{\otimes_{s} n} (bosonic, symmetrized),$

or with antisymmetrization $\otimes_{a}$ for fermions. Here $H_{1}$ is the one-particle Hilbert space (an irreducible Wigner representation). Fock space is built from a vacuum $∣0 ⟩$ via creation ( $a^{†}$ ) and annihilation ( $a$ ) operators satisfying canonical (anti)commutation relations:

$[a_{p}, a_{q}^{†}] = (2 π)^{3} δ^{3} (p - q) (bosons),$ ${a_{p}, a_{q}^{†}} = (2 π)^{3} δ^{3} (p - q) (fermions) .$

Terminology — ladder operators. The pair $(a, a^{†})$ is collectively called ladder operators (or raising/lowering operators): $a^{†}$ raises the particle number by one ( $H_{n} \to H_{n + 1}$ ), $a$ lowers it ( $H_{n} \to H_{n - 1}$ , with $a ∣0 ⟩ = 0$ ). The name comes from the harmonic oscillator in QM (see QM/heisenberg-picture.md), where the same $\overset{a}{^}, \overset{a}{^}^{†}$ algebra moves between energy eigenstates $∣ n ⟩$ on the "ladder" of equally spaced levels. The QFT usage is the same algebra applied per Fourier mode: each momentum mode $p$ of a free field is an independent harmonic oscillator, and $a_{p}^{†}, a_{p}$ are its ladder operators. Particle number is the count of excitations across all modes. In QFT-specific contexts "creation/annihilation operators" is more common than "ladder operators", but the terms are interchangeable.

A free scalar field admits the mode expansion

$ϕ (x) = \int \frac{d ^{3} p}{( 2 π ) ^{3}} \frac{1}{2 ω _{p}} (a_{p} e^{- i p \cdot x} + a_{p}^{†} e^{+ i p \cdot x}), ω_{p} = p^{2} + m^{2} .$

In an interacting theory, no such Fock space exists for the full interacting field (this is the content of Haag's theorem — see Remarks), but Fock spaces remain the appropriate description for asymptotic in/out states.

Vacuum

The vacuum $∣0 ⟩$ is the lowest-energy, Poincaré-invariant state. For a free theory it is annihilated by all $a_{p}$ . In an interacting theory the physical vacuum differs from the free (Fock) vacuum and is in general unitarily inequivalent to it.

States vs. Fields: Why QFT Looks "Operator-Heavy"

A reader coming from non-relativistic QM may notice that QFT seems to focus almost entirely on operators (the field operators $ϕ (x)$ , $ψ (x)$ , $A_{μ} (x)$ and their correlators) while saying very little about states. This is real — and worth being explicit about.

	Non-relativistic QM	QFT
Primary object	State $∥ ψ (t)⟩$ (or wavefunction $ψ (x, t)$ )	Field operators $\hat{ϕ} (x)$
Time evolution	State evolves (Schrödinger picture)	Operators evolve (Heisenberg picture)
What you compute	$⟨ ψ ∥ \hat{A} ∥ ψ ⟩$ , transition amplitudes	$⟨ 0∥ T \hat{ϕ} (x_{1}) \dots \hat{ϕ} (x_{n}) ∥0 ⟩$ , then S-matrix elements

The state is still primary in principle, but in practice it is usually fixed implicitly to be one of:

the vacuum $∣0 ⟩$ — for vacuum correlation functions and most perturbative computations,
an asymptotic Fock state $∣ p_{1}, s_{1}; p_{2}, s_{2}; \dots ⟩$ — for S-matrix calculations,
a coherent state of a bosonic field — for connecting to classical fields and for IR problems,
a bound-state wavefunction (positronium, hydrogen) — handled non-perturbatively (Bethe–Salpeter), and rarely written down explicitly,
a density matrix — for thermal QFT, decoherence, and open systems.

Several reasons drive the operator-centric emphasis:

Heisenberg picture is manifestly Lorentz-covariant. Putting all spacetime dependence into operators $ϕ (x) = ϕ (x, t)$ avoids singling out the time slice that the Schrödinger picture requires.
The state space is Fock space, not $L^{2} (R^{3})$ . A multi-particle state is a function over arbitrary numbers of particles with arbitrary momenta; there is no useful single "wavefunction in position space" to write down.
Most observables of interest are scattering amplitudes. Prepare an asymptotic in-state, evolve, take the overlap with an asymptotic out-state — the details of the state during the interaction never appear; only $⟨ out ∣ \hat{S} ∣ in ⟩$ does, computed from operator correlators via LSZ.
Haag's theorem (see Remarks) says the interacting vacuum and Fock states are unitarily inequivalent to the free ones — there is no concrete Hilbert space on which to write down "the interacting state." So one works with operator correlators and asymptotic states only.

Warning: the Field Operator $ψ (x)$ Is Not a Wavefunction

This is a major terminological collision. In some derivations of QED (notably the historical / Dirac-equation route), $ψ (x)$ starts out as a single-particle relativistic wavefunction — a state. After second quantization, the symbol $ψ (x)$ is reused to denote a field operator on Fock space. From that point on:

$ψ (x)$ is an operator, not a state.
$ψ (x) ∣0 ⟩$ is the (improper) state of one particle localized at $x$ .
The matrix element $⟨ 0∣ ψ (x) ∣ p, s ⟩ = u (p, s) e^{- i p \cdot x}$ is a Dirac wavefunction, but it's the matrix element of the field operator between two particular states — not the state itself.

Conflating these two meanings of $ψ (x)$ is one of the most common sources of confusion when crossing from QM to QFT.

Time-Ordering Operator

The time-ordering operator $T$ is the instruction to permute a product of operators so that those with larger time arguments stand to the left:

$T [A_{1} (t_{1}) A_{2} (t_{2}) \dots A_{n} (t_{n})] = A_{σ (1)} (t_{σ (1)}) A_{σ (2)} (t_{σ (2)}) \dots A_{σ (n)} (t_{σ (n)}),$

where $σ$ is the permutation that gives $t_{σ (1)} > t_{σ (2)} > \dots > t_{σ (n)}$ . For the two-operator case:

$T [A (t_{1}) B (t_{2})] = θ (t_{1} - t_{2}) A (t_{1}) B (t_{2}) + θ (t_{2} - t_{1}) B (t_{2}) A (t_{1}) .$

For fermionic operators an extra sign $(- 1)^{σ}$ from the permutation is included (so $T [ψ (t_{1}) \overset{ˉ}{ψ} (t_{2})] = - T [\overset{ˉ}{ψ} (t_{2}) ψ (t_{1})]$ ).

$T$ is not an operator on Fock space — it is a notational rule for handling non-commuting operators at different times. It plays a central role in:

The Dyson series for the S-matrix, $S = T exp (- i \int H_{int} d t)$ (see QED/historical.md §0.6).
Time-ordered correlators / Green's functions (next subsection), which are the inputs to the LSZ reduction formula.
Path-integral derivations of operator correlators (see QM/path-integral.md §1.6 for the full discussion in the simpler QM setting).

Notation collision warning. The same symbol $T$ is used in QFT for the transition operator $T = - i (S - 1)$ — a genuine operator on Fock space — that appears in the splitting $S = 1 + i T$ . The two meanings are entirely unrelated; context disambiguates: time-ordering $T$ always sits in front of a product of operators ( $T [\dots]$ or $T ϕ_{1} \dots ϕ_{n}$ ); transition $T$ sits between a bra and a ket as $⟨ f ∣ T ∣ i ⟩$ . See QED/historical.md §0.6 for the corresponding callout in context.

Correlation Functions

The fundamental observables of QFT are vacuum expectation values of products of fields:

Wightman functions: $W_{n} (x_{1}, \dots, x_{n}) = ⟨ 0∣ ϕ (x_{1}) \dots ϕ (x_{n}) ∣0 ⟩$ .
Time-ordered (Green's) functions: $G_{n} (x_{1}, \dots, x_{n}) = ⟨ 0∣ T ϕ (x_{1}) \dots ϕ (x_{n}) ∣0 ⟩$ , where $T$ orders fields by decreasing $x^{0}$ (with a sign for fermion exchanges).

Time-ordered correlators are what enter the LSZ reduction formula (next subsection) to compute $S$ -matrix elements.

LSZ Reduction Formula

The Lehmann–Symanzik–Zimmermann (LSZ) reduction formula is the bridge between time-ordered correlators of local fields (what perturbation theory and Feynman rules naturally compute) and S-matrix elements between asymptotic states (what observables require).

Setup. Pick any local field $ϕ (x)$ with non-zero matrix element to a one-particle state of the species of interest:

$⟨ 0∣ ϕ (x) ∣ p ⟩ \neq = 0.$

Such a $ϕ$ is called an interpolating field for that species. Different choices of $ϕ$ give the same on-shell $S$ -matrix elements (this is the LSZ equivalence of field redefinitions).

The formula for $n + m$ external particles (in the simplest scalar case; spin and species labels suppressed):

$⟨ p_{1}, \dots, p_{n}; out ∣ q_{1}, \dots, q_{m}; in ⟩ = i = 1 \prod n [i \int d^{4} x_{i} e^{i p_{i} \cdot x_{i}} (□_{x_{i}} + m^{2})] j = 1 \prod m [i \int d^{4} y_{j} e^{- i q_{j} \cdot y_{j}} (□_{y_{j}} + m^{2})] ⟨ 0∣ T ϕ (x_{1}) \dots ϕ (x_{n}) ϕ (y_{1}) \dots ϕ (y_{m}) ∣0 ⟩ .$

In words: each external leg contributes a Klein–Gordon operator $(□ + m^{2})$ (which on the mass shell extracts the residue of the single-particle pole in momentum space) plus a Fourier factor; everything else is the time-ordered $(n + m)$ -point correlator.

What it actually says. When the external momenta are on-shell ( $p_{i}^{2} = m^{2}$ , $p_{i}^{0} > 0$ ), the time-ordered correlator $⟨ 0∣ T ϕ \dots ϕ ∣0 ⟩$ has poles in each external momentum at $p^{2} = m^{2}$ from the propagation of single-particle intermediate states (the Källén–Lehmann spectral representation). The LSZ recipe extracts the residue at all those poles simultaneously and identifies it with the on-shell $S$ -matrix element.

Why this is significant.

No Hamiltonian required. LSZ takes correlators as input. Correlators can be computed from a Lagrangian via path integrals, but they could equally come from lattice simulations, the conformal bootstrap, integrability, or any other source. So LSZ provides an $H$ -free route to $S$ -matrix elements (compare the Møller-operator construction in foundations-modern.md §2.0.1, which does require $H = H_{0} + H_{int}$ ).
Field-redefinition equivalence. Replacing $ϕ \to ϕ + g [ϕ]$ for any local function $g$ leaves on-shell $S$ -matrix elements invariant. This is why effective field theories with different operator bases can describe identical physics, and why the "fundamental field" choice is largely conventional.
Bridges Feynman rules to observables. Step 6 of any QFT calculation (in QED, QED/historical.md §5.2, and elsewhere) implicitly uses LSZ to convert amputated time-ordered diagrams into $S$ -matrix elements: the $(□ + m^{2})$ operators on the external legs cancel against the external propagators of an amputated diagram, leaving exactly $i M$ (up to wavefunction renormalization factors $Z$ ).

For the full derivation see Peskin–Schroeder Ch. 7.2 or Weinberg Vol. 1 §10.3. For its role in pinning down asymptotic states without invoking $H$ , see foundations-modern.md §2.0.1.

S-Matrix and Cross Sections

The S-matrix $S$ maps asymptotic in-states (free particles in the far past) to asymptotic out-states (free particles in the far future):

$⟨ f, out ∣ i, in ⟩ = ⟨ f ∣ S ∣ i ⟩ .$

Writing $S = 1 + i T$ with $⟨ f ∣ T ∣ i ⟩ = (2 π)^{4} δ^{4} (p_{f} - p_{i}) M_{f i}$ , the invariant amplitude $M$ determines physical observables: differential cross sections $d σ$ , decay rates $d Γ$ , etc. The full master formulas connecting $M$ to measurable rates and the necessary kinematic ingredients (flux factor, Lorentz-invariant phase space, units, the optical theorem) are collected separately in Cross Sections and Decay Rates; the broader observable inventory is in Observables.

Symmetries and Noether Currents

A continuous symmetry of the action implies, by Noether's theorem, the existence of a conserved current $j^{μ} (x)$ with $\partial_{μ} j^{μ} = 0$ and a conserved charge $Q = \int d^{3} x j^{0}$ . In the quantum theory, $Q$ generates the symmetry on operators via $[Q, ϕ] = i δ ϕ$ .

Symmetries are classified as:

Global (parameter independent of $x$ ) vs. local / gauge (parameter depends on $x$ ).
Internal (acting on field indices) vs. spacetime (Poincaré, conformal, ...).
Continuous (Lie group) vs. discrete ( $C$ , $P$ , $T$ ).

Gauge Fields

A gauge theory has a local internal symmetry $G$ . To make the Lagrangian invariant one introduces a gauge field $A_{μ}^{a}$ valued in the Lie algebra of $G$ , and a covariant derivative $D_{μ} = \partial_{μ} - i g A_{μ}^{a} T^{a}$ , where $T^{a}$ are generators of $G$ in the relevant representation. The field strength

$F_{μν}^{a} = \partial_{μ} A_{ν}^{a} - \partial_{ν} A_{μ}^{a} + g f^{ab c} A_{μ}^{b} A_{ν}^{c}$

(with structure constants $f^{ab c}$ ) generalizes the electromagnetic $F_{μν}$ . Quantization requires gauge fixing to remove redundant degrees of freedom.

Regularization and Renormalization

Naive computations in interacting QFT yield divergent loop integrals. Regularization (cutoff, dimensional, Pauli–Villars, lattice) parametrizes the divergences; renormalization absorbs them into a redefinition of a finite number of parameters (masses, couplings, field normalizations). A theory is renormalizable if this can be done with finitely many counterterms; otherwise it is an effective field theory valid only below some energy scale.

The renormalization group describes how renormalized parameters depend on the chosen energy scale $μ$ , governed by beta functions $β (g) = μ \partial g / \partial μ$ .

Keyboard shortcuts

youyuanwu