From Scholarpedia - Reading time: 11 minThe Mermin-Wagner theorem exemplifies the crucial influence of both the dynamical symmetry and the spatial dimensionality on thermal phase transitions in many-body systems. It says: At finite temperatures, the quantum spin-\(S\) Heisenberg model with isotropic and finite-range exchange interactions on one- or two-dimensional lattices can be neither ferro- nor antiferromagnetic.
Contents |
(Units are chosen as \(\hbar=1\ ,\) \(k_B=1\ ,\) \(g\mu_B=1\ .\))
This is a model for magnetic insulators and is specified by the Hamiltonian \[\tag{1} \begin{array}{lcl} \hat{H}_\Lambda(h) &=& - \sum_{{\mathbf x},{\mathbf y} \in \Lambda} J({\mathbf x}-{\mathbf y}) \vec{S}({\mathbf x}) \cdot \vec{S}({\mathbf y}) - \sum_{{\mathbf x} \in \Lambda} h({\mathbf x}) \hat{S}_3 ({\mathbf x}) \\ &=& \hat{H}^{(0)} + \hat{H}^{(h)} , \end{array} \]
where \(\vec{S}({\mathbf x}) = (\hat{S}_1 ({\mathbf x}),\hat{S}_2 ({\mathbf x}),\hat{S}_3 ({\mathbf x}))\) denotes 3-vectorial spin operators localized at the \(N\) lattice sites \({\mathbf x} = (x_1,x_2, \ldots, x_d)\) of a finite \(d\)-dimensional cubic box \(\Lambda \subset \mathbb{Z}^d\) with volume \(|\Lambda| = N\) (i.e. unit lattice constant) and periodic boundary conditions. The spin components \(\hat{S}_i({\mathbf x})\) obey the usual commutation relations, \[ [ \hat{S}_1 ({\mathbf x}), \hat{S}_2 ({\mathbf y})] = {\rm i} \delta_{{\mathbf x},{\mathbf y} } S_3 ({\mathbf x}) , {\rm etc.} \] and \(\vec{S}^2 = S(S+1)I_{(2S+1)}\ ,\) \(I_{(2S+1)}\) the identity in the spin space, \(S=\frac{1}{2}, 1, \frac{3}{2}, \ldots\ .\)
The exchange couplings \(J({\mathbf x}-{\mathbf y})=J({\mathbf y}-{\mathbf x})\ ,\) \(J({\mathbf 0})=0\) and \(J({\mathbf x}-{\mathbf y})\geq 0 (\leq 0)\) for (anti-)ferromagnets are assumed for the moment to have finite range.
The operators \(\hat{S}_i({\mathbf x})\ ,\) \(i=1,2,3\ ,\) act in a \((2S+1)\)-dimensional Hilbert space \({\mathcal H}({\mathbf x})\ ;\) the total \((2S+1)^N\)-dimensional Hilbert space is given by \({\mathcal H}_\Lambda = \otimes_{{\mathbf x} \in \Lambda} {\mathcal H}({\mathbf x})\ .\)
Let \({\mathcal A}_\Lambda^{Spin} = {\mathcal A}_\Lambda\) denote the polynomial algebra of spin operators in \({\mathcal H}_\Lambda\ .\) The \(SU(2)\)-symmetry of the Hamiltonian \(\hat{H}^{(0)}\) may be described by one-parameter groups of unitary automorphisms \(\gamma_n(\phi) : {\mathcal A}_\Lambda \rightarrow {\mathcal A}_\Lambda\ ,\) corresponding to simultaneous global rotations \(0 \leq \phi \leq 2\pi\) around fixed but arbitrary axes \(\vec{n}\ ,\) \(\vec{n}^2 =1\ ,\) of all \(N\) spins in the box \(\Lambda\ .\) In particular,
\[ \gamma_n(\phi) \hat{P} = \exp (-{\rm i} \phi \hat{G}_n) \hat{P} \exp ({\rm i} \phi \hat{G}_n), \] with generators \(\hat{G}_n = \sum_{{\mathbf x} \in \Lambda} \vec{n}\cdot\vec{S}({\mathbf x})\ ,\) and \(\hat{P} \in {\mathcal A}_\Lambda\ .\)
The continuous symmetry \(\gamma_n(\phi) \hat{H}^{(0)} = \hat{H}^{(0)}\) is explicitly broken by the external field \(\vec{h}({\mathbf x})=(0,0,h({\mathbf x}))\ ,\) \(h({\mathbf x})= h \exp ({\mathrm i} {\mathbf q}\cdot {\mathbf x})\ .\) In the case of a ferromagnet, we take \({\mathbf q} = 0\ .\) To deal with conventional two-sublattice \((\Lambda_\pm)\) Neel antiferromagnetism we choose q such that \(e^{{\rm i} {\mathbf q} {\mathbf x} } = +1 (-1)\) when \({\mathbf x} \in \Lambda_+ (\Lambda_-)\ .\)
The external field \(h({\mathbf x})\) induces a (sublattice) magnetisation per site as expressed by the thermal average (\(\beta^{-1} := T\)): \[\tag{2} \begin{array}{rcl} m_{{\mathbf q} }(T,h,N) &=& \frac{1}{N Z} {\rm Tr} \left(e^{-\beta \hat{H}_\Lambda(h)} \sum_{{\mathbf x} \in \Lambda} \hat{S}_3 ({\mathbf x}) e^{{\rm i} {\mathbf q}\cdot {\mathbf x} } \right) \\ &=& \frac{1}{N} \left\langle \hat{M}_{{\mathbf q} } \right\rangle =: \frac{1}{N} \omega( \hat{M}_{{\mathbf q} }), \\ Z &=& Z (T,h,N) = {\rm Tr} \exp (-\beta \hat{H}_\Lambda (h)) . \end{array} \]
Definition: The Heisenberg model describes (anti-)ferromagnetism at \(T>0\) if it exhibits a finite spontaneous (sublattice) magnetisation in the thermodynamic limit \(|\Lambda|=N \rightarrow \infty\ :\) \[ \begin{array}{rcl} {\rm thermodynamic\ limit:} & & m_{{\mathbf q} } (T,h) := \lim_{N\rightarrow\infty} m_{{\mathbf q} } (T,h,N) \\ {\rm spontaneous\ magnetisation} & & \sigma_{{\mathbf q} } := \lim_{h\rightarrow 0} m_{{\mathbf q} } (T,h) \end{array} \]
The absence of ferromagnetism in the 2d-Heisenberg model has been already anticipated by Felix Bloch (Bloch 1930), with physically plausible arguments, based on elementary spin wave theory. In the isotropic Heisenberg ferromagnet at temperatures \(T\ll T_{{\textstyle Curie} }\) the energetically low-lying thermal excitations may be described approximately as an ideal Bose gas of magnons (quantum spin waves) in the periodic box \(\Lambda\subset \mathbb{Z}^d\ .\) A magnon with wave vector \({\mathbf k}\) has the energy \(\epsilon_f ({\mathbf k}) \simeq \eta k^2\ ,\) \(\eta>0\ ,\) for small values of \(k\ .\) These thermal excitations cause a \(T\)-dependent reduction \(\Delta \sigma_f(T) = \sigma_f (T=0) - \sigma_f (T)\) of the average spontaneous magnetisation from its saturation value. In the spin wave approximation one finds \[ \Delta \sigma_f(T) \propto \int_{k\geq 0} \frac{k^{d-1} {\rm d} k}{\exp(\beta \epsilon_f(k))-1} . \] For small values of \(k\ ,\) the integrand goes as \(k^{d-3}\ ;\) therefore the integral diverges if \(d\leq 2\ .\) This result indicates a breakdown of the spin wave approximation and has been interpreted to mean that any spontaneous magnetisation is ultimately wiped out by the thermal excitation of too many magnons.
A similar argument has been put forward (Auerbach 1994) to exclude isotropic Heisenberg antiferromagnetism in \(d\leq 2\ .\) In this case, spin wave modes with small \(k\) have an energy \(\epsilon_{af} = \gamma k\ ,\) \(\gamma>0\ .\) Their thermal excitations are found in linear order of spin wave theory to reduce the sublattice magnetisation according to \[\tag{3} \Delta \sigma_{af} (T) \propto \int_{k\geq 0} \frac{k^{d-1} a(k) {\rm d} k}{\exp(\beta \epsilon_{af}(k))-1}. \]
The factor \(a(k) = const./\epsilon_{af}(k)\) is due to quantum zero-point motions, which originate from the fact that the alternating magnetisation is not a conserved quantity, in contrast to the ferromagnetic case, where \(\sum_{{\mathbf x} \in \Lambda} \hat{S}_i({\mathbf x})\ ,\) \(\alpha=1,2,3\ ,\) commutes with the Hamiltonian \(\hat{H}_\Lambda^{(0)}\ .\) Since one finds \(a(k) \propto k^{-1}\) for small values of \(k\ ,\) the integral (3) displays the same singularity as in the ferromagnetic case and leads to the same conclusion that \( \Delta \sigma_{af} (T) =0\) for \(T>0\) and \(d\leq 2\ .\)
The Mermin-Wagner theorem validates these intuitive spin wave arguments with rigorous bounds for the magnetisation.
Let \(m_{{\mathbf q} } (T,h)\) be the magnetisation defined in (2), where \(\hat{H}_\Lambda(h)\ ,\) with \(\sum_{{\mathbf R} } | J({\mathbf R})| {\mathbf R}^2 =: \overline{J} <\infty\ ,\) is given by (1).
Then we have \[\tag{4} \sigma_{{\mathbf q} } (T) = \lim_{h\rightarrow 0} m_{{\mathbf q} } (T,h) = 0 \]
if \(d \leq 2\) and \(T>0\) (Mermin and Wagner 1966).
In order to establish the proposition (4), we try to probe the stability of the spontaneous magnetisation against thermal fluctuations arising from energetically low-lying elementary excitations. In particular, our aim is to find an upper bound \(\delta(T,h)\geq |m_{{\mathbf q} }(T,h))|\) such that \(\delta(T,h) \rightarrow 0\) when \(h \rightarrow 0\ ,\) if \(d \leq 2\ .\)
To carry out this plan, we adopt an idea due to P. Hohenberg (1967) and utilize the Bogoliubov inequality, \[\tag{5} \frac{\beta}{2} \omega (\hat{A}^\dagger \hat{A} + \hat{A} \hat{A}^\dagger) \geq \frac{|\omega([\hat{C},\hat{A}^\dagger])|^2}{\omega([\hat{C}[\hat{H}_\Lambda(h),\hat{C}^\dagger]])} \]
for thermal averages \(\omega(\hat{Q})\ ,\) \(\hat{Q} \in {\mathcal A}_\Lambda\ ,\) as introduced in (2). (For convenience, we include below an elementary, model-independent derivation of this inequality.)
The crucial task is to choose suitable operators \(\hat{A}\) and \(\hat{C}\ .\) For this purpose, we consider locally twisted spin configurations, generated by \[ \hat{G}_n := \sum_{{\mathbf x} } f({\mathbf x}) \vec{n} \cdot \vec{S}({\mathbf x}), \] \[\tag{6} \left. i \frac{d}{d\phi} \gamma_n(\phi, f) \hat{Q} \right|_{\phi=0} = [ \hat{G}_n(f), \hat{Q}] =: i \dot{\gamma}_n(f) \hat{Q}, \quad \hat{Q}\in {\mathcal A}_\Lambda , \]
with smoothly varying mappings \(f : \mathbb{Z}^d \rightarrow \mathbb{C}\) (here and subsequently, sums over \({\mathbf x}\) are understood to run over \(\Lambda\)).
In particular,
\[\tag{7} \dot{\gamma}_n(f) ( (\dot{\gamma}_n(f) \hat{H}_\Lambda)^\dagger ) = [\hat{G}_n(f) , [\hat{H}_\Lambda,\hat{G}^\dagger_n(f)]] . \]
With the spin wave argument in mind, we expect that such thermally fluctuating twists may be sufficiently violent to prevent the appearance of a spontaneous magnetisation at low spatial dimensions.
By comparing (6) and (7) with the commutators in (5), it is suggestive to choose \[ \hat{C} = \sum_{{\mathbf x} } e^{- i {\mathbf k}\cdot {\mathbf x} } \hat{S}_1 ({\mathbf x}) =: \hat{S}_1 ({\mathbf k}) , \] with \({\mathbf k} \neq 0\) an arbitrary vector from the first Brillouin zone \(\Lambda^*\ .\) By taking \[ \hat{A} = \sum_{{\mathbf x} } e^{- i ({\mathbf k}+{\mathbf q})\cdot {\mathbf x} } \hat{S}_2 ({\mathbf x}) =: \hat{S}_2 ({\mathbf k}+{\mathbf q}), \] we find immediately \[ \begin{array}{rcl} |\omega( [\hat{C},\hat{A}^\dagger]) | &=& N | m_{{\mathbf q} } (T,h,N)| , \\ \frac{1}{2} \omega (\hat{A}^\dagger \hat{A} + \hat{A} \hat{A}^\dagger) &=& \omega (\hat{S}_2^\dagger ({\mathbf k} + {\mathbf q}) \hat{S}_2 ({\mathbf k} + {\mathbf q})). \end{array} \] The evaluation of \(\omega ( [ \hat{C}, [ \hat{H}_\Lambda, \hat{C}^\dagger ]])\) is a bit more tedious but elementary. By using translational symmetry and (2) we obtain
\[ D_{{\mathbf k} } = 2 \sum_{{\mathbf x}, {\mathbf y} } J ({\mathbf x} - {\mathbf y}) (1 - \cos ({\mathbf k} \cdot ({\mathbf x} - {\mathbf y}))) \omega (\hat{S}_2 ({\mathbf x}) \hat{S}_2 ({\mathbf y}) + \hat{S}_3 ({\mathbf x}) \hat{S}_3 ({\mathbf y})) + Nhm_{{\mathbf q} } . \] We note that \(D_{{\mathbf k} } > 0\) by construction.
The inequality (5) now takes the form \[\tag{8} \frac{\beta}{N^2} \omega (| \hat{S}_2^\dagger ({\mathbf k}+{\mathbf q}) \hat{S}_2 ({\mathbf k}+{\mathbf q}) |) \geq \frac{|m_{{\mathbf q} }|^2}{D_{{\mathbf k} } } . \]
With \(1 - \cos {\mathbf k}\cdot {\mathbf R} \leq \frac{1}{2} {\mathbf k}^2 {\mathbf R}^2\ ,\) the Schwarz inequality \(|\omega( \hat{S}_i ({\mathbf x})\hat{S}_i ({\mathbf y}) | \leq ( \omega( \hat{S}^2_i ({\mathbf x})) \omega( \hat{S}^2_i ({\mathbf y})) )^{1/2}\ ,\) and with the following upper bound for \(D_{{\mathbf k} }\ ,\) \[ D_{{\mathbf k} } \leq N \sum_{{\mathbf x} } | J({\mathbf x})| {\mathbf x}^2 {\mathbf k}^2 ( \omega ( \hat{S}^2_2 ({\mathbf x})) + \omega ( \hat{S}^2_3 ({\mathbf x}))) + N |hm_{{\mathbf q} }|, \] the inequality (8) turns into \[\tag{9} \frac{\beta}{N^2} \omega (\hat{S}_2^\dagger ({\mathbf k} + {\mathbf q}) \hat{S}_2 ({\mathbf k} + {\mathbf q})) \geq \frac{1}{N} \frac{|m_{{\mathbf q} }|^2}{\rho k^2 + | h m_{{\mathbf q} }|} , \]
where \(\rho := \overline{J} S(S+1) > 0\ .\) Since (9) holds for any \({\mathbf k} \in \Lambda^*\ ,\) we may sum both sides over \({\mathbf k}\ ,\) \[ \frac{\beta}{N^2} \sum_{{\mathbf k} \in \Lambda^*} \omega (\hat{S}_2^\dagger ({\mathbf k} + {\mathbf q}) \hat{S}_2 ({\mathbf k} + {\mathbf q})) = \frac{\beta}{N} \sum_{{\mathbf x} } \omega (\hat{S}^2 _2 ({\mathbf x})) , \] \[ \frac{\beta}{N} \sum_{{\mathbf x} } \omega (\hat{S}^2 _2 ({\mathbf x})) \geq \frac{|m_{{\mathbf q} }|^2}{N} \sum_{{\mathbf k} \in \Lambda^*} \frac{1}{\rho k^2 + | h m_{{\mathbf q} }|} . \] After replacing the left-hand side by the upper bound \(\beta S(S+1)\) and going to the TD limit, we arrive at \[\tag{10} \frac{1}{T} S(S+1) \geq |m_{{\mathbf q} }(T,h)|^2 \frac{\Omega_d d}{(2\pi)^d} \int_0^{k_0} dk \frac{k^{d-1}}{\rho k^2 + | h m_{{\mathbf q} }|} . \]
We have strengthened the inequality by restricting the \(k\)-integration to run over the volume \(\Omega_d k_0^{d}\) of a ball with radius \(0 < k_0 < \pi\ .\) The integration in (10) is elementary and yields for \(d=1\ :\) \[\tag{11} S(S+1) \geq \frac{T|m_{\mathbf q} (T,h) |^2 }{\pi \sqrt{\rho | h m_{{\mathbf q} }|} } \arctan \left( \sqrt{\frac{ | h m_{{\mathbf q} }|}{\rho k_0^2} } \right) \]
and for \(d=2\ :\) \[\tag{12} S(S+1) \geq \frac{T|m_{\mathbf q} (T,h) |^2 }{4\pi\rho} \ln \left( 1 + \frac{\rho k_0^2}{ | h m_{{\mathbf q} }| }\right) . \]
These results imply \(|m_{\mathbf q} (T,h) | \rightarrow 0\) when \(h \rightarrow 0\) for all finite values of \(T, \overline{J}, S\ .\)
Explicitly, (11) and (12) yield upper bounds on \(|m_{\mathbf q} (T,h) |\ ,\) which hold asymptotically when \(h\rightarrow 0\ :\) \[ |m_{\mathbf q} (T,h) | \leq \left\{ \begin{array}{ll} c_1 \overline{J}^{1/3} S(S+1) | h T^{-2} |^{1/3}, & d=1 \\ c_2 \overline{J}^{1/2} S(S+1) | T \ln |h| |^{-1/2}, & d=2 \end{array} \right. \] with numerical constants \(c_{1,2}\ .\)
Consequently, \(|\sigma_{\mathbf q} (T) | =0\) for \(T>0\) in \(d\leq 2\ .\)
with an SO(3) invariant coupling of classical unit vectors \(\vec{n}\ .\) Formally, we obtain (13) via the following scaling procedure where \[\tag{14} \lambda(S) := \sqrt{S(S+1)}, \quad \frac{1}{\lambda(S)} \vec{S} =: \vec{n}, \quad \lambda^2 (S) J =: J^{cl}, \quad \lambda(S) h =: h^{cl} .\] In the limit \(S\rightarrow\infty\) with \(J^{cl}\) and \(h^{cl}\) fixed, the 3-vector components \(n_i\ ,\) \(i=1,2,3\ ,\) turn into commuting degrees of freedom. Also the inequality (10) survives the limit with the replacements (14) together with \(m_{{\mathbf q} } / \lambda(S) =: m^{cl}_{{\mathbf q} } (T, h^{cl})\ .\) A variety of extensions and further applications of the theorem (4) are reviewed in Gelfert and Nolting (2001).
Be \({\mathcal H}\) a Hilbert space with a (general) self-adjoint Hamiltonian \(H\) such that \(\exp (-\beta H)\) is in the trace class. At the same time, we consider \({\mathcal B}({\mathcal H}) \ni A, B, C, \ldots\ ,\) the space of bounded linear operators on \({\mathcal H}\ .\)
We can then define thermal equilibrium states, which we denote by \(\omega_\beta = \omega\ ,\) as linear functionals on the (\(C^*\)-) algebra \({\mathcal A}\) of the operators in \({\mathcal B}({\mathcal H})\ :\)
\[\tag{15} \omega : {\mathcal A} \rightarrow \mathbb{C}, \quad {\mathcal A} \mapsto \omega({\mathcal A}) . \]
In particular, \(\omega_\beta(A) = {\rm Tr}(\rho_\beta A)\ ,\) \(\rho_\beta = Z^{-1} e^{-\beta H}\ ,\) \({\rm Tr} \rho_\beta = 1\ .\) We set \(\beta H =: K\) and define \[ A_\tau := e^{\tau K} A e^{-\tau K}, \quad 0 \leq \tau \leq 1. \] Note that \[\tag{16} (A^\dagger)_\tau = (A_{-\tau})^\dagger . \]
The thermal states (15) fulfill the fundamental Kubo-Martin-Schwinger (KMS) relation (Bratteli and Robinson 1981) \[\tag{17} \omega(A_\tau B) = \omega (A B_{-\tau}) , \]
which follows from the cyclic invariance of the trace operation.
At last, we introduce the sesquilinear Mori product that serves as an inner product on the operator algebra \(A\ :\) \[\tag{18} (A| B) := \int_0^1 {\rm d} \tau \, \omega ( (A^\dagger)_\tau B) = (B | A)^* = (B^\dagger | A^\dagger). \]
The following relations are elementary consequences of the Eqs. (16), (17) and (18):
First, \[\tag{19} \frac{{\rm d} }{{\rm d} \tau} \omega ( (A^\dagger)_\tau C) = \omega ((A^\dagger)_\tau [C, K]), \]
and \[ (A | [C, K]) = \int_0^1 {\rm d}\tau \frac{{\rm d} }{{\rm d} \tau} \omega ((A^\dagger)_\tau C) = \omega ([C, A^\dagger]). \] Second, \[ \frac{{\rm d}^2}{{\rm d} \tau^2} \omega ( (A^\dagger)_\tau A) = \omega ((Q_{-\tau/2})^\dagger Q_{-\tau/2}) \geq 0, \] where \(Q=[A,K]\ .\) Hence \(\omega ((A^\dagger)_\tau A) =: \Omega_A (\tau)\) is convex in \(\tau\ .\)
Therefore \[\tag{20} 0 \leq (A | A) = \int_0^1 {\rm d}\tau\, \Omega_A(\tau) \leq \frac{1}{2} (\Omega_A(1) + \Omega_A(0)) = \frac{1}{2} \omega (A^\dagger A + AA^\dagger) . \]
The Mori product satisfies all the properties of an inner product required to establish the Schwarz inequality: \[\tag{21} | ( A| B) |^2 \leq (A|A) (B| B) . \]
After setting \(B = [C, K]\) and using Eqs. (19) and (20) we finally arrive at the Bogoliubov inequality: \[\tag{22} | \omega([C,A^\dagger])|^2 \leq \frac{\beta}{2} \omega(A^\dagger A + AA^\dagger) \omega([C, [H,C^\dagger]]) . \]
The above choice for \(B\) is made in view of later applications.
The original derivation of the Bogoliubov inequality can be found in Bogoliubov 1971. The inequality was used for the first time by P. Hohenberg (Hohenberg 1967) to rule out the existence of long-range order in translationally invariant Bose and Fermi systems at \(d\leq 2\ ,\) \(T>0\ .\)
The mathematical status of Bogoliubov's inequality and its role as a tool in statistical physics is documented, for instance, in Bratteli and Robinson 1981 and Simon 1993.
Internal references
(pending)