Klein–Gordon equation

Relativistic wave equation in quantum mechanics From Wikipedia, the free encyclopedia

In particle physics, the Klein–Gordon equation is a relativistic wave equation for spinless particles. It was discovered 1926 as the relativistic generalization of the Schrödinger equation, developed independently by numerous authors, among them Oskar Klein and Walter Gordon after whom it is commonly named. Within relativistic quantum mechanics, it suffers from numerous conceptual problems that are only resolved in quantum field theory, where the equation describes the dynamics of spin-0 fields. Mathematically, it is a linear second-order hyperbolic partial differential equation that is manifestly Lorentz covariant and can be viewed as the wave equation form of the relativistic energy–momentum relation. It plays a fundamental role in many areas of modern physics, such as quantum field theory, particle physics, and cosmology.

History

Discovery

The Klein–Gordon equation was discovered independently in the mid-1920s by numerous physicists. As such, Wolfgang Pauli famously described it as "the equation with many fathers".[1]:356 It is sometimes referred to as Schrödinger's relativistic equation or the Klein–Gordon–Fock equation.[2]:679

The first to discover the equation in December 1925, but not publish it, was Erwin Schrödinger.[2]:679 His derivation was motivated by de Broglie's theory for matter waves, with Schrödinger attempting to find a wave equation describing their evolution.[3] He applied the Klein–Gordon equation to the hydrogen atom where he calculated the fine structure of its spectrum. Finding that this did not match the experimental results, it demotivated him from investigating the equation further. As a result, he only published the non-relativistic limit of the equation in early 1926, the Schrödinger equation. It was only in his fourth paper in July 1926 that he published the full Klein–Gordon equation.[4]

During the 1920s Oskar Klein was developing his own theory of waves, not dissimilar to de Broglie's theory.[3] In particular, he became convinced that his wave theory of atoms could be connected to a unification of Maxwellian electrodynamics and Einsteinian general relativity. This led him to the idea of unifying them in a five-dimensional theory of relativity, where he interpreted the extra dimension to not be a real physical direction, rather a formal internal coordinate encoding the electric charge.[5]:49 From this theory he derived the Klein–Gordon equation, but did not assign much importance to it as it only formed a small part of his April 1926 publication.[6] This was the first time that the equation appeared in print.[3] Klein later claimed to have discovered the equation originally in summer 1925, before Schrödinger, but there is no direct primary evidence for this.

After Schrödinger’s original publication of the non-relativistic equation, it was a straightforward matter to generalize the equation to its relativistic form. In this way Pauli got the relativistic equation in April 1926 but did not publish it.[7][1]:315 He lost confidence in the equation after he did not manage to use it to establish the equivalence between matrix mechanics and wave mechanics, like he could for the Schrödinger equation.[3]

Meanwhile, unaware of Kleins paper, Vladimir Fock likewise derived the equation and published it in June.[8] He focused on discussing the wave equation to study the Zeeman effect and Stark effect. He was also the first to publish the resulting fine structure for hydrogen using the equation, found earlier by Schrödinger. Shortly afterwards, Carl Eckart also calculated the Klein–Gordon hydrogen fine structure.[9]

Louis de Broglie likewise derived the equation and published it in July, motivated by Schrödinger’s non-relativistic formulation of his matter wave theory hypothesis.[10][11][12]:44

Walter Gordon also derived the equation and published it at the end of September,[13][14]:115 primarily focusing on applying it to the Compton effect, also deriving the current associated to the Klein–Gordon equation, a result found previously by Klein.[3]

Finally, there were three other discoverers of the Klein–Gordon equation. They were Théophile De Donder and Frans-H. van den Dungen who were treating general dynamical systems and were not explicitly focused on quantum theory at all.[15][5]:49 Their paper was published in July 1926.[16] Johann Kudar likewise published about the Klein–Gordon equation in 1926.[17]:21[18]

Consequences

The equation was used in the following years to investigate a handful of problems. For example, Victor Bursian used it to treat dispersion as a perturbation, finding only a small relativistic correction to the result found from the Schrödinger equation.[19] Apart from its use in studying dispersion and Compton scattering, the equation was not very useful for treating most physical problems, primarily due to its inability to incorporate spin.[3]:1031 More importantly, it was unappealing to the theorists as it failed to accommodate the principles of quantum mechanics as understood at the time, such as the Heisenberg uncertainty principle, Borns probabilistic interpretation, and the transformation theory of Paul Dirac and Pascual Jordan.[20]:35 As a result, the equation did not play an important role in the development of quantum mechanics.

While the equation virtually disappeared from physics after the Dirac equation was discovered in 1928, it was revived in 1934 by Pauli and Victor Weisskopf who reinterpreted it in light of the theory of field quantization developed at the time, giving it a tenable quantum interpretation as describing spin-0 particles.[21][2]:684 This allowed for the negative energy states that appeared in naive quantum mechanical interpretation to be instead reinterpreted as antiparticles.

The first spin-0 particle to be predicted was the meson in 1935 by Hideki Yukawa, which utilized a form of the equation.[22] The first mesons to be discovered were charged pions in 1947, whose kinematics are described by the Klein–Gordon equation. Many other mesons were discovered in the following decades. The first elementary spin-0 particle to be discovered was the Higgs boson in 2012.

Since relativistic scalar fields play a prominent role in modern physics, the Klein–Gordon is likewise indispensable in many areas. For example, such scalar fields are found in cosmology and the theory of inflation,[23]:154 in the description of dark matter candidates such as axions and many other Beyond the Standard Model scenarios,[24] in many areas of theoretical physics such as string theory in the form of moduli, and in the AdS/CFT correspondence.[25]

Formulation

Definition

The Klein–Gordon equation describes the time evolution of a real real scalar field , which is a field that assigns a real number to each point in spacetime. The equation is a second order hyperbolic partial differential equation given by[2]:680

Here is the Laplace operator, the speed of light, and is the reduced Planck constant. The parameter plays the role of the mass of the scalar field.[nb 1] The time-independent case of the equation is equivalent to the screened Poisson equation. A more concise expression for the equation employs natural units, where both the speed of light and the reduced Planck’s constant are set to unity . The differential operator takes the form of the d'Alembert operator , where is the raised flat metric. The Klein–Gordon equation then takes the form[26]:17

Klein–Gordon equation

The Klein–Gordon equation can also act on a complex scalar field , which is a field that assigns to each point in spacetime a complex number.[26]:18 In this case the equation can be decomposed into a pair of Klein–Gordon equations acting independently on the two degrees of freedom of the complex scalar field. These can be the real and imaginary components, or alternatively the field and its complex conjugate. This decomposition holds because a non-interacting theory of a single complex scalar field is equivalent to a theory of two decoupled real scalar fields.

In the case of a complex scalar field, electromagnetic interactions can be introduced by minimally coupling the fields through the introduction of a gauge covariant derivative

where is the gauge field and is the electric charge of the scalar field. The covariant derivative is introduced to ensure that the resulting theory is gauge invariant under gauge transformations where the gauge field and scalar field transform as

for a gauge parameter . The Klein–Gordon equation minimally coupled to electromagnetism then takes the form

This equation describes the evolution of a complex scalar fields in scalar electrodynamics.

Lagrangian formulation

In a Lagrangian formulation, the Klein–Gordon equation for a real scalar field can be obtained as the Euler–Lagrange equation of the Lagrangian[17]:21

In quantum field theory this Lagrangian describes a massive non-interacting real scalar field whose excitations are scalar bosons of mass . In the case of a complex scalar field , its Lagrangian is given by[26]:18

The Klein–Gordon equation for and is found by varying the Lagrangian with respect to and , respectively. This Lagrangian is manifestly invariant under a global symmetry, which when gauged results in scalar electrodynamics.[27]:140 This Lagrangian corresponding to the Klein–Gordon equation is useful when calculating the currents associated with the symmetries using Noether's theorem.[28]:92

Correspondence principle derivation

The equation can be derived analogously to how the Schrödinger equation is derived from the non-relativistic equation for the energy of a particle. In particular, in the non-relativistic limit, the energy of a free particle with momentum is given by

By elevating the momentum and energy to operators through the correspondence principle[14]:116–117

this gives the Schrödinger equation.[12]:368 The Klein–Gordon equation is similarly acquired by analogously replacing the energy and momentum by the corresponding operators in the relativistic energy-momentum relation[2]:680

Attempting to acquire the equation for the square root of the above expression is on the other hand problematic since time and space do not appear on an equal footing.[29]:4 Additionally, one would have square roots of differential operators, which are usually handled by Taylor expanding the square root. This leads to an infinite series of higher derivative terms, making the theory difficult to work with.[14]:118

Properties

Symmetries

The equation transforms covariantly under spacetime translations and the Lorentz group. Together, these form the Poincare group which encodes the isometries of flat spacetime. Scalar fields transform as scalars under Lorentz transformations, meaning that under , the scalar field transforms as .[12]:364–365

Global symmetries give rise to currents and charges through Noether's theorem. The four currents corresponding to the four spacetime translations are given by the stress-energy tensor, which for a real scalar field theory is given by[29]:138

where is the Lagrangian. The conserved charges are given by the integral of the zeroth component of the current over all space. In this case the conserved charges are the total energy and momentum of the field.[28]:95 For Lorentz transformations, the six currents are expressed in terms of the stress-energy tensor as[29]:139

The conserved charges arising from the three rotational degrees of freedom are the angular momenta, while for the three boosts they correspond to the motion of the centre of energy.

The real scalar field theory also has an internal discrete symmetry , although this has no dynamical consequences in the free theory.[29]:144–145 In the case of a complex scalar field, there is a continuous internal phase symmetry[17]:79

with an associated Noether current given by[26]:18

The conserved charge to this symmetry measures the difference in the number of particles to antiparticles. When coupled to electromagnetism the charge is the electric charge, the conservation of which corresponds to charge conservation.[28]:96

Green's functions

The Klein–Gordon equation can be elevated to the inhomogeneous Klein–Gordon equation through the introduction of a source term[28]:66

where is a time and space dependent function. This equation admits a solution of the form[28]:146

where is known as the Green's function. The Green's function is the formal solution to the equation[29]:55

The principle behind the Green's function is to relate the solution at to some forcing term at .[28]:145 While the Green's function can be seen as the formal inverse of the Klein–Gordon operator , this choice is not unique.[17]:133 A precise choice of the Green's function requires a choice of boundary conditions, specifying where it has support, such as vanishing for certain values of .

For example, the retarded Green's function is defined as relating the solution at the present to impulses strictly from the past .[29]:30 Meanwhile, the advanced Green's function relates solutions in the present to impulses strictly from the future. These are given by

where the positive sign corresponds to the advanced Green's function.[30]

Another common choice of boundary condition results in the Feynman propagator, which implements time-ordering and is given by[nb 2][29]:55

In quantum field theory, this propagator is proportional to the time-ordered two-point correlation function for scalar fields.[26]:31

Plane-wave solutions

The Klein–Gordon equation can be solved directly by plugging in the Fourier transform of the scalar field[28]:59

The d'Alembert operator only acts on the 4-position terms in the exponent, bringing down factors of the momentum. The equation then reduces to a constraint on the 4-momentum of the Fourier mode plane waves

Writing this in terms of the energy and 3-momentum reveals that it is equivalent to the energy-momentum relation for a massive particle .[12]:367 Therefore, the most general solution to the Klein–Gordon equation is a superposition of plane wave solutions whose momenta satisfy the energy-momentum relation.[27]:21

By imposing the on-shell condition, such as through the insertion of delta functions into the Fourier transform solution, a more explicit form for the solutions is found[nb 3]

where and are arbitrary functions of the 3-momentum, and is the energy of the modes. This form makes explicit the inevitable presence of both positive and negative frequency solutions.

Other solutions

When the Klein–Gordon equation is minimally coupled to electromagnetism with a Coulomb potential, it is used to study exotic atoms whose nuclei are orbited by spinless bosonic particles.[32]:34 For example, in a pionic atom, the atomic nucleus is orbited by negatively charged pions with mass . The lifetime of pions is long enough for this system to form bound states before the pion decays.

The case when only one pion orbits a nucleus of charge can be solved directly. Stationary solutions for a Coulomb potential use separation of variables, where the field is written as , to express the Klein–Gordon equation as[2]:681

where is the fine-structure constant. The energy levels of this system are then given by[32]:34–37

where is the principal quantum number and is the orbital angular momentum quantum number. In contrast to the nonrelativistic case, relativistic effects lift the degeneracy in the states.

These energy levels differ from traditional atoms due to the fact that electrons are fermions with spin half, while the Klein–Gordon equation describes bosonic spin 0 particles. However, the expression for the energy levels for regular atoms, known as the Sommerfeld formula, is very similar. It is directly acquired from the above through the replacement of the pion mass with the electron mass and by replacing , the spin-orbital angular quantum number. These energy levels have been observed in pionic atoms.[33]

The solution breaks down when , where the square root becomes imaginary, which first happens for .[34]:95 Physically, the attractive Coulomb potential is too strong, with the kinetic energy of the pion being insufficient to stop the pion from spiralling into the nucleus.[34]:97 Unlike in the non-relativistic case, this occurs at a classical heuristic level in the relativistic theory because the velocity of the pion cannot increase fast enough as its orbital radius decreases in order to maintain the centrifugal force needed to resist the Coulomb force.[14]:166 More precisely the issue arises because the wavefunction develops pathological short distance behaviour. For realistic atomic nuclei, this problem is partially avoided for larger due to them not have perfect point-like Coulomb potentials.[14]:166–167

Another notable solution to the Klein–Gordon equation is the scattering of an initial plane-wave off an potential step of height .[35] The properties of this solution are described by a reflection coefficient and a transmission coefficient , which satisfy . They describe the probability of the incident wave being reflected off the barrier or passing through it, respectively. For large incoming energies , both the reflection and transmission coefficients are non-zero, with there being some probability of the particle passing through the barrier and some probability of it being reflected.[32]:30–34 In the case when , the transmission coefficient is zero and the reflection coefficient is one, with the particle always being reflected. This behaviour matches that found for the non-relativistic Schrödinger equation.[34]:85–88

However, when the potential is very large , the solution acquires a negative transmission coefficients and a reflection coefficient greater than one . This behaviour is known as the Klein paradox and it was first described by Oskar Klein for the Dirac equation.[36] The paradox is resolved by reinterpreting the phenomenon as particle-antiparticle pair creation from the potential barrier itself.[32]:30–34 Antiparticles are then attracted to the potential while particles are repulsed, contributing to the reflected beam, increasing the amount of particles observed coming back from the barrier. A full analysis requires a relativistic multiparticle theory in the form of quantum field theory.[34]:88

Quantum theory

Quantum mechanics

The Klein–Gordon equation was originally devised in the context of relativistic quantum mechanics, where it serves the role of a relativistic generalization of the Schrödinger equation. There it describes the evolution of the complex-valued wavefunction through[37]:480

where is the Laplacian, is the reduced Planck constant, and is the speed of light.

If the Klein–Gordon equation is to describe a wavefunction, there needs to be a corresponding conserved probability density that can be built from the wavefunction.[37]:482 The candidate probability density, given by the zeroth component of the four-current corresponding to the complex phase symmetry , is given by[2]:683

While this is conserved, it is not positive definite and therefore cannot be interpreted as a probability density.[12]:365 This is despite the fact that its corresponding 3-current matches the form of the probability current for the Schrödinger equation in the non-relativistic limit.

Another issue is the presence of negative energy states. Plane wave solutions of the equation admit both positive energy solutions and negative energy solutions .[2]:683 In principle, the presence of an unbounded spectrum is not a problem in a theory with no interactions. However, the issues arise once interactions are included, such as through coupling to the electromagnetic field. Historically, this was seen as a problem because then there is nothing preventing indefinite decay of a state through emission of electromagnetic radiation. From a modern perspective this instead reflects the breakdown of the single-particle interpretation of the theory.[37]:487

Quantum field theory

In the second quantization framework of quantum field theory, fields are elevated to be operators acting on states.[37]:489 They can be used to create or destroy particles or antiparticles when acting on a state. For a complex scalar field, the operator annihilates particles and creates antiparticles, while creates particles and annihilates antiparticles.[17]:66 This is in contrast to relativistic quantum mechanics where the field rather corresponds to a wavefunction describing the state of a single particle. The Klein–Gordon equation describes the equation of motion for the field operator and the particles it creates.[38]:82 In the limit of large occupation numbers in a coherent or semiclassical state, the quantum field theory for a scalar field reduces to a classical field theory.[39]:209

The negative frequency states that cause issues in the quantum mechanical application of the Klein–Gordon equation are reinterpreted as positive energy antiparticles.[38]:18 For a real scalar field theory, particles are their own antiparticles, while for a complex scalar field theory the two have opposite charges. The candidate probability density for the quantum theory, which failed to be positive definite, is instead reinterpreted as describing the conserved charge density operator in the quantum field theory.[32]:26

Non-relativistic limit

The non-relativistic limit of the complex Klein–Gordon equation yields the Schrödinger equation at leading order. This can be derived by factoring out the oscillating rest mass energy through the field redefinition[17]:190

and by restricting to the positive frequency particle sector, thus dropping antiparticle from the spectrum.[28]:112–113 A more careful approach yields two independent Schrödinger equations for two complex fields , one for the particle and one for the antiparticle.[12]:366 The resulting non-relativistic theory has an expanded symmetry group associated with the complex pair . This has a subgroup, corresponding to a phase redefinition of each , which implies conservation of particle number and antiparticle number, separately.[40] When interactions are included, these symmetries can break down to a single global corresponding to charge conservation inherited from the full relativistic theory. Additionally, the non-relativistic effective field theory may have a limited range of viability, such as only holding in the case when particles and antiparticles do not meet and annihilate into relativistic states. If such events are allowed to occur and the effective field theory does not include the relativistic degrees of freedom, then the non-relativistic effective field theory is no longer a closed unitary theory.[41]

The non-relativistic limit of the real Klein–Gordon equation is more subtle due to the particle being its own antiparticle. It can however be defined, with a single relativistic real scalar field resulting in a theory for a single complex scalar field in the non-relativistic limit. Despite being a complex field, the degrees of freedom match since the relativistic theory is a second derivative theory in time while the non-relativistic theory is a first order theory, thus each non-relativistic field has half as many degrees of freedom as in the relativistic theory.[42] The naive definition of the non-relativistic complex scalar field from the relativistic field is[43]

This definition works well for a free theory where the non-relativistic theory for is obtained by dropping high frequency terms of order , giving the Schrödinger equation to leading order. However, when interactions are taken into account, a modified non-local definition of is usually used to more easily arrive at an effective non-relativistic theory.[44] The non-relativistic theory also has an emergent symmetry, which is exact in the non-interacting case. This symmetry implies particle number conservation with a number charge of

In a self-interacting theory, this symmetry holds to all orders in perturbation theory in the limit of sufficiently low energies and for low occupation numbers that kinematically disallow number violating processes.[44]

While the limit of the classical non-interacting Klein–Gordon equation to the classical Schrödinger equation is straight forward, more care has to be taken when taking the limit of the quantum field theory to non-relativistic quantum mechanics. This is because the fundamental variables used differ significantly in the two frameworks.

An crude way to take the non-relativistic limit is to construct the wavefunction of the particle state. In particular, for the complex Klein–Gordon field, this is done by using the quantum field theory field operator to create a state of a particle at position as[nb 4][37]:480

This is then used to construct a more general state as a superposition of these position states

where is now the wavefunction.[45] By constructing the field theory Hamiltonian, one can then show that this implies the Schrödinger equation for the wavefunction

This procedure is rather technically imprecise, and does not easily generalize to include interactions. It also does not address what happened to the antiparticle degrees of freedom. Instead, a more careful analysis shows that the non-relativistic limit gives rise to the Schrödinger equation for both the particle and antiparticle. This procedure can also be used in the case of a real Klein–Gordon field to get a non-relativistic quantum theory.[46]

Klein–Gordon equation in curved spacetime

In curved spacetime, the flat metric has to be elevated to a curved metric, and the derivatives need to be modified to account for the spacetime curvature through the introduction of a covariant derivative . In the mostly negative metric signature, the Klein–Gordon equation in curved spacetime takes the form[47]:182

An alternative form that replaces the covariant derivative with regular partial derivatives, avoiding having to explicitly calculate the Christoffel symbols, is given by[48]:105

Here is the inverse metric tensor and is the determinant of the metric.

A more general form of the Klein–Gordon equation that allows for non-minimal coupling to gravity is given by[49]:44

where is the Ricci scalar. The case of is known as minimal coupling, with the equation reducing to the aforementioned form. Meanwhile, in dimensional spacetime, when[50]:44

the Klein–Gordon field is said to be conformally coupled to gravity. When the conformally coupled field is also massless, the equation is conformally invariant. This case is often used as a particularly tractable model for a quantum field theory in curved spacetime calculations. In particular, for conformally flat spacetimes these fields do not experience any particle production due to the system being equivalent to a flat space theory.[49]:62

Other equations

The Klein–Gordon equation is the equation of motion for free relativistic scalar field theories. Once self-interactions are introduced through a potential , the resulting equation of motion takes the form of a non-linear Klein–Gordon equation[27]:32

Notable examples include the sine-Gordon equation and the equation of motion for theory.[39]:59–60

Interactions can also be introduced to the Klein–Gordon equation by gauging the scalar field under some gauge group.[27]:121 When it is gauged under a group, then this leads to scalar electrodynamics, while when it is gauged under an group, the result is referred to as scalar chromodynamics. In generality, when gauged under some Lie group , the Klein–Gordon equation takes the form[51]

where is a multicomponent scalar field belonging to some representation of the Lie group. This means that it transforms as , where is a matrix representation of the group element . Meanwhile, gauge covariant derivative is defined as[27]:482

where are the components of the gauge field, corresponding to the connection, and are the generators in the representation of the Lie algebra corresponding to the Lie group representation .

The components of all non-interacting spin fields satisfy the Klein–Gordon equation either directly or once appropriate constraints are imposed.[52] This is a group theoretic consequence, with all unitary irreducible representations of the Poincare group satisfying the equation.[39]:8 For example, any solution of the Dirac equation, which describe spin-1/2 spinor fields, is automatically a solution of the Klein–Gordon equation.[53] A similar result holds for the Maxwell equation and the Proca equation, describing spin-1 massless and massive fields , respectively.[28]:120–121 Their component equations both reduce to the massless and massive Klein–Gordon equation once the constraint is imposed, which is a necessary constraint for the Proca equation while only a gauge choice of the Lorenz gauge for the Maxwell equation.[39]:125[27]:119

See also

Notes

  1. This can be seen from the role that plays in the dispersion relation for plane wave solutions giving in natural units.
  2. In quantum field theory the Feynman propagator conventionally includes a factor of as .
  3. Different normalizations exist for the denominator, with sometimes one replacing with .[31]
  4. Usually this is done more precisely using smearing.

References

Related Articles

Wikiwand AI