• Register
PhysicsOverflow is a next-generation academic platform for physicists and astronomers, including a community peer review system and a postgraduate-level discussion forum analogous to MathOverflow.

Welcome to PhysicsOverflow! PhysicsOverflow is an open platform for community peer review and graduate-level Physics discussion.

Please help promote PhysicsOverflow ads elsewhere if you like it.


PO is now at the Physics Department of Bielefeld University!

New printer friendly PO pages!

Migration to Bielefeld University was successful!

Please vote for this year's PhysicsOverflow ads!

Please do help out in categorising submissions. Submit a paper to PhysicsOverflow!

... see more

Tools for paper authors

Submit paper
Claim Paper Authorship

Tools for SE users

Search User
Reclaim SE Account
Request Account Merger
Nativise imported posts
Claim post (deleted users)
Import SE post

Users whose questions have been imported from Physics Stack Exchange, Theoretical Physics Stack Exchange, or any other Stack Exchange site are kindly requested to reclaim their account and not to register as a new user.

Public \(\beta\) tools

Report a bug with a feature
Request a new functionality
404 page design
Send feedback


(propose a free ad)

Site Statistics

205 submissions , 163 unreviewed
5,047 questions , 2,200 unanswered
5,345 answers , 22,709 comments
1,470 users with positive rep
816 active unimported users
More ...

  What is the fundamental reason of the fermion doubling?

+ 8 like - 0 dislike

Recall that the fermion doubling is the problem in taking the $a \to 0$ limit of a naively discretized fermionic theory (defined on a lattice with lattice spacing $a$). After such a limit one finds themselves with an additional amount (precisely $2^d$) of fermionic fields. One can fix this by considering different discretizations of the action that make unwanted fields decouple in the continuum limit. The downside is that the additional terms have to spoil some nice features of the theory (chiral symmetry, locality, lattice symmetry, ...).

Now, I wonder what is the true reason for the appearance of new fields. Is it the fermionic nature of the theory? (In other words, is a similar problem ruled out for bosonic fields?) And do all (naive?) fermionic theories (that is, independent of the continuum form of the action) suffer from this problem?

More generally, how can one tell a priori what will the field content of a lattice theory in the continuum limit be? Or is the field content fundamentally a continuum limit property that has to be calculated?

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Marek
asked Jul 13, 2011 in Theoretical Physics by Marek (635 points) [ no revision ]

2 Answers

+ 6 like - 0 dislike

The fermion doubling is manifested through the existence of extra poles in the Dirac propagator on the lattice. These poles cannot be made to disappear at the continuum limit. (The number of doublers can be reduced by different discretizations but not eliminated at all, this is essentially the Nielsen-Ninomiya theorem).

The reason for the fermion doubling lies in the existence of chiral anomaly. This anomaly exists in the continuum limit due to the chiral non-invariance of the path integral measure and not because of the non-invariance of Lagrangian. In the lattice formulation of a chiral theory based on a discretization of the Lagrangian, the anomaly is absent and the lattice formulation generates the extra species just to cancel this anomaly in the continuum limit. Since the axial anomaly exists in nature $\pi^0 \rightarrow \gamma \gamma$, this situation is unacceptable.

The fermion doubling problem is an artifact of the realization of the theory by means of quarks where the axial anomaly is not present in the lagrangian but in the path integral measure. There are approaches of other types of discretizations such as by means of fuzzy spaces in noncommutative geometry, where the quarks are not the basic fields of the theory. In these approaches, the fermion doubling problems do not exist.


This is an update referring to Marek's first comment.

For fermions, the axial anomaly can be recovored only by a non-trivial regularization of the path integral measure of the fermions. Any finite dimensional approximation of this measure as a product of Berezin-Grassmann integrals does not produce the axial anomaly. This is the reason why the Lattice regularization does not produce the axial anomaly and as a consequence the phenomenon of doubling occurs where the different species of doublers are of opposite chirality to cancel the anomaly. This is a property of fermion fields represented by Grassmann variables. In effective field theories (where the basic fields are pions) such as sigma models, the axial anomaly is manifested through a Wess-Zumino-Witten term in the Lagrangian, but these theories are not even perturbatively renormalizable and I think this is the reason why they were not put on a lattice.

One approach that I know of which solves the fermion doubling problem is the regularization by means of a fuzzy space approximation of the space-time manifold. The philosophy of this approach is explained in the introduction of Mark's Rieffel's article. The resolution of the fermion doubling using this approach is given nicely in Badis Ydri's thesis. (There is also more recent work on the subject by A. Balachandran and B. Ydri in the arxiv).

The main idea is that the Poisson algebra of functions over certain spaces (such as the two sphere, and more generally coadjoint orbits of compacl Lie groups) can be approximated by finite dimensional matrices. These approximations are called fuzzy spaces. Gauge fields and Fernmions can be constructed on these fuzzy spaces, which have the correct continuum limit when the matrix dimensions become infinite. This formulation contains the axial anomaly inherently, thus it is free of the doubling problem. The only drawback that I can see of this approach is that it is applicable only to some special 4-dimensional manifolds such as $CP^2$ or $S^2 \times S^2$, because the fuzzy manifold is required to be Berezin-quantizable.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user David Bar Moshe
answered Jul 13, 2011 by David Bar Moshe (4,355 points) [ no revision ]
Thanks. So, am I correct in concluding that doubling problem is not really a fermionic effect but can appear also in bosonic theories as a consequence of chiral non-invariance of path-integral? Also, do you have any reference for this stuff (either book or paper would be fine)?

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Marek
@Marek: How can you get a chiral anomaly without fermions?

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user BebopButUnsteady
@Bebop: I don't know much about anomalies (much less the chiral ones), so no idea. But chirality isn't unique to fermions, so unless you are referring to some technical result where half-integer spin is required in some calculation where the anomaly appears, I don't follow. Could you elaborate?

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Marek
@Marek: I'll add an update to the answer referring to your first comment

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user David Bar Moshe
@BebopButUnsteady: The chiral anomaly appears explicitely in the lagrangian of effective theories in the form of a Wess-Zumino-Witten term. In his seminal paper "Global aspects of current algebra" Nuclear Physics B Volume 223, Issue 2, 22 August 1983, Pages 422-432, Witten computed the amplitude of $\pi^0 \rightarrow \gamma \gamma$ by gauging the Wess-Zumino term of the effective action of the $SU(3)$ sigma model + a Wess-Zumino term.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user David Bar Moshe
@David Bar Moshe: I had somehow forgotten about that... Thanks for the reminder.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user BebopButUnsteady
@David: thanks for the update!

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Marek
+ 2 like - 0 dislike

A density matrix view of the problem:

When $p_\mu$ is near zero, one is considering a momentum near zero and the discrete lattice works fine. It's when $p_\mu$ is near $\pm \pi/a$, or more generally, near $n\pi/a$ for $n\neq 0$ an integer, that one finds problems. Instead of having very large momenta, these values of $p_\mu$ essentially give momenta as small as those near $p_\mu=0$, but with signs in the $\gamma$ matrices negated.

This is a type of aliasing problem. As with the usual aliasing, the problem at $\pm \pi/a$ goes away when one makes the sample rate faster, that is, replaces $a$ with a smaller value. And just as with aliasing, making $a$ smaller does not eliminate aliasing entirely, but instead pushes the problem to a higher frequency.

This drawing shows the usual aliasing effect. Note that the high frequency black signal (which corresponds to a high momentum) appears as a low frequency red signal (with low momentum):
enter image description here

The difference with the usual aliasing is what happens when $p_\mu = (2n+1)\pi/a$. These are the values that give a continuous Dirac equation with negated gamma matrices. To understand better what is going on here, let's consider the density matrix form. Density matrices avoid unphysical complex phases. I'll work in 3+1 dimensions.

One obtains a density matrix $\rho$ by multiplying a ket by a bra:
$$\rho = |a\rangle\langle a|.$$ The reason there are four spin-1/2 particles in 3+1 dimensions is that there are four primitive (i.e. trace=1) solutions to the idempotency equation:
$$\rho^2 = \rho$$
One has latitude in how one chooses the four states. In general, one chooses two elements of the Dirac algebra that (a) square to unity, (b) commute, and (c) are independent. These are called a "complete set of commuting roots of unity."

The commuting roots of unity are operators; one chooses them according to what operators one wishes to diagonalize. Following the wikipedia article on the construction of Dirac spinors, if we choose z-spin and charge $Q$, our complete set of commuting roots of unity is:
$$\sigma_z = i\gamma^1\gamma^2,\;\;\; Q = -\gamma^0$$
The four independent states are then:
$$\rho = (1\pm \sigma_z)(1\pm Q)/4.$$ To get the spinors from a density matrix, one chooses a nonzero column and normalizes. Thus spinors and density matrices are alternative mathematical representations of wave functions; neither is more fundamental.

If we discretize the density matrix, we will end up with the usual aliasing problem. From the point of view of lattice type calculations this is acceptable; there will be no duplicated particles. But spinors carry an extra degree of freedom; the arbitrary complex phase. This makes their aliasing behavior more complicated.

So consider what happens for a frequency a little larger than $\pi/2$. In the following illustration we color every other sample red or blue:
enter image description here
In the above, the density matrix will see this frequency appropriately as a high frequency. But with a spinor, we have arbitrary complex phase freedom. So we can negate the blue dots; the result is a low frequency. Thus the arbitrary complex phases of spinors naturally give aliasing problems at half frequencies.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Carl Brannen
answered Jul 14, 2011 by Carl Brannen (240 points) [ no revision ]
I'm unsatisfied with the above and would delete it except I think it has a germ of truth in it. Maybe the argument should instead be that the (naive) conversion from wave function to density matrix eliminates the frequency aliasing. Or that the problem of making a unitary derivative operator is easier in the density matrix form than in the state vector form (which requires looking at the Dirac equation for density matrices). See: arxiv.org/abs/hep-lat/0207008 for another explanation of the relationship between aliasing and fermion doubling.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Carl Brannen
I have to say I don't understand this answer at all (even after reading it twice), sorry.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Marek
If you modeled wave functions in position instead of momentum, you would end up aliasing for all the usual reasons. And the aliasing would show up doubled in state vectors (compared to density matrix) because of the arbitrary complex phase. Yet density matrices and state vectors are equally fundamental. Translating the calculation to momentum space (where it's actually done), you'll end up with something similar.

This post imported from StackExchange Physics at 2014-04-04 16:37 (UCT), posted by SE-user Carl Brannen

Your answer

Please use answers only to (at least partly) answer questions. To comment, discuss, or ask for clarification, leave a comment instead.
To mask links under text, please type your text, highlight it, and click the "link" button. You can then enter your link URL.
Please consult the FAQ for as to how to format your post.
This is the answer box; if you want to write a comment instead, please use the 'add comment' button.
Live preview (may slow down editor)   Preview
Your name to display (optional):
Privacy: Your email address will only be used for sending these notifications.
Anti-spam verification:
If you are a human please identify the position of the character covered by the symbol $\varnothing$ in the following word:
Then drag the red bullet below over the corresponding character of our banner. When you drop it there, the bullet changes to green (on slow internet connections after a few seconds).
Please complete the anti-spam verification

user contributions licensed under cc by-sa 3.0 with attribution required

Your rights