Free Will and Falling Cats

Frank Wilczek
Center for Theoretical Physics, MIT, Cambridge, MA 02139 USA;
T. D. Lee Institute and Wilczek Quantum Center,
Shanghai Jiao Tong University, Shanghai, China;
Arizona State University, Tempe, AZ, USA;
Stockholm University, Stockholm, Sweden

Abstract

If we consider a cat to be an isolated mechanical system governed by T-invariant mechanics, then its ability to land on its feet after being released from rest is incomprehensible. It is more appropriate to treat the cat as a creature that can change its shape in order to accomplish a purpose. Within that framework we can construct a useful and informative of the observed motion. One can learn from this example.

{textblock*}

5cm(11cm,-8.2cm) MIT-CTP/5718

1 The “Bad News for Cats” Theorem

The ability of cats to recover from a sudden fall from a height to land on their feet is legendary. The related ability of trained human divers and gymnasts to stick an intended landing is at least as impressive, and much more reliably documented. These feats have attracted the interest of Maxwell [1] and of many other physicists – not to mention athletes and trainers – down to recent times [2] [3].

In 1904 the renowned mathematician Paul Painlevé contributed a short, rather cryptic note [4] announcing the following result:

… let S be a conservative system in which each element remains identical to itself, so that the state of the system at an instant t is completely defined by the position and velocity of each element. Given that this system S is abandoned without velocity in a vacuum above the ground, is it possible for it to return to its initial configuration, oriented DIFFERENTLY in space, at an instant t?

This is a question that has already been discussed before at the Academy, a few years ago (the problem of the cat that lands on its paws). When we don’t subjugate a subject the system S to being conservative, the answer is affirmative and has been supported by numerous examples.

On the contrary, when S is conservative, the answer to the question asked is negative: if S returns to its initial configuration, it is surely oriented in the same way in space. This is a remarkable fact …

As the subsequent text makes clear, by “conservative system” Painlevé means a system point of masses that interact through central potentials according to the laws of Newtonian mechanics. Later reconstructions of the proof [5] [6] are rather complicated. Here, working at a more abstract level, we give a simple conceptual proof of a more general result:

Theorem: (a) If a closed system governed by time-reversal invariant dynamics assumes time-reversal even states $S(t_{0})$ , $S(t_{0}+\tau)$ at two times $t_{0}$ , $t_{0}+\tau$ separated by an interval $\tau$ , then its motion is periodic with period $2\tau$ . (b) If, furthermore, those states are related to one another by a transformation $R$ that is a symmetry of the dynamical evolution, then $R^{2}$ acting on $S(t_{0})$ (and on $S(t_{0}+\tau)$ ) is trivial.

Proof: (a): By applying time reflection around $t_{0}$ we have $S(t_{0}+t)=S(t_{0}-t)$ , and by applying time reflection around $t_{0}+\tau$ we have $S(t_{0}-t)=S(t_{0}+\tau+(-\tau-t))=S(t_{0}+\tau-(-\tau-t))=S(t_{0}+t+2\tau)$ . Thus $S(t_{0}+t)=S(t_{0}+t+2\tau$ ).

(b): If we denote the operation that evolves states through time $t$ by $U(t)$ , so $U(t)S(t_{i})\equiv S(t_{i}+t)$ , then the mathematical formulation of the statement that $R$ is a symmetry of the dynamics is $RU(t)=U(t)R$ . So if $S(t_{0}+\tau)=RS(t_{0})$ and $R$ is a symmetry of the dynamics, then $S(t_{0})=S(t_{0}+2\tau)=U(\tau)S(t_{0}+\tau)=U(\tau)RS(t_{0})=RU(\tau)S(t_{0})% =R^{2}S(t_{0})$ .

Probably the most interesting application of part (b) concerns symmetry under rotation (including, for parity-invariant forces, improper rotation). In those cases, $R^{2}S=S$ is a severe constraint on $RS$ , basically limiting it to $\pi$ rotations or (assuming parity symmetry) reflections and inversions.

Time reversal symmetry is essential to the theorem, as is shown in the Appendix.

Painlevé’s interest in this problem was stimulated by the falling cat problem, and this theorem has been called the “Bad News for Cats Theorem” [7][8]. Taken at face value, of course, the theorem makes cats’ achievements seem paradoxical, since it contradicts their observed ability to re-orient.

2 Why Cats are OK

I suspect that everyone involved in this discussion has had tongue firmly planted in cheek, but at the risk of belaboring the obvious I want to highlight why this theorem is not relevant to real biological cats. While simple, the reason touches on profound issues, as signaled by the title of this note.

The point is that the framework assumed in the theorem, even in its generalized form, is not appropriate for the description of the behavior of biological cats, divers, or gymnasts. Specifically, none of those are closed systems, nor is their state time-reversal invariant even when they appear, macroscopically, to be at rest. They can readily and selectively consume stored energy, notably by converting ATP into ADP, empowering mechanical motion accompanied by radiation of heat. Indeed, while they are living, they never cease doing that. Also, they can use stored energy to process and transmit information, and to move in response to its flow. Thus, the assumptions of the theorem do not apply to biological actors.

3 Engaging With Will

Fortunately, a radically different approach offers useful insight into this class of problems. That is, we assume that the cat, diver, or gymnast has significant abilities to change its shape, by “will”, in response to external cues and internal plans. Within that framework many fascinating questions about what degrees of freedom can be called into play, how quickly they respond, how the systems can learn by practice and instruction, and so forth, in order to attain specific goals, can be discussed usefully. This is the approach adopted universally, though usually implicitly, by cats, biologists, athletes, and trainers. On the other hand it is deprecated, in effect if not by intention, by physicists and philosophers who deny the reality of will.

The utility of the concept of will, in this context, does not mean that anything goes. The “Astonishing Hypothesis” [9] that mind (of which will is an aspect) is an emergent property of matter is not here challenged. Instead, a broad-minded approach, that takes into account both the legitimacy of will and constraints that follow from the laws of physics, is most fruitful. For example, conservation of angular momentum holds to a good approximation for cats and divers in flight, since it is difficult to radiate away angular momentum. The kinematics of deformable bodies under that constraint is already quite rich [2] [3], and naturally brings in ideas from non-abelian gauge theory [10]. It helps us to track precisely how changes of shape induce changes in orientation.

Engineers rarely speak of “will” or “purpose”, but they speak frequently of “control” and “utility” [11] [12], which are parallel concepts. Recently it has become increasingly common also to speak of “intelligent” engineered systems. These quoted concepts are extra-physical ideas, yet they are central to the process of designing complex material systems and anticipating their behavior. And for those who observe the finished products, they are essential tools for understanding how they got to be the way they are and for making sense of their structure.

Complementarity, as discussed in [13], is the insight that very different and even superficially contradictory ways of describing the same system can provide ways to answer different questions about it. These uses of extra-physical concepts to address questions about the physical behavior of material systems nicely illustrate the utility of complementarity in action.

The preceding considerations about will and movement to achieve re-orientation apply to motion more generally. In everyday life most of our experience of motion is associated with the movement of our own bodies. We plan motions and will make them happen, for example in rising from sleep, walking, bringing food to our mouths, and a host of other activities that are essential to life. This experienced mechanics is superficially – and also deeply – different from classical Newtonian mechanics. We can change our arm’s velocity, for example, in apparent violation of the conservation of energy, or accelerate it without applying external forces (external, that is, to our bodies).

A big part of the reason why classical Newtonian mechanics was difficult to discover, and a big part of why it is such a towering intellectual achievement, is that it required a kind of cognitive dissonance. Specifically, it required – and still requires – people to reconcile two very different, quasi-contradictory descriptions of motion, both successful and convincing within their domains of application. There is the “common sense” understanding of movement, in which the concepts of will and purpose are central, that applies to experienced bodily activity; and there is classical mechanics proper, austere and rigorously mathematical, from which those concepts are excluded.

A few centuries ago, when classical mechanics first took shape, the difference between those descriptions was widely believed to be fundamental. The first description applied when souls or other animating principles were active, while the other applied while those animating principles were dormant. Today, science has understood that matter, embodying a system of known, (relatively) simple, mathematically formulated laws, can support the sort of complex emergent behavior we find, for example, in molecular biochemistry, neurobiology and synthetic intelligence. It has come to seem much more reasonable, therefore, to regard those two descriptions as complementary. They are alternative descriptions of the same reality, tailored to address different questions. We can use one or the other, or a hybrid. The conceptually enriched reality that results makes room for will and purposive motion without bringing in souls or animating principles.

4 Engaging With Free Will

I feel that nothing that I have said so far should be controversial. Nevertheless, it prepares us to address some notoriously contentious issues that have a similar logical structure.

4.1 Making Choices

Another common experience in human life is that of making deliberate choices. We imagine the consequences of different possible actions, and select which to carry out. This process can be discussed from many perspectives, that use different concepts and offer different insights. For present purposes, some brief indications will suffice:

•

Psychology: personality, drives
•

Economics: utility, game theory
•

Anthropology: culture

On the other hand one might be tempted, based on a crude interpretation of the Astonishing Hypothesis, to deny the reality of choice altogether. If mind emerges from matter, and the behavior of matter is determined by a fixed set of deterministic equations then, according to this interpretation, the whole notion of making choices is illusory. What is going to happen, happens, full stop.

And yet, to paraphrase Galileo, we choose. Thousands of books and papers in the above-mentioned disciplines, not to mention history and fiction, discuss the human world in terms of people making choices. People can have rational discussions using those ideas, at various levels of sophistication, and they can learn to use them better.

It is hard not to see the tension between the pair (choice)-(physics in general) as a grander version of the tension (will)-(classical mechanics) we saw in our discussion of falling cats. Whether or not it is possible “in principle” to anticipate or understand the behavior of people from the quantum wave function of their constituent quarks, gluons, electrons, and photons, that is rarely the best way to do so. We can have richer and more informative discussions, and make more and better predictions, if we bring in extra-physical concepts.

There are many reasons why it is useful to bring in such concepts. For while the fundamental equations of physics, as presently understood, evolve the present state of a closed physical system into a unique state at any future time, that fact has very limited relevance to the question of determinism in human behavior, because:

•

People are very far from being closed systems. They eat, breathe, absorb and radiate heat, get rained on, and so forth, and – most important – sense their environment and process and act upon that sensory information.
•

Calculating the evolution requires near-complete and accurate knowledge of the initial state, which is not available even for much smaller systems than human bodies.
•

Even given perfect knowledge of the initial state, the required calculations are absurdly impractical.

Thus, it becomes essential to discover and develop concepts that correspond to more robust observables and lead to simpler calculations.

The limitations flowing from the foregoing constraints are so severe as to make the project of calculating human behavior from the fundamental equations preposterous, in practice. Still, it can be entertaining to ask: What about in theory?

The first issue can be postponed by taking in larger parts of the universe than individual human beings, so as to achieve effective closure. But that stratagem dodges rather than resolves the issue of experienced choice – which is, of course, experienced by individuals. It also aggravates the other two issues. Moreover, it is unclear that effective closure can ever be achieved in our universe, since a) even small distant events can ramify into big consequences (butterfly effect [14]), and b) according to physical cosmology, at every moment previously unobservable parts of the universe just begin to extend their influence to us (expanding horizon).

The second issue is at least aggravated, and may be fatally compromised, by quantum theory. In that context, deterministic evolution holds for the wave-function (of a closed system). But, as articulated most clearly in the many-worlds interpretation of quantum theory, the relationship between the wave-function and experienced reality is not deterministic. Moreover, since measurements inevitably change the wave-function it is not at all clear that the ideal of complete knowledge of the wave-function corresponds to an operational, empirically based concept.

The third issue too might pass, upon closer investigation, from severely impractical to strictly impossible. Computation has a physical basis [15] [16], and the demands of absurdly complex computations might well exceed the resources of the physical universe. Also, the kind of self-referential computations that figure in paradoxes of the kind “I compute what I will do, and then will myself to do the opposite”, bear a family resemblance to the paradoxes that underly the famous no-go theorems of Gödel (incompleteness of deductive systems) and Turing (halting problem), and are likely to be subject to similar limitations.

For me, the entertainment value of sightseeing in these murky depths stales rapidly, so I will close this section with a related but more nearly grounded speculative comment.

It is possible to imagine that a more advanced neurobiology, powered by better understanding of how the organs of mind function and interact at a molecular level, will contribute incisive new concepts to descriptive psychology. This, combined with advanced imaging techniques, will usher in new levels of self-awareness. Specifically, it is not preposterous to imagine a future when people will be able to visualize and meaningfully interpret flows of information within their own or other people’s brains in real time, within an immersive, augmented reality. To do justice to that expanded reality, people will need to develop more refined and possibly radically different concepts around “choice”.

4.2 Influencing Choices

The idea that people can make choices based on considering alternative plans and weighing their merits is basic to morality and law. Concepts such as guilt, sin, social acceptability, and legality are meant to judge or influence people’s choices. In all these cases, there is an important distinction between acts and choices.

For the sake of concreteness, I will focus here on law. In a world where choice was illusory, the law would only refer to acts, and its only purpose would be retribution. Issues of intent would not – indeed, could not – arise, nor defense by reason of insanity. Fortunately, we do not live in that world. (At least, I’ve argued that we don’t.) The law as it actually operates in civilized society is instead largely designed to avoid the occurrence of illegal acts by making them, through the threat of punishment, less attractive choices. In cases where there is no intent, the law did not influence any choice at all. Therefore its deterrent purpose was not disrespected, but simply inoperative, and punishment loses some of its point.

The process of making choices is often phrased as “free will”. Indeed, the current Wikipedia entry for “Free Will” begins

Free will is the capacity or ability to choose between different possible courses of action [17].

In so far as this is a valid definition, it is not clear that “free will” has any advantage over the simpler “choice”. The historical origins of “free will” are in theology. In that context its connotations might be welcome. But in other contexts they are excess baggage. For the reason that follows, the expression “free will” is probably best avoided.

Unfortunately, the phrase free will invites, through its linguistic connotations, extrapolations from its proper use in descriptive psychology and law into dubious speculations or even claims that its utility points to serious oversights in physics and physical cosmology – specifically, the failure of these subjects to incorporate the direct dynamical influence of choices emanating from minds. Let me conclude by re-iterating a basic observation [13] that heavily constrains such speculations and claims. In modern physics we have learned to how to make very precise calculations of many very delicate effects, and to compare them with very precise measurements. For one example, the form and magnitude of the magnetic field associated to electrons (technically, their magnetic moment) is both theoretically predicted and, independently, measured to accuracy within fractions of a part per billion. For another, atomic clocks of several different kinds are predicted and experimentally measured to keep time consistently with one another to accuracies that correspond to a fraction of a second per billion years. In the course of making such accurate measurements, experimenters must pay careful attention to, and account for, many factors, including mechanical alignment and stability of their apparatus, isolation through high vacuum and ultra-low temperature within the experiment’s core, electrical isolation of sensors and amplifiers, screening of stray magnetic and electromagnetic fields, shielding from or vetoing on cosmic ray events, and so forth. But amidst all this care and delicacy it has never proved necessary to take any precautions against the possible influence of what people nearby the experiments – or, for that matter, anywhere – are thinking. Demonstration of effects of that kind, however minute, would be remarkable. Based on the existing evidence, we can be confident that they are subtle and elusive, if they exist at all.

Acknowledgement: I thank Juliana Baena for introducing me to the literature around Painlevé’s paper and for preparing a translation (quoted in part above). Also I thank her and Antti Niemi for discussions about the physics around it. I thank Brian Greene and Maulik Parikh for discussions about the natural philosophy of will and choice that prompted me to write down some of my thoughts on those matters. This work is supported by the U.S. Department of Energy under grant Contract Number DE-SC0012567 and by the Swedish Research Council under Contract No. 335-2014-7424.

References

[1] L. Campbell and W. Garnnett, The Life of James Clerk Maxwell (Macmillan and Co., 1888).
[2] G. Gbur, Falling Felines and Fundamental Physics (Yale, 2019).
[3] J. Huber, Springboard and Platform Diving (Human Kinetics, 2015).
[4] P. Painlevé, Comptes Rendus Hebdomadaires des Séances de l’Académie des Sciences 139, 1170–74 (1904).
[5] I. Geru, Time-Reversal Symmetry (Springer, 2018).
[6] R. Sachs, The Physics of Time Reversal Symmetry (Chicago, 1984).
[7] B. Roberts, Reversing the Arrow of Time (Cambridge, 2022).
[8] J. Earman, International Studies in the Philosophy of Science 16.3, 245 (2022).
[9] F. Crick, The Astonishing Hypothesis: The Scientific Search for the Soul (Scribner, 1994).
[10] A. Shapere and F. Wilczek, Am. J. Phys. 57, 514 (1989); F. Wilczek, in Dialogues Between Physics and Mathematics: C. N. Yang at 100 ed. M.-L. Ge and Y.-H He (Springer, 2023).
[11] Historical collection: Classic Papers in Control Theory, ed. R. Bellman and R. Kalaba (Dover, 2017).
[12] Recent text: Dynamic Programming and Optimal Control (2 volumes), D. Bertsekas (Athena Scientific, 2001).
[13] F. Wilczek, Fundamentals (Penguin, 2021).
[14] E. Lorenz, J. Atmospheric Sciences 20 130 (1963).
[15] C. Bennett, Int. J. Theor. Phys. 21 904 (1982).
[16] S. Lloyd, Nature 406 1047 (2000).
[17] P. Carus, The Monist 20 (Open Court, 1910).

Appendix: Minimal Mechanical Examples; Necessity of T

We can illustrate the application of our theorem, and see that T symmetry is essential, by considering a particle subject to the simple Lagrangian

L~{}=~{}\frac{1}{2}({\dot{r}}^{2}+r^{2}{\dot{\phi}}^{2})\,-\,V(r)\,+\,\alpha f% (r)\dot{\phi}

(1)

in polar coordinates. This Lagrangian describes planar motion for a unit charge, unit mass particle in the presence of a transverse, axially symmetric magnetic field $B_{z}(r)=\alpha\frac{1}{r}\frac{df}{dr}$

For the cyclic variable $\phi$ we have the equation of motion $\frac{d}{dt}\frac{\partial L}{\partial\dot{\phi}}=0$ , so

p_{\phi}~{}=~{}\frac{\partial L}{\partial\dot{\phi}}=r^{2}\dot{\phi}+\alpha f(r)

(2)

is a constant of the motion. $T$ -invariant states are defined by varnishing velocity, including $\dot{\phi}=0$ , so

p_{\phi}~{}\rightarrow~{}\alpha f(r_{0})

(3)

and

\dot{\phi}=\frac{\alpha(f(r_{0})-f(r))}{r^{2}}

(4)

T invariance of the dynamics implies $\alpha=0$ , and thus $\dot{\phi}=0$ . If the particle does not pass through the origin $r=0$ , then $\phi$ is uniquely defined and constant, so the orientation of our particle cannot change, as is consistent with the implication b. of our theorem. If the particle passes through the origin with non-zero velocity then to maintain continuity of the velocity we must allow $\phi$ to increment by $\pi$ . Following that the next turning point might occur at the same value of $r$ and be related to the original one by a $\pi$ rotation. This is consistent with the theorem, and illustrates that we must allow for the possibility of a non-trivial $R$ .

Essentially this same example can be realized within the context of Painlevé’s original framework of particles and central forces, by having two very heavy particles acting equally on a third light particle equally distant from both. This model demonstrates, in particular, that $R\neq 1$ can occur without collisions. That possibility contradicts [4] [5] [6].

With $\alpha\neq 0$ things are different. We can derive the equation of motion for $r$ straightforwardly, or more elegantly using the Routhian

R~{}=~{}-\frac{1}{2}{\dot{r}}^{2}+\frac{1}{2}\frac{(p_{\phi}-\alpha f(r))^{2}}% {r^{2}}+V(r)

(5)

in the form

	$\displaystyle\frac{d}{dt}\frac{\partial R}{\partial\dot{r}}~{}$	$\displaystyle=$	$\displaystyle~{}\frac{\partial R}{\partial r}$
	$\displaystyle\ddot{r}~{}$	$\displaystyle=$	$\displaystyle~{}-\frac{d}{dr}(V(r)+\frac{1}{2}\frac{(p_{\phi}-\alpha f(r))^{2}% }{r^{2}})$		(6)

Using the conserved effective energy

{\cal E}~{}=~{}\frac{1}{2}{\dot{r}}^{2}+V(r)+\frac{1}{2}\frac{(p_{\phi}-\alpha f% (r))^{2}}{r^{2}}

(7)

we see that resting points, where $\dot{r}=0$ , generally represent places where the $r$ -evolution reverses direction. If there is a second resting point on the same trajectory, we will have oscillations bounded by the resting points. Generically, the accumulated rotation

\Delta\phi~{}=~{}\oint\frac{\alpha(f(r_{0})-f(r))}{r^{2}}dt

(8)

over a cycle will be neither $0$ nor $\pi$ , and we can encounter re-orientations (i.e., in this context, rotations) of any magnitude.

For example, let us take a harmonic oscillator potential $V(r)=\frac{1}{2}kx^{2}$ and a constant magnetic field $\alpha f(r)=\frac{1}{2}B_{0}r^{2}$ . If at $r=r_{0}$ we have $\dot{r}=\dot{\phi}=0$ then

	$\displaystyle{\cal E}~{}$	$\displaystyle=$	$\displaystyle~{}\frac{1}{2}kr_{0}^{2}$
	$\displaystyle p_{\phi}~{}$	$\displaystyle=$	$\displaystyle~{}\frac{B_{0}}{2}r_{0}^{2}$		(9)

and

{\dot{r}}^{2}~{}=~{}k(r_{0}^{2}-r^{2})-\frac{B_{0}^{2}}{4}\frac{(r_{0}^{2}-r^{% 2})^{2}}{r^{2}}

(10)

There is a turning point where $\dot{r}=0$ at

r_{1}~{}=~{}\frac{r_{0}}{\sqrt{1+\frac{4k}{B_{0}^{2}}}}

(11)

(Note that $\dot{\phi}$ does not vanish when $r=r_{1}$ .) When $r$ cycles back to $r_{0}$ , the accumulated phase is, using $dt=\pm\frac{dr}{\dot{r}}$ appropriately in Eqn. (8),

\Delta\phi~{}=~{}-2\frac{B_{0}}{|B_{0}|}\int\limits^{r_{0}}_{r_{1}}\frac{dr}{r% }\frac{1}{\sqrt{1+\frac{4k}{B_{0}^{2}}}}\sqrt{\frac{r_{0}^{2}-r^{2}}{r^{2}-r_{% 1}^{2}}}

(12)

Since the integrand is positive, this does not vanish.