Free Will and Falling Cats

Free Will and Falling Cats

Frank Wilczek
Center for Theoretical Physics, MIT, Cambridge, MA 02139 USA;
T. D. Lee Institute and Wilczek Quantum Center,
Shanghai Jiao Tong University, Shanghai, China;
Arizona State University, Tempe, AZ, USA;
Stockholm University, Stockholm, Sweden
Abstract

If we consider a cat to be an isolated mechanical system governed by T-invariant mechanics, then its ability to land on its feet after being released from rest is incomprehensible. It is more appropriate to treat the cat as a creature that can change its shape in order to accomplish a purpose. Within that framework we can construct a useful and informative of the observed motion. One can learn from this example.

{textblock*}

5cm(11cm,-8.2cm) MIT-CTP/5718

1 The “Bad News for Cats” Theorem

The ability of cats to recover from a sudden fall from a height to land on their feet is legendary. The related ability of trained human divers and gymnasts to stick an intended landing is at least as impressive, and much more reliably documented. These feats have attracted the interest of Maxwell [1] and of many other physicists – not to mention athletes and trainers – down to recent times [2] [3].

In 1904 the renowned mathematician Paul Painlevé contributed a short, rather cryptic note [4] announcing the following result:

… let S be a conservative system in which each element remains identical to itself, so that the state of the system at an instant t is completely defined by the position and velocity of each element. Given that this system S is abandoned without velocity in a vacuum above the ground, is it possible for it to return to its initial configuration, oriented DIFFERENTLY in space, at an instant t?

This is a question that has already been discussed before at the Academy, a few years ago (the problem of the cat that lands on its paws). When we don’t subjugate a subject the system S to being conservative, the answer is affirmative and has been supported by numerous examples.

On the contrary, when S is conservative, the answer to the question asked is negative: if S returns to its initial configuration, it is surely oriented in the same way in space. This is a remarkable fact …

As the subsequent text makes clear, by “conservative system” Painlevé means a system point of masses that interact through central potentials according to the laws of Newtonian mechanics. Later reconstructions of the proof [5] [6] are rather complicated. Here, working at a more abstract level, we give a simple conceptual proof of a more general result:


Theorem: (a) If a closed system governed by time-reversal invariant dynamics assumes time-reversal even states S(t0)𝑆subscript𝑡0S(t_{0})italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), S(t0+τ)𝑆subscript𝑡0𝜏S(t_{0}+\tau)italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ ) at two times t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, t0+τsubscript𝑡0𝜏t_{0}+\tauitalic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ separated by an interval τ𝜏\tauitalic_τ, then its motion is periodic with period 2τ2𝜏2\tau2 italic_τ. (b) If, furthermore, those states are related to one another by a transformation R𝑅Ritalic_R that is a symmetry of the dynamical evolution, then R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT acting on S(t0)𝑆subscript𝑡0S(t_{0})italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) (and on S(t0+τ)𝑆subscript𝑡0𝜏S(t_{0}+\tau)italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ )) is trivial.

Proof: (a): By applying time reflection around t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT we have S(t0+t)=S(t0t)𝑆subscript𝑡0𝑡𝑆subscript𝑡0𝑡S(t_{0}+t)=S(t_{0}-t)italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_t ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_t ), and by applying time reflection around t0+τsubscript𝑡0𝜏t_{0}+\tauitalic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ we have S(t0t)=S(t0+τ+(τt))=S(t0+τ(τt))=S(t0+t+2τ)𝑆subscript𝑡0𝑡𝑆subscript𝑡0𝜏𝜏𝑡𝑆subscript𝑡0𝜏𝜏𝑡𝑆subscript𝑡0𝑡2𝜏S(t_{0}-t)=S(t_{0}+\tau+(-\tau-t))=S(t_{0}+\tau-(-\tau-t))=S(t_{0}+t+2\tau)italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_t ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ + ( - italic_τ - italic_t ) ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ - ( - italic_τ - italic_t ) ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_t + 2 italic_τ ). Thus S(t0+t)=S(t0+t+2τS(t_{0}+t)=S(t_{0}+t+2\tauitalic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_t ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_t + 2 italic_τ).

(b): If we denote the operation that evolves states through time t𝑡titalic_t by U(t)𝑈𝑡U(t)italic_U ( italic_t ), so U(t)S(ti)S(ti+t)𝑈𝑡𝑆subscript𝑡𝑖𝑆subscript𝑡𝑖𝑡U(t)S(t_{i})\equiv S(t_{i}+t)italic_U ( italic_t ) italic_S ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≡ italic_S ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_t ), then the mathematical formulation of the statement that R𝑅Ritalic_R is a symmetry of the dynamics is RU(t)=U(t)R𝑅𝑈𝑡𝑈𝑡𝑅RU(t)=U(t)Ritalic_R italic_U ( italic_t ) = italic_U ( italic_t ) italic_R. So if S(t0+τ)=RS(t0)𝑆subscript𝑡0𝜏𝑅𝑆subscript𝑡0S(t_{0}+\tau)=RS(t_{0})italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ ) = italic_R italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) and R𝑅Ritalic_R is a symmetry of the dynamics, then S(t0)=S(t0+2τ)=U(τ)S(t0+τ)=U(τ)RS(t0)=RU(τ)S(t0)=R2S(t0)𝑆subscript𝑡0𝑆subscript𝑡02𝜏𝑈𝜏𝑆subscript𝑡0𝜏𝑈𝜏𝑅𝑆subscript𝑡0𝑅𝑈𝜏𝑆subscript𝑡0superscript𝑅2𝑆subscript𝑡0S(t_{0})=S(t_{0}+2\tau)=U(\tau)S(t_{0}+\tau)=U(\tau)RS(t_{0})=RU(\tau)S(t_{0})% =R^{2}S(t_{0})italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 2 italic_τ ) = italic_U ( italic_τ ) italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_τ ) = italic_U ( italic_τ ) italic_R italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = italic_R italic_U ( italic_τ ) italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_S ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).


Probably the most interesting application of part (b) concerns symmetry under rotation (including, for parity-invariant forces, improper rotation). In those cases, R2S=Ssuperscript𝑅2𝑆𝑆R^{2}S=Sitalic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_S = italic_S is a severe constraint on RS𝑅𝑆RSitalic_R italic_S, basically limiting it to π𝜋\piitalic_π rotations or (assuming parity symmetry) reflections and inversions.

Time reversal symmetry is essential to the theorem, as is shown in the Appendix.

Painlevé’s interest in this problem was stimulated by the falling cat problem, and this theorem has been called the “Bad News for Cats Theorem” [7][8]. Taken at face value, of course, the theorem makes cats’ achievements seem paradoxical, since it contradicts their observed ability to re-orient.

2 Why Cats are OK

I suspect that everyone involved in this discussion has had tongue firmly planted in cheek, but at the risk of belaboring the obvious I want to highlight why this theorem is not relevant to real biological cats. While simple, the reason touches on profound issues, as signaled by the title of this note.

The point is that the framework assumed in the theorem, even in its generalized form, is not appropriate for the description of the behavior of biological cats, divers, or gymnasts. Specifically, none of those are closed systems, nor is their state time-reversal invariant even when they appear, macroscopically, to be at rest. They can readily and selectively consume stored energy, notably by converting ATP into ADP, empowering mechanical motion accompanied by radiation of heat. Indeed, while they are living, they never cease doing that. Also, they can use stored energy to process and transmit information, and to move in response to its flow. Thus, the assumptions of the theorem do not apply to biological actors.

3 Engaging With Will

Fortunately, a radically different approach offers useful insight into this class of problems. That is, we assume that the cat, diver, or gymnast has significant abilities to change its shape, by “will”, in response to external cues and internal plans. Within that framework many fascinating questions about what degrees of freedom can be called into play, how quickly they respond, how the systems can learn by practice and instruction, and so forth, in order to attain specific goals, can be discussed usefully. This is the approach adopted universally, though usually implicitly, by cats, biologists, athletes, and trainers. On the other hand it is deprecated, in effect if not by intention, by physicists and philosophers who deny the reality of will.

The utility of the concept of will, in this context, does not mean that anything goes. The “Astonishing Hypothesis” [9] that mind (of which will is an aspect) is an emergent property of matter is not here challenged. Instead, a broad-minded approach, that takes into account both the legitimacy of will and constraints that follow from the laws of physics, is most fruitful. For example, conservation of angular momentum holds to a good approximation for cats and divers in flight, since it is difficult to radiate away angular momentum. The kinematics of deformable bodies under that constraint is already quite rich [2] [3], and naturally brings in ideas from non-abelian gauge theory [10]. It helps us to track precisely how changes of shape induce changes in orientation.

Engineers rarely speak of “will” or “purpose”, but they speak frequently of “control” and “utility” [11] [12], which are parallel concepts. Recently it has become increasingly common also to speak of “intelligent” engineered systems. These quoted concepts are extra-physical ideas, yet they are central to the process of designing complex material systems and anticipating their behavior. And for those who observe the finished products, they are essential tools for understanding how they got to be the way they are and for making sense of their structure.

Complementarity, as discussed in [13], is the insight that very different and even superficially contradictory ways of describing the same system can provide ways to answer different questions about it. These uses of extra-physical concepts to address questions about the physical behavior of material systems nicely illustrate the utility of complementarity in action.


The preceding considerations about will and movement to achieve re-orientation apply to motion more generally. In everyday life most of our experience of motion is associated with the movement of our own bodies. We plan motions and will make them happen, for example in rising from sleep, walking, bringing food to our mouths, and a host of other activities that are essential to life. This experienced mechanics is superficially – and also deeply – different from classical Newtonian mechanics. We can change our arm’s velocity, for example, in apparent violation of the conservation of energy, or accelerate it without applying external forces (external, that is, to our bodies).

A big part of the reason why classical Newtonian mechanics was difficult to discover, and a big part of why it is such a towering intellectual achievement, is that it required a kind of cognitive dissonance. Specifically, it required – and still requires – people to reconcile two very different, quasi-contradictory descriptions of motion, both successful and convincing within their domains of application. There is the “common sense” understanding of movement, in which the concepts of will and purpose are central, that applies to experienced bodily activity; and there is classical mechanics proper, austere and rigorously mathematical, from which those concepts are excluded.

A few centuries ago, when classical mechanics first took shape, the difference between those descriptions was widely believed to be fundamental. The first description applied when souls or other animating principles were active, while the other applied while those animating principles were dormant. Today, science has understood that matter, embodying a system of known, (relatively) simple, mathematically formulated laws, can support the sort of complex emergent behavior we find, for example, in molecular biochemistry, neurobiology and synthetic intelligence. It has come to seem much more reasonable, therefore, to regard those two descriptions as complementary. They are alternative descriptions of the same reality, tailored to address different questions. We can use one or the other, or a hybrid. The conceptually enriched reality that results makes room for will and purposive motion without bringing in souls or animating principles.

4 Engaging With Free Will

I feel that nothing that I have said so far should be controversial. Nevertheless, it prepares us to address some notoriously contentious issues that have a similar logical structure.

4.1 Making Choices

Another common experience in human life is that of making deliberate choices. We imagine the consequences of different possible actions, and select which to carry out. This process can be discussed from many perspectives, that use different concepts and offer different insights. For present purposes, some brief indications will suffice:

  • Psychology: personality, drives

  • Economics: utility, game theory

  • Anthropology: culture

On the other hand one might be tempted, based on a crude interpretation of the Astonishing Hypothesis, to deny the reality of choice altogether. If mind emerges from matter, and the behavior of matter is determined by a fixed set of deterministic equations then, according to this interpretation, the whole notion of making choices is illusory. What is going to happen, happens, full stop.

And yet, to paraphrase Galileo, we choose. Thousands of books and papers in the above-mentioned disciplines, not to mention history and fiction, discuss the human world in terms of people making choices. People can have rational discussions using those ideas, at various levels of sophistication, and they can learn to use them better.

It is hard not to see the tension between the pair (choice)-(physics in general) as a grander version of the tension (will)-(classical mechanics) we saw in our discussion of falling cats. Whether or not it is possible “in principle” to anticipate or understand the behavior of people from the quantum wave function of their constituent quarks, gluons, electrons, and photons, that is rarely the best way to do so. We can have richer and more informative discussions, and make more and better predictions, if we bring in extra-physical concepts.

There are many reasons why it is useful to bring in such concepts. For while the fundamental equations of physics, as presently understood, evolve the present state of a closed physical system into a unique state at any future time, that fact has very limited relevance to the question of determinism in human behavior, because:

  • People are very far from being closed systems. They eat, breathe, absorb and radiate heat, get rained on, and so forth, and – most important – sense their environment and process and act upon that sensory information.

  • Calculating the evolution requires near-complete and accurate knowledge of the initial state, which is not available even for much smaller systems than human bodies.

  • Even given perfect knowledge of the initial state, the required calculations are absurdly impractical.

Thus, it becomes essential to discover and develop concepts that correspond to more robust observables and lead to simpler calculations.


The limitations flowing from the foregoing constraints are so severe as to make the project of calculating human behavior from the fundamental equations preposterous, in practice. Still, it can be entertaining to ask: What about in theory?

The first issue can be postponed by taking in larger parts of the universe than individual human beings, so as to achieve effective closure. But that stratagem dodges rather than resolves the issue of experienced choice – which is, of course, experienced by individuals. It also aggravates the other two issues. Moreover, it is unclear that effective closure can ever be achieved in our universe, since a) even small distant events can ramify into big consequences (butterfly effect [14]), and b) according to physical cosmology, at every moment previously unobservable parts of the universe just begin to extend their influence to us (expanding horizon).

The second issue is at least aggravated, and may be fatally compromised, by quantum theory. In that context, deterministic evolution holds for the wave-function (of a closed system). But, as articulated most clearly in the many-worlds interpretation of quantum theory, the relationship between the wave-function and experienced reality is not deterministic. Moreover, since measurements inevitably change the wave-function it is not at all clear that the ideal of complete knowledge of the wave-function corresponds to an operational, empirically based concept.

The third issue too might pass, upon closer investigation, from severely impractical to strictly impossible. Computation has a physical basis [15] [16], and the demands of absurdly complex computations might well exceed the resources of the physical universe. Also, the kind of self-referential computations that figure in paradoxes of the kind “I compute what I will do, and then will myself to do the opposite”, bear a family resemblance to the paradoxes that underly the famous no-go theorems of Gödel (incompleteness of deductive systems) and Turing (halting problem), and are likely to be subject to similar limitations.


For me, the entertainment value of sightseeing in these murky depths stales rapidly, so I will close this section with a related but more nearly grounded speculative comment.

It is possible to imagine that a more advanced neurobiology, powered by better understanding of how the organs of mind function and interact at a molecular level, will contribute incisive new concepts to descriptive psychology. This, combined with advanced imaging techniques, will usher in new levels of self-awareness. Specifically, it is not preposterous to imagine a future when people will be able to visualize and meaningfully interpret flows of information within their own or other people’s brains in real time, within an immersive, augmented reality. To do justice to that expanded reality, people will need to develop more refined and possibly radically different concepts around “choice”.

4.2 Influencing Choices

The idea that people can make choices based on considering alternative plans and weighing their merits is basic to morality and law. Concepts such as guilt, sin, social acceptability, and legality are meant to judge or influence people’s choices. In all these cases, there is an important distinction between acts and choices.

For the sake of concreteness, I will focus here on law. In a world where choice was illusory, the law would only refer to acts, and its only purpose would be retribution. Issues of intent would not – indeed, could not – arise, nor defense by reason of insanity. Fortunately, we do not live in that world. (At least, I’ve argued that we don’t.) The law as it actually operates in civilized society is instead largely designed to avoid the occurrence of illegal acts by making them, through the threat of punishment, less attractive choices. In cases where there is no intent, the law did not influence any choice at all. Therefore its deterrent purpose was not disrespected, but simply inoperative, and punishment loses some of its point.


The process of making choices is often phrased as “free will”. Indeed, the current Wikipedia entry for “Free Will” begins

Free will is the capacity or ability to choose between different possible courses of action [17].

In so far as this is a valid definition, it is not clear that “free will” has any advantage over the simpler “choice”. The historical origins of “free will” are in theology. In that context its connotations might be welcome. But in other contexts they are excess baggage. For the reason that follows, the expression “free will” is probably best avoided.

Unfortunately, the phrase free will invites, through its linguistic connotations, extrapolations from its proper use in descriptive psychology and law into dubious speculations or even claims that its utility points to serious oversights in physics and physical cosmology – specifically, the failure of these subjects to incorporate the direct dynamical influence of choices emanating from minds. Let me conclude by re-iterating a basic observation [13] that heavily constrains such speculations and claims. In modern physics we have learned to how to make very precise calculations of many very delicate effects, and to compare them with very precise measurements. For one example, the form and magnitude of the magnetic field associated to electrons (technically, their magnetic moment) is both theoretically predicted and, independently, measured to accuracy within fractions of a part per billion. For another, atomic clocks of several different kinds are predicted and experimentally measured to keep time consistently with one another to accuracies that correspond to a fraction of a second per billion years. In the course of making such accurate measurements, experimenters must pay careful attention to, and account for, many factors, including mechanical alignment and stability of their apparatus, isolation through high vacuum and ultra-low temperature within the experiment’s core, electrical isolation of sensors and amplifiers, screening of stray magnetic and electromagnetic fields, shielding from or vetoing on cosmic ray events, and so forth. But amidst all this care and delicacy it has never proved necessary to take any precautions against the possible influence of what people nearby the experiments – or, for that matter, anywhere – are thinking. Demonstration of effects of that kind, however minute, would be remarkable. Based on the existing evidence, we can be confident that they are subtle and elusive, if they exist at all.



Acknowledgement: I thank Juliana Baena for introducing me to the literature around Painlevé’s paper and for preparing a translation (quoted in part above). Also I thank her and Antti Niemi for discussions about the physics around it. I thank Brian Greene and Maulik Parikh for discussions about the natural philosophy of will and choice that prompted me to write down some of my thoughts on those matters. This work is supported by the U.S. Department of Energy under grant Contract Number DE-SC0012567 and by the Swedish Research Council under Contract No. 335-2014-7424.



References

  • [1] L. Campbell and W. Garnnett, The Life of James Clerk Maxwell (Macmillan and Co., 1888).
  • [2] G. Gbur, Falling Felines and Fundamental Physics (Yale, 2019).
  • [3] J. Huber, Springboard and Platform Diving (Human Kinetics, 2015).
  • [4] P. Painlevé, Comptes Rendus Hebdomadaires des Séances de l’Académie des Sciences 139, 1170–74 (1904).
  • [5] I. Geru, Time-Reversal Symmetry (Springer, 2018).
  • [6] R. Sachs, The Physics of Time Reversal Symmetry (Chicago, 1984).
  • [7] B. Roberts, Reversing the Arrow of Time (Cambridge, 2022).
  • [8] J. Earman, International Studies in the Philosophy of Science 16.3, 245 (2022).
  • [9] F. Crick, The Astonishing Hypothesis: The Scientific Search for the Soul (Scribner, 1994).
  • [10] A. Shapere and F. Wilczek, Am. J. Phys. 57, 514 (1989); F. Wilczek, in Dialogues Between Physics and Mathematics: C. N. Yang at 100 ed. M.-L. Ge and Y.-H He (Springer, 2023).
  • [11] Historical collection: Classic Papers in Control Theory, ed. R. Bellman and R. Kalaba (Dover, 2017).
  • [12] Recent text: Dynamic Programming and Optimal Control (2 volumes), D. Bertsekas (Athena Scientific, 2001).
  • [13] F. Wilczek, Fundamentals (Penguin, 2021).
  • [14] E. Lorenz, J. Atmospheric Sciences 20 130 (1963).
  • [15] C. Bennett, Int. J. Theor. Phys. 21 904 (1982).
  • [16] S. Lloyd, Nature 406 1047 (2000).
  • [17] P. Carus, The Monist 20 (Open Court, 1910).

Appendix: Minimal Mechanical Examples; Necessity of T


We can illustrate the application of our theorem, and see that T symmetry is essential, by considering a particle subject to the simple Lagrangian

L=12(r˙2+r2ϕ˙2)V(r)+αf(r)ϕ˙𝐿12superscript˙𝑟2superscript𝑟2superscript˙italic-ϕ2𝑉𝑟𝛼𝑓𝑟˙italic-ϕL~{}=~{}\frac{1}{2}({\dot{r}}^{2}+r^{2}{\dot{\phi}}^{2})\,-\,V(r)\,+\,\alpha f% (r)\dot{\phi}italic_L = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( over˙ start_ARG italic_r end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over˙ start_ARG italic_ϕ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - italic_V ( italic_r ) + italic_α italic_f ( italic_r ) over˙ start_ARG italic_ϕ end_ARG (1)

in polar coordinates. This Lagrangian describes planar motion for a unit charge, unit mass particle in the presence of a transverse, axially symmetric magnetic field Bz(r)=α1rdfdrsubscript𝐵𝑧𝑟𝛼1𝑟𝑑𝑓𝑑𝑟B_{z}(r)=\alpha\frac{1}{r}\frac{df}{dr}italic_B start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT ( italic_r ) = italic_α divide start_ARG 1 end_ARG start_ARG italic_r end_ARG divide start_ARG italic_d italic_f end_ARG start_ARG italic_d italic_r end_ARG

For the cyclic variable ϕitalic-ϕ\phiitalic_ϕ we have the equation of motion ddtLϕ˙=0𝑑𝑑𝑡𝐿˙italic-ϕ0\frac{d}{dt}\frac{\partial L}{\partial\dot{\phi}}=0divide start_ARG italic_d end_ARG start_ARG italic_d italic_t end_ARG divide start_ARG ∂ italic_L end_ARG start_ARG ∂ over˙ start_ARG italic_ϕ end_ARG end_ARG = 0, so

pϕ=Lϕ˙=r2ϕ˙+αf(r)subscript𝑝italic-ϕ𝐿˙italic-ϕsuperscript𝑟2˙italic-ϕ𝛼𝑓𝑟p_{\phi}~{}=~{}\frac{\partial L}{\partial\dot{\phi}}=r^{2}\dot{\phi}+\alpha f(r)italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT = divide start_ARG ∂ italic_L end_ARG start_ARG ∂ over˙ start_ARG italic_ϕ end_ARG end_ARG = italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over˙ start_ARG italic_ϕ end_ARG + italic_α italic_f ( italic_r ) (2)

is a constant of the motion. T𝑇Titalic_T-invariant states are defined by varnishing velocity, including ϕ˙=0˙italic-ϕ0\dot{\phi}=0over˙ start_ARG italic_ϕ end_ARG = 0, so

pϕαf(r0)subscript𝑝italic-ϕ𝛼𝑓subscript𝑟0p_{\phi}~{}\rightarrow~{}\alpha f(r_{0})italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT → italic_α italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) (3)

and

ϕ˙=α(f(r0)f(r))r2˙italic-ϕ𝛼𝑓subscript𝑟0𝑓𝑟superscript𝑟2\dot{\phi}=\frac{\alpha(f(r_{0})-f(r))}{r^{2}}over˙ start_ARG italic_ϕ end_ARG = divide start_ARG italic_α ( italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_f ( italic_r ) ) end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (4)

T invariance of the dynamics implies α=0𝛼0\alpha=0italic_α = 0, and thus ϕ˙=0˙italic-ϕ0\dot{\phi}=0over˙ start_ARG italic_ϕ end_ARG = 0. If the particle does not pass through the origin r=0𝑟0r=0italic_r = 0, then ϕitalic-ϕ\phiitalic_ϕ is uniquely defined and constant, so the orientation of our particle cannot change, as is consistent with the implication b. of our theorem. If the particle passes through the origin with non-zero velocity then to maintain continuity of the velocity we must allow ϕitalic-ϕ\phiitalic_ϕ to increment by π𝜋\piitalic_π. Following that the next turning point might occur at the same value of r𝑟ritalic_r and be related to the original one by a π𝜋\piitalic_π rotation. This is consistent with the theorem, and illustrates that we must allow for the possibility of a non-trivial R𝑅Ritalic_R.

Essentially this same example can be realized within the context of Painlevé’s original framework of particles and central forces, by having two very heavy particles acting equally on a third light particle equally distant from both. This model demonstrates, in particular, that R1𝑅1R\neq 1italic_R ≠ 1 can occur without collisions. That possibility contradicts [4] [5] [6].

With α0𝛼0\alpha\neq 0italic_α ≠ 0 things are different. We can derive the equation of motion for r𝑟ritalic_r straightforwardly, or more elegantly using the Routhian

R=12r˙2+12(pϕαf(r))2r2+V(r)𝑅12superscript˙𝑟212superscriptsubscript𝑝italic-ϕ𝛼𝑓𝑟2superscript𝑟2𝑉𝑟R~{}=~{}-\frac{1}{2}{\dot{r}}^{2}+\frac{1}{2}\frac{(p_{\phi}-\alpha f(r))^{2}}% {r^{2}}+V(r)italic_R = - divide start_ARG 1 end_ARG start_ARG 2 end_ARG over˙ start_ARG italic_r end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG divide start_ARG ( italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT - italic_α italic_f ( italic_r ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + italic_V ( italic_r ) (5)

in the form

ddtRr˙𝑑𝑑𝑡𝑅˙𝑟\displaystyle\frac{d}{dt}\frac{\partial R}{\partial\dot{r}}~{}divide start_ARG italic_d end_ARG start_ARG italic_d italic_t end_ARG divide start_ARG ∂ italic_R end_ARG start_ARG ∂ over˙ start_ARG italic_r end_ARG end_ARG =\displaystyle== Rr𝑅𝑟\displaystyle~{}\frac{\partial R}{\partial r}divide start_ARG ∂ italic_R end_ARG start_ARG ∂ italic_r end_ARG
r¨¨𝑟\displaystyle\ddot{r}~{}over¨ start_ARG italic_r end_ARG =\displaystyle== ddr(V(r)+12(pϕαf(r))2r2)𝑑𝑑𝑟𝑉𝑟12superscriptsubscript𝑝italic-ϕ𝛼𝑓𝑟2superscript𝑟2\displaystyle~{}-\frac{d}{dr}(V(r)+\frac{1}{2}\frac{(p_{\phi}-\alpha f(r))^{2}% }{r^{2}})- divide start_ARG italic_d end_ARG start_ARG italic_d italic_r end_ARG ( italic_V ( italic_r ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG divide start_ARG ( italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT - italic_α italic_f ( italic_r ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) (6)

Using the conserved effective energy

=12r˙2+V(r)+12(pϕαf(r))2r212superscript˙𝑟2𝑉𝑟12superscriptsubscript𝑝italic-ϕ𝛼𝑓𝑟2superscript𝑟2{\cal E}~{}=~{}\frac{1}{2}{\dot{r}}^{2}+V(r)+\frac{1}{2}\frac{(p_{\phi}-\alpha f% (r))^{2}}{r^{2}}caligraphic_E = divide start_ARG 1 end_ARG start_ARG 2 end_ARG over˙ start_ARG italic_r end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_V ( italic_r ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG divide start_ARG ( italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT - italic_α italic_f ( italic_r ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (7)

we see that resting points, where r˙=0˙𝑟0\dot{r}=0over˙ start_ARG italic_r end_ARG = 0, generally represent places where the r𝑟ritalic_r-evolution reverses direction. If there is a second resting point on the same trajectory, we will have oscillations bounded by the resting points. Generically, the accumulated rotation

Δϕ=α(f(r0)f(r))r2𝑑tΔitalic-ϕcontour-integral𝛼𝑓subscript𝑟0𝑓𝑟superscript𝑟2differential-d𝑡\Delta\phi~{}=~{}\oint\frac{\alpha(f(r_{0})-f(r))}{r^{2}}dtroman_Δ italic_ϕ = ∮ divide start_ARG italic_α ( italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_f ( italic_r ) ) end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_d italic_t (8)

over a cycle will be neither 00 nor π𝜋\piitalic_π, and we can encounter re-orientations (i.e., in this context, rotations) of any magnitude.

For example, let us take a harmonic oscillator potential V(r)=12kx2𝑉𝑟12𝑘superscript𝑥2V(r)=\frac{1}{2}kx^{2}italic_V ( italic_r ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_k italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and a constant magnetic field αf(r)=12B0r2𝛼𝑓𝑟12subscript𝐵0superscript𝑟2\alpha f(r)=\frac{1}{2}B_{0}r^{2}italic_α italic_f ( italic_r ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. If at r=r0𝑟subscript𝑟0r=r_{0}italic_r = italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT we have r˙=ϕ˙=0˙𝑟˙italic-ϕ0\dot{r}=\dot{\phi}=0over˙ start_ARG italic_r end_ARG = over˙ start_ARG italic_ϕ end_ARG = 0 then

\displaystyle{\cal E}~{}caligraphic_E =\displaystyle== 12kr0212𝑘superscriptsubscript𝑟02\displaystyle~{}\frac{1}{2}kr_{0}^{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_k italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
pϕsubscript𝑝italic-ϕ\displaystyle p_{\phi}~{}italic_p start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT =\displaystyle== B02r02subscript𝐵02superscriptsubscript𝑟02\displaystyle~{}\frac{B_{0}}{2}r_{0}^{2}divide start_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (9)

and

r˙2=k(r02r2)B024(r02r2)2r2superscript˙𝑟2𝑘superscriptsubscript𝑟02superscript𝑟2superscriptsubscript𝐵024superscriptsuperscriptsubscript𝑟02superscript𝑟22superscript𝑟2{\dot{r}}^{2}~{}=~{}k(r_{0}^{2}-r^{2})-\frac{B_{0}^{2}}{4}\frac{(r_{0}^{2}-r^{% 2})^{2}}{r^{2}}over˙ start_ARG italic_r end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_k ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - divide start_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG divide start_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (10)

There is a turning point where r˙=0˙𝑟0\dot{r}=0over˙ start_ARG italic_r end_ARG = 0 at

r1=r01+4kB02subscript𝑟1subscript𝑟014𝑘superscriptsubscript𝐵02r_{1}~{}=~{}\frac{r_{0}}{\sqrt{1+\frac{4k}{B_{0}^{2}}}}italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG 1 + divide start_ARG 4 italic_k end_ARG start_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG end_ARG (11)

(Note that ϕ˙˙italic-ϕ\dot{\phi}over˙ start_ARG italic_ϕ end_ARG does not vanish when r=r1𝑟subscript𝑟1r=r_{1}italic_r = italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.) When r𝑟ritalic_r cycles back to r0subscript𝑟0r_{0}italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the accumulated phase is, using dt=±drr˙𝑑𝑡plus-or-minus𝑑𝑟˙𝑟dt=\pm\frac{dr}{\dot{r}}italic_d italic_t = ± divide start_ARG italic_d italic_r end_ARG start_ARG over˙ start_ARG italic_r end_ARG end_ARG appropriately in Eqn. (8),

Δϕ=2B0|B0|r1r0drr11+4kB02r02r2r2r12Δitalic-ϕ2subscript𝐵0subscript𝐵0subscriptsuperscriptsubscript𝑟0subscript𝑟1𝑑𝑟𝑟114𝑘superscriptsubscript𝐵02superscriptsubscript𝑟02superscript𝑟2superscript𝑟2superscriptsubscript𝑟12\Delta\phi~{}=~{}-2\frac{B_{0}}{|B_{0}|}\int\limits^{r_{0}}_{r_{1}}\frac{dr}{r% }\frac{1}{\sqrt{1+\frac{4k}{B_{0}^{2}}}}\sqrt{\frac{r_{0}^{2}-r^{2}}{r^{2}-r_{% 1}^{2}}}roman_Δ italic_ϕ = - 2 divide start_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG | italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | end_ARG ∫ start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_d italic_r end_ARG start_ARG italic_r end_ARG divide start_ARG 1 end_ARG start_ARG square-root start_ARG 1 + divide start_ARG 4 italic_k end_ARG start_ARG italic_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG end_ARG square-root start_ARG divide start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG (12)

Since the integrand is positive, this does not vanish.