Standing on the shoulders of giants

“Standing on the shoulders of giants” was the title of my Polish blog post that was addressing the issue I need to address here and now, in my English blog.
From Wikipedia:

The metaphor of dwarfs standing on the shoulders of giants (Latin: nanos gigantum humeris insidentes) expresses the meaning of “discovering truth by building on previous discoveries”.[1] This concept has been traced to the 12th century, attributed to Bernard of Chartres. Its most familiar expression in English is by Isaac Newton in 1676: “If I have seen further, it is by standing on the shoulders of giants.”[2]

With us the story is that we also want to see further. But can we? Is that possible? Can a simple blog provide better information than original papers and monographs written by experts? To my own surprise I have found that yes, it can. Experts are human beings, they are always in a hurry, changing subjects of their research, making errors, leaving holes in arguments, and hoping that if there is someone really interested in the subject, then she/he will fix the errors, fill the holes, simplify the arguments, bring new observations.

Here is the image from Wikipedia:

And indeed we can provide additional pairs of eyes. My own adventure with the mechanics of flipping tops was possible because of several papers that I tried to study. I say “tried”, because I was not able to understand everything from these papers. Usually, after a year I have problems with understanding my own papers! But the paper that I really-really liked was this one:

Ramses van Zon, Jeremy Schofield, “Numerical implementation of the exact dynamics of free rigid bodies“, J. Comput. Phys. 225, 145-164 (2007)

Abstract
In this paper the exact analytical solution of the motion of a rigid body with arbitrary mass distribution is derived in the absence of forces or torques. The resulting expressions are cast into a form where the dependence of the motion on initial conditions is explicit and the equations governing the orientation of the body involve only real numbers. Based on these results, an efficient method to calculate the location and orientation of the rigid body at arbitrary times is presented. This implementation can be used to verify the accuracy of numerical integration schemes for rigid bodies, to serve as a building block for event-driven discontinuous molecular dynamics simulations of general rigid bodies, and for constructing symplectic integrators for rigid body dynamics.

Then I also learned a couple of ideas from

Celledoni, Elena; Zanna, Antonella.E Celledoni, F Fassò, N Säfström, A Zanna, The exact computation of the free rigid body motion and its use in splitting methods, SIAM Journal on Scientific Computing 30 (4), 2084-2112 (2008)

Then I have found even more recent paper:

Marcello Romano, “Concise Form of the Dynamic and Kinematic Solutions of the Euler-Poinsot Problem“, Advances in the Astronautical Sciences Volume 145, (IAA-AAS-DyCoSS1-01-05), (2012)

It is this last paper that encouraged me to study the subject deeper. Romano, in particular, wrote:


It is a surprising fact that the exact solutions of the Euler-Poinsot problem are rarely taught in graduate courses of astrodynamics, or used in the engineering practice. It is even more surprising, in particular, that the kinematic solutions by Jacobi in terms of rotation matrix elements, as well as the associated remarkable geometric interpretation, are overlooked in the literature, and do not appear in any textbook of astrodynamics or celestial mechanics (to the best knowledge of the author). Two possible reasons of this fact are: the general lack of familiarity regarding the Jacobi elliptic and theta functions, and the inconvenient treatment of initial conditions and body geometry related to the usual formulation of the solutions (as given, for instance, limited to the dynamics, by Hughes and Wittenburg).
It is the goal of the author to foster, with the present paper, an increased familiarity, dissemination and utilization of the exact solutions of the Euler-Poinsot problem.

Well, well. For dissemination what can be better than the blog? So, here I am – disseminating! And doing it according to my own taste and feeling of what is simple and what is interesting.

Here we go, with the dissemination task. We are done with exact solutions of Euler’s equation for the free asymmetric top. But that is only half of the whole problem. We are dealing with mechanics or, better, dynamics. It was created by Newton and then, having particularly in mind rotating bodies, developed by Euler. The fundamental equation of dynamics is known from schools:

    \[m\vec{a}=\vec{F}.\]

We have speed v

    \[\vec{v}=\frac{\Delta \vec{x}}{\Delta t},\]

and we have acceleration a

    \[\vec{a}=\frac{\Delta \vec{v}}{\Delta t}.\]

To solve a mechanical problem we need to find \vec{x}(t), that is how position changes with time. Quite often we do it in two steps: we know the forces, so we know the acceleration. We integrate to find how velocity is changing with time. This is half of the work. But then we have to integrate again in order to find how position changes with time.

With rotating bodies it is similar, except that instead of “velocity” we have “angular velocity”, and instead of “position” we have “orientation” (described by the “attitude matrix” Q(t)).

Employee Attitude Matrix
Employees can be divided into 4 groups using the employee commitment and employee satisfaction. They can be apostles, mercenaries, hostages and terrorists. The organisation should have a plan for all groups because they have different needs. They need to receive different messages from the leadership team.
:)

In previous posts we have learned how to solve Euler’s equations. They give us angular velocity \vec{\Omega}(t) in the body frame. But in order to find the orientation of the body with respect to the laboratory, we need to integrate the equation that relates the attitude matrix Q(t) to \vec{\Omega}(t). Here I will rely on the content of the post Angular momentum. From Eq. (1) therein we have

(1)   \begin{equation*}\frac{dQ(t)}{dt}=Q(t)W(t),\end{equation*}

where

    \[W(t)=\begin{bmatrix}0&-\Omega_3(t)&\Omega_2(t)\\\Omega_3(t)&0&-\Omega_1(t)\\-\Omega_2(t)&\Omega_1(t)&0\end{bmatrix}=\begin{bmatrix}0&-\frac{L_3(t)}{I_3}&\frac{L_2(t)}{I_2}\\\frac{L_3(t)}{I_3}&0&-\frac{L_1(t)}{I_1}\\-\frac{L_2(t)}{I_2}&\frac{L_1(t)}{I_1}&0\end{bmatrix},\]

where we have used the relation between angular momentum and angular velocity:

    \[L_i=I_i\Omega_i,\quad (i=1,2,3).\]

Q(t), the attitude matrix, is an orthogonal 3\times3 real matrix. When we find it, we will know how the body rotates in space, as seen from the laboratory inertial frame. In order to find Q(t) we need to solve the differential equation. In fact Q(t) is a matrix with 3\times 3=9 coefficients, so we need to solve a system of nine differential equations. From a general theory of differential equations (linear in our case) we know that if Q(t) is given at the initial time t=t_0, then the solution exists and is unique. This is encouraging but not very helpful. From the matrix form (1) we also see that if Q(t) is a solution, and if R is a fixed orthonormal matrix, then Q'(t)=RQ(t) is also a solution. It has very simple interpretation: Q'(t) represents the motion with respect to a rotated laboratory, rotated by the rotation matrix R. Moreover, every solution Q'(t) of our equation is of that form. Thus, if we find just one solution of our equation, then we know them all. But we need to find one. It is from the paper by van Zon and Schofield mentioned above that I learned the trick of how to simplify the task of solving nine differential equations. By using geometric reasoning it is possible to reduce the number of equations that need to be solved from nine to just one! It is quite possible that van Zone and Schofield learned the trick from someone else. That is what “standing on the shoulders of giants” means in practice. Albert Einstein was known for standing on the shoulders of others quite often, without quoting them explicitly. Apparently in his mind it is not important who does something, it is important what is being accomplished. I am not that original, I like to quote other people work, who were working in a similar direction, even if I am not using their results.

But let us return back to “what”, that is what the simplifying trick is about. We have reference frames, and we have angular momentum vector \vec{L}, which is constant with respect to the laboratory frame (conservation of angular momentum). Therefore we can choose the laboratory frame so that \vec{L} has in this frame components (0,0,L), where L is the length of \vec{L}. How to do it in practice?? We do not have to worry about it now, after all we are solving our equations in theory. When we are done with the theory, then we can return to the real world.

So, in the laboratory our vector \vec{L} has components (0,0,L), while in the rotating frame attached to the body it has components (L_1,L_2,L_3). Matrix Q transforms components of vectors in the body frame into components of these vectors in laboratory. In particular it transforms \vec{L}=(L_1,L_2,L_3) into \vec{l}=(0,0,L). There are many orthogonal matrices that do just that. Suppose Q\vec{L}=\vec{l}, and Q_0\vec{L}=\vec{l}. Then

    \[ \vec{l}=Q\vec{L}=Q{Q_0}^{-1}\vec{l}.\]

Therefore Q_1=Q Q_0^{-1} is an orthogonal matrix that keeps the vector \vec{l} fixed. It must be a rotation about the third axis. Therefore Q_1 is of the form:

(2)   \begin{equation*} \begin{bmatrix}\cos\psi&-\sin\psi&0\\\sin\psi&\cos\psi&0\\0&0&1\end{bmatrix},\end{equation*}

and Q=Q_1Q_0. Can we find easily some Q_0? Yes, we can. And this is the trick:

(3)   \begin{equation*}\begin{bmatrix}\frac{L_1L_3}{LL_p}&\frac{L_2L_3}{LL_p}&-\frac{L_p}{L}\\-\frac{L_2}{L_p}&\frac{L_1}{L_p}&0\\\frac{L_1}{L}&\frac{L_2}{L}&\frac{L_3}{L}\end{bmatrix},\quad L_p=\sqrt{L_1^2+L_2^2}.\end{equation*}

Here is simple REDUCE code verifying that Q_0 is indeed orthogonal, and that Q_0\vec{L}=\vec{l} (even though checking it by hand is easy):

Lp:=sqrt(L1^2+L2^2);
L:=sqrt(L1^2+L2^2+L3^2);
Q:=mat(
(L1*L3/(L*Lp),L2*L3/(L*Lp),-Lp/L),
(-L2/Lp,L1/Lp,0),
(L1/L,L2/L,L3/L) );
Q*tp(Q);
Q*tp mat((L1,L2,L3))-tp mat((0,0,L));
END;

We have \vec{L}(t) given as our solution of Euler’s equations. Therefore we know Q_0(t). It remains now to substitute Q(t) defined as

    \[Q(t)=\begin{bmatrix}\cos\psi(t)&-\sin\psi(t)&0\\\sin\psi(t)&\cos\psi(t)&0\\0&0&1\end{bmatrix}\begin{bmatrix}\frac{L_1(t)L_3(t)}{LL_p(t)}&\frac{L_2(t)L_3(t)}{LL_p(t)}&-\frac{L_p(t)}{L}\\-\frac{L_2(t)}{L_p(t)}&\frac{L_1(t)}{L_p}&0\\\frac{L_1(t)}{L}&\frac{L_2(t)}{L}&\frac{L_3(t)}{L}\end{bmatrix},\quad L_p(t)=\sqrt{L_1(t)^2+L_2(t)^2},\]

into Eq (1), find differential equation for \psi(t), and solve it. It is an adventure. Standing on the shoulders of giants is always an adventure. What if one falls down from that height?

8 thoughts on “Standing on the shoulders of giants

  1. I have been quite lately. Have to refigure out how to animate something in Maple. Still refiguring! Last night I did realise, oh.. we have velocity equations. I need position equations in time. Was going to try integrating the 3 Euler equationns. Would that be a valid approach? Was definately wondering what to do about the a rotation matrix too. Thanks for posting that paper.

  2. A couple of questions on the paper nomenclature. What do ~ mean/represtent above. I am interpreting it to mean “in the body frame of reference”. A^t (I dont have the proper symbol used in the paper), I presume is the transpose of A. These questions might sound like I already know the answer to them. When I started typing here I didn’t. I find clarifying what my actual questions are often brings the answer as I ask them.

  3. The tilde usually just means it’s a vector and I think he may be using it to differentiate vector vs scalar for the same name and yeah that all seems related to using the body frame of reference. I think it said somewhere the A^t is the transpose.

    There’s a blog called “A QUantum Diaries Survivor”. I think before I found it and it didn’t have the “Survivor” in its name, it was a research diary. I do think seeing active research in blog form (like here) is great.

    I remember writing my first two papers for a journal and yes the actual paper is somewhat anticlimactic. It looks like you just woke up one day and wrote down what you knew when it was really a lot of reading of giants with shoulders mixed with lots of confusion and wows.

    My first paper was almost all via a library and the second almost all via the internet even though they were only a couple years apart (late 90s to early 2000s). Kind of miss the library but the internet has more information.

  4. we have angular momentum vector \vec{L} ->
    we have angular momentum vector \vec{l}

    laboratory frame so that \vec{L} ->
    laboratory frame so that \vec{l}

    where L is the length of \vec{L} ->
    where L is the length of \vec{l}

    in the laboratory our vector \vec{L} ->
    in the laboratory our vector \vec{l}

    1. I am not sure if I really want to make these changes. When I write \vec{L}, i mean a vector that may have different components in different frames. It will have components (L_1,L_2,L_3) in the body frame, and it will have components (l_1,l_2,l_3) in the laboratory frame. When I use the notation \vec{l} what I would like to mean is : the triple of numbers (l_1,l_2,l_3).
      I know it can be confusing. One should have a different notation for a vector, and for a triple of numbers. For instance vector would be \vec{L} and a column of three numbers would be \mathbf{L}.
      Other wise on has to guess the meaning from the context. What is not that difficult, and everyone should get some training in that art. It is useful in life.

Leave a Reply