Behind the Guesses: Quaternions -- Part 1: How many?

[Click here for a PDF of this post with nicer formatting or see below]
The Setup
Quaternions seem to be one of the least understood mathematical things amongst physicists. I have sat in countless lectures where at some point the lecturer pointed out that a particular topic could be understood or explained using quaternions, but, when pressed, could not really explain what, precisely, one of these quaternion thingies actually is.

The first encounter people have with quaternions is generally after they learn about the complex plane and its relationship to the regular 2D Cartesian plane. After seeing all sorts of nifty properties and uses of this relationship (we'll see one shortly) it's only natural to ask if there's a 3D complex analogue to the 2D complex plane. And, therefore, most books ask precisely this question. However, they usually give less-than-satisfactory attempts at generalizing, highlighting the mysterious algebraic problem of ``closure'' or something to that effect.

Then, often retelling the story of Hamilton and a bridge¹ they pull some strange, ``4D'' quaternions out of a hat and show how they happily resolve all the algebraic problems. This, it seems, should be enough to placate even the most thoughtful reader, and stands in place of an actual explanation. And even though there is a lot of information about these buggers out there on the intertubes, all that I've seen is of the same approach.

So, it doesn't surprise me, honestly, that ``quaternion'' is also one of the most popular searches on this blog. The topic of quaternions is really too big to handle fully in one post (and, for full disclosure, I do not completely understand them myself), so this post will deal primarily with a rationale for the initial guess of a ``4D'' quaternion.

This post assumes you have read, and thoroughly grokked my discussion of dot and cross products,[1] and have a solid understanding of traditional complex numbers.

Complex Numbers and 2D Vectors
My approach in this section is based on the fantastic book, Visual Complex Analysis by Tristan Needham.[2] If something here isn't clear (and it's not the fault of my writing), or is different from the way you learned complex numbers, read this book. Even if everything is perfectly clear, read this book. What I'm trying to say is: Read this book.²

Recall that a complex number $z=x+iy$ can be represented by a vector $\vec{z}=x\hat\imath+y\hat\jmath$ in a 2D $xy$ -plane. Also, if $r=\sqrt{x^2+y^2}$ -- i.e. the ``modulus'' of $z$ or the length of $\vec z$ -- and $\theta$ is the ``argument'' of $z$ or the angle between $\vec z$ and the $x$ -axis, we can also write $z=re^{i\theta}$ in ``polar form.'' See [2] for pictures. We also have Euler's identity

$e^{i\theta}=\cos\theta+i\sin\theta$

(1)

Furthermore, recall that multiplying two complex numbers together effects a rotation and scaling. For example, multiplying a complex number $z=re^{i\phi}$ -- graphically, a vector of length $r$ making angle $\phi$ with respect to the real axis ( $x$ -axis) -- by $\xi=\rho e^{i\theta}$ gives $\xi z=(\rho r)e^{i(\phi+\theta)}$ . This can be understood graphically as a scaling of $r$ by $\rho$ and a rotation of the direction of $z$ by the angle $\theta$ . Finally, the complex conjugate $z^*$ of $z$ is given either by $z^*=x-iy$ or $z=re^{-i\theta}$ .

From Complex Multiplication to Vector Products
For two complex numbers $A=a e^{i\alpha}$ and $B=b e^{i\beta}$ let's see what $A^* B$ is. This demonstration (at least initially) is based on [2].³ Anyway,

$A^* B = a b e^{i (\beta - \alpha)}$

$=a b e^{i\theta}.$

(2)

Graphically this is a vector with length $ab$ at an angle $\theta=(\beta-\alpha)$ from the $x$ -axis. Expanding this into a real and complex part using Euler's identity (1) gives:

$A^* B=ab\cos\theta + i\,ab\sin\theta.$

(3)

We now note that the real part of this expression corresponds to the dot product between the two vectors
$\vec A$ and $\vec B$ . But should we do with the imaginary part?

Well, the magnitude of the imaginary part certainly corresponds to the magnitude of one dimension of the cross product between the two vectors. That is, if we relate the complex plane to the Cartesian $xy$ -plane then the imaginary part of $A^* B$ is the $z$ -component of $\vec{A}\times\vec{B}$ . This important point is often lost in passing, and thus this property of complex multiplication is relegated to the realm of ``cool trick.'' However, we'll make good use of this detail.

Rethinking complex numbers
Now we are ready for the conceptual jump. Although we got to the representation of dot and cross products through use of a 2D complex plane, we're going to distance ourselves from this wonderful visualization for the moment and note that an arbitrary complex number has two parts: One corresponds to a dot product, the other corresponds to one dimension of a cross product of two vectors.[3]⁴ If we want to find a relationship between complex numbers and 3D vectors we need to pick one of these parts to generalize.

Now, recall that the dot product yields a scalar quantity equal to the amount that two vectors point in the same direction. Since there is no directionality or dimensionality inherent in this quantity -- it's just a length -- there's really no way to add extra bits here. Length stays a scalar in any dimension.

So, instead we turn to the cross-product part. In the preceding section I repeatedly stressed that the imaginary part of $A^*B$ corresponds to one dimension of a 3D cross product. However, which single dimension of the cross product we choose is completely arbitrary: Just as with the calculation of area for the cross product, the 2D Cartesian plane we choose to map to the complex plane could just as easily be the $xy$ -, $zx$ - or the $yz$ -planes.

Recall, that to resolve this ambiguity in in cross-product land we chose to identify which plane we were talking about by a right-hand rule normal vector to the plane. However, here we're attempting to generalize complex numbers, not cross products per se. So, instead of assigning different normal vectors to each cross product term, let's assign a different complex number to each term. That is, $i^2=-1$ and $j^2=-1$ , but $i\neq j$ for example. Then, we assign $i$ to the cross product of two vectors in the $yz$ -plane and $j$ to the cross product of two vectors in the $zx$ -plane.

The one question remaining, though, as we generalize our complex plane, is how many additional complex numbers do we need? Maybe, naively, we can try adding just one extra cross-product dimension. That is, $i$ and $j$ only. The problem, though, can be seen in cross-product land.

Closure
Remember, that a cross product resultant vector is a normal vector to an arbitrary plane in 3D Cartesian space, and thus always requires all three unit vectors $\hat{x}$ , $\hat y$ and $\hat{z}$ . For example, the cross product $\hat{x}\times\hat y$ is $\hat z$ . That is, in order to make sense of $\hat x$ cross $\hat y$ which can exist in 2D, you must already have a third unit vector $\hat z$ .

Physically, in Cartesian vector space, it means that you must be able to add any arbitrary 3D cross product resultant vectors and still get a 3D vector. In fact, if this wasn't true, there'd be no way to even write the 3D cross product in the first place since you need to project the arbitrary vectors to three (independent) 2D planes and then add the resulting normal vectors. You can't have just two cross-product parts and get a result that always makes sense. This is the requirement of ``closure.''

The reason there's no problem in the 2D plane version is simply because there's only one possible normal vector, so we only look at the magnitude of the cross product -- i.e. the amount of area -- and the sign. And that is just a scalar! In 2D land nothing is preventing you from adding the cross product to the dot product -- they're both scalars -- so you can write a two-element complex number $x+i\,y$ combination with no trouble.

However, in 3D we can't simply add a vector to a scalar, and therefore we need all three parts of the cross product. So too, then, if we want a generalized complex number to have a dot-product part and a cross-product part that makes physical sense, we need three complex numbers: $i$ and $j$ from above, plus a $k^2=-1$ , $k\neq i,j$ corresponding to the cross product of the projection of vectors in the $xy$ -plane.

Thus, we now have a generalized complex number -- quaternion -- of the form

$z=\ell+i\,x+j\,y+k\,z.$

(4)

References
[1] E. Lansey. The dot product and cross products [online]. April 2009. Available from: http://behindtheguesses.blogspot.com/2009/04/dot-and-cross-products.html.
[2] Tristan Needham. Visual Complex Analysis. Oxford University Press, Oxford, UK, 1997.
[3] C. Doran and A.N. Lasenby. Geometric algebra for physicists. Cambridge University Press New York, Cambridge, UK, 1st edition, 2003.

¹ Just Google it, it's not really worth retelling, in my opinion.
² If I was stranded on an island forever but could bring only one math book, this would be it.
³ Just go out and get that book already! What are you waiting for?
⁴Thanks, Peeter, for recommending the book ([3]) which highlighted this point.

7 comments:

Peeter JootDecember 1, 2009 at 12:17 AM
A nicely written post once again. It will be interesting where you go with this.
Bharath PrabhuswamyDecember 3, 2009 at 10:12 AM
Nice post!.. Ive been sitting in front of the camera and trying to map its 2D Capture back to 3D and yes, funny things happened.

We probably believe that assuming i = sqrt(-1) and calling it imaginary will do the job.
The tricky part when we try to define root of a negative number. However if we see an argand diagram, its just rotation in the other direction.
Due to this I always think that 'i' means 'independent'.
So if we have x + iy and a + ib, x and b shall interact only if b gets rid 'i'. Similarly for a and y.
AnonymousDecember 23, 2009 at 1:56 AM
In mathematics, the quaternions are a number system that extends the complex numbers. They were first described by Irish mathematician Sir William Rowan Hamilton in 1843 and applied to mechanics in three-dimensional space. A striking feature of quaternions is that the product of two quaternions is noncommutative, meaning that the product of two quaternions depends on which factor is to the left of the multiplication sign and which factor is to the right. Hamilton defined a quaternion as the quotient of two directed lines in a three-dimensional space or equivalently as the quotient of two vectors. Quaternions can also be represented as the sum of a scalar and a vector. Quaternions find uses in both theoretical and applied mathematics, in particular for calculations involving three-dimensional rotations such as in three-dimensional computer graphics and epipolar geometry, although they have been superseded in many applications by vectors and matrices. I am a college sophomore with a dual major in Physics and Mathematics @ University of California, Santa Barbara. By the way, i came across these excellent physics flash cards. Its also a great initiative by the FunnelBrain team. Amazing!!!
AnonymousAugust 31, 2010 at 6:48 AM
Hi,

very nice article, thank you for sharing your insights :)

There is one thing I fail to understand in the section "From Complex Multiplication to Vector Products": why does A*B equal abe^((beta-alpha)i) rather than abe^((alpha+beta)i)? As far as I understood, the product of two complex numbers A and B is given by the rotation of A (resp. B) by the degree which B (resp. A) forms with the real line (plus a stretching of its modulus of course), so the angle of A*B should be the sum, rather than the difference, of the two angles... Am I missing something?

Anyway, thanks again for sharing your intuitions, I just ordered the book you recommend!

Andrew
Eli LanseyAugust 31, 2010 at 7:17 AM
Hi Andrew,
Thanks for your comment.
I intended the '*' to represent complex conjugate, not multiplication. But, you're exactly right otherwise. Except, because of the conjugate, the angle of A represents rotation in the opposite direction, thus beta + (-alpha).
Hope that clears things up.
AnonymousAugust 31, 2010 at 7:23 AM
Oh, sure! That explains everything, thank you! I should have been more careful and interpret the "*" operator the way you defined it above ;) Thanks a lot and sorry for bothering.

Andrew
AdamJune 29, 2011 at 6:48 PM
Hi, I have read through your posts. Please do NOT stop writing if you have any more ideas..
These articles are very useful for an undergradute, ( or anyone who wants good examples for "definition is not equal explanation").

Adam from Hungary

Behind the Guesses

Pages

Monday, November 30, 2009

Quaternions -- Part 1: How many?

7 comments:

About Me

Subscription Options

Other sites to check out

Followers

Blog Archive