Add post on quaternions

2020-11-09 20:31:37 +01:00 · 2020-11-09 20:31:37 +01:00 · bac177d458
commit bac177d458
parent a80c305a6f
2 changed files with 300 additions and 0 deletions
--- a/bib/bibliography.bib
+++ b/bib/bibliography.bib
@ -288,3 +288,72 @@
  address =	 {Cambridge, Massachusetts},
  isbn =	 9780262039420,
 }
+
+@Book{stillwell2010_mathem_its_histor,
+  author =	 {John Stillwell},
+  title =	 {Mathematics and Its History},
+  year =	 2010,
+  publisher =	 {Springer},
+  url =		 {https://doi.org/10.1007/978-1-4419-6053-5},
+  DATE_ADDED =	 {Fri Nov 6 14:39:47 2020},
+  doi =		 {10.1007/978-1-4419-6053-5},
+  isbn =	 9781441960528,
+  series =	 {Undergraduate Texts in Mathematics},
+}
+
+@article{sola2017_quater_kinem_error_state_kalman_filter,
+  author =	 {Sol{\`a}, Joan},
+  title =	 {Quaternion Kinematics for the Error-State Kalman
+                  Filter},
+  journal =	 {CoRR},
+  year =	 {2017},
+  url =		 {http://arxiv.org/abs/1711.02508v1},
+  abstract =	 {This article is an exhaustive revision of concepts
+                  and formulas related to quaternions and rotations in
+                  3D space, and their proper use in estimation engines
+                  such as the error-state Kalman filter.  The paper
+                  includes an in-depth study of the rotation group and
+                  its Lie structure, with formulations using both
+                  quaternions and rotation matrices. It makes special
+                  attention in the definition of rotation
+                  perturbations, derivatives and integrals. It
+                  provides numerous intuitions and geometrical
+                  interpretations to help the reader grasp the inner
+                  mechanisms of 3D rotation.  The whole material is
+                  used to devise precise formulations for error-state
+                  Kalman filters suited for real applications using
+                  integration of signals from an inertial measurement
+                  unit (IMU).},
+  archivePrefix ={arXiv},
+  eprint =	 {1711.02508},
+  primaryClass = {cs.RO},
+}
+
+@article{welchIntroductionKalmanFilter2006,
+  title = {An {{Introduction}} to the {{Kalman Filter}}},
+  volume = {7},
+  issn = {10069313},
+  url = {http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.79.6578&rep=rep1&type=pdf},
+  doi = {10.1.1.117.6808},
+  abstract = {In 1960, R.E. Kalman published his famous paper describing a recursive solution to the discrete-data linear filtering problem. Since that time, due in large part to advances in digital computing, the Kalman filter has been the subject of extensive research and application, particularly in the area of autonomous or assisted navigation. The Kalman filter is a set of mathematical equations that provides an efficient computational (recursive) means to estimate the state of a process, in a way that minimizes the mean of the squared error. The filter is very powerful in several aspects: it supports estimations of past, present, and even future states, and it can do so even when the precise nature of the modeled system is unknown. The purpose of this paper is to provide a practical introduction to the discrete Kalman filter. This introduction includes a description and some discussion of the basic discrete Kalman filter, a derivation, description and some discussion of the extended Kalman filter, and a relatively simple (tangible) example with real numbers \& results.},
+  number = {1},
+  journaltitle = {In Practice},
+  date = {2006},
+  pages = {1--16},
+  author = {Welch, Greg and Bishop, Gary},
+  file = {/home/dimitri/Nextcloud/Zotero/storage/LJ7QQCXF/Bishop, Welch - Unknown - An Introduction to the Kalman Filter.pdf},
+  eprinttype = {pmid},
+  eprint = {20578276}
+}
+
+@inproceedings{joldes2020_algor_manip_quater_float_point_arith,
+  author =	 {M. {Jolde{\c{s}}} and J. -M. {Muller}},
+  title =	 {Algorithms for Manipulating Quaternions in
+                  Floating-Point Arithmetic},
+  booktitle =	 {2020 {IEEE} 27th Symposium on Computer Arithmetic
+                  {(ARITH)}},
+  year =	 2020,
+  pages =	 {48-55},
+  doi =		 {10.1109/ARITH48897.2020.00016},
+  url =		 {https://doi.org/10.1109/ARITH48897.2020.00016},
+}
--- a/posts/quaternions.org
+++ b/posts/quaternions.org
@ -0,0 +1,231 @@
+---
+title: "Quaternions: what are they good for?"
+date: 2020-11-09
+toc: false
+---
+
+* A bit of history
+
+Quaternions come from the quest to find more numbers. A bunch of
+mathematicians from the 19th century were so impressed by the
+unreasonable effectiveness of complex numbers that they wondered
+whether the trick could be extended to other structures, notably other
+$n$-tuples of real numbers.
+
+As it turns out, this only works for some values of $n$, namely 2, 4,
+and 8. There are profound reasons for this, with connections to
+various areas of mathematics.[fn:: {-} If you want to learn more about
+the historical background of hypercomplex numbers and their
+properties, I cannot recommend Stillwell's /Mathematics and Its
+History/ enough citep:stillwell2010_mathem_its_histor.]
+
+Here is a quick recap, in case you'd like a quick overview of all the
+stuff that could reasonably be considered "numbers".
+
+/Complex numbers/ are of the form $z = a + ib$, with $a$ and $b$ real
+numbers, and $i$ (the /imaginary unit/ defined such that $i^2 =
+-1$. They form a /field/, and have all the nice properties that we can
+expect of well-behaved numbers, such as:
+- Associativity: $(xy)z = x(yz)$
+- Commutativity: $xy = yx$
+for all complex numbers $x$, $y$, and $z$.
+
+/Quaternions/ are an extension of complex numbers, but in /four/
+dimensions instead of just two. They are of the form $\mathbf{q} = a +
+ib + jc + kd$, where $i$, $j$, and $k$ the imaginary units. They
+follow a bunch of rules, which are $i^2 = j^2 = k^2 = ijk = -1$, and
+\[ ij = -ji = k,\quad jk = -kj = i,\quad \text{and } ki = -ik = j. \]
+
+One thing we can notice straightaway: quaternions are /not
+commutative!/ That is, in general, $\mathbf{q_1} \mathbf{q_2} \neq
+\mathbf{q_2} \mathbf{q_1}$. However, like the complex numbers, they
+are associative.
+
+Finally, /octonions/ are the last members of the club. They can be
+described with 8 real components. Their imaginary units behave
+similarly to quaternions imaginary units, with more complicated
+rules. And furthermore, they are neither commutative nor even
+associative! (At this point, one wonders whether they deserve the
+title of number at all.)
+
+And /that's it!/ There is something really strange happening here: we
+can define something looking like numbers only for dimensions 2, 4,
+and 8. Not for 3, not for 6, and not for 17. Moreover, we're losing
+important properties along the way. Starting from the full, complete,
+/perfect/ structure of the complex numbers[fn:complex], we gradually
+lose the things that make working with numbers easy and intuitive.
+
+[fn:complex] Yes, as many authors have pointed out, complex numbers
+are actually the most "complete" numbers. They have all the
+interesting properties, and fill in the gaps where so-called "real"
+numbers are failing. You can define any polynomial you want with
+complex coefficients, they will always have the correct number of
+roots in the complex numbers.
+
+
+* Why are quaternions interesting?
+
+So we can build these kinds of 4- or 8-dimensional numbers. That is
+fascinating in a way, even if only to answer the more philosophical
+question of what numbers are, and what their properties mean. But as
+it turns out, quaternions also have direct applications, notably in
+Physics.
+
+Indeed, quaternions are extremely useful for representing /rotations/.
+There are many different ways to represent rotations:
+- [[https://en.wikipedia.org/wiki/Euler_angles][Euler angles]] are arguably the most intuitive: it's just three angles
+  representing yaw, pitch, and roll.
+- [[https://en.wikipedia.org/wiki/Rotation_matrix][Rotation matrices]] are the most natural from a mathematical point of
+  view. Rotations are invertible linear transformation in 3D space,
+  after all, so they should be represented by matrices in
+  $\mathrm{GL}_3(\mathbb{R})$.
+
+However, both of these representations suffer from serious
+drawbacks. Euler angles are nice for intuitively understanding what's
+going on, and for visualisation. However, even simple operations like
+applying a rotation to a vector, or composing two rotations, quickly
+lead to a lot of messy and unnatural computations. As soon as
+rotations become larger that $\pi$, you get positions that are
+ill-defined (and you have to implement a lot of wraparound in
+$[-\pi,\pi]$ or $[0,2\pi]$).
+
+Rotation matrices are more straightforward. Composing two rotations
+and rotating a vector is matrix-matrix or matrix-vector
+multiplication. However, a $3\times 3$ matrix contains 9 scalar elements, just
+to represent an object that is intrinsically 3-dimensional. Because of
+this, computations can become costly, or even unstable in
+floating-point representations.
+
+The quaternion is therefore an elegant compromise in space
+requirements (4 elements) and ease of computations. It is moreover
+easy and numerically stable to get rotation matrices from quaternions
+and vice-versa.
+
+From a numerical stability point of view, it is easier to deal with
+computation errors in the case of quaternions: a renormalized
+quaternion is always a valid rotation, while it is difficult to
+correct a matrix that is not orthogonal any more due to rounding
+errors.
+
+* Unit quaternions and rotations
+
+Rotations are represented by unit quaternions, i.e. quaternions of
+norm 1:[fn:proofs]
+
+\[ \mathbf{q} = \exp\left((u_x i + u_y j + u_z k) \frac{\theta}{2}\right). \]
+
+[fn:proofs] {-} For a complete derivation of quaternion properties and
+rotation representation, see
+cite:sola2017_quater_kinem_error_state_kalman_filter.
+
+
+This represents a rotation of angle $\theta$ around the vector
+$\mathbf{u} = [u_x\; u_y\; u_z]^T$. The unit quaternion $\mathbf{q}$
+can also be written as
+
+\[ \mathbf{q} = \cos\left(\frac{\theta}{2}\right) + \mathbf{u}\sin\left(\frac{\theta}{2}\right). \]
+
+Composition of rotations is simply the quaternion multiplication. To
+orient a vector $\mathbf{x}$ by applying a rotation, we conjugate it
+by our quaternion $\mathbf{q}$:
+
+\[ \mathbf{x'} = \mathbf{q} \mathbf{x} \mathbf{q'}, \]
+
+where all products are quaternion multiplications, and the vectors
+$\mathbf{x}$ and $\mathbf{x'}$ are represented as /pure quaternions/,
+i.e. quaternions whose real part is zero:
+
+\[ \mathbf{x} = x_1 i + x_2 j + x_3 k. \]
+
+The fact that quaternions are not commutative also corresponds
+directly to the fact that rotations themselves are not commutative.
+
+* Quaternion conventions
+
+People are using various quaternions conventions, depending on their
+choice of multiplication formula ($ij = -k$ or $ij = k$) and on their
+choice of representation (real part first or real part last). The case
+$ij = k$ and real part first used in this article is called the
+/Hamilton convention/, whereas the convention where $ij = -k$ and the
+real part is last is called the /JPL convention/.[fn:conventions]
+
+[fn:conventions] As the name suggest, the JPL convention is used
+mostly by NASA's [[https://www.jpl.nasa.gov/][Jet Propulsion Laboratory]], and by extension in
+aerospace applications. The Hamilton convention in more frequent in
+robotics and in the state estimation literature, such as Kalman
+filtering citep:sola2017_quater_kinem_error_state_kalman_filter.
+
+
+As always, it is important to clearly define ahead of time what
+convention is used in your projects, especially if you're getting
+inspirations from books and articles. Check if they are all using the
+same conventions, or hard-to-debug issues may arise!
+
+[[https://en.wikipedia.org/wiki/Quaternions_and_spatial_rotation#Alternative_conventions][Wikipedia]] and especially
+cite:sola2017_quater_kinem_error_state_kalman_filter contain a very
+useful reference of the various possible conventions, and where they
+are used.
+
+* Applications
+
+Quaternions are often the best choice whenever rotation or attitude
+representations are required. This includes robotics, aerospace
+engineering, 3D graphics, video games, and so on.
+
+They are of particular use in optimal control or state estimation
+scenarios: they are often the representation of choice for the
+attitude of an object in a [[https://en.wikipedia.org/wiki/Kalman_filter][Kalman filter]] for instance.[fn:kalman]
+
+[fn:kalman] {-} For a nice introduction to Kalman filters, see [[https://www.bzarg.com/p/how-a-kalman-filter-works-in-pictures/][this
+blog post]] or the introductory article by
+cite:welchIntroductionKalmanFilter2006.
+
+
+* Software and libraries
+
+When working with quaternions, it may be tiresome to reimplement all
+the basic functions you might need (composition, conjugation,
+conversions to and from rotation matrices and [[https://en.wikipedia.org/wiki/Conversion_between_quaternions_and_Euler_angles][Euler angles]], and so
+on).[fn:algos] Thankfully, quaternions are a very standard part of engineers
+toolboxes, so many libraries were written for a variety of scientific
+programming languages.
+
+[fn:algos] {-} For an overview of efficient floating-point algorithms
+for manipulating quaternions, see
+cite:joldes2020_algor_manip_quater_float_point_arith.
+
+
+For [[https://julialang.org/][Julia]] (easily the best programming language for this kind of
+application in my opinion):
+- [[https://github.com/JuliaGeometry/Quaternions.jl][Quaternions.jl]], for basic quaternion representation and
+  manipulation,
+- [[https://github.com/JuliaGeometry/Rotations.jl][Rotations.jl]], for representing rotations and operating on them more
+  generally,
+- [[https://github.com/JuliaGeometry/CoordinateTransformations.jl][CoordinateTransformations.jl]], for a general framework not limited to
+  rotations.
+
+In Python:
+- [[https://docs.scipy.org/doc/scipy/reference/generated/scipy.spatial.transform.Rotation.html][scipy.spatial.transform.Rotation]], which has the benefit of being
+  included in SciPy directly,
+- [[https://quaternion.readthedocs.io/en/latest/][numpy-quaternion]], for a more feature-complete implementation.
+
+And if you have to work in Matlab, unfortunately the functionality is
+locked away in the Robotics System Toolbox, or in the Aerospace
+Toolbox. Use open source software if you can!
+
+* The structure of the group of unit quaternions
+
+As it turns out, quaternions are even more interesting than
+expected. Not satisfied with representing rotations efficiently, they
+also have the structure of a [[https://en.wikipedia.org/wiki/Lie_group][Lie group]]. A Lie group is a structure
+that combines the properties of a group and of a differentiable
+manifold, thus allowing to compute derivatives, and solve differential
+equations, of functions taking quaternion values.[fn::{-} Rotation
+matrices also have a Lie group structure.]
+
+This is obviously extremely interesting when studying dynamical
+systems, as these are often modelled as systems of differential
+equations. Having a way to define rigorously derivatives and
+uncertainties on quaternions is a very significant result.
+
+* References