34

Loomis famously wrote hundreds of proofs of Pythagoras' Theorem (reference below), but these are all basically proofs "from below". Today on Twitter @panlepan mentioned Carnot's theorem which has Pythagoras' Theorem as a special case and this got me thinking. Which theorems have Pythagoras' Theorem as a special case?

@BOOK{Loomis1968,
  author =       {Loomis, E.},
  title =        {The {Pythagorean} Proposition},
  publisher =    {National Council of Teachers of Mathematics},
  year =         {1968},
  address =      {Washington},
}
  • Carnot's theorem is in fact equivalent to Pythagoras' theorem. We don't usually call a pair of theorem like this "one is the special case of the other". – Zerox May 05 '22 at 11:54
  • 8
    Moreover, saying that Pithagoras is a special case of Theorem X means that it is not used (not even indirectly) in the proof of $X$. – Francesco Polizzi May 05 '22 at 12:31
  • 4
    @FrancescoPolizzi Why? To me it just means that the statement of Theorem X becomes Pithagoras theorem's statement if you strengthen the assumptions. – Alessandro Della Corte May 05 '22 at 12:44
  • 3
    @AlessandroDellaCorte: It is a matter of definitions. What is a "special case"? Maybe you are right and yours is the common meaning. However, when I think of Theorem Y as a "special case" of Theorem X, I imagine that I first prove X and then, specializing the assumptions, I get Y. But If I strongly used Y in order to prove X, it seems strange to me to call it a "special case". In fact, in this situation X and Y are equivalent statements. – Francesco Polizzi May 05 '22 at 12:52
  • @FrancescoPolizzi Your definition makes sense, and when I do research I tend to think that way as well. However, it seems to me rather subjective, in a way. At least, it depends on the proof strategy you choose. See my (very obvious) answer below: of course you can prove that relying on PT, but you can also prove it just using the axioms of an inner product space. Which one counts? – Alessandro Della Corte May 05 '22 at 12:56
  • 7
    I think one has to be careful describing statements as equivalent—of course, any two true (and any two false!) statements are equivalent. One often "knows what it means" to say that two statements are equivalent, or that one implies the other, but in the pure logical sense I think that there's no rigorous ground for distinguishing these two cases. (Of course more interesting is, perhaps like @Dirk's answer, a property of a class of spaces that may or may not hold, and that holds exactly when Pythagoras does … in whatever sense that should be interpreted.) – LSpice May 05 '22 at 13:42
  • 3
    I assumed by Carnot's theorem you meant the thermodynamical one and was interested to see how that implied Pythagoras' theorem. I can see how such a thing might happen, but it turns out there is a different Carnot's theorem, named after the father of the thermo guy. – Harry Wilson May 05 '22 at 14:02
  • 1
    @HarryWilson The reversible case corresponds to $\alpha=\frac{\pi}{2}$, of course ;) – Alessandro Della Corte May 05 '22 at 15:33
  • Do you mean other than the Euclidian distance metric? – RBarryYoung May 06 '22 at 15:38
  • 5
    @FrancescoPolizzi I definitely do not interpret the term "special case" in the way you suggest. In my book, whenever T generalizes S, S is a special case of T. On the other hand, if someone says that S is a corollary of T, then it does suggest to me that S is not used in the proof of T. – Timothy Chow May 07 '22 at 00:02
  • @TimothyChow: as I said, it is a matter of definitions. What is a "generalization"? Personally, if S and T turn out to be equivalent (i.e., if S implies T and conversely), I prefer not to call S a "special case", even if it is apparently so. But I agree that this is open to debate. – Francesco Polizzi May 07 '22 at 06:28
  • I interpret "Corollary" as a (more or less) immediate consequence of T, non necessarily obtained by specializing the assumptions. For instance, the Brauer Fixed Point Theorem is a corollary of the non-existence of a retraction of $\mathbb{D}^{n+1}$ to $S^{n}$, but it is not a "special case" of it. – Francesco Polizzi May 07 '22 at 06:39
  • 2
    @FrancescoPolizzi I'm just reporting how I use the terms in practice and, by extension, how I have seen others use the term in practice. I might say, "Let's see what Theorem 1 says in the special case that n = 2." What would you say instead? Incidentally, I don't think I have ever seen the words "special case," "generalization," or "corollary" defined formally in any mathematical research paper or monograph. – Timothy Chow May 07 '22 at 12:12
  • @TimothyChow; Yes, I understand your point. Well, if in a lecture I use the "special case" S in order to prove T, I just make a remark about this ("please note that the case $n=1$ was actually used in the proof of the general case" or something similar). – Francesco Polizzi May 08 '22 at 07:42
  • @LSpice Normally the way it works is that one says two statements are equivalent in the presence of given assumptions $A_1, \ldots, A_n$; in other words, one proves a statement of the form $(A_1 \wedge \ldots \wedge A_n) \Rightarrow (B \Leftrightarrow C)$. There's nothing very tricky about this -- all TFAE theorems are of this type. Similarly, a careful writer will write "in ZF, the axiom of choice and the well-ordering principle are equivalent". – Todd Trimble May 13 '22 at 21:07
  • 1

16 Answers16

34

The Law of cosines is the first that comes to my mind:

$$c^{2}=a^{2}+b^{2}-2ab\cos \gamma$$

enter image description here
(source: Wikipedia)

If $\gamma$ is a right angle, its cosine is 0 and all that remains is Pythagoras' Theorem.

Glorfindel
  • 2,743
29

Parseval identities in the theory of Fourier series and integrals.

22

The parallelogram law says that if $\mathcal{V}$ is an inner product space and $\mathbf{v},\mathbf{w} \in \mathcal{V}$, then $$ 2\|\mathbf{v}\|^2 + 2\|\mathbf{w}\|^2 = \|\mathbf{v} + \mathbf{w}\|^2 + \|\mathbf{v} - \mathbf{w}\|^2, $$ where the norm here is the one induced by the inner product (in fact, a result of John von Neumann says that a norm is induced by an inner product if and only if the above parallelogram law holds). Its name comes from the fact that, if $\mathcal{V} = \mathbb{R}^2$ then the equality above simply tells us something about the sides lengths of a parallelogram compared to the lengths of its diagonals.

In the special case when $\langle \mathbf{v},\mathbf{w}\rangle = 0$, we have $\|\mathbf{v} + \mathbf{w}\|^2 = \|\mathbf{v} - \mathbf{w}\|^2$, so the parallelogram law simplifies to $$ \|\mathbf{v}\|^2 + \|\mathbf{w}\|^2 = \|\mathbf{v} + \mathbf{w}\|^2, $$ which is the Pythagorean theorem (if $\mathcal{V} = \mathbb{R}^2$).

22

The Pythagorean theorem is a limit of the general formula for spherical or hyperbolic space: $\cos(a\sqrt{\kappa})\cos(b\sqrt{\kappa})=\cos(c\sqrt{\kappa})$, where $\kappa$ is the curvature.

20

So far no one has mentioned the original generalization!

Early in Euclid's Elements, the Pythagorean theorem is stated by comparing square areas:

Book I, Proposition 47: In right-angled triangles the square on the side opposite the right angle equals the sum of the squares on the sides containing the right angle.

enter image description here

Later on, the Elements generalizes this far beyond squares. First it defines similar figures, meaning any similar figures with straight line segments:

Book VI, Definition 1: Similar rectilinear figures are such as have their angles severally equal and the sides about the equal angles proportional.

So the generalization that appears in the Elements is the Pythageorean theorem for any similar figures, as known some 2500 years ago:

Book VI, Proposition 31: In right-angled triangles the figure on the side opposite the right angle equals the sum of the similar and similarly described figures on the sides containing the right angle.

enter image description here

16

This is probably the simplest: $(a+b)^2=a^2+b^2+2ab$, if you take $a,b$ elements of some inner product vector space and $(\cdot)^2$ means inner product with itself.

  • Same spirit as Nathaniel's answer – wlad May 08 '22 at 08:31
  • @wlad yes, although not exactly the same: there are inner spaces in which the parallelogram law doesn’t hold. I don’t remember which answer was posted first, btw, but below there is an answer on the law if cosine which is closer to this one. Also there, however, I’d say that there is some difference, as this does not rely on Euclidean geometry and can be deduced only by the axioms verified by any inner product vector space. – Alessandro Della Corte May 08 '22 at 09:13
13

Pythagoras' theorem is a special case of the three point identity for Bregman distances: Let $h$ be convex and lower semi-continuous on a Banach space - further assume differentiability of $h$ for simplicity. Then $h$ induces a Bregman distance $$D_h(u,v) = h(u)-h(v) - \langle \nabla h(v), u-v\rangle $$ and this Bregman distance fulfills the mentioned three point equality $$D_h(w,v) + D_h(v,u) + \langle \nabla h(u) - \nabla h(v),v-w\rangle = D_h(w,u).$$ The special case appears for $h(u) = \tfrac12\|u\|^2$ on a Hilbert space $X$ (or on $X = \mathbb{R}^2$). Here we get $\nabla h(u) = u$ and $D_h(u,v) = \tfrac12\|x-y\|^2$ and you get Pythagoras.

Dirk
  • 12,325
  • I think you have some terms on the wrong side of the equations? Looking at my own proof and e.g. https://math.stackexchange.com/questions/3668286/amaris-pythagorean-theorem or https://math.stackexchange.com/questions/2766399/what-is-the-intuition-behind-the-bregman-divergence-of-three-points – usul May 07 '22 at 05:03
  • More specifically, I believe the "generic" Pythagorean special case is that when the vector $w-v$ is orthogonal to $\nabla h(u) - \nabla h(v)$, we should get $D_h(w,v)+D_h(v,u)=D_h(w,u)$. Choosing $h(u) = \tfrac{1}{2}|u|^2$, and in particular choosing $v = $ the origin, gets us to the most basic Pythagoras as the condition collapses to $w \perp u$. – usul May 07 '22 at 05:18
11

One of the most attractive generalisations is de Gua's theorem: If a tetrahedron $ABCD$ is rectangular at $A$, then $$ |BCD|^2=|ABC|^2+|ACD|^2+|ABD|^2$$ where the absolute value signs denote area.

This can easily be proved analytically by the $p,q$ method. One assumes,as one can, that the vertices are $(0,0,0)$, $(1,0,0)$, $(0,q,0)$ and $(0,0,t)$--the calculations are then done in a matter of minutes.

Given the objection below, here is the hierarchy of results: The square of the measure of a measurable set in $m$-dimensional affine subspace of euclidean $n$-space is the sum of the squares of the measures of the $n \choose m$ orthogonal projections on all the $m$-dimensional coordinate subspaces of $n$-space with respect to a fixed ONB basis of the latter (Conant and Beyer).

segunda
  • 11
  • 3
10

$$1=\cos^2 x+\sin^2x$$ (which can be proven without using Pythagoras) holds for arbitrary $x$ in $\mathbb C$ and yields Pythagoras for real $x$.

Roland Bacher
  • 17,432
  • 1
    This is because in $a^2+b^2=c^2$ you divide both sides by $c^2$ ? – BCLC May 06 '22 at 16:53
  • Wait ...is pythagoras' theorem perhaps equivalent to this? – BCLC May 08 '22 at 10:05
  • 1
    @BCLC yes I think so because this is the special case when triangle's hypotenuse=1, you just scale up the triangle by whatever hypotenuse length you'd like after that. – Pineapple Fish May 08 '22 at 16:34
  • 2
    I don't want to turn this into an answer, but it's also a special case of the angle sum formulas from trigonometry. Special case of angle sum formulas (derivable visually and using linear algebra): $\cos(x+y)=\cos(x)\cos(y)-\sin(x)\sin(y)$. now just set $y=-x$ to get the above identity in this answer and you're done, since $\sin^2(x)+\cos^2(x)=1$. – Pineapple Fish May 08 '22 at 16:39
  • And the Pythagorean identity is itself a special case of the generalized half-angle formula: $$ad\sin^2{\frac{\alpha}{2}}+bc\cos^2{\frac{\gamma}{2}}=(s-a)(s-d).$$ – Emmanuel José García Jul 24 '22 at 16:32
10

I like to think about Pythagoras theorem as a corollary/special case of the following theorem:

Theorem: Let $X$ be a finite dimensional real Banach space such that the group of linear isometries (that is, isometries which fix $0$) of $X$ acts transitively on the unit sphere of $X$. Then $X$ is a Hilbert space.

That is, having enough rotations forces the norm to come from an inner product, and from this Pythagoras theorem follows

user49822
  • 2,033
9

The Binet-Cauchy formula says that if $A$ and $B$ are a $n\times m$ and $B$ a $m\times n$ real matrices, respectively, and for $s\subseteq\{1,\ldots ,m\}$ with $|s|=n$ we denote by $A_s$ the $n\times n$ submatrix of $A$ obtained deleting the columns not in $s$, and by $B^s$ the $n\times n$ submatrix of $B$ obtained deleting the rows not in $s$, then

$$\det(AB)=\sum_{\begin{array}{c} s\subseteq\{1,\ldots ,m\}\\ |s=n|\end{array}} \det(A_s)\det(B^s)$$

In the case $B = A^T$, the transpose of $A$, since $B_s = (A^T)_s=(A_s)^T$ the formula gives

$$\det(AA^T) = \sum_{\begin{array}{c} s\subseteq\{1,\ldots ,m\}\\ |s=n|\end{array}} \det(A_s)\det((A_s)^T)=\sum_{\begin{array}{c} s\subseteq\{1,\ldots ,m\}\\ |s=n|\end{array}} (\det(A_s))^2$$

Since the parallelotope in $\mathbb{R}^n$ generated by the $n$ row vectors of $A$ has measure $\sqrt{\det(AA^T)}$, then the formula says that the square of the $n$-dimensional measure of an $n$-dimensional parallelotope equals the sum of the squares of the measures of its projections onto all possible $n$-dimensional coordinate hyperplanes. If $n = 1$ this reduces to the Pythagorean theorem.

user6530
  • 101
8

The discrete form of the parallel axes theorem for the second moment of area for $\,n\,$ points $\,A_k\,$ with centroid $\,G\,$ and an arbitrary point $\,P\,$ is $\,\sum_{k=1}^n PA_k^2 = n \cdot PG^2 + \sum_{k=1}^n GA_k^2 \;\;(*)\,$.

Let $\,\triangle ABC\,$ be a right triangle and $\,O\,$ the midpoint of hypotenuse $\,BC\,$, known to also be its circumcenter, so $\,OA = OB = OC = \frac{1}{2} BC\,$. Then$\,(*)\,$ for $\,A_1A_2A_3 \equiv BOC\,$, $\,G\equiv O\,$ and $\,P \equiv A\,$ reduces to Pythagoras' theorem:

$$ \require{cancel} \begin{align} AB^2+AO^2+AC^2 = 3 \, AO^2 + OB^2+\bcancel{0}+OC^2 \iff AB^2+AC^2 &=4 AO^2 = BC^2 \end{align} $$

dxiv
  • 416
6

In a less “higher” maths fashion : This Numberphile video somewhat says that Pythagoras theorem is a special case of Ptolemy’s theorem which is a more general view of properties of a cyclic quadrilateral. But navigating between the 2 always seems some sort of tautology to me …

There is also Casey’s theorem which reduces to Ptolemy’s (so which could reduce to Pythagoras’).

Quoting the Ptolemy’s theorem Wikipedia article :

More generally, if the quadrilateral is a rectangle with sides $a$ and $b$ and diagonal $d$ then Ptolemy's theorem reduces to the Pythagorean theorem. In this case the center of the circle coincides with the point of intersection of the diagonals. The product of the diagonals is then $d^2$, the right hand side of Ptolemy's relation is the sum $a^2 + b^2$.

Later on, it is cited as a corollary theorem.

Somos
  • 2,464
4

There’s an “n-dimensional Pythagorean theorem” https://billcookmath.com/papers/2012-06_nD_pythag.pdf, saying that the square of the $k$-dimensional area of a $k$-dimensional parallelogram $P$ in $n$-space is equal to the sum of the squares of the $k$-dimensional areas of the projections of $P$ onto all $k$-dimensional planes spanned by the coordinate axes.

This theorem is in particular relevant when introducing differential forms; I learned about it from this MSE question.

D.R.
  • 741
2

There is a generalisation of Pythagoras to the case of a non-right angled triangle. Inscribe an isosceles triangle $\triangle ADE$ in $\triangle ABC$ so that $[DE]\subseteq[BC]$ and $|\angle BAC|=|\angle ADC|=|\angle AEB|$. Set $r=|BE|$ and $s=|DC|$. (See a picture here.) Then by observing that $\triangle ABC$, $\triangle DAC$ and $\triangle EBA$ are similar, we get $\frac{a}{b}=\frac{b}{s}$ and $\frac{a}{c}=\frac{c}{r}$, which yields $$b^2+c^2=a(r+s).$$

If $\angle BAC$ is acute, then $[BE]$ and $[DC]$ overlap, so that $r+s>a$, while if $\angle BAC$ is obtuse we have $r+s<a$. And of course if $\angle BAC$ is a right angle, we get $r+s=a$ and recover Pythagoras' Theorem.

Wikipedia attributes this to Thābit ibn Qurra.

1

The law of total variance says that if $X,Y$ are real-valued random variables and $\operatorname E(X^2)<+\infty,$ then $$ \operatorname{var}(X) = \overbrace{\operatorname{var}(\operatorname E(X\mid Y))}^{\begin{smallmatrix} \text{explained} \\ \text{component} \\ \text{of the variance} \end{smallmatrix}} {} + {} \overbrace{ \operatorname E(\operatorname{var}(X\mid Y)) }^{\begin{smallmatrix} \text{unexplained} \\ \text{component} \\ \text{of the variance} \end{smallmatrix}} $$

A more concrete version is that in the analysis of variance, the sum of squares of residuals ("residuals" are not to be confused with "errors") plus the sum of squares due to regression equals the total corrected sum of squares (that last being the sum of squares of deviations from the sample mean).

Most statistics texts tell you that the proportion of the total variance that is "explained" is the square of the correlation between $X$ and $Y.$ (For that you need to assume $Y$ also has a finite second moment. But the validity of the identity above does not depend on $Y$ being real-valued at all, and in cases where $Y$ takes values in a set with no structure or in $\mathbb R^n$ or something else, it is still standard to call the explained proportion of the total variance $\text{“}R^2\text{”},$ even though in such cases there is no quantity called $R.$) Some textbooks go on to say that that is also the square of the cosine of an angle (one of the angles of the right triangle that is involved), and then deduce certain bounds on the correlation between $X$ and $Z$ given those between $X$ and $Y$ and between $Y$ and $Z.$

A further generalization is Brillinger's law of total cumulance, of which the following is the case $n=4{:}$

\begin{align} \text{joint cumulant} = {} & \kappa(X_1,X_2,X_3,X_4) \\[8pt] = {} & \kappa(\kappa(X_1,X_2,X_3,X_4\mid Y)) \\[6pt] & {} + {} \underbrace{ \kappa(\kappa(X_1,X_2,X_3\mid Y), \kappa(X_4\mid Y)) + \cdots}_\text{4 terms} \\[2pt] & {} + {} \underbrace{ \kappa(\kappa(X_1,X_2\mid Y),\kappa(X_3,X_4\mid Y)) + \cdots }_\text{3 terms} \\[6pt] & {} + {} \underbrace{\kappa(\kappa(X_1,X_2\mid Y),\kappa(X_3\mid Y), \kappa(X_4\mid Y)) + \cdots}_\text{6 terms} \\[6pt] & {} + \kappa(\kappa(X_1\mid Y),\kappa(X_2\mid Y), \kappa(X_3\mid Y), \kappa(X_4\mid Y)) \end{align} where the sum is over the set of all partitions of the set $\{X_1,\ldots,X_n\}$ of random variables.

Michael Hardy
  • 11,922
  • 11
  • 81
  • 119