16

Let's suppose I have a spacetime manifold $M$. Let $p$ be a point on my manifold. Now I move from $p$ to some other point $p'$. Presumably I should have moved some "distance" right? How can I speak of notions of space and time if I have no conception of distance?

But now consider light moving through spacetime. Suppose my light starts at $p = (0,0,0,0)$ and travels to $p' = (1,1,0,0)$. By the definition of the spacetime interval $ds^2 = dt^2 - dx^2 - dy^2 - dz^2$, this should mean $ds^2 = (1)^2 - (1)^2 - 0 - 0 = 0$. So $ds^2=0$.

Yet I have moved from point $p$ to $p'$. So I clearly have moved along some path along the curve, but the length of this path is zero. Shouldn't that mean $p$ and $p'$ are the same point?

Note: I think I may be suffering from an overly Euclidean mindset and my brain hasn't adapted yet enough to the non-Euclidean logic of semi-Riemannian manifolds.

My Question

Can someone resolve this contradiction?

4 Answers4

14

Let's separate out some definitions:

metric(1): Given a set $X$, a function $d : X \times X \to \mathbb{R}$ such that the following axioms hold for all $x,y,z \in X$:

  • $d(x,y) \geq 0$,
  • $d(x,y) = 0 \Leftrightarrow x = y$,
  • $d(x,y) = d(y,x)$, and
  • $d(x,z) \leq d(x,y) + d(y,z)$.

pseudo-metric(1): Given a set $X$, a function $d : X \times X \to \mathbb{R}$ such that the following axioms hold for all $x,y,z \in X$:

  • $d(x,x) = 0$,
  • $d(x,y) = d(y,x)$, and
  • $d(x,z) \leq d(x,y) + d(y,z)$.

metric(2): (aka "inner product") Given a vector space $V$ over a field $F$, which is either $\mathbb{R}$ or $\mathbb{C}$, a function $g : V \times V \to F$ such that the following axioms hold for all $x,y,z \in V$ and $a \in F$:

  • $g(x,y) = \overline{g(y,x)}$;
  • $g(ax,y) = a g(x,y)$,
  • $g(x+y,z) = g(x,z) + g(y,z)$,
  • $g(x,x) \geq 0$, and
  • $g(x,x) = 0 \rightarrow x = 0$.

pseudo-metric(2): (aka "pseudo inner product") Given a vector space $V$ over a field $F$, which is either $\mathbb{R}$ or $\mathbb{C}$, a function $g : V \times V \to F$ such that the following axioms hold for all $x,y,z \in V$ and $a \in F$:

  • $g(x,y) = \overline{g(y,x)}$;
  • $g(ax,y) = a g(x,y)$,
  • $g(x+y,z) = g(x,z) + g(y,z)$, and
  • $\exists\ v \in V : g(x,v) \neq 0$.

Now you want to define a distance between points on a manifold. You are intuitively looking for a (pseudo-)metric(1) here, a distance function on a set without any extra structure. The problem is all you are given is a (pseudo-)metric(2) on the tangent space at each point. Your (pseudo-)metric(2) can only give you magnitudes of tangent vectors at points. Intuitively, these are "infinitesimal distances." You need to integrate such magnitudes along a path in order to get distances between points.

But this is the crux of the issue: What path do you choose? Even for a nice manifold like the surface of a 2-sphere (that is, something with a real metric(2), not just a pseudo-metric(2), on its tangent bundle), the distance between points is path dependent. You could fly directly from New York to London along a great circle (geodesic), or you could stop by in Beijing.

If you have positive-definiteness working for you, you could take the infimum over all paths from one point to another. Consider curves of the form \begin{align} \gamma : [0,1] & \to M \\ \lambda & \mapsto p \\ 0,1 & \mapsto p_1,p_2. \end{align} Then $$ d(p_1,p_2) = \inf_\gamma \int_0^1 \left(g_p \left(\frac{\mathrm{d}p}{\mathrm{d}\lambda}, \frac{\mathrm{d}p}{\mathrm{d}\lambda}\right)\right)^{1/2} \, \mathrm{d}\lambda $$ defines a distance function in the metric(1) sense as long as $g_p$ is an honest metric(2) inner product at each $p$.

Unfortunately, when you try this with a Lorentzian manifold equipped with a pseudo-metric(2), the construction fails to produce anything useful. Even taking an absolute value before the square root, there will always be a piecewise differentiable null path between any two points. Thus there will be differentiable curves of length arbitrarily close to $0$, and so the pseudo-metric(1) you induce is trivial: all distances are $0$.

  • I usually see the last definition as $d(..)=..\int \sqrt{g(..)}$. Everything should work even without the square root, the only thing I am not sure about is the triangle inequality. – Void Mar 04 '15 at 08:15
  • Actually I am now sure that it will violate the triangle inequality - it can be seen in terms of infinitesimals: take the vector $XZ$ representing an infinitesimal path to have a norm $1$. Now you can take the point $Y$ halfway between $X$ and $Z$ and the prescription without the square root will tell you that the path $XY + YZ$ has half the length of $XZ$. – Void Mar 04 '15 at 08:25
  • @Void You're quite right. I just forgot the square root. –  Mar 04 '15 at 17:32
9

I think it might help to think about the spacetime interval $\text{d}s^2$ as a measure of movement in spacetime relative to the speed of light. Let's say that you want to move from a point $p=(0,0,0,0)$ to another point $p'=(t,x,0,0)$. The quantity $\text{d}s^2 = c^2\text{d}t^2-\text{d}x^2$ is then:

  • Positive if $x<ct$, which means that you traversed the distance slower than the speed of light;
  • Zero if $x = ct$, which means that you traversed it exactly at the speed of light;
  • Negative if $x>ct$, which means that you traversed the distance faster than the speed of light.

So with the metric convention that you use, special relativity dictates that any massive particle can only traverse positive spacetime intervals, and any massless particle can only traverse zero spacetime intervals, since all the distances are measured relative to how a photon would move between the points.

jabirali
  • 1,820
  • So are you saying that, if I am at a point $p$ and I move to a point $p'$, that the rules for traveling on the manifold depend if the particle is massive or not? – Stan Shunpike Mar 04 '15 at 02:36
  • Manifolds in general don't come with a notion of "traveling on" them, but yes, massive particles have speeds less than $c$ and massless particles travel at speed $c$ unless you include phenomena like, say, glass. – CR Drost Mar 04 '15 at 02:39
  • 2
    @StanShunpike, what I mean is that according to special relativity, a massless particle can only move from $p$ to $p'$ if $(p'-p)^2=0$, a massive particle can only move from $p$ to $p'$ if $(p'-p)^2>0$, and no particle can move from $p$ to $p'$ if $(p'-p)^2<0$. So the spacetime distance $\text{d}s^2$ between the two points determines which kinds of particles are allowed to move from $p$ to $p'$ and not. – jabirali Mar 04 '15 at 02:50
  • @jabirali Ohhh...I'm still not sure that completely answers my question but +1 because I didn't realize that. So you are saying that massless particles travel exclusively on light-like intervals, whereas massive particles travel exclusively on space-like and no particles (besides hypothetical ones) travel along space-like. Wow! Somehow this seems obvious in retrospect but I never thought about it like that. Cool! – Stan Shunpike Mar 04 '15 at 02:57
  • Specifically, the part that I don't think is answered is....okay, let's say then that the only particles that travel along light-like intervals are massless particles...that still leaves me with my original problem of traveling some distance between two points and the interval being zero. Why doesn't that imply they are the same point? – Stan Shunpike Mar 04 '15 at 02:59
  • Unless I am somehow misunderstanding what the spacetime interval represents? But surely there is some notion of length on the manifold. And if that's not the spacetime interval, what is it? – Stan Shunpike Mar 04 '15 at 02:59
  • Massive particles travel exclusively on time-like intervals, but yeah, sounds like you got it :). I think I found another way to visualize what's going on. Say that you're at the origin, and shoot a photon towards point $x$. At exactly the same time (in your frame of reference), you start running towards point $x$ as well. The spacetime distance $s = \sqrt{c^2t^2-x^2}$ will then be some kind of measure of how far ahead of you the photon is, at the time $t$ when you arrive at point $x$. – jabirali Mar 04 '15 at 03:12
  • As for a measure of length on the interval: you can still use the Euclidean length $L = \sqrt{x^2 + y^2 + z^2}$ to measure distances in relativity as well; but different observers will in general not agree on its value. – jabirali Mar 04 '15 at 03:19
  • @jabirali Because observers disagree about its value, spatial distance is no longer an intrinsic geometric object. Many people would advocate not bothering with it. – Ryan Unger Mar 04 '15 at 03:21
  • But I never meant spatial distance. I meant spacetime distance. Isn't that what a metric space is? It's has a distance function. Ah, but I forgot. The positive definite thing is violated on semi-Riemannian manifolds. But let's suppose we forget about spacelike intervals. Then doesn't the semi-Riemannian manifold form a metric space? Because then it would be $\geq 0$ which is the criteria for a distance function. That's the "distance" I meant. As @0celo7 says, I never bother with the spatial distance because it's not an intrinsic concept.... – Stan Shunpike Mar 04 '15 at 04:12
  • @jabirali I'm also confused now. Am I supposed to think of this manifold like some sort of highway where particles can only travel in certain lanes lol. We are saying massive particles can never travel along trajectories where you move with equal amounts of time and space because that would yield $ds^2 = 0$. But if that's so, doesn't that effectively limit the trajectories the particle can take? I've always sort of imagined particles as having equal footing. They can travel wherever they please. But this sounds more restrictive in that mass affects which trajectories they move on. – Stan Shunpike Mar 04 '15 at 04:15
  • Okay, first, don't listen to jabirali's $s = \sqrt{c^2 t^2 - x^2}$ because the actual distance of how far ahead of you the photon is is simply $c t - x$ and there is little motivation for multiplying by $c t + x$ and taking the square root... Yes, massive particles are fundamentally affected in terms of what trajectories they can do; they can do timelike paths (since they experience time) but not spacelike paths (since two points on a spacelike path have arbitrary time order, as I indicated below). Massless particles only go on lightlike paths, $ds^2 = 0$. The reasons are a bit complicated. – CR Drost Mar 04 '15 at 04:38
  • Also I think that the key here is that the spacetime metric is not a "metric" in the most common mathematical sense. This is obvious because it gives negative numbers for some distances between points in spacetime (what, are we going to say that space is imaginary?). It is merely a number which is preserved under a Lorentz transformation. Now, when we do special relativity we find that for massive particles, [E/c, p] is a 4-vector whose spacetime norm is $m^2 c^2$, and for massless particles we recover $E = p c$, which was already known for light waves before special relativity. – CR Drost Mar 04 '15 at 04:50
  • @0celo7 I didn't say either. Please read jabirali's comment starting "Massive particles travel..." for needed context. – CR Drost Mar 04 '15 at 16:31
  • @0celo7 Sure it is. (Whose notion of simultaneous are we using here? I was using the stationary observer who fires a pulse past you because they actually think that you are at position x, while you always think you're at x=0. The alternative is your co-moving distance, which is $\sqrt{c^2 t^2 - x^2}$ due to length contraction, but then, you don't think it takes you a time $t$ or a distance $x$, which is why it doesn't make sense to use that coordinate system.) – CR Drost Mar 04 '15 at 19:10
5

Yeah, you've not yet adapted. That's OK. Let me take you through it.

In this conventional world of classical physics we have separate notions of distance and time, with the idea that either two events happen at the same time and therefore have an objective distance between them, or two events happen at different times and therefore have an objective time between them. It is always one or the other, not both, not neither: if there is a time gap between two events, then there is some reference frame which sees them both at distance L for any L you'd like; otherwise if the time gap is zero then everyone agrees on the distance between the two events. This allows you to freeze a moment in time and speak of distances.

In relativity, we make things a little more complicated, but also more realistic. It's almost the same story but not quite. Think about, say, a supernova explosion -- what we see as a bright flash in the starry sky would, if you were to "look down upon it", look like an event in spacetime with a horizon of light announcing the event expanding outward at the speed of light. That "expanding bubble" is important. In relativity we call that bubble a "light cone".

Imagine two expanding-at-speed-$c$ bubbles. Topologically, either one bubble is inside the other, or they both intersect on a ring (when they're large enough to intersect at all!), or they both intersect at just one point where they "kiss" each other. Those are your three possibilities: time-separated, space-separated, and null-separated. They correspond to a positive metric, negative metric, and 0 metric. The Lorentz transforms preserve this topology because they preserve the metric in general; the light cones get mapped to other light cones with the same structure relative to each other.

Given time-separated, you still have that two things are not objectively space-separated, as there is a reference frame for a spaceship which passes through both events in spacetime. Given space-separated, you also have that they are not objectively time-separated; that's a little more subtle, but imagine someone on the expanding "ring" where the bubbles "intersect": they see both events simultaneous with their local apparatus. The claim is, given the right velocity, they would draw the distances back to the original events both as L, so that they think that both events were simultaneous, proving that there is no objective time-ordering.

Null-separated is the new beast that you'll have to accept, lying between those. It's objectively "time separated" because up to a point, one light cone is "inside" the other one, so you can say that one comes "before" the other. However, in the limit as you go faster and faster, trying to be at both events at the same time, you literally see no time elapse between the two events. Similarly it's objectively "space separated" but there are reference frames which make the distance between the two events arbitrarily small.

Those are the metric values and what they mean. "Distance" is relative to this new notion of "basically at the same place, basically at the same time" and can go negative for proper distances or positive for proper times.

You can view this, too, if you like, as broadening the notion of "present" up from a plane in classical physics to the space between two light cones. One light cone represents all the events in the "past" which "have been seen" by this spacetime point (light from them has had an opportunity to reach the point); another light code represents all the events in the "future" which "have seen" this spacetime point; the spacelike-separated stuff is all in a "relativistic present" relative to this point; different Lorentz transforms choose different planes through the point as "present planes" but this choice is more arbitrary.

CR Drost
  • 37,682
0

While the others have provided more technical answers, I hope to offer a more straight-forward explanation. The interval $ds^2$ is:

$ds^2 = g_{\mu\nu}dx^\mu dx^\nu = 0$

But this is a sum and it does not imply that the $dx^\mu dx^\nu$ are zero.

  • But zero sum would imply the vectors were zero if $g$ were a positive definite billinear form, which condition would make it a "real" metric (in the mathematician's sense). It's the nontrivial signature that allows nonzero null vectors. – Selene Routley Jun 19 '17 at 13:17