1

I have some confusions about proper time and the calculations involved in Einstein's famous thought experiment where two lightning strikes occur simultaneously at the ends of a train for an observer observing the train to be moving at velocity v (Observer 1) while an observer stationary with respect to the train (Observer 2) observes these strikes to non-simultaneous.

In a lecture, my teacher presented the two "events" as being (for each lightning strike):

  1. The lightning strikes the end of the train
  2. The light travels to the observer's eyes

To show that there was a discrepancy between Observer 1 and 2 wrt the train and the observer watching the train move at velocity v, the time dilation formula was used. My understanding is that the time dilation formula relates proper time and coordinate time. Aren't neither observers measuring the proper time between these two events? Proper time is the time measured by an observer stationary in a frame in which the events occur at the same spatial coordinates, and this does not happen for either observer here.

I will go through the derivation presented: Let L be the length of the train from Observer 1's frame (it is dilated from the rest length observed in Observer 2's frame). Let t be the time for for Observer 1 to see light from the front of the train hit Observer 2, and t be that time for Observer 2. In Observer 1's frame, Observer 2 is moving towards the incoming light from the front of the train, so we have $$vt = L/2 - ct \rightarrow t = \frac{L}{2(c+v)}.$$ In Observer 2's frame, the time they observe $\tau$ is given by $$t = \gamma \tau.$$

The last step is the source of my confusion as I mentioned above.

Another question I have concerns the Lorentz transformations, as I see in this thread. From this, it looks to be that doing the Lorentz transformations is equivalent to applying time dilation and length contraction together. Is this understanding correct in some sense?

4 Answers4

1

So it sounds like you're confused about time dilation. You can complete this derivation without time dilation, but it requires some more advanced mathematics.

How does the train argument work?

The standard train argument works like this:

  1. We tell our students that everyone sees light move at the same speed $c$. We tell our students to be very careful comparing times on different clocks because of an effect that we will see in step (4), so we are going to carefully route all light to the same clock and see what it measures. We are then going to derive some rules to transform this into frames where that clock is moving horizontally on a train of constant speed $v$.
  2. We consider a light pulse which travels in the train in a vertical direction. It goes a distance $h$ from the ground to the ceiling, bounces against a mirror, and then travels the same distance back to the ground. A clock at the ground measures the time $\tau = 2h/c$ for it to travel this distance. But now in the frame that sees this thing moving, the principle of constancy of the speed of light demands that we have triangles of base $v~t$ and height $h$ and hypotenuse $c~t$, so we find $t=2h/\sqrt{c^2-v^2}$. We tell the students that this factor of $\gamma = 1/\sqrt{1-(v/c)^2}$ is a universal time-dilation factor, any frame which sees a clock moving sees it tick slowly by this factor.
  3. We consider now a light pulse which travels in the train in a horizontal direction. It goes a distance $L$ and again reflects off a mirror back to its origin, where a clock measures the time to travel this distance as $\tau = 2L/c.$ Now the calculation gets more fun, as we know that the time elapsed is $t = \gamma ~2L/c$ from the previous section, but that conflicts with the equivalent analysis: in this frame the relative velocity of the mirror and the light is $c-v$ and the relative velocity of the absorber and the light is $c+v$ and we would expect to find $$t=\frac{L}{c+v} + \frac{L}{c-v} = \frac{2Lc}{c^2-v^2} = \gamma^2 2 L/c.$$ We allow this discrepancy between $\gamma$ and $\gamma^2$ to sink in for a bit and finally we resolve the student's confusion: there is only one parameter here that can absorb the paradox and it is $L$, so we find that $\tau = 2 L/c$ and that $t=\gamma~2L/c = \gamma^2~2L'/c$ and this means that $L' = L/\gamma.$ There is evidently also a universal effect: if some frame thinks that two objects are not moving and are separated by a constant distance $L$ then every other frame which sees them both moving at speed $v$ thinks that the true distance between them is $L'=L/\gamma.$
  4. We finally return to this discrepancy about clocks at different locations by using two clocks that are in sync upon the train, at $x=0$ and $x=L,$ having the property that someone on the train at position $x=L/2$ sees both of them tick identically. We now just consider the forward half of that trajectory: the time elapsed in this frame is $\tau = L/c.$ We know that the ground frame must see the same light travel a distance $L/\gamma$ and that the time taken must be $$t = \frac{L/\gamma}{c - v} = \frac{(L/\gamma)~(c + v)}{c^2 - v^2} = \gamma~\frac Lc~\left(1 + \frac vc\right).$$We ask our students to think about why this is not $t = \gamma~\tau$ as we might expect given (2) above. If the ground frame saw these two clocks as being in sync, then the conclusion that $t=\gamma~\tau$ would be unavoidable, but we can see that actually we have $t = \gamma~\tau~(1 + v/c)$ and this must mean that we do not see those clocks as being in sync. In particular when the light pulse is emitted and the first clock reads $0$ we on the ground must think that the second clock reads $-vL/c^2$. We tell our students that this is a universal effect called "the relativity of simultaneity."
  5. We usually then derive the Lorentz transformation somewhat hastily. Our students at this point are uncertain enough that they accept the equation $x' = \gamma~(x - vt/c^2)$ without asking the very important question of why this factor is $\gamma$ and not $1/\gamma$ as might be expected from length contraction.

It looks like your teacher may have skipped through 2 and 3 in order to get to 4. The chief strength of the train argument is that it immediately shows you by the Pythagorean theorem where this factor of $\gamma$ comes from. It has several weaknesses including that when a student runs back through these ideas they start asking why $h$ wasn't length-contracted earlier, or asking why the Lorentz transform has $\gamma$ and not $1/\gamma$ for the transform of position coordinates, or we get to the velocity addition formula and we introduce hyperbolic tangents and it seems terribly ad-hoc. It also does not seem to introduce much causality into the system.

Deriving the Lorentz transform with matrix mathematics.

We tell our students that everyone sees light move at the same speed $c$. We then imagine that someone quickly turns on-and-off a light while they are moving at speed $v$. We draw a circle of the light around her, in her reference frame, pointing out that it is a distance $L = c~t$ around her in all directions.

Now we invite them to think about a reference frame in which this person is moving at some slow speed $v \ll c$. We draw the same circle, with a point at its center, this is where she was when she flashed that light on and off; but now she has moved forward a distance $v~t$. So we put a point very close to the center but perhaps slightly to the right and remind that she has not moved very fast relative to light. We now put points at the same distance $L=c~t$ around her, and we notice that the dots orthogonal to the motion are basically on the circle, but the ones "in front of her" from our perspective are also "in front of the light" and the ones "behind her" are also "behind the light".

We explain that what must be happening, given that she thinks that the light is hitting these dots "right now" while we think that the light "has already" hit these dots behind her and "is yet to" hit the dots ahead of her, is that we must disagree on when "right now" is at distant positions. We draw clocks on these dots reflecting our in-sync time T, she must see the clock in front of her show a slightly later time and the clock behind her show a slightly earlier time: the time when the light actually reaches the clocks, which is given by $L/(c\mp v),$ where the top operator $-$ is for the one ahead of her and the bottom is for the one behind her. Since $v$ is small we can Taylor-expand this as $L/c \pm Lv/c^2.$ We note that clocks are out-of-sync proportional to their distance and the velocity that she is going. This is where we get back to that causality aspect that we mentioned earlier: we say to the students, "So it is a strange feature of acceleration in our world that when you accelerate, you see clocks ahead of you by a distance $L$ tick faster at a rate $1+L~a/c^2$ and the clocks behind you tick slower at a rate $1-L~a/c^2.$ This is going to be important because we're going to see that in gravitational frames the 'correct' frames are free-falling, and for us to stay at a constant distance from Earth's center, Nature thinks that we are in a non-inertial frame accelerating upwards with acceleration $g$. Therefore all of the clocks in the satellites in the sky must seem to be ticking faster! This has been observed and is part of how GPS has to work."

Now comes the "advanced mathematics": it is not terribly advanced; it is linear algebra. We impose on students the desire to work with a space-unit dimension of time $w = c~t$ and dimensionless velocities $\beta = v/c$ and then observe that our present transform is this: when moving to a frame with velocity $v$ we used to use the Galilean transform, $$\begin{bmatrix}w'\\x'\\y'\end{bmatrix} = \begin{bmatrix}1&0&0\\-\beta&1&0\\0&0&1\end{bmatrix} \begin{bmatrix} w\\x\\y\end{bmatrix}$$where the $y$ is just to show this orthogonal axis. But now we see that we actually need something which for small velocities looks like,$$\begin{bmatrix}w'\\x'\\y'\end{bmatrix} = \begin{bmatrix}1&-\beta&0\\-\beta&1&0\\0&0&1\end{bmatrix} \begin{bmatrix} w\\x\\y\end{bmatrix}.$$We observe for the student's sake that now, instead of just one of these components mixing unilaterally with the other, the two are getting jumbled up together and it's hard to see what the effects of that are going to be on anything other than this uninvolved $y$-component as we accelerate towards $v\approx c.$ There are now two ways that I know of to get the right answer from this first-order matrix $F(\beta) = \begin{bmatrix}1&-\beta\\-\beta&1\end{bmatrix}.$

The first is to observe that $F(-\beta) F(\beta)$ should be the identity matrix $I$ as we should be able to transform to new coordinates and then back and recover our original ones. This matrix product is not $I$ but $I~(1-\beta^2).$ The minimal solution for mathematical consistency is therefore to distribute this factor evenly to get $L(\beta) = F(\beta)/\sqrt{1-\beta^2}$ which will be mathematically consistent and we recover $F(\beta)$ to first order in $\beta.$ This can seem "magical" to the student in an unpleasant way but I have met many who think "oh, that is very obvious!"

The second is more elaborate, we consider the matrix exponentiation $$\lim_{n\to\infty} \begin{bmatrix}1 & -\alpha/n\\-\alpha/n&1\end{bmatrix}^n.$$Here we clearly are trying to derive what the large-scale behavior must be by a sequence of $n$ small steps towards it. To do a matrix exponentiation we need eigenvectors: this is not hard as $[1;1]$ is an obvious one and symmetric matrices have orthogonal eigenspaces hence $[1;-1]$ is a logical next guess, and it works just fine. The eigenvalues are $1-\alpha/n$ for the first one, $1+\alpha/n$ for the second. This matrix exponentiation works out to$$\lim_{n\to\infty} \sqrt{\frac12} \begin{bmatrix}1&1\\1&-1\end{bmatrix} ~ \begin{bmatrix}1-\alpha/n&0\\0&1+\alpha/n\end{bmatrix}^n ~ \sqrt{\frac12} \begin{bmatrix}1&1\\1&-1\end{bmatrix} = \frac12 \begin{bmatrix}1&1\\1&-1\end{bmatrix} ~ \begin{bmatrix}e^{-\alpha}&0\\0&e^\alpha\end{bmatrix} ~ \begin{bmatrix}1&1\\1&-1\end{bmatrix},$$using the familiar limit that $\lim_{n\to\infty}\left(1 \pm \frac\alpha n\right)^n = e^{\pm \alpha}.$ Now we can introduce these functions $\sinh\alpha = (e^\alpha - e^{-\alpha})/2$ and $\cosh \alpha = (e^\alpha + e^{-\alpha})/2$ as the odd and even parts of the exponential function, in terms of which this matrix product takes the form,$$ \begin{bmatrix}\cosh \alpha&-\sinh\alpha\\-\sinh\alpha&\cosh \alpha\end{bmatrix}.$$Now one observes that a particle at rest in our frame would appear to be moving backwards with velocity $-c~\sinh\alpha / \cosh\alpha = -\tanh\alpha$ in this other frame, thus coming to a conclusion that $\tanh \alpha = \beta.$ We can then observe that $\cosh^2\alpha - \sinh^2\alpha = 1$ to find that $1 - \tanh^2\alpha = 1/\cosh^2\alpha$ hence $\cosh\alpha = 1/\sqrt{1-\beta^2}.$ Since $\sinh \alpha = \cosh\alpha \tanh\alpha$ we have $\sinh \alpha = \beta/\sqrt{1-\beta^2}.$ We show the interpretation of $\cosh\alpha$ as a time dilation factor and then transform the lines $[w; L]$ to be $[w'; L'] = \gamma~[w-\beta L; L - \beta w]$ to find $L' = \gamma L - \beta~(w' + \beta L),$ so at constant $w'$ these are spaced by $L' = \gamma L (1 - \beta^2) = L/\gamma,$ and we have derived length contraction as well.

Finally we derive the rules $$\sinh(\alpha \pm \beta) = \sinh \alpha \cosh\beta \pm \cosh\alpha\sinh\beta,\\ \cosh(\alpha \pm \beta) = \cosh \alpha \cosh\beta \pm \sinh\alpha\sinh\beta.$$From them we show that the composition of two Lorentz transforms is also a Lorentz transform with the summed components, from which the idea of adding "rapidities" to find velocities now naturally flows.

CR Drost
  • 37,682
0

My understanding is that the time dilation formula relates proper time and coordinate time. Aren't neither observers measuring the proper time between these two events? Proper time is the time measured by an observer stationary in a frame in which the events occur at the same spatial coordinates, and this does not happen for either observer here.

Correct, but we are not measuring "the proper time between these two events". We are measuring the dilation of time of the moving observer relative to the proper time of the stationary observer. (Of course each inertial observer is stationary in his own frame.)

it looks to be that doing the Lorentz transformations is equivalent to applying time dilation and length contraction together. Is this understanding correct in some sense?

Yes. The Lorentz transformation is the transformation of the time and space coordinates and therefore reflects the length contraction and time dilation. In other words, the contraction and dilation directly follow from the Lorentz transformation formulas.

safesphere
  • 12,640
  • "We are measuring the dilation of time of the moving observer relative to the proper time of the stationary observer. "

    Why are you able to call the time of the stationary observer the "proper time?" Do you mean Observer 1 or 2 in my question?

    – Quantum Dot Sep 24 '17 at 05:36
  • It doesn't matter which observer, because each of them is stationary in his own frame. The Observer 1 sees the time of the Observer 2 dilated relative to the proper time of the Observer 1. At the same time, the Observer 2 sees the time of the Observer 1 dilated relative to the proper time of the Observer 2. In other words, if you move fast away from me, I see your time moving slower than mine while you see my time moving slower than yours. – safesphere Sep 24 '17 at 05:44
  • So then when is it appropriate to talk about the "proper time between events" as opposed to just the proper time of an observer? I see this distinction being made but do not see the direct relationship between them or why one is done over the other, etc. – Quantum Dot Sep 24 '17 at 06:05
  • The proper time between two events is measured by an observer who sees both events happen at the same place. Every inertial observer's measurements of events at their own position is a measurement of a proper time in their own reference frame. Any other observer can derive the proper time as $\sqrt{c^2 t^2 - d^2}$ where $t$ is the time they measure and $d$ is the distance between the positions that they saw the events happen at. – CR Drost Sep 24 '17 at 06:08
  • That was the point of my answer, there is no such thing as "the proper time" between two spatially separated events. The proper time is by definition local. You can say, "the time between two events as measured by the observer". So he is in his frame uses his proper clock to measure the time interval between two events the way he sees them. This would not be "the proper time between the events", because a different observer in his own proper frame would measure (using his proper clock) a different time interval between the same two events. – safesphere Sep 24 '17 at 06:14
  • So does this mean that the proper time for events and observers is "different?" Or that, for an observer, the "events" are simply recording the ticking of his own clock?

    Also, @CR Drost, what would be the "proper time" between events in this thought experiment, if it is even applicable to talk about it?

    – Quantum Dot Sep 24 '17 at 06:23
  • The proper time is the time between two local events measured by the local clock. For example, you can use your wristwatch to measure the time between your two heartbeats. If at this moment I see you from a moving train, by my clock the time between your two heartbeats would be different and not proper. However, as @CRDrost pointed out, I can calculate your proper time to know what you measure. As repeated once and again, there is no such thing as "the proper time" between two events that happen at different locations. No such thing as "the proper time" between the lightnings in your case. – safesphere Sep 24 '17 at 06:37
  • Okay, so for clarification: In this thought experiment, is it correct to say that the "local events" are just the ticking of Observer 2's wristwatch? So really we are just trying to see how much Observer 2's wristwatch has elapsed from Observer 1's frame, just with the lightning strike reaching him defining the time interval we care about? – Quantum Dot Sep 24 '17 at 07:12
  • Yes, the ticking of Observer 2's wristwatch is local only for this observer. The lightnings are not local for any of the observers and neither lightning is local to the other lightning, because they are spatially separated. It's all really very simple once it lines up properly in your mind. Special relativity is easy to understand, but hard to believe. – safesphere Sep 24 '17 at 07:46
  • So, when I try to find $\tau$ for some observer in the formula, I am really just trying to find the time elapsed by that observer in their own frame of reference? That's all? I don't need to think of events and such?

    Also, in the formula, what precisely is $t$? For example in this thought experiment, what is Observer 1's $t$ really referring to? Is it the time they record between the lightning striking and reaching Observer 2, from their own wristwatch? Or is it the time they observe Observer 2's wristwatch to elapse for these events? I'm assuming it is the former.

    – Quantum Dot Sep 24 '17 at 09:54
  • Consider you are at the station and I am in the moving train. You see my time moving slower by gamma=2. I measure the period between my heartbeats by my watch as 1 second. It is my proper time tau=1 sec. You see me moving twice slower. By your wristwatch the period between my heartbeats is 2 seconds. This is the coordinate time, my time in your coordinates t=2 sec. So you know my t, but you want to know my tau. The formula gives you tau=t/gamma or tau=2/2=1 sec. (TBC) – safesphere Sep 24 '17 at 15:07
  • When the lightnings strike, they emit light that eventually reaches the observer. While there is no "proper time between the lightnings", because they are at different places, consider the events of their light reaching the observer. He can measure the time period between the flashes he sees by his watch as his proper time between the events of the light flashes coming to him. This is not the "proper time between the lightnings", but it is the proper time between the flashes he sees, because the events of him seeing the flashes happen at his location and are local to him. – safesphere Sep 24 '17 at 15:18
0

We have two events here : \begin{align} &\text{event f : A light pulse from the train front} \tag{01}\\ &\text{event b: A light pulse from the train back} \tag{02} \end{align} Let these two events have the following space-time coordinates

For Observer 1: \begin{align} \text{event f : } &\left(x^{f}_{1},t^{f}_{1}\right) \tag{03}\\ \text{event b : } &\left(x^{b}_{1},t^{b}_{1}\right) \tag{04}\\ x_{1} \text{ difference :} \quad & \Delta x_{1}=x^{f}_{1}-x^{b}_{1}=L_{1} \tag{05}\\ t_{1} \text{ difference :} \quad & \Delta t_{1}=t^{f}_{1}-t^{b}_{1}=0 \tag{06} \end{align}

For Observer 2: \begin{align} \text{event f : } &\left(x^{f}_{2},t^{f}_{2}\right) \tag{07}\\ \text{event b : } &\left(x^{b}_{2},t^{b}_{2}\right) \tag{08}\\ x_{2} \text{ difference :} \quad & \Delta x_{2}=x^{f}_{2}-x^{b}_{2}=L_{2} \tag{09}\\ t_{2} \text{ difference :} \quad & \Delta t_{2}=t^{f}_{2}-t^{b}_{2} \tag{10} \end{align}

Lorentz Transformation : \begin{align} \Delta x_{2}&=\gamma\left(\Delta x_{1}-v\Delta t_{1}\right)=\gamma\left(L_{1}-v\cdot 0\right)=\dfrac{L_{1}}{\sqrt{1-\dfrac{v^{2}}{c^{2}}}}=L_{2} \tag{11}\\ \Delta t_{2}&=\gamma\left(\Delta t_{1}-\dfrac{v\Delta x_{1}}{c^{2}}\right)=\gamma\left(0-\dfrac{v L_{1}}{c^{2}}\right)=-\dfrac{vL_{1}}{c^{2}\sqrt{1-\dfrac{v^{2}}{c^{2}}}}=-\dfrac{v}{c^{2}}L_{2} \tag{12} \end{align} Nothing more, nothing less.

Frobenius
  • 15,613
  • So here you have not considered the time it takes for the light to reach either Observer, just the time that the flashes occur themselves, correct? – Quantum Dot Sep 24 '17 at 22:45
  • I am confused as to how you have reached the same time difference as I did above without invoking anything related to the time light takes to travel, whatsoever. – Quantum Dot Sep 24 '17 at 23:47
  • For reference, here is the link to the problem I am referencing in my post: https://i.imgur.com/HZAeBj0.jpg. Ms U is Observer 1, Mr P is Observer 2. It seems to me that the problem is asking for the difference in two sequences of events rather than the difference in two events themselves. How can both this and your Lorentz transformations result in the same time difference? – Quantum Dot Sep 24 '17 at 23:58
  • Where does the factor of c come in when you have written $\frac{c}{c+v} \frac{d}{\gamma}$? – Quantum Dot Sep 26 '17 at 00:20
  • @Quantum Dot : Of course your comment for $\frac{c}{c+v} \frac{d}{\gamma}$ refers to my answer in the above link (1) "Are my intuitions about special relativity right?" and might be post therein. Anyway, we don't insert factors in our equations at random. See therein equation (5.1) and Figure 2. See also my footnote '(2)Kinematics' to interpret intuitively the results with length contraction, AFTER WRITING DOWN THE RIGHT RESULTS USING THE LORENTZ TRANSFORMATION. $\gamma(1-\frac{v}{c}) =\frac{c}{c+v} \frac{1}{\gamma}$. – Frobenius Sep 26 '17 at 04:34
0

You dont need to invoke time dilation explicitly to explain relativity of simultaneity. If we agree that lightnings strike both the ends of the train cart and the ground arround it in such a way to make the Obs1 see the strikes as simultaneous, and he can check this by checking the markings on the ground, and by measuring the light speed as c, than Obs2 must also see that Obs1 saw these events as simultaneous but since he thinks that these events happend at the ends of the train cart in regard to which Obs1 is actually moving he than concludes that events obviously happend at different times but in such a way that rays come at the same instance to the observer 1.So, if the train is moving to the right that means that the observer 1 is rushing towards the left end of the train and away from the right end of the train. IN order for him to see these events simult. from Ob2 perspective left lightning HAD to hit second and right lightning had to hit first. If all of this happend when Ob1 and 2 were alligned than if velocity of motion of the train (or from Ob2 perspective velocity of Ob1) is v Ob1 will move distance vt and light rays will have to cover the distance (L/2)+vt and (L/2)-vt so light ray which covers more distance has to be the one that started the travel first. If we divide this by c (speed of light) then we see two different intervals. Difference of these two is the non simultaneity:

(γ^2)(β/c)L. (eq1)

Maybe here you can see something of worth to you.

Regarding the other question, yes of course we see time and space intervals contractions and dilations in Lorentz transformations. Just compare equation 1 with transformation of time interval if we set the time interval in non primed frame to zero. Of course, L of the train is just space separation dx.

Žarko Tomičić
  • 1,803
  • 11
  • 15