26

[See final additions at bottom for a restatement of the paradox.]

I have seen the assertion in a couple of places that if you trap light in a reflective resonator, or any other kind of sealed trap, then (as another user here put it in answering a different question) by "calculating the Doppler shift at either end and then finding the difference between the red and blue photon momentums", one can show that it's equivalent to "the same mass $m$... that defines how it responds to force". The net momentum of the light will match that of an equivalent mass moving at the speed of the resonator, the net energy will match the relativistic total energy of that mass at that speed, and the net force when accelerated will match the inertia of that mass, all due to Doppler shifts.

I have been trying to do this calculation, getting the doppler-shifted frequencies and then finding the change of energy and momentum that results from viewing it in another reference frame, to show that they are the same as the energy and momentum of an equivalent rest mass with the velocity of the enclosing resonator. But I can't do it -- the results keep coming out wrong.

As I understand it, you calculate doppler shifts with a factor of $\sqrt{\frac{1-\beta}{1+\beta}}$ where $\beta$ is the fraction of lightspeed by which the frames differ in velocity. I'll call that $R$. It can also be expressed as $\gamma(1-\beta)$, where $\gamma$ is the Lorentz factor $1/\sqrt{1-\beta^2}$. If I have things understood correctly, when a light source is receding then the light you observe from it has its wavelength divided by $R$ and its frequency multiplied by $R$. But if light hits a moving mirror, the factor needs to be applied twice: the outgoing light gets shifted by $R^2$.

So I modeled a simple resonator, or laser, and looked at a small sample of light in it, which might be one photon. In its rest frame the laser has length $d$, and the light sample has frequency $f$, wavelength $\lambda$, and energy $E$. So when moving the forward-going light will have frequency $f/R$ and wavelength $\lambda R$, and vice versa for the reverse-moving light. And energy is proportional to frequency. And I thought, if we average the forward momentum with the backward momentum we should get the total, and likewise for energy. This led me to a pair of naive first-approximation formulas: for momentum, $\frac{1}{2}(\frac{1}{R} – R)E∕c$, and for energy, $\frac{1}{2}(\frac{1}{R} + R)E$. For a numeric example, consider $\beta=\frac{1}{2}$: in this case $R=\sqrt{\frac{1}{3}}$, and the blue-shifted forward light has three times the frequency of the red-shifted backward light. The resulting energy is $1.1547E$ and the momentum is $0.57735E/c$, which are correct. Then I did some algebra and verified that this momentum formula is indeed equivalent to $\gamma\beta E/c$ and the energy formula is equivalent to $\gamma E$. Problem solved, right?

Not so fast. I then noticed a problem. I'd been doing a simple average of forward and backward globs of light, giving them equal weight. But the light isn't weighted equally! It spends more time going forward than going backward. In the $\beta=\frac{1}{2}$ case, it spends three fourths of its time going forward and one fourth going backward. This means that at any given time, three fourths of the photons in the laser are going forward, and have blue-shifted energy rather than red-shifted energy.

Here, I made a spacetime diagram of it: enter image description here

Another way to look at it is that if the Doppler shift increases frequency by $R$, it increases power by $R^2$. The rate of photon reception per second goes up, as well as the energy in each photon. No matter how I look at it, I have to conclude that in this case, there is nine times as much energy and momentum in the forward-moving light as in the backward-moving light: three times as many photons with three times as much oomph apiece.

But this produces a completely wrong result when I try to compare the resulting mass or energy to what it should be. Like, instead of $1.1547E$ for the energy, I get $1.443E$. I got this by weighting the forward light by $(1+\beta)$ and the backward light by $(1-\beta)$ before averaging. And that is way too much, apparently violating conservation of energy.

Where have I gone wrong in my understanding?


Update: In an earlier update which I've deleted, I said that I thought I was getting to an answer. I was wrong. I fooled myself and didn't notice I was subtly slipping back into the rest frame of reference.

During the time I thought my new approach was working, I made a new spacetime diagram, showing a more continuously operating resonator, which at some given time opens both its mirrors, dumping half of the light out of each end: enter image description here

This diagram confirms that at speed $\frac{1}{2}c$, photons or wavefronts moving to the right are three times as frequent in both space and time as those going to the left. (Though if you measure along the rest-frame diagonals everything is even.) This difference in photons per second is enough to completely account for all of the expected change of net energy in the system, without leaving any room for each photon to also have differing energy due to its changed frequency.

While I was at it, I recast some of my equations using momentary energy in terms of continuous power. In these, the light's "mass" is $2Pd/c^3$, where $P$ is the power in one direction in the rest frame. There were no surprises and it left the question unchanged. The boosted state still has too much energy relative to the resting state.


Later update, ignoring the mirrors: I tried to clarify exactly how the energy would change with frame of reference, and ended up kind of ignoring the mirrors and looking at two opposing beams of light, which have equal frequencies and power in the rest frame. How much energy does each beam have within a window of length $d$? The energy is $hfk$ where $h$ is Planck's constant and $k$ is the count of photons. How does that transform when the window moves at speed $\beta$? I believe $d_\beta=d_0/ \gamma$, $f_\beta=f_0/R$, and $k_\beta=k_0(1+\beta)=k_0/\gamma R$. More photons fit in the window when blue-shifted (photon density changes with frequency), but the narrowing of the window cuts that down a bit. So $E_\beta=E_0(1+\beta)/R = E_0\gamma(1+\beta)^2$. You would then apply a boost of $-\beta$ to the opposing beam. Note that the total photon count is still $2k_0$. The total energy is $E_\beta+E_{-\beta}$, and that's supposed to add up to $2\gamma E_0$ (the energy that an equivalent mass would have at that speed), but it doesn't. [Edit: it adds up to $2\gamma(1+\beta^2)E_0$, just as in udrv's answer.]

Similarly, the momentum in each direction would be $p_\beta=E_0(1+\beta)/Rc$, and the total would be $p_\beta-p_{-\beta}$. That's supposed to add up to $2\beta\gamma E_0/c$. It doesn't.


Summation, after udrv's answer: We now have a total of five different mathematical approaches, from my first try which took a weighted average of a small sample of light over time, to udrv's final approach using a segment of spacetime shaped like a hyperwedge. I believe all five ended up producing the same answer. They all say that the energy of the system in a moving observer's frame is greater than the energy of the bare photons in the rest frame by a factor of $\gamma(1+\beta^2)$, when I would expect the change to be a factor of $\gamma$ only. So it's pretty clear that the answer to this question is not to fix the math. No matter how you come at it, the math is coming out consistent.

So the question now is, how can this make sense physically? Where can that much energy come from? How can it be reconciled with the apparent fact that if we were boosting massive particles instead of light, the resulting energy would be smaller?

I end up imagining the following scenario. Let's say you have a reservoir of cold electrons and positrons, and when you release them, you convert them into gamma rays which then get trapped in the resonator. (Yeah, I know this is a stretch.) If you convert them in the rest frame, the energy of the resulting photons equals the energy of the particles' mass. If you first boost the particles into a moving frame, their mass-energy goes up by a factor of $\gamma$, and if you then convert them into photons, the energy must be the same. But if you convert them into photons in the rest frame and then switch to a moving view of those photons while they're in the resonator, this finding seems to say that you end up with more total energy than you do if you convert already boosted particles. This is a paradox.

How do we resolve this? How can we either show that these boosted energies which appear to be different are really the same, or show that energies which appear the same in the rest frame are actually different, or show that it's somehow not contradictory for the two to have conflicting results?


Udrv's second answer (the one with the +100 bounty awarded) points out that the issue doesn't just apply to light, but to trapped massy particles as well. This is true. If I model massy particles trapped between things that bounce them back into the enclosed volume, or equivalently if I examine an arbitrary volume into which streams of massy particles flow from opposite directions (the former may be imagined as a string on two pulleys, and the latter as two rods passing through an open-ended box), then I get the same paradox. The forward-moving mass contains more particles and spends more time in transit than the backward-moving mass does, to a degree that increases with the relative speed between the forward and backward masses, apparently becoming lightlike at the limit. This paradox goes away if you imagine two masses that stay entirely enclosed in the volume and duration under examination, so that the forward and backward groups have the same number of particles.

The two scenarios (reflector vs selecting a window from a larger set of passing particles) can be seen as equivalent when viewed in terms of flux at the boundaries. Each end has an equal number of particles passing inward and outward at that surface, but not an equal amount of energy except in the rest frame.

ANYWAY, where it stands now, it sounds like the original idea -- that light trapped in a resonator is externally equivalent to mass of equal energy -- must be false. Is that what people are concluding?


I am going to try to frame the paradox one last time, in terms which are as clear and unambiguous as possible. The physical situation depicted in the thought experiment is quite implausible in practical terms, but setting it up this way is beneficial, from my point of view, for the absence of potential distractions.

The thought experiment in question is the one I mentioned earlier, where we annihilate electron-positron pairs inside a resonator. The setup is as follows: we have $n$ electrons and $n$ positrons, which have an arbitrarily small velocity relative to each other. At some time, they start to collide and annihilate in pairs. The annihilation occurs at a steady rate for a finite time $\Delta t$. By an amazing coincidence, all of the annihilations produce photons that happen to be directed parallel to some given x axis, half of them going each way along it. The result is two beams emitted in opposite directions, each having steady power while it lasts (to some arbitrary degree of precision), and then ceasing simultaneously.

If we observe this experiment from the rest frame of the particles, each electron and positron has a mass of 0.5110 MeV, and each emitted photon has that same energy. The total energy in the system never changes.

Now we repeat the experiment, except this time it happens between a pair of perfectly reflective gamma-ray mirrors. Once again, everything coincidentally lines up on the x axis. The distance between the mirrors is exactly $c\Delta t$, which means that the reflected photons from the first annihilations reach the center just as the last annihilations are finishing up. The result is that photons are shuttling continuously in both directions with no gaps. The numbers and energies of the photons are exactly as they were without the mirrors -- they're just confined to a smaller volume. The total energy again does not change.

Now let us watch these two experiments from a viewpoint boosted to a speed of ½$c$ along the x axis. First, without mirrors. Before annihilation, each electron and positron now has its apparent mass increased by a factor of $\sqrt{4/3}$, so they now weigh 0.59005 MeV. After annihilation, we have two groups of $n$ photons, one being blueshifted by a factor of $\sqrt{3}$ and the other redshifted by $\sqrt{⅓}$, resulting in energies of 0.8851 and 0.2950 MeV, which adds up to 2 × 0.59005. Again the total does not change.

But how does it look to the ½$c$ observer with the mirrors in place? After the annihilation is complete, we have $2n$ photons, but instead of being split fifty-fifty between red and blue frequencies, we've got $1.5n$ blue photons and only $0.5n$ red ones. All of the math we've thrown at this agrees that with the mirrors in place, the total energy of the system is now no longer 1.1801 MeV per pair, it's now 1.4752 MeV per pair. The average over time per photon says the excess is there, the stress tensor integration says it's there, the spacetime wedgie says it's there, the half-volume pressure balance says it's there... (for details, see entire rest of thread.)

BUT THIS ADDITIONAL ENERGY CANNOT EXIST. There is no source for it. It comes from nowhere, and if you open both ends of the resonator simultaneously in proper time to let the photons out, it vanishes into nowhere, reverting to what we saw in the mirrorless case. Why does our math keep saying it's there? It cannot be real actual energy, it is clearly illusory in some sense. So why do we see more than is actually there?

  • 3
    This is a great calculation (with many subtleties), kudos for writing out the details well! – knzhou Jul 05 '16 at 03:29
  • Going through your calculation, the first thing I'm puzzled about is the statement "But if light hits a moving mirror, the factor needs to be applied twice." Why is this? There's only one velocity in the problem, the relative velocity between the source and the receiver, so you should never apply the shift twice. Applying it once accounts for both of their motions. – knzhou Jul 05 '16 at 03:30
  • 2
    "But the light isn't weighted equally! It spends more time going forward than going backward." Nice catch! That's totally right. – knzhou Jul 05 '16 at 03:33
  • 1
    knzhou, the idea that a mirror does a double doppler shift is something I found more than once in reading around about the problem. Think of observing the laser's original light source with no mirrors: from the front you'd see it blueshifted by 1/R, and from the back it would be redshifted by R for a ratio of R squared. Add the mirrors and these frequencies don't change, as the reflected light joins in with the existing source light. – Paul Kienitz Jul 05 '16 at 03:36
  • 2
    Ah, you're right. Okay, now I need to go think about this more carefully, you got me nerd sniped! – knzhou Jul 05 '16 at 03:40
  • I'm just trying to investigate a steady state case now; I'll add acceleration when I have this part straight. Your two photon case no longer has them at the same freq in the rest frame. And I'm assuming that my sample is just a portion of the light in the resonator. – Paul Kienitz Jul 05 '16 at 04:00
  • 1
    Well, I worked on it and got stuck at exactly the same place you did. The problem statement is excellent, though, as are the diagrams; this question deserves a canonical answer to go with them. – knzhou Jul 09 '16 at 07:03
  • Wow, thanks for adding a bounty. I was hesitating about doing that myself. You rock. – Paul Kienitz Jul 10 '16 at 05:41
  • " a small sample of light in it, which might be one photon." No, light in bulk cannot be the same as one photon . Light is composed of photons in a nonlinear probabilistic way, because photons are quantum mechanical entities. Your statement for me sounds as if a "brick is a sample of a building". – anna v Jul 10 '16 at 06:25
  • If this finding can be proven, it should be applicable to any arbitrary irregular mix of photons. – Paul Kienitz Jul 10 '16 at 06:56
  • But the mix of photons is not arbitrary in building up the classical field. Have a look at http://motls.blogspot.com/2011/11/how-classical-fields-particles-emerge.html . With your complicated imaginary setup it will be equally more complicated than in the link, but for the photons to add up to the classical field they have to go through the addition of their wavefunctions. I cannot think what it would mean when frames are moving at 1/2 c velocities, but the basic "impact" of the photon in terms of creation and annihilation operators on the moving surfaces has to be taken into account. – anna v Jul 10 '16 at 07:37
  • 1
    @annav while I agree with everything you say and "one photon" might not be the right formulation to choose in the question I think that this question is really not about the nature of light in all it's complexities. a classical ray approximation (or a many-photon wavepacket) should be sufficient. – Wolpertinger Jul 10 '16 at 10:56
  • 1
    @Numrok I am not good enough in optics to follow the arguments even classically. I am pointing out that IF there exists a discrepancy it should exist because of ignoring the quantum mechanical nature of the photon, since in the text the "photon" is invoked again and again. After all a photon interacting with a surface which is moving at 1/2 c in the lab is moving with 1c with respect to the surface. There cannot be faster and slower for photons, as there might for wavefronts built up by zillionsof photons – anna v Jul 10 '16 at 13:02
  • It's late and I'm tired, but I don't really understand your setup. You have a laser that is being fed into a resonator moving at constant velocity wrt the laser? What work is being done in holding the mirrors at the same separation? Why isn't the resonator accelerating? – ProfRob Jul 11 '16 at 04:41
  • The idea is that there are just two ideal mirrors with light trapped between them. The word "laser" is an overcomplication and probably shouldn't have been used. – Paul Kienitz Jul 11 '16 at 04:53
  • I'm not sure I understand the initial claim, wouldn't such a resonator have zero mass in the rest frame? – Daniel Kerr Jul 11 '16 at 22:19
  • @DanielKerr No, by $E = mc^2$. – knzhou Jul 12 '16 at 05:26
  • Ah I see, thanks @knzhou, I misinterpreted the claim that you get the mass from the momentum relationship. – Daniel Kerr Jul 12 '16 at 05:43
  • 1
    I'm trying to figure momentum and energy as a set. Sorry if that led to any confusion of the two. ...........Maybe someday, if this all gets resolved, I'll be able to add some acceleration and figure out the inertia. I originally figured the energy and momentum would just be uninteresting preliminarues. – Paul Kienitz Jul 12 '16 at 07:02
  • To be honest, the average in the first place makes me uncomfortable. The claim as quoted is that the difference in momenta is related to the mass, not the total momentum of the system. Perhaps a clearer statement of the claim would help. – Daniel Kerr Jul 13 '16 at 01:16
  • 1
    Density of photon would change – RoderickLee Jul 16 '16 at 03:03
  • I believe density would change in direct proportion to frequency, so the number of photons per wavefront would be invariant. – Paul Kienitz Jul 16 '16 at 08:32
  • 1
    @PaulKienitz Well, the bounty has expired and it looks like Phys.SE has collectively struck out, which is pretty rare! If you solve the question, you can post it as an answer and accept it yourself; I'm still interested in seeing a resolution of this. – knzhou Jul 16 '16 at 21:53
  • 1
    @PaulKienitz I'm still pretty sure it has to be a modeling problem, that there's something unphysical going on in all the calculations. Let's consider the following alternative situation: two extremely light particles are connected by a string which always has tension $T$. Then the two particles oscillate back and forth. (This system is relevant to yours, because you can take the masses of the particles to zero, in which case the particles are always traveling nearly at $c$ except at the turnaround points. I'm basically replacing the mirrors with a spring.) – knzhou Jul 21 '16 at 03:40
  • 1
    If you carefully calculate the energy of this system in a boosted frame, one finds the correct result, i.e. an increase by $\gamma$. I know this is true for two reasons: 1) this is a simple model for quarks confined inside hadrons, and we know that hadron energy behaves right, and 2) I've done this calculation myself once, in extreme detail. – knzhou Jul 21 '16 at 03:41
  • 1
    However, if you neglect the elastic potential energy of the string, the result is wrong! You get a totally different scaling. As such, I'm 90% sure that in the mirror case, there must be a similar extra energy we're missing. – knzhou Jul 21 '16 at 03:43
  • I kind of doubt you could put the masses at zero in that model, but you could certainly approach zero as a limit. – Paul Kienitz Jul 21 '16 at 03:47
  • @PaulKienitz Mhm, that's what I meant to say. – knzhou Jul 21 '16 at 03:49
  • 1
    I think the crucial point is hiding here. It feels very intuitive that adding mirrors can't contribute to energy. Yet a spring definitely can. And in a pinch, can't you model the mirrors as just an extremely stiff spring, that exerts zero force for length less than $L$ and a huge one for lengths greater? – knzhou Jul 21 '16 at 03:51
  • Maybe so. Any real frame holding mirrors has to stretch, so stuffing photons inside would add energy to the frame. But that energy can only come from what was already inside. Stretching the cavity redshifts the light. – Paul Kienitz Jul 21 '16 at 04:32
  • Here's a question we might want to clarify. Does packing photons together as a "gas" add energy beyond what the photons have individually, due to pressure intrinsic to the group? I would say no, as one photon by itself already can exert force, and the pressure of a beam is only the sum of that. – Paul Kienitz Jul 21 '16 at 04:36
  • 1
    @PaulKienitz I don't think there's any difference between one photon and many photons. Everything in the setup is perfectly linear. – knzhou Jul 21 '16 at 05:46
  • I think there are several subquestions here that can be split off into their own questions. At this point, I don't think anybody new coming in will read this long exchange... a fresh start might get more perspectives. – knzhou Jul 21 '16 at 05:46
  • 1
    I put on another bounty. Both the question and the answer below are exemplary, even more so than when I put the first bounty on! – knzhou Jul 25 '16 at 05:46
  • Thanks -- I hope it attracts someone with a fresh insight. – Paul Kienitz Jul 25 '16 at 17:16
  • My dad has a physics degree, and he doesn't have an answer either. – Paul Kienitz Jul 30 '16 at 04:46
  • For another neat example where this idea comes up, see this question. – knzhou Feb 18 '19 at 21:36

4 Answers4

9

Update 1:

1) Note added in proof: The photon stress-energy densities obtained below more or less heuristically are identical to those obtained in more rigorous approaches from the electromagnetic stress-energy density tensor.

2) The physical reason why the stress-energy argument retrieves the detailed balance result in the OP, but is inequivalent to simply boosting the total energy-momentum, seems to be radiation pressure.

Here is a simpler way to look at it:

As seen in the resonator at rest, the photon gas is in a macroscopic stationary state of total energy density $2e_0 = 2 n_0\hbar\omega_0$ and total momentum density $p_0c = 0$, with $n_0$ the volume number-density for photons propagating in one direction along the resonator. However, despite the null momentum density, the photon gas also has a very finite radiation pressure $\pi_0 = e_0$, which gives an additional contribution to the stress-energy density. Keeping only the relevant components (time $\mu=0$, and $\mu =1$ along the resonator and its direction of motion), the latter reads then $$ T_0 = \left(\begin{array}{cc} 2e_0 & 0 \\0 & 2e_0\end{array}\right) \equiv \left(\begin{array}{cc} 2n_0\hbar\omega_0\ & 0 \\0 & 2n_0\hbar\omega_0 \end{array}\right) $$ Under a Lorentz boost to the external observer frame by $$ \Lambda = \left(\begin{array}{cc} \gamma & \gamma\beta \\ \gamma\beta & \gamma \end{array}\right) $$ $T_0$ produces the boosted stress-energy density $T^{\mu\nu} = \Lambda^\mu_{\;\alpha} \Lambda^\nu_{\;\beta} T_0^{\alpha\beta}$ or $$ T = \left(\begin{array}{cc} 2\gamma^2(1+\beta^2)e_0 & 4\gamma^2\beta n_0\hbar\omega_0 \\ 4\gamma^2\beta n_0\hbar\omega_0 & 2\gamma^2 (1+\beta^2)e_0\end{array}\right) $$ The energy density, momentum density, and radiation pressure in the observer frame are then $$ e \equiv T^{00} = 2\gamma^2(1+\beta^2)e_0;\;\;\; pc = 4\gamma^2\beta e_0, \;\;\; \pi = 2\gamma^2(1+\beta^2)e_0 $$ and the corresponding macroscopic total energy and total momentum read $$ E = \frac{L}{\gamma} 2\gamma^2(1+\beta^2)e_0 = 2\gamma(1+\beta^2) E_0, \;\;\; Pc = \frac{L}{\gamma} 4\gamma^2\beta e_0 = 4\gamma\beta E_0 $$ with $E_0 = Le_0 = Ln_0\hbar\omega_0$. This is the detailed balance OP result.

Check: If the radiation pressure in the resonator frame were null, we would have the resonator stress-energy density as $$ T'_0 = \left(\begin{array}{cc} 2n_0\hbar\omega_0\ & 0 \\0 & 0 \end{array}\right) $$ and the boosted one as $$ T' = \left(\begin{array}{cc} 2\gamma^2 n_0\hbar\omega_0 & 2\gamma^2\beta n_0\hbar\omega_0 \\ 2\gamma^2\beta n_0\hbar\omega_0 & 2\gamma^2\beta^2 n_0\hbar\omega_0 \end{array}\right) $$ For the boosted densities this would mean $$ e' \equiv T'^{00} = 2\gamma^2 e_0 ;\;\;\; p'c = 2\gamma^2\beta e_0, \;\;\; \pi' = 2\gamma^2\beta^2 e_0 $$ and macroscopically $$ E = \frac{L}{\gamma} 2\gamma^2 e_0 = 2\gamma E_0, \;\;\; Pc = \frac{L}{\gamma} 2\gamma^2\beta e_0 = 2\gamma\beta E_0 $$ The latter look now as if obtained from boosting the total energy-momentum in the resonator frame as a 4-vector, $$ E = \gamma(2E_0 +\beta P_0 c) = 2\gamma E_0, \;\;\; Pc = \gamma(P_0c + \beta 2E_0) = 2\gamma\beta E_0 $$ but of course do not apply.


Update 2: Regarding energy-momentum conservation

We already have that $$ e^2 - p^2c^2 = 4 \gamma^4 [(1+\beta^2)^2 - 4\beta^2] e^2_0 = 4 \gamma^4 (1-\beta^2)^2 e_0^2 = (2 e_0)^2 $$ and in this sense energy-momentum density is locally invariant under Lorentz boosts. It may be objected that $e^2 - p^2c^2 = (2 e_0)^2 \neq 0$, when one would naively expect that for photons $e^2 - p^2c^2 = 0$. But this is because the total stress-energy density no longer records the momentum densities of forward and backward photons separately. When we consider the separate stress-energy densities for forward and backward photons it can be checked that $$ e_\pm^2 - p_\pm^2 c^2 = e_0^2 - p_0^2c^2 = 0 $$ as they should, and again energy-momentum densities are locally invariant.

The main problem, however, is that the same cannot be said for the finite volume counterpart, since $E^2 -P^2c^2 \neq (2E_0)^2$. But the stress-energy density satisfies a covariant conservation law, $\partial_\mu T^{\mu\nu} =0$, that works as usual: whatever enters an infinitesimal space-time volume, along all directions, equals what comes out. The finite 4-volume version ensures that the energy-momentum flux through the 3D-boundaries of any 4-volume is null. In particular, it guarantees that the total 4-momentum is a Lorentz invariant on constant-time hyperplanes corresponding to different observers (the fancy way to say "under boosts").

The following shows that the detailed balance resonator result also follows from stress-energy conservation.

Consider the world-tube of the resonator as traced in its own rest frame, and let us take a slice of this world-tube bounded by two constant-time hyperplanes (3D-spaces), $S_R$ for the resonator frame, $S_O$ for the observer frame, and the time-like sides of the world-tube. The observer moves at velocity $-\beta c$ in this view. The two hyperplane cut-outs lying within the world-tube correspond to the resonator as observed at the given times in the two frames. For convenience, and without lack of generality, let us choose the common time origin at the moment when the resonator's rear end passes the common space origin, and let the two hyperplanes correspond to $ct=0$ in both frames. The world-tube slice is then simplified to a wedge, as in the figure.

enter image description here

For this arrangement, stress-energy conservation implies that the total resonator 4-momentum on the observer $S_O$ hyperplane must be (the Lorentz transform of) the sum of the total 4-momentum on $S_R$ plus the contribution of the side time-like hypersurface $S_{RO}$ with outward normal, in the positive x-direction (caution: directions of normals follow the switch $-S_O + S_R + S_{RO} = 0 \rightarrow S_O = S_R + S_{RO}$). The latter lies between proper times $ct_1=0$ and $ct_2$ corresponding to the front end of the resonator at time $ct_O = 0$ in the observer frame, that is at $ct_O = 0 = \gamma(ct_2+\beta L)$, or $ct_2 = -\beta L$. Indeed, the $S_R + S_{RO}$ contributions amount to $$ \tau^\mu = \int_0^L{dx \left( \bf{e}_{t} \cdot T \right)^\mu } + \int_{-\beta L}^0{ \left( \bf{e}_x \cdot T \right)^\mu} = \int_0^L{dx\; T^{\mu 0}} + \int_{-\beta L}^0{dt \; T^{\mu 1}} = L T^{\mu 0} + \beta L T^{\mu 1} $$ Their Lorentz boost to the observer frame gives, for $T^{01} = T^{10} = 0$, $T^{00} = T^{11} = 2n_0\hbar\omega_0 = 2e_0$, $$ E = \gamma(\tau^0 + \beta \tau^1) = \gamma\left[ (L T^{00} + \beta L T^{01}) + \beta \left( L T^{10} + \beta L T^{11} \right) \right] = \gamma \left( L T^{00} + \beta^2 L T^{11} \right) = 2 \gamma (1 + \beta^2)E_0 $$ $$ Pc = \gamma(\tau^1 + \beta \tau^0) = \gamma\left[ (L T^{10} + \beta L T^{11}) + \beta \left( L T^{00} + \beta L T^{01} \right) \right] = \gamma \left( \beta L T^{11} + \beta L T^{00} \right) = 2 \gamma \beta E_0 $$ These are precisely the same values as obtained before.

The really interesting thing is that what was identified previously as the radiation pressure contribution is now seen to arise entirely from the time-like $S_{RO}$ terms, and therefore appears as a dynamical effect. And although with the current choice of world-tube wedge these terms occur asymmetrically at the front mirror, shifting the $S_O$ hyperplane at another time shows that they represent a difference of contributions from both mirrors.

----------

Original answer:

I think it all comes down to the fact that the energy-momentum of an extended body, $P^\mu$, must be calculated from its stress-energy density $T^{\mu\nu}$, which is a tensor, not a 4-vector, and behaves differently under Lorentz transforms: $$ T^{\mu\nu} \rightarrow \Lambda^\mu_{\;\alpha} \Lambda^\nu_{\;\beta} T^{\alpha\beta} \;\;\;\text{vs.}\;\;\; P^\mu \rightarrow \Lambda^\mu_{\;\alpha} P^\alpha $$ The general idea is that, by analogy with a "dust" of massive particles, the stress-energy densities for forward and backward traveling photons are of the form $$ T_\pm^{\mu\nu} \sim p_\pm^\mu p_\pm^\nu \equiv p_\pm \otimes p_\pm $$ where $p_\pm^\mu$ are corresponding 4-momentum densities. Adding up the forward and backward components gives $$ T^{\mu\nu} \sim p_+^\mu p_+^\nu + p_-^\mu p_-^\nu \\ \neq (p_+^\mu + p_-^\mu) (p_+^\nu + p_-^\nu) = p^\mu p^\nu $$ In other words, if in the rest frame we have $$ T_0^{\mu\nu} \sim p_{0,+}^\mu p_{0,+}^\nu + p_{0,-}^\mu p_{0,-}^\nu $$ after a boost we find $$ T^{\mu\nu} \sim \left(\Lambda^\mu_\alpha p_{0,+}^\alpha\right) \left( \Lambda^\nu_\beta p_{0,+}^\beta\right) + \left(\Lambda^\mu_\alpha p_{0,-}^\alpha \right) \left(\Lambda^\nu_\beta p_{0,-}^\beta\right) $$ Hence simply boosting the total 4-momentum does not retrieve the correct energy and momentum in the new frame, wherefrom the apparent paradox.

How to fix the problem:

1) To avoid confusion with simultaneity issues, divide the moving resonator into thin slices perpendicular to its direction of motion. Each such slice will have a slightly different proper time, those toward the front lagging behind the ones toward the back, but what is important is that the proper time is uniform throughout each slice. Calculate the stress-energy density for an arbitrary slice.

2) In any given frame, integrate relevant stress-energy components along the entire resonator to obtain the total energy and total momentum.

3) When transforming from one frame to another, boost the stress-energy density through the tensor transform, and only then integrate for total energy and momentum.

Finding the stress-energy density:

For a massive "dust" the stress-energy density reads $$ T^{\mu\nu} = p^\mu J^\nu $$ where $p^\mu$ is the 4-momentum of the "dust" and $J^\nu$ the particle 4-flux. To follow this analogy, pick one resonator slice, located in the rest frame at $[x_0, x_0+dx_0]$ and having rest volume $dV_0$. Say at each moment there are on average $dN_0$ photons passing through it in each direction, each of energy $\hbar \omega_0$. The slice as seen in its rest frame at moment $ct_0$ will be visible to the outside observer at $$ ct = \gamma(ct_0 + \beta x_0), \;\;\; x = \gamma(x_0 + \beta ct_0) $$ The instant number of photons passing thru the boosted slice in either direction is obviously the same as in the rest frame, $dN_0$, but the corresponding photon number-densities change due to length contraction of the volume as $$ n_0 = \frac{dN_0}{dV_0} \rightarrow n = \frac{dN_0}{dV} = \frac{dN_0}{dV_0/\gamma} = \gamma n_0 $$ So the number-densities are not scalars. Note however that since the slice is co-moving with the resonator, it is transporting a photon flux $2n\vec{\beta}c$ relative to the external observer. In fact, by analogy with the massive "dust", the slice densities and their fluxes are components of flux 4-vectors. For a massive "dust" the flux 4-vector reads $$ J^\mu = n_0 u^\mu $$ where $u^\mu$ is the "dust" 4-velocity. For our photons the 4-velocity is ill-defined, but the flux may still be defined analogously using the direction ${\bar u}$ of the 4-momentum, given by $p^\mu = (\hbar \omega/c) {\bar u}$. Then in the resonator rest frame the forward and backward photon fluxes must read (resonator along $\mu = 1$) $$ J_{0, \pm}^\mu = n_0 c {\bar u}_\pm^\mu = n_0 (c, \pm c, 0, 0) $$ and after boost to the observer frame become $$ J_+^\mu = \left( \;\gamma(J_{0, \pm}^0 + \beta J_{0, \pm}^1), \;\gamma(J_{0, \pm}^1 + \beta J_{0, \pm}^0), \;0, \;0 \right) = (1\pm\beta)\gamma n_0 \;(c, \pm c, 0, 0) $$

  • Check: Since the 4-fluxes are uniform and time-independent, they integrate trivially along the resonator to give the total number of photons propagating in the forward and backward directions as $$ N_\pm = \frac{L}{\gamma}\frac{J_\pm^0}{c} = \frac{L}{\gamma} (1 \pm \beta)\gamma n_0 = (1\pm \beta) L n_0 = (1\pm\beta) N_0 $$ in agreement with estimates from the simpler length-contraction arguments. The total photon count is $N = N_+ + N_- = 2N_0$ as it should.

To complete the "dust" analogy, the stress-energy densities for forward and backward propagating photons become $$ T_\pm^{\mu\nu} = p_\pm^\mu J^\nu_\pm $$ With the Doppler-shifted forward and backward 4-momenta reading $$ p_\pm^\mu = (\hbar \omega/c, \pm \hbar\omega/c, 0, 0) \equiv \gamma(1\pm \beta) (\hbar\omega_0/c) (1, \pm 1, 0, 0) $$ the total stress-energy density is then $$ T_\pm^{\mu\nu} = \gamma^2(1+\beta)^2 n_0\hbar\omega_0 \;[{\bf w}_+\otimes {\bf w}_+] + \gamma(1-\beta)^2 n_0\hbar\omega_0 \;[{\bf w}_-\otimes {\bf w}_-] $$ with ${\bf w}_\pm = (1, \pm 1, 0, 0)$. For a unit resonator cross-sectional area this gives the photon total energy density and total energy as $$ e = T^{00} = \gamma^2 \left[(1+\beta)^2 + (1-\beta)^2 \right] n_0\hbar\omega_0 = \gamma^2 (1 + \beta^2) (2 n_0 \hbar\omega_0) $$ $$ E = \frac{L}{\gamma} e = \frac{L}{\gamma} \gamma^2 (1 + \beta^2) (2 n_0 \hbar\omega_0) = 2\gamma (1 + \beta^2) E_0 \neq 2\gamma E_0 $$ for $E_0 = Ln_0\hbar\omega_0 = N_0\hbar\omega_0$, and the photon total momentum density and total momentum as $$ p^1c = T^{01} = \gamma^2 \left[(1+\beta)^2 - (1-\beta)^2 \right] n_0\hbar\omega_0 = \beta \gamma^2 (4 n_0 \hbar\omega_0) $$ $$ P^1c = \frac{L}{\gamma} p^1 = \frac{L}{\gamma} \beta \gamma^2 (4 n_0 \hbar\omega_0) = 4 \beta\gamma E_0 \neq 2\beta \gamma E_0 $$ That is, the detailed photon counting paid off after all.

udrv
  • 10,371
  • Wow, it's gonna take me some time to even follow that -- I never learned tensors. So the math required for this problem is beyond my current capability. Thanks... I think. – Paul Kienitz Jul 17 '16 at 04:41
  • 3
    Well, if you know vectors you must know matrices from vector transformations. If you know matrices you have the simplest way to look at tensors: as generalized matrices that transform in a certain way when changing coordinates, reference frames in particular. You can find a very intuitive intro in http://www.ita.uni-heidelberg.de/~dullemond/lectures/tensor/tensor.pdf, if you are interested. – udrv Jul 17 '16 at 05:27
  • A lot of the notation still baffles me, but I do see you concluding that the total energy $E = 2\gamma (1 + \beta^2) E_0 \neq 2\gamma E_0$. This appears to nearly agree with my "wrong" result in the second to last paragraph of the question, and confirm that the energy of light in a boosted resonator is greater than the energy of a boosted equivalent mass. How does that not violate a conservation law? Is it in some sense illusory? – Paul Kienitz Jul 17 '16 at 06:58
  • 1
    The total energy - actually that is your result, and so is the total momentum. Your $E_\beta + E_{-\beta}$ is my $E$: you have $E_{\pm\beta} = \gamma(1\pm\beta)^2E_0$, $E_\beta + E_{-\beta} = \gamma\left[(1+\beta)^2 + (1-\beta)^2\right] E_0 = 2\gamma\left[ 1 + \beta^2\right] E_0$. Same for total momentum. About notations, I tried to avoid the heavier things, but do let me know which ones trouble you. As for the significance of the result, see my update. I wrote in a hurry, so please let me know if it is too telegraphic. – udrv Jul 17 '16 at 18:32
  • Holy crap -- I never thought of how the radiation pressure against the mirrors is an additional store of energy in the rest frame. This is starting to make physical sense now. I think we've probably got it now -- lemme study this a bit more. Sorry it's too late for the bounty. – Paul Kienitz Jul 17 '16 at 20:33
  • No, wait... adding mirrors around an existing light beam does not add energy. – Paul Kienitz Jul 18 '16 at 01:58
  • 1
    Radiation pressure is defined at every point in the photon gas, not just at the mirrors. Notice that I never mentioned a macroscopic "total pressure", although pressure on the mirrors, or on any surface embedded in the gas, can be easily calculated. And never mind the bounty, the problem is surely fun anyway. – udrv Jul 18 '16 at 14:46
  • As best as I can understand it, treating the pressure as a store of energy just gives you the energy you already had in the photons. If you do work against that pressure to push the mirrors closer together, that increases energy by blueshifting the light inside, and if you let it push the mirrors very far apart, the light's frequency drops toward zero. So I'm still left looking at a starting energy of $2E_0$ and an ending energy of $2\gamma(1+\beta^2)E_0$, and wondering how it's possible to bridge that gap. – Paul Kienitz Jul 18 '16 at 18:06
  • 1
    Well, I didn't treat "pressure as a store of energy" just because it fit the bill here. I merely reformulated the problem in terms of stress-energy densities, which is in fact the standard approach for relativistic fields. The immediate advantage is that energy-momentum conservation is firmly observed, provided the stress-energy tensor is correctly calculated. I added another formal proof on this, and it shows that the source of the discrepancy is of the nature of a work differential on the mirrors. – udrv Jul 20 '16 at 18:54
  • 1
    But I confess there is still something about it that escapes me at the moment. In any case, the pressure does not just redistribute energy, and I don't think you need to move the mirrors much to gauge the work against them. – udrv Jul 20 '16 at 18:54
  • I haven't read your second update in detail yet, but in the meantime, I'd like to know the answer to what should be an easy question: including the pressure effect, what is the total energy in the rest frame, relative to $Ln_0\hbar\omega_0$? And is its ratio to the moving energy a factor of $\gamma$ or of $\gamma(1+\beta)$? – Paul Kienitz Jul 20 '16 at 20:28
  • I added a new section to the bottom of the question. It's clear now that the math is going to keep on agreeing -- we've now arrived at the same answer five different ways. The question is how that can not be a paradox. – Paul Kienitz Jul 21 '16 at 03:24
  • 1
    Was hoping to have a definitive answer by now, but am still looking into it. For that it's worth, here is what I have at the moment. In my notation, the total energy in the resonator rest frame is $2E_0 = L(2n_0\hbar\omega_0)$, so $E_0 = n_0\hbar\omega_0 = E_{0,\text{forward}} = E_{0,\text{backward}}$. – udrv Jul 22 '16 at 17:29
  • 1
    As for the factor to the moving energy: My best calculation, as in the last update, says that stress-energy conservation gives the total energy in the moving resonator as $E (= E_{\text{forward}} + E_{\text{backward}}) = \gamma(1+\beta^2) (2E_0)$, not $\gamma (2E_0)$, due to a certain contribution from boundary conditions on the mirrors, which ultimately come from radiation pressure. – udrv Jul 22 '16 at 17:31
  • 1
    Following standard interpretations, I could have handwaved this term as due to momentum flux or (work of) pressure on the front mirror and tension on the back one, something that has to be compensated by strain/potential energy in the resonator (as knzhou suggested), but I would still have to explain at least two other things: – udrv Jul 22 '16 at 17:33
  • 1
  • Why this still happens if we leave the resonator in place and move the observer from the resonator frame to the moving frame (work done on observer, not resonator). 2) If we follow your suggestion and remove the resonator altogether to look at the e.m. field itself, the boundary term is still there and the result still holds, but there isn't any external strain or potential energy to invoke. Bottom line: either my math is missing something despite my efforts to be thorough, or it is correct, but I still have to make meaningful sense, physically, of the term that explains it all.
  • – udrv Jul 22 '16 at 17:34
  • 1
    Long way to say I got the math, I don't get what it represents D: In the meantime, looking into one other idea, will post if it adds anything new. – udrv Jul 22 '16 at 17:35
  • So we're all in the same boat now. Thanks for your hard work. – Paul Kienitz Jul 22 '16 at 18:26
  • 2
    I'm just gonna step in to say, if this ever gets resolved, you two might want to write a paper! It's a fantastic paradox and all the figures are already drawn. I think AJP and EJP especially love this kind of stuff. – knzhou Jul 25 '16 at 17:48
  • 2
    Wow, thanks! If it turns out it's not been done before, why not? This is one of those awesomely simple, yet maddeningly nontrivial problems that raise a host of good questions. – udrv Jul 25 '16 at 21:40
  • @PaulKienitz Regarding my question (2) above: If the field inside the region considered (no resonator) is to hold its pressure, there must also be pressure from the field outside the region, so this is equivalent to strain/potential energy. Same applies to perfect gas of massive particles, as opposed to "dust". Dust has common rest frame for its particles but no pressure there, gas has no common rest frame for all particles, but does have non-zero pressure. – udrv Jul 28 '16 at 15:21
  • @urdv But the pressure in this case consists only of a hypothetical potential to transfer momentum if anything should happen to be moved into that space. No actual force is being exerted on anything. I would describe it as like two dusts passing each other at high speed with no collisions. – Paul Kienitz Jul 28 '16 at 18:52
  • Yes, pressure as in an ideal gas, or your two collision-less dusts passing each other. It gives momentum flux or transfer across the boundary surface (or any other surface), even if the net transfer is obviously null. Since momentum transfer per unit time in either direction is force, the outside-in component acts as restoring force (or mirror) against the inside-out component. Not saying that this takes away the elephant though. – udrv Jul 29 '16 at 04:16