International Journal of Theoretical and Mathematical Physics
p-ISSN: 2167-6844 e-ISSN: 2167-6852
2024; 14(3): 67-69
doi:10.5923/j.ijtmp.20241403.01
Received: Jun. 10, 2024; Accepted: Jun. 22, 2024; Published: Jun. 24, 2024

Edward P. Krisner
Department of Mathematics, University of Pittsburgh at Greensburg, Greensburg, USA
Correspondence to: Edward P. Krisner, Department of Mathematics, University of Pittsburgh at Greensburg, Greensburg, USA.
| Email: | ![]() |
Copyright © 2024 The Author(s). Published by Scientific & Academic Publishing.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/

One would be hard pressed to find a derivation of the Lorentz transformation (LT) that uses only the two postulates of special relativity, and that is entirely mathematically rigorous. Most, if not all, derivations presume that, in addition to the two postulates, the LT has a particular form such as linearity.We will give several examples of such derivations from several esteemed textbooks. We will also discuss how Einstein derived the LT in his revolutionary 1905 paper, “On the Electrodynamics of Moving Bodies.” Then, we will provide a rigorous derivation of the LT using only the two postulates.
Keywords: Lorentz Transform, Mathematically Rigorous Derivation, Special Relativity, Postulates of Relativity
Cite this paper: Edward P. Krisner, Does Linearity of the Lorentz Transformation Follow from the two Postulates of Special Relativity, International Journal of Theoretical and Mathematical Physics, Vol. 14 No. 3, 2024, pp. 67-69. doi: 10.5923/j.ijtmp.20241403.01.
![]() | (1) |
are constants to be determined. As will be elaborated upon in the following section, some authors will, at best, give a “hand-waving” argument as to why the transformation is linear. Failure to provide a rigorous proof of why the two postulates imply linearity leaves open the possibility that there may be a nonlinear transformation that is consistent with the two postulates. Our ultimate goal is to eliminate nonlinearity as a possibility. In the following section we will discuss several example derivations of the LT that appear in the literature. Subsequently, we will provide a rigorous proof that the LT is a consequence of the two postulates without making any additional assumptions.
and
. We’ll assume that the
frame is moving at a speed
in the direction of increasing values of
. Moreover, our derivation will be based on the conventional that
whenever
.Since there is no motion in either the
or
direction it is taken that
and
. So all that remains to determine the interrelatedness between
and
.
was previously established with rigor. Hence, an observer at rest in S’ will also see these same three events occur at equally spaced times,
and
.Subsequently, the transformation given by
where
is a constant, is considered. Under this particular transformation, the authors demonstrate that an observer in
does not see the three events occurring at three equally spaced times. The fact that equally spaced times in one frame yields equally spaced times in the other may indeed only be possible if the frames are related via a linear transformation. But to provide a rigorous proof, the task of ruling out an arbitrary nonlinear transformation must be completed. ![]() | (2) |
are the coordinates of an arbitrary inertial reference frame. This motivates the likes of [2] and [3] to consider a transformation of the form ![]() | (3) |
follows from both (2) and (3). However, once again, we have a situation whereby
is tacitly assumed to be independent of both
and
. So not only is (3) a linear transformation, it’s a linear transformation of a particular form. It’s worth re-emphasizing that we are not arguing that the (3) is incorrect. We will ultimately prove that it is correct. But if
is simply assumed to be independent of
and
, then a transformation of the form (3) in which
depends on
and/or
remains possible. It is worth pursuing the argument presented in [2] a bit further. In letting
it follows from (3) that
But
is the velocity of
relative to the
. Thus, ![]() | (4) |
![]() | (5) |
is known as the Lorentz factor.Substituting (5) into (3) yields the familiar LT. The drawback of this approach is that there’s still no guarantee that
is independent of x and t. For example, in choosing
the above argument can be implemented to draw the same conclusion.
if and only if
. This is certainly true in the way we described the frames
and
. Then they claim that ![]() | (6) |
is a function to be determined. Clearly, (6) certainly implies that
if and only if
. The converse, however, is not true. As a random example,
also ensures that
if and only if
. Yet again we have a situation in which prior knowledge of the LT is used to derive the LT.
denote a physical quantity of some kind at a spatial location
. Homogeneity requires the function
to be invariant under spatial translations,
. How this implies that two inertial reference frames are related by a linear transformation is unclear.
frame is related to the time in the
frame by the equation![]() | (7) |
frame, and is of length
as measured by an observer who is at rest in that frame. An observer in the
frame will determine the length of the rod to be ![]() | (8) |
can be written in terms of
and
. As asserted in Section 2,
travels in the direction of increasing values of
at a speed
. That is, given an arbitrary constant
the plane
in the
frame travels in the direction of increasing values of
at speed
. So a stationary observer in
will describe the motion of such a traveling plane via the kinematic equation
where
is the distance between the traveling plane and the plane
at
according to the observer in
.Let us consider a specific example in which
.![]() | Figure 1. The plane at time ![]() |
at
the moment in which the planes
and
align. While a stationary observer in
at
will see the plane
as stationary and 17 units away, a stationary observer in
at
will see the plane at a contracted length of
units away at that moment. Thus, we substitute
for
in the kinematic equation. Doing so yields
which can be rewritten as
Of course there is nothing special about the number 17. An arbitrary but fixed value of
refers to a stationary plane in
and a plane traveling at speed
in
. Thus, the very same argument can be applied to
that was applied to 17 to obtain ![]() | (9) |
in terms of
and
. In proving (9) we viewed planes of the form
as traveling in the direction of increasing values of
at speed
. It is equally legitimate to consider
as the moving frame and
as the stationary frame. That is, we can repeat the same method that we used to obtain (9) to deduce ![]() | (10) |
or equivalently
By (9), we can substitute
for
in the above equation. Doing this and a little algebra yields ![]() | (11) |
which holds whenever
.