General relativity is a theory of how gravity works. In General Relativity, gravity is not a force, but rather an effect caused by curved spacetime. This conclusion is based on two fundamental principles:
The Equivalence Principle, which says that gravity is indistinguishable from the effect of an accelerating reference frame
The Principle of Covariance, which says that all laws of physics should be in the same form in all reference frames
The culminating breakthrough of General Relativity is summarized succinctly by the Einstein Field Equations:
However, to truly understand what the equation means, we need to go slowly and build our understanding of relativity first. And it often helps to start with the physical intuition underlying relativity - the equivalence principle.
Consider an observer inside a closed room. This room is accelerating upwards at a constant rate of 9.81m/s2. The observer holds a 1-kilogram ball. What would happen if the observer would drop a ball?
Well, we know that the room is under constant upwards acceleration, so when the observer releases the ball, the floor of the room will travel upwards towards it at 9.81m/s2. However, to the observer, who is moving upwards along with the floor, it would look like everything is stationary, and the ball is the object that is falling down.
If we use Newton’s second law of motion, we find that the force experienced by the ball would be given by:
Now, consider another observer, inside another closed room. This room is placed on the surface of the Earth. The observer inside this second room drops another 1-kilogram ball. What would happen next?
Well, the ball will experience the force of Earth’s gravity, causing it to fall downwards as well. If we use Newton’s law of universal gravitation, we find that the force experienced by the ball would be given by:
Notice that this is the same result as our closed room moving upwards through space at 9.81m/s2. The effect of gravity and of an accelerated reference frame is the same. But this is just a coincidence, right? Or is it...?
Imagine you were in either the closed room in space or the closed room on Earth, but you weren’t told which one. Is there any way you could tell which room it was? No, it would be impossible to perform an experiment to tell the closed room in space from the closed room on Earth.
So, gravity is indistinguishable from accelerated reference frames. This is the equivalence principle.
To understand General Relativity, we must first be familiar with the idea of events.
An event is anything that happens. This could be, “a spaceship flew through my window”. Yes, that is an event.
We can describe the event by finding the position and time it occured, relative to a chosen reference frame:
E.g. “a spaceship flew through my window at 5 meters left and 6 meters in front of my head, at 2 meters above sea level, at 2:30 pm, on January 15th, 2021”
We have a x-coordinate (5 meters left of my head), a y-coordinate (6 meters in front of my head), a z-coordinate (2 meters above sea level), and a time coordinate (2:30 pm 1/15/21)
To describe the distance between two events, we use a spacetime metric. This could be the Euclidean metric δαβ, the Minkowski metric ηαβ, or the general metric gαβ. As we’ve seen before, the Minkowski metric in particular is given by ηαβ, where α and β represent the (α,β)-th entry of the matrix:
Additionally, since the components of metric tensors can vary as spacetime is curved and distances change, we shouldn’t expect that the metric of spacetime will always be ηαβ; in fact, that would only be true for flat (uncurved) spacetime. So, we replace ηαβ with the more general form of the metric tensor gαβ, which applies to all spacetime metrics. We finally arrive at the general metric tensor:
Previously, we have already been introduced to index notation - for example, we saw that position could be represented by xμ=x,y,z, and that an equation such as vμ=dtdxμ is actually a system of three equations, one each for x, y, and z. W’ve also seen that we can generally let the letters we use for tensor indices to be whatever we want, and the equations will still be consistent. For example, gαβ=gij=gαγ=gμγ. There is a difference between these two forms of indices, however.
The first type, where we have a single index, is called a free index. Free indices result in a different equation for each coodinate - for instance, given Fμ=maμ, then we have the system of equations:
The second type, where we have an index that repeats once as a lower index and once as an upper index in a term, is used to stand for summation. For example, recall the multivariable chain rule:
Note that we have an index i that appears in the lower index and one that appears in the upper index, so by the Einstein summation convention, we can get rid of the summation sign:
Notice that j appears once as a top index, and once as a bottom index. Thus, j must be a summation index - in GR we call these dummy indices. In contrast, i and k don’t appear twice on the top and bottom, so are free indices, representing a system of equations. Therefore, if we were to expand the dummy summation index, where j goes from 1 - 3, we’ll get:
In general, dummy indices can be changed at will, but free indices cannot. This is because changing a dummy index is just changing the index you use for the summation, which is totally arbitrary, but changing a free index would change the system of equations into a completely different system of equations.
The use of the Einstein summation convention allows the equations of General Relativity to be written very compactly. For instance, take the definition of the Ricci tensor, given by:
Observe that only i and j are free indices - the other two indices m and k appear both on upper and on lower indices, making them summation indices. If we were to fully write out just the summation of the Ricci tensor, where m and k both sum from 0 to 3, and we assume 0=t, 1=x, 2=y, 3=z, we’d get:
And remember, this is just expanding the summations! This isn’t even writing out the system of equations for each free index! You can go and see the full Einstein field equations with each system of equations written out at https://github.com/bnschussler/Fully-Expanded-Einstein-Field-Equations, and it is 22 pages long!
Finally, in General Relativity, partial derivatives are often written in a compact way:
We know from Newton’s first law of motion that an object in motion stays in motion at constant speed - that is, it undergoes no acceleration. In other terms:
This is why, for instance, a ball rolling along an infinitely long hallway will keep going in a path in the same direction - its velocity vector, and thus its directions, stays constant. In nice Euclidean space, we call this path a “straight line” - the effect of going ahead in the same direction forever.
As we know, in Euclidean space, a straight line is the shortest path between two points, which we call a geodesic. We might be tempted to phrase Newton’s first law to say that “a particle in motion will travel along a straight line”. However, in non-Euclidean geometries, a geodesic is not necessarily a straight line. So, we must generalize Newton’s first law with modifications: a particle in motion will move along a geodesic.
To formulate this law mathematically, we can say that the action along the path xk(λ) between 2 points A=xk(0) and B=xk(1) must be minimized. We include the units of mc to get the units for action right, so the full action along the path xk(λ) is:
Now, to actually be able to solve, we need to write the integrand in terms of our path parameter λ. To do this, we can divide dxi and dxj both by dλ, then, to keep the integrand the same, multiply by dλ⋅dλ=dλ2:
We can apply the familiar Euler-Lagrange equations to our Lagrangian, as we’ve done before, to find the equations of motion for the particle traveling along the path:
Let’s first take the derivative with respect to xk. We use the chain rule and the fact that ∂x∂u=21u−21∂x∂u. In our Lagrangian, the only part that actually depends on xk is gij, so the rest of the Lagrangian can be thought of as a constant. So we have:
Finally, let’s remember the Einstein summation convention, which tell us that dummy indices can be changed to whatever indices we want, because they are just summation indices. Here, since i and j appear both as lower indices and as upper indices, they are dummy indices. That means:
Here, a can be any index that isn’t already one of the free indices (the free indices are i and k here). We’ll choose a, but really it can be anything.
And we can factor out the common factor of the Lagrangian and get rid of it by dividing it from both sides of the equation (remember zero divided by anything is still zero):
Notice that this equation has three dummy indices - i (which appears both in d2xi as upper index and gik as lower index), a (which appears both in the lower part of a partial derivative and upper part of another derivative dxa), and j (which appears as a lower index in gij and an upper index in dxj). Remember this! It’ll be very important later!
Note here that again, a and i are dummy indices - a appears on the lower partial derivative and upper partial derivative terms, and i appears in the lower gik and upper dxi term. So let’s do two index substitutions which will make the equation so much easier to solve. First, note that the equation we extracted this expression from had three dummy indices - let’s reduce it to just two by swapping a with j:
Where (this is clearer if we expand out the partial derivatives term by term, but it can be seen by just glancing at the equation too) the free indices are μ, j, and i (which is why they appear on the Christoffel symbol itself), while the dummy index k is used for summation (which is why it’s only present in the partial derivatives).
Note that for particles with mass, the paramter λ is interpreted to be the proper time τ, but really λ can be any invariant parameter along a particle’s trajectory. If we set λ=τ, then the equations become:
Any path that obeys the geodesic equation in spacetime is a geodesic. Since spacetime can be curved, these geodesics are not straight lines. The curvature of spacetime - what we experience as gravity - causes distances to change. This causes the metric to change, which in turn affects the paths of particles.