Circular and Elliptical Orbits (\(e < 1\))#
For circular and elliptical orbits, combining Eq. (189) and Eq. (190) results in:
Mean Anomaly#
We define the term in the square brackets in Eq. (193) to be the mean anomaly, \(M_e\), where the subscript \(e\) indicates that this is for the ellipse. We will have different equations for the parabola and hyperbola.
The mean anomaly is a monotonically increasing function of the true anomaly, as shown in Fig. 49. This is good because it means that \(M_e\) can be used in place of \(\nu\) for all four quadrants on the \(x\)-\(y\) plane. If \(M_e\) had a peak, we would have to be concerned about which quadrant we were in.
Notice that when \(e = 0\) (a circular orbit), the mean anomaly and true anomaly are equal.
Plugging Eq. (194) into Eq. (193) and solving for \(M_e\), we find:
If we know the period of the orbit, we can simplify the equation for the mean anomaly:
For a circle, the mean anomaly equals the true anomaly:
Further solution is not needed if the orbit is circular.
Eccentric Anomaly#
We can simplify the equation for \(M_e\) even further by using an auxiliary angle, \(E\), called the eccentric anomaly. \(E\) is defined as the angle from the \(x\) axis to a point on a circle that circumscribes the orbital ellipse and where the point is located vertically above a point with true anomaly \(\nu\) on the ellipse. This is shown in Fig. 48 and Fig. 50.
From Fig. 50, we see that the distance \(OP\) is equal to:
where \(a\) is the semimajor axis of the ellipse, \(e\) is the eccentricity, and the distance \(FP\) is related to the true anomaly:
Combining Eq. (198) and Eq. (199), we find:
Replacing \(r\) with the orbit equation in terms of the semimajor axis and simplifying, we find:
or, solving for \(\nu\):
Unfortunately, these equations result in a quadrant ambiguity. We can resolve this by some further trigonometric transformations:
The inverse tangent function is not multivalued for a given value of \(\nu\) or \(E\), so this resolves the quadrant ambiguity.
Note
Note: Eq. (203) can be solved in Python with np.arctan()
and in Matlab with atan()
. It is not necessary to use np.arctan2()
or atan2()
. Make sure all your arguments are in terms of radians!
Kepler’s Equation#
Some further trigonometry with Eq. (201) and Eq. (203) yields Kepler’s Equation:
This gives the relationship between mean anomaly and eccentric anomaly. The value of \(M_e\) monotonically increases as a function of \(E\), as shown in Fig. 51 for several values of \(e\).
Solution Procedures#
Now we have all the pieces we need to apply the solution procedures discussed in the introductory section.
Given True Anomaly, Find Time Since Periapsis#
Given a value of the true anomaly \(\nu\), the eccentric anomaly \(E\) can be calculated from Eq. (203). Then, Kepler’s Equation can be solved to find \(M_e\), and the time since periapsis is found by:
Given Time Since Periapsis, Find True Anomaly#
If, on the other hand, we are given the time since periapsis and want to find the true anomaly, we must first solve Eq. (196) for \(M_e\). Then we need to solve Kepler’s equation for \(E\). Unfortunately, this equation is transcendental in \(E\), so it cannot be solved analytically. There are several methods to solve Kepler’s equation, depending on the level of accuracy required and the access to computational tools.
Once \(E\) is determined, Eq. (203) can be solved for \(\nu\).
Newton’s Method to Solve Kepler’s Equation#
For Newton’s method, we seek the roots of a function, \(f(E) = 0\). In this case, we can rearrange Kepler’s equation as:
It is also convenient to supply the derivative, when it is available, since this improves the convergence rate of the algorithm. The derivative is:
These equations can be provided to standard computational solvers for root finding algorithms.
Infinite Series Solutions of Kepler’s Equation#
Although there are no analytical solutions for Kepler’s equation, several people have developed infinite series solutions. The first was developed by Lagrange:
where the coefficients \(a_n\) are given by:
and where \(\mathrm{floor}(x)\) means to take the next lowest integer relative to \(x\).
This infinite series converges for \(e < 0.6627434193\), a limit that was proven by Laplace, and is therefore typically called the Laplace limit. For larger values of \(e\), the series diverges.
Another infinite series was developed by Bessel, which converges for all \(e < 1\):
where \(J_n\) are the Bessel functions of the first kind, defined by their own infinite series:
There are other feasible series solutions to Kepler’s equation, some of which are discussed by Colwell [Col92].