mirror of
https://github.com/saitohirga/WSJT-X.git
synced 2025-10-24 01:20:22 -04:00
596 lines
25 KiB
Plaintext
596 lines
25 KiB
Plaintext
|
[section:root_finding_examples Examples of Root-Finding (with and without derivatives)]
|
||
|
|
||
|
[import ../../example/root_finding_example.cpp]
|
||
|
[import ../../example/root_finding_n_example.cpp]
|
||
|
[import ../../example/root_finding_multiprecision_example.cpp]
|
||
|
|
||
|
The examples demonstrate how to use the various tools for
|
||
|
[@http://en.wikipedia.org/wiki/Root-finding_algorithm root finding].
|
||
|
|
||
|
We start with the simple cube root function `cbrt` ( C++ standard function name
|
||
|
[@http://en.cppreference.com/w/cpp/numeric/math/cbrt cbrt])
|
||
|
showing root finding __cbrt_no_derivatives.
|
||
|
|
||
|
We then show how use of derivatives can improve the speed of convergence.
|
||
|
|
||
|
(But these examples are only a demonstration and do not try to make
|
||
|
the ultimate improvements of an 'industrial-strength'
|
||
|
implementation, for example, of `boost::math::cbrt`, mainly by using a better computed initial 'guess'
|
||
|
at [@boost:/libs/math/include/boost/math/special_functions/cbrt.hpp cbrt.hpp]).
|
||
|
|
||
|
Then we show how a higher root (__fifth_root) [super 5][radic] can be computed,
|
||
|
and in
|
||
|
[@../../example/root_finding_n_example.cpp root_finding_n_example.cpp]
|
||
|
a generic method for the __nth_root that constructs the derivatives at compile-time.
|
||
|
|
||
|
These methods should be applicable to other functions that can be differentiated easily.
|
||
|
|
||
|
[section:cbrt_eg Finding the Cubed Root With and Without Derivatives]
|
||
|
|
||
|
First some `#includes` that will be needed.
|
||
|
|
||
|
[root_finding_include_1]
|
||
|
|
||
|
[tip For clarity, `using` statements are provided to list what functions are being used in this example:
|
||
|
you can, of course, partly or fully qualify the names in other ways.
|
||
|
(For your application, you may wish to extract some parts into header files,
|
||
|
but you should never use `using` statements globally in header files).]
|
||
|
|
||
|
Let's suppose we want to find the root of a number ['a], and to start, compute the cube root.
|
||
|
|
||
|
So the equation we want to solve is:
|
||
|
|
||
|
__spaces ['f(x) = x[cubed] -a]
|
||
|
|
||
|
We will first solve this without using any information
|
||
|
about the slope or curvature of the cube root function.
|
||
|
|
||
|
Fortunately, the cube-root function is 'Really Well Behaved' in that it is monotonic
|
||
|
and has only one root (we leave negative values 'as an exercise for the student').
|
||
|
|
||
|
We then show how adding what we can know about this function, first just the slope
|
||
|
or 1st derivative ['f'(x)], will speed homing in on the solution.
|
||
|
|
||
|
Lastly, we show how adding the curvature ['f''(x)] too will speed convergence even more.
|
||
|
|
||
|
[h3:cbrt_no_derivatives Cube root function without derivatives]
|
||
|
|
||
|
First we define a function object (functor):
|
||
|
|
||
|
[root_finding_noderiv_1]
|
||
|
|
||
|
Implementing the cube-root function itself is fairly trivial now:
|
||
|
the hardest part is finding a good approximation to begin with.
|
||
|
In this case we'll just divide the exponent by three.
|
||
|
(There are better but more complex guess algorithms used in 'real life'.)
|
||
|
|
||
|
[root_finding_noderiv_2]
|
||
|
|
||
|
This snippet from `main()` in [@../../example/root_finding_example.cpp root_finding_example.cpp]
|
||
|
shows how it can be used.
|
||
|
|
||
|
[root_finding_main_1]
|
||
|
|
||
|
[pre
|
||
|
cbrt_noderiv(27) = 3
|
||
|
cbrt_noderiv(28) = 3.0365889718756618
|
||
|
]
|
||
|
|
||
|
The result of `bracket_and_solve_root` is a [@http://www.cplusplus.com/reference/utility/pair/ pair]
|
||
|
of values that could be displayed.
|
||
|
|
||
|
The number of bits separating them can be found using `float_distance(r.first, r.second)`.
|
||
|
The distance is zero (closest representable) for 3[super 3] = 27
|
||
|
but `float_distance(r.first, r.second) = 3` for cube root of 28 with this function.
|
||
|
The result (avoiding overflow) is midway between these two values.
|
||
|
|
||
|
[h3:cbrt_1st_derivative Cube root function with 1st derivative (slope)]
|
||
|
|
||
|
We now solve the same problem, but using more information about the function,
|
||
|
to show how this can speed up finding the best estimate of the root.
|
||
|
|
||
|
For the root function, the 1st differential (the slope of the tangent to a curve at any point) is known.
|
||
|
|
||
|
This algorithm is similar to this [@http://en.wikipedia.org/wiki/Nth_root_algorithm nth root algorithm].
|
||
|
|
||
|
If you need some reminders, then
|
||
|
[@http://en.wikipedia.org/wiki/Derivative#Derivatives_of_elementary_functions derivatives of elementary functions]
|
||
|
may help.
|
||
|
|
||
|
Using the rule that the derivative of ['x[super n]] for positive n (actually all nonzero n) is ['n x[super n-1]],
|
||
|
allows us to get the 1st differential as ['3x[super 2]].
|
||
|
|
||
|
To see how this extra information is used to find a root, view
|
||
|
[@http://en.wikipedia.org/wiki/Newton%27s_method Newton-Raphson iterations]
|
||
|
and the [@http://en.wikipedia.org/wiki/Newton%27s_method#mediaviewer/File:NewtonIteration_Ani.gif animation].
|
||
|
|
||
|
We define a better functor `cbrt_functor_deriv` that returns
|
||
|
both the evaluation of the function to solve, along with its first derivative:
|
||
|
|
||
|
To '['return]' two values, we use a [@http://en.cppreference.com/w/cpp/utility/pair std::pair]
|
||
|
of floating-point values.
|
||
|
|
||
|
[root_finding_1_deriv_1]
|
||
|
|
||
|
The result of [@boost:/libs/math/include/boost/math/tools/roots.hpp `newton_raphson_iterate`]
|
||
|
function is a single value.
|
||
|
|
||
|
[tip There is a compromise between accuracy and speed when chosing the value of `digits`.
|
||
|
It is tempting to simply chose `std::numeric_limits<T>::digits`,
|
||
|
but this may mean some inefficient and unnecessary iterations as the function thrashes around
|
||
|
trying to locate the last bit. In theory, since the precision doubles with each step
|
||
|
it is sufficient to stop when half the bits are correct: as the last step will have doubled
|
||
|
that to full precision. Of course the function has no way to tell if that is actually the case
|
||
|
unless it does one more step to be sure. In practice setting the precision to slightly more
|
||
|
than `std::numeric_limits<T>::digits / 2` is a good choice.]
|
||
|
|
||
|
Note that it is up to the caller of the function to check the iteration count
|
||
|
after the call to see if iteration stoped as a result of running out of iterations
|
||
|
rather than meeting the required precision.
|
||
|
|
||
|
Using the test data in [@../../test/test_cbrt.cpp /test/test_cbrt.cpp] this found the cube root
|
||
|
exact to the last digit in every case, and in no more than 6 iterations at double
|
||
|
precision. However, you will note that a high precision was used in this
|
||
|
example, exactly what was warned against earlier on in these docs! In this
|
||
|
particular case it is possible to compute ['f(x)] exactly and without undue
|
||
|
cancellation error, so a high limit is not too much of an issue.
|
||
|
|
||
|
However, reducing the limit to `std::numeric_limits<T>::digits * 2 / 3` gave full
|
||
|
precision in all but one of the test cases (and that one was out by just one bit).
|
||
|
The maximum number of iterations remained 6, but in most cases was reduced by one.
|
||
|
|
||
|
Note also that the above code omits a probable optimization by computing z[sup2]
|
||
|
and reusing it, omits error handling, and does not handle
|
||
|
negative values of z correctly. (These are left as the customary exercise for the reader!)
|
||
|
|
||
|
The `boost::math::cbrt` function also includes these and other improvements:
|
||
|
most importantly it uses a much better initial guess which reduces the iteration count to
|
||
|
just 1 in almost all cases.
|
||
|
|
||
|
[h3:cbrt_2_derivatives Cube root with 1st & 2nd derivative (slope & curvature)]
|
||
|
|
||
|
Next we define yet another even better functor `cbrt_functor_2deriv` that returns
|
||
|
both the evaluation of the function to solve,
|
||
|
along with its first [*and second] derivative:
|
||
|
|
||
|
__spaces['f''(x) = 6x]
|
||
|
|
||
|
using information about both slope and curvature to speed convergence.
|
||
|
|
||
|
To [''return'] three values, we use a `tuple` of three floating-point values:
|
||
|
[root_finding_2deriv_1]
|
||
|
|
||
|
The function `halley_iterate` also returns a single value,
|
||
|
and the number of iterations will reveal if it met the convergence criterion set by `get_digits`.
|
||
|
|
||
|
The no-derivative method gives a result of
|
||
|
|
||
|
cbrt_noderiv(28) = 3.0365889718756618
|
||
|
|
||
|
with a 3 bits distance between the bracketed values, whereas the derivative methods both converge to a single value
|
||
|
|
||
|
cbrt_2deriv(28) = 3.0365889718756627
|
||
|
|
||
|
which we can compare with the [@boost:/libs/math/doc/html/math_toolkit/powers/cbrt.html boost::math::cbrt]
|
||
|
|
||
|
cbrt(28) = 3.0365889718756627
|
||
|
|
||
|
Note that the iterations are set to stop at just one-half of full precision,
|
||
|
and yet, even so, not one of the test cases had a single bit wrong.
|
||
|
What's more, the maximum number of iterations was now just 4.
|
||
|
|
||
|
Just to complete the picture, we could have called
|
||
|
[link math_toolkit.roots.roots_deriv.schroder `schroder_iterate`] in the last
|
||
|
example: and in fact it makes no difference to the accuracy or number of iterations
|
||
|
in this particular case. However, the relative performance of these two methods
|
||
|
may vary depending upon the nature of ['f(x)], and the accuracy to which the initial
|
||
|
guess can be computed. There appear to be no generalisations that can be made
|
||
|
except "try them and see".
|
||
|
|
||
|
Finally, had we called `cbrt` with [@http://shoup.net/ntl/doc/RR.txt NTL::RR]
|
||
|
set to 1000 bit precision (about 300 decimal digits),
|
||
|
then full precision can be obtained with just 7 iterations.
|
||
|
To put that in perspective,
|
||
|
an increase in precision by a factor of 20, has less than doubled the number of
|
||
|
iterations. That just goes to emphasise that most of the iterations are used
|
||
|
up getting the first few digits correct: after that these methods can churn out
|
||
|
further digits with remarkable efficiency.
|
||
|
|
||
|
Or to put it another way: ['nothing beats a really good initial guess!]
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_finding_example.cpp root_finding_example.cpp],
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:lambda Using C++11 Lambda's]
|
||
|
|
||
|
Since all the root finding functions accept a function-object, they can be made to
|
||
|
work (often in a lot less code) with C++11 lambda's. Here's the much reduced code for our "toy" cube root function:
|
||
|
|
||
|
[root_finding_2deriv_lambda]
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_finding_example.cpp root_finding_example.cpp],
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:5th_root_eg Computing the Fifth Root]
|
||
|
|
||
|
Let's now suppose we want to find the [*fifth root] of a number ['a].
|
||
|
|
||
|
The equation we want to solve is :
|
||
|
|
||
|
__spaces['f](x) = ['x[super 5] -a]
|
||
|
|
||
|
If your differentiation is a little rusty
|
||
|
(or you are faced with an function whose complexity makes differentiation daunting),
|
||
|
then you can get help, for example, from the invaluable
|
||
|
[@http://www.wolframalpha.com/ WolframAlpha site.]
|
||
|
|
||
|
For example, entering the commmand: `differentiate x ^ 5`
|
||
|
|
||
|
or the Wolfram Language command: ` D[x ^ 5, x]`
|
||
|
|
||
|
gives the output: `d/dx(x ^ 5) = 5 x ^ 4`
|
||
|
|
||
|
and to get the second differential, enter: `second differentiate x ^ 5`
|
||
|
|
||
|
or the Wolfram Language command: `D[x ^ 5, { x, 2 }]`
|
||
|
|
||
|
to get the output: `d ^ 2 / dx ^ 2(x ^ 5) = 20 x ^ 3`
|
||
|
|
||
|
To get a reference value, we can enter: [^fifth root 3126]
|
||
|
|
||
|
or: `N[3126 ^ (1 / 5), 50]`
|
||
|
|
||
|
to get a result with a precision of 50 decimal digits:
|
||
|
|
||
|
5.0003199590478625588206333405631053401128722314376
|
||
|
|
||
|
(We could also get a reference value using __multiprecision_root).
|
||
|
|
||
|
The 1st and 2nd derivatives of x[super 5] are:
|
||
|
|
||
|
__spaces['f]\'(x) = 5x[super 4]
|
||
|
|
||
|
__spaces['f]\'\'(x) = 20x[super 3]
|
||
|
|
||
|
[root_finding_fifth_functor_2deriv]
|
||
|
[root_finding_fifth_2deriv]
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_finding_example.cpp root_finding_example.cpp] and
|
||
|
[@../../example/root_finding_n_example.cpp root_finding_n_example.cpp].
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:multiprecision_root Root-finding using Boost.Multiprecision]
|
||
|
|
||
|
The apocryphally astute reader might, by now, be asking "How do we know if this computes the 'right' answer?".
|
||
|
|
||
|
For most values, there is, sadly, no 'right' answer.
|
||
|
This is because values can only rarely be ['exactly represented] by C++ floating-point types.
|
||
|
What we do want is the 'best' representation - one that is the nearest __representable value.
|
||
|
(For more about how numbers are represented see __floating_point).
|
||
|
|
||
|
Of course, we might start with finding an external reference source like
|
||
|
__WolframAlpha, as above, but this is not always possible.
|
||
|
|
||
|
Another way to reassure is to compute 'reference' values at higher precision
|
||
|
with which to compare the results of our iterative computations using built-in like `double`.
|
||
|
They should agree within the tolerance that was set.
|
||
|
|
||
|
The result of `static_cast`ing to `double` from a higher-precision type like `cpp_bin_float_50` is guaranteed
|
||
|
to be the [*nearest representable] `double` value.
|
||
|
|
||
|
For example, the cube root functions in our example for `cbrt(28.)` compute
|
||
|
|
||
|
`std::cbrt<double>(28.) = 3.0365889718756627`
|
||
|
|
||
|
WolframAlpha says `3.036588971875662519420809578505669635581453977248111123242141...`
|
||
|
|
||
|
`static_cast<double>(3.03658897187566251942080957850) = 3.0365889718756627`
|
||
|
|
||
|
This example `cbrt(28.) = 3.0365889718756627`
|
||
|
|
||
|
[tip To ensure that all potentially significant decimal digits are displayed use `std::numeric_limits<T>::max_digits10`
|
||
|
(or if not available on older platforms or compilers use `2+std::numeric_limits<double>::digits*3010/10000`).[br]
|
||
|
|
||
|
Ideally, values should agree to `std::numeric-limits<T>::digits10` decimal digits.
|
||
|
|
||
|
This also means that a 'reference' value to be [*input] or `static_cast` should have
|
||
|
at least `max_digits10` decimal digits (17 for 64-bit `double`).
|
||
|
]
|
||
|
|
||
|
If we wish to compute [*higher-precision values] then, on some platforms, we may be able to use `long double`
|
||
|
with a higher precision than `double` to compare with the very common `double`
|
||
|
and/or a more efficient built-in quad floating-point type like `__float128`.
|
||
|
|
||
|
Almost all platforms can easily use __multiprecision,
|
||
|
for example, __cpp_dec_float or a binary type __cpp_bin_float types,
|
||
|
to compute values at very much higher precision.
|
||
|
|
||
|
[note With multiprecision types, it is debatable whether to use the type `T` for computing the initial guesses.
|
||
|
Type `double` is like to be accurate enough for the method used in these examples.
|
||
|
This would limit the exponent range of possible values to that of `double`.
|
||
|
There is also the cost of conversion to and from type `T` to consider.
|
||
|
In these examples, `double` is used via `typedef double guess_type`.]
|
||
|
|
||
|
Since the functors and functions used above are templated on the value type,
|
||
|
we can very simply use them with any of the __multiprecision types. As a reminder,
|
||
|
here's our toy cube root function using 2 derivatives and C++11 lambda functions to find the root:
|
||
|
|
||
|
[root_finding_2deriv_lambda]
|
||
|
|
||
|
Some examples below are 50 decimal digit decimal and binary types
|
||
|
(and on some platforms a much faster `float128` or `quad_float` type )
|
||
|
that we can use with these includes:
|
||
|
|
||
|
[root_finding_multiprecision_include_1]
|
||
|
|
||
|
Some using statements simplify their use:
|
||
|
|
||
|
[root_finding_multiprecision_example_1]
|
||
|
|
||
|
They can be used thus:
|
||
|
|
||
|
[root_finding_multiprecision_example_2]
|
||
|
|
||
|
A reference value computed by __WolframAlpha is
|
||
|
|
||
|
N[2^(1/3), 50] 1.2599210498948731647672106072782283505702514647015
|
||
|
|
||
|
which agrees exactly.
|
||
|
|
||
|
To [*show] values to their full precision, it is necessary to adjust the `std::ostream` `precision` to suit the type, for example:
|
||
|
|
||
|
[root_finding_multiprecision_show_1]
|
||
|
|
||
|
[root_finding_multiprecision_example_3]
|
||
|
|
||
|
which outputs:
|
||
|
|
||
|
[pre
|
||
|
cbrt(2) = 1.2599210498948731647672106072782283505702514647015
|
||
|
|
||
|
value = 2, cube root =1.25992104989487
|
||
|
value = 2, cube root =1.25992104989487
|
||
|
value = 2, cube root =1.2599210498948731647672106072782283505702514647015
|
||
|
]
|
||
|
|
||
|
[tip Be [*very careful] about the floating-point type `T` that is passed to the root-finding function.
|
||
|
Carelessly passing a integer by writing
|
||
|
`cpp_dec_float_50 r = cbrt_2deriv(2);` or `show_cube_root(2);`
|
||
|
will provoke many warnings and compile errors.
|
||
|
|
||
|
Even `show_cube_root(2.F);` will produce warnings because `typedef double guess_type` defines the type
|
||
|
used to compute the guess and bracket values as `double`.
|
||
|
|
||
|
Even more treacherous is passing a `double` as in `cpp_dec_float_50 r = cbrt_2deriv(2.);`
|
||
|
which silently gives the 'wrong' result, computing a `double` result and [*then] converting to `cpp_dec_float_50`!
|
||
|
All digits beyond `max_digits10` will be incorrect.
|
||
|
Making the `cbrt` type explicit with `cbrt_2deriv<cpp_dec_float_50>(2.);` will give you the desired 50 decimal digit precision result.
|
||
|
] [/tip]
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_finding_multiprecision_example.cpp root_finding_multiprecision_example.cpp].
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:nth_root Generalizing to Compute the nth root]
|
||
|
|
||
|
If desired, we can now further generalize to compute the ['n]th root by computing the derivatives [*at compile-time]
|
||
|
using the rules for differentiation and `boost::math::pow<N>`
|
||
|
where template parameter `N` is an integer and a compile time constant. Our functor and function now have an additional template parameter `N`,
|
||
|
for the root required.
|
||
|
|
||
|
[note Since the powers and derivatives are fixed at compile time, the resulting code is as efficient as as if hand-coded as the cube and fifth-root examples above.
|
||
|
A good compiler should also optimise any repeated multiplications.]
|
||
|
|
||
|
Our ['n]th root functor is
|
||
|
|
||
|
[root_finding_nth_functor_2deriv]
|
||
|
|
||
|
and our ['n]th root function is
|
||
|
|
||
|
[root_finding_nth_function_2deriv]
|
||
|
|
||
|
[root_finding_n_example_2]
|
||
|
|
||
|
produces an output similar to this
|
||
|
|
||
|
[root_finding_example_output_1]
|
||
|
|
||
|
[tip Take care with the type passed to the function. It is best to pass a `double` or greater-precision floating-point type.
|
||
|
|
||
|
Passing an integer value, for example, `nth_2deriv<5>(2)` will be rejected, while `nth_2deriv<5, double>(2)` converts the integer to `double`.
|
||
|
|
||
|
Avoid passing a `float` value that will provoke warnings (actually spurious) from the compiler about potential loss of data,
|
||
|
as noted above.]
|
||
|
|
||
|
[warning Asking for unreasonable roots, for example, `show_nth_root<1000000>(2.);` may lead to
|
||
|
[@http://en.wikipedia.org/wiki/Loss_of_significance Loss of significance] like
|
||
|
`Type double value = 2, 1000000th root = 1.00000069314783`.
|
||
|
Use of the the `pow` function is more sensible for this unusual need.
|
||
|
]
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_finding_n_example.cpp root_finding_n_example.cpp].
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:elliptic_eg A More complex example - Inverting the Elliptic Integrals]
|
||
|
|
||
|
The arc length of an ellipse with radii ['a] and ['b] is given by:
|
||
|
|
||
|
[pre L(a, b) = 4aE(k)]
|
||
|
|
||
|
with:
|
||
|
|
||
|
[pre k = [sqrt](1 - b[super 2]/a[super 2])]
|
||
|
|
||
|
where ['E(k)] is the complete elliptic integral of the second kind - see __ellint_2.
|
||
|
|
||
|
Let's suppose we know the arc length and one radii, we can then calculate the other
|
||
|
radius by inverting the formula above. We'll begin by encoding the above formula
|
||
|
into a functor that our root-finding algorithms can call.
|
||
|
|
||
|
Note that while not
|
||
|
completely obvious from the formula above, the function is completely symmetrical
|
||
|
in the two radii - which can be interchanged at will - in this case we need to
|
||
|
make sure that `a >= b` so that we don't accidentally take the square root of a negative number:
|
||
|
|
||
|
[import ../../example/root_elliptic_finding.cpp]
|
||
|
|
||
|
[elliptic_noderv_func]
|
||
|
|
||
|
We'll also need a decent estimate to start searching from, the approximation:
|
||
|
|
||
|
[pre L(a, b) [approx] 4[sqrt](a[super 2] + b[super 2])]
|
||
|
|
||
|
Is easily inverted to give us what we need, which using derivative-free root
|
||
|
finding leads to the algorithm:
|
||
|
|
||
|
[elliptic_root_noderiv]
|
||
|
|
||
|
This function generally finds the root within 8-10 iterations, so given that the runtime
|
||
|
is completely dominated by the cost of calling the ellliptic integral it would be nice to
|
||
|
reduce that count somewhat. We'll try to do that by using a derivative-based method;
|
||
|
the derivatives of this function are rather hard to work out by hand, but fortunately
|
||
|
[@http://www.wolframalpha.com/input/?i=d%2Fda+\[4+*+a+*+EllipticE%281+-+b^2%2Fa^2%29\]
|
||
|
Wolfram Alpha] can do the grunt work for us to give:
|
||
|
|
||
|
[pre d/da L(a, b) = 4(a[super 2]E(k) - b[super 2]K(k)) / (a[super 2] - b[super 2])]
|
||
|
|
||
|
Note that now we have [*two] elliptic integral calls to get the derivative, so our
|
||
|
functor will be at least twice as expensive to call as the derivative-free one above:
|
||
|
we'll have to reduce the iteration count quite substantially to make a difference!
|
||
|
|
||
|
Here's the revised functor:
|
||
|
|
||
|
[elliptic_1deriv_func]
|
||
|
|
||
|
The root-finding code is now almost the same as before, but we'll make use of
|
||
|
Newton-iteration to get the result:
|
||
|
|
||
|
[elliptic_1deriv]
|
||
|
|
||
|
The number of iterations required for `double` precision is now usually around 4 -
|
||
|
so we've slightly more than halved the number of iterations, but made the
|
||
|
functor twice as expensive to call!
|
||
|
|
||
|
Interestingly though, the second derivative requires no more expensive
|
||
|
elliptic integral calls than the first does, in other words it comes
|
||
|
essentially "for free", in which case we might as well make use of it
|
||
|
and use Halley-iteration. This is quite a typical situation when
|
||
|
inverting special-functions. Here's the revised functor:
|
||
|
|
||
|
[elliptic_2deriv_func]
|
||
|
|
||
|
The actual root-finding code is almost the same as before, except we can
|
||
|
use Halley, rather than Newton iteration:
|
||
|
|
||
|
[elliptic_2deriv]
|
||
|
|
||
|
While this function uses only slightly fewer iterations (typically around 3)
|
||
|
to find the root, compared to the original derivative-free method, we've moved from
|
||
|
8-10 elliptic integral calls to 6.
|
||
|
|
||
|
Full code of this example is at
|
||
|
[@../../example/root_elliptic_finding.cpp root_elliptic_finding.cpp].
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
|
||
|
[endsect] [/section:root_examples Examples of Root Finding (with and without derivatives)]
|
||
|
|
||
|
[section:bad_guess The Effect of a Poor Initial Guess]
|
||
|
|
||
|
It's instructive to take our "toy" example algorithms, and use deliberately bad initial guesses to see how the
|
||
|
various root finding algorithms fair. We'll start with the cubed root, and using the cube root of 500 as the test case:
|
||
|
|
||
|
[table
|
||
|
[[Initial Guess=][-500% ([approx]1.323)][-100% ([approx]3.97)][-50% ([approx]3.96)][-20% ([approx]6.35)][-10% ([approx]7.14)][-5% ([approx]7.54)][5% ([approx]8.33)][10% ([approx]8.73)][20% ([approx]9.52)][50% ([approx]11.91)][100% ([approx]15.87)][500 ([approx]47.6)]]
|
||
|
[[bracket_and_solve_root][12][8][8][10][11][11][11][11][11][11][7][13]]
|
||
|
[[newton_iterate][12][7][7][5][5][4][4][5][5][6][7][9]]
|
||
|
[[halley_iterate][7][4][4][3][3][3][3][3][3][4][4][6]]
|
||
|
[[schroder_iterate][11][6][6][4][3][3][3][3][4][5][5][8]]
|
||
|
]
|
||
|
|
||
|
As you can see `bracket_and_solve_root` is relatively insensitive to starting location - as long as you don't start many orders of magnitude away from the root it will
|
||
|
take roughly the same number of steps to bracket the root and solve it. On the other hand the derivative-based methods are slow to start, but once they have some digits
|
||
|
correct they increase precision exceptionally fast: they are therefore quite sensitive to the initial starting location.
|
||
|
|
||
|
The next table shows the number of iterations required to find the second radius of an ellipse with first radius 50 and arc-length 500:
|
||
|
|
||
|
[table
|
||
|
[[Initial Guess=][-500% ([approx]20.6)][-100% ([approx]61.81)][-50% ([approx]61.81)][-20% ([approx]98.9)][-10% ([approx]111.3)][-5% ([approx]117.4)][5% ([approx]129.8)][10% ([approx]136)][20% ([approx]148.3)][50% ([approx]185.4)][100% ([approx]247.2)][500 ([approx]741.7)]]
|
||
|
[[bracket_and_solve_root][11][5][5][8][8][7][7][8][9][8][6][10]]
|
||
|
[[newton_iterate][4][4][4][3][3][3][3][3][3][4][4][4]]
|
||
|
[[halley_iterate][4][3][3][3][3][2][2][3][3][3][3][3]]
|
||
|
[[schroder_iterate][4][3][3][3][3][2][2][3][3][3][3][3]]
|
||
|
]
|
||
|
|
||
|
Interestingly this function is much more resistant to a poor initial guess when using derivatives.
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[section:bad_roots Examples Where Root Finding Goes Wrong]
|
||
|
|
||
|
There are many reasons why root root finding can fail, here are just a few of the more common examples:
|
||
|
|
||
|
[h3 Local Minima]
|
||
|
|
||
|
If you start in the wrong place, such as z[sub 0] here:
|
||
|
|
||
|
[$../roots/bad_root_1.svg]
|
||
|
|
||
|
Then almost any root-finding algorithm will descend into a local minima rather than find the root.
|
||
|
|
||
|
[h3 Flatlining]
|
||
|
|
||
|
In this example, we're starting from a location (z[sub 0]) where the first derivative is essentially zero:
|
||
|
|
||
|
[$../roots/bad_root_2.svg]
|
||
|
|
||
|
In this situation the next iteration will shoot off to infinity (assuming we're using derivatives that is). Our
|
||
|
code guards against this by insisting that the root is always bracketed, and then never stepping outside those bounds.
|
||
|
In a case like this, no root finding algorithm can do better than bisecting until the root is found.
|
||
|
|
||
|
Note that there is no scale on the graph, we have seen examples of this situation occur in practice ['even when
|
||
|
several decimal places of the initial guess z[sub 0] are correct.]
|
||
|
|
||
|
This is really a special case of a more common situation where root finding with derivatives is ['divergent]. Consider
|
||
|
starting at z[sub 0] in this case:
|
||
|
|
||
|
[$../roots/bad_root_4.svg]
|
||
|
|
||
|
An initial Newton step would take you further from the root than you started, as will all subsequent steps.
|
||
|
|
||
|
[h3 Micro-stepping / Non-convergence]
|
||
|
|
||
|
Consider starting at z[sub 0] in this situation:
|
||
|
|
||
|
[$../roots/bad_root_3.svg]
|
||
|
|
||
|
The first derivative is essentially infinite, and the second close to zero (and so offers no correction if we use it),
|
||
|
as a result we take a very small first step. In the worst case situation, the first step is so small
|
||
|
- perhaps even so small that subtracting from z[sub 0] has no effect at the current working precision - that our algorithm
|
||
|
will assume we are at the root already and terminate. Otherwise we will take lot's of very small steps which never converge
|
||
|
on the root: our algorithms will protect against that by reverting to bisection.
|
||
|
|
||
|
An example of this situation would be trying to find the root of e[super -1/z[super 2]] - this function has a single
|
||
|
root at ['z = 0], but for ['z[sub 0] < 0] neither Newton nor Halley steps will ever converge on the root, and for ['z[sub 0] > 0]
|
||
|
the steps are actually divergent.
|
||
|
|
||
|
[endsect]
|
||
|
|
||
|
[/
|
||
|
Copyright 2015 John Maddock and Paul A. Bristow.
|
||
|
Distributed under the Boost Software License, Version 1.0.
|
||
|
(See accompanying file LICENSE_1_0.txt or copy at
|
||
|
http://www.boost.org/LICENSE_1_0.txt).
|
||
|
]
|
||
|
|