In this post we will consider minimax rational approximations used for computation of modified Bessel functions of the second kind – . We will follow our usual workflow – we study properties of the existing methods and then propose our original approximations optimized for computations with double precision.
We try to keep the post brief and stick to results. Detailed description of our methodology can be found in previous reports of the series: K0(x)
, I0(x)
and I1(x)
.
In their pioneering work[1], Blair & Russon proposed the following approximation forms for :
All software libraries relying on rational approximations for computing use (1) and (2) with small variations in coefficients of polynomials and .
Relative accuracy shown below is measured in terms of machine epsilon ( eps = 2-52 ˜ 2.22e-16
) against “true” values of the function computed in quadruple precision by Maple. Red lines show natural limits for relative error we expect for double precision computations.
Blair & Russon[1], Boost 1.59.0[2] and SPECFUN 2.5[3]
Boost and SPECFUN use the same coefficients from original report of Blair & Russon.
Numerical Recipes, 3rd edition, 2007[4]
Strangely enough Numerical Recipes delivers good accuracy for , , whereas it shows very poor performance for all other functions – , and .
GNU GSL 2.0[5]
(GSL uses Chebyshev series)
MATLAB 2015a[6]
(MATLAB uses Chebyshev series)
Similar to K0(x)
, MATLAB provides the lowest accuracy for compared to all other libraries. Relative error is up to 10.14eps
near the . GSL and MATLAB use Chebyshev expansion for computations and apparently both suffer from similar issues (incorrectly rounded Chebyshev coefficients as GSL source code revealed).
Method/Software | ||
---|---|---|
Minimax Rational Approximations: | ||
Blair & Russon/Boost/SPECFUN | 2.06 | 3.77 |
Numerical Recipes, 2007 | 1.95 | 8.51 |
Refined – (4), (5) | 1.63 | 2.25 |
Chebyshev Expansions: | ||
GNU GSL 2.0 | 2.95 | 7.11 |
MATLAB 2015a | 2.83 | 10.14 |
†computed over 50K uniformly distributed random samples in each interval
Refined Approximations
In a search for better accuracy we use extended approximation form for near zero:
(3)
This form reflects behavior of for small arguments well and showed good results in our previous study. For large arguments we use the same formula as in (2).
Instead of using pure approximation error (usually computed in extended precision when rounding errors are ignored), we use additional criteria to find the optimal approximations. Highest priority is assigned to the actual accuracy delivered by schema in double precision. In other words, we take into account conditioning of the approximation in double precision.
This allows us to find approximations which are much more suitable for double precision computations with following properties:
Maximum error is 1.63eps
for small arguments and 2.25eps
for .
Final expressions for optimal approximations:
Coefficients of the polynomials:
n | |
---|---|
0 | -3.0796575782920622440538935e-01 |
1 | -8.5370719728650778045782736e-02 |
2 | -4.6421827664715603298154971e-03 |
3 | -1.1253607036630425931072996e-04 |
4 | -1.5592887702110907110292728e-06 |
5 | -1.4030163679125934402498239e-08 |
6 | -8.8718998640336832196558868e-11 |
7 | -4.1614323580221539328960335e-13 |
8 | -1.5261293392975541707230366e-15 |
n | |
---|---|
0 | 8.3333333333333325191635191e-02 |
1 | 6.9444444444467956461838830e-03 |
2 | 3.4722222211230452695165215e-04 |
3 | 1.1574075952009842696580084e-05 |
4 | 2.7555870002088181016676934e-07 |
5 | 4.9724386164128529514040614e-09 |
n | |
---|---|
0 | 1.0234817795732426171122752e-01 |
1 | 9.4576473594736724815742878e-01 |
2 | 2.1876721356881381470401990e+00 |
3 | 6.0143447861316538915034873e-01 |
4 | -1.3961391456741388991743381e-01 |
5 | 8.8229427272346799004782764e-02 |
6 | -8.5494054051512748665954180e-02 |
7 | 1.0617946033429943924055318e-01 |
8 | -1.5284482951051872048173726e-01 |
9 | 2.3707700686462639842005570e-01 |
10 | -3.7345723872158017497895685e-01 |
11 | 5.6874783855986054797640277e-01 |
12 | -8.0418742944483208700659463e-01 |
13 | 1.0215105768084562101457969e+00 |
14 | -1.1342221242815914077805587e+00 |
15 | 1.0746932686976675016706662e+00 |
16 | -8.4904532475797772009120500e-01 |
17 | 5.4542251056566299656460363e-01 |
18 | -2.7630896752209862007904214e-01 |
19 | 1.0585982409547307546052147e-01 |
20 | -2.8751691985417886721803220e-02 |
21 | 4.9233441525877381700355793e-03 |
22 | -3.9900679319457222207987456e-04 |
n | |
---|---|
0 | 8.1662031018453173425764707e-02 |
1 | 7.2398781933228355889996920e-01 |
2 | 1.4835841581744134589980018e+00 |
Conclusion
We have derived new rational approximations for the modified Bessel function of the second kind – . On the contrary to existing methods we minimize the actual error delivered by approximation in double precision. Proposed methods provide highest relative accuracy among all publicly available approximations (published and/or implemented in software).
MATLAB suffers from accuracy loss near zero and should be avoided if accuracy is of high priority.
References
- Russon A.E., Blair J.M., Rational function minimax approximations for the modified Bessel functions and , AECL-3461, Chalk River Nuclear Laboratories, Chalk River, Ontario, October 1969.
- Boost – free peer-reviewed portable C++ source libraries. Version 1.59.0. Boost: Modified Bessel function of the second kind.
- Cody W.J., Algorithm 715: SPECFUN – A Portable FORTRAN Package of Special Function Routines and Test Drivers, ACM Transactions on Mathematical Software, Volume 19, Number 1, March 1993, pages 22-32. SPECFUN: Modified Bessel function of the second kind.
- Press, William H. and Teukolsky, Saul A. and Vetterling, William T. and Flannery, Brian P. Numerical Recipes 3rd Edition: The Art of Scientific Computing, 2007, Cambridge University Press, New York, USA.
- M. Galassi et al, GNU Scientific Library Reference Manual, 3rd Edition (January 2009), ISBN 0954612078.
- MATLAB 2015a, The MathWorks, Inc., Natick, Massachusetts, United States.
{ 2 comments… read them below or add one }
I thought you would like to know that I’ve just implemented something similar to this for Boost.Math.
However it’s actually possible to do a little better, consider (5) for a moment, it turns out that K1(x)x^1/2 e^x is approximately constant for x > 1, and therefore there’s an old trick we can use – to replace P22/Q2 by C+R(x) where C is an arbitrary constant chosen to be the approximate value of the function over the range in question (truncated to an exact binary value), and R(x) is a rational approximation optimised for low *absolute* error compared to C. Now as long as the error with which we evaluate R(x) is small compared to C, we effectively “erase” the error when we perform the addition. Of course we may still round incorrectly as a result of the erroneous bits from R(x), so the error is slightly more than 0.5ulp, but we should be close. Add in 0.5ulp from the other 2 terms and we should achieve an overall maximum error just slightly more than 1.5ulp ~ 1.5eps.
I’m not sure how far you minimised P22/Q2 but another observation is that interpolated solutions for (5) are really quite far from minimax solutions. For example P22/Q2 interpolated is just accurate enough for double precision (error 8e-17), while the minimax solution is nearer to 3e-18. Which is my way of saying that many fewer terms can be used 😉
Using a P8/Q8 solution:
and C = 1.45034217834472656
Yielded an approximation with a peek experimental error for K1 of 1.6eps. Just as importantly, it’s much faster to execute: the issue with large polynomials is that each arithmetic operation is dependent on the previous result which completely stalls the processor pipeline. You can mitigate this somewhat by using a second order Horner scheme, but even that doesn’t seem to be enough to keep the pipeline full, here’s the approximate performance results I obtained on Intel-x64 (all relative times):
P22/Q2 (Horner) 1.12
P22/Q2 (2nd order Horner) 0.743
P8/Q8 (Horner) 0.41
P8/Q8 (2nd order Horner) 0.38
[aside – we can of course go even further with rational approximations and use SIMD instructions to evaluate the 2 polynomials in parallel.]
Similar arguments apply to(4) where your P5 and P8 are being added to terms > 1, so these need only be optimised for low absolute error, similarly for K0. Both functions also extend to 128-bit precision fairly easily using these techniques BTW.
Apologies for the long post, but hopefully someone will find this useful, for the record, Boost’s code is here:
https://github.com/boostorg/math/blob/develop/include/boost/math/special_functions/detail/bessel_k0.hpp
and here:
https://github.com/boostorg/math/blob/develop/include/boost/math/special_functions/detail/bessel_k1.hpp
Thank you for the wealth of information and congratulations on results!
I am sure it will be useful for many people as well.
Just for information, here is improvement of max relative error on 1M of random points in each interval:
The next step would be to reduce error to
< eps
and provide correctly rounded values.IEEE standard requires this for elementary functions, special functions is the natural next step.
Intel already does this in its libraries: http://www.advanpix.com/2016/05/12/accuracy-of-bessel-functions-in-matlab/
In this case approximations should be based on simple polynomials with coefficients representable exactly as machine numbers, with lookup tables for worst cases (to resolve table maker dilemma), etc.
Quite boring and tedious work, far from actual math :).