Adaptive integration - Fundamentals of Numerical Computation

To this point, we have used only equally spaced nodes to compute integrals. Yet there are problems in which non-uniformly distributed nodes would clearly be more appropriate, as demonstrated in Demo 5.7.1.

Example 5.7.1 (Motivation for adaptive integration)

Julia

MATLAB

Python

Example 5.7.1

This function gets increasingly oscillatory as $x$ increases.

using Plots
f = x -> (x + 1)^2 * cos((2x + 1) / (x - 4.3))
plot(f, 0, 4, xlabel=L"x", ylabel=L"f(x)")

Accordingly, the trapezoid rule is more accurate on the left half of this interval than on the right half.

using QuadGK
left_val, _ = quadgk(f, 0, 2, atol=1e-14, rtol=1e-14)
right_val, _ = quadgk(f, 2, 4, atol=1e-14, rtol=1e-14)

n = [50 * 2^k for k in 0:3]
err = zeros(length(n), 2)
for (k, n) in enumerate(n)
    T, _ = FNC.trapezoid(f, 0, 2, n)
    err[k, 1] = T - left_val

    T, _ = FNC.trapezoid(f, 2, 4, n)
    err[k, 2] = T - right_val
end

@pt :header=["n", "left error", "right error"] [n err]

Both the picture and the numerical results suggest that more nodes should be used on the right half of the interval than on the left half.

Example 5.7.1

This function gets increasingly oscillatory as $x$ increases.

f = @(x) (x + 1).^2 .* cos((2 * x + 1) ./ (x - 4.3));
clf
fplot(f, [0, 4], 2000)
xlabel('x'), ylabel(('f(x)'));

Accordingly, the trapezoid rule is more accurate on the left half of this interval than on the right half.

left_val = integral(f, 0, 2, abstol=1e-14, reltol=1e-14);
right_val = integral(f, 2, 4, abstol=1e-14, reltol=1e-14);

n = round(50 * 2 .^ (0:3)');
err = zeros(length(n), 2);
for i = 1:length(n)
    T = trapezoid(f, 0, 2, n(i));
    err(i, 1) = T - left_val;
    T = trapezoid(f, 2, 4, n(i));
    err(i, 2) = T - right_val;
end
table(n, err(:, 1), err(:, 2), variableNames=["n", "left error", "right error"])

Both the picture and the numerical results suggest that more nodes should be used on the right half of the interval than on the left half.

Example 5.7.1

This function gets increasingly oscillatory as $x$ increases.

f = lambda x: (x + 1) ** 2 * cos((2 * x + 1) / (x - 4.3))
x = linspace(0, 4, 600)
plot(x, f(x))
xlabel("$x$")
ylabel("$f(x)$");

Accordingly, the trapezoid rule is more accurate on the left half of this interval than on the right half.

n_ = 50 * 2 ** arange(4)
Tleft = zeros(4)
Tright = zeros(4)
for i, n in enumerate(n_):
    Tleft[i] = FNC.trapezoid(f, 0, 2, n)[0]
    Tright[i] = FNC.trapezoid(f, 2, 4, n)[0]
print("left half:", Tleft)
print("right half:", Tright)

left half: [2.00419122 2.00605957 2.00652661 2.00664337]
right half: [-4.32798637 -4.73621129 -4.80966839 -4.82666144]

from scipy.integrate import quad
left_val, err = quad(f, 0, 2, epsabs=1e-13, epsrel=1e-13)
right_val, err = quad(f, 2, 4, epsabs=1e-13, epsrel=1e-13)

print("    n     left error   right error")
for k in range(n_.size):
    print(f"  {n_[k]:4}    {Tleft[k]-left_val:8.3e}    {Tright[k]-right_val:8.3e}")

    n     left error   right error
    50    -2.491e-03    5.042e-01
   100    -6.227e-04    9.600e-02
   200    -1.557e-04    2.255e-02
   400    -3.892e-05    5.554e-03

Both the picture and the numerical results suggest that more nodes should be used on the right half of the interval than on the left half.

We would like an algorithm that automatically detects and reacts to a situation like that in Demo 5.7.1, a trait known as adaptivity.

5.7.1Error estimation¶

Ideally, we would like to make adaptation decisions based on the error of the integration result. Knowing the error exactly would be equivalent to knowing the exact answer, but we can estimate it using the extrapolation technique of Numerical integration. Consider the Simpson formula (5.6.15) resulting from one level of extrapolation from trapezoid estimates:

S_f(2n) = \frac{1}{3} \Bigl[ 4 T_f(2n) - T_f(n) \Bigr].

(5.7.1)

We expect this method to be fourth-order accurate, i.e.,

\int_a^b f(x)\, dx = S_f(2n) + O(n^{-4}),

(5.7.2)

We can further extrapolate to sixth-order accuracy using (5.6.17):

R_f(4n) = \frac{1}{15} \Bigl[ 16 S_f(4n) - S_f(2n) \Bigr].

(5.7.3)

By virtue of higher order of accuracy, $R_f(4n)$ should be more accurate than $S_f(4n)$ . Hence, a decent estimate of the error in the better of the two Simpson values is

E = R_f(4n) - S_f(4n) = \frac{S_f(4n) - S_f(2n)}{15}.

(5.7.4)

5.7.2Divide and conquer¶

If $|E|$ is judged to be acceptably small, we are done. This judgment takes some care. For instance, suppose the exact integral is 10²⁰. Requiring $|E| < \delta\ll 1$ would be fruitless in double precision, since it would require more than 20 accurate digits. Hence checking the absolute size of the error alone is not appropriate. Conversely, consider the integral

\int_{10^{-6}}^{2\pi} 2 \sin x\, dx \approx -10^{-12}.

(5.7.5)

We are likely to sample values of the integrand that are larger than, say, $1/2$ in absolute value, so obtaining this very small result has to rely on subtractive cancellation. We cannot hope for more than 4-5 accurate digits, so a strict test of the relative error is also not recommended. In other words, we can seek an error that is small relative to the data (the integrand), which is $O(1)$ , but not relative to the answer itself.

Typically, we use both relative and absolute error, stopping when either one is considered small enough. Algebraically, the test is

|E| < \delta_a + \delta_r |S_f(n)|,

(5.7.6)

where $\delta_a$ and $\delta_r$ are given absolute and relative error tolerances, respectively.

When $|E|$ fails to meet (5.7.6), we bisect the interval $[a,b]$ to exploit the identity

\int_a^b f(x)\, dx = \int_a^{(a+b)/2} f(x)\, dx + \int_{(a+b)/2}^b f(x)\, dx,

(5.7.7)

and independently compute estimates to each of the half-length integrals. Each of these half-sized computations recursively applies Simpson’s formula and the error estimation criterion, making further bisections as necessary. Such an approach is called divide and conquer in computer science: recursively split the problem into easier pieces and glue the results together.

5.7.3Implementation¶

It is typical to use just the minimal formula $S_f(4)$ and its error estimate $E$ to make decisions about adaptivity. A computation of $S_f(4)$ requires three trapezoid estimates $T_f(1)$ , $T_f(2)$ , and $T_f(4)$ . As observed in (5.6.18) and Demo 5.6.3, the five integrand evaluations in $T_f(4)$ are sufficient to compute all of these values.

Function 5.7.1 shows an implementation. It uses five function values to compute three trapezoid estimates with $n=1$ , $n=2$ , and $n=4$ , applying the updating formula (5.6.18) twice. It goes on to find the two Simpson approximations and to estimate the error by (5.7.4).

If the error estimate passes the test (5.7.6), the better Simpson value is returned as the integral over the given interval. Otherwise, the interval is bisected, integrals over the two pieces are computed using recursive calls, and those results are added to give the complete integral.

Algorithm 5.7.1 (intadapt)

Julia

MATLAB

Python

Adaptive integration

intadapt.jl

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
"""
    intadapt(f, a, b, tol)

Adaptively integrate `f` over [`a`,`b`] to within target error
tolerance `tol`. Returns the estimate and a vector of evaluation
nodes.
"""
function intadapt(f, a, b, tol, fa = f(a), fb = f(b), m = (a + b) / 2, fm = f(m))
    # Use error estimation and recursive bisection.
    # These are the two new nodes and their f-values.
    xl = (a + m) / 2
    fl = f(xl)
    xr = (m + b) / 2
    fr = f(xr)

    # Compute the trapezoid values iteratively.
    h = (b - a)
    T = [0.0, 0.0, 0.0]
    T[1] = h * (fa + fb) / 2
    T[2] = T[1] / 2 + (h / 2) * fm
    T[3] = T[2] / 2 + (h / 4) * (fl + fr)

    S = (4T[2:3] - T[1:2]) / 3      # Simpson values
    E = (S[2] - S[1]) / 15           # error estimate

    if abs(E) < tol * (1 + abs(S[2]))  # acceptable error?
        Q = S[2]                   # yes--done
        nodes = [a, xl, m, xr, b]      # all nodes at this level
    else
        # Error is too large--bisect and recurse.
        QL, tL = intadapt(f, a, m, tol, fa, fm, xl, fl)
        QR, tR = intadapt(f, m, b, tol, fm, fb, xr, fr)
        Q = QL + QR
        nodes = [tL; tR[2:end]]   # merge the nodes w/o duplicate
    end
    return Q, nodes
end

Adaptive integration

intadapt.m

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
function [Q,t] = intadapt(f,a,b,tol)
%INTADAPT   Adaptive integration with error estimation.
% Input:
%   f     integrand (function)
%   a,b   interval of integration (scalars)
%   tol   acceptable error
% Output:
%   Q     approximation to integral(f,a,b)
%   t     vector of nodes used

m = (b+a)/2;
[Q,t] = do_integral(a,f(a),b,f(b),m,f(m),tol);

    % Use error estimation and recursive bisection. 
    function [Q,t] = do_integral(a,fa,b,fb,m,fm,tol)
        
        % These are the two new nodes and their f-values.
        xl = (a+m)/2;  fl = f(xl);
        xr = (m+b)/2;  fr = f(xr);
        t = [a;xl;m;xr;b];              % all 5 nodes at this level

        % Compute the trapezoid values iteratively. 
        h = (b-a);
        T(1) = h*(fa+fb)/2;
        T(2) = T(1)/2 + (h/2)*fm;
        T(3) = T(2)/2 + (h/4)*(fl+fr);
        
        S = (4*T(2:3)-T(1:2)) / 3;      % Simpson values
        E = (S(2)-S(1)) / 15;           % error estimate
                
        if abs(E) < tol*(1+abs(S(2)))   % acceptable error?
            Q = S(2);                   % yes--done
        else
            % Error is too large--bisect and recurse. 
            [QL,tL] = do_integral(a,fa,m,fm,xl,fl,tol);
            [QR,tR] = do_integral(m,fm,b,fb,xr,fr,tol);
            Q = QL + QR;
            t = [tL;tR(2:end)];         % merge the nodes w/o duplicate
        end        
    end

end  % main function

Adaptive integration

intadapt.py

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
def intadapt(f, a, b, tol):
    """
    intadapt(f, a, b, tol)

    Do adaptive integration to estimate the integral of f over [a,b] to desired
    error tolerance tol. Returns estimate and a vector of evaluation nodes used.
    """

    # Use error estimation and recursive bisection.
    def do_integral(a, fa, b, fb, m, fm, tol):
        # These are the two new nodes and their f-values.
        xl = (a + m) / 2
        fl = f(xl)
        xr = (m + b) / 2
        fr = f(xr)
        t = np.array([a, xl, m, xr, b])  # all 5 nodes at this level

        # Compute the trapezoid values iteratively.
        h = b - a
        T = np.zeros(3)
        T[0] = h * (fa + fb) / 2
        T[1] = T[0] / 2 + (h / 2) * fm
        T[2] = T[1] / 2 + (h / 4) * (fl + fr)

        S = (4 * T[1:] - T[:-1]) / 3  # Simpson values
        E = (S[1] - S[0]) / 15  # error estimate

        if abs(E) < tol * (1 + abs(S[1])):  # acceptable error?
            Q = S[1]  # yes--done
        else:
            # Error is too large--bisect and recurse.
            QL, tL = do_integral(a, fa, m, fm, xl, fl, tol)
            QR, tR = do_integral(m, fm, b, fb, xr, fr, tol)
            Q = QL + QR
            t = np.hstack([tL, tR[1:]])  # merge the nodes w/o duplicate
        return Q, t

    m = (b + a) / 2
    Q, t = do_integral(a, f(a), b, f(b), m, f(m), tol)
    return Q, t

Example 5.7.2 (Using adaptive integration)

Julia

MATLAB

Python

Example 5.7.2

We’ll integrate the function from Demo 5.7.1.

f = x -> (x + 1)^2 * cos((2x + 1) / (x - 4.3));

We perform the integration and show the nodes selected underneath the curve.

A, t = FNC.intadapt(f, 0, 4, 0.001)
@show num_nodes = length(t);

plot(f, 0, 4;
    color=:black, legend=:none,
    xlabel=L"x",  ylabel=L"f(x)", 
    title="Adaptive node selection")
plot!(t, f.(t), seriestype=:sticks, m=(:o, 2))

The error turns out to be a bit more than we requested. It’s only an estimate, not a guarantee.

Q, _ = quadgk(f, 0, 4, atol=1e-14, rtol=1e-14);    # 'exact' value
println("error: $(Q-A)");

error: -0.022002813037627522

Let’s see how the number of integrand evaluations and the error vary with the requested tolerance.

tol = [1 / 10^k for k in 4:14]
err, n = [], []
for tol in 10.0 .^ (-4:-1:-14)
    A, t = FNC.intadapt(f, 0, 4, tol)
    push!(err, Q - A)
    push!(n, length(t))
end
@pt :header=["tolerance", "error", "number of nodes"] [tol err n][1:2:end, :]

As you can see, even though the errors are not smaller than the tolerances, the two columns decrease in tandem. If we consider now the convergence not in $h$ , which is poorly defined now, but in the number of nodes actually chosen, we come close to the fourth-order accuracy of the underlying Simpson scheme.

plot(n, abs.(err);
    m=:o, label="results",
    xaxis=(:log10, "number of nodes"),  yaxis=(:log10, "error"),
    title="Convergence of adaptive integration")

order4 = @. 0.01 * (n / n[1])^(-4)
plot!(n, order4, l=:dash, label=L"O(n^{-4})")

Example 5.7.2

We’ll integrate the function from Demo 5.7.1.

f = @(x) (x + 1).^2 .* cos((2 * x + 1) ./ (x - 4.3));

We perform the integration and show the nodes selected underneath the curve.

[Q, t] = intadapt(f, 0, 4, 0.001);
clf, fplot(f, [0, 4], 2000)
hold on
stem(t, f(t), '.-')
title('Adaptive node selection')
xlabel('x'), ylabel('f(x)')
fprintf("number of nodes = %d", length(t))

number of nodes =

The error turns out to be a bit more than we requested. It’s only an estimate, not a guarantee.

I = integral(f, 0, 4, abstol=1e-14, reltol=1e-14);    % 'exact' value
fprintf("error = %.2e", abs(Q - I))

error =

2.20e-02

Let’s see how the number of integrand evaluations and the error vary with the requested tolerance.

tol = 1 ./ 10.^(4:14)';
err = zeros(size(tol));
n = zeros(size(tol));
for k = 1:length(tol)
    [A, t] = intadapt(f, 0, 4, tol(k));
    err(k) =  I - A;
    n(k) = length(t);
end
table(tol, err, n, variableNames=["tolerance", "error", "number of nodes"])

As you can see, even though the errors are not smaller than the estimates, the two columns decrease in tandem. If we consider now the convergence not in $h$ , which is poorly defined now, but in the number of nodes actually chosen, we come close to the fourth-order accuracy of the underlying Simpson scheme.

clf
loglog(n, abs(err), "-o", displayname="results")
xlabel("number of nodes"), ylabel("error")
title("Convergence of adaptive integration")
order4 = 0.1 * abs(err(end)) * (n / n(end)).^(-4);
hold on
loglog(n, order4, "k--", displayname="O(n^{-4})")
legend();

Example 5.7.2

We’ll integrate the function from Demo 5.7.1.

from scipy.integrate import quad
f = lambda x: (x + 1) ** 2 * cos((2 * x + 1) / (x - 4.3))
I, errest = quad(f, 0, 4, epsabs=1e-12, epsrel=1e-12)
print("integral:", I)    # 'exact' value

integral: -2.8255333734374504

We perform the integration and show the nodes selected underneath the curve.

Q, t = FNC.intadapt(f, 0, 4, 0.001)
print("number of nodes:", t.size)

x = linspace(0, 4, 600)
plot(x, f(x), "k")
stem(t, f(t))
xlabel("$x$"); ylabel("$f(x)$");

number of nodes: 69

The error turns out to be a bit more than we requested. It’s only an estimate, not a guarantee.

print("error:", I - Q)

error: -0.02200281303763152

Let’s see how the number of integrand evaluations and the error vary with the requested tolerance.

tol_ = 10.0 ** arange(-4, -12, -1)
err_ = zeros(tol_.size)
num_ = zeros(tol_.size, dtype=int)
print("    tol         error     # f-evals")
for i, tol in enumerate(tol_):
    Q, t = FNC.intadapt(f, 0, 4, tol)
    err_[i] = I - Q
    num_[i] = t.size
    print(f"  {tol:6.1e}    {err_[i]:10.3e}    {num_[i]:6d}")

    tol         error     # f-evals
  1.0e-04    -4.195e-04       113
  1.0e-05     4.790e-05       181
  1.0e-06     6.314e-06       297
  1.0e-07    -6.639e-07       489
  1.0e-08     7.181e-08       757
  1.0e-09     1.265e-08      1193
  1.0e-10    -8.441e-10      2009
  1.0e-11     2.612e-11      3157

loglog(num_, abs(err_), "-o", label="results")
order4 = 0.01 * (num_ / num_[0]) ** (-4)
loglog(num_, order4, "--", label="$O(n^{-4})$")
xlabel("number of nodes"), ylabel("error")
legend()
title("Convergence of adaptive quadrature");

Although adaptivity and the error estimation that goes with it can be very powerful, they come at some cost. The error estimation cannot be universally perfect, so sometimes the answer will not be as accurate as requested, and sometimes the function will be evaluated more times than necessary. Subtle problems may arise when the integral is a step within a larger computation (see Exercise 6).

5.7.4Exercises¶

⌨ For each integral below, use Function 5.7.1 with error tolerance $10^{-2},10^{-3},\ldots,10^{-12}$ . Make a table of errors and the number of integrand evaluation nodes used, and use a convergence plot as in Demo 5.7.2 to compare to fourth-order accuracy. (These integrals were taken from Bailey et al. (2005).)
(a) $\displaystyle \int_0^1 x\log(1+x)\, dx = \frac{1}{4}$
(b) $\displaystyle \int_0^1 x^2 \tan^{-1}x\, dx = \frac{\pi-2+2\log 2}{12}$
(c) $\displaystyle \int_0^{\pi/2}e^x \cos x\, dx = \frac{e^{\pi/2}-1}{2}$
(d) $\displaystyle \int_{0}^1 \sqrt{x} \log(x) \, dx = -\frac{4}{9}$ (Note: Although the integrand has the limiting value zero as $x\to 0$ , you have to implement the function carefully to return zero as the value of $f(0)$ , or start the integral at $x=\macheps$ .)
(e) $\displaystyle \int_0^1 \sqrt{1-x^2}\, dx = \frac{\pi}{4}$
⌨ For each integral below: (i) use quadgk to find the value to at least 12 digits; (ii) use Function 5.7.1 to evaluate the integral to a tolerance of 10^-8; (iii) compute the absolute error and the number of nodes used; (iv) use the $O(h^2)$ term in the Euler–Maclaurin formula (5.6.9) to estimate how many nodes are required by the fixed-stepsize trapezoidal formula to reach an absolute error of 10^-8.
(a) $\displaystyle \int_{0.1}^3 \operatorname{sech}(\sin(1/x))\, d x$
(b) $\rule[2em]{0pt}{0pt} \displaystyle\int_{-0.9}^9 \ln((x+1)^3))\, d x$
(c) $\rule[2em]{0pt}{0pt} \displaystyle\int_{-\pi}^\pi \cos(x^3)\, d x$
⌨ An integral such as $\displaystyle \int_0^1 x^{-\gamma}\, dx$ for $\gamma>0$ , in which the integrand blows up at one or both ends, is known as an improper integral. It has a finite value if $\gamma<1$ , despite the singularity. One way to deal with the problem of the infinite value for $f(t_0)$ is to replace the lower limit with a small number ε. (A more robust way to handle improper integrals is discussed in Chapter 9.)
Using Function 5.7.1 with a small tolerance, make a log-log plot of the error as a function of ε when $\gamma=2/3$ , for $\epsilon=10^{-15},10^{-16},\ldots,10^{-45}$ .
⌨ A curious consequence of our logic in Function 5.7.1 is that the algorithm uses what we believe to be a more accurate, sixth-order answer only for estimating error; the returned value is the supposedly less accurate $S_f(2n)$ . The practice of returning the extrapolated $R_f(4n)$ instead is called local extrapolation.
Modify Function 5.7.1 to use local extrapolation and repeat parts (a) and (e) of Exercise 1 above, comparing the observed convergence to both fourth order and sixth order.
⌨ The sine integral function is defined by
$\operatorname{Si}(x) = \int_0^x \frac{\sin z}{z}\, dz.$
(5.7.8)
Use Function 5.7.1 to plot Si over the interval $[1,10]$ . Note: You will need to replace the lower bound of integration by $\macheps$ .

⌨ Adaptive integration can have subtle drawbacks. This exercise is based on the error function, a smooth function defined as
$\operatorname{erf}(x) = \frac{2}{\sqrt{\pi}}\int_0^x e^{-s^2}\,ds.$
(5.7.9)
(a) Define a function $g$ that approximates erf by applying Function 5.6.1 with $n=300$ . Make a plot of the error $g(x)-\operatorname{erf}(x)$ at 500 points in the interval $[0,3]$ .
(b) Define another approximation $h$ that applies Function 5.7.1 with error tolerance 10^-7. Plot the error in $h$ as in part (a). Why does it look so different from the previous case?
(c) Suppose you wished to find $x$ such that $\operatorname{erf}(x) = .95$ by using rootfinding on one of your two approximations. Why is the version from part (a) preferable?

References¶

Bailey, D. H., Jeyabalan, K., & Li, X. S. (2005). A Comparison of Three High-Precision Quadrature Schemes. Experimental Mathematics, 14(3), 317–329. 10.1080/10586458.2005.10128931

Preface

Numerical integration

Preface

Next steps