Row pivoting - Fundamentals of Numerical Computation

As mentioned in LU factorization, the $\mathbf{A}=\mathbf{L}\mathbf{U}$ factorization is not stable for every nonsingular $\mathbf{A}$ . Indeed, the factorization does not always even exist.

Example 2.6.1 (Failure of naive LU factorization)

Julia

MATLAB

Python

Example 2.6.1

Here is a previously encountered matrix that factors well.

A = [2 0 4 3 ; -4 5 -7 -10 ; 1 15 2 -4.5 ; -2 0 2 -13];
L, U = FNC.lufact(A)
L

4×4 LowerTriangular{Float64, Matrix{Float64}}:
  1.0   ⋅     ⋅    ⋅ 
 -2.0  1.0    ⋅    ⋅ 
  0.5  3.0   1.0   ⋅ 
 -1.0  0.0  -2.0  1.0

If we swap the second and fourth rows of $\mathbf{A}$ , the result is still nonsingular. However, the factorization now fails.

A[[2, 4], :] = A[[4, 2], :]  
L, U = FNC.lufact(A)
L

4×4 LowerTriangular{Float64, Matrix{Float64}}:
  1.0     ⋅      ⋅    ⋅ 
 -1.0  NaN       ⋅    ⋅ 
  0.5   Inf   NaN     ⋅ 
 -2.0   Inf   NaN    1.0

The presence of NaN in the result indicates that some impossible operation was required. The source of the problem is easy to locate. We can find the first outer product in the factorization just fine:

U[1, :] = A[1, :]
L[:, 1] = A[:, 1] / U[1, 1]
A -= L[:, 1] * U[1, :]'

4×4 Matrix{Float64}:
 0.0   0.0  0.0    0.0
 0.0   0.0  6.0  -10.0
 0.0  15.0  0.0   -6.0
 0.0   5.0  1.0   -4.0

The next step is U[2, :] = A[2, :], which is also OK. But then we are supposed to divide by U[2, 2], which is zero. The algorithm cannot continue.

Example 2.6.1

Here is a previously encountered matrix that factors well.

A = [
    2 0 4 3
    -4 5 -7 -10
    1 15 2 -4.5
    -2 0 2 -13
    ];
[L, U] = lufact(A);
L

If we swap the second and fourth rows of $\mathbf{A}$ , the result is still nonsingular. However, the factorization now fails.

A([2, 4], :) = A([4, 2], :);    % swap rows 2 and 4
[L, U] = lufact(A);
L

U(1, :) = A(1, :);
L(:, 1) = A(:, 1) / U(1, 1)
A = A - L(:, 1) * U(1, :)

The next step is U(2, :) = A(2, :), which is also OK. But then we are supposed to divide by U(2, 2), which is zero. The algorithm cannot continue.

Example 2.6.1

Here is a previously encountered matrix that factors well.

A = array([
    [2, 0, 4, 3],
    [-4, 5, -7, -10],
    [1, 15, 2, -4.5],
    [-2, 0, 2, -13]
    ])
L, U = FNC.lufact(A)
print(L)

[[ 1.   0.  -0.   0. ]
 [-2.   1.  -0.   0. ]
 [ 0.5  3.   1.   0. ]
 [-1.   0.  -2.   1. ]]

If we swap the second and fourth rows of $\mathbf{A}$ , the result is still nonsingular. However, the factorization now fails.

A[[1, 3], :] = A[[3, 1], :]  
L, U = FNC.lufact(A)
print(L)

[[ 1.   nan  nan  0. ]
 [-1.   nan  nan  0. ]
 [ 0.5  inf  nan  0. ]
 [-2.   inf  nan  1. ]]

/Users/driscoll/Dropbox/Mac/Documents/GitHub/fnc/python/fncbook/fncbook/chapter02.py:47: RuntimeWarning: divide by zero encountered in divide
  L[:, k] = A_k[:, k] / U[k,k]
/Users/driscoll/Dropbox/Mac/Documents/GitHub/fnc/python/fncbook/fncbook/chapter02.py:47: RuntimeWarning: invalid value encountered in divide
  L[:, k] = A_k[:, k] / U[k,k]
/Users/driscoll/mambaforge/envs/myst/lib/python3.13/site-packages/numpy/_core/numeric.py:983: RuntimeWarning: invalid value encountered in multiply
  return multiply(a.ravel()[:, newaxis], b.ravel()[newaxis, :], out)

U[0, :] = A[0, :]
L[:, 0] = A[:, 0] / U[0, 0]
A -= outer(L[:, 0],  U[0, :])
print(A)

[[  0.   0.   0.   0.]
 [  0.   0.   6. -10.]
 [  0.  15.   0.  -6.]
 [  0.   5.   1.  -4.]]

The next step is U[1, :] = A[1, :], which is also OK. But then we are supposed to divide by U[1, 1], which is zero. The algorithm cannot continue.

In LU factorization we remarked that LU factorization is equivalent to Gaussian elimination with no row swaps. However, those swaps are necessary in situations like those encountered in Demo 2.6.1, in order to avoid division by zero. We will find a modification of the outer product procedure that allows us to do the same thing.

2.6.1Choosing a pivot¶

The diagonal element of $\mathbf{U}$ that appears in the denominator of line 17 of Function 2.4.1 is called the pivot element of its column. In order to avoid a zero pivot, we will use the largest available element in the column we are working on as the pivot. This technique is known as row pivoting.

Example 2.6.2 (Row pivoting in LU factorization)

Julia

MATLAB

Python

Example 2.6.2

Here is the trouble-making matrix from Demo 2.6.1.

A₁ = [2 0 4 3 ; -2 0 2 -13; 1 15 2 -4.5 ; -4 5 -7 -10]

4×4 Matrix{Float64}:
  2.0   0.0   4.0    3.0
 -2.0   0.0   2.0  -13.0
  1.0  15.0   2.0   -4.5
 -4.0   5.0  -7.0  -10.0

We now find the largest candidate pivot in the first column. We don’t care about sign, so we take absolute values before finding the max.

i = argmax( abs.(A₁[:, 1]) )

4

This is the row of the matrix that we extract to put into $\mathbf{U}$ . That guarantees that the division used to find $\boldsymbol{\ell}_1$ will be valid.

L, U = zeros(4,4),zeros(4,4)
U[1, :] = A₁[i, :]
L[:, 1] = A₁[:, 1] / U[1, 1]
A₂ = A₁ - L[:, 1] * U[1, :]'

4×4 Matrix{Float64}:
 0.0   2.5   0.5   -2.0
 0.0  -2.5   5.5   -8.0
 0.0  16.25  0.25  -7.0
 0.0   0.0   0.0    0.0

Observe that $\mathbf{A}_2$ has a new zero row and zero column, but the zero row is the fourth rather than the first. However, we forge on by using the largest possible pivot in column 2 for the next outer product.

@show i = argmax( abs.(A₂[:, 2]) ) 
U[2, :] = A₂[i, :]
L[:, 2] = A₂[:, 2] / U[2, 2]
A₃ = A₂ - L[:, 2] * U[2, :]'

i = argmax(abs.(A₂[:, 2])) = 3

4×4 Matrix{Float64}:
 0.0  0.0  0.461538  -0.923077
 0.0  0.0  5.53846   -9.07692
 0.0  0.0  0.0        0.0
 0.0  0.0  0.0        0.0

Now we have zeroed out the third row as well as the second column. We can finish out the procedure.

@show i = argmax( abs.(A₃[:, 3]) ) 
U[3, :] = A₃[i, :]
L[:, 3] = A₃[:, 3] / U[3, 3]
A₄ = A₃ - L[:, 3] * U[3, :]'

i = argmax(abs.(A₃[:, 3])) = 2

4×4 Matrix{Float64}:
 0.0  0.0  0.0  -0.166667
 0.0  0.0  0.0   0.0
 0.0  0.0  0.0   0.0
 0.0  0.0  0.0   0.0

@show i = argmax( abs.(A₄[:, 4]) ) 
U[4, :] = A₄[i, :]
L[:, 4] = A₄[:, 4] / U[4, 4];

i = argmax(abs.(A₄[:, 4])) = 1

We do have a factorization of the original matrix:

A₁ - L * U

4×4 Matrix{Float64}:
 0.0  -1.38778e-16  0.0  0.0
 0.0   1.38778e-16  0.0  0.0
 0.0   0.0          0.0  0.0
 0.0   0.0          0.0  0.0

And $\mathbf{U}$ has the required structure:

4×4 Matrix{Float64}:
 -4.0   5.0   -7.0      -10.0
  0.0  16.25   0.25      -7.0
  0.0   0.0    5.53846   -9.07692
  0.0   0.0    0.0       -0.166667

However, the triangularity of $\mathbf{L}$ has been broken.

4×4 Matrix{Float64}:
 -0.5    0.153846  0.0833333   1.0
  0.5   -0.153846  1.0        -0.0
 -0.25   1.0       0.0        -0.0
  1.0    0.0       0.0        -0.0

Example 2.6.2

Here is the trouble-making matrix from Demo 2.6.1.

A_1 = [2 0 4 3; -2 0 2 -13; 1 15 2 -4.5; -4 5 -7 -10]

We now find the largest candidate pivot in the first column. We don’t care about sign, so we take absolute values before finding the max.

[~, i] = max( abs(A_1(:, 1)) )

This is the row of the matrix that we extract to put into $\mathbf{U}$ . That guarantees that the division used to find $\boldsymbol{\ell}_1$ will be valid.

L = zeros(4, 4);
U = zeros(4, 4);
U(1, :) = A_1(i, :);
L(:, 1) = A_1(:, 1) / U(1, 1);
A_2 = A_1 - L(:, 1) * U(1, :)

[~, i] = max( abs(A_2(:, 2)) )
U(2, :) = A_2(i, :);
L(:, 2) = A_2(:, 2) / U(2, 2);
A_3 = A_2 - L(:, 2) * U(2, :)

Now we have zeroed out the third row as well as the second column. We can finish out the procedure.

[~, i] = max( abs(A_3(:, 3)) ) 
U(3, :) = A_3(i, :);
L(:, 3) = A_3(:, 3) / U(3, 3);
A_4 = A_3 - L(:, 3) * U(3, :)

[~, i] = max( abs(A_4(:, 4)) ) 
U(4, :) = A_4(i, :);
L(:, 4) = A_4(:, 4) / U(4, 4);

We do have a factorization of the original matrix:

A_1 - L * U

And $\mathbf{U}$ has the required structure:

However, the triangularity of $\mathbf{L}$ has been broken.

Example 2.6.2

Here is the trouble-making matrix from Demo 2.6.1.

A_1 = array([
    [2, 0, 4, 3],
    [-2, 0, 2, -13],
    [1, 15, 2, -4.5],
    [-4, 5, -7, -10]
    ])

We now find the largest candidate pivot in the first column. We don’t care about sign, so we take absolute values before finding the max.

i = argmax( abs(A_1[:, 0]) )
print(i)

This is the row of the matrix that we extract to put into $\mathbf{U}$ . That guarantees that the division used to find $\boldsymbol{\ell}_1$ will be valid.

L, U = eye(4), zeros((4, 4))
U[0, :] = A_1[i, :]
L[:, 0] = A_1[:, 0] / U[0, 0]
A_2 = A_1 - outer(L[:, 0], U[0, :])
print(A_2)

[[ 0.    2.5   0.5  -2.  ]
 [ 0.   -2.5   5.5  -8.  ]
 [ 0.   16.25  0.25 -7.  ]
 [ 0.    0.    0.    0.  ]]

i = argmax( abs(A_2[:, 1]) ) 
print(f"new pivot row is {i}")
U[1, :] = A_2[i, :]
L[:, 1] = A_2[:, 1] / U[1, 1]
A_3 = A_2 - outer(L[:, 1], U[1, :])
print(A_3)

new pivot row is 2
[[ 0.          0.          0.46153846 -0.92307692]
 [ 0.          0.          5.53846154 -9.07692308]
 [ 0.          0.          0.          0.        ]
 [ 0.          0.          0.          0.        ]]

Now we have zeroed out the third row as well as the second column. We can finish out the procedure.

i = argmax( abs(A_3[:, 2]) ) 
print(f"new pivot row is {i}")
U[2, :] = A_3[i, :]
L[:, 2] = A_3[:, 2] / U[2, 2]
A_4 = A_3 - outer(L[:, 2], U[2, :])
print(A_4)

new pivot row is 1
[[ 0.          0.          0.         -0.16666667]
 [ 0.          0.          0.          0.        ]
 [ 0.          0.          0.          0.        ]
 [ 0.          0.          0.          0.        ]]

i = argmax( abs(A_4[:, 3]) ) 
print(f"new pivot row is {i}")
U[3, :] = A_4[i, :]
L[:, 3] = A_4[:, 3] / U[3, 3];

new pivot row is 0

We do have a factorization of the original matrix:

A_1 - L @ U

array([[0.00000000e+00, 0.00000000e+00, 0.00000000e+00, 0.00000000e+00],
       [0.00000000e+00, 0.00000000e+00, 2.22044605e-16, 0.00000000e+00],
       [0.00000000e+00, 0.00000000e+00, 0.00000000e+00, 0.00000000e+00],
       [0.00000000e+00, 0.00000000e+00, 0.00000000e+00, 0.00000000e+00]])

And $\mathbf{U}$ has the required structure:

print(U)

[[ -4.           5.          -7.         -10.        ]
 [  0.          16.25         0.25        -7.        ]
 [  0.           0.           5.53846154  -9.07692308]
 [  0.           0.           0.          -0.16666667]]

However, the triangularity of $\mathbf{L}$ has been broken.

print(L)

[[-0.5         0.15384615  0.08333333  1.        ]
 [ 0.5        -0.15384615  1.         -0.        ]
 [-0.25        1.          0.         -0.        ]
 [ 1.          0.          0.         -0.        ]]

We will return to the loss of triangularity in $\mathbf{L}$ momentarily. First, though, there is a question left to answer: what if at some stage, all the elements of the targeted column are zero, i.e., there are no available pivots? Fortunately that loose end ties up nicely, although a proof is a bit beyond our scope here.

A linear system with a singular matrix has either no solution or infinitely many solutions. Either way, a technique other than LU factorization is needed to handle it.

2.6.2Permutations¶

Even though the resulting $\mathbf{L}$ in Demo 2.6.2 is no longer of unit lower triangular form, it is close. In fact, all that is needed is to reverse the order of its rows.

Example 2.6.3 (Pivoting as row permutation)

Julia

MATLAB

Python

Example 2.6.3

Here again is the matrix from Demo 2.6.2.

A = [2 0 4 3 ; -2 0 2 -13; 1 15 2 -4.5 ; -4 5 -7 -10]

4×4 Matrix{Float64}:
  2.0   0.0   4.0    3.0
 -2.0   0.0   2.0  -13.0
  1.0  15.0   2.0   -4.5
 -4.0   5.0  -7.0  -10.0

As the factorization proceeded, the pivots were selected from rows 4, 3, 2, and finally 1. If we were to put the rows of $\mathbf{A}$ into that order, then the algorithm would run exactly like the plain LU factorization from LU factorization.

B = A[[4, 3, 2, 1], :]
L, U = FNC.lufact(B);

We obtain the same $\mathbf{U}$ as before:

4×4 UpperTriangular{Float64, Matrix{Float64}}:
 -4.0   5.0   -7.0      -10.0
   ⋅   16.25   0.25      -7.0
   ⋅     ⋅     5.53846   -9.07692
   ⋅     ⋅      ⋅        -0.166667

And $\mathbf{L}$ has the same rows as before, but arranged into triangular order:

4×4 LowerTriangular{Float64, Matrix{Float64}}:
  1.0     ⋅         ⋅          ⋅ 
 -0.25   1.0        ⋅          ⋅ 
  0.5   -0.153846  1.0         ⋅ 
 -0.5    0.153846  0.0833333  1.0

Example 2.6.3

Here again is the matrix from Demo 2.6.2.

A = [2 0 4 3; -2 0 2 -13; 1 15 2 -4.5; -4 5 -7 -10]

B = A([4, 3, 2, 1], :);
[L, U] = lufact(B);

We obtain the same $\mathbf{U}$ as before:

And $\mathbf{L}$ has the same rows as before, but arranged into triangular order:

Example 2.6.3

Here again is the matrix from Demo 2.6.2.

A = array([
    [2, 0, 4, 3],
    [-2, 0, 2, -13],
    [1, 15, 2, -4.5],
    [-4, 5, -7, -10]
    ])

As the factorization proceeded, the pivots were selected from rows 4, 3, 2, and finally 1 (with NumPy indices being one less). If we were to put the rows of $\mathbf{A}$ into that order, then the algorithm would run exactly like the plain LU factorization from LU factorization.

B = A[[3, 2, 1, 0], :]
L, U = FNC.lufact(B);

We obtain the same $\mathbf{U}$ as before:

print(U)

[[ -4.           5.          -7.         -10.        ]
 [  0.          16.25         0.25        -7.        ]
 [  0.           0.           5.53846154  -9.07692308]
 [  0.           0.           0.          -0.16666667]]

And $\mathbf{L}$ has the same rows as before, but arranged into triangular order:

print(L)

[[ 1.          0.          0.          0.        ]
 [-0.25        1.          0.          0.        ]
 [ 0.5        -0.15384615  1.          0.        ]
 [-0.5         0.15384615  0.08333333  1.        ]]

In principle, if the permutation of rows implied by the pivot locations is applied all at once to the original $\mathbf{A}$ , no further pivoting is needed. In practice, this permutation cannot be determined immediately from the original $\mathbf{A}$ ; the only way to find it is to run the algorithm. Having obtained it at the end, though, we can use it to state a simple relationship.

Function 2.6.2 shows our implementation of PLU factorization.^[1]

Algorithm 2.6.2 (plufact)

Julia

MATLAB

Python

LU factorization with partial pivoting

plufact.jl

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
"""
    plufact(A)

Compute the PLU factorization of square matrix `A`, returning the
triangular factors and a row permutation vector.
"""
function plufact(A)
    n = size(A, 1)
    L = zeros(n, n)
    U = zeros(n, n)
    p = fill(0, n)
    Aₖ = float(copy(A))

    # Reduction by outer products
    for k in 1:n
        p[k] = argmax(abs.(Aₖ[:, k]))    # best pivot in column k
        U[k, :] = Aₖ[p[k], :]
        L[:, k] = Aₖ[:, k] / U[k, k]
        if k < n    # no update needed on last iteration
            Aₖ -= L[:, k] * U[k, :]'
        end
    end
    return LowerTriangular(L[p, :]), U, p
end

LU factorization with partial pivoting

plufact.jl

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
"""
    plufact(A)

Compute the PLU factorization of square matrix `A`, returning the
triangular factors and a row permutation vector.
"""
function plufact(A)
    n = size(A, 1)
    L = zeros(n, n)
    U = zeros(n, n)
    p = fill(0, n)
    Aₖ = float(copy(A))

    # Reduction by outer products
    for k in 1:n
        p[k] = argmax(abs.(Aₖ[:, k]))    # best pivot in column k
        U[k, :] = Aₖ[p[k], :]
        L[:, k] = Aₖ[:, k] / U[k, k]
        if k < n    # no update needed on last iteration
            Aₖ -= L[:, k] * U[k, :]'
        end
    end
    return LowerTriangular(L[p, :]), U, p
end

LU factorization with partial pivoting

plufact.py

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
def plufact(A):
    """
        plufact(A)

    Compute the PLU factorization of square matrix A, returning the
    triangular factors and a row permutation vector.
    """
    n = A.shape[0]
    L = np.zeros((n, n))
    U = np.zeros((n, n))
    p = np.zeros(n, dtype=int)
    A_k = np.copy(A)

    # Reduction by np.outer products
    for k in range(n):
        p[k] = np.argmax(abs(A_k[:, k]))
        U[k, :] = A_k[p[k], :]
        L[:, k] = A_k[:, k] / U[k, k]
        if k < n-1:
            A_k -= np.outer(L[:, k], U[k, :])
    return L[p, :], U, p

Ideally, the PLU factorization takes $\sim \frac{2}{3}n^3$ flops asymptotically, just like LU without pivoting. The implementation in Function 2.6.2 does not achieve this optimal flop count, however. Like Function 2.4.1, it does unnecessary operations on structurally known zeros for the sake of being easier to understand.

2.6.3Linear systems¶

The output of Function 2.6.2 is a factorization of a row-permuted $\mathbf{A}$ . Therefore, given a linear system $\mathbf{A}\mathbf{x}=\mathbf{b}$ , we have to permute $\mathbf{b}$ the same way before applying forward and backward substitution. This is equivalent to changing the order of the equations in a linear system, which does not affect its solution.

Example 2.6.4 (PLU factorization for solving linear systems)

Julia

MATLAB

Python

Example 2.6.4

The third output of plufact is the permutation vector we need to apply to $\mathbf{A}$ .

A = rand(1:20, 4, 4)
L, U, p = FNC.plufact(A)
A[p,:] - L * U   # should be ≈ 0

4×4 Matrix{Float64}:
 0.0   0.0          0.0          0.0
 0.0   0.0          0.0          0.0
 0.0   0.0          3.55271e-15  0.0
 0.0  -8.88178e-16  1.77636e-15  3.55271e-15

Given a vector $\mathbf{b}$ , we solve $\mathbf{A}\mathbf{x}=\mathbf{b}$ by first permuting the entries of $\mathbf{b}$ and then proceeding as before.

b = rand(4)
z = FNC.forwardsub(L,b[p])
x = FNC.backsub(U,z)

4-element Vector{Float64}:
  0.01399443922210889
  0.014538620453601336
 -0.014792442049713922
  0.03240571682877231

A residual check is successful:

b - A*x

4-element Vector{Float64}:
 0.0
 0.0
 0.0
 0.0

Example 2.6.4

The third output of plufact is the permutation vector we need to apply to $\mathbf{A}$ .

A = randi(20, 4, 4);
[L, U, p] = plufact(A);
A(p, :) - L * U    % should be ≈ 0

Given a vector $\mathbf{b}$ , we solve $\mathbf{A}\mathbf{x}=\mathbf{b}$ by first permuting the entries of $\mathbf{b}$ and then proceeding as before.

b = rand(4, 1);
z = forwardsub(L, b(p));
x = backsub(U, z)

A residual check is successful:

b - A*x

Example 2.6.4

The third output of plufact is the permutation vector we need to apply to $\mathbf{A}$ .

A = random.randn(4, 4)
L, U, p = FNC.plufact(A)
A[p, :] - L @ U   # should be ≈ 0

array([[ 0.00000000e+00,  0.00000000e+00,  0.00000000e+00,
         0.00000000e+00],
       [ 0.00000000e+00,  0.00000000e+00,  0.00000000e+00,
         0.00000000e+00],
       [ 0.00000000e+00,  0.00000000e+00, -1.11022302e-16,
         5.55111512e-17],
       [ 0.00000000e+00,  1.11022302e-16,  0.00000000e+00,
         2.22044605e-16]])

Given a vector $\mathbf{b}$ , we solve $\mathbf{A}\mathbf{x}=\mathbf{b}$ by first permuting the entries of $\mathbf{b}$ and then proceeding as before.

b = random.randn(4)
z = FNC.forwardsub(L, b[p])
x = FNC.backsub(U, z)

A residual check is successful:

b - A @ x

array([ 0.00000000e+00, 0.00000000e+00, -2.22044605e-16, -1.11022302e-16])

The lu function from the built-in package LinearAlgebra returns the same three outputs as Function 2.6.2. If you only request one output, it will be a factorization object that can be used with a backslash. This is useful when you want to solve with multiple versions of $\mathbf{b}$ but do the factorization only once.

Example 2.6.5 (Built-in PLU factorization)

Julia

MATLAB

Python

Example 2.6.5

With the syntax A \ b, the matrix A is PLU-factored, followed by two triangular solves.

A = randn(500, 500)   # 500x500 with normal random entries
A \ rand(500)          # force compilation
@elapsed for k=1:50; A \ rand(500); end

1.552101792

In Efficiency of matrix computations we showed that the factorization is by far the most costly part of the solution process. A factorization object allows us to do that costly step only once per unique matrix.

factored = lu(A)     # store factorization result
factored \ rand(500)   # force compilation
@elapsed for k=1:50; factored \ rand(500); end

0.006267875

Example 2.6.5

With the syntax A \ b, the matrix A is PLU-factored, followed by two triangular solves.

A = randn(500, 500);    % 500x500 with normal random entries
tic; for k=1:50; A \ rand(500, 1); end; toc

Elapsed time is 0.591412 seconds.

[L, U, p] = lu(A, 'vector');    % keep factorization result
tic
for k=1:50
    b = rand(500, 1);
    U \ (L \ b(p));
end
toc

Elapsed time is 0.010308 seconds.

Example 2.6.5

In linalg.solve, the matrix A is PLU-factored, followed by two triangular solves. If we want to do those steps seamlessly, we can use the lu_factor and lu_solve from scipy.linalg.

from scipy.linalg import lu_factor, lu_solve
A = random.randn(500, 500) 
b = ones(500)  
LU, perm = lu_factor(A)
x = lu_solve((LU, perm), b)

Why would we ever bother with this? In Efficiency of matrix computations we showed that the factorization is by far the most costly part of the solution process. A factorization object allows us to do that costly step only once per matrix, but solve with multiple right-hand sides.

start = timer()
for k in range(50): linalg.solve(A, random.rand(500))
print(f"elapsed time for 50 full solves: {timer() - start}")

start = timer()
LU, perm = lu_factor(A)
for k in range(50): lu_solve((LU, perm), random.rand(500))
print(f"elapsed time for 50 shortcut solves: {timer() - start}")

elapsed time for 50 full solves: 0.760492041001271
elapsed time for 50 shortcut solves: 0.019780083999648923

2.6.4Stability¶

There is one detail of the row pivoting algorithm that might seem arbitrary: why choose the pivot of largest magnitude in a column, rather than, say, the uppermost nonzero in the column? The answer is numerical stability.

Example 2.6.6 (Stability of PLU factorization)

Let

\mathbf{A} = \begin{bmatrix} -\epsilon & 1 \\ 1 & -1 \end{bmatrix}.

(2.6.2)

If $\epsilon=0$ , LU factorization without pivoting fails for $\mathbf{A}$ . But if $\epsilon\neq 0$ , we can go without pivoting, at least in principle.

Julia

MATLAB

Python

Example 2.6.6

We construct a linear system for this matrix with $\epsilon=10^{-12}$ and exact solution $[1,1]$ :

ϵ = 1e-12
A = [-ϵ 1; 1 -1]
b = A * [1, 1]

2-element Vector{Float64}:
 0.999999999999
 0.0

We can factor the matrix without pivoting and solve for $\mathbf{x}$ .

L, U = FNC.lufact(A)
x = FNC.backsub( U, FNC.forwardsub(L, b) )

2-element Vector{Float64}:
 0.9999778782798785
 1.0

Note that we have obtained only about 5 accurate digits for $x_1$ . We could make the result even more inaccurate by making ε even smaller:

ϵ = 1e-20; A = [-ϵ 1; 1 -1]
b = A * [1, 1]
L, U = FNC.lufact(A)
x = FNC.backsub( U, FNC.forwardsub(L, b) )

2-element Vector{Float64}:
 -0.0
  1.0

This effect is not due to ill conditioning of the problem—a solution with PLU factorization works perfectly:

A \ b

2-element Vector{Float64}:
 1.0
 1.0

Example 2.6.6

We construct a linear system for this matrix with $\epsilon=10^{-12}$ and exact solution $[1, 1]$ :

ep = 1e-12
A = [-ep 1; 1 -1];
b = A * [1; 1];

We can factor the matrix without pivoting and solve for $\mathbf{x}$ .

[L, U] = lufact(A);
x = backsub( U, forwardsub(L, b) )

Note that we have obtained only about 5 accurate digits for $x_1$ . We could make the result even more inaccurate by making ε even smaller:

ep = 1e-20; A = [-ep 1; 1 -1];
b = A * [1; 1];
[L, U] = lufact(A);
x = backsub( U, forwardsub(L, b) )

This effect is not due to ill conditioning of the problem—a solution with PLU factorization works perfectly:

A \ b

Example 2.6.6

We construct a linear system for this matrix with $\epsilon=10^{-12}$ and exact solution $[1,1]$ :

ep = 1e-12
A = array([[-ep, 1], [1, -1]])
b = A @ array([1, 1])

We can factor the matrix without pivoting and solve for $\mathbf{x}$ .

L, U = FNC.lufact(A)
print(FNC.backsub( U, FNC.forwardsub(L, b) ))

[0.99997788 1.        ]

Note that we have obtained only about 5 accurate digits for $x_1$ . We could make the result even more inaccurate by making ε even smaller:

ep = 1e-20;
A = array([[-ep, 1], [1, -1]])
b = A @ array([1, 1])
L, U = FNC.lufact(A)
print(FNC.backsub( U, FNC.forwardsub(L, b) ))

[-0.  1.]

This effect is not due to ill conditioning of the problem—a solution with PLU factorization works perfectly:

print(linalg.solve(A, b))

[1. 1.]

The factors of this $\mathbf{A}$ without pivoting are found to be

\mathbf{L} = \begin{bmatrix} 1 & 0 \\ -\epsilon^{-1} & 1 \end{bmatrix}, \qquad \mathbf{U} = \begin{bmatrix} -\epsilon & 1 \\ 0 & \epsilon^{-1}-1 \end{bmatrix}.

(2.6.3)

For reasons we will quantify in Conditioning of linear systems, the solution of $\mathbf{A}\mathbf{x}=\mathbf{b}$ is well-conditioned, but the problems of solving $\mathbf{L}\mathbf{z}=\mathbf{b}$ and $\mathbf{U}\mathbf{x}=\mathbf{z}$ have condition numbers essentially $1/\epsilon^2$ each. Thus, for small ε, solution of the original linear system by unpivoted LU factorization is highly unstable.

Somewhat surprisingly, solving $\mathbf{A}\mathbf{x}=\mathbf{b}$ via PLU factorization is technically also unstable. In fact, examples of unstable solutions are well-known, but they have been nonexistent in practice. While there is a lot of evidence and some reasoning about why this is the case, the situation is not completely understood. Yet PLU factorization remains the algorithm of choice for general linear systems.

2.6.5Exercises¶

✍ Perform by hand the pivoted LU factorization of each matrix.
(a) $\quad \displaystyle \begin{bmatrix} 2 & 3 & 4 \\ 4 & 5 & 10 \\ 4 & 8 & 2 \end{bmatrix},\qquad$ (b) $\quad \displaystyle \begin{bmatrix} 1 & 4 & 5 & -5 \\ -1 & 0 & -1 & -5 \\ 1 & 3 & -1 & 2 \\ 1 & -1 & 5 & -1 \end{bmatrix}$ .
✍ Let $\mathbf{A}$ be a square matrix and $\mathbf{b}$ be a column vector of compatible length. Here is correct Julia code to solve $\mathbf{A}\mathbf{x}=\mathbf{b}$ :
```
L,U,p = lu(A)
x = U \ (L\b[p])
```
Suppose instead you replace the last line above with
```
x = U \ L \ b[p]
```
Mathematically in terms of $\mathbf{L}$ , $\mathbf{U}$ , $\mathbf{p}$ , and $\mathbf{b}$ , what vector is found?
✍ Suppose that A is a $4\times 6$ matrix in Julia and you define
```
B = A[end:-1:1,end:-1:1]
```
Show that $\mathbf{B} = \mathbf{P} \mathbf{A} \mathbf{Q}$ for certain matrices $\mathbf{P}$ and $\mathbf{Q}$ .

✍ An $n\times n$ permutation matrix $\mathbf{P}$ is a reordering of the rows of an identity matrix such that $\mathbf{P} \mathbf{A}$ has the effect of moving rows $1,2,\ldots,n$ of $\mathbf{A}$ to new positions $i_1,i_2,\ldots,i_n$ . Then $\mathbf{P}$ can be expressed as
$\mathbf{P} = \mathbf{e}_{i_1}\mathbf{e}_1^T + \mathbf{e}_{i_2}\mathbf{e}_2^T + \cdots + \mathbf{e}_{i_n}\mathbf{e}_n^T.$
(2.6.4)
(a) For the case $n=4$ and $i_1=3$ , $i_2=2$ , $i_3=4$ , $i_4=1$ , write out separately, as matrices, all four of the terms in the sum. Then add them together to find $\mathbf{P}$ .
(b) Use the formula in the general case to show that $\mathbf{P}^{-1}=\mathbf{P}^T$ .

Footnotes¶

Because unpivoted LU factorization is not useful, in practice the term LU factorization mostly refers to pivoted LU.
↩

Preface

Efficiency of matrix computations

Preface

Vector and matrix norms