Skip to article frontmatterSkip to article content

Row pivoting

As mentioned in LU factorization, the A=LU\mathbf{A}=\mathbf{L}\mathbf{U} factorization is not stable for every nonsingular A\mathbf{A}. Indeed, the factorization does not always even exist.

In LU factorization we remarked that LU factorization is equivalent to Gaussian elimination with no row swaps. However, those swaps are necessary in situations like those encountered in Demo 2.6.1, in order to avoid division by zero. We will find a modification of the outer product procedure that allows us to do the same thing.

2.6.1Choosing a pivot

The diagonal element of U\mathbf{U} that appears in the denominator of line 17 of Function 2.4.1 is called the pivot element of its column. In order to avoid a zero pivot, we will use the largest available element in the column we are working on as the pivot. This technique is known as row pivoting.

We will return to the loss of triangularity in L\mathbf{L} momentarily. First, though, there is a question left to answer: what if at some stage, all the elements of the targeted column are zero, i.e., there are no available pivots? Fortunately that loose end ties up nicely, although a proof is a bit beyond our scope here.

A linear system with a singular matrix has either no solution or infinitely many solutions. Either way, a technique other than LU factorization is needed to handle it.

2.6.2Permutations

Even though the resulting L\mathbf{L} in Demo 2.6.2 is no longer of unit lower triangular form, it is close. In fact, all that is needed is to reverse the order of its rows.

In principle, if the permutation of rows implied by the pivot locations is applied all at once to the original A\mathbf{A}, no further pivoting is needed. In practice, this permutation cannot be determined immediately from the original A\mathbf{A}; the only way to find it is to run the algorithm. Having obtained it at the end, though, we can use it to state a simple relationship.

Function 2.6.2 shows our implementation of PLU factorization.[1]

Ideally, the PLU factorization takes 23n3\sim \frac{2}{3}n^3 flops asymptotically, just like LU without pivoting. The implementation in Function 2.6.2 does not achieve this optimal flop count, however. Like Function 2.4.1, it does unnecessary operations on structurally known zeros for the sake of being easier to understand.

2.6.3Linear systems

The output of Function 2.6.2 is a factorization of a row-permuted A\mathbf{A}. Therefore, given a linear system Ax=b\mathbf{A}\mathbf{x}=\mathbf{b}, we have to permute b\mathbf{b} the same way before applying forward and backward substitution. This is equivalent to changing the order of the equations in a linear system, which does not affect its solution.

The lu function from the built-in package LinearAlgebra returns the same three outputs as Function 2.6.2. If you only request one output, it will be a factorization object that can be used with a backslash. This is useful when you want to solve with multiple versions of b\mathbf{b} but do the factorization only once.

2.6.4Stability

There is one detail of the row pivoting algorithm that might seem arbitrary: why choose the pivot of largest magnitude in a column, rather than, say, the uppermost nonzero in the column? The answer is numerical stability.

The factors of this A\mathbf{A} without pivoting are found to be

L=[10ϵ11],U=[ϵ10ϵ11]. \mathbf{L} = \begin{bmatrix} 1 & 0 \\ -\epsilon^{-1} & 1 \end{bmatrix}, \qquad \mathbf{U} = \begin{bmatrix} -\epsilon & 1 \\ 0 & \epsilon^{-1}-1 \end{bmatrix}.

For reasons we will quantify in Conditioning of linear systems, the solution of Ax=b\mathbf{A}\mathbf{x}=\mathbf{b} is well-conditioned, but the problems of solving Lz=b\mathbf{L}\mathbf{z}=\mathbf{b} and Ux=z\mathbf{U}\mathbf{x}=\mathbf{z} have condition numbers essentially 1/ϵ21/\epsilon^2 each. Thus, for small ε, solution of the original linear system by unpivoted LU factorization is highly unstable.

Somewhat surprisingly, solving Ax=b\mathbf{A}\mathbf{x}=\mathbf{b} via PLU factorization is technically also unstable. In fact, examples of unstable solutions are well-known, but they have been nonexistent in practice. While there is a lot of evidence and some reasoning about why this is the case, the situation is not completely understood. Yet PLU factorization remains the algorithm of choice for general linear systems.

2.6.5Exercises

  1. ✍ Perform by hand the pivoted LU factorization of each matrix.

    (a) [2344510482],\quad \displaystyle \begin{bmatrix} 2 & 3 & 4 \\ 4 & 5 & 10 \\ 4 & 8 & 2 \end{bmatrix},\qquad (b) [1455101513121151]\quad \displaystyle \begin{bmatrix} 1 & 4 & 5 & -5 \\ -1 & 0 & -1 & -5 \\ 1 & 3 & -1 & 2 \\ 1 & -1 & 5 & -1 \end{bmatrix}.

  2. ✍ Let A\mathbf{A} be a square matrix and b\mathbf{b} be a column vector of compatible length. Here is correct Julia code to solve Ax=b\mathbf{A}\mathbf{x}=\mathbf{b}:

    L,U,p = lu(A)
    x = U \ (L\b[p])

    Suppose instead you replace the last line above with

    x = U \ L \ b[p]

    Mathematically in terms of L\mathbf{L}, U\mathbf{U}, p\mathbf{p}, and b\mathbf{b}, what vector is found?

  3. ✍ Suppose that A is a 4×64\times 6 matrix in Julia and you define

    B = A[end:-1:1,end:-1:1]

    Show that B=PAQ\mathbf{B} = \mathbf{P} \mathbf{A} \mathbf{Q} for certain matrices P\mathbf{P} and Q\mathbf{Q}.

  1. ✍ An n×nn\times n permutation matrix P\mathbf{P} is a reordering of the rows of an identity matrix such that PA\mathbf{P} \mathbf{A} has the effect of moving rows 1,2,,n1,2,\ldots,n of A\mathbf{A} to new positions i1,i2,,ini_1,i_2,\ldots,i_n. Then P\mathbf{P} can be expressed as

    P=ei1e1T+ei2e2T++einenT.\mathbf{P} = \mathbf{e}_{i_1}\mathbf{e}_1^T + \mathbf{e}_{i_2}\mathbf{e}_2^T + \cdots + \mathbf{e}_{i_n}\mathbf{e}_n^T.

    (a) For the case n=4n=4 and i1=3i_1=3, i2=2i_2=2, i3=4i_3=4, i4=1i_4=1, write out separately, as matrices, all four of the terms in the sum. Then add them together to find P\mathbf{P}.

    (b) Use the formula in the general case to show that P1=PT\mathbf{P}^{-1}=\mathbf{P}^T.

Footnotes
  1. Because unpivoted LU factorization is not useful, in practice the term LU factorization mostly refers to pivoted LU.