Computing with matrices - Fundamentals of Numerical Computation

At a reductive level, a matrix is a table of numbers that obeys certain algebraic laws. But matrices are pervasive in scientific computation, mainly because they represent linear operations on vectors. Moreover, vectors go far beyond the three-dimensional representations of physical quantities you learned about in calculus.

2.2.1Notation¶

We use capital letters in bold to refer to matrices, and lowercase bold letters for vectors. All named vectors in this book are column vectors. The bold symbol $\boldsymbol{0}$ may refer to a vector of all zeros or to a zero matrix, depending on context; we use 0 as the scalar zero only.

To refer to a specific element of a matrix, we use the uppercase name of the matrix without boldface, as in $A_{24}$ to mean the $(2,4)$ element of $\mathbf{A}$ .^[1] To refer to an element of a vector, we use just one subscript, as in $x_3$ . If you see a boldface character with one or more subscripts, then you know that it is a matrix or vector that belongs to a sequence or indexed collection.

We will have frequent need to refer to the individual columns of a matrix as vectors. Our convention is to use a lowercase bold version of the matrix name with a subscript to represent the column number. Thus, $\mathbf{a}_1,\mathbf{a}_2,\ldots,\mathbf{a}_n$ are the columns of the $m\times n$ matrix $\mathbf{A}$ . Conversely, whenever we define a sequence of vectors $\mathbf{v}_1,\ldots,\mathbf{v}_p$ , we can implicitly consider them to be columns of a matrix $\mathbf{V}$ . Sometimes we might write $\mathbf{V}=\bigl[ \mathbf{v}_j \bigr]$ to emphasize the connection.

The notation $\mathbf{A}^T$ is used for the transpose of a matrix, whether it is real or complex. In the case of complex matrices, it’s almost always more desirable to use the adjoint $\mathbf{A}^*$ , which is the transpose with the complex conjugate of each element.^[2] If $\mathbf{A}$ is real, then $\mathbf{A}^*=\mathbf{A}^T$ . A symmetric matrix is a square matrix such that $\mathbf{A}^T=\mathbf{A}$ .

The identity matrix of size $n$ is denoted $\mathbf{I}$ , or sometimes $\mathbf{I}_n$ if emphasizing the size is important in context. For columns of the identity we break with our usual naming convention and denote them by $\mathbf{e}_j$ .

2.2.2Block matrix expressions¶

We will often find it useful to break a matrix into separately named pieces. For example, we might write

\mathbf{A} = \begin{bmatrix} \mathbf{A}_{11} & \mathbf{A}_{12} & \mathbf{A}_{13} \\ \mathbf{A}_{21} & \mathbf{A}_{22} & \mathbf{A}_{23} \end{bmatrix}, \qquad \mathbf{B} = \begin{bmatrix} \mathbf{B}_1 \\ \mathbf{B}_2 \\ \mathbf{B}_3 \end{bmatrix}.

(2.2.1)

It’s understood that blocks that are on top of one another have the same number of columns, and blocks that are side by side have the same number of rows. Typically, if the blocks all have compatible dimensions, then they can be multiplied as though the blocks were scalars. For instance, continuing with the definitions above, we say that $\mathbf{A}$ is block- $2\times 3$ and $\mathbf{B}$ is block- $3\times 1$ , so we can write

\mathbf{A} \mathbf{B} = \begin{bmatrix} \mathbf{A}_{11}\mathbf{B}_1 + \mathbf{A}_{12}\mathbf{B}_2 + \mathbf{A}_{13}\mathbf{B}_3 \\ \mathbf{A}_{21}\mathbf{B}_1 + \mathbf{A}_{22}\mathbf{B}_2 + \mathbf{A}_{23}\mathbf{B}_3 \end{bmatrix},

(2.2.2)

provided that the individual block products are well-defined. For transposes we have, for example,

\mathbf{A}^T = \begin{bmatrix} \mathbf{A}_{11}^T & \mathbf{A}_{21}^T \\[2mm] \mathbf{A}_{12}^T & \mathbf{A}_{22}^T \\[2mm] \mathbf{A}_{13}^T & \mathbf{A}_{23}^T \end{bmatrix}.

(2.2.3)

2.2.3Vector and matrix basics¶

Vectors and matrices are integral to scientific computing. All modern languages provide ways to work with them beyond manipulation of individual elements.

Example 2.2.1 (Matrix operations)

Julia

MATLAB

Python

Example 2.2.1

In Julia, vectors and matrices are one-dimensional and two-dimensional arrays, respectively. Square brackets are used to enclose elements of a matrix or vector. Use spaces for horizontal concatenation, and semicolons or new lines to indicate vertical concatenation.

A = [ 1 2 3 4 5; 50 40 30 20 10
    π sqrt(2) exp(1) (1+sqrt(5))/2 log(3) ]

3×5 Matrix{Float64}:
  1.0       2.0       3.0       4.0       5.0
 50.0      40.0      30.0      20.0      10.0
  3.14159   1.41421   2.71828   1.61803   1.09861

m, n = size(A)

(3, 5)

A vector is not quite the same thing as a matrix: it has only one dimension, not two. Separate its elements by commas or semicolons:

x = [ 3, 3, 0, 1, 0 ]
size(x)

(5,)

For some purposes, however, an $n$ -vector in Julia is treated like having a column shape. Note the difference if we use spaces instead of commas inside the brackets:

y = [ 3 3 0 1 0 ]
size(y)

(1, 5)

This $1\times 5$ matrix is not equivalent to a vector.

Concatenated elements within brackets may be matrices or vectors for a block representation, as long as all the block sizes are compatible.

[ x  x ]

5×2 Matrix{Int64}:
 3  3
 3  3
 0  0
 1  1
 0  0

[ x; x ]

10-element Vector{Int64}:
 3
 3
 0
 1
 0
 3
 3
 0
 1
 0

The zeros and ones functions construct matrices with entries all zero or one, respectively.

B = [ zeros(3, 2) ones(3, 1) ]

3×3 Matrix{Float64}:
 0.0  0.0  1.0
 0.0  0.0  1.0
 0.0  0.0  1.0

A single quote ' after a matrix returns its adjoint. For real matrices, this is the transpose; for complex-valued matrices, the elements are also conjugated.

A'

5×3 adjoint(::Matrix{Float64}) with eltype Float64:
 1.0  50.0  3.14159
 2.0  40.0  1.41421
 3.0  30.0  2.71828
 4.0  20.0  1.61803
 5.0  10.0  1.09861

If x is simply a vector, then its transpose has a row shape.

x'

1×5 adjoint(::Vector{Int64}) with eltype Int64:
 3  3  0  1  0

There are many convenient shorthand ways of building vectors and matrices other than entering all of their entries directly or in a loop. To get a range with evenly spaced entries between two endpoints, you have two options. One is to use a colon :.

y = 1:4              # start:stop

1:4

z = 0:3:12           # start:step:stop

0:3:12

(Ranges are not strictly considered vectors, but they behave identically in most circumstances.) Instead of specifying the step size, you can give the number of points in the range if you use range.

s = range(-1, 1, 5)

-1.0:0.5:1.0

Accessing an element is done by giving one (for a vector) or two (for a matrix) index values within square brackets.

a = A[2, end-1]

20.0

x[2]

3

The indices can be vectors or ranges, in which case a block of the matrix is accessed.

A[1:2, end-2:end]    # first two rows, last three columns

2×3 Matrix{Float64}:
  3.0   4.0   5.0
 30.0  20.0  10.0

If a dimension has only the index : (a colon), then it refers to all the entries in that dimension of the matrix.

A[:, 1:2:end]        # all of the odd columns

3×3 Matrix{Float64}:
  1.0       3.0       5.0
 50.0      30.0      10.0
  3.14159   2.71828   1.09861

The matrix and vector senses of addition, subtraction, scalar multiplication, multiplication, and power are all handled by the usual symbols.

B = diagm( [-1, 0, -5] )   # create a diagonal matrix

3×3 Matrix{Int64}:
 -1  0   0
  0  0   0
  0  0  -5

@show size(A), size(B);
BA = B * A     # matrix product

(size(A), size(B)) = ((3, 5), (3, 3))

3×5 Matrix{Float64}:
  -1.0    -2.0       -3.0     -4.0      -5.0
   0.0     0.0        0.0      0.0       0.0
 -15.708  -7.07107  -13.5914  -8.09017  -5.49306

A * B causes an error here, because the dimensions aren’t compatible.

A * B    # throws an error

DimensionMismatch: matrix A has axes (Base.OneTo(3),Base.OneTo(5)), matrix B has axes (Base.OneTo(3),Base.OneTo(3))

Stacktrace:
 [1] _generic_matmatmul!(C::Matrix{Float64}, A::Matrix{Float64}, B::Matrix{Int64}, _add::LinearAlgebra.MulAddMul{true, true, Bool, Bool})
   @ LinearAlgebra ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:883
 [2] generic_matmatmul!
   @ ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:868 [inlined]
 [3] _mul!
   @ ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:287 [inlined]
 [4] mul!
   @ ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:285 [inlined]
 [5] mul!
   @ ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:253 [inlined]
 [6] *(A::Matrix{Float64}, B::Matrix{Int64})
   @ LinearAlgebra ~/.julia/juliaup/julia-1.11.2+0.aarch64.apple.darwin14/share/julia/stdlib/v1.11/LinearAlgebra/src/matmul.jl:114
 [7] top-level scope
   @ In[29]:1

A square matrix raised to an integer power is the same as repeated matrix multiplication.

B^3    # same as B*B*B

3×3 Matrix{Int64}:
 -1  0     0
  0  0     0
  0  0  -125

Sometimes one instead wants to treat a matrix or vector as a mere array and simply apply a single operation to each element of it. For multiplication, division, and power, the corresponding operators start with a dot.

C = -A;

Because both matrices are $3\times 5$ , A * C would be an error here, but elementwise operations are fine.

elementwise = A .* C

3×5 Matrix{Float64}:
    -1.0        -4.0    -9.0       -16.0       -25.0
 -2500.0     -1600.0  -900.0      -400.0      -100.0
    -9.8696     -2.0    -7.38906    -2.61803    -1.20695

The two operands of a dot operator have to have the same size—unless one is a scalar, in which case it is expanded or broadcast to be the same size as the other operand.

x_to_two = x .^ 2

5-element Vector{Int64}:
 9
 9
 0
 1
 0

two_to_x = 2 .^ x

5-element Vector{Int64}:
 8
 8
 1
 2
 1

Most of the mathematical functions, such as cos, sin, log, exp, and sqrt, expect scalars as operands. However, you can broadcast any function, including ones that you have defined, across a vector or array by using a special dot syntax.

show(cos.(π * x))    # broadcast to a function

[-1.0, -1.0, 1.0, -1.0, 1.0]

Rather than dotting multiple individual functions, you can use @. before an expression to broadcast everything within it.

show(@. cospi( (x + 1)^3) )    # broadcast an entire expression

[1.0, 1.0, -1.0, 1.0, -1.0]

Example 2.2.1

In MATLAB, every numerical value is treated like a matrix. A matrix with one row or one column is interpreted as a vector, and a $1\times 1$ matrix is interpreted as a scalar.

Square brackets are used to enclose elements of a matrix or vector. Use spaces for horizontal concatenation, and semicolons or new lines to indicate vertical concatenation.

A = [ 
    1       2      3             4      5; 
    50     40     30            20     10
    pi sqrt(2) exp(1) (1+sqrt(5))/2 log(3) 
    ]

m, n = size(A)

Unrecognized function or variable 'm'.

x = [ 3, 3, 0, 1, 0 ];   % row vector
size(x)

Concatenated elements within brackets may be matrices or vectors for a block representation, as long as all the block sizes are compatible.

[ x  x ]

[ x; x ]

The zeros and ones functions construct matrices with entries all zero or one, respectively.

B = [ zeros(3, 2) ones(3, 1) ]

A single quote ' after a matrix returns its adjoint. For real matrices, this is the transpose; for complex-valued matrices, the elements are also conjugated.

A'

y = 1:4              % start:stop

z = 0:3:12           % start:step:stop

Instead of specifying the step size, you can give the number of points in the range if you use linspace.

s = linspace(-1, 1, 5)    % row result

Accessing an element is done by giving one (for a vector) or two (for a matrix) index values within parentheses.

a = A(2, end-1)

x(2)

The indices can be vectors or ranges, in which case a block of the matrix is accessed.

A(1:2, end-2:end)    % first two rows, last three columns

If a dimension has only the index : (a colon), then it refers to all the entries in that dimension of the matrix.

A(:, 1:2:end)        % all of the odd columns

The matrix and vector senses of addition, subtraction, scalar multiplication, multiplication, and power are all handled by the usual symbols.

B = diag([-1, 0, -5])   % create a diagonal matrix

size(A)
size(B)

BA = B * A     % matrix product

A * B causes an error here, because the dimensions aren’t compatible.

A * B    % throws an error

Error using *
Incorrect dimensions for matrix multiplication. Check that the number of columns in the first matrix matches the number of rows in the second matrix. To operate on each element of the matrix individually, use TIMES (.*) for elementwise multiplication.

A square matrix raised to an integer power is the same as repeated matrix multiplication.

B^3    % same as B*B*B

C = -A;

Because both matrices are $3\times 5$ , A * C would be an error here, but elementwise operations are fine.

elementwise = A .* C

The two operands of a dot operator have to have the same size—unless one is a scalar, in which case it is expanded or broadcast to be the same size as the other operand.

x_to_two = x .^ 2

two_to_x = 2 .^ x

cos(pi * x)

Example 2.2.1

A vector is created using square brackets and commas to enclose and separate its entries.

x = array([3, 3, 0, 1, 0 ])
print(x.shape)

(5,)

To construct a matrix, you nest the brackets to create a “vector of vectors”. The inner vectors are the rows.

A = array([ 
    [1, 2, 3, 4, 5],
    [50, 40, 30, 20, 10], 
    [pi, sqrt(2), exp(1), (1+sqrt(5))/2, log(3)] 
    ])

print(A)
print(A.shape)

[[ 1.          2.          3.          4.          5.        ]
 [50.         40.         30.         20.         10.        ]
 [ 3.14159265  1.41421356  2.71828183  1.61803399  1.09861229]]
(3, 5)

In this text, we treat all vectors as equivalent to matrices with a single column. That isn’t true in NumPy, because even an $n \times 1$ array has two dimensions, unlike a vector.

array([[3], [1], [2]]).shape

(3, 1)

You can concatenate arrays with compatible dimensions using hstack and vstack.

print( hstack([A, A]) )

[[ 1.          2.          3.          4.          5.          1.
   2.          3.          4.          5.        ]
 [50.         40.         30.         20.         10.         50.
  40.         30.         20.         10.        ]
 [ 3.14159265  1.41421356  2.71828183  1.61803399  1.09861229  3.14159265
   1.41421356  2.71828183  1.61803399  1.09861229]]

print( vstack([A, A]) )

[[ 1.          2.          3.          4.          5.        ]
 [50.         40.         30.         20.         10.        ]
 [ 3.14159265  1.41421356  2.71828183  1.61803399  1.09861229]
 [ 1.          2.          3.          4.          5.        ]
 [50.         40.         30.         20.         10.        ]
 [ 3.14159265  1.41421356  2.71828183  1.61803399  1.09861229]]

Transposing a matrix is done by appending .T to it.

print(A.T)

[[ 1.         50.          3.14159265]
 [ 2.         40.          1.41421356]
 [ 3.         30.          2.71828183]
 [ 4.         20.          1.61803399]
 [ 5.         10.          1.09861229]]

For matrices with complex values, we usually want instead the adjoint or hermitian, which is .conj().T.

print((x + 1j).conj().T)

[3.-1.j 3.-1.j 0.-1.j 1.-1.j 0.-1.j]

There are many convenient shorthand ways of building vectors and matrices other than entering all of their entries directly or in a loop. To get a vector with evenly spaced entries between two endpoints, you have two options.

print(arange(1, 7, 2))   # from 1 to 7 (not inclusive), step by 2

[1 3 5]

print(linspace(-1, 1, 5))   # from -1 to 1 (inclusive), with 5 total values

[-1.  -0.5  0.   0.5  1. ]

The practical difference between these is whether you want to specify the step size in arange or the number of points in linspace.

Accessing an element is done by giving one (for a vector) or two index values in square brackets. In Python, indexing always starts with zero, not 1.

A = array([ 
    [1, 2, 3, 4, 5],
    [50, 40, 30, 20, 10], 
    linspace(-5, 5, 5) 
    ])
x = array([3, 2, 0, 1, -1 ])

print("row 2, col 3 of A:", A[1, 2])
print("first element of x:", x[0])

row 2, col 3 of A: 30.0
first element of x: 3

The indices can be ranges, in which case a slice or block of the matrix is accessed. You build these using a colon in the form start:stop. However, the last value of this range is stop-1, not stop.

print(A[1:3, 0:2])    # rows 2 and 3, cols 1 and 2

[[50.  40. ]
 [-5.  -2.5]]

If start or stop is omitted, the range extends to the first or last index.

print(x[1:])  # elements 2 through the end

[ 2  0  1 -1]

print(A[:2, 0])  # first two rows in column 1

[ 1. 50.]

Notice in the last case above that even when the slice is in the shape of a column vector, the result is just a vector with one dimension and neither row nor column shape.

There are more variations on the colon ranges. A negative value means to count from the end rather than the beginning. And a colon by itself means to include everything from the relevant dimension.

print(A[:-1, :])    # all rows up to the last, all columns

[[ 1.  2.  3.  4.  5.]
 [50. 40. 30. 20. 10.]]

Finally, start:stop:step means to step size or stride other than one. You can mix this with the other variations.

print(x[::2])  # all the odd indexes

[ 3  0 -1]

print(A[:, ::-1])  # reverse the columns

[[ 5.   4.   3.   2.   1. ]
 [10.  20.  30.  40.  50. ]
 [ 5.   2.5  0.  -2.5 -5. ]]

The matrix and vector senses of addition, subtraction, and scalar multiplication and division are all handled by the usual symbols. Two matrices of the same size (what NumPy calls shape) are operated on elementwise.

print(A - 2 * ones([3, 5]))  # subtract two from each element

[[-1.   0.   1.   2.   3. ]
 [48.  38.  28.  18.   8. ]
 [-7.  -4.5 -2.   0.5  3. ]]

If one operand has a smaller number of dimensions than the other, Python tries to broadcast it in the “missing” dimension(s), and the operation proceeds if the resulting shapes are identical.

print(A - 2)    # subtract two from each element

[[-1.   0.   1.   2.   3. ]
 [48.  38.  28.  18.   8. ]
 [-7.  -4.5 -2.   0.5  3. ]]

u = array([1, 2, 3, 4, 5])
print(A - u)    # repeat this row for every row of A

[[ 0.   0.   0.   0.   0. ]
 [49.  38.  27.  16.   5. ]
 [-6.  -4.5 -3.  -1.5  0. ]]

v = array([1, 2, 3])
print(A - v)  # broadcasting this would be 3x3, so it's an error

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[29], line 2
      1 v = array([1, 2, 3])
----> 2 print(A - v)  # broadcasting this would be 3x3, so it's an error

ValueError: operands could not be broadcast together with shapes (3,5) (3,)

print(A - v.reshape([3, 1]))    # broadcasts to each column of A

[[ 0.   1.   2.   3.   4. ]
 [48.  38.  28.  18.   8. ]
 [-8.  -5.5 -3.  -0.5  2. ]]

Matrix–matrix and matrix–vector products are computed using @ or matmul.

B = diag([-1, 0, -5])    # create a diagonal 3x3
print(B @ A)    # matrix product

[[ -1.   -2.   -3.   -4.   -5. ]
 [  0.    0.    0.    0.    0. ]
 [ 25.   12.5   0.  -12.5 -25. ]]

$AB$ is undefined for these matrix sizes.

print(A @ B)    # incompatible sizes

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[32], line 1
----> 1 print(A @ B)    # incompatible sizes

ValueError: matmul: Input operand 1 has a mismatch in its core dimension 0, with gufunc signature (n?,k),(k,m?)->(n?,m?) (size 3 is different from 5)

The multiplication operator * is reserved for elementwise multiplication. Both operands have to be the same size, after any potential broadcasts.

print(B * A)    # not the same size, so it's an error

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[33], line 1
----> 1 print(B * A)    # not the same size, so it's an error

ValueError: operands could not be broadcast together with shapes (3,3) (3,5)

print((A / 2) * A)    # elementwise

[[5.000e-01 2.000e+00 4.500e+00 8.000e+00 1.250e+01]
 [1.250e+03 8.000e+02 4.500e+02 2.000e+02 5.000e+01]
 [1.250e+01 3.125e+00 0.000e+00 3.125e+00 1.250e+01]]

To raise to a power elementwise, use a double star. This will broadcast as well.

print(B)
print(B**3)

[[-1  0  0]
 [ 0  0  0]
 [ 0  0 -5]]
[[  -1    0    0]
 [   0    0    0]
 [   0    0 -125]]

print(x)
print(2.0**x)

[ 3  2  0  1 -1]
[8.  4.  1.  2.  0.5]

Most of the mathematical functions, such as cos, sin, log, exp and sqrt, expecting scalars as operands will be broadcast to arrays.

print(cos(pi * x))

[-1.  1.  1. -1. -1.]

2.2.4Row and column operations¶

A critical identity in matrix multiplication is

\mathbf{A} \mathbf{e}_j = \mathbf{a}_j.

(2.2.4)

Furthermore, the expression

\mathbf{A} \begin{bmatrix} \mathbf{e}_1 & \mathbf{e}_3 & \mathbf{e}_5 \end{bmatrix}

(2.2.5)

reproduces three columns. An equivalent expression in Julia would be A[:,1:2:5].

We can extend the same idea to rows by using the general identity $(\mathbf{R}\mathbf{S})^T=\mathbf{S}^T\mathbf{R}^T$ . Let $\mathbf{B}=\mathbf{A}^T$ have columns $\bigl[ \mathbf{b}_j \bigr]$ , and note

(\mathbf{b}_j)^T = (\mathbf{B} \mathbf{e}_j)^T = \mathbf{e}_j^T \mathbf{B}^T = \mathbf{e}_j^T \mathbf{A}.

(2.2.6)

But $\mathbf{e}_j^T$ is the $j$ th row of $\mathbf{I}$ , and $\mathbf{b}_j^T$ is the transpose of the $j$ th column of $\mathbf{B}$ , which is the $j$ th row of $\mathbf{A}$ by $\mathbf{B}=\mathbf{A}^T$ . Thus, multiplication on the left by row $j$ of the identity extracts the $j$ th row. Extracting the single element $(i,j)$ from the matrix is, therefore, $\mathbf{e}_i^T \mathbf{A} \mathbf{e}_j$ .

Being able to extract specific rows and columns of a matrix via algebra makes it straightforward to do row- and column-oriented operations, such as linear combinations.

Example 2.2.2

Say that $\mathbf{A}$ has five columns. Adding twice the third column of $\mathbf{A}$ to its first column is done by

\mathbf{A}(\mathbf{e}_1+2\mathbf{e}_3).

(2.2.7)

Suppose we want to do this operation “in place,” meaning replacing the first column of $\mathbf{A}$ with this value and leaving the other four columns of $\mathbf{A}$ alone. We can replace $\mathbf{A}$ with

\mathbf{A} \begin{bmatrix} \mathbf{e}_1+2\mathbf{e}_3 & \mathbf{e}_2 & \mathbf{e}_3 & \mathbf{e}_4 & \mathbf{e}_5 \end{bmatrix}.

(2.2.8)

Julia

MATLAB

Python

The Julia equivalent is

A[:, 1] += 2A[:, 3]

The += operator means to increment the item on the left-hand side. There are similar interpretations for -= and *=.

2.2.5Exercises¶

✍ Suppose
$\mathbf{C} = \begin{bmatrix} \mathbf{I} & \mathbf{A} \\ -\mathbf{I} & \mathbf{B} \end{bmatrix}.$
(2.2.9)
Using block notation, find $\mathbf{C}^2$ and $\mathbf{C}^3$ .
⌨ Let
$\mathbf{A} = \begin{bmatrix} 2 & 1 & 1 & 0 \\ 0 & -1 & 4 & 1 \\ 2 & 2 & 0 & -2 \\ 1 & 3 & -1 & 5 \end{bmatrix}, \quad \mathbf{B} = \begin{bmatrix} 3 & -1 & 0 & 2 \\ 7 & 1 & 0 & 2 \end{bmatrix},$
(2.2.10)
$\mathbf{u} = \begin{bmatrix} 2 \\ -1 \\ 3 \\ 1 \end{bmatrix}, \quad \mathbf{v} = \begin{bmatrix} \pi \\ e \end{bmatrix}.$
(2.2.11)
(Do not round off the values in $\mathbf{v}$ —find them using native Julia commands.) For each expression below, use Julia to find the result, or explain why the result does not exist.
(a) $\mathbf{A}\mathbf{B},\quad$ (b) $\mathbf{B} \mathbf{A},\quad$ (c) $\mathbf{v}^T \mathbf{B},\quad$ (d) $\mathbf{B} \mathbf{u},\quad$ (e) $\bigl[ \, \mathbf{u}\:\: \mathbf{A}\mathbf{u} \:\: \mathbf{A}^2 \mathbf{u} \:\: \mathbf{A}^3 \mathbf{u} \bigr]$ .
⌨ Let
$\mathbf{u} = \begin{bmatrix} 1\\3\\5\\7\\9\\11 \end{bmatrix}, \qquad \mathbf{v} = \begin{bmatrix} -60 \\ -50 \\ -40 \\ -30 \\ -20 \\ -10 \end{bmatrix}.$
(2.2.12)
Find the inner products $\mathbf{u}^T\mathbf{v}$ and $\mathbf{v}^T\mathbf{u}$ and the outer products $\mathbf{u}\mathbf{v}^T$ and $\mathbf{v}\mathbf{u}^T$ .
⌨ In Julia, give a demonstration of the identity $(\mathbf{A}\mathbf{B})^T=\mathbf{B}^T\mathbf{A}^T$ for some arbitrarily chosen $3\times 4$ matrix $\mathbf{A}$ and $4\times 2$ matrix $\mathbf{B}$ .

✍ Prove that if $\mathbf{A}$ and $\mathbf{B}$ are invertible, then $(\mathbf{A}\mathbf{B})^{-1}=\mathbf{B}^{-1}\mathbf{A}^{-1}$ . (In producing the inverse, it follows that $\mathbf{A}\mathbf{B}$ is invertible as well.)
✍ Suppose $\mathbf{B}$ is an arbitrary $4\times 3$ matrix. In each part below a matrix $\mathbf{A}$ is described in terms of $\mathbf{B}$ . Express $\mathbf{A}$ as a product of $\mathbf{B}$ with one or more other matrices.
(a) $\mathbf{A}\in\mathbb{R}^{4 \times 1}$ is the result of adding the first column of $\mathbf{B}$ to -2 times the last column of $\mathbf{B}$ .
(b) The rows of $\mathbf{A}\in\mathbb{R}^{4 \times 3}$ are the rows of $\mathbf{B}$ in order 4,3,2,1.
(c) The first column of $\mathbf{A}\in\mathbb{R}^{4 \times 3}$ is 1 times the first column of $\mathbf{B}$ , the second column of $\mathbf{A}$ is 2 times the second column of $\mathbf{B}$ , and the third column of $\mathbf{A}$ is 3 times the third column of $\mathbf{B}$ .
(d) $A$ is the scalar sum of all elements of $\mathbf{B}$ .
(a) ✍ Prove that for real vectors $\mathbf{v}$ and $\mathbf{w}$ of the same length, the inner products $\mathbf{v}^T\mathbf{w}$ and $\mathbf{w}^T\mathbf{v}$ are equal.
(b) ✍ Prove true, or give a counterexample for, the equivalent statement about outer products, $\mathbf{v}\mathbf{w}^T$ and $\mathbf{w}\mathbf{v}^T$ .

Footnotes¶

This aspect of our notation is slightly unusual. More frequently one would see the lowercase $a_{24}$ in this context. We feel that our notation lends more consistency and clarity to expressions with mixed symbols, and it is more like how computer code is written.
↩
The conjugate of a complex number is found by replacing all references to the imaginary unit $i$ by $-i$ .
↩

Preface

Polynomial interpolation

Preface

Linear systems