Math 513 HW2 spring 2014, University of Wisconsin, Madison

Operations that acts on columns of \(B\) are implemented using a matrix which is post multiplied by \(B\), while operations that acts on rows of \(B\) are implemented by a matrix which is pre multiplied by \(B\).

Hence, to put it all together, the above operations are written in the other given, resulting in\[ r=R_{3}\times R_{2}\times R_{1}\times B\times C_{1}\times C_{2}\times C_{3}\times C_{4}\] where \(r\) is the ﬁnal transformation of \(B\). The question now asks to verify the above using Matlab. The following is the code used to verify the result

%script file name: problem_1_parta.m
%by Nasser M. Abbasi

%make a random B matrix to verify the method with
B  = randi(10,4,4);

C1 = zeros(size(B));
C1(logical(eye(size(C1)))) = 1;
C1(1,1) = 2;

C2 = zeros(size(B));
C2(logical(eye(size(C2)))) = 1;
C2(1,1)     = 0;
C2(1,end)   = 1;
C2(end,1)   = 1;
C2(end,end) =0;

C3 = zeros(size(B));
C3(logical(eye(size(C3)))) = 1;
C3(end,end) = 0;
C3(3,end)   = 1;

C4 = zeros(3);
C4(2,1) = 1;
C4(3,2) = 1;
C4(4,3) = 1;

R1 = zeros(size(B));
R1(logical(eye(size(R1)))) = 1;
R1(3,3) = 1/2;

R2 = zeros(size(B));
R2(logical(eye(size(R2)))) = 1;
R2(1,3) = 1;

R3 = zeros(size(B));
R3(logical(eye(size(R3)))) = 1;
R3(1,2) = -1;
R3(3,2) = -1;
R3(4,2) = -1;

fprintf('B is \n'); B
fprintf('step 1\n'); r = B*C1
fprintf('step 2\n'); r = R1*r
fprintf('step 3\n'); r = R2*r
fprintf('step 4\n'); r = r*C2
fprintf('step 5\n'); r = R3*r
fprintf('step 6\n'); r = r*C3
fprintf('step 7\n'); r = r*C4

EDU>> problem_1_part_a
B =
8     7    10     8
8     2     4     3
3     2     6     6
7     5     3     7
step 1
16     7    10     8
16     2     4     3
6     2     6     6
14     5     3     7
step 2
16     7    10     8
16     2     4     3
3     1     3     3
14     5     3     7
step 3
19     8    13    11
16     2     4     3
3     1     3     3
14     5     3     7
step 4
11     8    13    19
3     2     4    16
3     1     3     3
7     5     3    14
step 5
8     6     9     3
3     2     4    16
0    -1    -1   -13
4     3    -1    -2
step 6
8     6     9     9
3     2     4     4
0    -1    -1    -1
4     3    -1    -1
step 7
6     9     9
2     4     4
-1    -1    -1
3    -1    -1

1.2 part b

To write it as product \(A\times B\times C\), let \(A=R_{3}\times R_{2}\times R_{1}\) and \(C=C_{1}\times C_{2}\times C_{3}\times C_{4}\). The following matlab code veriﬁes this result. It uses the same \(B\) matrix used by part a above to verify that the same result is obtained

%script file name: problem_1_partb.m
%by Nasser M. Abbasi

%Using the same random B from part a to use to verify
B =  [8     7    10     8
     8     2     4     3
     3     2     6     6
     7     5     3     7];

C1 = zeros(size(B));
C1(logical(eye(size(C1)))) = 1;
C1(1,1) = 2;

C2 = zeros(size(B));
C2(logical(eye(size(C2)))) = 1;
C2(1,1)     = 0;
C2(1,end)   = 1;
C2(end,1)   = 1;
C2(end,end) =0;

C3 = zeros(size(B));
C3(logical(eye(size(C3)))) = 1;
C3(end,end) = 0;
C3(3,end)   = 1;

C4 = zeros(3);
C4(2,1) = 1;
C4(3,2) = 1;
C4(4,3) = 1;

R1 = zeros(size(B));
R1(logical(eye(size(R1)))) = 1;
R1(3,3) = 1/2;

R2 = zeros(size(B));
R2(logical(eye(size(R2)))) = 1;
R2(1,3) = 1;

R3 = zeros(size(B));
R3(logical(eye(size(R3)))) = 1;
R3(1,2) = -1;
R3(3,2) = -1;
R3(4,2) = -1;

fprintf('B is \n'); B
fprintf('A is \n'); A=R3*R2*R1
fprintf('C is \n'); C=C1*C2*C3*C4
fprintf('A*B*C is \n'); A*B*C

EDU>> problem_1_part_b
B is
8     7    10     8
8     2     4     3
3     2     6     6
7     5     3     7
A is
1.0000   -1.0000    0.5000         0
0    1.0000         0         0
0   -1.0000    0.5000         0
0   -1.0000         0    1.0000
C is
0     0     0
1     0     0
0     1     1
0     0     0
A*B*C is
6     9     9
2     4     4
-1    -1    -1
3    -1    -1

2 Problem 2.3 page 15

question: Do problem 2.3, page 15. For the latter, you may assume that the matrix is symmetric (i.e. \(A\) is real-values and \(A^{\prime }=A\)) and may examine there expressions of the form \(\left \langle Ax,y\right \rangle \)

2.1 part a

proof: Let \(x\) be an eigenvector of \(A\) with corresponding eigenvalue \(\lambda \), then \(Ax=\lambda x\), and taking the conjugate transpose of both sides \begin{align*} (Ax)^{\ast } & =(\lambda x)^{\ast }\\ x^{\ast }A^{\ast } & =\bar{\lambda }x^{\ast } \end{align*}

post multiply each side by \(x\) \begin{align*} x^{\ast }A^{\ast }x & =\bar{\lambda }x^{\ast }x\\ x^{\ast }(A^{\ast }x) & =\bar{\lambda }(x^{\ast }x) \end{align*}

And since \(A\) is Hermitian, then \(A=A^{\ast }\) and the above becomes \begin{align*} x^{\ast }(Ax) & =\bar{\lambda }(x^{\ast }x)\\ x^{\ast }(\lambda x) & =\bar{\lambda }(x^{\ast }x)\\ \lambda (x^{\ast }x) & =\bar{\lambda }(x^{\ast }x) \end{align*}

Since \(x\neq 0\) then the above implies \(\lambda =\bar{\lambda }\). This is only possible if \(\lambda \) is real. Hence all eiqenvalues of \(A\) must be real.

2.2 part b

Given: \(x,y\) are eigenvectors corresponding to distinct eigevalues. show that \(x,y\) are orthogonal

proof: Let \(\lambda _{x}\) be the eigenvalue corresponding to \(x\) and let \(\lambda _{y}\) be the eigenvalue corresponding to \(y\) and let. Then \begin{align*} Ax & =\lambda _{x}x\\ Ay & =\lambda _{y}y \end{align*}

Hence from the ﬁrst equation above, taking the complex conjugate \begin{align*} (Ax)^{\ast } & =(\lambda _{x}x)^{\ast }\\ x^{\ast }A^{\ast } & =\bar{\lambda }_{x}x^{\ast } \end{align*}

post multiply each side of the above by \(y\) gives \begin{align*} x^{\ast }A^{\ast }y & =\bar{\lambda }_{x}x^{\ast }y\\ x^{\ast }(A^{\ast }y) & =\bar{\lambda }_{x}x^{\ast }y\\ x^{\ast }(Ay) & =\bar{\lambda }_{x}x^{\ast }y\\ x^{\ast }(\lambda _{y}y) & =\bar{\lambda }_{x}x^{\ast }y\\ \lambda _{y}x^{\ast }y & =\bar{\lambda }_{x}x^{\ast }y \end{align*}

But \(\bar{\lambda }_{x}=\lambda _{x}\) from part a, hence \(\lambda _{y}x^{\ast }y=\lambda _{x}x^{\ast }y\) and since \(\lambda _{y}\neq \lambda _{x}\) since we assumed all eigenvalues are distinct, then the above implies that \[ x^{\ast }y=0 \] which means that \(\left \langle x,y\right \rangle =0\) which implies \(x\) and \(y\) are orthogonal.

3 problem 2

3.1 part (a)

given \(\left \Vert Qx\right \Vert =\left \Vert x\right \Vert \) shown that \(1\) is only eigenvalue of \(Q^{T}Q\).

By deﬁnition \begin{align*} \left \Vert Qx\right \Vert & =\left ( Qx\right ) ^{T}\left ( Qx\right ) \\ & =\left ( x^{T}Q^{T}\right ) \left ( Qx\right ) \\ & =x^{T}\left ( Q^{T}Q\right ) x \end{align*}

But we are told that \(\left \Vert Qx\right \Vert =\left \Vert x\right \Vert \) and since \(\left \Vert x\right \Vert =x^{T}x\,\) we can write

\begin{align*} x^{T}\left ( Q^{T}Q\right ) x & =\left \Vert x\right \Vert \\ & =x^{T}x \end{align*}

Therefore, for the LHS above to be equal to the RHS, it must be that \(Q^{T}Q=I\) where \(I\) is the identity matrix. But the only eigenvalue of \(I\) is \(1\), since \(Iv=v\) for any \(v\). Therefore \(1\) is the only eigenvalue of \(Q^{T}Q\) which can be written as \(\sigma \left \{ Q^{T}Q\right \} =\left \{ 1\right \} \)

3.2 part (b)

We need to show that \(Q^{T}Q\) is symmetric for any matrix \(Q\). By deﬁnition, a matrix \(A\) is symmetric if \(A^{T}=A\). But

We have shown that \(A^{T}=A\), where \(A\) happened to be \(Q^{T}Q\) in this case. Hence \(Q^{T}Q\) is symmetric for any \(Q\).

Now we need to use this property to show that \(\left \Vert Qx\right \Vert =\left \Vert x\right \Vert \) implies that \(Q\) is orthogonal as well.

A matrix is orthogonal if each one of its columns (or rows) is orthogonal to each other column (or row). In addition, the normal of each column (or row) is one.

The ﬁrst property above means that \(\left \langle q_{i},q_{j}\right \rangle =\delta _{ij}\) where \(\delta _{ij}=1\) if \(i=j\) and zero otherwise and where \(q_{i}\) means the \(i^{th}\) column (or row) of \(Q\) and \(q_{j}\) means the \(j^{th}\) column (or row) of \(Q\). But from part (a) above, we showed that \(Q^{T}Q=I\) which is the same as saying that \(\left \langle q_{i}^{T},q_{j}\right \rangle =\delta _{ij}\). Hence \(Q\) meets the ﬁrst propery of orthogonality. Now we need to show that the norm of each column (or row) of \(Q=1\).

Since \(\left \Vert q_{i}\right \Vert =\sqrt{q_{i}^{T}q_{i}}=\sqrt{\delta _{ii}}=\sqrt{1}=1\), then the norm is \(1\). Hence both properties are satisﬁed. Hence \(Q\) is unitary matrix (or orthogonal).