Classification of Bounded Operators
Here we define the notion of the adjoint operator and using this notion, define self-adjoint, unitary and normal operators. We discuss simple algebraic properties of operators in these classes. We also discuss projections.
The Adjoint Operator
Theorem 2.1: Let H be a Hilbert space and A∈B(H). There exists a unique operator A∗∈B(H) such that:
(Ax,y)=(x,A∗y), ∀x,y∈H. Proof: Let y∈H and define the linear functional fy:H→C as fy(x):=(Ax,y). By the Cauchy-Schwarz inequality, for every x∈H we have:
∣fy(x)∣≤∥Ax∥∥y∥≤∥A∥∥x∥∥y∥=(∥A∥∥y∥)∥x∥. So fy is a bounded linear functional on H and ∥f∥≤∥A∥∥y∥. According to the Riesz representation theorem there exists a unique zy∈H such that for all x∈H:
(Ax,y)=f(x)=(x,zy) and ∥zy∥=∥f∥≤∥A∥∥y∥. Define the operator A∗:H→H as A∗x=zy. One can verify that A∗ is linear. We have already proved that for all y∈H:
∥A∗y∥≤∥A∥∥y∥, in other words A∗ is bounded and:
∥A∗∥≤∥A∥. By construction 2.1 is satisfied and the uniqueness of zy guarantees that the constructed operator A∗ is the unique operator satisfying the inequality. ■
Definition 2.2: The operator A∗ from the previous theorem is called the adjoint of A.
The adjoint operator can also be defined in a more general situation when B∈B(X,Y) for Banach spaces X,Y; see Reed-Simon Section VI.2 for discussion. In this course we only consider the adjoint of operators on a Hilbert space.
Theorem 2.3: Let A,A1,A2∈B(H), and α∈C. Then:
- (αA1+A2)∗αA1∗+A2∗;
- (A1A2)∗=A2∗A1∗;
- A∗∗=A, where A∗∗=(A∗)∗;
- ∥A∗∥=∥A∥;
- ∥A∗A∥=∥AA∗∥=∥A∥2;
- If A is invertible then (A−1)∗=(A∗)−1.
Proof: In order to show that an operator is the adjoint of another, it is sufficient to show that it satisfies the adjoint equation (2.1) for all x,y∈H. Then, the uniqueness of the adjoint tells us that it has to be the adjoint.
- We compute for any x,y∈H:
((αA1+A2)x,y)=α(A1x,y)+(A2x,y)=α(x,A1∗y)+(x,A2∗y)=(x,(αA1∗+A2∗)y). - We compute again, for x,y∈H:
(A1A2x,y)=(A2x,A1∗y)=(x,A2∗A1∗y). - For every x,y∈H:
(A∗∗x,y)=(y,A∗∗x)=(A∗y,x)=(x,A∗y)=(Ax,y). Consequently for every x,y∈H:((A−A∗∗)x,y)=0. Since this is in particular true for y=(A−A∗∗)x we obtain that for all x∈H, (A−A∗∗)x=0. - This follows from 2.2 and (3):
∥A∥=∥A∗∗∥≤∥A∗∥≤∥A∥. - By (4) and Theorem 1.5:
∥A∗A∥≤∥A∗∥∥A∥=∥A∥2.
On the other hand:
∥A2∥=∥x∥=1sup∥Ax∥2=∥x∥=1sup(Ax,Ax)=∥x∥=1sup(x,A∗Ax)≤∥x∥=1sup∥x∥∥A∗Ax∥=∥A∗A∥. Thus ∥A∗A∥=∥A∥2. Using this equality with A∗ instead of A we derive from (3) and (4) that ∥AA∗∥=∥A∥2.
- It is sufficient to take the adjoint operators in the equality:
AA−1=I=A−1A. ■
Definition 2.4: An operator A∈B(H) is said to be:
- normal if AA∗=A∗A,
- self-adjoint if A∗=A, i.e. if for all (x,y)∈H, (Ax,y)=(x,Ay),
- unitary if A∗A=AA∗=I i.e. if A−1=A∗.
Note that any self-adjoint operator is normal. Any unitary operator U∈B(H) is normal as well.
Unitary operators can be defined more generally as operators from one Hilbert space to another one: U∈B(H1,H2) is called unitary if U∗U=IH1, UU∗=IH2.
Example 2.5: Consider the operator T of multiplication by a function t in L2(R). Then T is self-adjoint if and only if t is real-valued; T is unitary if and only if ∣t(x)∣=1 for all x; T is normal for any function t. Similar statements apply to the operator of multiplication by a sequence.
Example 2.6: An operator T given by the matrix {tij}i,j=1N in CN is self-adjoint if and only if tij=tji for all i,j. Similarly an integral operator with the integral kernel t(,y) is self-adjoint if t(x,y)=t(y,x) for all x,y. There are no known simple sufficient conditions for unitarity or normality of integral operators.
Theorem 2.7:
- A is self-adjoint, λ∈R⟹λA is self-adjoint.
- A1,A2 are self-adjoint ⟹A1+A2 is self-adjoint.
- Let A1,A2 be self-adjoint. Then A1A2 is self-adjoint if and only if A1 and A2 commute.
Proof: Exercise. ■
Theorem 2.8: Let A inB(H); then A is self-adjoint if and only if (Ax,x) is real for all x∈H.
Proof: If A is self-adjoint then for all x∈H:
(Ax,x)=(x,Ax)=(Ax,x), in other words, (Ax,x) is real.
Let us prove the converse. For any x,y∈H, we have:
4(Ax,y)=(A(x+y),x+y)−(A(x−y),x−y)+i(A(x+iy),x+iy)−i(A(x−iy),x−iy) (this is the polarization identity for operators) and similarly:
4(x,Ay)=(x+y,A(x+y))−(x−y,A(x−y))+i(x+iy,A(x+iy))−i(x−iy,A(x−iy)). For every z∈H we have:
(Bz,z)=(Bz,z)=(z,Bz). Since this is true for z∈{x+y,x−y,x+iy,x−iy}, the right hand sides of the two above equations are equal. This implies that:
(Ax,y)=(x,Ay) for all x,y∈H, in other words, A is self-adjoint. ■
Corollary 2.9: If Bn,n∈N are self-adjoint and Bn→B weakly, then B is self-adjoint.
Proof: We have (Anx,x)→(Ax,x) for all x. Since (Anx.x) are real, this implies that (Ax,x) is real, hence A is self-adjoint. ■