← Back to modules

Eigenvalues and Eigenvectors

Characteristic equation, diagonalization, and complex eigenvalues.

Eigenvalues and eigenvectors reveal the intrinsic skeleton of a linear transformation. Most vectors get both rotated and stretched when a matrix acts on them. But a few special vectors — the eigenvectors — only get stretched: their direction is preserved (or reversed), and the stretch factor is the eigenvalue.

Finding eigenvalues reduces to solving the characteristic equation det(AλI)=0\det(A-\lambda I)=0, a polynomial in λ\lambda. Each root is an eigenvalue; for each eigenvalue, the eigenvectors are the nonzero solutions to (AλI)v=0(A-\lambda I)\mathbf{v}=\mathbf{0}.

Diagonalization — writing A=PDP1A=PDP^{-1} — is the payoff: once you have eigenvectors and eigenvalues, complex operations like A100A^{100} become trivial. This chapter also covers the full theory of linear transformations, which underpins every subsequent result.

Linear transformations and their matrices

A function T:RnRmT:\mathbb{R}^n\to\mathbb{R}^m is linear if T(u+v)=T(u)+T(v)T(\mathbf{u}+\mathbf{v})=T(\mathbf{u})+T(\mathbf{v}) and T(cv)=cT(v)T(c\mathbf{v})=cT(\mathbf{v}) for all vectors and scalars. Together these imply T(cu+dv)=cT(u)+dT(v)T(c\mathbf{u}+d\mathbf{v}) = cT(\mathbf{u})+dT(\mathbf{v}) — linear maps preserve linear combinations.

Every linear transformation has a unique representing matrix: A=[T(e1)    T(e2)        T(en)]A = [T(\mathbf{e}_1)\;|\;T(\mathbf{e}_2)\;|\;\cdots\;|\;T(\mathbf{e}_n)], where the columns are the images of the standard basis vectors. Once you know what TT does to the basis, you know everything.

Geometric examples: rotations, reflections, projections, shears, and scalings are all linear. Non-linear examples: translations, squaring a vector, adding a constant.

Composition of transformations corresponds to matrix multiplication: (TATB)(x)=A(Bx)=(AB)x(T_A\circ T_B)(\mathbf{x}) = A(B\mathbf{x}) = (AB)\mathbf{x}. This is the geometric motivation for the matrix multiplication rule.

Helpful?

Kernel and image

The kernel of TT is ker(T)={x:T(x)=0}\ker(T) = \{\mathbf{x}: T(\mathbf{x})=\mathbf{0}\} — the set of inputs that TT collapses to zero. For TAT_A, this is exactly Null(A)\text{Null}(A).

TT is injective (one-to-one) iff ker(T)={0}\ker(T)=\{\mathbf{0}\} — only zero maps to zero. Equivalently, different inputs always produce different outputs.

The image of TT is Im(T)={T(x):xRn}=Col(A)\text{Im}(T) = \{T(\mathbf{x}): \mathbf{x}\in\mathbb{R}^n\} = \text{Col}(A). TT is surjective (onto) iff Im(T)=Rm\text{Im}(T)=\mathbb{R}^m iff rank(A)=m\text{rank}(A)=m.

The rank-nullity theorem for transformations: dim(ker(T))+dim(Im(T))=n\dim(\ker(T))+\dim(\text{Im}(T))=n. Independent of any choice of basis or matrix representation.

Helpful?

Eigenvalues and eigenvectors

A nonzero vector v\mathbf{v} is an eigenvector of AA with eigenvalue λ\lambda if Av=λvA\mathbf{v}=\lambda\mathbf{v}. The matrix merely scales v\mathbf{v} — it does not change its direction (or it reverses it when λ<0\lambda<0).

To find eigenvalues: rewrite as (AλI)v=0(A-\lambda I)\mathbf{v}=\mathbf{0}. For a nonzero solution to exist, the matrix AλIA-\lambda I must be singular: det(AλI)=0\det(A-\lambda I)=0. This is the characteristic equation.

The characteristic polynomial of an n×nn\times n matrix has degree nn and exactly nn roots in C\mathbb{C} (counting multiplicity). Real matrices may have complex eigenvalues in conjugate pairs.

For each eigenvalue λk\lambda_k, the eigenspace is Eλk=Null(AλkI)E_{\lambda_k}=\text{Null}(A-\lambda_k I). Any nonzero vector in this eigenspace is an eigenvector. The eigenspace always has dimension at least 11.

Trace and determinant shortcuts: iλi=tr(A)\sum_i \lambda_i = \text{tr}(A) and iλi=det(A)\prod_i \lambda_i = \det(A). For a 2×22\times 2 matrix, these two conditions fully determine the eigenvalues once you find the characteristic polynomial.

Helpful?

Diagonalization

A matrix AA is diagonalizable if A=PDP1A=PDP^{-1}, where the columns of PP are nn linearly independent eigenvectors and D=diag(λ1,,λn)D=\text{diag}(\lambda_1,\ldots,\lambda_n) contains the corresponding eigenvalues.

Criterion: AA is diagonalizable iff it has nn linearly independent eigenvectors. A sufficient condition: nn distinct eigenvalues. Repeated eigenvalues may or may not permit diagonalization.

Computing powers: Ak=PDkP1A^k = PD^kP^{-1}, and Dk=diag(λ1k,,λnk)D^k=\text{diag}(\lambda_1^k,\ldots,\lambda_n^k) is trivial. This turns computing A100A^{100} into three matrix multiplications.

Geometric interpretation: D=P1APD=P^{-1}AP means that in the coordinate system of eigenvectors, the transformation AA acts as pure diagonal scaling. Diagonalisation finds the 'natural' coordinate system for AA.

Non-diagonalisable matrices (defective matrices) require Jordan normal form — beyond this course but worth knowing exists.

Helpful?

Geometric transformations and their eigenvalues

Rotation by θ\theta (counterclockwise): Rθ=(cosθsinθsinθcosθ)R_\theta=\begin{pmatrix}\cos\theta&-\sin\theta\\\sin\theta&\cos\theta\end{pmatrix}. det(Rθ)=1\det(R_\theta)=1 (area preserved, orientation maintained). Eigenvalues e±iθe^{\pm i\theta} are complex for θ0,π\theta\neq 0,\pi — no real eigenvectors, consistent with the fact that rotations spin everything.

Reflection across the xx-axis: (1001)\begin{pmatrix}1&0\\0&-1\end{pmatrix}. Eigenvalues λ1=1\lambda_1=1 (vectors on xx-axis are fixed) and λ2=1\lambda_2=-1 (vectors on yy-axis are flipped). det=1\det=-1: orientation reversed.

Projection onto a line: eigenvalues 11 (vectors on the line are fixed by projection) and 00 (vectors orthogonal to the line collapse to zero). Idempotent: P2=PP^2=P.

Shear (1k01)\begin{pmatrix}1&k\\0&1\end{pmatrix}: repeated eigenvalue λ=1\lambda=1 with only one independent eigenvector. Not diagonalisable — it is a canonical example of a defective matrix.

Helpful?