Eigenvalues and eigenvectors reveal the intrinsic skeleton of a linear transformation. Most vectors get both rotated and stretched when a matrix acts on them. But a few special vectors — the eigenvectors — only get stretched: their direction is preserved (or reversed), and the stretch factor is the eigenvalue.
Finding eigenvalues reduces to solving the characteristic equation , a polynomial in . Each root is an eigenvalue; for each eigenvalue, the eigenvectors are the nonzero solutions to .
Diagonalization — writing — is the payoff: once you have eigenvectors and eigenvalues, complex operations like become trivial. This chapter also covers the full theory of linear transformations, which underpins every subsequent result.
Linear transformations and their matrices
A function is linear if and for all vectors and scalars. Together these imply — linear maps preserve linear combinations.
Every linear transformation has a unique representing matrix: , where the columns are the images of the standard basis vectors. Once you know what does to the basis, you know everything.
Geometric examples: rotations, reflections, projections, shears, and scalings are all linear. Non-linear examples: translations, squaring a vector, adding a constant.
Composition of transformations corresponds to matrix multiplication: . This is the geometric motivation for the matrix multiplication rule.
💡Explain it simply
A linear transformation reshapes space without bending, tearing, or moving the origin. Lines stay lines. Parallel lines stay parallel. The whole space gets rotated, stretched, or squished in a uniform way. Every such rule is exactly captured by a matrix.
Matrix of a 90° rotation
- Find the matrix for counterclockwise rotation in .
- (the -axis vector rotates to the -axis).
- .
- Matrix: .
- Apply to : . Rotated ✓.
Kernel and image
The kernel of is — the set of inputs that collapses to zero. For , this is exactly .
is injective (one-to-one) iff — only zero maps to zero. Equivalently, different inputs always produce different outputs.
The image of is . is surjective (onto) iff iff .
The rank-nullity theorem for transformations: . Independent of any choice of basis or matrix representation.
💡Explain it simply
The kernel is the transformation's blind spot — inputs it can't distinguish from zero. The image is its range — all possible outputs. An injective map has a trivial kernel (no blind spots). A surjective map has an image that fills the entire output space.
Eigenvalues and eigenvectors
A nonzero vector is an eigenvector of with eigenvalue if . The matrix merely scales — it does not change its direction (or it reverses it when ).
To find eigenvalues: rewrite as . For a nonzero solution to exist, the matrix must be singular: . This is the characteristic equation.
The characteristic polynomial of an matrix has degree and exactly roots in (counting multiplicity). Real matrices may have complex eigenvalues in conjugate pairs.
For each eigenvalue , the eigenspace is . Any nonzero vector in this eigenspace is an eigenvector. The eigenspace always has dimension at least .
Trace and determinant shortcuts: and . For a matrix, these two conditions fully determine the eigenvalues once you find the characteristic polynomial.
💡Explain it simply
Imagine pushing on a rubber sheet with a matrix. Most arrows move and spin. Eigenvectors are special arrows that only stretch or shrink — they never change direction. The eigenvalue is the stretch factor. If , the arrow triples in length. If , it flips around.
Eigenvalues and eigenvectors of a 2×2 matrix
- Find eigenvalues and eigenvectors of .
- Characteristic equation: . Eigenvalues: , .
- For : . Solution: .
- For : . Solution: .
- Check: ✓. ✓.
Diagonalization
A matrix is diagonalizable if , where the columns of are linearly independent eigenvectors and contains the corresponding eigenvalues.
Criterion: is diagonalizable iff it has linearly independent eigenvectors. A sufficient condition: distinct eigenvalues. Repeated eigenvalues may or may not permit diagonalization.
Computing powers: , and is trivial. This turns computing into three matrix multiplications.
Geometric interpretation: means that in the coordinate system of eigenvectors, the transformation acts as pure diagonal scaling. Diagonalisation finds the 'natural' coordinate system for .
Non-diagonalisable matrices (defective matrices) require Jordan normal form — beyond this course but worth knowing exists.
💡Explain it simply
Diagonalisation asks: is there a tilted coordinate system in which your transformation just stretches each axis independently? If you tilt your axes to align with the eigenvectors, the transformation becomes trivial. The tilt is ; the independent scalings are .
Geometric transformations and their eigenvalues
Rotation by (counterclockwise): . (area preserved, orientation maintained). Eigenvalues are complex for — no real eigenvectors, consistent with the fact that rotations spin everything.
Reflection across the -axis: . Eigenvalues (vectors on -axis are fixed) and (vectors on -axis are flipped). : orientation reversed.
Projection onto a line: eigenvalues (vectors on the line are fixed by projection) and (vectors orthogonal to the line collapse to zero). Idempotent: .
Shear : repeated eigenvalue with only one independent eigenvector. Not diagonalisable — it is a canonical example of a defective matrix.
💡Explain it simply
Each transformation type has its own eigenvalue signature. Rotations have complex eigenvalues (no real fixed directions). Reflections have eigenvalues (one axis is fixed, the other flips). Projections have eigenvalues and (one subspace is kept, the complementary subspace is erased).
Common Mistakes to Avoid
- Including the zero vector as an eigenvector. Eigenvectors must be nonzero by definition.
- Writing instead of . You cannot subtract a scalar from a matrix.
- Assuming every matrix is diagonalisable. A matrix with distinct eigenvalues is, but repeated eigenvalues can prevent it.
- Confusing eigenvalues (scalars ) and eigenvectors (nonzero vectors ). They are distinct objects that go together.
- Forgetting to check the trace/determinant as a sanity check after computing eigenvalues.