Matrix Diagonalization — Definition, Formula & Examples

Matrix diagonalization is the process of rewriting a square matrix as a product

A = PDP^{-1}

, where

D

is a diagonal matrix of eigenvalues and

P

is a matrix whose columns are the corresponding eigenvectors. It simplifies computations like raising a matrix to a power.

An $n \times n$ matrix $A$ is said to be diagonalizable if there exists an invertible matrix $P$ and a diagonal matrix $D$ such that $A = PDP^{-1}$ . Equivalently, $A$ is diagonalizable if and only if it has $n$ linearly independent eigenvectors, which form the columns of $P$ , with the diagonal entries of $D$ being the corresponding eigenvalues.

Key Formula

A = PDP^{-1}

Where:

$A$ = The original n × n square matrix
$P$ = Invertible matrix whose columns are linearly independent eigenvectors of A
$D$ = Diagonal matrix whose diagonal entries are the eigenvalues of A

How It Works

First, find all eigenvalues of

A

by solving the characteristic equation

\det(A - \lambda I) = 0

. Next, for each eigenvalue

\lambda_i

, find its eigenvector(s) by solving

(A - \lambda_i I)\mathbf{x} = \mathbf{0}

. Arrange the eigenvectors as columns of

P

and place the matching eigenvalues along the diagonal of

D

. If you end up with

n

linearly independent eigenvectors for an

n \times n

matrix, the matrix is diagonalizable. A key payoff:

A^k = PD^kP^{-1}

, where

D^k

is trivial to compute since you just raise each diagonal entry to the

k

th power.

Worked Example

Problem: Diagonalize the matrix A = [[4, 1], [2, 3]].

Find eigenvalues: Solve det(A − λI) = 0.

\det\begin{pmatrix} 4-\lambda & 1 \\ 2 & 3-\lambda \end{pmatrix} = (4-\lambda)(3-\lambda) - 2 = \lambda^2 - 7\lambda + 10 = (\lambda - 5)(\lambda - 2) = 0

Find eigenvectors: For λ = 5: solve (A − 5I)x = 0, giving the eigenvector (1, 1). For λ = 2: solve (A − 2I)x = 0, giving the eigenvector (1, −2).

\lambda_1 = 5: \mathbf{v}_1 = \begin{pmatrix}1\\1\end{pmatrix}, \quad \lambda_2 = 2: \mathbf{v}_2 = \begin{pmatrix}1\\-2\end{pmatrix}

Assemble P and D: Place the eigenvectors as columns of P and eigenvalues on the diagonal of D.

P = \begin{pmatrix}1 & 1\\1 & -2\end{pmatrix}, \quad D = \begin{pmatrix}5 & 0\\0 & 2\end{pmatrix}

Answer: A = PDP⁻¹ where P = [[1, 1], [1, −2]] and D = [[5, 0], [0, 2]].

Why It Matters

Diagonalization turns repeated matrix operations into simple arithmetic on eigenvalues. In physics and engineering, it decouples systems of differential equations into independent scalar equations. It also underpins the principal component analysis (PCA) technique widely used in data science.

Common Mistakes

Mistake: Assuming every square matrix can be diagonalized.

Correction: A matrix is diagonalizable only if it has n linearly independent eigenvectors. For example, a 2×2 matrix with a repeated eigenvalue that yields only one independent eigenvector is not diagonalizable.