Positive Definite Matrix

categories: Linear algebra, Matrix

Definition
A square Matrix $A \in R^{n \times n}$ (or $C^{n \times n}$ ) is positive definite if:

$A$ is symmetric (or Hermitian if $A$ is complex): $A = A^{T}$ (or $A = A^{*}$ ).
For all non-zero vectors $x \in R^{n}$ (or $C^{n}$ ):
$x^{T} A x > 0.$
(In the complex case, $x^{*} A x > 0$ , where $x^{*}$ is the conjugate transpose of $x$ .)

Intuition

A positive definite matrix corresponds to a quadratic form $x^{T} A x$ that always produces positive values for non-zero $x$ .
It represents a multidimensional generalization of a convex parabola, where the energy (or quadratic form) reaches a unique minimum, often associated with optimization and stability.

In Deep Learning, a positive definite Hessian matrix (second-derivative matrix) of a loss function guarantees that the function is convex, ensuring the existence of a unique minimum.

Key Properties

Eigenvalues:
All eigenvalues of $A$ are positive: $λ_{i} > 0$ .
Determinant and Leading Principal Minors:
- $det (A) > 0$ .
- All leading principal minors (determinants of top-left submatrices) are positive.
Cholesky Decomposition:
A positive definite matrix $A$ can be decomposed as:
$A = L L^{T},$
where $L$ is a lower triangular matrix with positive diagonal entries.
Matrix Inversion:
A positive definite matrix is always invertible.
Norm Property:
$∥ A x ∥_{2} > 0 for all non-zero x .$
Relation to Convexity:
- If $A$ is the Hessian of a scalar function $f (x)$ , i.e., $\nabla^{2} f (x) = A$ , then $f (x)$ is strictly convex.

Applications in Deep Learning

Optimization:
- Positive definite matrices arise in the Hessian of loss functions. If the Hessian is positive definite, the loss is convex, and gradient descent converges to the unique minimum.
- Convexity ensures no local minima exist, making optimization simpler and more predictable.
Energy Landscape:
In machine learning models, ensuring a positive definite structure means the “energy” (loss function) has a well-defined minimum, stabilizing training.
Covariance Matrix:
Positive definite matrices often represent covariance matrices in probabilistic models and Gaussian distributions.

Examples

Basic Example:
$A = [2003] .$
$A$ is symmetric, and for any $x = [x_{1} x_{2}]$ :
$x^{T} A x = 2 x_{1}^{2} + 3 x_{2}^{2} > 0 for all x \neq = 0.$
Non-positive Definite Example:
$B = [1221] .$
Eigenvalues of $B$ are $λ_{1} = 3$ , $λ_{2} = - 1$ . Since $λ_{2} < 0$ , $B$ is not positive definite.

Evgeny's Notes

Explorer

Recent posts

Installing the Homebrew Channel App on an LG TV (Ubuntu)

Obsidian + Zettelkasten + PARA

About this site

Positive Definite Matrix

Graph View

Backlinks