Definition

The nuclear norm of a matrix (also called the trace norm) is the sum of its singular values:

where:

The nuclear norm can be viewed as the matrix equivalent of the norm for vectors, promoting low-rank approximations and sparsity in singular values.

Key Properties

  1. Non-Negativity:
    , with equality if and only if .

  2. Relation to Frobenius Norm:
    The nuclear norm is always less than or equal to the Frobenius norm:

  3. Dual Norm:
    The nuclear norm is the dual norm of the Spectral Norm :

    where is the spectral norm.

  4. Unitary Invariance:
    The nuclear norm is invariant under orthogonal (or unitary) transformations:

    for any orthogonal/unitary matrices and .

  5. Convexity:
    The nuclear norm is a Convex Function, making it useful for optimization problems.


Applications

  1. Low-Rank Matrix Approximation:
    Minimizing the nuclear norm promotes low-rank solutions in problems like collaborative filtering and data compression.

  2. Compressed Sensing:
    Nuclear norm minimization is used for matrix completion problems, such as recovering missing entries in a matrix.

  3. Robust PCA:
    Separates a low-rank matrix from sparse noise in data analysis.

  4. Control Systems:
    Appears in optimal control problems for minimizing system ranks.


Examples

Example 1: Full Rank Matrix

Let:

The singular values of are , .
The nuclear norm:

Example 2: Low-Rank Matrix

Let:

The SVD of gives singular values (rank 1).
The nuclear norm:

Example 3: Relation to Frobenius Norm

Let:

Frobenius norm:

Nuclear norm:

In this case, , consistent with the properties.