Renjith Thazhathethil

Determinants

Motivation: The Algebra of Linear Systems

The concept of the determinant arises naturally through the study of systems of linear equations. It serves as a fundamental algebraic tool that characterizes the existence and uniqueness of solutions. Specifically, the determinant acts as a "gatekeeper": its value dictates whether a system possesses a unique solution or behaves singularly.

The2×22 \times 2Case

To understand the origin of this quantity, let us consider a system of two linear equations with two variables:

ax+by=pcx+dy=q\begin{align*} ax + by &= p \\ cx + dy &= q \end{align*}

Our objective is to isolatexxby eliminatingyy. To achieve this, we multiply the first equation byddand the second bybb:

adx+bdy=dpbcx+bdy=bq\begin{aligned} adx + bdy &= dp \\ bcx + bdy &= bq \end{aligned}

Subtracting the second equation from the first yields:

(adbc)x=dpbq(ad - bc)x = dp - bq

This result reveals that a unique solution forxxexists if and only if the coefficient(adbc)(ad - bc)is non-zero. Ifadbc=0ad - bc = 0, the equations are linearly dependent, and the system fails to yield a unique solution. Because this specific quantity determines the fundamental nature of the system's solution, it is aptly named the determinant.

The3×33 \times 3Case

Generalizing this logic to a3×33 \times 3system introduces greater algebraic complexity but follows the same principle of elimination. Consider the following system:

a11x1+a12x2+a13x3=b1a21x1+a22x2+a23x3=b2a31x1+a32x2+a33x3=b3\begin{align} a_{11}x_1 + a_{12}x_2 + a_{13}x_3 &= b_1 \\ a_{21}x_1 + a_{22}x_2 + a_{23}x_3 &= b_2 \\ a_{31}x_1 + a_{32}x_2 + a_{33}x_3 &= b_3 \end{align}

Assuminga330a_{33} \neq 0, we can eliminatex3x_3from Equations (1) and (2) by performing the row operationsa33×(1)a13×(3)a_{33} \times (1) - a_{13} \times (3)anda33×(2)a23×(3)a_{33} \times (2) - a_{23} \times (3). This reduction results in a2×22 \times 2system inx1x_1andx2x_2:

(a11a33a31a13)x1+(a12a33a32a13)x2=b1a33b3a13(a21a33a31a23)x1+(a22a33a32a23)x2=b2a33b3a23\begin{align*} (a_{11}a_{33} - a_{31}a_{13})x_1 + (a_{12}a_{33} - a_{32}a_{13})x_2 &= b_1a_{33} - b_3a_{13} \\ (a_{21}a_{33} - a_{31}a_{23})x_1 + (a_{22}a_{33} - a_{32}a_{23})x_2 &= b_2a_{33} - b_3a_{23} \end{align*}

Applying the2×22 \times 2elimination logic to this reduced system, we isolatex1x_1. The resulting coefficient ofx1x_1is:

(a11a33a31a13)(a22a33a32a23)(a12a33a32a13)(a21a33a31a23)=a33(a11a22a33a11a32a23a12a21a33+a12a31a23+a13a21a32a13a22a31)\begin{align*} &(a_{11}a_{33} - a_{31}a_{13})(a_{22}a_{33} - a_{32}a_{23}) - (a_{12}a_{33} - a_{32}a_{13})(a_{21}a_{33} - a_{31}a_{23}) \\ &= a_{33}(a_{11}a_{22}a_{33} - a_{11}a_{32}a_{23} - a_{12}a_{21}a_{33} + a_{12}a_{31}a_{23} + a_{13}a_{21}a_{32} - a_{13}a_{22}a_{31}) \end{align*}

For a unique solution to exist, this expression must be non-zero. Sincea330a_{33} \neq 0, the condition simplifies to the following non-vanishing requirement:

a11a22a33a11a32a23a12a21a33+a12a31a23+a13a21a32a13a22a310\begin{align*} a_{11}a_{22}a_{33} - a_{11}a_{32}a_{23} - a_{12}a_{21}a_{33} + a_{12}a_{31}a_{23} + a_{13}a_{21}a_{32} - a_{13}a_{22}a_{31} \neq 0 \end{align*}

This quantity defines the determinant of the3×33 \times 3matrixAA:

A=(a11a12a13a21a22a23a31a32a33)A=\begin{pmatrix} a_{11} & a_{12} & a_{13} \\ a_{21} & a_{22} & a_{23} \\ a_{31} & a_{32} & a_{33} \end{pmatrix}

The explicit expansion ofdet(A)\det(A)is given by:

det(A)=(a11a22a33+a12a23a31+a13a21a32)(a11a23a32+a12a21a33+a13a22a31)\begin{align*} \det(A) = &(a_{11}a_{22}a_{33} + a_{12}a_{23}a_{31} + a_{13}a_{21}a_{32}) \\ &- (a_{11}a_{23}a_{32} + a_{12}a_{21}a_{33} + a_{13}a_{22}a_{31}) \tag{4} \end{align*}

Fundamental Properties

The determinant possesses several key properties that facilitate both its calculation and its theoretical application. These properties can be verified directly from the algebraic expansion:

  1. Row/Column Interchanges: Interchanging any two rows or any two columns reverses the sign of the determinant.
  2. Identical Rows/Columns: If a matrix contain two identical rows or two identical columns, its determinant is zero.
  3. Scalar Multiplication: Multiplying a single row or column by a scalarccscales the entire determinant bycc.
  4. Row/Column Addition: Adding a multiple of one row (or column) to another row (or column) leaves the determinant unchanged.

General Form: The Leibniz Formula

If we examine the3×33 \times 3expansion in Equation (4), a clear structural pattern emerges:

  1. Each term is a product of exactly one element from each row and each column.
  2. The column indices(j,k,l)(j, k, l)constitute a permutation of the set{1,2,3}\{1, 2, 3\}.
  3. The expansion consists of3!=63! = 6terms, representing every possible permutation.

This observation generalizes to anyn×nn \times nmatrix. The determinant of ann×nn \times nmatrixAA, known as the Leibniz formula, is defined as the sum over all permutations:

det(A)=σSnsgn(σ)a1σ(1)a2σ(2)anσ(n)\begin{align*} \det(A) = \sum_{\sigma \in S_n} \text{sgn}(\sigma) a_{1\sigma(1)} a_{2\sigma(2)} \dots a_{n\sigma(n)} \end{align*}

where:

The Sign of a Permutation

The sign of a permutation is determined by its decomposition into disjoint cycles.

Note: A cycle(i1,i2,,ik)(i_1, i_2, \dots, i_k)describes a mapping wherei1i2iki1i_1 \to i_2 \to \dots \to i_k \to i_1. Elements not included in a cycle are considered fixed (cycles of length 1).

Letnnbe the order of the permutation andkkbe the total number of disjoint cycles (including fixed elements). The sign is defined as:

sgn(σ)=(1)nk\text{sgn}(\sigma) = (-1)^{n - k}

This formal definition provides a unified framework that consistently yields the2×22 \times 2and3×33 \times 3formulas derived earlier.

While the Leibniz formula generalizes the determinant to anyn×nn \times nmatrix, its formal justification requires demonstrating that it satisfies the essential property of the determinant: it is non-zero if and only if the matrix is invertible. This proof involves algebraic machinery that is beyond the scope of this course.

Historical Context

The concept of the determinant is unique in mathematical history because it predates the formal definition of a "matrix" by nearly two centuries. It was initially developed as a specialized algorithmic tool for solving linear systems.

Chronology of Development

Key Figure Contributions