Article and note

A Series Learning through the MacWilliams IdentityPart 10 of 12

An Introduction to Heat Kernels and Trace Formulas through the MacWilliams Identity

Among the five proof systems for the MacWilliams identity, this note focuses on the proof that appears through heat kernels and trace formulas, and introduces kernels and traces on finite sets, heat operators and heat kernels on finite graphs, Hamming graphs, translation actions, and trace formulas on quotient spaces.

Published:: Jun 14, 2026
Updated:: Jun 14, 2026
Reading time:: 38 min (about 8,278 words)

Tagscoding theoryMacWilliams identityheat kerneltrace formulaHamming graphspectral graph theoryfinite fieldsweight enumeratorexpository note

Introduction

One of the fundamental theorems in coding theory is the MacWilliams identity. Let $E$ be the coordinate set, let $n \coloneqq \card{E}$ , and, for a linear code $C \leq \F_{q}^{E}$ over the finite field $\F_{q}$ , consider its dual code

C^{\perp} \coloneqq \{ u \in \F_{q}^{E} : u \cdot c = 0 \text{ for all } c \in C \}

where

u \cdot c = \sum_{e \in E} u_{e} c_{e}.

For a codeword $c \in \F_{q}^{E}$ , write its support and Hamming weight as

\supp(c) \coloneqq \{ e \in E : c_{e} \neq 0 \}, \qquad \wt(c) \coloneqq \card{\supp(c)}

respectively. Define the weight enumerator of the linear code $C$ by

W_{C}(X, Y) = \sum_{c \in C} X^{n - \wt(c)} Y^{\wt(c)}.

The MacWilliams identity is the formula saying that the weight enumerator of the dual code can be computed from the weight enumerator of $C$ as follows:

W_{C^{\perp}}(X, Y) = \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr).

This is the MacWilliams identity.

In this series, we use proofs of the MacWilliams identity as a guide to introductions to neighbouring areas and concepts. For that reason, the series is aimed at readers with the following background:

We assume basic familiarity with elementary coding theory, namely with

what a (finite) field is,
what a linear code over a finite field is,
what the Hamming weight is,
what the dual code is.

(It is not necessary to know a proof of the MacWilliams identity.)

This note does not presuppose Parts 1–9. In the series as a whole, we compare several proofs of the MacWilliams identity, but the content of the previous parts is not needed in order to read this note. The necessary linear operators, kernels, traces, heat operators and heat kernels on finite graphs, Hamming graphs, translation actions, and character eigenfunctions are introduced in the text to the extent needed.

In this series, we view the proof methods for the MacWilliams identity as falling roughly into the following five families. However, this classification is a map of the whole series, and is not required for reading the proof in this note:

Fourier, character, and Poisson-type proofs.
Möbius inversion, lattice-theoretic, shortening-puncturing proofs.
Orthogonal-polynomial and association-scheme proofs.
Matroid and Tutte-polynomial proofs.
Moment and double-counting proofs.

The proof treated in this note is written, on the surface, in the language of

Hamming graphs, heat operators, Markov operators, and trace formulas.

Compressed into the five families above, it belongs to the Fourier, character, and Poisson family. This is because, when the heat operator on the Hamming graph is diagonalised, characters of the finite abelian group $\F_{q}^{E}$ eventually appear as eigenfunctions. That said, character theory itself is not the main actor in this note. The aim here is to use a proof of the MacWilliams identity as a guide to an introduction to heat operators and heat kernels and to trace formulas.

Roughly speaking, a heat operator is an operator that describes how heat spreads on a graph or a space. Its matrix entries are called the heat kernel. On a finite set, a heat operator is a finite matrix, and the heat kernel is its matrix entry. A trace formula is an identity obtained by computing the trace of a single operator in two ways. In one computation, the operator is decomposed into eigenspaces and the eigenvalues are summed. In the other computation, the diagonal entries of the kernel are summed directly. The equality of these two computations gives a non-trivial identity. The trace formulas considered in this note are not deep trace formulas involving infinite-dimensional analysis or convergence issues, but finite trace formulas obtained by computing the trace of a finite-dimensional matrix in two ways.

The MacWilliams identity considered in this note appears as the following trace formula. Write $\Kop_{z}$ for the heat operator on the Hamming graph on $V = \F_{q}^{E}$ , and write its kernel as $\Kop_{z}(x,y)$ . We take the trace of $\Kop_{z}$ on the quotient space $V/C$ by the translation action of the code $C$ . More precisely, we are not defining a new quotient graph or a heat operator on that quotient graph separately. Rather, we identify the function space $\C^{V/C}$ on the quotient set $V/C$ with the space $\Hspace^{C}$ of $C$ -invariant functions on $V$ , and compute the trace of the restriction $\res{\Kop_{z}}{\Hspace^{C}}$ of the original heat operator $\Kop_{z}$ . Computing this trace from the spectral side produces the weight distribution of $C^{\perp}$ . On the other hand, averaging along the translation action and then summing the diagonal entries produces the weight distribution of $C$ . Comparing the two gives the MacWilliams identity.

The flow of the note is as follows.

operators on finite sets → kernels and traces → heat operators and heat kernels on finite graphs → heat operators and heat kernels on Hamming graphs → traces on quotient spaces → the MacWilliams identity

Stating only the key point of the proof in advance, it is as follows. In one coordinate, the zero translation produces $1+(q-1)z$ , and a non-zero translation produces $1-z$ . In several coordinates, this zero/non-zero distinction occurs coordinate by coordinate, and so records the Hamming weight. Averaging further over the elements of $C$ produces the weight enumerator of $C$ . On the other hand, from the viewpoint of eigenfunctions, the characters that remain on the quotient space are precisely the $C$ -invariant characters, and these correspond exactly to $C^{\perp}$ .

For the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.

Function Spaces, Kernels, and Traces on Finite Sets

Before treating heat operators and heat kernels, we prepare the language for viewing linear operators on finite sets as matrices. All spaces considered here are finite-dimensional, so no analytic convergence issues arise.

For a finite set $\Omega$ , let $\C^{\Omega}$ be the set of all complex-valued functions on $\Omega$ . In this note, an italic $C$ denotes a code, whereas the blackboard bold $\C$ denotes the field of complex numbers. For each $x \in \Omega$ , define $\delta_{x} \in \C^{\Omega}$ by

\delta_{x}(y) \coloneqq \begin{cases} 1, & y = x,\\ 0, & y \neq x \end{cases}

Then $\{ \delta_x : x \in \Omega \}$ is a basis of $\C^{\Omega}$ .

A linear operator $T \colon \C^{\Omega} \to \C^{\Omega}$ can be written as a matrix whose rows and columns are indexed by $\Omega$ . In this note, we write its matrix entry as $K_{T}(x, y)$ and call it the kernel of $T$ . Thus

(Tf)(x) \coloneqq \sum_{y \in \Omega} K_{T}(x, y) f(y)

On a finite set, a kernel is simply another name for a matrix entry. However, when we later call it a heat kernel, we think of this matrix entry as the amount of heat transmitted from the point $y$ to the point $x$ . Here the basic convention in this note is that operators act on functions. As a matrix entry, $K_{T}(x,y)$ can be read as the contribution from the column index $y$ to the row index $x$ . On the other hand, when the same matrix is read intuitively as an expectation operator for a Markov process, it is read as the probability of moving from the current point $x$ to the next point $y$ . In general, the transpose matrix appears in the convention where one acts on probability distributions. The heat kernels actually used in this note are symmetric, so both readings use the same numerical matrix, and only the interpretation of the direction differs. The word kernel appears in two senses in this note. One is the matrix-entry sense just defined, written as $K_{T}(x,y)$ or $\Kop_{z}(x,y)$ . The other is the linear-algebraic subspace sent to zero, written as $\Ker(T)$ . To avoid confusion, kernels as matrix entries are denoted by symbols of the form $K_T(x,y)$ , whereas linear-algebraic kernels are written as $\Ker(T)$ . Also note the following notation. $K_{T}$ denotes the kernel of a general operator $T$ . The symbol $K_{q}$ appearing later denotes the complete graph on $q$ vertices, and $\Kop_{z}$ denotes the heat operator on the Hamming graph. Its heat kernel is written $\Kop_{z}(x,y)$ . We use the same letter for the operator and its kernel, but when $(x,y)$ is attached it denotes a matrix entry.

Definition 2.1.

The trace of a linear operator $T \colon \C^{\Omega} \to \C^{\Omega}$ is defined by

\Tr(T) \coloneqq \sum_{x \in \Omega} K_{T}(x, x).

This is the usual trace of a matrix. That is, it is the sum of the diagonal entries. If $T$ is diagonalisable and its eigenvalues, counted with multiplicity, are $\lambda_{1}, \lambda_{2}, \dots, \lambda_{m}$ , then

\Tr(T) = \lambda_{1} + \lambda_{2} + \dots + \lambda_{m}.

Thus the trace has at least two viewpoints.

summing diagonal entries ⇔ summing eigenvalues

Using these two viewpoints for the same operator is the basic idea of a trace formula.

In this note, we also use traces on subspaces. For that purpose we prepare a simple lemma using projections.

Lemma 2.2.

Let $S \subseteq \C^{\Omega}$ be a subspace, and let $P \colon \C^{\Omega} \to \C^{\Omega}$ be a projection onto $S$ . That is, assume $P^{2} = P$ and $\Image(P) = S$ . Suppose that the linear operator $T$ commutes with $P$ . Then $\Tr(\res{T}{S})=\Tr(PT)$ .

Proof

Since $P$ is a projection, $\C^{\Omega}$ decomposes as

\C^{\Omega} = \Image(P) \oplus \Ker(P).

Since $T$ commutes with $P$ , the operator $T$ preserves both $\Image(P)$ and $\Ker(P)$ . With respect to this decomposition, the matrix of $T$ is block diagonal, and $P$ has the form

\begin{pmatrix} I & 0\\ 0 & 0 \end{pmatrix}.

Therefore the trace of $PT$ is equal to the trace of the block of $T$ on $\Image(P) = S$ . Thus $\Tr(PT) = \Tr(\res{T}{S})$ .

We will later apply this lemma to the space of functions invariant under translations by $C$ . This is because we identify functions on the quotient space $\F_{q}^{E}/C$ with $C$ -invariant functions on $\F_{q}^{E}$ , and compute traces using the projection onto that space.

Heat Operators and Heat Kernels on Finite Graphs

We next introduce the idea of heat operators and heat kernels on finite graphs. Rather than developing general graph theory, we keep in mind the finite regular graphs needed in this note, especially Hamming graphs.

Let $\Omega$ be the vertex set of a finite graph. The function space on the graph is $\C^{\Omega}$ . The graph Laplacian is an operator that measures the difference between a function and its surrounding values. The standard combinatorial Laplacian is defined by $L \coloneqq D - A$ . Here $A$ is the adjacency matrix, and $D$ is the diagonal matrix whose diagonal entries are the degrees. For a regular graph, $D$ is a constant multiple of the identity matrix.

The finite-graph version of the heat equation can be written using the matrix exponential as $H_{t} = \exp(-tL)$ . This $H_{t}$ is called the heat operator, and its kernel $H_{t}(x, y)$ is called the heat kernel. On a finite set, this is just an ordinary matrix exponential. Here the matrix exponential is the matrix defined by

\exp(-tL) = \sum_{r=0}^{\infty} \frac{(-tL)^{r}}{r!}.

In the situations actually used in this note, the eigenspace decomposition gives concrete formulas, so there is no need to compute this series directly.

In this note, however, we use a scalar multiple of the Laplacian in order to simplify the calculations. Multiplying the Laplacian by a positive constant amounts to viewing the heat operator after rescaling time. Indeed, replacing $L$ by $\alpha L$ gives $\exp(-t\alpha L)=\exp(-(\alpha t)L)$ . Thus the structure of eigenfunctions and trace formulas is unchanged. What changes is only the reparametrisation of the time parameter appearing in the eigenvalues. In this note, we use the parameter $z = e^{-t}$ instead of time $t$ . Strictly speaking, for finite $t$ we have $0 < z \leq 1$ . The value $z=0$ corresponds to the limit $t \to \infty$ . The time $t = 0$ corresponds to $z = 1$ , and then the heat operator is the identity operator. As $t$ becomes larger, $z$ approaches $0$ , and the heat operator moves towards averaging. In the one-coordinate case, at the limiting value $z = 0$ the operator $M_{z}$ becomes the averaging projection $\Pi_{0}$ . Thus $z$ can be viewed as a parameter measuring how much of the original value is retained. The heat kernels and traces actually computed below are all expressed as polynomials in $z$ . Therefore substituting $z=0$ also makes sense as a polynomial. At the stage where the MacWilliams identity is obtained, $z$ is read not as an analytic time parameter but as a formal variable. Thus we first understand the operators as heat operators for $0 \leq z \leq 1$ , and in the end read the same formulas as polynomial identities. The later substitution $z = Y/X$ and homogenisation use precisely this polynomial viewpoint.

Heat operators and heat kernels on finite graphs have the following two basic viewpoints.

Spectral side: Decompose $H_{t}$ into eigenspaces and sum the eigenvalues $e^{-t\lambda}$ .
Geometric side: Sum the diagonal entries $H_{t}(x, x)$ of the heat kernel over the vertices $x$ .

Both compute the same trace $\Tr(H_{t})$ . In this note, we also consider the trace of an operator obtained by composing a heat operator with a translation action. This operation of composing with a translation and then taking the trace is what extracts the weight distribution of the code $C$ .

The Heat Operator and Heat Kernel of the One-Coordinate Complete Graph

A Hamming graph is the Cartesian product, coordinate by coordinate, of complete graphs. Here the graph product is meant in the Cartesian sense. Two multi-coordinate vertices are adjacent precisely when they differ in exactly one coordinate, and in that coordinate they are adjacent in the one-coordinate complete graph. We therefore begin with the one-coordinate complete graph.

Let the one-coordinate vertex set be $\F_{q}$ , and let the function space be $U \coloneqq \C^{\F_{q}}$ . Define the averaging projection onto the one-dimensional subspace of $U$ consisting of constant functions by

\Pi_{0} f \coloneqq \frac{1}{q}\left(\sum_{a \in \F_{q}} f(a) \right) \one.

Here $\one$ is the function that is identically $1$ . The kernel of this projection in the linear-algebraic sense, namely the subspace on which $\Pi_{0}f=0$ , is

U_{1} = \left\{ f \in U : \sum_{a \in \F_{q}} f(a) = 0 \right\}.

Thus $U = \C \one \oplus U_{1}$ . Here the subscript $1$ indicates that this is the space corresponding to the eigenvalue $1$ for the one-coordinate Laplacian $\Delta_{1}$ defined immediately below; it does not mean that $U_{1}$ is one-dimensional. In fact, $\dim_{\C} U_{1}=q-1$ .

In this note, we define the one-coordinate Laplacian by

\Delta_{1} \coloneqq \Id - \Pi_{0}.

This is a scalar multiple of the combinatorial Laplacian of the complete graph $K_{q}$ . Here $J$ is the $q \times q$ matrix all of whose entries are $1$ . Indeed, the adjacency matrix of $K_{q}$ is $J - I$ , and the combinatorial Laplacian is

(q - 1)I - (J - I) = qI - J.

On the other hand, since $\Pi_{0} = J/q$ ,

\Delta_{1} = I - J/q = \frac{1}{q}(qI - J).

Thus we are simply using the usual Laplacian multiplied by $1/q$ .

Writing $z = e^{-t}$ , the one-coordinate heat operator is

M_{z} \coloneqq \exp(-t\Delta_{1}) = \Pi_{0} + z(\Id - \Pi_{0}).

It acts with eigenvalue $1$ in the constant direction and with eigenvalue $z$ in the direction where the sum is $0$ .

Lemma 4.1.

The kernel of the one-coordinate heat operator $M_{z}$ is given by

M_{z}(a, b) = \begin{cases} \dfrac{1 + (q - 1)z}{q}, & a = b,\\[6pt] \dfrac{1 - z}{q}, & a \neq b. \end{cases}

Proof

The kernel of $\Pi_{0}$ is $1/q$ for all $a$ and $b$ . The kernel of $\Id$ is $1$ when $a = b$ and $0$ when $a \neq b$ . Therefore, writing $M_{z} = z\Id + (1 - z)\Pi_{0}$ , when $a = b$ we get

z + \frac{1 - z}{q} = \frac{1 + (q - 1)z}{q},

and when $a \neq b$ we get

\frac{1 - z}{q}.

This formula also has a probabilistic reading. The operator $M_{z}$ is the operation that keeps the current value with probability $z$ , and replaces it by a uniformly chosen value of $\F_{q}$ with probability $1 - z$ . In this note, we view $M_{z}$ not as acting on probability distributions, but as an operator acting on functions, that is, observables. In other words, it returns the average value of $f$ when the next state is chosen at random by this rule. In particular, when $0 \leq z \leq 1$ , the operator $M_{z}$ sends non-negative functions to non-negative functions and preserves constant functions, so it can be read as a Markov operator on the function side. If the same symmetric matrix is made to act on distributions, it can also be read as an operator sending probability distributions to probability distributions. In the convention where one acts on distributions, the transpose matrix appears, but the heat kernel in the present case is symmetric, so we are looking at the same numerical matrix. However, in this note we use it not for the probabilistic interpretation but for trace computations.

The most important point in one coordinate is the trace after composing with a translation. For $a \in \F_{q}$ , define the translation operator $\tau_{a} \colon U \to U$ by

(\tau_{a} f)(x) \coloneqq f(x - a).

With this definition, $\tau_{a}$ acts as pullback on functions. On basis vectors, $\tau_{a} \delta_{b} = \delta_{a + b}$ . The points themselves are moved by $x \mapsto x+a$ , but because the action on functions is by pullback, the formula contains $x-a$ . Later we write $C$ -invariance as $f(x+c)=f(x)$ . Since $C$ is an additive subgroup, as $c$ runs through all of $C$ , so does $-c$ . Thus the condition $f(x+c)=f(x)$ is equivalent to the condition $\tau_{c}f=f$ .

Lemma 4.2.

For every $a \in \F_{q}$ ,

\Tr(\tau_{a} M_{z}) = \begin{cases} 1 + (q - 1)z, & a = 0,\\ 1 - z, & a \neq 0 \end{cases}

holds.

Proof

Write the kernel of $M_{z}$ as $M_{z}(x, y)$ . The kernel of $\tau_{a} M_{z}$ is

(\tau_{a} M_{z})(x, y) = M_{z}(x - a, y).

Hence

\Tr(\tau_{a} M_{z}) = \sum_{x \in \F_{q}} M_{z}(x - a, x).

If $a = 0$ , all these entries are diagonal entries, so

\sum_{x \in \F_{q}} M_{z}(x, x) = q \cdot \frac{1 + (q - 1)z}{q} = 1 + (q - 1)z.

On the other hand, if $a \neq 0$ , then $x - a \neq x$ for every $x$ , so all entries are off-diagonal entries. Therefore

\sum_{x \in \F_{q}} M_{z}(x - a, x) = q \cdot \frac{1 - z}{q} = 1 - z.

This lemma records the source of the change of variables in the MacWilliams identity. In the ordinary trace, one sums $M_{z}(x,x)$ over $x$ . After composing with $\tau_{a}$ , however, one reads $M_{z}(x-a,x)$ as the diagonal entry. In other words, translation shifts the diagonal entries of the heat kernel by $a$ . The zero translation produces $1 + (q - 1)z$ , and a non-zero translation produces $1 - z$ . This is because, when $a=0$ , only diagonal entries are read, whereas when $a\neq 0$ , only off-diagonal entries are read. In several coordinates, this distinction occurs coordinate by coordinate, and the Hamming weight is recorded in the trace. After later substituting $z = Y/X$ and homogenising, this becomes

X + (q - 1)Y, \qquad X - Y.

The Heat Operator and Heat Kernel of the Hamming Graph

We now move to several coordinates. Put $V \coloneqq \F_{q}^{E}$ . This is the vertex set of the Hamming graph. Define the Hamming distance between two vertices $x, y \in V$ by

\dist(x, y) \coloneqq \wt(x - y).

In the Hamming graph, two vertices at distance $1$ are joined by an edge. This graph is the Cartesian product of $n$ copies of the one-coordinate complete graph.

Write the function space as $\Hspace \coloneqq \C^{V}$ . Using the one-coordinate space $U = \C^{\F_{q}}$ , this can be viewed as

\Hspace \cong \bigotimes_{e \in E} U.

For each coordinate $e \in E$ , let $\Delta_{e}$ be the operator that applies the one-coordinate Laplacian $\Delta_{1}$ only in the $e$ coordinate and applies the identity operator in all other coordinates. Define the Laplacian on the Hamming graph by

\Delta \coloneqq \sum_{e \in E} \Delta_{e}.

This is the usual combinatorial Laplacian of the Hamming graph multiplied by $1/q$ . Indeed, the usual combinatorial Laplacian in the $e$ -coordinate direction is $q\Delta_{e}$ . Thus, if $L_{\mathrm{Ham}}$ denotes the usual combinatorial Laplacian of the whole Hamming graph, then

L_{\mathrm{Ham}} = \sum_{e \in E} q\Delta_{e} = q\Delta.

Therefore the $\Delta$ used in the text is the usual combinatorial Laplacian multiplied by $1/q$ , with the same normalisation as in the one-coordinate case. Since the operators $\Delta_{e}$ commute with one another, putting $z = e^{-t}$ gives

\exp(-t\Delta) = \prod_{e \in E} \exp(-t\Delta_{e}) \cong \bigotimes_{e \in E} \exp(-t\Delta_{1}) = \bigotimes_{e \in E} M_{z}.

Thus the following operator is the natural heat operator on the Hamming graph. Under this identification, define the heat operator on the Hamming graph by

\Kop_{z} \coloneqq \bigotimes_{e \in E} M_{z}.

This applies the same one-coordinate heat operator $M_{z}$ independently in all coordinates. Although we use tensor-product notation here, the only concrete meaning needed in this note is the following. For $x=(x_{e})_{e \in E}$ and $y=(y_{e})_{e \in E}$ , set

\Kop_{z}(x,y) = \prod_{e \in E} M_{z}(x_{e},y_{e})

and define its action on a function $f \in \C^{V}$ by

(\Kop_{z}f)(x) = \sum_{y \in V} \Kop_{z}(x,y) f(y).

Thus even without knowing the general theory of tensor products, all trace computations below can be read using only this product formula. In particular, when $0 \leq z \leq 1$ , $\Kop_{z}$ can be viewed as a Markov operator that independently, in each coordinate, either keeps the value or replaces it by a uniformly chosen value. This is an explanation as an expectation operator acting on functions, that is, observables. The same symmetric matrix can also be made to act on distributions. The probabilistic interpretation here is for intuition; in the proof, we use trace computations and polynomial identities.

Lemma 5.1.

The kernel of $\Kop_{z}$ is given by

\Kop_{z}(x,y) = q^{-n} \bigl( 1 + (q - 1)z \bigr)^{n - \dist(x, y)} (1 - z)^{\dist(x, y)}. \tag{5.1}

Proof

By the product formula just above, the kernel of $\Kop_{z}$ is the product of the one-coordinate kernels. Namely,

\Kop_{z}(x, y) = \prod_{e \in E} M_{z}(x_{e}, y_{e}).

A coordinate with $x_{e} = y_{e}$ contributes $\dfrac{1 + (q - 1)z}{q}$ , while a coordinate with $x_{e} \neq y_{e}$ contributes $\dfrac{1 - z}{q}$ . Since the number of coordinates in which $x$ and $y$ differ is $\dist(x, y)$ , we obtain (5.1).

This formula shows that $\Kop_{z}(x, y)$ depends not on the particular values of $x$ and $y$ , but only on the Hamming distance $\dist(x,y)$ . Thus $\Kop_{z}$ is an operator compatible with the symmetries of the Hamming graph. In the language of association schemes, $\Kop_{z}$ belongs to the Bose–Mesner algebra of the Hamming scheme. However, in this note we do not use the general theory of association schemes; we proceed only by computations with heat operators, heat kernels, and traces.

Translations on $V$ are defined in the same way. For $a \in V$ , define $\tau_{a} \colon \Hspace \to \Hspace$ by

(\tau_{a} f)(x) = f(x - a).

This is the tensor product of the one-coordinate translations.

Proposition 5.2.

For every $a \in V$ ,

\Tr(\tau_{a} \Kop_{z}) = \bigl( 1 + (q - 1)z \bigr)^{n - \wt(a)}(1 - z)^{\wt(a)} \tag{5.2}

holds.

Proof

Using the kernel of $\Kop_{z}$ , we have

\Tr(\tau_{a} \Kop_{z}) = \sum_{x \in V} \Kop_{z}(x - a, x).

The coordinates in which $x - a$ and $x$ differ are precisely the coordinates with $a_{e} \neq 0$ . Therefore $\dist(x - a, x) = \wt(a)$ , and this is independent of $x$ . By Lemma 5.1, the contribution from each $x$ is

q^{-n} \bigl( 1 + (q - 1)z \bigr)^{n - \wt(a)}(1 - z)^{\wt(a)}.

Since $V$ has $q^{n}$ elements, summing gives (5.2).

The meaning of this proposition is transparent. If we take the trace of the heat operator $\Kop_{z}$ after composing with the translation $a$ , the zero and non-zero coordinates of $a$ produce different factors. Thus the Hamming weight of $a$ is recorded in the trace. This is why the weight enumerator of $C$ appears later.

Traces on Quotient Spaces

Next, consider the action of the code $C \leq V = \F_{q}^{E}$ by translations. As an additive group, $C$ acts on $V$ . Write the quotient set as $V/C$ . Functions on the quotient set can be identified with $C$ -invariant functions on $V$ .

Definition 6.1.

The subspace

\Hspace^{C} \coloneqq \{ f \in \Hspace : f(x + c) = f(x) \text{ for all } x \in V,\, c \in C \}

of $\Hspace = \C^{V}$ is called the space of $C$ -invariant functions.

The space $\Hspace^{C}$ can be identified with the function space $\C^{V/C}$ on $V/C$ . Indeed, a $C$ -invariant function is constant on cosets, so it is the same thing as a function with one value for each coset. Concretely, given $g \in \C^{V/C}$ , define a function $\widetilde{g}$ on $V$ by

\widetilde{g}(x) \coloneqq g(x + C).

Then $\widetilde{g}$ is $C$ -invariant. Conversely, if $f \in \Hspace^{C}$ , then $f$ is constant on each coset $x+C$ , and hence a function $g$ on $V/C$ is defined by

g(x + C) \coloneqq f(x).

Through these two correspondences, we identify $\C^{V/C}$ with $\Hspace^{C}$ . What we are doing here is not constructing a new quotient graph or its Laplacian separately. Rather, we identify the subspace of $C$ -invariant functions inside the original function space $\Hspace = \C^{V}$ with the function space on the quotient set $V/C$ , and compute the trace of the restriction $\res{\Kop_{z}}{\Hspace^{C}}$ of the original heat operator $\Kop_{z}$ . If one writes the kernel on the quotient set explicitly, then for cosets $x+C$ and $y+C$ it is

\overline{\Kop}_{z}(x+C, y+C) = \sum_{c \in C}\Kop_{z}(x, y+c).

This expression is independent of the choice of representatives $x$ and $y$ . This is because $\Kop_{z}$ is translation-invariant, and, as $c$ ranges over all of $C$ , the points $y+c$ range over the same coset. Taking the diagonal sum of this kernel on the quotient set is the same content as taking the trace of $P_{C}\Kop_{z}$ on $V$ . In the text, however, we do not define a quotient graph anew; instead, we compute the trace using the restriction to $\Hspace^{C}$ and the averaging projection. This trace does not count the points of $V$ individually. It is the trace in which each coset is counted as one degree of freedom. In other words, we are not simply summing diagonal entries over all $q^{n}$ points of $V$ ; rather, we are taking the trace corresponding to the degrees of freedom of the $C$ -invariant function space $\Hspace^{C}$ , namely to the number of cosets. This distinction is important when we later rewrite the trace by using the averaging projection $P_{C}$ as

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \Tr(P_{C}\Kop_{z}).

Here the condition $f(x + c)=f(x)$ appearing in the definition is the same as $\tau_{c}f=f$ . Indeed, $\tau_{c}f=f$ means $f(x - c)=f(x)$ . Since $C$ is an additive subgroup, as $c$ ranges over all of $C$ , so does $-c$ . Therefore $f(x + c)=f(x)$ and $\tau_{c}f=f$ are equivalent.

Define the averaging projection from $\Hspace$ to $\Hspace^{C}$ by

P_{C} \coloneqq \frac{1}{\card{C}}\sum_{c \in C} \tau_{c}. \tag{6.1}

This operator averages over translations by $C$ . Acting on functions, it is

(P_{C}f)(x) = \frac{1}{\card{C}}\sum_{c \in C} f(x - c).

This means that it averages the values on the coset $x + C$ . As $c$ ranges over all of $C$ , both $x-c$ and $x+c$ range over the same coset $x+C$ , so the sign difference does not affect the average. The operator $P_{C}$ averages the values of a function on each coset $x+C$ , and then extends that average value over the whole coset. Thus, once a function has been averaged, averaging it again does not change it. This is the intuition for $P_{C}^{2}=P_{C}$ , namely for the fact that it is a projection. The coefficient $1/\card{C}$ appears because we normalise the average in the translation directions by $C$ . A point of the quotient set $V/C$ appears inside $V$ as a coset consisting of $\card{C}$ points. By averaging the values along that coset direction, we extract from a function on $V$ only its $C$ -invariant part.

Lemma 6.2.

The operator $P_{C}$ is the projection onto $\Hspace^{C}$ . Moreover, $\Kop_{z}$ commutes with every translation $\tau_{a}$ , and in particular with $P_{C}$ .

Proof

First, $P_{C} f$ is $C$ -invariant. Indeed, for $d \in C$ ,

\begin{aligned} \tau_{d} P_{C} f &= \frac{1}{\card{C}} \sum_{c \in C} \tau_{d} \tau_{c} f = \frac{1}{\card{C}} \sum_{c \in C}\tau_{c+d} f = \frac{1}{\card{C}} \sum_{c^{\prime} \in C} \tau_{c^{\prime}} f = P_{C} f. \end{aligned}

Conversely, if $f \in \Hspace^{C}$ , then $\tau_{c} f = f$ for every $c \in C$ , and so $P_{C} f = f$ . Directly, we also compute

\begin{aligned} P_{C}^{2} &= \frac{1}{\card{C}^{2}}\sum_{c, d \in C} \tau_{c}\tau_{d} \\ &= \frac{1}{\card{C}^{2}}\sum_{c, d \in C} \tau_{c + d} \\ &= \frac{1}{\card{C}}\sum_{h \in C} \tau_{h} = P_{C}. \end{aligned}

In the last equality we used the fact that, for each fixed $h \in C$ , there are exactly $\card{C}$ pairs $(c,d)$ with $c+d=h$ . Hence $P_{C}$ is the projection onto $\Hspace^{C}$ .

Next, since the kernel of $\Kop_{z}$ depends only on $\dist(x, y)$ , it is invariant under translations. In other words, for all $a, x, y \in V$ ,

\Kop_{z}(x + a, y + a) = \Kop_{z}(x, y).

Checking this with the sign convention included, we have

\begin{aligned} (\tau_{a}\Kop_{z}f)(x) &= \sum_{y \in V}\Kop_{z}(x - a, y)f(y), \\ (\Kop_{z}\tau_{a}f)(x) &= \sum_{y \in V}\Kop_{z}(x, y)f(y - a) \\ &= \sum_{y^{\prime} \in V}\Kop_{z}(x, y^{\prime} + a)f(y^{\prime}). \end{aligned}

By translation-invariance of Hamming distance, $\Kop_{z}(x - a, y^{\prime})=\Kop_{z}(x, y^{\prime} + a)$ , and hence $\tau_{a}\Kop_{z}=\Kop_{z}\tau_{a}$ . Therefore $\Kop_{z}$ commutes with each $\tau_{a}$ . In particular, it also commutes with $P_{C}$ .

By this commutativity, if $f$ is $C$ -invariant, then $\Kop_{z}f$ is also $C$ -invariant. Thus $\Kop_{z}$ preserves $\Hspace^{C}$ , and we can consider the restricted operator $\res{\Kop_{z}}{\Hspace^{C}}$ . Therefore, by Lemma 2.2,

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \Tr(P_{C} \Kop_{z}).

Computing the right-hand side produces the weight enumerator of $C$ . From here, we compute the same left-hand side $\Tr(\res{\Kop_{z}}{\Hspace^{C}})$ by averaging over translations and then summing the diagonal entries of the kernel.

Theorem 6.3 (Geometric Side of the Trace).

For a code $C \leq \F_{q}^{E}$ ,

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \frac{1}{\card{C}} W_{C} \bigl( 1 + (q - 1)z, 1 - z \bigr) \tag{6.2}

holds.

Proof

By Lemma 2.2 and Lemma 6.2,

\begin{aligned} \Tr(\res{\Kop_{z}}{\Hspace^{C}}) &= \Tr(P_{C} \Kop_{z}) \\ &= \frac{1}{\card{C}} \sum_{c \in C} \Tr(\tau_{c} \Kop_{z}). \end{aligned}

Substituting Proposition 5.2 gives

\begin{aligned} \Tr(\res{\Kop_{z}}{\Hspace^{C}}) &= \frac{1}{\card{C}} \sum_{c \in C} \bigl( 1 + (q - 1)z \bigr)^{n - \wt(c)} (1 - z)^{\wt(c)} \\ &= \frac{1}{\card{C}} W_{C} \bigl( 1 + (q - 1)z, 1 - z \bigr). \end{aligned}

We call this computation the geometric side. We composed the heat operator with translations $c \in C$ , took their traces, and averaged over $c$ . Since the weight of $c$ appeared inside the trace, the weight enumerator of $C$ appeared.

As a check on the coefficients, let us look at $z=1$ and $z=0$ . When $z=1$ , we have $\Kop_{1}=\Id$ , so the left-hand side is

\dim \Hspace^{C} = \card{V/C} = \frac{q^{n}}{\card{C}}.

The right-hand side is

\frac{1}{\card{C}}W_{C}(q,0)=\frac{q^{n}}{\card{C}},

so the two agree. When $z=0$ , the one-coordinate heat operator becomes the averaging projection $\Pi_{0}$ , and in several coordinates it corresponds to averaging over all of $V$ and producing a constant function. As heat time, this is not a finite time but the limit $t \to \infty$ . On the other hand, the formulas here are polynomials in $z$ , so we may substitute $z=0$ . Thus on the quotient space only the constant direction contributes, and the left-hand side is $1$ . The right-hand side is also

\frac{1}{\card{C}}W_{C}(1,1)=1.

This is not a proof, but a check that the coefficients in the formula fit naturally.

Spectral Side: Characters as Eigenfunctions

Next, we compute the same trace as a sum of eigenvalues. Here we use the eigenfunctions of the heat operator on the Hamming graph. These eigenfunctions are additive characters of the finite abelian group $V = \F_{q}^{E}$ .

Fix a non-trivial additive character

\psi \colon \F_{q} \to \C^{\times}

of $\F_{q}$ . That is, assume that $\psi(a + b) = \psi(a)\psi(b)$ holds and that $\psi$ is not identically $1$ . If $q = p^{m}$ , such a character can be constructed, for example, using the trace map in the form

\psi(a)= \exp\left(\frac{2\pi\sqrt{-1}}{p}\operatorname{tr}_{\F_{q}/\F_{p}}(a)\right).

In this expression, elements of $\F_{p}$ are read as the representatives $0,1,\dots,p-1$ . The symbol $\operatorname{tr}_{\F_{q}/\F_{p}}$ appearing here is the trace map of finite fields, and is different from $\Tr$ , which denotes the trace of an operator. In this text, the trace of an operator is written as $\Tr$ , while the trace map of finite fields is written as $\operatorname{tr}_{\F_{q}/\F_{p}}$ . The property needed in this note is the following one-coordinate orthogonality relation.

Lemma 7.1.

For $b \in \F_{q}$ ,

\sum_{a \in \F_{q}} \psi(ab) = \begin{cases} q, & b = 0,\\ 0, & b \neq 0 \end{cases}

holds.

Proof

If $b = 0$ , then every term is $1$ , and so the sum is $q$ . Suppose $b \neq 0$ . Then the map $a \mapsto ab$ is a bijection of $\F_{q}$ , and hence

\sum_{a \in \F_{q}} \psi(ab) = \sum_{t \in \F_{q}} \psi(t).

Denote the right-hand side by $S$ . Since $\psi$ is non-trivial, there exists $d \in \F_{q}$ such that $\psi(d) \neq 1$ . As $t$ ranges over all of $\F_{q}$ , so does $t + d$ , and therefore

S = \sum_{t \in \F_{q}} \psi(t + d) = \psi(d) \sum_{t \in \F_{q}} \psi(t) = \psi(d)S.

Since $\psi(d) \neq 1$ , we have $S = 0$ .

For $u \in V$ , define a function $\chi_{u} \in \C^{V}$ by

\chi_{u}(x) \coloneqq \psi(u \cdot x) = \psi\left( \sum_{e \in E} u_{e} x_{e} \right).

We call this the character corresponding to $u$ . Here $x$ is the spatial variable, while $u$ may be regarded as an index representing a frequency on that space. The heat operator acts with a different multiplier for each such frequency $u$ .

Lemma 7.2.

The family of functions $\{ \chi_{u} : u \in V \}$ is an orthogonal basis of $\C^{V}$ .

Proof

Put the standard inner product

\langle f, g\rangle \coloneqq \sum_{x \in V} f(x) \overline{g(x)}

on $\C^{V}$ . We compute this standard inner product for $u, v \in V$ . Character values are roots of unity in the complex numbers, so $\overline{\psi(a)} = \psi(-a)$ . Therefore, using complex conjugation, we obtain

\begin{aligned} \langle \chi_{u}, \chi_{v} \rangle &= \sum_{x \in V} \chi_{u}(x) \overline{\chi_{v}(x)} \\ &= \sum_{x \in V} \psi((u - v) \cdot x) \\ &= \prod_{e \in E} \left( \sum_{a \in \F_{q}} \psi((u_{e} - v_{e})a) \right). \end{aligned}

By Lemma 7.1, if $u = v$ this value is $q^{n}$ , while if $u \neq v$ , then in at least one coordinate the inner sum is $0$ , and hence the whole product is $0$ . Thus $\{ \chi_{u} : u \in V \}$ is a family of mutually orthogonal non-zero vectors. Its cardinality is $q^{n} = \dim_{\C} \C^{V}$ , so it is a basis.

The basis obtained here is an orthogonal basis, not an orthonormal basis. The norm of each $\chi_{u}$ is $q^{n/2}$ . However, the trace is the sum of the eigenvalues in an eigenspace decomposition, and it does not depend on the normalisation of eigenvectors. Thus there is no problem for the trace computations below even without replacing this by an orthonormal basis.

Next, we see that these functions are eigenfunctions of the heat operator.

Lemma 7.3.

For every $u \in V$ ,

\Kop_{z} \chi_{u} = z^{\wt(u)} \chi_{u}

holds.

Proof

Work in one coordinate. For $b \in \F_{q}$ , define a function $\varphi_{b} \in U = \C^{\F_{q}}$ by

\varphi_{b}(a) \coloneqq \psi(ba).

If $b = 0$ , then $\varphi_{b}$ is a constant function, so $M_{z}$ acts on it with eigenvalue $1$ . If $b \neq 0$ , then by Lemma 7.1,

\sum_{a \in \F_{q}} \varphi_{b}(a) = 0,

and hence $\varphi_{b} \in U_{1}$ . Therefore $M_{z}$ acts on $\varphi_{b}$ with eigenvalue $z$ .

In several coordinates,

\chi_{u}(x) = \prod_{e \in E} \psi(u_{e} x_{e}),

and $\Kop_{z} = \bigotimes_{e \in E} M_{z}$ . A coordinate with $u_{e} = 0$ contributes the eigenvalue $1$ , and a coordinate with $u_{e} \neq 0$ contributes the eigenvalue $z$ . Hence the total eigenvalue is $z^{\wt(u)}$ .

Finally, we determine which characters remain inside the space $\Hspace^{C}$ of $C$ -invariant functions.

Lemma 7.4.

For $u \in V$ , the character $\chi_{u}$ is $C$ -invariant if and only if $u \in C^{\perp}$ .

Proof

The statement that $\chi_{u}$ is $C$ -invariant means that

\chi_{u}(x + c) = \chi_{u}(x)

holds for every $x \in V$ and $c \in C$ . But

\chi_{u}(x + c) = \psi(u \cdot x + u \cdot c) = \chi_{u}(x) \psi(u \cdot c),

so this is equivalent to $\psi(u \cdot c) = 1$ for every $c \in C$ .

If $u \in C^{\perp}$ , then $u \cdot c = 0$ for every $c \in C$ , and hence clearly $\psi(u \cdot c) = 1$ . Conversely, suppose that $\psi(u \cdot c) = 1$ for all $c \in C$ . Assume that for some $c_{0} \in C$ we have $b \coloneqq u \cdot c_{0} \neq 0$ . Since $C$ is $\F_{q}$ -linear, $\lambda c_{0} \in C$ for every $\lambda \in \F_{q}$ . Therefore $\psi(\lambda b) = 1$ must hold for every $\lambda \in \F_{q}$ . But if $b \neq 0$ , the map $\lambda \mapsto \lambda b$ is a bijection of $\F_{q}$ , so this would mean that $\psi$ is identically $1$ . This contradicts the non-triviality of $\psi$ . Hence $u \cdot c = 0$ for every $c \in C$ , and so $u \in C^{\perp}$ .

The reverse implication in the proof essentially uses the fact that $C$ is $\F_{q}$ -linear. It is not the case that $u \cdot c=0$ follows immediately from $\psi(u \cdot c)=1$ . A non-trivial additive character $\psi$ may send a non-zero element to $1$ . However, since $C$ is $\F_{q}$ -linear, if $c_{0} \in C$ , then all $\lambda c_{0}$ also lie in $C$ . Thus, if one assumes $u \cdot c_{0}\neq 0$ , then $\psi(\lambda (u \cdot c_{0}))=1$ holds for all $\lambda \in \F_{q}$ . The elements $\lambda (u \cdot c_{0})$ range over all of $\F_{q}$ , which would mean that $\psi$ is identically $1$ , contradicting non-triviality.

We can now perform the spectral-side computation.

Theorem 7.5 (Spectral Side of the Trace).

For a code $C \leq \F_{q}^{E}$ ,

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \sum_{u \in C^{\perp}} z^{\wt(u)} = W_{C^{\perp}}(1, z) \tag{7.1}

holds.

Proof

By Lemma 7.2, every $f \in \Hspace$ has a unique expansion

f = \sum_{u \in V} \alpha_{u}\chi_{u}.

For $c \in C$ ,

(\tau_{c}\chi_{u})(x) = \chi_{u}(x-c) = \psi(-u\cdot c)\chi_{u}(x).

If $f \in \Hspace^{C}$ , then $\tau_{c}f = f$ for every $c \in C$ , and hence

\sum_{u \in V}\alpha_{u}\bigl(\psi(-u\cdot c)-1\bigr)\chi_{u}=0.

Since the characters $\chi_{u}$ are linearly independent, if $\alpha_{u} \neq 0$ , then $\psi(-u\cdot c)=1$ for every $c \in C$ . By Lemma 7.4, this is equivalent to $u \in C^{\perp}$ . Conversely, if $u \in C^{\perp}$ , then $\chi_{u}$ is $C$ -invariant. Therefore $\Hspace^{C}$ has $\{ \chi_{u} : u \in C^{\perp} \}$ as a basis. Indeed, the dimensions also agree. We have $\dim \Hspace^{C}=\card{V/C}=q^{n}/\card{C}$ , and by linear algebra over finite fields, $\card{C^{\perp}}=q^{n}/\card{C}$ . This follows from the non-degeneracy of the standard inner product, which gives $\dim_{\F_{q}} C+\dim_{\F_{q}} C^{\perp}=n$ . Hence the number of degrees of freedom of functions on the quotient space agrees with the number of remaining characters.

Also, by Lemma 7.3, the operator $\Kop_{z}$ has eigenvalue $z^{\wt(u)}$ on $\chi_{u}$ . Therefore the trace of $\res{\Kop_{z}}{\Hspace^{C}}$ is

\sum_{u \in C^{\perp}} z^{\wt(u)}.

This is exactly $W_{C^{\perp}}(1, z)$ .

We call this computation the spectral side. When the trace is computed using eigenfunctions of the heat operator, only the $C$ -invariant eigenfunctions remain, and these are indexed precisely by $C^{\perp}$ . Their eigenvalues are $z^{\wt(u)}$ , so the weight enumerator of the dual code appears.

The MacWilliams Identity as a Trace Formula

We have now computed the same trace $\Tr(\res{\Kop_{z}}{\Hspace^{C}})$ in two ways. The important point is that the spectral side and the geometric side are not computing different quantities; they are computing the trace of exactly the same operator $\res{\Kop_{z}}{\Hspace^{C}}$ in two ways. In one computation $C^{\perp}$ appears, and in the other computation $C$ appears. On the spectral side,

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = W_{C^{\perp}}(1, z),

while on the geometric side,

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \frac{1}{\card{C}} W_{C} \bigl(1 + (q - 1)z, 1 - z \bigr).

Thus we obtain the following.

Theorem 8.1 (Inhomogeneous Form of the MacWilliams Identity).

For every linear code $C \leq \F_{q}^{E}$ ,

W_{C^{\perp}}(1, z) = \frac{1}{\card{C}} W_{C} \bigl( 1 + (q - 1)z, 1 - z \bigr) \tag{8.1}

holds.

Proof

It is enough to equate the right-hand sides of Theorem 7.5 and Theorem 6.3.

The equality used here has already been read as a polynomial identity in $z$ . To return to the usual two-variable form, we homogenise. The weight enumerator $W_{B}(X, Y)$ of a length $n$ code $B$ is a homogeneous polynomial whose terms all have total degree $n$ . Therefore, if $X \neq 0$ , then

W_{B}(X, Y) = X^{n} W_{B}(1, Y/X)

holds. Apply this property to $B=C^{\perp}$ and $B=C$ , put $z = Y/X$ , and multiply both sides by $X^{n}$ . Then

\begin{aligned} W_{C^{\perp}}(X, Y) &= X^{n} W_{C^{\perp}}(1, Y/X) \\ &= \frac{X^{n}}{\card{C}} W_{C}\left( 1 + (q - 1)\frac{Y}{X}, 1 - \frac{Y}{X} \right) \\ &= \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr) \end{aligned}

is obtained. In the last equality, we used the fact that $W_{C}$ is a homogeneous polynomial of total degree $n$ . In other words, by multiplying both arguments simultaneously by the outer factor $X^{n}$ , we get

X^{n} W_{C}\left( 1 + (q - 1)\frac{Y}{X}, 1 - \frac{Y}{X} \right) = W_{C}\bigl( X + (q - 1)Y, X - Y \bigr).

Both sides are polynomials in $X,Y$ , and they agree on the range $X \neq 0$ . Hence they agree as a polynomial identity, and the same equality also holds when $X = 0$ . This is the MacWilliams identity.

Theorem 8.2 (MacWilliams Identity).

For a linear code $C \leq \F_{q}^{E}$ over the finite field $\F_{q}$ ,

W_{C^{\perp}}(X, Y) = \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr)

holds.

In one sentence, the proof can be summarised as follows.

The MacWilliams identity is a trace formula saying that, when the heat operator on the Hamming graph is viewed on the quotient space $\F_{q}^{E}/C$ , its spectral side gives the weight enumerator of $C^{\perp}$ , while its geometric side gives the MacWilliams transform of the weight enumerator of $C$ .

A Small Example: The Binary Repetition Code of Length Three

In the general theory above, the key point was that the same trace on the quotient space was computed in two ways, with $C$ appearing on the geometric side and $C^{\perp}$ appearing on the spectral side. Below, for the binary repetition code of length three, we directly check that the geometric side and the spectral side give the same polynomial. Let $E = \{ 1, 2, 3 \}$ and $q = 2$ , and consider the binary repetition code

C = \{ 000, 111 \} \leq \F_{2}^{3}.

Then

W_{C}(X, Y) = X^{3} + Y^{3}.

The dual code is the even-weight code

C^{\perp} = \{ 000, 110, 101, 011 \},

and

W_{C^{\perp}}(X, Y) = X^{3} + 3XY^{2}.

The matrix of the one-coordinate heat operator, that is, its heat kernel, is

M_{z} = \frac{1}{2} \begin{pmatrix} 1 + z & 1 - z \\ 1 - z & 1 + z \end{pmatrix}.

Therefore, for $a \in \F_{2}^{3}$ ,

\Tr(\tau_{a}\Kop_{z}) = (1 + z)^{3 - \wt(a)}(1 - z)^{\wt(a)}.

The elements of $C$ are $000$ and $111$ , so the trace on the geometric side is

\begin{aligned} \Tr(\res{\Kop_{z}}{\Hspace^{C}}) &= \frac{1}{2}\bigl( (1 + z)^{3} + (1 - z)^{3} \bigr) \\ &= 1 + 3z^{2}. \end{aligned}

On the other hand, on the spectral side the weights of the elements of $C^{\perp}$ are $0$ , $2$ , $2$ , and $2$ , so

\sum_{u \in C^{\perp}} z^{\wt(u)} = 1 + 3z^{2}.

Indeed, the two trace computations agree. After homogenising, we get

\begin{aligned} W_{C^{\perp}}(X, Y) &= \frac{1}{2} W_{C}(X + Y, X - Y) \\ &= \frac{1}{2} \bigl((X + Y)^{3} + (X - Y)^{3} \bigr) \\ &= X^{3} + 3XY^{2}, \end{aligned}

which confirms the MacWilliams identity.

What this example shows is as follows. The two translations $000$ and $111$ in $C$ shift the diagonal entries of the kernel of the heat operator and then read them. When the shift is zero, $1 + z$ appears; when it is non-zero, $1 - z$ appears. On the other hand, the eigenfunctions on the quotient space are precisely the characters invariant on $C$ , and these correspond to $C^{\perp}$ . Because these two readings compute the same trace, an identity of weight enumerators is produced.

What the Heat Operator, Heat Kernel, and Trace Formula Did in This Proof

Let us review the proof in this note.

First, we read the Hamming weight from the diagonal entries of the kernel of the heat operator after composing with a translation. When the one-coordinate heat operator $M_{z}$ is composed with a translation and we take the trace, depending on whether the translation amount is zero or non-zero, the factors

1 + (q - 1)z, \qquad 1 - z

appear. In several coordinates, these are multiplied coordinate by coordinate, and the Hamming weight is recorded.

Second, we used the code $C$ as a translation group. Using the averaging projection by $C$ ,

P_{C} = \frac{1}{\card{C}} \sum_{c \in C} \tau_{c},

we pass to the function space on $V/C$ , that is, to the space of $C$ -invariant functions. Computing the trace through this projection averages the contributions of the translations corresponding to codewords in $C$ . This is the geometric side.

Third, we diagonalised the heat operator on the Hamming graph using eigenfunctions. The eigenfunctions are characters of the form $\chi_{u}(x) = \psi(u \cdot x)$ , and the corresponding eigenvalues are $z^{\wt(u)}$ . On the quotient space, only the $C$ -invariant characters remain. These correspond exactly to $u \in C^{\perp}$ . This is the spectral side.

Fourth, we computed the same trace in two ways. On the spectral side we sum the eigenvalues indexed by $C^{\perp}$ , while on the geometric side we average the contributions of translations by elements of $C$ . The result is the equality

\Tr(\res{\Kop_{z}}{\Hspace^{C}}) = \sum_{u \in C^{\perp}} z^{\wt(u)} = \frac{1}{\card{C}} \sum_{c \in C}(1 + (q - 1)z)^{n - \wt(c)}(1 - z)^{\wt(c)}.

This equation is the core of the proof in this note.

Thus the MacWilliams identity can be read as the following trace formula for a heat operator.

The dual code $C^{\perp}$ appears on the spectral side of the quotient space $V/C$ , while the original code $C$ appears on the geometric side, where the trace on the same quotient space is computed as a translation average.

This viewpoint has the same underlying principle as the usual character-theoretic proof, while emphasising the analytic form of “spectral side versus geometric side”.

Concepts Seen in This Part

In this part, while aiming at a proof of the MacWilliams identity, we introduced the basic tools of heat operators, heat kernels, and trace formulas. They can be organised as follows.

Kernels on finite sets

These are the matrix entries $K_{T}(x, y)$ of a linear operator $T \colon \C^{\Omega} \to \C^{\Omega}$ on a finite set $\Omega$ . On a finite set, a kernel is the matrix itself.

Trace

This is the sum of the diagonal entries of an operator. It can also be computed as a sum of eigenvalues. The equality of these two viewpoints is the entrance to trace formulas.

Heat operators and heat kernels

The operator constructed from a Laplacian $\Delta$ as $\exp(-t\Delta)$ is the heat operator, and its matrix entries form the heat kernel. In this note, putting $z = e^{-t}$ , we wrote the one-coordinate heat operator as

M_{z} = \Pi_{0} + z(\Id - \Pi_{0}).

Hamming graph

This is the graph with vertex set $\F_{q}^{E}$ in which two vertices at distance $1$ are joined by an edge. It can be viewed as the coordinatewise Cartesian product of one-coordinate complete graphs.

Translation action

This is the action that moves a point by $x \mapsto x + a$ for $a \in \F_{q}^{E}$ . On the function space, we let it act by $(\tau_{a} f)(x) = f(x - a)$ .

C

-invariant functions

These are functions unchanged by translations by the code $C$ . They are the same thing as functions on the quotient set $\F_{q}^{E}/C$ .

Averaging projection

This is the projection onto $C$ -invariant functions,

P_{C} = \frac{1}{\card{C}} \sum_{c \in C} \tau_{c}.

Using this projection, we brought the trace on the quotient space back to a trace on the original space.

Spectral side

This is the side where the heat operator is diagonalised by eigenfunctions and the trace is computed. In this note, because the $C$ -invariant eigenfunctions were indexed by $C^{\perp}$ , the weight enumerator of the dual code appeared.

Geometric side

This is the side where the diagonal entries of the kernel of the heat operator composed with translations are summed directly. In this note, according to the Hamming weight of $c \in C$ , the factors $1 + (q - 1)z$ and $1 - z$ appeared coordinate by coordinate, and the weight enumerator of the original code $C$ appeared.

Looking Back at This Proof Family

From here on, we are no longer in the proof itself, but are placing this proof within the map of the series as a whole. This section is supplementary; it may be skipped without affecting the understanding of the proof in this note.

On the surface, the proof in this note is a proof using heat operators, heat kernels, and trace formulas. We constructed the heat operator $\Kop_{z}$ on the Hamming graph and computed the trace obtained by viewing it on the quotient space $\F_{q}^{E}/C$ in two ways. The spectral side produced $C^{\perp}$ , and the geometric side produced $C$ .

Compressed into the five families, this proof belongs to the

Fourier, character, and Poisson family.

The reason is that the eigenfunctions diagonalising the heat operator on the Hamming graph are characters of the finite abelian group $\F_{q}^{E}$ . Indeed, once the eigenfunctions

\chi_{u}(x) = \psi(u \cdot x)

are used, finite Fourier analysis is already in the background.

However, the viewpoint in this note has a different appearance from the usual character-theoretic proof. In the usual character-theoretic proof, one directly uses the orthogonality relation

\sum_{c \in C} \psi(u \cdot c).

In contrast, this note constructs the heat operator on the Hamming graph and computes its trace on the quotient space in two ways:

as a sum of eigenvalues and as a diagonal sum of kernels after composing with translations.

Thus the Fourier principle is repackaged in the analytic form of a trace formula.

The following supplement is not needed for reading this note, but for readers who know the language of association schemes, the heat operator $\Kop_{z}$ here is an element of the Bose–Mesner algebra of the Hamming scheme. For that reason, this proof also touches the orthogonal-polynomial and association-scheme family. However, in the proof in this note, we placed in the foreground the viewpoint of computing the trace of a heat operator in two ways, rather than the general theory of Bose–Mesner algebras.

This difference is the phenomenon that this series aims to highlight: the same theorem can be seen in the languages of different fields. Even for the same MacWilliams identity,

one can see it as character orthogonality, or as a trace formula for a heat operator.

The proof in this note extracts that analytic form.

Next Time

At this point, the derivation of the MacWilliams identity from heat operators, heat kernels, and trace formulas, which is the main claim of this note, is complete. What follows is a preview within the series.

Next time, we look at the MacWilliams identity from the side of lattices and theta functions. In the proof in this note, we used heat operators and heat kernels on the finite set $\F_{q}^{E}$ and derived the MacWilliams identity as a finite trace formula. Next time, through Construction A, which constructs a lattice from a code, we embed weight enumerators into theta functions. Then we will view the MacWilliams identity through the transformation formula for lattice theta functions, namely the continuous version of the Poisson summation formula.

The main actors next time are

Construction A → lattices → dual lattices → theta functions → the Poisson summation formula → the MacWilliams identity

Even though the proof belongs to the same Fourier, character, and Poisson family, next time the MacWilliams identity will appear not as a heat operator and heat kernel on a finite set, but as a transformation formula for continuous lattices and theta functions.

References

[Chu97] Fan R. K. Chung. Spectral graph theory. Conference Board of the Mathematical Sciences, Washington, DC; by the American Mathematical Society, Providence, RI, vol. 92, pp. xii+207, 1997 Citation contextFor the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.↩
[GR01] Chris Godsil and Gordon Royle. Algebraic graph theory. Springer-Verlag, New York, vol. 207, pp. xx+439, 2001. doi:10.1007/978-1-4613-0163-9 Citation contextFor the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.↩
[CY99] Fan Chung and S.-T. Yau. Coverings, heat kernels and spanning trees. Electron. J. Combin., vol. 6, pp. Research Paper 12, 21, 1999. doi:10.37236/1444 Citation contextFor the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.↩
[Ter99] Audrey Terras. Fourier analysis on finite groups and applications. Cambridge University Press, Cambridge, vol. 43, pp. x+442, 1999. doi:10.1017/CBO9780511626265 Citation contextFor the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.↩
[MS77] F. J. MacWilliams and N. J. A. Sloane. The theory of error-correcting codes. II. North-Holland Publishing Co., Amsterdam-New York-Oxford, vol. Vol. 16, pp. i–ix and 370–762, 1977 Citation contextFor the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.↩

This series

A Series Learning through the MacWilliams Identity · Part 10 of 12

PreviousAn Introduction to Factor Graphs and Partition Functions through the MacWilliams Identity

Back to series list

Disclaimer

Articles on this site are based on the operator's personal understanding, investigation, and research notes. I try to keep the content accurate, but it may contain errors or incomplete explanations. I do not guarantee its accuracy, completeness, usefulness, or currentness.

Please use the information on this site at your own judgment and responsibility. To the extent permitted by law, the operator is not liable for damages, losses, or disadvantages arising from using, or being unable to use, information on this site.

If you notice an error, unclear explanation, broken link, or insufficient citation, please contact the operator. I will review the content and, when appropriate, correct, update, or remove it.

§1Introduction

§2Function Spaces, Kernels, and Traces on Finite Sets

§3Heat Operators and Heat Kernels on Finite Graphs

§4The Heat Operator and Heat Kernel of the One-Coordinate Complete Graph

§5The Heat Operator and Heat Kernel of the Hamming Graph

§6Traces on Quotient Spaces

§7Spectral Side: Characters as Eigenfunctions

§8The MacWilliams Identity as a Trace Formula

§9A Small Example: The Binary Repetition Code of Length Three

§10What the Heat Operator, Heat Kernel, and Trace Formula Did in This Proof

§11Concepts Seen in This Part

§12Looking Back at This Proof Family

§13Next Time

References

This series

Disclaimer

Introduction

Function Spaces, Kernels, and Traces on Finite Sets

Heat Operators and Heat Kernels on Finite Graphs

The Heat Operator and Heat Kernel of the One-Coordinate Complete Graph

The Heat Operator and Heat Kernel of the Hamming Graph

Traces on Quotient Spaces

Spectral Side: Characters as Eigenfunctions

The MacWilliams Identity as a Trace Formula

A Small Example: The Binary Repetition Code of Length Three

What the Heat Operator, Heat Kernel, and Trace Formula Did in This Proof

Concepts Seen in This Part

Looking Back at This Proof Family

Next Time