Article and note
A Series Learning through the MacWilliams IdentityPart 10 of 12
An Introduction to Heat Kernels and Trace Formulas through the MacWilliams Identity
- Published:
- Updated:
- Reading time:
- 38 min (about 8,278 words)
Introduction
One of the fundamental theorems in coding theory is the MacWilliams identity. Let be the coordinate set, let , and, for a linear code over the finite field , consider its dual code
where
For a codeword , write its support and Hamming weight as
respectively. Define the weight enumerator of the linear code by
The MacWilliams identity is the formula saying that the weight enumerator of the dual code can be computed from the weight enumerator of as follows:
This is the MacWilliams identity.
In this series, we use proofs of the MacWilliams identity as a guide to introductions to neighbouring areas and concepts. For that reason, the series is aimed at readers with the following background:
We assume basic familiarity with elementary coding theory, namely with
what a (finite) field is,
what a linear code over a finite field is,
what the Hamming weight is,
what the dual code is.
(It is not necessary to know a proof of the MacWilliams identity.)
This note does not presuppose Parts 1–9. In the series as a whole, we compare several proofs of the MacWilliams identity, but the content of the previous parts is not needed in order to read this note. The necessary linear operators, kernels, traces, heat operators and heat kernels on finite graphs, Hamming graphs, translation actions, and character eigenfunctions are introduced in the text to the extent needed.
In this series, we view the proof methods for the MacWilliams identity as falling roughly into the following five families. However, this classification is a map of the whole series, and is not required for reading the proof in this note:
Fourier, character, and Poisson-type proofs.
Möbius inversion, lattice-theoretic, shortening-puncturing proofs.
Orthogonal-polynomial and association-scheme proofs.
Matroid and Tutte-polynomial proofs.
Moment and double-counting proofs.
The proof treated in this note is written, on the surface, in the language of
Hamming graphs, heat operators, Markov operators, and trace formulas.
Compressed into the five families above, it belongs to the Fourier, character, and Poisson family. This is because, when the heat operator on the Hamming graph is diagonalised, characters of the finite abelian group eventually appear as eigenfunctions. That said, character theory itself is not the main actor in this note. The aim here is to use a proof of the MacWilliams identity as a guide to an introduction to heat operators and heat kernels and to trace formulas.
Roughly speaking, a heat operator is an operator that describes how heat spreads on a graph or a space. Its matrix entries are called the heat kernel. On a finite set, a heat operator is a finite matrix, and the heat kernel is its matrix entry. A trace formula is an identity obtained by computing the trace of a single operator in two ways. In one computation, the operator is decomposed into eigenspaces and the eigenvalues are summed. In the other computation, the diagonal entries of the kernel are summed directly. The equality of these two computations gives a non-trivial identity. The trace formulas considered in this note are not deep trace formulas involving infinite-dimensional analysis or convergence issues, but finite trace formulas obtained by computing the trace of a finite-dimensional matrix in two ways.
The MacWilliams identity considered in this note appears as the following trace formula. Write for the heat operator on the Hamming graph on , and write its kernel as . We take the trace of on the quotient space by the translation action of the code . More precisely, we are not defining a new quotient graph or a heat operator on that quotient graph separately. Rather, we identify the function space on the quotient set with the space of -invariant functions on , and compute the trace of the restriction of the original heat operator . Computing this trace from the spectral side produces the weight distribution of . On the other hand, averaging along the translation action and then summing the diagonal entries produces the weight distribution of . Comparing the two gives the MacWilliams identity.
The flow of the note is as follows.
operators on finite sets → kernels and traces → heat operators and heat kernels on finite graphs → heat operators and heat kernels on Hamming graphs → traces on quotient spaces → the MacWilliams identity
Stating only the key point of the proof in advance, it is as follows. In one coordinate, the zero translation produces , and a non-zero translation produces . In several coordinates, this zero/non-zero distinction occurs coordinate by coordinate, and so records the Hamming weight. Averaging further over the elements of produces the weight enumerator of . On the other hand, from the viewpoint of eigenfunctions, the characters that remain on the quotient space are precisely the -invariant characters, and these correspond exactly to .
For the spectral theory of finite graphs, Chung [Chu97] and Godsil–Royle [GR01] are standard references. For heat kernels on graphs, see for instance Chung–Yau [CY99]. For Fourier analysis on finite groups and its applications to graph and coding theory, see also Terras [Ter99]. For the classical treatment of the MacWilliams identity in coding theory, see MacWilliams–Sloane [MS77]. In this note, we do not develop all this general theory, but instead extract only the part needed for the ordinary Hamming-weight MacWilliams identity for linear codes over finite fields.
Function Spaces, Kernels, and Traces on Finite Sets
Before treating heat operators and heat kernels, we prepare the language for viewing linear operators on finite sets as matrices. All spaces considered here are finite-dimensional, so no analytic convergence issues arise.
For a finite set , let be the set of all complex-valued functions on . In this note, an italic denotes a code, whereas the blackboard bold denotes the field of complex numbers. For each , define by
Then is a basis of .
A linear operator can be written as a matrix whose rows and columns are indexed by . In this note, we write its matrix entry as and call it the kernel of . Thus
On a finite set, a kernel is simply another name for a matrix entry. However, when we later call it a heat kernel, we think of this matrix entry as the amount of heat transmitted from the point to the point . Here the basic convention in this note is that operators act on functions. As a matrix entry, can be read as the contribution from the column index to the row index . On the other hand, when the same matrix is read intuitively as an expectation operator for a Markov process, it is read as the probability of moving from the current point to the next point . In general, the transpose matrix appears in the convention where one acts on probability distributions. The heat kernels actually used in this note are symmetric, so both readings use the same numerical matrix, and only the interpretation of the direction differs. The word kernel appears in two senses in this note. One is the matrix-entry sense just defined, written as or . The other is the linear-algebraic subspace sent to zero, written as . To avoid confusion, kernels as matrix entries are denoted by symbols of the form , whereas linear-algebraic kernels are written as . Also note the following notation. denotes the kernel of a general operator . The symbol appearing later denotes the complete graph on vertices, and denotes the heat operator on the Hamming graph. Its heat kernel is written . We use the same letter for the operator and its kernel, but when is attached it denotes a matrix entry.
Definition 2.1.
The trace of a linear operator is defined by
This is the usual trace of a matrix. That is, it is the sum of the diagonal entries. If is diagonalisable and its eigenvalues, counted with multiplicity, are , then
Thus the trace has at least two viewpoints.
summing diagonal entries ⇔ summing eigenvalues
Using these two viewpoints for the same operator is the basic idea of a trace formula.
In this note, we also use traces on subspaces. For that purpose we prepare a simple lemma using projections.
Lemma 2.2.
Let be a subspace, and let be a projection onto . That is, assume and . Suppose that the linear operator commutes with . Then .
Proof
Since is a projection, decomposes as
Since commutes with , the operator preserves both and . With respect to this decomposition, the matrix of is block diagonal, and has the form
Therefore the trace of is equal to the trace of the block of on . Thus .
We will later apply this lemma to the space of functions invariant under translations by . This is because we identify functions on the quotient space with -invariant functions on , and compute traces using the projection onto that space.
Heat Operators and Heat Kernels on Finite Graphs
We next introduce the idea of heat operators and heat kernels on finite graphs. Rather than developing general graph theory, we keep in mind the finite regular graphs needed in this note, especially Hamming graphs.
Let be the vertex set of a finite graph. The function space on the graph is . The graph Laplacian is an operator that measures the difference between a function and its surrounding values. The standard combinatorial Laplacian is defined by . Here is the adjacency matrix, and is the diagonal matrix whose diagonal entries are the degrees. For a regular graph, is a constant multiple of the identity matrix.
The finite-graph version of the heat equation can be written using the matrix exponential as . This is called the heat operator, and its kernel is called the heat kernel. On a finite set, this is just an ordinary matrix exponential. Here the matrix exponential is the matrix defined by
In the situations actually used in this note, the eigenspace decomposition gives concrete formulas, so there is no need to compute this series directly.
In this note, however, we use a scalar multiple of the Laplacian in order to simplify the calculations. Multiplying the Laplacian by a positive constant amounts to viewing the heat operator after rescaling time. Indeed, replacing by gives . Thus the structure of eigenfunctions and trace formulas is unchanged. What changes is only the reparametrisation of the time parameter appearing in the eigenvalues. In this note, we use the parameter instead of time . Strictly speaking, for finite we have . The value corresponds to the limit . The time corresponds to , and then the heat operator is the identity operator. As becomes larger, approaches , and the heat operator moves towards averaging. In the one-coordinate case, at the limiting value the operator becomes the averaging projection . Thus can be viewed as a parameter measuring how much of the original value is retained. The heat kernels and traces actually computed below are all expressed as polynomials in . Therefore substituting also makes sense as a polynomial. At the stage where the MacWilliams identity is obtained, is read not as an analytic time parameter but as a formal variable. Thus we first understand the operators as heat operators for , and in the end read the same formulas as polynomial identities. The later substitution and homogenisation use precisely this polynomial viewpoint.
Heat operators and heat kernels on finite graphs have the following two basic viewpoints.
- Spectral side
Decompose into eigenspaces and sum the eigenvalues .
- Geometric side
Sum the diagonal entries of the heat kernel over the vertices .
Both compute the same trace . In this note, we also consider the trace of an operator obtained by composing a heat operator with a translation action. This operation of composing with a translation and then taking the trace is what extracts the weight distribution of the code .
The Heat Operator and Heat Kernel of the One-Coordinate Complete Graph
A Hamming graph is the Cartesian product, coordinate by coordinate, of complete graphs. Here the graph product is meant in the Cartesian sense. Two multi-coordinate vertices are adjacent precisely when they differ in exactly one coordinate, and in that coordinate they are adjacent in the one-coordinate complete graph. We therefore begin with the one-coordinate complete graph.
Let the one-coordinate vertex set be , and let the function space be . Define the averaging projection onto the one-dimensional subspace of consisting of constant functions by
Here is the function that is identically . The kernel of this projection in the linear-algebraic sense, namely the subspace on which , is
Thus . Here the subscript indicates that this is the space corresponding to the eigenvalue for the one-coordinate Laplacian defined immediately below; it does not mean that is one-dimensional. In fact, .
In this note, we define the one-coordinate Laplacian by
This is a scalar multiple of the combinatorial Laplacian of the complete graph . Here is the matrix all of whose entries are . Indeed, the adjacency matrix of is , and the combinatorial Laplacian is
On the other hand, since ,
Thus we are simply using the usual Laplacian multiplied by .
Writing , the one-coordinate heat operator is
It acts with eigenvalue in the constant direction and with eigenvalue in the direction where the sum is .
Lemma 4.1.
The kernel of the one-coordinate heat operator is given by
Proof
The kernel of is for all and . The kernel of is when and when . Therefore, writing , when we get
and when we get
This formula also has a probabilistic reading. The operator is the operation that keeps the current value with probability , and replaces it by a uniformly chosen value of with probability . In this note, we view not as acting on probability distributions, but as an operator acting on functions, that is, observables. In other words, it returns the average value of when the next state is chosen at random by this rule. In particular, when , the operator sends non-negative functions to non-negative functions and preserves constant functions, so it can be read as a Markov operator on the function side. If the same symmetric matrix is made to act on distributions, it can also be read as an operator sending probability distributions to probability distributions. In the convention where one acts on distributions, the transpose matrix appears, but the heat kernel in the present case is symmetric, so we are looking at the same numerical matrix. However, in this note we use it not for the probabilistic interpretation but for trace computations.
The most important point in one coordinate is the trace after composing with a translation. For , define the translation operator by
With this definition, acts as pullback on functions. On basis vectors, . The points themselves are moved by , but because the action on functions is by pullback, the formula contains . Later we write -invariance as . Since is an additive subgroup, as runs through all of , so does . Thus the condition is equivalent to the condition .
Lemma 4.2.
For every ,
holds.
Proof
Write the kernel of as . The kernel of is
Hence
If , all these entries are diagonal entries, so
On the other hand, if , then for every , so all entries are off-diagonal entries. Therefore
This lemma records the source of the change of variables in the MacWilliams identity. In the ordinary trace, one sums over . After composing with , however, one reads as the diagonal entry. In other words, translation shifts the diagonal entries of the heat kernel by . The zero translation produces , and a non-zero translation produces . This is because, when , only diagonal entries are read, whereas when , only off-diagonal entries are read. In several coordinates, this distinction occurs coordinate by coordinate, and the Hamming weight is recorded in the trace. After later substituting and homogenising, this becomes
The Heat Operator and Heat Kernel of the Hamming Graph
We now move to several coordinates. Put . This is the vertex set of the Hamming graph. Define the Hamming distance between two vertices by
In the Hamming graph, two vertices at distance are joined by an edge. This graph is the Cartesian product of copies of the one-coordinate complete graph.
Write the function space as . Using the one-coordinate space , this can be viewed as
For each coordinate , let be the operator that applies the one-coordinate Laplacian only in the coordinate and applies the identity operator in all other coordinates. Define the Laplacian on the Hamming graph by
This is the usual combinatorial Laplacian of the Hamming graph multiplied by . Indeed, the usual combinatorial Laplacian in the -coordinate direction is . Thus, if denotes the usual combinatorial Laplacian of the whole Hamming graph, then
Therefore the used in the text is the usual combinatorial Laplacian multiplied by , with the same normalisation as in the one-coordinate case. Since the operators commute with one another, putting gives
Thus the following operator is the natural heat operator on the Hamming graph. Under this identification, define the heat operator on the Hamming graph by
This applies the same one-coordinate heat operator independently in all coordinates. Although we use tensor-product notation here, the only concrete meaning needed in this note is the following. For and , set
and define its action on a function by
Thus even without knowing the general theory of tensor products, all trace computations below can be read using only this product formula. In particular, when , can be viewed as a Markov operator that independently, in each coordinate, either keeps the value or replaces it by a uniformly chosen value. This is an explanation as an expectation operator acting on functions, that is, observables. The same symmetric matrix can also be made to act on distributions. The probabilistic interpretation here is for intuition; in the proof, we use trace computations and polynomial identities.
Lemma 5.1.
The kernel of is given by
Proof
By the product formula just above, the kernel of is the product of the one-coordinate kernels. Namely,
A coordinate with contributes , while a coordinate with contributes . Since the number of coordinates in which and differ is , we obtain (5.1).
This formula shows that depends not on the particular values of and , but only on the Hamming distance . Thus is an operator compatible with the symmetries of the Hamming graph. In the language of association schemes, belongs to the Bose–Mesner algebra of the Hamming scheme. However, in this note we do not use the general theory of association schemes; we proceed only by computations with heat operators, heat kernels, and traces.
Translations on are defined in the same way. For , define by
This is the tensor product of the one-coordinate translations.
Proposition 5.2.
For every ,
holds.
Proof
The meaning of this proposition is transparent. If we take the trace of the heat operator after composing with the translation , the zero and non-zero coordinates of produce different factors. Thus the Hamming weight of is recorded in the trace. This is why the weight enumerator of appears later.
Traces on Quotient Spaces
Next, consider the action of the code by translations. As an additive group, acts on . Write the quotient set as . Functions on the quotient set can be identified with -invariant functions on .
Definition 6.1.
The subspace
of is called the space of -invariant functions.
The space can be identified with the function space on . Indeed, a -invariant function is constant on cosets, so it is the same thing as a function with one value for each coset. Concretely, given , define a function on by
Then is -invariant. Conversely, if , then is constant on each coset , and hence a function on is defined by
Through these two correspondences, we identify with . What we are doing here is not constructing a new quotient graph or its Laplacian separately. Rather, we identify the subspace of -invariant functions inside the original function space with the function space on the quotient set , and compute the trace of the restriction of the original heat operator . If one writes the kernel on the quotient set explicitly, then for cosets and it is
This expression is independent of the choice of representatives and . This is because is translation-invariant, and, as ranges over all of , the points range over the same coset. Taking the diagonal sum of this kernel on the quotient set is the same content as taking the trace of on . In the text, however, we do not define a quotient graph anew; instead, we compute the trace using the restriction to and the averaging projection. This trace does not count the points of individually. It is the trace in which each coset is counted as one degree of freedom. In other words, we are not simply summing diagonal entries over all points of ; rather, we are taking the trace corresponding to the degrees of freedom of the -invariant function space , namely to the number of cosets. This distinction is important when we later rewrite the trace by using the averaging projection as
Here the condition appearing in the definition is the same as . Indeed, means . Since is an additive subgroup, as ranges over all of , so does . Therefore and are equivalent.
Define the averaging projection from to by
This operator averages over translations by . Acting on functions, it is
This means that it averages the values on the coset . As ranges over all of , both and range over the same coset , so the sign difference does not affect the average. The operator averages the values of a function on each coset , and then extends that average value over the whole coset. Thus, once a function has been averaged, averaging it again does not change it. This is the intuition for , namely for the fact that it is a projection. The coefficient appears because we normalise the average in the translation directions by . A point of the quotient set appears inside as a coset consisting of points. By averaging the values along that coset direction, we extract from a function on only its -invariant part.
Lemma 6.2.
The operator is the projection onto . Moreover, commutes with every translation , and in particular with .
Proof
First, is -invariant. Indeed, for ,
Conversely, if , then for every , and so . Directly, we also compute
In the last equality we used the fact that, for each fixed , there are exactly pairs with . Hence is the projection onto .
Next, since the kernel of depends only on , it is invariant under translations. In other words, for all ,
Checking this with the sign convention included, we have
By translation-invariance of Hamming distance, , and hence . Therefore commutes with each . In particular, it also commutes with .
By this commutativity, if is -invariant, then is also -invariant. Thus preserves , and we can consider the restricted operator . Therefore, by Lemma 2.2,
Computing the right-hand side produces the weight enumerator of . From here, we compute the same left-hand side by averaging over translations and then summing the diagonal entries of the kernel.
Theorem 6.3 (Geometric Side of the Trace).
For a code ,
holds.
Proof
We call this computation the geometric side. We composed the heat operator with translations , took their traces, and averaged over . Since the weight of appeared inside the trace, the weight enumerator of appeared.
As a check on the coefficients, let us look at and . When , we have , so the left-hand side is
The right-hand side is
so the two agree. When , the one-coordinate heat operator becomes the averaging projection , and in several coordinates it corresponds to averaging over all of and producing a constant function. As heat time, this is not a finite time but the limit . On the other hand, the formulas here are polynomials in , so we may substitute . Thus on the quotient space only the constant direction contributes, and the left-hand side is . The right-hand side is also
This is not a proof, but a check that the coefficients in the formula fit naturally.
Spectral Side: Characters as Eigenfunctions
Next, we compute the same trace as a sum of eigenvalues. Here we use the eigenfunctions of the heat operator on the Hamming graph. These eigenfunctions are additive characters of the finite abelian group .
Fix a non-trivial additive character
of . That is, assume that holds and that is not identically . If , such a character can be constructed, for example, using the trace map in the form
In this expression, elements of are read as the representatives . The symbol appearing here is the trace map of finite fields, and is different from , which denotes the trace of an operator. In this text, the trace of an operator is written as , while the trace map of finite fields is written as . The property needed in this note is the following one-coordinate orthogonality relation.
Lemma 7.1.
For ,
holds.
Proof
If , then every term is , and so the sum is . Suppose . Then the map is a bijection of , and hence
Denote the right-hand side by . Since is non-trivial, there exists such that . As ranges over all of , so does , and therefore
Since , we have .
For , define a function by
We call this the character corresponding to . Here is the spatial variable, while may be regarded as an index representing a frequency on that space. The heat operator acts with a different multiplier for each such frequency .
Lemma 7.2.
The family of functions is an orthogonal basis of .
Proof
Put the standard inner product
on . We compute this standard inner product for . Character values are roots of unity in the complex numbers, so . Therefore, using complex conjugation, we obtain
By Lemma 7.1, if this value is , while if , then in at least one coordinate the inner sum is , and hence the whole product is . Thus is a family of mutually orthogonal non-zero vectors. Its cardinality is , so it is a basis.
The basis obtained here is an orthogonal basis, not an orthonormal basis. The norm of each is . However, the trace is the sum of the eigenvalues in an eigenspace decomposition, and it does not depend on the normalisation of eigenvectors. Thus there is no problem for the trace computations below even without replacing this by an orthonormal basis.
Next, we see that these functions are eigenfunctions of the heat operator.
Lemma 7.3.
For every ,
holds.
Proof
Work in one coordinate. For , define a function by
If , then is a constant function, so acts on it with eigenvalue . If , then by Lemma 7.1,
and hence . Therefore acts on with eigenvalue .
In several coordinates,
and . A coordinate with contributes the eigenvalue , and a coordinate with contributes the eigenvalue . Hence the total eigenvalue is .
Finally, we determine which characters remain inside the space of -invariant functions.
Lemma 7.4.
For , the character is -invariant if and only if .
Proof
The statement that is -invariant means that
holds for every and . But
so this is equivalent to for every .
If , then for every , and hence clearly . Conversely, suppose that for all . Assume that for some we have . Since is -linear, for every . Therefore must hold for every . But if , the map is a bijection of , so this would mean that is identically . This contradicts the non-triviality of . Hence for every , and so .
The reverse implication in the proof essentially uses the fact that is -linear. It is not the case that follows immediately from . A non-trivial additive character may send a non-zero element to . However, since is -linear, if , then all also lie in . Thus, if one assumes , then holds for all . The elements range over all of , which would mean that is identically , contradicting non-triviality.
We can now perform the spectral-side computation.
Theorem 7.5 (Spectral Side of the Trace).
For a code ,
holds.
Proof
By Lemma 7.2, every has a unique expansion
For ,
If , then for every , and hence
Since the characters are linearly independent, if , then for every . By Lemma 7.4, this is equivalent to . Conversely, if , then is -invariant. Therefore has as a basis. Indeed, the dimensions also agree. We have , and by linear algebra over finite fields, . This follows from the non-degeneracy of the standard inner product, which gives . Hence the number of degrees of freedom of functions on the quotient space agrees with the number of remaining characters.
Also, by Lemma 7.3, the operator has eigenvalue on . Therefore the trace of is
This is exactly .
We call this computation the spectral side. When the trace is computed using eigenfunctions of the heat operator, only the -invariant eigenfunctions remain, and these are indexed precisely by . Their eigenvalues are , so the weight enumerator of the dual code appears.
The MacWilliams Identity as a Trace Formula
We have now computed the same trace in two ways. The important point is that the spectral side and the geometric side are not computing different quantities; they are computing the trace of exactly the same operator in two ways. In one computation appears, and in the other computation appears. On the spectral side,
while on the geometric side,
Thus we obtain the following.
Theorem 8.1 (Inhomogeneous Form of the MacWilliams Identity).
For every linear code ,
holds.
Proof
It is enough to equate the right-hand sides of Theorem 7.5 and Theorem 6.3.
The equality used here has already been read as a polynomial identity in . To return to the usual two-variable form, we homogenise. The weight enumerator of a length code is a homogeneous polynomial whose terms all have total degree . Therefore, if , then
holds. Apply this property to and , put , and multiply both sides by . Then
is obtained. In the last equality, we used the fact that is a homogeneous polynomial of total degree . In other words, by multiplying both arguments simultaneously by the outer factor , we get
Both sides are polynomials in , and they agree on the range . Hence they agree as a polynomial identity, and the same equality also holds when . This is the MacWilliams identity.
Theorem 8.2 (MacWilliams Identity).
For a linear code over the finite field ,
holds.
In one sentence, the proof can be summarised as follows.
The MacWilliams identity is a trace formula saying that, when the heat operator on the Hamming graph is viewed on the quotient space , its spectral side gives the weight enumerator of , while its geometric side gives the MacWilliams transform of the weight enumerator of .
A Small Example: The Binary Repetition Code of Length Three
In the general theory above, the key point was that the same trace on the quotient space was computed in two ways, with appearing on the geometric side and appearing on the spectral side. Below, for the binary repetition code of length three, we directly check that the geometric side and the spectral side give the same polynomial. Let and , and consider the binary repetition code
Then
The dual code is the even-weight code
and
The matrix of the one-coordinate heat operator, that is, its heat kernel, is
Therefore, for ,
The elements of are and , so the trace on the geometric side is
On the other hand, on the spectral side the weights of the elements of are , , , and , so
Indeed, the two trace computations agree. After homogenising, we get
which confirms the MacWilliams identity.
What this example shows is as follows. The two translations and in shift the diagonal entries of the kernel of the heat operator and then read them. When the shift is zero, appears; when it is non-zero, appears. On the other hand, the eigenfunctions on the quotient space are precisely the characters invariant on , and these correspond to . Because these two readings compute the same trace, an identity of weight enumerators is produced.
What the Heat Operator, Heat Kernel, and Trace Formula Did in This Proof
Let us review the proof in this note.
First, we read the Hamming weight from the diagonal entries of the kernel of the heat operator after composing with a translation. When the one-coordinate heat operator is composed with a translation and we take the trace, depending on whether the translation amount is zero or non-zero, the factors
appear. In several coordinates, these are multiplied coordinate by coordinate, and the Hamming weight is recorded.
Second, we used the code as a translation group. Using the averaging projection by ,
we pass to the function space on , that is, to the space of -invariant functions. Computing the trace through this projection averages the contributions of the translations corresponding to codewords in . This is the geometric side.
Third, we diagonalised the heat operator on the Hamming graph using eigenfunctions. The eigenfunctions are characters of the form , and the corresponding eigenvalues are . On the quotient space, only the -invariant characters remain. These correspond exactly to . This is the spectral side.
Fourth, we computed the same trace in two ways. On the spectral side we sum the eigenvalues indexed by , while on the geometric side we average the contributions of translations by elements of . The result is the equality
This equation is the core of the proof in this note.
Thus the MacWilliams identity can be read as the following trace formula for a heat operator.
The dual code appears on the spectral side of the quotient space , while the original code appears on the geometric side, where the trace on the same quotient space is computed as a translation average.
This viewpoint has the same underlying principle as the usual character-theoretic proof, while emphasising the analytic form of “spectral side versus geometric side”.
Concepts Seen in This Part
In this part, while aiming at a proof of the MacWilliams identity, we introduced the basic tools of heat operators, heat kernels, and trace formulas. They can be organised as follows.
- Kernels on finite sets
These are the matrix entries of a linear operator on a finite set . On a finite set, a kernel is the matrix itself.
- Trace
This is the sum of the diagonal entries of an operator. It can also be computed as a sum of eigenvalues. The equality of these two viewpoints is the entrance to trace formulas.
- Heat operators and heat kernels
The operator constructed from a Laplacian as is the heat operator, and its matrix entries form the heat kernel. In this note, putting , we wrote the one-coordinate heat operator as
- Hamming graph
This is the graph with vertex set in which two vertices at distance are joined by an edge. It can be viewed as the coordinatewise Cartesian product of one-coordinate complete graphs.
- Translation action
This is the action that moves a point by for . On the function space, we let it act by .
- -invariant functions
These are functions unchanged by translations by the code . They are the same thing as functions on the quotient set .
- Averaging projection
This is the projection onto -invariant functions,
Using this projection, we brought the trace on the quotient space back to a trace on the original space.
- Spectral side
This is the side where the heat operator is diagonalised by eigenfunctions and the trace is computed. In this note, because the -invariant eigenfunctions were indexed by , the weight enumerator of the dual code appeared.
- Geometric side
This is the side where the diagonal entries of the kernel of the heat operator composed with translations are summed directly. In this note, according to the Hamming weight of , the factors and appeared coordinate by coordinate, and the weight enumerator of the original code appeared.
Looking Back at This Proof Family
From here on, we are no longer in the proof itself, but are placing this proof within the map of the series as a whole. This section is supplementary; it may be skipped without affecting the understanding of the proof in this note.
On the surface, the proof in this note is a proof using heat operators, heat kernels, and trace formulas. We constructed the heat operator on the Hamming graph and computed the trace obtained by viewing it on the quotient space in two ways. The spectral side produced , and the geometric side produced .
Compressed into the five families, this proof belongs to the
Fourier, character, and Poisson family.
The reason is that the eigenfunctions diagonalising the heat operator on the Hamming graph are characters of the finite abelian group . Indeed, once the eigenfunctions
are used, finite Fourier analysis is already in the background.
However, the viewpoint in this note has a different appearance from the usual character-theoretic proof. In the usual character-theoretic proof, one directly uses the orthogonality relation
In contrast, this note constructs the heat operator on the Hamming graph and computes its trace on the quotient space in two ways:
as a sum of eigenvalues and as a diagonal sum of kernels after composing with translations.
Thus the Fourier principle is repackaged in the analytic form of a trace formula.
The following supplement is not needed for reading this note, but for readers who know the language of association schemes, the heat operator here is an element of the Bose–Mesner algebra of the Hamming scheme. For that reason, this proof also touches the orthogonal-polynomial and association-scheme family. However, in the proof in this note, we placed in the foreground the viewpoint of computing the trace of a heat operator in two ways, rather than the general theory of Bose–Mesner algebras.
This difference is the phenomenon that this series aims to highlight: the same theorem can be seen in the languages of different fields. Even for the same MacWilliams identity,
one can see it as character orthogonality, or as a trace formula for a heat operator.
The proof in this note extracts that analytic form.
Next Time
At this point, the derivation of the MacWilliams identity from heat operators, heat kernels, and trace formulas, which is the main claim of this note, is complete. What follows is a preview within the series.
Next time, we look at the MacWilliams identity from the side of lattices and theta functions. In the proof in this note, we used heat operators and heat kernels on the finite set and derived the MacWilliams identity as a finite trace formula. Next time, through Construction A, which constructs a lattice from a code, we embed weight enumerators into theta functions. Then we will view the MacWilliams identity through the transformation formula for lattice theta functions, namely the continuous version of the Poisson summation formula.
The main actors next time are
Construction A → lattices → dual lattices → theta functions → the Poisson summation formula → the MacWilliams identity
Even though the proof belongs to the same Fourier, character, and Poisson family, next time the MacWilliams identity will appear not as a heat operator and heat kernel on a finite set, but as a transformation formula for continuous lattices and theta functions.
References
- [Chu97] Fan R. K. Chung. Spectral graph theory. Conference Board of the Mathematical Sciences, Washington, DC; by the American Mathematical Society, Providence, RI, vol. 92, pp. xii+207, 1997 ↩
- [GR01] Chris Godsil and Gordon Royle. Algebraic graph theory. Springer-Verlag, New York, vol. 207, pp. xx+439, 2001. doi:10.1007/978-1-4613-0163-9 ↩
- [CY99] Fan Chung and S.-T. Yau. Coverings, heat kernels and spanning trees. Electron. J. Combin., vol. 6, pp. Research Paper 12, 21, 1999. doi:10.37236/1444 ↩
- [Ter99] Audrey Terras. Fourier analysis on finite groups and applications. Cambridge University Press, Cambridge, vol. 43, pp. x+442, 1999. doi:10.1017/CBO9780511626265 ↩
- [MS77] F. J. MacWilliams and N. J. A. Sloane. The theory of error-correcting codes. II. North-Holland Publishing Co., Amsterdam-New York-Oxford, vol. Vol. 16, pp. i–ix and 370–762, 1977 ↩
This series
A Series Learning through the MacWilliams Identity · Part 10 of 12
Disclaimer
Articles on this site are based on the operator's personal understanding, investigation, and research notes. I try to keep the content accurate, but it may contain errors or incomplete explanations. I do not guarantee its accuracy, completeness, usefulness, or currentness.
Please use the information on this site at your own judgment and responsibility. To the extent permitted by law, the operator is not liable for damages, losses, or disadvantages arising from using, or being unable to use, information on this site.
If you notice an error, unclear explanation, broken link, or insufficient citation, please contact the operator. I will review the content and, when appropriate, correct, update, or remove it.