Article and note

A Series Learning through the MacWilliams IdentityPart 6 of 12

An Introduction to Pless Moments through the MacWilliams Identity

Among the five proof systems for the MacWilliams identity, this note focuses on the moment and double-counting approach, and introduces moments of the weight distribution, binomial moments, Pless power moment identities, double counting, and the recovery of a distribution from its moments.

Published:: Jun 1, 2026
Updated:: Jun 1, 2026
Reading time:: 20 min (about 4,251 words)

Tagscoding theoryMacWilliams identityPless power momentsmomentsdouble countingweight distributionfinite fieldsexpository note

Download PDF

Introduction

One of the fundamental theorems in coding theory is the MacWilliams identity. Let $E$ be the coordinate set, let $n = \card{E}$ , and let $C \leq \F_{q}^{E}$ be a linear code over the finite field $\F_{q}$ . For $C$ , consider its dual code

C^{\perp} \coloneqq \{ u \in \F_q^E : u \cdot c = 0 \text{ for all } c \in C \}

where

u \cdot c = \sum_{e \in E} u_e c_e

is the standard inner product. In this note, for a word $c \in \F_q^E$ , write

\supp(c)=\{e\in E: c_e\neq 0\}, \qquad \wt(c)=\card{\supp(c)}.

Define the weight enumerator of the linear code $C$ by

W_{C}(X, Y) \coloneqq \sum_{c \in C} X^{n - \wt(c)} Y^{\wt(c)}.

The MacWilliams identity is the formula saying that the weight enumerator of the dual code can be computed from the weight enumerator of $C$ as follows:

W_{C^{\perp}}(X,Y) = \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr).

This is the MacWilliams identity.

In this series, I use proofs of the MacWilliams identity as a guide to an introduction to neighbouring areas and concepts. Accordingly, this series is aimed at readers who already know the following basic material in coding theory:

We assume familiarity, at least at a basic level, with

what a (finite) field is,
what a linear code over a finite field is,
what the Hamming weight is,
what the dual code is.

(There is no problem if you do not know a proof of the MacWilliams identity.)

This note does not assume Parts 1–5. Krawtchouk polynomials and association schemes appear in the background at some points, but this note does not use their theory. The only tools needed here are weight distributions, moments, binomial coefficients, double counting, and finite-dimensional linear algebra.

In this series, I look at proof methods for the MacWilliams identity under the following five broad families.

Fourier, character, and Poisson methods.
Möbius inversion, lattice-theoretic, and shortening/puncturing methods.
Orthogonal-polynomial and association-scheme methods.
Matroid and Tutte-polynomial methods.
Moment and double-counting methods.

In this note, I focus on the last of these: the moment and double-counting approach. The aim of this note is to use a proof of the MacWilliams identity as a guide to an introduction to Pless moments, especially moments of the weight distribution and double counting.

Write the weight distribution as

A_{w}(C) = \card{\{ c \in C: \wt(c) = w \}} \qquad (0 \leq w \leq n).

Then the weight enumerator is

W_{C}(X, Y) = \sum_{w = 0}^{n} A_{w}(C) X^{n-w} Y^{w}.

From another point of view, using Krawtchouk polynomials and Hamming schemes, one can regard the MacWilliams identity as a transform of the weight distribution itself. Here a Hamming scheme is, roughly speaking, the combinatorial structure obtained by classifying two words according to their Hamming distance. However, this note does not use its theoretical definition or general theory. Here, instead of transforming the weight distribution directly, we first look at

\sum_{w = 0}^{n} A_{w}(C) w^{h} \quad\text{and}\quad \sum_{w=0}^{n} A_{w}(C) \binom{w}{r}

as moments.

In probability theory, one studies means, variances, and higher moments in order to understand a distribution. We shall do the same here. However, the moments used in this note are not expectations obtained by normalising to a probability distribution. They are moments as sums over all codewords. If one wants to view them as probabilistic means or expectations, one should divide these quantities by $\card{C}$ . That said, in coding theory, moments involving binomial coefficients $\binom{w}{r}$ are more natural than moments involving ordinary powers $w^{h}$ . The reason is that $\binom{\wt(c)}{r}$ is the number of ways to choose $r$ coordinates from the non-zero coordinates of the codeword $c$ . Therefore

\sum_{w=0}^{n}A_{w}(C)\binom{w}{r}

counts pairs consisting of a codeword and a coordinate set. By counting this quantity in two ways, Pless moment identities appear.

The flow of this note is as follows.

weight distribution $\to$ binomial moments $\to$ moment generating function $\to$ double counting $\to$ Pless moment identities $\to$ MacWilliams identity

In Pless's classical paper [Ple63], power moment identities are derived from the MacWilliams identity, and applications are also given. In this note, I do not take the MacWilliams identity as known. Instead, I compute the binomial moments directly by double counting and explain how the MacWilliams identity can be recovered from sufficiently many moments. For the classical treatment of the weight distribution of the dual code and the MacWilliams equations, see MacWilliams–Sloane [MS77, Chapter 5]. For equivalent formulations including the MacWilliams equations and Pless moments, Huffman–Pless [HP03, Sections 7.1–7.2] is a standard reference. For a textbook treatment of Pless moments, see also Pless [Ple98, Chapter 8].

Weight Distributions and Moments

First, view the weight distribution as a sequence. Let $C \leq \F_{q}^{E}$ be a linear code, and write

A_{w} \coloneqq A_{w}(C) = \card{\{ c \in C : \wt(c) = w \}} \qquad (0\leq w\leq n).

This sequence $(A_{0}, A_{1}, \dots, A_{n})$ is the weight distribution of $C$ .

In order to collect the weight distribution into a one-variable polynomial, set

P_{C}(t) \coloneqq W_{C}(1, t) = \sum_{w=0}^{n} A_{w} t^{w}.

This $P_{C}(t)$ is the dehomogenisation of the weight enumerator $W_{C}(X,Y)$ . Indeed, we may read

W_C(X, Y) = X^{n} P_{C}(Y/X).

Here $Y/X$ is a formal substitution, and in the end both sides are compared as polynomials.

Definition 2.1.

For a non-negative integer $h$ , define the $h$ -th power moment of $C$ by

M_{h}^{\mathrm{pow}}(C) \coloneqq \sum_{w = 0}^{n} A_{w}(C) w^{h}.

When $h = 0$ , we use the convention $0^{0} = 1$ , and hence

M_{0}^{\mathrm{pow}}(C) = \sum_{w=0}^{n} A_{w}(C) = \card{C}.

When $h = 1$ , this is the total weight of all codewords,

\sum_{c \in C} \wt(c).

When $h=2$ , it is the sum of the squares of the weights.

In coding theory, however, the following binomial moments are easier to handle than power moments.

Definition 2.2.

For $0 \leq r \leq n$ , define the $r$ -th binomial moment of $C$ by

M_{r}(C) \coloneqq \sum_{w = 0}^{n} A_{w}(C) \binom{w}{r}.

Here we use the convention $\binom{w}{r}=0$ for $w < r$ . The meaning of a binomial moment is very concrete. If we fix a codeword $c \in C$ , then $\binom{\wt(c)}{r}$ is the number of ways to choose $r$ coordinates from $\supp(c)$ . Thus the $r$ -th binomial moment $M_{r}(C)$ counts pairs of the following form:

(c, S) \quad\text{where}\quad c \in C, \quad S \subseteq \supp(c), \quad \card{S} = r.

This viewpoint of counting pairs is the entrance to the Pless moment identity.

The Binomial-Moment Generating Function

Binomial moments appear as the coefficients when the polynomial $P_{C}(t)$ is expanded around $t = 1$ .

Theorem 3.1.

Let $P_{C}(t) = \sum_{w = 0}^{n} A_{w}(C) t^{w}$ . Then

P_{C}(1 + z) = \sum_{r = 0}^{n} M_{r}(C) z^{r}

holds.

Proof

By the binomial theorem,

(1 + z)^{w} = \sum_{r = 0}^{w} \binom{w}{r} z^{r}.

Therefore

\begin{aligned} P_{C}(1 + z) &= \sum_{w = 0}^{n} A_{w}(C)(1 + z)^{w} \\ &= \sum_{w = 0}^{n} A_{w}(C) \sum_{r = 0}^{w} \binom{w}{r} z^{r} \\ &= \sum_{r = 0}^{n} \left( \sum_{w = 0}^{n} A_{w}(C) \binom{w}{r} \right)z^r \\ &= \sum_{r = 0}^{n} M_{r}(C) z^{r}. \end{aligned}

This theorem shows that the binomial moments determine the distribution completely.

Theorem 3.2.

For a sequence $a_{0}, a_{1}, \dots, a_{n}$ , set

m_{r} = \sum_{w=0}^{n} a_{w} \binom{w}{r} \qquad(0 \leq r\leq n).

Then $a_{0}, a_{1}, \dots, a_{n}$ can be recovered uniquely from $m_{0}, m_{1}, \dots, m_{n}$ . Explicitly,

a_{w} = \sum_{r=w}^{n}(-1)^{r-w}\binom{r}{w} m_{r} \qquad(0\leq w\leq n).

Proof

Put $P(t) = \sum_{w=0}^{n} a_{w} t^{w}$ . The assumption is precisely that

P(1 + z) = \sum_{r = 0}^{n} m_{r} z^{r}

holds. Substituting $z = t - 1$ , we get

P(t) = \sum_{r = 0}^{n} m_{r}(t - 1)^{r}.

Taking the coefficient of $t^{w}$ on the right-hand side gives

a_{w} = \sum_{r=w}^{n} m_{r} \binom{r}{w}(-1)^{r-w}.

This theorem is important in the proof in this note. The MacWilliams identity is a formula relating entire weight distributions. However, even without handling the whole weight distribution directly, one can recover the weight distribution once all binomial moments are known. Therefore, to find the weight distribution of the dual code, it is enough to find all binomial moments of the dual code.

Counting Binomial Moments in Two Ways

We now count binomial moments. Let $C$ be an $[n,k]$ linear code over $\F_{q}$ , that is, let $\dim_{\F_{q}} C = k$ . Thus $\card{C} = q^{k}$ . We write the number of dual codewords of weight $j$ simply as $A_{j}(C^{\perp})$ .

The binomial moment $\displaystyle M_{r}(C) = \sum_{w = 0}^{n} A_{w}(C) \binom{w}{r}$ counts pairs

(c, S), \qquad c \in C, \quad S \subseteq \supp(c), \quad \card{S} = r.

We first count these pairs with $S$ fixed.

Definition 4.1.

For $S \subseteq E$ , define

N_{C}(S) \coloneqq \card{\{ c \in C: c_{e} \neq 0 \text{ for all } e \in S\}}.

With this notation,

M_{r}(C) = \sum_{\substack{S \subseteq E \\\card{S} = r}} N_{C}(S). \tag{4.1}

For fixed $S$ , the number $N_{C}(S)$ is the number of codewords for which all coordinates in $S$ are non-zero. Notice that no condition is imposed on the coordinates outside $S$ . Thus $N_{C}(S)$ is not the number of codewords whose support is exactly $S$ ; it is the number of codewords which are non-zero at least on $S$ . The condition of being non-zero is not a linear condition, so we first rewrite it in terms of the linear condition of specifying which coordinates are zero.

For $T \subseteq S$ , consider all codewords whose coordinates in $T$ are all zero. This is a linear subspace. Consider the restriction map to the coordinates in $T$ ,

\rho_{T} \colon C \to \F_{q}^{T}, \qquad c \mapsto c|_{T}.

Its kernel is

\Ker(\rho_{T}) = \{ c \in C : c_{e} = 0 \text{ for all } e \in T\}.

If $Z_{e} = \{ c \in C : c_{e} = 0\}$ , then $N_{C}(S)$ is the number of codewords which do not belong to any of the $Z_{e}$ with $e \in S$ . Therefore, by inclusion–exclusion,

N_{C}(S) = \sum_{T \subseteq S}(-1)^{\card{T}}\card{\Ker(\rho_{T})}. \tag{4.2}

Next, we write $\card{\Ker(\rho_{T})}$ using information from the dual code. Let

C^{\perp}(T) \coloneqq \{ u \in C^{\perp}:\supp(u) \subseteq T\}

denote the dual codewords whose support is contained in $T$ . This is not the shortened code of $C^{\perp}$ on $T$ itself. It is the subspace consisting of those codewords on $E$ whose support is contained in $T$ .

Lemma 4.2.

For any $T \subseteq E$ ,

\card{\Ker(\rho_{T})} = q^{k - \card{T}} \card{C^{\perp}(T)}

holds.

Proof

Write $\rho_{T}(C)$ for the image of the restriction map $\rho_{T} \colon C \to \F_{q}^{T}$ . By the first isomorphism theorem,

\card{\Ker(\rho_{T})} = \frac{\card{C}}{\card{\rho_{T}(C)}} = \frac{q^{k}}{\card{\rho_{T}(C)}}.

On the other hand, with respect to the standard inner product on $\F_{q}^{T}$ , the space $C^{\perp}(T)$ can be identified with the orthogonal complement of $\rho_{T}(C)$ . Indeed, if $u \in \F_{q}^{T}$ is extended by zero to $E$ , and this extension is denoted by $\widetilde{u}$ , then

\begin{aligned} u \in \rho_{T}(C)^{\perp} &\Longleftrightarrow u \cdot c|_{T} = 0 \quad\text{for all } c \in C \\ &\Longleftrightarrow \widetilde{u} \cdot c = 0 \quad\text{for all } c \in C \\ &\Longleftrightarrow \widetilde u \in C^{\perp}. \end{aligned}

Since $\widetilde{u}$ is zero outside $T$ , this is equivalent to $\widetilde{u} \in C^{\perp}(T)$ . In general, for a subspace $D$ of the finite-dimensional space $\F_{q}^{T}$ , we have $\dim D + \dim D^{\perp} = \card{T}$ . Hence $\card{D}\card{D^{\perp}} = q^{\card{T}}$ . Applying this to $D = \rho_{T}(C)$ gives

\card{\rho_{T}(C)}\card{C^{\perp}(T)} = q^{\card{T}}.

Substituting this into the equation above, we obtain

\card{\Ker(\rho_{T})} = q^{k} \frac{\card{C^{\perp}(T)}}{q^{\card{T}}} = q^{k-\card{T}}\card{C^{\perp}(T)}.

This lemma is the linear-algebraic core of the double counting. The size of the kernel of the coordinate restriction map of $C$ is expressed by the number of codewords in $C^{\perp}$ whose support is contained in $T$ . The right-hand side contains $q^{k-\card{T}}$ , so when $\card{T} > k$ it may look as if a fraction has appeared. However, this is just a rewriting of $\card{\Ker(\rho_{T})}=\card{C}/\card{\rho_{T}(C)}$ using the size of an orthogonal complement, and in the end it is always an integer. Equivalently, in that case $\card{C^{\perp}(T)}$ contains the missing power of $q$ .

Pless's Binomial Moment Identity

From the preparation in the previous section, we obtain a formula for binomial moments. This is the binomial-moment version of the Pless moment identity.

Theorem 5.1 (Pless's binomial moment identity).

Let $C \leq \F_{q}^{E}$ be an $[n, k]$ linear code. Then, for $0 \leq r \leq n$ ,

\sum_{w=0}^{n} A_{w}(C)\binom{w}{r} = q^{k - r} \sum_{j = 0}^{r} (-1)^{j}(q - 1)^{r-j} \binom{n - j}{r - j} A_{j}(C^{\perp}) \tag{5.1}

holds.

Proof

Before entering the proof, let us organise the roles of the three sets. The set $S$ is the set of $r$ coordinates on which the codeword $c$ is required to be non-zero. The set $T$ is the set of coordinates which are specified to be zero in the inclusion–exclusion argument. The set $U$ is the support of a dual codeword $u \in C^{\perp}$ . The condition $U \subseteq T \subseteq S$ means that a dual codeword with support $U$ is counted in $C^{\perp}(T)$ , and that this $T$ is used in the inclusion–exclusion argument.

Write the left-hand side as $M_{r}(C)$ . By (4.1) and (4.2),

\begin{aligned} M_{r}(C) &= \sum_{\substack{S \subseteq E\\\card{S} = r}} \sum_{T \subseteq S}(-1)^{\card{T}}\card{\Ker(\rho_{T})}. \end{aligned}

Substituting Lemma 4.2, we get

M_{r}(C) = \sum_{\substack{S \subseteq E\\\card{S} = r}} \sum_{T \subseteq S} (-1)^{\card{T}}q^{k-\card{T}}\card{C^{\perp}(T)}.

Now split the codewords of $C^{\perp}$ according to their supports. For $U \subseteq E$ , set

A_{C^{\perp}}(U) \coloneqq \card{\{ u \in C^{\perp}:\supp(u) = U \}}.

This is an auxiliary notation for the number of codewords with the support $U$ itself fixed, not the number $A_j(C^{\perp})$ with only the weight fixed. Then

\card{C^{\perp}(T)} = \sum_{U \subseteq T}A_{C^{\perp}}(U).

Therefore

\begin{aligned} M_{r}(C) &= \sum_{U \subseteq E}A_{C^{\perp}}(U) \sum_{\substack{S \subseteq E,\ \card{S} = r}} \sum_{\substack{T: U \subseteq T \subseteq S}} (-1)^{\card{T}}q^{k-\card{T}} \end{aligned}

follows.

Fix $U$ and put $\card{U} = j$ . From now on, count the contribution of all $S,T$ satisfying $U \subseteq T \subseteq S$ . If $j > r$ , then there is no $S$ with $U \subseteq S$ and $\card{S} = r$ , so the contribution is $0$ . Hence assume $j \leq r$ . Once $S$ satisfies $U\subseteq S$ and $\card{S}=r$ , the set $T$ runs through all sets with $U \subseteq T \subseteq S$ . Writing $T = U \cup L$ , where $L \subseteq S \setminus U$ , the inner sum is

\begin{aligned} \sum_{L \subseteq S \setminus U} (-1)^{j + \card{L}} q^{k - j - \card{L}} &= (-1)^{j} q^{k - j} \sum_{L \subseteq S \setminus U}(-q^{-1})^{\card{L}} \\ &= (-1)^{j} q^{k-j} \left( 1 - \frac{1}{q} \right)^{r-j} \\ &= (-1)^{j} q^{k - r}(q - 1)^{r - j}. \end{aligned}

The number of $r$ -element subsets $S$ containing $U$ is $\binom{n-j}{r-j}$ . Therefore the total contribution of dual codewords with support $U$ is

A_{C^{\perp}}(U) (-1)^{j} q^{k-r}(q-1)^{r-j}\binom{n-j}{r-j}.

Finally, summing over all $U$ with $\card{U} = j$ , we have

\sum_{\substack{U \subseteq E \\ \card{U} = j}}A_{C^{\perp}}(U) = A_{j}(C^{\perp}).

Hence

M_{r}(C) = q^{k-r} \sum_{j = 0}^{r} (-1)^{j}(q - 1)^{r - j} \binom{n - j}{r - j}A_{j}(C^{\perp})

follows.

This formula has quite a meaningful form. The left-hand side is the $r$ -th binomial moment of $C$ . The right-hand side involves only the numbers of codewords in $C^{\perp}$ of weights $0, 1, \dots, r$ . Thus the low moments of $C$ are controlled by the low-weight part of the dual code.

For example, when $r = 0$ , the formula is

\sum_{w = 0}^{n} A_{w}(C) = q^{k} A_{0}(C^{\perp}) = q^{k}.

This is simply $\card{C} = q^{k}$ . When $r = 1$ , we get

\sum_{w = 0}^{n} w A_{w}(C) = q^{k - 1}\bigl( (q - 1)n - A_{1}(C^{\perp}) \bigr).

If $C^{\perp}$ has no codeword of weight $1$ , then

\sum_{w = 0}^{n} wA_{w}(C) = q^{k - 1}(q - 1)n.

The absence of a dual codeword of weight $1$ means that the map $c \mapsto c_{e}$ to each coordinate is not the zero map. A non-zero linear map $C \to \F_{q}$ is surjective, so at each coordinate $0$ appears $q^{k - 1}$ times, and non-zero values appear altogether $(q - 1)q^{k - 1}$ times. Summing this over all coordinates gives the expression $q^{k - 1}(q - 1)n$ above. This agrees with the intuition that, on average, each coordinate is non-zero in the ratio $q - 1$ to $1$ .

When $r = 2$ , the formula is

\begin{aligned} \sum_{w = 0}^{n} A_{w}(C)\binom{w}{2} &= q^{k-2}\Bigl( (q-1)^2\binom{n}{2}\\ &\qquad - (q - 1)(n - 1)A_{1}(C^{\perp}) + A_{2}(C^{\perp}) \Bigr). \end{aligned}

Here the codewords of weights $1$ and $2$ in the dual code appear as correction terms for the second moment.

Pless Identities as Power Moment Identities

Pless identities are often written not in terms of binomial moments, but in the form of power moments $\sum_{w = 0}^{n} A_{w}(C) w^{h}$ . To pass from binomial moments to power moments, we use Stirling numbers.

Definition 6.1.

For non-negative integers $h$ and $r$ , define the Stirling number of the second kind $\Stirling{h}{r}$ by

x^{h} = \sum_{s=0}^{h}\Stirling{h}{s} x(x - 1) \dotsm (x - s + 1).

The term with $s = 0$ is read as $1$ when $h = 0$ , and as $0$ when $h > 0$ .

Since

x(x - 1)\dotsm(x - r + 1) = r! \binom{x}{r},

we have

x^{h} = \sum_{r=0}^{h} r! \Stirling{h}{r} \binom{x}{r}. \tag{6.1}

Theorem 6.2 (Pless power moment identities).

Let $C \leq \F_{q}^{E}$ be an $[n,k]$ linear code. Then, for any non-negative integer $h$ ,

\begin{aligned} \sum_{w = 0}^{n} A_{w}(C) w^{h} = \sum_{r=0}^{\min(h,n)} r!\Stirling{h}{r} q^{k-r} \sum_{j=0}^{r} (-1)^{j}(q - 1)^{r - j} \binom{n - j}{r - j} A_{j}(C^{\perp}) \tag{6.2} \end{aligned}

holds.

Proof

Substitute $x = w$ into (6.1) and take the weighted sum with respect to the weight distribution. This gives

\begin{aligned} \sum_{w = 0}^{n} A_{w}(C) w^{h} &= \sum_{w=0}^{n} A_{w}(C) \sum_{r=0}^{h} r!\Stirling{h}{r}\binom{w}{r} \\ &= \sum_{r=0}^{\min(h,n)} r! \Stirling{h}{r} \sum_{w=0}^{n} A_{w}(C)\binom{w}{r}. \end{aligned}

Now substitute Theorem 5.1 (Pless's binomial moment identity)Let $C \leq \F_{q}^{E}$ be an $[n, k]$ linear code. Then, for $0 \leq r \leq n$ , $\sum_{w=0}^{n} A_{w}(C)\binom{w}{r} = q^{k - r} \sum_{j = 0}^{r} (-1)^{j}(q - 1)^{r-j} \binom{n - j}{r - j} A_{j}(C^{\perp}) \tag{5.1}$ holds.Theorem 5.1.

This is the classical power moment identity originating in Pless [Ple63]. Its appearance is a little complicated, but its structure is simple. First, expand the power $w^{h}$ as a linear combination of binomial coefficients $\binom{w}{r}$ . Then compute the binomial moments by double counting.

What is especially important is that only the low-weight distribution of the dual code appears on the right-hand side. When $h \leq n$ , it is enough to know the numbers of codewords in $C^{\perp}$ of weights $0, 1, \dots, h$ . In general, the required weights are $0, 1, \dots, \min(h,n)$ . This property is very useful when determining the weight distribution of concrete codes.

Recovering the MacWilliams Identity from Moments

So far, we have proved Pless's binomial moment identity. Next, we recover the MacWilliams identity from this identity. The key point is to collect the binomial moments into a generating function.

If $C$ is an $[n,k]$ code, then $C^{\perp}$ is an $[n,n-k]$ code. Apply Theorem 5.1 (Pless's binomial moment identity)Let $C \leq \F_{q}^{E}$ be an $[n, k]$ linear code. Then, for $0 \leq r \leq n$ , $\sum_{w=0}^{n} A_{w}(C)\binom{w}{r} = q^{k - r} \sum_{j = 0}^{r} (-1)^{j}(q - 1)^{r-j} \binom{n - j}{r - j} A_{j}(C^{\perp}) \tag{5.1}$ holds.Theorem 5.1 to $C^{\perp}$ . Since $(C^{\perp})^{\perp} = C$ , if we write $A_{w} = A_{w}(C)$ , then the binomial moments of $C^{\perp}$ are

M_{r}(C^{\perp}) = q^{n - k - r} \sum_{w = 0}^{r} (-1)^{w}(q - 1)^{r - w} \binom{n - w}{r - w} A_{w}. \tag{7.1}

On the other hand, by Theorem 3.1,

P_{C^{\perp}}(1 + z) = \sum_{r=0}^{n} M_{r}(C^{\perp})z^{r}.

Substituting (7.1), we obtain

\begin{aligned} P_{C^{\perp}}(1 + z) &= \sum_{r = 0}^{n} q^{n - k - r} \sum_{w = 0}^{r} (-1)^{w} (q - 1)^{r - w} \binom{n - w}{r - w} A_{w} z^{r} \\ &= \sum_{w = 0}^{n} A_{w}(-1)^{w} \sum_{r = w}^{n} q^{n - k - r}(q - 1)^{r - w} \binom{n - w}{r - w} z^{r}. \end{aligned}

Putting $r = w + s$ , the inner sum becomes

\begin{aligned} &\sum_{s = 0}^{n - w} q^{n - k - w - s}(q - 1)^{s} \binom{n-w}{s} z^{w + s} \\ ={}& q^{-k} z^{w} \sum_{s = 0}^{n - w} \binom{n - w}{s} q^{n - w - s}(q - 1)^{s} z^{s} \\ ={}& q^{-k} z^{w} \bigl( q + (q - 1)z \bigr)^{n - w}. \end{aligned}

Therefore

P_{C^{\perp}}(1 + z) = \frac{1}{\card{C}} \sum_{w = 0}^{n} A_{w}(-z)^{w} \bigl( q + (q - 1)z \bigr)^{n - w}. \tag{7.2}

Now set $t \coloneqq 1 + z$ . Then

-z = 1 - t, \qquad q + (q - 1)z = 1 + (q - 1)t,

so (7.2) becomes

P_{C^{\perp}}(t) = \frac{1}{\card{C}} \sum_{w=0}^{n} A_{w}(C) \bigl( 1 + (q - 1)t \bigr)^{n-w}(1 - t)^{w}. \tag{7.3}

Finally, we homogenise this one-variable identity. Since $P_{C^{\perp}}(t) = W_{C^{\perp}}(1, t)$ , formally putting $t = Y/X$ and multiplying both sides by $X^{n}$ gives

\begin{aligned} W_{C^{\perp}}(X, Y) &= \frac{1}{\card{C}} \sum_{w = 0}^{n} A_{w}(C) \bigl( X + (q - 1)Y \bigr)^{n - w}(X - Y)^{w} \\ &= \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr). \end{aligned}

This is a calculation as a polynomial identity. Thus the MacWilliams identity has been obtained.

Theorem 7.1 (MacWilliams identity).

Let $C \leq \F_{q}^{E}$ be a linear code. Then

W_{C^{\perp}}(X, Y) = \frac{1}{\card{C}} W_{C}\bigl( X + (q - 1)Y, X - Y \bigr)

holds.

In this proof, we did not use Krawtchouk polynomials by name. However, in the process of collecting binomial moments into a generating function, essentially the same transform coefficients appear in a different guise. The difference is that, in the present viewpoint, we do not transform the weight distribution all at once. Instead, we first compute all moments and then recover the distribution from them.

A Small Example: The Binary Repetition Code of Length $3$

Let us recover the weight distribution of a dual code from moments in a small example. Consider the repetition code over $\F_{2}^{3}$

C = \{ 000, 111 \}.

This is a $[3,1]$ code, and its weight distribution is

(A_{0}, A_{1}, A_{2}, A_{3}) = (1, 0, 0, 1).

The dual code $C^{\perp}$ is the set of all words of even weight,

C^{\perp} = \{ 000, 011, 101, 110 \},

so the answer should be

(1, 0, 3, 0).

Here we recover this from Pless moments.

The code $C^{\perp}$ is a $[3,2]$ code. Applying Pless's binomial moment identity to $C^{\perp}$ and using $(C^{\perp})^{\perp} = C$ , we get

M_{r}(C^{\perp}) = 2^{2-r} \sum_{w=0}^{r}(-1)^{w} \binom{3-w}{r-w}A_{w}(C).

Here $q = 2$ , so $(q - 1)^{r - w} = 1$ .

We compute these in order. For $r = 0$ ,

M_{0}(C^{\perp}) = 2^{2} A_{0}(C) = 4.

For $r = 1$ ,

M_{1}(C^{\perp}) = 2\left(\binom{3}{1}A_{0}(C) - A_{1}(C)\right) = 2 \cdot 3 = 6.

For $r = 2$ ,

M_{2}(C^{\perp}) = \binom{3}{2} A_{0}(C) - \binom{2}{1} A_{1}(C) + A_{2}(C) =3.

For $r = 3$ ,

M_{3}(C^{\perp}) = 2^{-1}\left(A_{0}(C) - A_{1}(C) + A_{2}(C) - A_{3}(C) \right) = 0.

Therefore

P_{C^{\perp}}(1 + z) = 4 + 6z + 3z^{2}.

Putting $z = t - 1$ , we get

P_{C^{\perp}}(t) = 4 + 6(t - 1) + 3(t - 1)^{2} = 1 + 3t^{2}.

Thus

P_{C^{\perp}}(t) = 1 + 3t^{2},

and the weight distribution is recovered as

(A_{0}(C^{\perp}), A_{1}(C^{\perp}), A_{2}(C^{\perp}), A_{3}(C^{\perp})) = (1, 0, 3, 0).

In this example, we confirmed the answer by listing the elements of the dual code directly. From the viewpoint of Pless moments, however, we first computed $M_{0}(C^{\perp})$ , $M_{1}(C^{\perp})$ , $M_{2}(C^{\perp})$ , $M_{3}(C^{\perp})$ , then recovered $P_{C^{\perp}}(t)$ , and finally read off the weight distribution. The flow is not to guess the distribution directly, but to determine it through its moments.

What Were Pless Moments Doing in This Proof?

Let us organise the role played by Pless moments in the proof.

First, we introduced the viewpoint of studying the weight distribution through moments. Instead of handling the weight distribution $(A_{0}, A_{1}, \dots, A_{n})$ directly, we looked at

\sum_{w=0}^{n} A_{w} w^{h} \quad\text{and}\quad \sum_{w=0}^{n} A_{w} \binom{w}{r}.

This is the idea of studying a distribution through sum-type quantities which, after normalisation, correspond to means and higher means, and through binomial moments.

Second, binomial moments naturally became counting problems. The quantity $\displaystyle \sum_{w=0}^{n} A_{w}(C) \binom{w}{r}$ counts pairs consisting of a codeword $c \in C$ and an $r$ -element subset $S$ contained in its non-zero coordinates. For this reason, binomial moments are well suited to double counting.

Third, during the double counting, the low-weight distribution of the dual code appeared. To count codewords whose coordinates in $S$ are all non-zero, we used inclusion–exclusion, and expressed the kernel of a coordinate restriction map using the part of the dual code with small support. As a result, the $r$ -th binomial moment involved only the numbers of codewords in $C^{\perp}$ of weights $0, 1, \dots, r$ .

Fourth, collecting all binomial moments allowed us to recover the whole weight distribution. Using the generating function

P_{C}(1 + z) = \sum_{r = 0}^{n} M_{r}(C) z^{r},

the moment sequence is precisely the sequence of coefficients in the expansion of the weight distribution polynomial around $t = 1$ . Thus sufficiently many moments determine the distribution itself.

The point of the proof in this note can be summarised in the following sentence.

The MacWilliams identity is not only a formula which directly transforms the weight distribution; it is also the generating-function version of the Pless moment identity saying that the low-weight distribution of the dual code controls the moments of the original code.

Concepts Seen in This Part

In this part, while aiming at a proof of the MacWilliams identity, we introduced the basic tools of Pless moments. They may be organised as follows.

Weight distribution

For a code $C$ , this is the sequence defined by

A_{w}(C) = \card{\{ c \in C : \wt(c) = w \}}.

The weight enumerator collects this sequence into a two-variable polynomial.

Power moment

This is a quantity of the form

\sum_{w = 0}^{n} A_{w}(C) w^{h}.

It measures the weight distribution using the $h$ -th powers of the weights.

Binomial moment

This is the quantity defined by

M_{r}(C) = \sum_{w = 0}^{n} A_{w}(C) \binom{w}{r}.

It counts pairs consisting of a codeword and an $r$ -element subset of its non-zero coordinates.

Moment generating function

For the dehomogenised weight enumerator $P_{C}(t) = W_{C}(1, t)$ , we have

P_{C}(1 + z) = \sum_{r = 0}^{n} M_{r}(C) z^{r}.

Binomial moments are the coefficients in the expansion of $P_{C}(t)$ around $t = 1$ .

Double counting

We viewed $M_{r}(C)$ in two ways: by counting from codewords, and by first fixing a coordinate set. In the latter method, inclusion–exclusion and coordinate restriction maps appear.

Pless's binomial moment identity

If $C$ is an $[n,k]$ code, then

\sum_{w=0}^{n} A_{w}(C)\binom{w}{r} = q^{k-r} \sum_{j=0}^{r} (-1)^{j} (q - 1)^{r - j} \binom{n - j}{r - j} A_{j}(C^{\perp})

holds. The $r$ -th binomial moment is determined only by the distribution of the dual code from weight $0$ through weight $r$ .

Pless power moment identities

By expanding the power $w^{h}$ as a linear combination of binomial coefficients $\binom{w}{r}$ , one obtains a formula for power moments from the binomial moment identity. The Stirling numbers of the second kind appear as the conversion coefficients.

From the viewpoint of Pless moments, the MacWilliams identity is not merely a formula which changes the variables of the weight enumerator. It also summarises the counting structure by which each moment of the weight distribution of a code is controlled by low-weight codewords in the dual code.

Review of This Proof Family

As mentioned at the beginning, the proof in this note belongs to the

moment and double-counting approach

family. On the surface, it is a proof which computes moments of the weight distribution. At a deeper level, however, it does not transform the distribution itself directly. Instead, it counts the quantities defined by that distribution in two ways, and then recovers the distribution from sufficiently many moments.

The point of this proof is the following sentence.

Reinterpret the weight distribution as a moment sequence, and connect those moments to the low-weight distribution of the dual code by double counting.

In the Fourier and character approach, the dual code appears from the orthogonality relations of characters. In a proof using the language of association schemes, one treats the relations classified by Hamming distance as matrices and reads off the transform of the weight distribution from their simultaneous eigenspace decomposition. This note did not use that matrix algebra; instead, it extracted the same transform from counting moments. In the present moment approach, the dual code appears in the process of counting kernels of coordinate restriction maps. Even for the same MacWilliams identity, the form looks quite different because the quantities being observed are different.

Next Time

Next time, we shall look at the MacWilliams identity from the side of matroids and the Tutte polynomial.

In the proof in this note, we studied moments of the weight distribution by double counting. Next time, we shall consider the combinatorial structure determined by the columns of a generator matrix of a linear code, namely a matroid. The weight enumerator of the code is expressed as a specialisation of the Tutte polynomial of the corresponding matroid. The dual code corresponds to the dual matroid, and the MacWilliams identity emerges from the duality of the Tutte polynomial.

Thus the protagonist next time will be

generator matrix $\to$ vector matroid $\to$ Tutte polynomial $\to$ dual matroid $\to$ MacWilliams identity

The same MacWilliams identity will appear, this time as the duality of a matroid invariant.

References

[Ple63] Vera Pless. Power moment identities on weight distributions in error correcting codes. Information and Control, vol. 6, no. 2, pp. 147–152, 1963. doi:10.1016/S0019-9958(63)90189-X Citation contextIn Pless's classical paper [Ple63], power moment identities are derived from the MacWilliams identity, and applications are also given. In this note, I do not take the MacWilliams identity as known. Instead, I compute the binomial moments directly by double counting and explain how the MacWilliams identity can be recovered from sufficiently many moments. For the classical treatment of the weight distribution of the dual code and the MacWilliams equations, see MacWilliams–Sloane [MS77, Chapter 5]. For equivalent formulations including the MacWilliams equations and Pless moments, Huffman–Pless [HP03, Sections 7.1–7.2] is a standard reference. For a textbook treatment of Pless moments, see also Pless [Ple98, Chapter 8].↩1 ↩2
[MS77] F. J. MacWilliams and N. J. A. Sloane. The theory of error-correcting codes. I. North-Holland Publishing Co., Amsterdam-New York-Oxford, vol. Vol. 16, pp. i–xv and 1–369, 1977 Citation contextIn Pless's classical paper [Ple63], power moment identities are derived from the MacWilliams identity, and applications are also given. In this note, I do not take the MacWilliams identity as known. Instead, I compute the binomial moments directly by double counting and explain how the MacWilliams identity can be recovered from sufficiently many moments. For the classical treatment of the weight distribution of the dual code and the MacWilliams equations, see MacWilliams–Sloane [MS77, Chapter 5]. For equivalent formulations including the MacWilliams equations and Pless moments, Huffman–Pless [HP03, Sections 7.1–7.2] is a standard reference. For a textbook treatment of Pless moments, see also Pless [Ple98, Chapter 8].↩
[HP03] W. Cary Huffman and Vera Pless. Fundamentals of Error-Correcting Codes. Cambridge University Press, 2003. doi:10.1017/CBO9780511807077 Citation contextIn Pless's classical paper [Ple63], power moment identities are derived from the MacWilliams identity, and applications are also given. In this note, I do not take the MacWilliams identity as known. Instead, I compute the binomial moments directly by double counting and explain how the MacWilliams identity can be recovered from sufficiently many moments. For the classical treatment of the weight distribution of the dual code and the MacWilliams equations, see MacWilliams–Sloane [MS77, Chapter 5]. For equivalent formulations including the MacWilliams equations and Pless moments, Huffman–Pless [HP03, Sections 7.1–7.2] is a standard reference. For a textbook treatment of Pless moments, see also Pless [Ple98, Chapter 8].↩
[Ple98] Vera Pless. Introduction to the theory of error-correcting codes. John Wiley & Sons, Inc., New York, pp. xiv+207, 1998. doi:10.1002/9781118032749 Citation contextIn Pless's classical paper [Ple63], power moment identities are derived from the MacWilliams identity, and applications are also given. In this note, I do not take the MacWilliams identity as known. Instead, I compute the binomial moments directly by double counting and explain how the MacWilliams identity can be recovered from sufficiently many moments. For the classical treatment of the weight distribution of the dual code and the MacWilliams equations, see MacWilliams–Sloane [MS77, Chapter 5]. For equivalent formulations including the MacWilliams equations and Pless moments, Huffman–Pless [HP03, Sections 7.1–7.2] is a standard reference. For a textbook treatment of Pless moments, see also Pless [Ple98, Chapter 8].↩

This series

A Series Learning through the MacWilliams Identity · Part 6 of 12

PreviousAn Introduction to Association Schemes through the MacWilliams Identity NextAn Introduction to Matroids and Tutte Polynomials through the MacWilliams Identity

Back to series list

Disclaimer

Articles on this site are based on the operator's personal understanding, investigation, and research notes. I try to keep the content accurate, but it may contain errors or incomplete explanations. I do not guarantee its accuracy, completeness, usefulness, or currentness.

Please use the information on this site at your own judgment and responsibility. To the extent permitted by law, the operator is not liable for damages, losses, or disadvantages arising from using, or being unable to use, information on this site.

If you notice an error, unclear explanation, broken link, or insufficient citation, please contact the operator. I will review the content and, when appropriate, correct, update, or remove it.

§1Introduction

§2Weight Distributions and Moments

§3The Binomial-Moment Generating Function

§4Counting Binomial Moments in Two Ways

§5Pless's Binomial Moment Identity

§6Pless Identities as Power Moment Identities

§7Recovering the MacWilliams Identity from Moments

§8A Small Example: The Binary Repetition Code of Length 333

§9What Were Pless Moments Doing in This Proof?

§10Concepts Seen in This Part

§11Review of This Proof Family

§12Next Time

References

This series

Disclaimer

Introduction

Weight Distributions and Moments

The Binomial-Moment Generating Function

Counting Binomial Moments in Two Ways

Pless's Binomial Moment Identity

Pless Identities as Power Moment Identities

Recovering the MacWilliams Identity from Moments

A Small Example: The Binary Repetition Code of Length $3$

What Were Pless Moments Doing in This Proof?

Concepts Seen in This Part

Review of This Proof Family

Next Time