Article and note

A Series Learning through the MacWilliams IdentityPart 4 of 12

An Introduction to Krawtchouk Polynomials through the MacWilliams Identity

Among the five proof systems for the MacWilliams identity, this note focuses on the orthogonal-polynomial and association-scheme approach, and introduces weight distributions, Krawtchouk polynomials, generating functions, orthogonality, and the Krawtchouk transform.

Published:: May 28, 2026
Updated:: Jun 6, 2026
Reading time:: 22 min (about 4,790 words)

Tagscoding theoryMacWilliams identityKrawtchouk polynomialsorthogonal polynomialsweight distributionfinite fieldsexpository note

Download PDF

Introduction

In coding theory, one of the basic theorems is the MacWilliams identity. Let $E$ be the coordinate set, let $n=\card{E}$ , and let $C \leq \mathbb{F}_{q}^{E}$ be a linear code over the finite field $\mathbb{F}_{q}$ . For its dual code

C^{\perp} = \{ u \in \mathbb{F}_{q}^{E} : u \cdot c = 0 \text{ for all } c \in C \},

where

u\cdot c=\sum_{e\in E}u_e c_e,

the weight enumerator of the dual code can be computed from the weight enumerator of $C$ as follows:

W_{C^{\perp}}(X,Y) = \frac{1}{\card{C}} W_{C}\bigl(X + (q - 1)Y, X - Y \bigr).

This is the MacWilliams identity.

In this series, I use proofs of the MacWilliams identity as a guide to an introduction to neighbouring areas and concepts. Accordingly, this series is aimed at readers who already know at least the following basic material:

We assume that the reader already knows, at least at a basic level, the following notions:

what a (finite) field is,
what a linear code over a finite field is,
what the Hamming weight is,
what the dual code is.

(There is no problem if you do not know a proof of the MacWilliams identity.)

In this series, I look at proof methods for the MacWilliams identity under the following five broad families.

Fourier, character, and Poisson methods.
Möbius inversion, lattice theory, and shortening/puncturing methods.
Orthogonal polynomials and association schemes.
Matroids and Tutte polynomials.
Moments and double-counting methods.

In this note, I focus on the third of these: the orthogonal-polynomial and association-scheme approach. However, this note does not yet enter the full theory of association schemes. Using character sums only as a minimal tool, we reread the MacWilliams identity as a Krawtchouk transform on weight distributions. This is an entrance to the orthogonal-polynomial and association-scheme viewpoint. The aim of this note is to use a proof of the MacWilliams identity as a guide to an introduction to Krawtchouk polynomials.

The MacWilliams identity is usually written as the polynomial identity

W_{C^{\perp}}(X,Y) = \frac{1}{\card{C}} W_C\bigl(X+(q-1)Y,X-Y\bigr).

In this note, we look at this identity at the coefficient level. Write the weight distribution of a code $C$ as

A_{w}(C) = \card{\{ c \in C: \wt(c) = w \}} \qquad (0 \leq w \leq n).

Then the weight enumerator is

W_{C}(X, Y) = \sum_{w = 0}^{n} A_{w}(C) X^{n - w} Y^w.

The MacWilliams identity is also a transform which computes the weight distribution vector

(A_{0}(C^{\perp}), A_{1}(C^{\perp}), \dots, A_{n}(C^{\perp}))

of the dual code from the weight distribution vector

(A_{0}(C), A_{1}(C), \dots, A_{n}(C))

of $C$ . The coefficients appearing in this transform are the Krawtchouk polynomials.

The route in this note is as follows.

Look at weight distributions.
Define Krawtchouk polynomials by a generating function.
Become familiar with the explicit formula and small examples.
See their meaning as character sums.
Check orthogonality and the Krawtchouk transform.
Derive the coefficient-level MacWilliams identity.
Return to the usual polynomial form.

Krawtchouk polynomials are a representative example of discrete orthogonal polynomials. In the classical world of orthogonal polynomials, Hermite polynomials, Laguerre polynomials, Jacobi polynomials, and so on are well known. These have orthogonality with respect to integrals over continuous intervals. On the other hand, Krawtchouk polynomials are orthogonal with respect to sums over the finite set $\{0,1,\dots,n\}$ . This feature, being “orthogonal polynomials on a finite set”, fits very well with the Hamming weight.

The original paper on Krawtchouk polynomials is Krawtchouk [Kra29]. For their systematic place as orthogonal polynomials, Koekoek–Lesky–Swarttouw [KLS10, Chapter 9] is a standard reference. For Krawtchouk polynomials and the MacWilliams identity in coding theory, MacWilliams–Sloane [MS77, Chapter 5] is the classical standard reference.

From weight enumerators to weight distributions

First, we reinterpret the weight enumerator as a coefficient vector. Let $E$ be a finite set and let $n=\card{E}$ . For a word $c\in\mathbb{F}_{q}^{E}$ , write its Hamming weight as

\wt(c)=\card{\{e\in E:c_e\neq 0\}}.

Definition 2.1.

For a linear code $C\leq \mathbb{F}_{q}^{E}$ , define

A_w(C) = \card{\{c\in C:\wt(c)=w\}} \qquad (0\leq w\leq n).

The sequence

(A_0(C),A_1(C),\dots,A_n(C))

is called the weight distribution of $C$ .

Using this notation, the weight enumerator can be written as

W_C(X,Y) = \sum_{w=0}^{n}A_w(C)X^{n-w}Y^w.

The weight enumerator packages the weight distribution as a single polynomial in two variables.

In order to write the MacWilliams identity at the coefficient level, write the weight distribution of the dual code as

B_j \coloneqq A_j(C^{\perp}) \qquad (0\leq j\leq n).

The aim is to express $B_j$ in terms of the numbers $A_w(C)$ . The form will be

B_{j} = \frac{1}{\card{C}} \sum_{w=0}^{n}A_w(C)\Kraw_j(w).

Here $\Kraw_j(w)$ is a value of a Krawtchouk polynomial.

Thus, in the proof in this note, we view the MacWilliams identity as

a linear transform on the weight distribution vector.

The entries of the matrix describing this linear transform are the Krawtchouk polynomials.

Definition of Krawtchouk polynomials

We now define Krawtchouk polynomials. In this note, we use the normalisation adapted to the $q$ -ary Hamming space which naturally appears in coding theory. There are several conventions in the literature for the placement of indices and for normalisation, but here we use the following definition.

Definition 3.1.

Fix $n$ and $q$ . For $0\leq w\leq n$ , define the numbers $\Kraw_0(w),\Kraw_1(w),\dots,\Kraw_n(w)$ by the generating function

\sum_{j=0}^{n}\Kraw_{j}(w)z^{j} = \bigl( 1 + (q - 1)z \bigr)^{n - w}(1 - z)^{w}. \tag{3.1}

That is, $\Kraw_j(w)$ is the coefficient of $z^j$ when the right-hand side is expanded as a polynomial in $z$ . By the explicit formula, this value is realised by substituting $w$ into a polynomial. That polynomial is called the degree- $j$ $q$ -ary Krawtchouk polynomial $\Kraw_j(x)=\Kraw_j^{(n,q)}(x)$ .

Here the subscript $j$ means “which Krawtchouk polynomial”, while the $w$ inside the parentheses means the Hamming weight substituted into that polynomial. Thus $\Kraw_j(w)$ is the value of the $j$ -th polynomial $\Kraw_j(x)$ at $x=w$ . In coding theory, one mainly uses the values for $w=0,1,\dots,n$ .

With this definition, we do not first read the right-hand side with $x$ as a general variable. Instead, we first define the values for the weights $w=0,1,\dots,n$ . The next explicit formula shows that $\Kraw_j(x)$ is genuinely a polynomial in $x$ of degree at most $j$ , and that substituting the integer $w$ satisfies 3.1. In coding theory, one mainly uses it by substituting weights $x=w$ .

Expanding 3.1 gives the following explicit formula.

Theorem 3.2.

For $0\leq j\leq n$ ,

\Kraw_j(x) = \sum_{\ell=0}^{j} (-1)^{\ell}(q-1)^{j-\ell} \binom{x}{\ell}\binom{n-x}{j-\ell} \tag{3.2}

holds.

Proof

First fix $0\leq w\leq n$ , and expand the right-hand side of the generating function by the binomial theorem:

\begin{aligned} &\bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w \\ ={}& \left( \sum_{a\geq 0}\binom{n-w}{a}(q-1)^a z^a \right) \left( \sum_{b\geq 0}\binom{w}{b}(-1)^b z^b \right) . \end{aligned}

Taking the coefficient of $z^j$ , we have $a+b=j$ , so putting $b=\ell$ gives

\sum_{\ell=0}^{j} (-1)^{\ell}(q-1)^{j-\ell} \binom{w}{\ell}\binom{n-w}{j-\ell}.

This is equal to the coefficient $\Kraw_j(w)$ of $z^j$ on the left-hand side of the generating function. Replacing $w$ in the right-hand side by the variable $x$ gives the polynomial in (3.2).

Here $\binom{x}{\ell}$ is read as the polynomial in $x$ defined by

\binom{x}{\ell} = \frac{x(x-1)\cdots(x-\ell+1)}{\ell!}.

Similarly, $\binom{n-x}{j-\ell}$ is also a polynomial in $x$ . When an integer $x=w$ is substituted, it agrees with the usual value of the binomial coefficient. In this sense, (3.2) gives $\Kraw_j(x)$ as a polynomial in $x$ .

This formula already shows a coding-theoretic meaning. Fix one word of weight $w$ . Among the $n$ coordinates, $w$ coordinates are non-zero and $n-w$ coordinates are zero. When considering another word of weight $j$ , one encounters choices in which $\ell$ of its non-zero coordinates overlap with the non-zero coordinates of the fixed word, and the remaining $j-\ell$ lie on the zero-coordinate side. Krawtchouk polynomials arise when a signed sum is inserted into this counting. The point to notice is that this is not merely a sum of cardinalities. We shall explain it later as a character sum using a non-trivial additive character $\psi$ : at the positions overlapping with the non-zero coordinates of the fixed word, instead of simply counting the non-zero value in $q-1$ ways, the oscillating sum

\sum_{a\in\mathbb{F}_{q}^{\times}}\psi(ac_r)=-1

appears. Therefore the contribution of the overlapping coordinates is not $(q-1)^{\ell}$ but $(-1)^{\ell}$ . Viewing this value as a function of $w$ gives $\Kraw_j(w)$ , and the explicit formula expresses it as the polynomial $\Kraw_j(x)$ . This signed counting leads to the later interpretation as a character sum.

First examples

To become familiar with the definition, let us compute a few low-degree polynomials. The constant term of the generating function

\sum_{j=0}^{n}\Kraw_j(w)z^j = \bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w

for weight $w$ is $1$ , so

\Kraw_0(x)=1.

Next, looking at the coefficient of $z$ , for weight $w$ we get

\Kraw_1(w) = (q-1)(n-w)-w.

Written as a polynomial in $x$ , this is

\Kraw_1(x) = (q-1)(n-x)-x = (q-1)n-qx.

Thus $\Kraw_1(x)$ is a linear polynomial in $x$ .

In particular, in the binary case, that is, when $q=2$ , we have

\Kraw_1(x)=n-2x.

This is a quantity which appears very often in the binary Hamming space. A word of weight $w$ has $n-w$ zero coordinates and $w$ non-zero coordinates. In the binary case, a non-zero coordinate contributes $-1$ with sign, while a zero coordinate contributes $+1$ , so the difference

(n-w)-w=n-2w

appears.

Example 4.1 (The binary case with $n=3$ ).

Let $q=2$ and $n=3$ . The generating function is

\sum_{j=0}^{3}\Kraw_j(w)z^j = (1+z)^{3-w}(1-z)^w.

The values for $x=0,1,2,3$ are as follows:

\begin{array}{c|rrrr} & x=0 & x=1 & x=2 & x=3 \\ \hline \Kraw_0(x) & 1 & 1 & 1 & 1 \\ \Kraw_1(x) & 3 & 1 & -1 & -3 \\ \Kraw_2(x) & 3 & -1 & -1 & 3 \\ \Kraw_3(x) & 1 & -1 & 1 & -1 \end{array}

We shall use this table later in the example of the binary repetition code. Here, notice that after fixing $j$ , we get a function of $x$ . For instance, $\Kraw_2(x)$ takes the values $3,-1,-1,3$ at $x=0,1,2,3$ .

Combinatorial meaning of Krawtchouk polynomials

Krawtchouk polynomials are not merely polynomials defined by a generating function. In the Hamming space, they appear as the following natural sums.

Here, fix one non-trivial additive character of the finite field $\mathbb{F}_{q}$ ,

\psi\colon \mathbb{F}_{q}\to \mathbb{C}^{\times}.

That is, $\psi(a+b)=\psi(a)\psi(b)$ holds, and $\psi$ is not identically $1$ . Finite fields have such additive characters. Indeed, if $q=p^m$ and $L\colon \mathbb{F}_{q}\to\mathbb{F}_{p}$ is a non-zero $\mathbb{F}_{p}$ -linear map, then

\psi(a)=\exp(2\pi i L(a)/p)

is a non-trivial additive character. Here $L(a)\in\mathbb{F}_{p}$ is read through the representative $0,1,\dots,p-1$ . Character theory is not the main subject of this note, but in order to see how Krawtchouk polynomials arise from the Hamming space, we use just this one tool. We do not assume the general theory of characters, and prove only the properties needed below. Concretely, we use only the one-coordinate character sum proved here and the formula used later to extract the indicator function of the dual code. Thus the proof in this section is arranged so that it can be followed without systematic knowledge of character theory.

The needed property of character sums is the following single fact.

Lemma 5.1.

For $b\in\mathbb{F}_{q}$ ,

\sum_{a\in\mathbb{F}_{q}}\psi(ab) = \begin{cases} q, & b=0,\\ 0, & b\neq 0 \end{cases}

holds.

Proof

If $b=0$ , then $\psi(ab)=\psi(0)=1$ for every $a$ , so the sum is $q$ . Suppose $b\neq 0$ . Then $a\mapsto ab$ is a bijection of $\mathbb{F}_{q}$ , so

\sum_{a\in\mathbb{F}_{q}}\psi(ab) = \sum_{t\in\mathbb{F}_{q}}\psi(t).

Put the right-hand side equal to $S$ . Since $\psi$ is non-trivial, there is some $d\in\mathbb{F}_{q}$ such that $\psi(d)\neq 1$ . As $t$ runs through all of $\mathbb{F}_{q}$ , so does $t+d$ , hence

S = \sum_{t\in\mathbb{F}_{q}}\psi(t+d) = \sum_{t\in\mathbb{F}_{q}}\psi(t)\psi(d) = \psi(d)S.

Since $\psi(d)\neq 1$ , we get $S=0$ .

Theorem 5.2.

Let $c\in\mathbb{F}_{q}^{E}$ be a word of weight $w$ . Then, for every $0\leq j\leq n$ ,

\sum_{\substack{u\in\mathbb{F}_{q}^{E}\\ \wt(u)=j}} \psi(u\cdot c) = \Kraw_j(w) \tag{5.1}

holds.

Proof

Compute the generating function obtained by collecting the left-hand side over all $j$ :

\begin{aligned} \sum_{j=0}^{n} \left( \sum_{\substack{u\in\mathbb{F}_{q}^{E}\\ \wt(u)=j}} \psi(u\cdot c) \right)z^j &= \sum_{u\in\mathbb{F}_{q}^{E}} \psi(u\cdot c)z^{\wt(u)} \\ &= \sum_{u\in\mathbb{F}_{q}^{E}} \prod_{r\in E}\psi(u_r c_r)z^{\mathbf{1}_{u_r\neq 0}} \\ &= \prod_{r\in E} \left( \sum_{a\in\mathbb{F}_{q}} \psi(a c_r)z^{\mathbf{1}_{a\neq 0}} \right). \end{aligned}

Here $\mathbf{1}_{u_r\neq 0}$ is $1$ when $u_r\neq 0$ , and $0$ otherwise.

For a coordinate $r$ , if $c_r=0$ , then

\sum_{a\in\mathbb{F}_{q}} \psi(a c_r)z^{\mathbf{1}_{a\neq 0}} = 1+(q-1)z.

On the other hand, if $c_r\neq 0$ , then $a\mapsto ac_r$ is a bijection of $\mathbb{F}_{q}$ , and by Lemma 5.1,

\sum_{a\in\mathbb{F}_{q}}\psi(ac_r)=0.

Therefore

\sum_{a\in\mathbb{F}_{q}^{\times}}\psi(ac_r)=-1.

Hence

\sum_{a\in\mathbb{F}_{q}} \psi(a c_r)z^{\mathbf{1}_{a\neq 0}} = 1-z.

Since $c$ has weight $w$ , there are $w$ coordinates with $c_r\neq 0$ and $n-w$ coordinates with $c_r=0$ . Hence the generating function above is

\bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w.

This is equal to the generating function of the Krawtchouk polynomials,

\sum_{j=0}^{n}\Kraw_j(w)z^j.

Comparing the coefficients of $z^j$ gives the claim.

This theorem expresses well the coding-theoretic meaning of Krawtchouk polynomials. $\Kraw_j(w)$ is the sum of the character values $\psi(u\cdot c)$ over the words $u$ of weight $j$ , after fixing a word $c$ of weight $w$ . In other words, Krawtchouk polynomials are

oscillating sums obtained by viewing the weight- $j$ layer of the Hamming space from a word of weight $w$ .

Here we have explained this using character sums, but if one only looks at (3.2), it is also a sum of binomial coefficients counting the ways in which coordinates overlap. Krawtchouk polynomials as orthogonal polynomials bridge these two viewpoints.

Krawtchouk polynomials as orthogonal polynomials

We now see why Krawtchouk polynomials are called “orthogonal polynomials”. Orthogonality means, roughly, that the inner product of polynomials of different degrees is $0$ . For Krawtchouk polynomials, the inner product is defined by a sum over the finite set

\{0,1,\dots,n\}.

For $0\leq w\leq n$ , put

v_w = \binom{n}{w}(q-1)^w.

This is the number of words of weight $w$ in $\mathbb{F}_{q}^{E}$ . Indeed, there are $\binom{n}{w}$ ways to choose the $w$ coordinates which are non-zero, and each non-zero coordinate has $q-1$ possible values.

Definition 6.1.

For functions $f,g\colon \{0,1,\dots,n\}\to\mathbb{C}$ , define

\langle f,g\rangle = \sum_{w=0}^{n}\binom{n}{w}(q-1)^w f(w)\overline{g(w)}.

This inner product has as weights the sizes of the weight layers of the Hamming space. Krawtchouk polynomials are orthogonal with respect to this inner product. Since the values of Krawtchouk polynomials are real, the orthogonality formula is the same whether or not one writes complex conjugation.

Theorem 6.2 (Orthogonality of Krawtchouk polynomials).

For $0\leq r,s\leq n$ ,

\sum_{w=0}^{n} \binom{n}{w}(q-1)^w \Kraw_r(w)\Kraw_s(w) = q^n \binom{n}{r}(q-1)^r \delta_{r,s} \tag{6.1}

holds. Here $\delta_{r,s}$ is the Kronecker delta.

Proof

Compute the generating function in two variables:

\begin{aligned} &H(z,t) \\ \coloneqq{}& \sum_{w=0}^{n}\binom{n}{w}(q-1)^w \left(\sum_{r=0}^{n}\Kraw_r(w)z^r\right) \left(\sum_{s=0}^{n}\Kraw_s(w)t^s\right) \\ &= \sum_{w=0}^{n}\binom{n}{w}(q-1)^w \bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w \bigl(1+(q-1)t\bigr)^{n-w}(1-t)^w \\ &= \sum_{w=0}^{n}\binom{n}{w} \left((1+(q-1)z)(1+(q-1)t)\right)^{n-w} \left((q-1)(1-z)(1-t)\right)^w \\ &= \left( (1+(q-1)z)(1+(q-1)t) +(q-1)(1-z)(1-t) \right)^n . \end{aligned}

Simplifying the expression inside the parentheses gives

\begin{aligned} &(1+(q-1)z)(1+(q-1)t) +(q-1)(1-z)(1-t) \\ &\qquad= q\bigl(1+(q-1)zt\bigr). \end{aligned}

Therefore

H(z,t) = q^n\bigl(1+(q-1)zt\bigr)^n = q^n\sum_{r=0}^{n}\binom{n}{r}(q-1)^r z^r t^r.

On the other hand, by the definition of $H(z,t)$ , the coefficient of $z^r t^s$ is the left-hand side

\sum_{w=0}^{n}\binom{n}{w}(q-1)^w\Kraw_r(w)\Kraw_s(w).

Comparing coefficients gives (6.1).

This theorem shows that $\Kraw_0,\Kraw_1,\dots,\Kraw_n$ form an orthogonal basis for the function space on the finite set $\{0,1,\dots,n\}$ . This is the important point. Krawtchouk polynomials are not merely convenient coefficients; they are the orthogonal basis naturally attached to the weight layers of the Hamming space.

The Krawtchouk transform

Using Krawtchouk polynomials, one can define a transform on vectors of length $n+1$ . This is the Krawtchouk transform.

Definition 7.1.

For a vector $a=(a_0,a_1,\dots,a_n)$ , define its Krawtchouk transform by

(\mathcal{K}a)_j = \sum_{w=0}^{n}a_w\Kraw_j(w) \qquad (0\leq j\leq n).

In matrix form, the Krawtchouk transform is the operation of multiplying by the $(n+1)\times(n+1)$ matrix

K=(\Kraw_j(w))_{0\leq j,w\leq n}.

This matrix is sometimes called the Krawtchouk matrix.

The orthogonality shown just above concerns the inner product with weights

v_w=\binom{n}{w}(q-1)^w,

and the sum has the form

\sum_w v_w\Kraw_r(w)\Kraw_s(w).

On the other hand, the self-inverse property shown here concerns multiplying the Krawtchouk matrix itself twice, and the sum has the form

\sum_j \Kraw_r(j)\Kraw_j(w).

The two are closely related, but the summation index and the placement of the weights differ, so we keep them separate here. Orthogonality is a formula for the weighted inner product of the functions $\Kraw_r,\Kraw_s$ , while self-inverseness is a formula for the product of the Krawtchouk matrix $K=(\Kraw_j(w))$ .

The Krawtchouk transform is its own inverse, up to normalisation.

Theorem 7.2.

For every vector $a=(a_0,\dots,a_n)$ ,

\mathcal{K}^{2}a=q^n a

holds. That is, applying the Krawtchouk transform twice multiplies the vector by $q^n$ .

Proof

It is enough to show that, for all $0\leq r,w\leq n$ ,

\sum_{j=0}^{n}\Kraw_r(j)\Kraw_j(w) = q^n\delta_{r,w} \tag{7.1}

holds. For fixed $w$ , collect the left-hand side into a generating function in $r$ :

\begin{aligned} \sum_{r=0}^{n} \left( \sum_{j=0}^{n}\Kraw_r(j)\Kraw_j(w) \right)t^r &= \sum_{j=0}^{n}\Kraw_j(w) \left( \sum_{r=0}^{n}\Kraw_r(j)t^r \right) \\ &= \sum_{j=0}^{n}\Kraw_j(w) \bigl(1+(q-1)t\bigr)^{n-j}(1-t)^j \\ &= \bigl(1+(q-1)t\bigr)^n \sum_{j=0}^{n}\Kraw_j(w) \left(\frac{1-t}{1+(q-1)t}\right)^j . \end{aligned}

Substitute

s=\frac{1-t}{1+(q-1)t}

into the generating function of the Krawtchouk polynomials:

\begin{aligned} \sum_{j=0}^{n}\Kraw_j(w)s^j &= \bigl(1+(q-1)s\bigr)^{n-w}(1-s)^w. \end{aligned}

Here

1+(q-1)s = \frac{q}{1+(q-1)t}, \qquad 1-s = \frac{qt}{1+(q-1)t}.

Therefore

\begin{aligned} &\bigl(1+(q-1)t\bigr)^n \sum_{j=0}^{n}\Kraw_j(w)s^j \\ ={}& \bigl(1+(q-1)t\bigr)^n \left(\frac{q}{1+(q-1)t}\right)^{n-w} \left(\frac{qt}{1+(q-1)t}\right)^w \\ ={}& q^n t^w. \end{aligned}

Hence the coefficient of $t^r$ is $q^n$ when $r=w$ , and $0$ otherwise. This is (7.1), and the claim follows.

This self-inverse property corresponds to the symmetry of the MacWilliams identity. Indeed, taking the dual of $C$ once more gives $(C^{\perp})^{\perp}=C$ . Also, for a linear code over a finite field,

\card{C}\card{C^{\perp}}=q^n.

This follows from $\dim C+\dim C^{\perp}=n$ . Thus the fact that the transform on weight distributions becomes multiplication by $q^n$ after applying it twice is compatible with returning to the original code after taking the dual twice.

The weight distribution of the dual code and the Krawtchouk transform

We now prove the MacWilliams identity using Krawtchouk polynomials. First we prove the coefficient-level formula.

In this section too, fix one non-trivial additive character $\psi$ of the finite field $\mathbb{F}_{q}$ . We do not develop character theory itself in detail, but use only the following orthogonal-complement extraction formula.

Lemma 8.1.

For every $u\in\mathbb{F}_{q}^{E}$ ,

\frac{1}{\card{C}} \sum_{c\in C}\psi(u\cdot c) = \begin{cases} 1, & u\in C^{\perp},\\ 0, & u\notin C^{\perp} \end{cases} \tag{8.1}

holds.

Proof

If $u\in C^{\perp}$ , then $u\cdot c=0$ for every $c\in C$ , so each term is $\psi(0)=1$ , and the sum is $\card{C}$ .

Suppose $u\notin C^{\perp}$ . Then there exists $c_0\in C$ such that $d=u\cdot c_0\neq 0$ . Since $d\neq 0$ , the map $a\mapsto ad$ is a bijection of $\mathbb{F}_{q}$ . Since $\psi$ is non-trivial, there is some $a\in\mathbb{F}_{q}$ such that $\psi(ad)\neq 1$ .

Put

S=\sum_{c\in C}\psi(u\cdot c).

Since $C$ is a linear code, $ac_0\in C$ , and as $c$ runs through all of $C$ , $c+ac_0$ also runs through all of $C$ exactly once. Therefore

\begin{aligned} S &= \sum_{c\in C}\psi(u\cdot(c+ac_0)) \\ &= \sum_{c\in C}\psi(u\cdot c)\psi(a(u\cdot c_0)) \\ &= \psi(ad)S. \end{aligned}

Since $\psi(ad)\neq 1$ , we have $S=0$ .

This lemma lets us convert sums over $C^{\perp}$ into sums over $C$ . We then substitute the Krawtchouk-polynomial sum (5.1).

Theorem 8.2 (Coefficient-level MacWilliams identity).

Let $C\leq\mathbb{F}_{q}^{E}$ be a linear code. Put $A_w=A_w(C)$ and $B_j=A_j(C^{\perp})$ . Then, for every $0\leq j\leq n$ ,

B_j = \frac{1}{\card{C}} \sum_{w=0}^{n}A_w\Kraw_j(w) \tag{8.2}

holds.

Proof

Writing the number of words of weight $j$ in the dual code using the indicator function, we have

B_j = \sum_{\substack{u\in\mathbb{F}_{q}^{E}\\\wt(u)=j}} \mathbf{1}_{C^{\perp}}(u).

Substituting Lemma 8.1, we get

\begin{aligned} B_j &= \frac{1}{\card{C}} \sum_{\substack{u\in\mathbb{F}_{q}^{E}\\\wt(u)=j}} \sum_{c\in C}\psi(u\cdot c) \\ &= \frac{1}{\card{C}} \sum_{c\in C} \sum_{\substack{u\in\mathbb{F}_{q}^{E}\\\wt(u)=j}} \psi(u\cdot c). \end{aligned}

By Theorem 5.2, the inner sum is equal to

\Kraw_j(\wt(c)).

Therefore

B_j = \frac{1}{\card{C}} \sum_{c\in C}\Kraw_j(\wt(c)).

Since there are $A_w$ codewords of weight $w$ , this becomes

B_j = \frac{1}{\card{C}} \sum_{w=0}^{n}A_w\Kraw_j(w).

This theorem is the coefficient version of the MacWilliams identity through Krawtchouk polynomials. In other words, the weight distribution of the dual code is the Krawtchouk transform of the weight distribution of the original code, divided by $\card{C}$ :

(A_0(C^{\perp}),\dots,A_n(C^{\perp})) = \frac{1}{\card{C}}\mathcal{K}(A_0(C),\dots,A_n(C)).

Returning to the polynomial form of the MacWilliams identity

From the coefficient-level formula, we derive the usual polynomial form of the MacWilliams identity.

Theorem 9.1 (MacWilliams identity).

Let $C\leq\mathbb{F}_{q}^{E}$ be a linear code. Then

W_{C^{\perp}}(X,Y) = \frac{1}{\card{C}} W_C\bigl(X+(q-1)Y,X-Y\bigr)

holds.

Proof

Put $A_w=A_w(C)$ and $B_j=A_j(C^{\perp})$ . By the coefficient-level formula (8.2),

\begin{aligned} W_{C^{\perp}}(X,Y) &= \sum_{j=0}^{n}B_jX^{n-j}Y^j \\ &= \frac{1}{\card{C}} \sum_{j=0}^{n} \sum_{w=0}^{n}A_w\Kraw_j(w)X^{n-j}Y^j \\ &= \frac{1}{\card{C}} \sum_{w=0}^{n}A_w \sum_{j=0}^{n}\Kraw_j(w)X^{n-j}Y^j. \end{aligned}

Now use the generating function of the Krawtchouk polynomials. The expression $Y/X$ here is a formal calculation, and ultimately means that both sides agree as polynomials in $X,Y$ .

\begin{aligned} \sum_{j=0}^{n}\Kraw_j(w)X^{n-j}Y^j &= X^n\sum_{j=0}^{n}\Kraw_j(w)\left(\frac{Y}{X}\right)^j \\ &= X^n \left(1+(q-1)\frac{Y}{X}\right)^{n-w} \left(1-\frac{Y}{X}\right)^w \\ &= \bigl(X+(q-1)Y\bigr)^{n-w}(X-Y)^w. \end{aligned}

Therefore

\begin{aligned} W_{C^{\perp}}(X,Y) &= \frac{1}{\card{C}} \sum_{w=0}^{n}A_w \bigl(X+(q-1)Y\bigr)^{n-w}(X-Y)^w \\ &= \frac{1}{\card{C}} W_C\bigl(X+(q-1)Y,X-Y\bigr). \end{aligned}

This proves the MacWilliams identity.

In this proof, the variable substitution in the MacWilliams identity,

X\mapsto X+(q-1)Y, \qquad Y\mapsto X-Y,

appeared from the generating function of the Krawtchouk polynomials. Thus the MacWilliams identity can be viewed as

the Krawtchouk transform repackaged by the generating function called the weight enumerator.

A small example: the binary repetition code of length $3$

Let us check the coefficient-level formula in a small example. Consider the repetition code over $\mathbb{F}_{2}^{3}$ ,

C=\{ 000, 111 \}.

Its weight distribution is

(A_0,A_1,A_2,A_3)=(1,0,0,1).

The dual code is

C^{\perp} = \{u\in\mathbb{F}_{2}^{3}:u_1+u_2+u_3=0\},

namely the set of all words of even weight. Therefore

C^{\perp}=\{000,011,101,110\},

and its weight distribution is

(B_0,B_1,B_2,B_3)=(1,0,3,0).

We check that this comes from the Krawtchouk transform.

By Example 4.1 (The binary case with n=3)Let $q=2$ and $n=3$ . The generating function is $\sum_{j=0}^{3}\Kraw_j(w)z^j = (1+z)^{3-w}(1-z)^w.$ The values for $x=0,1,2,3$ are as follows: $\begin{array}{c|rrrr} & x=0 & x=1 & x=2 & x=3 \\ \hline \Kraw_0(x) & 1 & 1 & 1 & 1 \\ \Kraw_1(x) & 3 & 1 & -1 & -3 \\ \Kraw_2(x) & 3 & -1 & -1 & 3 \\ \Kraw_3(x) & 1 & -1 & 1 & -1 \end{array}$ Example 4.1, the Krawtchouk table for $q=2$ and $n=3$ is

\begin{array}{c|rrrr} & w=0 & w=1 & w=2 & w=3 \\ \hline \Kraw_0(w) & 1 & 1 & 1 & 1 \\ \Kraw_1(w) & 3 & 1 & -1 & -3 \\ \Kraw_2(w) & 3 & -1 & -1 & 3 \\ \Kraw_3(w) & 1 & -1 & 1 & -1 \end{array}

The coefficient-level MacWilliams identity says

B_j = \frac{1}{\card{C}} \sum_{w=0}^{3}A_w\Kraw_j(w).

Here $\card{C}=2$ , and $A_0=A_3=1$ , $A_1=A_2=0$ , so

B_j = \frac{1}{2}\bigl(\Kraw_j(0)+\Kraw_j(3)\bigr).

From the table,

\begin{aligned} B_0 &= \frac{1}{2}(1+1)=1,\\ B_1 &= \frac{1}{2}(3-3)=0,\\ B_2 &= \frac{1}{2}(3+3)=3,\\ B_3 &= \frac{1}{2}(1-1)=0. \end{aligned}

Therefore

(B_0,B_1,B_2,B_3)=(1,0,3,0),

which indeed agrees with the weight distribution of $C^{\perp}$ .

In polynomial form,

W_C(X,Y)=X^3+Y^3,

and the right-hand side of the MacWilliams identity is

\begin{aligned} \frac{1}{2}W_C(X+Y,X-Y) &= \frac{1}{2}\left((X+Y)^3+(X-Y)^3\right)\\ &= X^3+3XY^2. \end{aligned}

This agrees with

W_{C^{\perp}}(X,Y)=X^3+3XY^2.

In this example, multiplying each row $j$ of the Krawtchouk table by the weight distribution $(1,0,0,1)$ gives the $j$ -th coefficient of the weight distribution of the dual code.

What Krawtchouk polynomials were doing in this proof

Let us organise the roles played by Krawtchouk polynomials in the proof.

First, Krawtchouk polynomials were the transform coefficients for weight distributions. Writing the weight distribution of $C$ as $A_w=A_w(C)$ and that of $C^{\perp}$ as $B_j=A_j(C^{\perp})$ , we had

B_j = \frac{1}{\card{C}} \sum_{w=0}^{n}A_w\Kraw_j(w).

This is the MacWilliams identity viewed at the coefficient level.

Second, Krawtchouk polynomials were oscillating sums over weight layers of the Hamming space. After fixing a word $c$ of weight $w$ , we had

\sum_{\substack{u\in\mathbb{F}_{q}^{E}\\\wt(u)=j}} \psi(u\cdot c) = \Kraw_j(w).

Thus $\Kraw_j(w)$ is the sum obtained by viewing the layer of weight $j$ from a word of weight $w$ .

Third, Krawtchouk polynomials formed an orthogonal basis. On the finite set $\{0,\dots,n\}$ , with weights

\binom{n}{w}(q-1)^w,

the polynomials $\Kraw_0,\dots,\Kraw_n$ are mutually orthogonal. This shows that Krawtchouk polynomials are not merely computational coefficients, but orthogonal polynomials attached to the distance structure of the Hamming space.

Fourth, the generating function produced the MacWilliams variable substitution. Formally substitute $z=Y/X$ into

\sum_{j=0}^{n}\Kraw_j(w)z^j = \bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w.

The $Y/X$ here is a formal substitution, and the final result is an equality of polynomials in $X,Y$ :

\sum_{j=0}^{n}\Kraw_j(w)X^{n-j}Y^j = \bigl(X+(q-1)Y\bigr)^{n-w}(X-Y)^w.

This is the variable substitution in the polynomial form of the MacWilliams identity.

Thus, in the proof in this note, Krawtchouk polynomials played four roles:

transform coefficients for computing the weight distribution of the dual code,
oscillating sums over weight layers of the Hamming space,
orthogonal polynomials on a finite set,
the generating function which produces the MacWilliams variable substitution.

Concepts seen in this part

In this part, while aiming at a proof of the MacWilliams identity, we introduced the basic tools around Krawtchouk polynomials. Here is the summary.

Weight distribution

This is the sequence defined for a code $C$ by

A_w(C)=\card{\{c\in C:\wt(c)=w\}}.

The weight enumerator packages the weight distribution as a polynomial in two variables.

Krawtchouk polynomials

First define their values at weights $w=0,1,\dots,n$ by the generating function

\sum_{j=0}^{n}\Kraw_j(w)z^j = \bigl(1+(q-1)z\bigr)^{n-w}(1-z)^w.

By the explicit formula, these values come from polynomials $\Kraw_j(x)$ in $x$ . In coding theory, one mainly uses them by substituting weights $0,1,\dots,n$ for $x$ .

Explicit formula

Krawtchouk polynomials can be written as

\Kraw_j(x) = \sum_{\ell=0}^{j} (-1)^{\ell}(q-1)^{j-\ell} \binom{x}{\ell}\binom{n-x}{j-\ell}.

This can be interpreted as a sum in which binomial coefficients counting the ways coordinates overlap are combined with signed contributions coming from character values.

Orthogonality

$\Kraw_0,\dots,\Kraw_n$ are orthogonal on the finite set $\{0,\dots,n\}$ with weights

\binom{n}{w}(q-1)^w.

This weight is the number of words of weight $w$ in the Hamming space.

Krawtchouk transform

This is the transform defined for a vector $a=(a_0,\dots,a_n)$ by

(\mathcal{K}a)_j=\sum_{w=0}^{n}a_w\Kraw_j(w).

Applying this transform twice multiplies the vector by $q^n$ .

Coefficient-level MacWilliams identity

If $A_w=A_w(C)$ and $B_j=A_j(C^{\perp})$ , then

B_j = \frac{1}{\card{C}} \sum_{w=0}^{n}A_w\Kraw_j(w).

This means that the weight distribution of the dual code is obtained by applying the Krawtchouk transform to the weight distribution of the original code.

At first, orthogonal polynomials may look like analytic objects. However, Krawtchouk polynomials are highly discrete orthogonal polynomials: their orthogonality is expressed entirely by sums over a finite set. This is why they fit so well with Hamming spaces and coding theory.

Looking back at the proof family of this note

As stated at the beginning, the proof in this note can be read as an entrance to

the orthogonal-polynomial and association-scheme approach.

On the surface, it is a coefficient calculation using Krawtchouk polynomials. We also used character sums as a minimal tool to prove the coefficient-level MacWilliams identity. But at a deeper level, the orthogonal polynomials attached to the distance structure of the Hamming space control the transform of weight distributions.

The main point of the proof in this note can be summarised in one sentence:

The MacWilliams identity is the Krawtchouk transform on weight distributions.

From the Fourier viewpoint, one handles a transform on the whole of $\mathbb{F}_{q}^{E}$ . In the viewpoint of this note, that information has been compressed down to “weights only”. Instead of distinguishing all words, we look only at the coefficients for weights $0,1,\dots,n$ . In this compressed world, Krawtchouk polynomials appear as the shadow of the Fourier-type transform.

This viewpoint naturally leads, in a more advanced direction, to association schemes. In the Hamming space, pairs of words can be divided according to whether their distance is $0,1,\dots,n$ . The commutative algebra built from these relations is the Bose–Mesner algebra of the Hamming association scheme. The eigenvalues which appear there are precisely the Krawtchouk polynomials. Delsarte [Del73] is the classical starting point for Delsarte's association-scheme approach to coding theory, and Delsarte–Levenshtein [DL98] surveys the relationship between association schemes and coding theory. Levenshtein [Lev95] is also a representative reference for the appearance of Krawtchouk polynomials in bounds for codes and designs in the Hamming space.

Further direction: towards association schemes

We have now completed the viewpoint on the MacWilliams identity using Krawtchouk polynomials, which was the aim of this note. From here, as a more advanced viewpoint, let us introduce the direction of reinterpreting the Krawtchouk polynomials appearing in this note in the language of association schemes.

In the proof in this note, we defined Krawtchouk polynomials by a generating function and used them as transform coefficients for weight distributions. But why are these polynomials naturally attached to the Hamming space? The answer lies in the matrices built from distance relations in the Hamming space.

Take the set of all words of length $n$ as the vertex set, and write $M_j$ for the matrix which records whether the distance between two words is $j$ . Here $M_j$ is the adjacency matrix of the distance relation. These matrices form a mutually commuting algebra. When this algebra is simultaneously diagonalised, Krawtchouk polynomials appear as its eigenvalues.

Thus, in the more advanced viewpoint, the main flow is

Hamming distance → adjacency matrices → Bose–Mesner algebra → Krawtchouk polynomials as eigenvalues

The MacWilliams identity appears in this eigenvalue theory as the dual transform of weight distributions.

Even for the same Krawtchouk polynomials, the view changes substantially depending on whether one sees them, as in this note, as “orthogonal polynomials defined by a generating function”, or more advancedly as “eigenvalues of the Hamming scheme”. This difference is the natural entrance to association schemes.

References

[Kra29] M. Krawtchouk. Sur une généralisation des polynômes d'Hermite. Comptes rendus hebdomadaires des séances de l'Académie des sciences, vol. 189, pp. 620–622, 1929 ↩
[KLS10] Roelof Koekoek, Peter A. Lesky, and René F. Swarttouw. Hypergeometric orthogonal polynomials and their q-analogues. Springer-Verlag, Berlin, pp. xx+578, 2010. doi:10.1007/978-3-642-05014-5 ↩
[MS77] F. J. MacWilliams and N. J. A. Sloane. The theory of error-correcting codes. I. North-Holland Publishing Co., Amsterdam-New York-Oxford, vol. Vol. 16, pp. i–xv and 1–369, 1977 ↩
[Del73] P. Delsarte. An algebraic approach to the association schemes of coding theory. Philips Res. Rep. Suppl., no. 10, pp. vi+97, 1973 Citation contextThis viewpoint naturally leads, in a more advanced direction, to association schemes. In the Hamming space, pairs of words can be divided according to whether their distance is $0,1,\dots,n$ . The commutative algebra built from these relations is the Bose–Mesner algebra of the Hamming association scheme. The eigenvalues which appear there are precisely the Krawtchouk polynomials. Delsarte [Del73] is the classical starting point for Delsarte's association-scheme approach to coding theory, and Delsarte–Levenshtein [DL98] surveys the relationship between association schemes and coding theory. Levenshtein [Lev95] is also a representative reference for the appearance of Krawtchouk polynomials in bounds for codes and designs in the Hamming space.↩
[DL98] Philippe Delsarte and Vladimir I. Levenshtein. Association schemes and coding theory. IEEE Trans. Inform. Theory, vol. 44, no. 6, pp. 2477–2504, 1998. doi:10.1109/18.720545 Citation contextThis viewpoint naturally leads, in a more advanced direction, to association schemes. In the Hamming space, pairs of words can be divided according to whether their distance is $0,1,\dots,n$ . The commutative algebra built from these relations is the Bose–Mesner algebra of the Hamming association scheme. The eigenvalues which appear there are precisely the Krawtchouk polynomials. Delsarte [Del73] is the classical starting point for Delsarte's association-scheme approach to coding theory, and Delsarte–Levenshtein [DL98] surveys the relationship between association schemes and coding theory. Levenshtein [Lev95] is also a representative reference for the appearance of Krawtchouk polynomials in bounds for codes and designs in the Hamming space.↩
[Lev95] Vladimir I. Levenshtein. Krawtchouk polynomials and universal bounds for codes and designs in Hamming spaces. IEEE Trans. Inform. Theory, vol. 41, no. 5, pp. 1303–1321, 1995. doi:10.1109/18.412678 Citation contextThis viewpoint naturally leads, in a more advanced direction, to association schemes. In the Hamming space, pairs of words can be divided according to whether their distance is $0,1,\dots,n$ . The commutative algebra built from these relations is the Bose–Mesner algebra of the Hamming association scheme. The eigenvalues which appear there are precisely the Krawtchouk polynomials. Delsarte [Del73] is the classical starting point for Delsarte's association-scheme approach to coding theory, and Delsarte–Levenshtein [DL98] surveys the relationship between association schemes and coding theory. Levenshtein [Lev95] is also a representative reference for the appearance of Krawtchouk polynomials in bounds for codes and designs in the Hamming space.↩

This series

A Series Learning through the MacWilliams Identity · Part 4 of 12

PreviousAn Introduction to Tensor Products through the MacWilliams Identity NextAn Introduction to Association Schemes through the MacWilliams Identity

Back to series list

Disclaimer

Articles on this site are based on the operator's personal understanding, investigation, and research notes. I try to keep the content accurate, but it may contain errors or incomplete explanations. I do not guarantee its accuracy, completeness, usefulness, or currentness.

Please use the information on this site at your own judgment and responsibility. To the extent permitted by law, the operator is not liable for damages, losses, or disadvantages arising from using, or being unable to use, information on this site.

If you notice an error, unclear explanation, broken link, or insufficient citation, please contact the operator. I will review the content and, when appropriate, correct, update, or remove it.

§1Introduction

§2From weight enumerators to weight distributions

§3Definition of Krawtchouk polynomials

§4First examples

§5Combinatorial meaning of Krawtchouk polynomials

§6Krawtchouk polynomials as orthogonal polynomials

§7The Krawtchouk transform

§8The weight distribution of the dual code and the Krawtchouk transform

§9Returning to the polynomial form of the MacWilliams identity

§10A small example: the binary repetition code of length 333

§11What Krawtchouk polynomials were doing in this proof

§12Concepts seen in this part

§13Looking back at the proof family of this note

§14Further direction: towards association schemes

References

This series

Disclaimer

Introduction

From weight enumerators to weight distributions

Definition of Krawtchouk polynomials

First examples

Combinatorial meaning of Krawtchouk polynomials

Krawtchouk polynomials as orthogonal polynomials

The Krawtchouk transform

The weight distribution of the dual code and the Krawtchouk transform

Returning to the polynomial form of the MacWilliams identity

A small example: the binary repetition code of length $3$

What Krawtchouk polynomials were doing in this proof

Concepts seen in this part

Looking back at the proof family of this note

Further direction: towards association schemes