Article and note
A Series Learning through the MacWilliams IdentityPart 2 of 12
An Introduction to Character Theory through the MacWilliams Identity
- Published:
- Updated:
- Reading time:
- 31 min (about 6,683 words)
Introduction
In coding theory, one of the basic theorems is the MacWilliams identity. For a linear code over a finite field with coordinate set , the weight enumerator of its dual code
can be computed from the weight enumerator of as follows:
This is the MacWilliams identity.
In this series, I use proofs of the MacWilliams identity as a guide to an introduction to neighbouring areas and concepts. Accordingly, this series is aimed at readers who already know at least the following basic material:
We assume that the reader already knows, at least at a basic level, the following notions:
what a (finite) field is,
what a linear code over a finite field is,
what the Hamming weight is,
what the dual code is.
(There is no problem if you do not know a proof of the MacWilliams identity.)
In this series, I look at proof methods for the MacWilliams identity under the following five broad families.
Fourier, character, and Poisson methods.
Möbius inversion, lattice theory, and shortening/puncturing methods.
Orthogonal polynomials and association schemes.
Matroids and Tutte polynomials.
Moments and double-counting methods.
This note focuses on the first of these: the Fourier, character-theoretic, and Poisson approach. However, rather than putting terms such as the Fourier transform or the Poisson summation formula centre stage from the outset, I begin with the most basic building block, namely characters of finite abelian groups. In other words, the aim of this second note is to use a proof of the MacWilliams identity as a guide to an introduction to character theory on finite abelian groups.
Last time, I proved the MacWilliams identity using inclusion among supports and Möbius inversion. This time, by contrast, I prove the same identity using characters of finite abelian groups and their orthogonality relations. Where the main actor last time was “inclusion among supports”, the main actors this time are “group duality” and “orthogonality relations of characters”.
By character theory here I do not mean the whole of representation theory of finite groups. What is needed in this note is a homomorphism from a finite abelian group to the multiplicative group of non-zero complex numbers:
Such a map is called a character of . Characters of finite abelian groups send group elements to roots of unity in , and turn addition into multiplication. Moreover, characters satisfy very strong orthogonality relations. Through a fixed identification, these orthogonality relations pick out the condition corresponding to the dual code .
The route in this note is as follows:
viewing codes as additive groups → characters → character groups → orthogonality relations → annihilators → the finite Poisson summation formula → the MacWilliams identity
If one looks only at the proof of the MacWilliams identity, the number of necessary formulae is not so large. Essentially, everything is condensed into the single orthogonality relation
However, if one uses this formula only as an isolated computation, it becomes harder to see the landscape of character theory as a subject. So in this note I first introduce characters of finite abelian groups, character groups, and annihilators, and then apply them to the MacWilliams identity.
For characters of finite groups and finite Fourier analysis, standard references include Terras [Ter99] and Stein–Shakarchi [SS03]. For trace maps over finite fields and additive characters, see Lidl–Niederreiter [LN97].
Viewing codes as additive groups
Let us begin by changing the point of view on the objects of coding theory slightly. The space is a vector space over a finite field, but if one forgets about scalar multiplication1 and concentrates only on addition, it is also a finite abelian group.
Definition 2.1.
Let be a set equipped with an addition and a zero element . We say that is an abelian group if the following hold for all :
.
.
For each , there exists an element such that .
.
The element in (G3) is called the (additive) inverse of . It is immediate that is uniquely determined for each 2, so we write this inverse as . The finite field is a finite abelian group with respect to addition. Hence its direct product is also a finite abelian group. A linear code is therefore both a vector subspace and a subgroup of this additive group.
The character-theoretic proof uses this additive-group structure. In other words, rather than starting with scalar multiplication in the vector space, we first focus on as an additive group and on its subgroup .
However, the dual code
is defined using the inner product. What is interesting in the character-theoretic proof is that this orthogonal complement with respect to the inner product appears naturally as the characters that are trivial on . For that reason, I begin by introducing characters of finite abelian groups.
Characters of finite abelian groups
The groups considered in this note are abelian, so I write their operation additively. By contrast, the set of non-zero complex numbers
forms a group under multiplication. A character is a map that carries addition to multiplication.
Definition 3.1.
Let be a finite abelian group. A character of is a group homomorphism
In other words, it is a map satisfying
for all .
A group homomorphism preserves the identity element3, so one always has .
Definition 3.2.
A character of is said to be trivial if
for every . I write the trivial character as .
Characters of finite groups always take values among roots of unity. Indeed, let be the order4 of . Then , so
Thus a character sends elements of a finite group to finitely many points on the unit circle in the complex plane.
Example 3.3 (Characters of ).
Consider the cyclic group as an additive group. For , define
This is a character of . Here and are taken to be integers representing their residue classes. The value does not change if one chooses different representatives, so this definition is well-defined. In particular, the complex numbers whose th power is are exactly of the form
In fact, these are all the characters of . Indeed, a character is determined by the value of the generator , and since , one has
Hence is a complex number whose th power is , and there are possible choices.
For example, when , that is, when , the character corresponding to is given by
Example 3.4 (Additive characters of ).
View as an additive group. Then the map
defines a character . Indeed, and , and for the addition on one has
Definition 3.5.
A character of the additive group of the finite field is called an additive character of .
Let , where is a prime. Using the trace map of the finite field
one can write additive characters explicitly. For example, define an additive character by
Here is an element of , so when putting it into the exponential function, we identify and use the representatives . This value depends only on the residue class modulo , so it does not depend on the choice of representative. This is an additive character because the trace map is additive. Indeed,
Moreover, finite field extensions are separable, so is not the zero map. In this note I use this fact as a basic fact from finite field theory. Since is a non-zero -linear map, its image is all of . Therefore there exists such that
and for such an we have
So is non-trivial.
More generally, for define
Then is also an additive character. When , the character is trivial. When , the map is a bijection of , so if were trivial, then would also be trivial. Hence is non-trivial whenever .
Note that when with , the map is a non-zero linear map from the -dimensional -vector space to the -dimensional space , so its kernel is non-trivial. Therefore, for an additive character coming from the trace, it can happen that
even for a non-zero . So in general one does not have
This point becomes important later, when comparing and .
Lemma 3.6.
Let be a non-trivial additive character of . Then every additive character of can be written in the form
for a unique .
Proof
As already noted, for each the function
is an additive character.
Next I show that if , then . Indeed,
and since , the map is a bijection of . Hence the right-hand side is a non-trivial additive character, so and cannot coincide. Thus there are at least distinct additive characters.
Conversely, I show that there are at most additive characters. Write , and let be an -basis of . For any additive character and any , we have in the additive group, so
Thus each is a complex number whose th power is .
Moreover, every can be written uniquely as
Here, when writing as an exponent, I understand the element of to be represented by one of . Therefore
In other words, is determined by its values on the basis . Since each has at most possibilities, the total number of additive characters is at most .
It follows that there are exactly additive characters, and that
is the full set of additive characters. Uniqueness follows from the fact already proved that when .
From this point on, I fix one non-trivial additive character of and write it simply as . In the character-theoretic proof of the MacWilliams identity, the final result is the same no matter which non-trivial additive character one chooses.
Character groups
The full set of characters itself forms a group.
Definition 4.1.
Let be a finite abelian group. Write
for the set of all characters of . This is called the character group of .
The group structure on is given by pointwise multiplication of characters. That is, for , define
The identity element is the trivial character , and the inverse of is given by
Theorem 4.2.
For a finite abelian group ,
holds.
Proof
Here I use the fundamental theorem of finite abelian groups as a black box. By that theorem, is isomorphic to a direct product of finitely many cyclic groups
As seen in Example 3.3, the group has exactly characters. Moreover, characters of a direct product are obtained as products of characters on each factor. Indeed, given , define
Then
Conversely, if and are given, then
defines a character of , since
Thus the characters of a direct product correspond bijectively to tuples of characters on the factors. Hence the number of characters also multiplies under direct products. Therefore follows.
This theorem means that a finite abelian group has as many characters as it has elements. However, one cannot in general identify with naturally. To construct such an identification, one needs an additional choice. There is a natural correspondence between and , but in general the passage from to itself involves a choice. In this note, I later connect with its character group by choosing a non-trivial additive character and the standard inner product. In coding theory, such a concrete identification is obtained by choosing the standard inner product and a non-trivial additive character.
Note that Theorem 4.2 is a background theorem for general finite abelian groups, and its proof used the fundamental theorem of finite abelian groups as a black box. However, in the case that is actually needed in this note, one can prove the result directly from the concrete description of additive characters of finite fields, without using that general theorem. In the next section, I use that direct proof as the main route.
Characters of
From this point on, let be a finite set, and write . Here I fix a non-trivial additive character of and the standard inner product
For , define a map
by
This is a character of the additive group . Indeed, for ,
Theorem 5.1.
The map
is a group isomorphism.
Proof
is a group homomorphism
For ,
so is a group homomorphism.
is surjective
Take any character . For each , let be the vector whose th coordinate is and whose other coordinates are , and define
Then is an additive character of . Indeed,
By Lemma 3.6, for each there exists a unique such that
Put .
For any , one has
and therefore
Hence , so is surjective.
is injective
Suppose that . Then for any and any ,
By the uniqueness in Lemma 3.6, it follows that . This holds for every , so . Therefore is injective.
Hence is a bijective group homomorphism, and therefore a group isomorphism.
There is also another proof using the general cardinality theorem for finite abelian groups. First, as above, one proves that is a group homomorphism, and then one checks, as in the older argument, that if is the trivial character, then . Indeed, if , then for some one has . Since is non-trivial, there exists such that . Define by
Then , and therefore
So is not trivial. Thus is injective. Since and Theorem 4.2 gives
any injective map between these finite sets is also surjective. This alternative proof uses the general cardinality theorem for finite abelian groups, whereas the main proof above does not.
This isomorphism depends on the choice of the non-trivial additive character and the standard inner product. Under that choice, one can identify an element with the character . In fact, for the proof of the MacWilliams identity itself, it is not necessary to know that every character of is of this form. What is needed is only that each gives rise to a character , and that this character is trivial on if and only if . However, since this note is intended as an introduction to character theory, I also explain that all characters of are of this form.
Orthogonality relations for characters
The most basic fact in character theory is that the sum of a non-trivial character vanishes.
Theorem 6.1.
Let be a finite abelian group, and let . Then
holds.
Proof
If , then for all , so the sum is .
Next assume that . Then there exists such that . Put
As runs through all of , so does exactly once, and hence
Therefore . Since , it follows that .
This theorem is the orthogonality relation for the whole group. In the MacWilliams identity, however, the sum is taken over a subgroup . The same argument still works in that setting.
Corollary 6.2 (Orthogonality relation on a subgroup).
Let be a finite abelian group, and let be a subgroup. For ,
holds.
Proof
Let me say a little more in words about the meaning of this orthogonality relation. If a character is identically on , then the sum is simply added times, so it equals . On the other hand, if oscillates even slightly on , its values cancel one another in the complex plane, and the sum becomes .
Annihilators: characters that are trivial on a subgroup
We give a name to the characters that are trivial on a subgroup.
Definition 7.1.
Let be a finite abelian group, and let be a subgroup. The annihilator of 5 is defined by
The set is a subgroup of . Indeed, the trivial character lies in because for every . Also, if , then for every one has
and similarly
So the subgroup conditions are satisfied. Using annihilators, Corollary 6.2 can be rewritten as
Now apply this formula to and . Let me first organise where the objects under discussion live.
This is a subspace of , consisting of all vectors orthogonal to under the inner product.
This is the full set of characters of the additive group .
This is a subgroup of , consisting of all characters that are trivial on .
This is an isomorphism depending on the fixed choice of and the standard inner product, and it connects with .
So, strictly speaking, and live in different places. The former is a subspace of , whereas the latter is a subgroup of . Accordingly, whenever I say that they “correspond” below, I mean correspondence through the fixed isomorphism .
I connect and its character group by the isomorphism from Theorem 5.1
Through this isomorphism, the annihilator of corresponds to .
As already noted, even for a non-trivial additive character , one does not in general have
In particular, when with , the standard additive character built from the trace has a non-trivial kernel. Therefore, from one cannot conclude immediately that . In the proof below, one uses the -linearity of at this point.
Here we use the following auxiliary fact. If is a non-trivial additive character of and satisfies , then
is also a non-trivial additive character. Indeed, is a bijection of , and if held for every , then would be identically on all of .
Theorem 7.2.
Under the isomorphism above, corresponds to . In other words, for ,
holds.
Proof
()
Suppose that . Then for every one has , so
Hence .
()
Assume that . Suppose, for a contradiction, that . Then there exists such that . Put , so , and the map is a non-trivial additive character of . Hence there exists such that
But is a linear code, so , and therefore
This contradicts the assumption that is trivial on . Hence .
Thus and are equivalent.
This theorem says that, through the fixed isomorphism , the condition is equivalent to saying that the corresponding character is trivial on . This is the central viewpoint in the character-theoretic proof.
Corollary 7.3.
For every ,
holds.
Proof
Apply Corollary 6.2 with , , and , and then use Theorem 7.2.
This formula is extremely important. The left-hand side is a character sum over . The right-hand side is the indicator function of whether lies in . That is,
In the character-theoretic proof, this formula is used to convert sums over into sums over .
Character sums as a finite Poisson summation formula
Before entering the MacWilliams identity itself, I write the formula in a slightly more general form. Let , and let be any function.
Definition 8.1.
For a function , define its character-sum transform by
Here I use the form without the coefficient . The placement of normalising factors and signs varies from one reference to another, but I use this form because it fits the later MacWilliams identity. Also, in this note I do not use the Fourier inversion formula itself. Rather, I use this transform to move sums over to sums over .
This transform is usually called the Fourier transform on a finite abelian group. However, to keep the viewpoint of an introduction to character theory, I treat it simply as an operation of taking weighted sums with characters.
The finite Poisson summation formula is a formula that, instead of summing directly over , converts the sum into a character sum over . The name may sound somewhat grand, but in the range used in this note it is not a deep analytic theorem. What one actually does is only to express as a character sum and interchange the order of two finite sums. Although the word Poisson appears in the name, no integrals or limits appear here; everything takes place entirely over finite sums.
Theorem 8.2 (Finite Poisson summation formula).
For any function ,
holds.
Proof
To check the normalisation, let us look at the case . Then the left-hand side is . On the right-hand side,
and if , then the map is a non-trivial character by Theorem 5.1, so
by Theorem 6.1. Hence the right-hand side becomes
This agrees with
which follows from the usual dimension formula
This is not a fact used in the proof of the finite Poisson summation formula itself; it is only a check that the coefficient has the right normalisation.
This theorem is almost all of the MacWilliams identity. Above I stated it for complex-valued functions . The function used below takes values in , but one can simply apply the complex-valued case coefficientwise, so the same proof works unchanged. Thus I also apply this formula to polynomial-valued functions. The only remaining task is to choose a function corresponding to the weight enumerator and compute its character-sum transform .
The character sum in one coordinate
The function used in the MacWilliams identity decomposes by coordinates. So I begin with a one-coordinate calculation.
Let and be indeterminates, and define a function
by
This is the function that returns when one coordinate is and when it is non-zero.
Let us first look at the case of . If we take the non-trivial additive character , then for ,
while for ,
This is the prototype of the general expressions and .
Theorem 9.1.
For ,
holds.
Proof
First suppose that . Then , so
Next suppose that . Then the map is a permutation of , and in particular a permutation of . Hence
Since is a non-trivial character, Theorem 6.1 gives
Therefore
Hence
From this one-coordinate calculation, the variable change
emerges. At a zero coordinate one gets , while at a non-zero coordinate one gets .
The character-sum transform of the weight function
Let . The weight enumerator can be written using the function
Then for any code ,
Moreover, can be written as a product over coordinates:
This decomposition reduces the computation of the character-sum transform to the one-coordinate calculation.
Theorem 10.1.
For ,
holds.
Proof
By definition,
Since is an additive character,
Therefore
In the last equality, I used that a sum over a finite direct product factors into the product of sums over the coordinates. For example, if , then
By Theorem 9.1, the factor for each coordinate is
Since the number of zero coordinates of is and the number of non-zero coordinates is , we obtain
This theorem is exactly the variable transformation in the MacWilliams identity. The variables and arise from the one-coordinate character sum of the weight function.
Proof of the MacWilliams identity
We are now ready to prove the MacWilliams identity. Everything follows simply by combining the preparations above.
Let , and put . Then
Applying Theorem 8.2 with gives
By Theorem 10.1,
so
This proves the MacWilliams identity.
The flow of this proof may be summarised in the following sentence:
Through the fixed isomorphism , whenever the corresponding character is trivial on , and the character-sum transform of the weight function yields the MacWilliams variable transformation.
A small example: the binary repetition code of length
Let us look at a small example to see how the character sum picks out the dual code. Consider the code over
This is a one-dimensional repetition code. As a non-trivial additive character of , take
This code is self-dual. Indeed, the condition for to be orthogonal to the element of is
which means that or . Hence .
Corollary 7.3 asserts that for ,
holds. Indeed, since , the left-hand side is
This equals when , and equals when . So the character sum really does produce the indicator function of .
The weight enumerator is
The right-hand side of the MacWilliams identity is
which agrees with .
In this example, one can see that every calculation comes from the orthogonality relation for the character
In this example, since , the non-trivial additive character has the property
However, when with , it can happen that even for a non-zero , and for such an , the trace character satisfies . That is why, in the general proof, one does not conclude directly from , but instead uses the -linearity of .
What character theory was doing in this proof
Let me now organise the role played by character theory in the proof.
First, character theory changed the meaning of the dual code. Usually, the dual code is defined through the inner product by
Through the fixed isomorphism , the condition becomes equivalent to saying that the corresponding character is trivial on . That is,
Secondly, the orthogonality relation for characters allowed us to write the indicator function of as
This made it possible to convert sums over into sums over .
Thirdly, the character-sum transform of the weight function
split into one-coordinate character sums. From the one-coordinate calculation
the variable transformation in the MacWilliams identity emerged.
Thus the key points of the character-theoretic proof are the following three.
View each element of as the character .
Through the fixed isomorphism , make correspond to the characters that are trivial on .
Use character orthogonality to convert the weight sum over into a character sum over .
The MacWilliams identity is obtained by combining these three ideas.
Concepts introduced in this note
In this note, with the proof of the MacWilliams identity as the goal, I introduced the basic tools of character theory on finite abelian groups. To summarise:
- Finite abelian group
A finite set equipped with addition satisfying associativity, a zero element, inverses, and commutativity. The space is a finite abelian group.
- Character
A group homomorphism from a finite abelian group to ,
It is a function that turns addition into multiplication.
- Trivial character
The character sending every element to .
- Character group
The set of all characters of ,
With pointwise multiplication, it is itself a finite abelian group.
- Additive character
A character of the additive group of a finite field . Once a non-trivial additive character is fixed, each gives a character
- Orthogonality relation
The sum of a non-trivial character over the whole group is . Likewise, on a subgroup, the sum of a character that is non-trivial there is also .
- Annihilator
For a subgroup , the set of all characters that are trivial on :
In coding theory, through the fixed isomorphism , the annihilator of corresponds to .
- Finite Poisson summation formula
For a function , the formula
converts a sum over the dual code into a sum over the original code.
In this proof, character theory allowed us to reinterpret the relation between and from “orthogonal complement” to “correspondence with an annihilator”. Through this reinterpretation, the dual code arises naturally from orthogonality relations of characters.
Looking back at this family of proofs
As mentioned at the beginning, the present proof belongs to the family of
Fourier, character, and Poisson methods.
On the surface, it is a proof using characters of finite abelian groups. At a deeper level, however, it uses Fourier analysis on finite abelian groups.
More concretely, the following correspondences appeared.
the character a basis function in Fourier analysis
the characters trivial on , namely , corresponding through the fixed isomorphism
the MacWilliams variable transformation the one-coordinate character-sum transform
From this point of view, the MacWilliams identity is a special case of the Poisson summation formula on a finite abelian group. Unlike the Poisson summation formula in continuous analysis, however, every sum here is finite. So issues of convergence and integration do not arise.
The main point of the proof may be summarised in the following sentence:
Through the fixed isomorphism , the dual code corresponds to the characters trivial on , namely . Using this correspondence together with character orthogonality, one can convert the weight sum over into a character sum over the original code .
This way of thinking appears not only in the classical MacWilliams identity, but also in more general MacWilliams-type identities for complete weight enumerators, additive codes on finite abelian groups, and codes over Frobenius rings.
In the MacWilliams identity over Frobenius rings proved by Wood [Woo99], the identification between the ring and its character group through a generating character plays a central role. As a somewhat more concrete reference on “MacWilliams-type identities over finite Frobenius rings”, one may mention [HL01], which develops a unified treatment of such identities for several kinds of weight distribution on linear codes over finite Frobenius rings.
Next time
Next time, I revisit the character-theoretic proof given here using the language of matrices and tensor products.
In the present proof, I used a non-trivial additive character of to compute, in each coordinate, the sum
This one-coordinate transform can in fact be written as a matrix. Then the transformation for a code of length can be written as the -fold tensor product of that matrix.
In other words, the proof in this note can be translated into the form
character sums → matrices → tensor products
If one pushes the character-theoretic viewpoint into the background, the MacWilliams identity begins to look like “a formula saying that the tensor product of a certain Hadamard-type matrix acts on weight enumerators”.
Even within the same Fourier, character, and Poisson family of proofs, the picture changes considerably depending on whether one places characters or matrices in the foreground. Next time, by examining this difference, I will use it as a guide to an introduction to tensor-product linear algebra.
Footnotes
- Even so, it is not the case that one can forget it completely and still carry out every argument. Much of the discussion proceeds using only the additive-group structure, but when we later relate the annihilator to the usual dual code , we use the -linearity of , namely that if and , then . ↩
- If and are both inverses of , then . (This proof shows that commutativity (G4) is not needed.) ↩
- Here we consider a homomorphism from a group written additively to a group written multiplicatively. Let be the identity element of , and let be the identity element of . Then . ↩
- The order of is the smallest positive integer such that . (If one uses multiplicative notation instead, it is the smallest positive integer such that .) ↩
- Despite the name, this does not mean that the value of the character becomes . It means that the value becomes the multiplicative identity , that is, that the character is trivial on the subgroup. The terminology is borrowed by analogy with annihilators in module theory. ↩
References
- [Ter99] Audrey Terras. Fourier analysis on finite groups and applications. Cambridge University Press, Cambridge, vol. 43, pp. x+442, 1999. doi:10.1017/CBO9780511626265 ↩
- [SS03] Elias M. Stein and Rami Shakarchi. Fourier analysis. Princeton University Press, Princeton, NJ, vol. 1, pp. xvi+311, 2003 ↩
- [LN97] Rudolf Lidl and Harald Niederreiter. Finite fields. Cambridge University Press, Cambridge, vol. 20, pp. xiv+755, 1997 ↩
- [Woo99] Jay A. Wood. Duality for modules over finite rings and applications to coding theory. Amer. J. Math., vol. 121, no. 3, pp. 555–575, 1999. http://muse.jhu.edu/journals/american_journal_of_mathematics/v121/121.3wood.pdf ↩
- [HL01] Thomas Honold and Ivan Landjev. MacWilliams identities for linear codes over finite Frobenius rings. Finite fields and applications (Augsburg, 1999), pp. 276–292, 2001 ↩
This series
A Series Learning through the MacWilliams Identity · Part 2 of 12
Disclaimer
Articles on this site are based on the operator's personal understanding, investigation, and research notes. I try to keep the content accurate, but it may contain errors or incomplete explanations. I do not guarantee its accuracy, completeness, usefulness, or currentness.
Please use the information on this site at your own judgment and responsibility. To the extent permitted by law, the operator is not liable for damages, losses, or disadvantages arising from using, or being unable to use, information on this site.
If you notice an error, unclear explanation, broken link, or insufficient citation, please contact the operator. I will review the content and, when appropriate, correct, update, or remove it.