Sumsets and fixed points of substitutions

By F. Michel Dekking

Abstract

In this paper we introduce a technique to determine the sumset , where is the indicator function of the 0’s occurring in a fixed point of a substitution on the alphabet .

1. Introduction

Let be the sequence given by for , where is the golden mean. In the recent paper Reference 5, it was proved that the sumset is equal to the natural numbers with exception of the numbers 1 and 3. Here the sumset is defined by

It was also shown that if is defined by , then the set has a complement that is infinite. The determination of takes more than 8 pages, and the complicated structure of is described as containing “some fractal patterns”. The goal of the present paper is to elucidate what the source of these fractal patterns is, and at the same time to introduce a vast generalization of these types of sumsets.

Interestingly, Shallit in Reference 7 also positioned the sumset problem of the paper Reference 5 in the combinatorics on words context, but in a completely different way. The focus there is on the Fibonacci representation (also known as Zeckendorf representation) of the natural numbers, and the proofs are computer assisted. See also the paper Reference 9.

The crucial observation that we make is that the sequences and give the positions of 0’s, respectively the positions of 1’s in the infinite Fibonacci word , fixed point of the substitution , (see, e.g., Reference 6). Let be any substitution on the monoid , admitting a fixed point . Then, let give the positions of 0 in , and let give the positions of 1 in .

Problem.

Determine , and .

The proofs in Reference 5 are purely arithmetical. Our approach, inspired by Reference 3, is to analyse by studying the product set . This amounts to passing from fixed points of substitutions to fixed points of two-dimensional substitutions. See, e.g., Reference 4 for an overview of the theory of two-dimensional substitutions. Our application of two-dimensional substitutions to the sumset problem is self-contained.

In Section 3 we give a very simple proof of the result of Reference 5 using a two-dimensional substitution which has as fixed point. A proof of the result from Reference 5, analogously to the way this is done in Theorem 4, is very complex. The numbers of columns on the right side of the 2D fixed point of the product substitution that one has to use is unbounded for the Fibonacci substitution, whereas it is 15 for the von Neumann substitution.

In Section 4 we solve the , and problem for the Thue-Morse word , fixed point of the substitution , . We remark that our result is equivalent to Theorem 2 of the paper Reference 9, which is proved in a completely different, computer-assisted way.

In Section 5 we solve the and problem for the von Neumann word. The interest of this example lies in the fact that this example is one of a large family for which the method of Reference 7 does not work Reference 8.

In Section 6 we present the classical ‘sum of two squares’ problem in our 2D substitution context. This example also illustrates the fact that our technique extends from fixed points of substitutions to 0-1-valued morphic sequences, i.e., fixed points of substitutions on arbitrary alphabets which are mapped to by a letter-to-letter map.

2. Substitutions and their products

A substitution is a homomorphism of the monoid of words over an alphabet , that is, consists of concatenations with from for , and for all . Since we are interested in the characteristic functions of 0-1-words, we simplify the presentation and take .

Let on be a substitution given by

for two natural numbers and . We then define the direct product substitution on the alphabet by

where .

Example 1.

Let be the Fibonacci substitution given by , then, abbreviating to for the direct product substitution is given by

Graphic without alt text

Suppose that has prefix 0. Then converges to a fixed point of as , and so converges to the 2D fixed point of .

Thus the element of with index contains the symbol if and only if if and only if and .

Let the diagonal words be defined for by

Then if and only if contains a symbol . So we have obtained the following theorem.

Theorem 1.

Let be a substitution on , such that has prefix 0, and let be the fixed point of with prefix . Let be the sequence of positions of in . Let be the diagonal words occurring in , for . Then .

Note that we also have that if is the sequence of positions of in , then , and .

3. The Fibonacci word

The Fibonacci word is the unique fixed point of the substitution , . Let be the direct product of .

It is convenient to code , and to assign colors to these four letters so that one obtains more easily an idea of the structure of . The direct product substitution on this new alphabet is given by

Graphic without alt text

Here are some examples of 2D words obtained by iterating .

Graphic without alt text

Note that

We are now in a position to give a completely different proof of Theorem 3.1. of Reference 5.

Theorem 2 (Reference 5).

Let , , , , , , , , , , … be the sequence , where is the golden mean. Then

Proof.

Recall from the introduction that the sequence is equal to the sequence of positions of 0 in the infinite Fibonacci word .

We use Theorem 1. Trivially, 1 is not in , and we see that does not occur in . Further we see that does occur in , and . So it remains to show that all contain a symbol for . To see this, note that the first five columns on the left border of are a concatenation of the 2D words and for all . So these five columns are composed of blocks on top of , on top of , and on top of :

Graphic without alt text

In all three cases, the diagonal words starting at the left border in the top or will cross a square with a symbol in one of the four left most columns, which finishes the proof.

Remark.

The proof shows that if , with from , then can always be chosen from the set .

4. The Thue-Morse word

The Thue-Morse word is the fixed point of the substitution , . Although it is general practice (as in Reference 5) to index Beatty sequences starting from , and similarly for fixed points of substitutions, this is not the case for the Thue-Morse sequence. The Thue-Morse word is indexed starting from . Thus , , , , … gives the positions of 0 in , and , , , , … the positions of 1 in .

Let , be the symmetry operator on . Note that is symmetric, i.e., for , .

The direct product substitution of is the 2D substitution defined on the symbols for (written as ) by

When we code , and color the squares of the symbols and , then the substitution takes the form

Graphic without alt text.

Since we start at index 0, we have to reconsider the definition of the diagonal words. It turns out that it is convenient to index these by their lengths. So , etc. The iterate with the diagonal words , , …, indicated by lines is given by

Graphic without alt text

Our goal is to describe the diagonal words , for , . We see that in particular

Proposition 1.

Let be the morphism given by for . Then

Let be the 2-to-1 morphism given by for . Then

Proof.

Let the lines correspond to the diagonal words . The line cuts through the diagonals of the -blocks , …, , and these -blocks are images under of the symbols , …,  on the line . Thus now follows from the observation that the are exactly the symbols on the diagonals of the blocks .

For the second statement, we look at the successive blocks , …, cut by the line . The line cuts through the lower left squares of these blocks. This implies that the symbols at the odd indices of the word can be read off directly from the successive pairs of symbols of by mapping the diagonals of the -blocks to the symbols at the lower left corner of these blocks. This gives , which is an instance of .

For the symbols at the even indices of the word we have to do more work: we have to determine the symbol at the upper right corner of the -block directly below a pair of -blocks from the pair of symbols situated at lower left and the upper right corner of these blocks. The situation is as in the following drawing:

Graphic without alt text

This gives: , which happens to be a version of .

Theorem 3.

Let , , , , , , … give the positions of ’s in Thue Morse sequence, and , , , , , , … the positions of in the Thue Morse sequence. Then

Proof.

We only prove the result. The other two can be proved in a similar way.

According to Theorem 1, if and only if contains a symbol . Note that we had to shift the diagonal words, as they are redefined for this section.

Let be the morphism from Proposition 1 in -coding, , , , . Then is given by

It follows from this and Equation Equation 3 that , , for .

This takes care of the part of the theorem. Obviously 2 and 4 are not in . It remains to prove that all other numbers are in , which is equivalent to proving that a symbol occurs in all diagonal words with and not equal to a power of 2. It looks attractive to use Proposition 1 to accomplish this. Indeed, let be the set consisting of the twelve 2-blocks , , …, with two different symbols. Then one may check that all elements of occur in the length 4 blocks from , and also in the length 3 blocks from . This implies that a diagonal word in which all elements from occur propagates this property to and . However, there are many diagonal words in which not all blocks from occur, in fact all of the form for some non-negative integers and . This makes an induction proof very complex, and so we will give a completely different proof.

In Figure 1 we depict the -squares. Here the red lines indicate diagonal words without the symbol , and the green lines indicate diagonal words with a symbol .

We also indicated (parts of) the diagonal words above the main diagonal. These are denoted by , …, , starting from the main diagonal. For instance the block has the two red diagonal lines and .

Figure 2 gives the red and green line structure for the -blocks inherited from the -blocks.

Observe that any diagonal that is partly red, partly green, actually should be a green diagonal. This leads to Figure 3.

We finish the proof with induction. Starting from , the blocks , , , have the following properties:

(1)

There are some red diagonals with .

(2)

There are some red diagonals with .

(3)

There are some red diagonals and with .

(4)

All other diagonals and are green.

One checks that these four properties hold for , and then using the induction hypothesis makes the step from to in the same way as the step from to is made as in Figures 1, 2 and 3.

5. The von Neumann word

The von Neumann word was introduced in the paper Reference 1. One has , fixed point of the substitution , .

The 2D von Neumann substitution is given by

Graphic without alt text

Before we continue, we formulate a simple lemma on the language of the substitution , i.e., on the collection of words that can occur in .

Lemma 1.

The words and do not occur in .

Using Lemma 1 one simply derives the next lemma.

Lemma 2.

The word occurs uniquely as suffix of . The word occurs uniquely as suffix of . The word occurs uniquely as suffix of .

We split the possible occurrences of words of in the von Neumann word according to a tree of prefixes, dividing these in six possible cases. This prefix tree is given in Figure 4, where is the empty word. Note that we used Lemma 1 again to determine the offspring of the node labeled 10.

Theorem 4.

Let , , , , , … give the positions of ’s in the von Neumann sequence, and , , , , , , … the positions of in the von Neumann sequence. Then

Proof.

The proof of is almost trivial.

The proof for is based on Theorem 1, with the trivial change that we consider the fixed point starting with 1. The proof runs as the proof for the Fibonacci word, except that here we need the 15 left most columns.

First one checks easily that the numbers are not in .

To handle the diagonal words for , we split into the six cases we introduced in Figure 4. In Case 2, 3 and 4 we consider the three diagonals starting at the left border in the blocks , in Case 1, 5 and 6 the single diagonal starting at the left border in . See Figure 5 for the three cases 00, 11 and 001. This means that in Case 1 the top two blocks are , in Case 2 they are , and in Case 3 they are .

In Case 2 a block must precede the top blocks , by Lemma 2. In a similar way there are forced blocks in Case 4 and Case 6.

The other blocks are then inserted following the von Neumannn words or . In all cases, except the last, a row has been added at the bottom. This is allowed because both and have the letter as prefix.

The red diagonals are those parts of the three diagonals starting in the top block , respectively the single diagonal starting in , that do not cross a square with label , i.e., the red diagonal ends just before the first -square. See Figure 6 for the three remaining cases.

The fact that in all six cases a square with label is encountered finishes the proof.

6. The sum of squares

Here we give a classical example of a sumset. Let . It is well known (see, e.g., Reference 2), that the characteristic function of , as a word, is a letter-to-letter substitution of the fixed point with prefix 0 of the morphism

The letter-to-letter map is given by , , .

The corresponding 2D substitution is the morphism given by

Graphic without alt text

The numbers that are a sum of two squares occur as sums of the indices of the squares with the symbols in the fixed point of , limit of as .

At first sight it is surprising that this set has a fractal structure, but due to the fact that the morphism is not primitive (i.e., the incidence matrix of the substitution is reducible), there is no exponential scaling structure in the 2D word , but rather a polynomial one. The latter simply amounts to the recursion . This recursion is clearly visible in Figure 7, which displays the two-dimensional word .

Acknowledgments

I am grateful to Jean-Paul Allouche and Jeffrey Shallit for their useful comments on this paper.

Figures

Figure 1.

Diagonals in -squares

Graphic without alt text
Figure 2.

Diagonals in -squares inherited from -blocks

Graphic without alt text
Figure 3.

Final color assignment to the diagonals in -blocks

Graphic without alt text
Figure 4.

The prefix tree for the von Neumann words

Graphic without alt text
Figure 5.

Diagonals for 2D von Neumann words in Cases 1, 2, 3

Graphic without alt text
Figure 6.

Diagonals for 2D von Neumann words in Cases 4, 5, 6

Graphic without alt text
Figure 7.

The 2D word

Graphic without alt text

Mathematical Fragments

Theorem 1.

Let be a substitution on , such that has prefix 0, and let be the fixed point of with prefix . Let be the sequence of positions of in . Let be the diagonal words occurring in , for . Then .

Equation (3)
Proposition 1.

Let be the morphism given by for . Then

Let be the 2-to-1 morphism given by for . Then

Lemma 1.

The words and do not occur in .

Theorem 4.

Let , , , , , … give the positions of ’s in the von Neumann sequence, and , , , , , , … the positions of in the von Neumann sequence. Then

References

Reference [1]
J.-P. Allouche, J. Bétréma, and J. O. Shallit, Sur des points fixes de morphismes d’un monoïde libre (French, with English summary), RAIRO Inform. Théor. Appl. 23 (1989), no. 3, 235–249, DOI 10.1051/ita/1989230302351. MR1020473,
Show rawAMSref \bib{All-Ber-Shallit}{article}{ author={Allouche, J.-P.}, author={B\'{e}tr\'{e}ma, J.}, author={Shallit, J. O.}, title={Sur des points fixes de morphismes d'un mono\"{\i }de libre}, language={French, with English summary}, journal={RAIRO Inform. Th\'{e}or. Appl.}, volume={23}, date={1989}, number={3}, pages={235--249}, issn={0988-3754}, review={\MR {1020473}}, doi={10.1051/ita/1989230302351}, }
Reference [2]
Alan Cobham, Uniform tag sequences, Math. Systems Theory 6 (1972), 164–192, DOI 10.1007/BF01706087. MR457011,
Show rawAMSref \bib{Cobham}{article}{ author={Cobham, Alan}, title={Uniform tag sequences}, journal={Math. Systems Theory}, volume={6}, date={1972}, pages={164--192}, issn={0025-5661}, review={\MR {457011}}, doi={10.1007/BF01706087}, }
Reference [3]
Michel Dekking, Károly Simon, and Balázs Székely, The algebraic difference of two random Cantor sets: the Larsson family, Ann. Probab. 39 (2011), no. 2, 549–586, DOI 10.1214/10-AOP558. MR2789506,
Show rawAMSref \bib{DSS-AP}{article}{ author={Dekking, Michel}, author={Simon, K\'{a}roly}, author={Sz\'{e}kely, Bal\'{a}zs}, title={The algebraic difference of two random Cantor sets: the Larsson family}, journal={Ann. Probab.}, volume={39}, date={2011}, number={2}, pages={549--586}, issn={0091-1798}, review={\MR {2789506}}, doi={10.1214/10-AOP558}, }
Reference [4]
Natalie Priebe Frank, A primer of substitution tilings of the Euclidean plane, Expo. Math. 26 (2008), no. 4, 295–326, DOI 10.1016/j.exmath.2008.02.001. MR2462439,
Show rawAMSref \bib{Frank-primer}{article}{ author={Frank, Natalie Priebe}, title={A primer of substitution tilings of the Euclidean plane}, journal={Expo. Math.}, volume={26}, date={2008}, number={4}, pages={295--326}, issn={0723-0869}, review={\MR {2462439}}, doi={10.1016/j.exmath.2008.02.001}, }
Reference [5]
Sutasinee Kawsumarng, Tammatada Khemaratchatakumthorn, Passawan Noppakaew, and Prapanpong Pongsriiam, Sumsets associated with Wythoff sequences and Fibonacci numbers, Period. Math. Hungar. 82 (2021), no. 1, 98–113, DOI 10.1007/s10998-020-00343-0. MR4219408,
Show rawAMSref \bib{Kawsumarng}{article}{ author={Kawsumarng, Sutasinee}, author={Khemaratchatakumthorn, Tammatada}, author={Noppakaew, Passawan}, author={Pongsriiam, Prapanpong}, title={Sumsets associated with Wythoff sequences and Fibonacci numbers}, journal={Period. Math. Hungar.}, volume={82}, date={2021}, number={1}, pages={98--113}, issn={0031-5303}, review={\MR {4219408}}, doi={10.1007/s10998-020-00343-0}, }
Reference [6]
M. Lothaire, Algebraic combinatorics on words, Encyclopedia of Mathematics and its Applications, vol. 90, Cambridge University Press, Cambridge, 2002. A collective work by Jean Berstel, Dominique Perrin, Patrice Seebold, Julien Cassaigne, Aldo De Luca, Steffano Varricchio, Alain Lascoux, Bernard Leclerc, Jean-Yves Thibon, Veronique Bruyere, Christiane Frougny, Filippo Mignosi, Antonio Restivo, Christophe Reutenauer, Dominique Foata, Guo-Niu Han, Jacques Desarmenien, Volker Diekert, Tero Harju, Juhani Karhumaki and Wojciech Plandowski; With a preface by Berstel and Perrin, DOI 10.1017/CBO9781107326019. MR1905123,
Show rawAMSref \bib{Lothaire}{book}{ author={Lothaire, M.}, title={Algebraic combinatorics on words}, series={Encyclopedia of Mathematics and its Applications}, volume={90}, note={A collective work by Jean Berstel, Dominique Perrin, Patrice Seebold, Julien Cassaigne, Aldo De Luca, Steffano Varricchio, Alain Lascoux, Bernard Leclerc, Jean-Yves Thibon, Veronique Bruyere, Christiane Frougny, Filippo Mignosi, Antonio Restivo, Christophe Reutenauer, Dominique Foata, Guo-Niu Han, Jacques Desarmenien, Volker Diekert, Tero Harju, Juhani Karhumaki and Wojciech Plandowski; With a preface by Berstel and Perrin}, publisher={Cambridge University Press, Cambridge}, date={2002}, pages={xiv+504}, isbn={0-521-81220-8}, review={\MR {1905123}}, doi={10.1017/CBO9781107326019}, }
Reference [7]
Jeffrey Shallit, Sumsets of Wythoff sequences, Fibonacci representation, and beyond, Period. Math. Hungar. 84 (2022), no. 1, 37–46, DOI 10.1007/s10998-021-00390-1. MR4381748,
Show rawAMSref \bib{Shallit-sumset}{article}{ author={Shallit, Jeffrey}, title={Sumsets of Wythoff sequences, Fibonacci representation, and beyond}, journal={Period. Math. Hungar.}, volume={84}, date={2022}, number={1}, pages={37--46}, issn={0031-5303}, review={\MR {4381748}}, doi={10.1007/s10998-021-00390-1}, }
Reference [8]
J. Shallit, Personal communication, March 2021.
Reference [9]
Aayush Rajasekaran, Jeffrey Shallit, and Tim Smith, Additive number theory via automata theory, Theory Comput. Syst. 64 (2020), no. 3, 542–567, DOI 10.1007/s00224-019-09929-9. MR4081148,
Show rawAMSref \bib{Shallit-additive}{article}{ author={Rajasekaran, Aayush}, author={Shallit, Jeffrey}, author={Smith, Tim}, title={Additive number theory via automata theory}, journal={Theory Comput. Syst.}, volume={64}, date={2020}, number={3}, pages={542--567}, issn={1432-4350}, review={\MR {4081148}}, doi={10.1007/s00224-019-09929-9}, }

Article Information

MSC 2020
Primary: 11B13 (Additive bases, including sumsets), 68R10 (Graph theory (including graph drawing) in computer science), 05A17 (Combinatorial aspects of partitions of integers)
Author Information
F. Michel Dekking
CWI, Amsterdam; and Delft University of Technology, Faculty EEMCS, Delft, The Netherlands
Michel.Dekking@cwi.nl, F.M.Dekking@tudelft.nl
Communicated by
Amanda Folsom
Journal Information
Proceedings of the American Mathematical Society, Series B, Volume 9, Issue 37, ISSN 2330-1511, published by the American Mathematical Society, Providence, Rhode Island.
Publication History
This article was received on , revised on , and published on .
Copyright Information
Copyright 2022 by the author under Creative Commons Attribution-NonCommercial 3.0 License (CC BY NC 3.0)
Article References
  • Permalink
  • Permalink (PDF)
  • DOI 10.1090/bproc/132
  • MathSciNet Review: 4503109
  • Show rawAMSref \bib{4503109}{article}{ author={Dekking, F.}, title={Sumsets and fixed points of substitutions}, journal={Proc. Amer. Math. Soc. Ser. B}, volume={9}, number={37}, date={2022}, pages={393-403}, issn={2330-1511}, review={4503109}, doi={10.1090/bproc/132}, }

Settings

Change font size
Resize article panel
Enable equation enrichment

Note. To explore an equation, focus it (e.g., by clicking on it) and use the arrow keys to navigate its structure. Screenreader users should be advised that enabling speech synthesis will lead to duplicate aural rendering.

For more information please visit the AMS MathViewer documentation.