Notices of the American Mathematical Society

Welcome to the Notices of the American Mathematical Society.
With support from AMS membership, we are pleased to share the journal with the global mathematical community.

Wiener-Wintner Ergodic Theorem, in Brief

Idris Assani

Communicated by Notices Associate Editor Steven Sam

Article cover

In 1941, N. Wiener and A. Wintner introduced a strengthening of the Birkhoff-Khinchin Ergodic Theorem, which initiated the study of a general phenomenon in ergodic theory where samplings are “good” for an uncountable number of systems. While this result was interesting on its own, it took mathematicians a few decades to realize that these “good” samplings play key roles in various other types of ergodic averages.

In this short note, we will briefly introduce the theorem and discuss how it played a key role in the study of Furstenberg averages, averages along the cubes, and the Return Times Theorem.

1. What is Ergodic Theory?

The results that we are presenting here are part of ergodic theory. We do not pretend to present an exhaustive list on the topic defined in the introduction. Our goal is to introduce the interested reader and particularly graduate students to this topic without requiring extensive background.⁠Footnote1

1

We will use a to indicate references which are not listed at the end of the article. The complete list of references is available at https://idrisassani.web.unc.edu/wp-content/uploads/sites/21419/2021/09/Complete-bibliography-WWET.pdf.

The word “ergodic,” which was coined by L. Boltzmann, comes from the Greek words (energy) and (path). Ergodic theory is a branch of mathematics which has its origin in statistical mechanics, and can be dated back to the 1870s when J. C. Maxwell and Boltzmann were working on formulating an “ergodic hypothesis” which studied the conditions for which the “time average” of the system equals the “space average.”

For instance, consider a box with gas molecules inside it. We can think of the box as a unit cube in . Suppose one is interested in studying how often these molecules, which are moving freely, visit the first octant of the box on average between a large enough time interval. One could study the path of each molecule, but that becomes hopeless as soon as we realize the number of variables are enormous (perhaps on the order of Avogadro’s number, which is around ). But if we assume that the molecules move freely enough so that they are not particularly trapped in a certain part of the box, then the frequency of their visit should equal the proportion of the entire space that our molecules occupy (if we are correct in the hypothesis that the time average equals the space average). As long as we can make this assumption, we conclude that the molecules are in the first octant of the box of the time on average. We recommend C. Moore’s note 17 for those interested in the historical aspect of the initial development of ergodic theory.

The mathematical formulation of ergodic theory starts with a measure space such that there exists an action by a measurable map The set is then called a dynamical system. If the underlying -algebra is clear, we sometimes omit it and simply write . Ergodic theory is a subfield of the field of dynamical systems which studies the statistical behavior of such systems. The map is said to be measure preserving if for each we have The dynamical system is then called a measure-preserving system. Measure-preserving systems are one of the central objects of study of ergodic theory. Here are some examples of measure-preserving systems:

a)

Rotations on , the one-dimensional torus, with Lebesgue measure . They are defined by the map , where The measure-preserving property is a simple consequence of the fact that Lebesgue measure is invariant under translations. This makes a measure-preserving system.

b)

The map is defined by , where is again the Lebesgue measure. We sketch the graph of this map in Figure 1. Here the measure-preserving property can be derived by checking it on open intervals . The slope of the lines show that where and .

Figure 1.

The doubling map defined by .

Graphic for Figure 1.  without alt text

We remark that the map has some properties quite distinct from the rotation map above. First of all, is invertible, while is not. Second of all, is forward measure preserving, i.e., , while is not. Lastly, the system under the map displays more chaotic behavior, in the sense that trajectories between two points in may differ quite a bit, no matter how close the original points are, whereas those trajectories under will always have fixed distance. The former phenomenon is called sensitivity to initial conditions, or some may call it the butterfly effect. While this notion is not too important in this article, interested readers can learn more details about this in any textbook on dynamical systems.

c)

One can obtain a measure-preserving system in a fairly general situation. Let be a compact metric space and a continuous function from to By a theorem of Krylov-Bogoliubov*, there exists a measure defined on the Borel subsets of which makes the system measure preserving.

We give now the definition of an ergodic measure-preserving system.

Definition 1.1.

The measure-preserving system is said to be ergodic if implies or

We will discuss the importance of this notion later, but an intuitive way to describe this is that an ergodic system contains no nontrivial invariant set (modulo a set of measure zero).

It is simple to see that if in example a) the number is rational, then is periodic. This periodicity prevents the system from being ergodic. However, if is irrational this system is ergodic. This can be seen by using Weyl’s criteria of uniform distribution. Example b) is also ergodic.

Furthermore, is ergodic if and only if every invariant function is a.e. constant (i.e., if for some , then is a constant function a.e.).

We will assume in this paper that In other words is a probability space.

One of the nice things about studying ergodic theory is that all measure-preserving systems employ valuable statistical properties, including those that display more chaotic behavior (like the doubling map example above) that are typically challenging to study at the level of individual orbits. Here we present three classical results that demonstrate these statistical qualities.

We will first discuss possibly the oldest result in ergodic theory, which is due to Poincaré.

Theorem 1.2 (Poincaré Recurrence Theorem).

Let be a measure-preserving system where we assume that Consider with positive measure. Then Furthermore if the system is ergodic, then if is another set with positive measure we have

In essence this theorem says that for “most” points , its iterates under , namely , return infinitely often to the set . When the system is ergodic, then for most points its iterates will “visit” any other set with positive measure infinitely often (an intuitive explanation for this follows from the fact that an ergodic system lacks nontrivial invariant sets).

The following two classical results are considered to be the “heart” of ergodic theory. In particular, the results by von Neumann and Birkhoff (later reformulated by Khinchin) from the early 1930s provided a rationale for the hypothesis that time averages can be equal to space averages, which was a fundamental problem in statistical mechanics.

Figure 2.

G. Birkhoff (left), A. Khinchin (middle), and J. von Neumann (right).

Graphic for Figure 2.  without alt text

First we discuss the result by von Neumann.

Theorem 1.3 (von Neumann’s mean ergodic theorem).

Let be a measure-preserving system, with Let denote a function in for some , . Then the averages converge in norm to a function such that If the system is ergodic, then

One of the consequences of this theorem is the decomposition of the space into the sum of two subspaces: , and the closure of the set of functions of the form Functions of this form are called coboundaries. Thus we have (see Yoshida-Kakutani* or 3, §1.1 for more details).

Next we discuss the result by Birkhoff and Khinchin. Birkhoff originally obtained this result for continuous flows on manifolds. Later Khinchin extended that result to hold for abstract measure-preserving dynamical systems.

Theorem 1.4 (Birkhoff-Khinchin pointwise ergodic theorem).

Let be a measure-preserving system, with Let denote a function in for some , . Then the averages converge a.e. to , where the function satisfies If the system is ergodic, then

One classical way to prove Theorem 1.4 is by using the following maximal inequality: for each and for each we have

By the Banach principle, this inequality shows that the set of functions for which pointwise convergence holds is closed in . This reduces the study of the pointwise convergence to a dense set which is given by

Perhaps some readers may have noticed that Birkhoff-Khinchin’s pointwise result implies that of von Neumann (on a finite measure space, a.e. convergence and boundedness implies norm convergence by the Lebesgue Dominated Convergence Theorem). While that is indeed the case, it should be noted that von Neumann studied these averages from an operator theory perspective. We should also point out some history regarding Birkhoff and von Neumann, which is nicely summarized in the short note by C. Moore 17:

According to Birkhoff and Koopman, von Neumann communicated his result personally to both of them on October 22, 1931, and pointed out to them that his result raised the important question of whether a pointwise result might be valid. Birkhoff then went to work and, by different methods, quickly established his pointwise ergodic theorem. He submitted his paper to PNAS on December 1, 1931, for appearance in the December 1931 issue. One presumes that he sent copies to Koopman and von Neumann, who would have noticed that Birkhoff had not given von Neumann adequate credit and recognition for his result. von Neumann evidently planned to include his ergodic theorem and its proof in a much longer paper he was writing for the Annals of Mathematics, but he then apparently quickly drafted a short paper for PNAS with his proof of the mean ergodic theorem and submitted it to PNAS on December 10, 1931. It appeared in the January 1932 issue. One suspects that these events led Koopman and Birkhoff to write and publish their paper in PNAS 2 months later, which set matters straight and clearly acknowledged von Neumann’s priority. It should also be noted that E. Hopf presented a slightly different proof of the mean ergodic theorem and some improvements on the Birkhoff theorem in a paper, which appeared in the January 1932 issue of PNAS. For whatever reason, the Birkhoff paper and its result has over time become the better known of the two papers, but in light of these historical details, the von Neumann paper deserves at least equal billing.

The averages are called “time averages” and is called the space average. Another statement of the pointwise ergodic theorem for ergodic systems is that for almost any point , the limit of the time averages is the space average.

Given a set of positive measure the sequence is composed of zeros and ones. By the pointwise ergodic theorem the averages converge a.e. to if is ergodic.

Each time we have . We introduce the following definition that we will use later.

Definition 1.5.

Let be a measure-preserving system. Given , we denote by , the th visit time to the set (see Figure 3). More precisely, we define and, for ,

Due to the Poincaré Recurrence Theorem, we know that is defined for every for -a.e. . If is ergodic, the return times can be defined on -a.e. .

Figure 3.

Visit times, and to the set .

Graphic for Figure 3.  without alt text

2. Wiener-Wintner Ergodic Theorem

We would like to discuss an important strengthening of the Birkhoff-Khinchin Pointwise Ergodic Theorem, which was announced by N. Wiener and A. Wintner*. This result will be central to our discussion.

Figure 4.

N. Wiener (left) and A. Wintner (right).

Graphic for Figure 4.  without alt text
Theorem 2.1 (Wiener-Wintner Ergodic Theorem).

Let be a probability measure-preserving system, and let . Then there exists , such that for every , the averages

converge for every .

Though the original proof had a gap, fortunately the gap has been filled, and now there are at least three different proofs of this theorem 3.

We note that if the value of is fixed, one can show that the convergence of the averages above is an immediate corollary of the Birkhoff-Khinchin Theorem. Indeed, let be any measure-preserving system, and let be the rotation system on the -torus by , i.e., , . Let , and let such that . Then

and these averages converge by applying the Birkhoff-Khinchin Theorem on the product space .

The novelty of the Wiener-Wintner result, however, is that the set of full measure is independent of the value of . Since is uncountable, one cannot simply apply one of the standard tricks of measure and integration where we intersect countably many sets of full measure to show that the result holds.

It appears that the Wiener-Wintner Theorem is of interest to some applied mathematicians as well, as communicated to the author by I. Mezić. See a paper, for instance, by M. Budišić, R. Mohr, and I. Mezić*.

More can be said about the Wiener-Wintner Ergodic Theorem if we assume the system to be ergodic. In particular, it becomes easier to identify the limit of Wiener-Wintner averages.

For an ergodic measure-preserving transformation , we say that is an eigenfunction for if -a.e. for some and . The Kronecker factor is a sub--algebra of generated by the eigenfunctions of (i.e., the smallest -algebra that makes all of the eigenfunctions of measurable). In this situation, is the closed linear span of eigenfunctions of . We note that if is an eigenvalue of , then since is measure preserving.

For example, let be a -torus, and let be an irrational number. Let be a map such that . The system is called the skew-product system (where is the normalized Lebesgue measure). We consider functions of the form , where . We note that if and only if , or in other words . This fact can be used to show that the system is an ergodic system (where is the normalized Lebesgue measure).⁠Footnote2 Furthermore, if is an eigenfunction of , i.e., for some , then we must have . This implies that is generated by the set of functions that only depend on the first coordinate.

2

A formal proof can be derived using the Fourier expansion of a function in Furthermore, on a topological system like the skew-product one, ergodicity can be demonstrated by showing every -invariant continuous function is a.e. constant.

We are now ready to state the uniform version of Theorem 2.1. This result was announced in the work of Bourgain on the Double Recurrence Theorem 6, and applied to prove this result (we will discuss this later). A detailed proof can be found in, for instance, 3, Theorem 2.4.

Theorem 2.2 (Uniform Wiener-Wintner Ergodic Theorem).

Let be an ergodic measure-preserving system, the Kronecker factor, and suppose that is orthogonal to . Then for -a.e. , we have

To illustrate this theorem, we present the following example which shows convergence of averages that are fairly difficult to prove directly. Recall the skew-product example above. By the definition of , we have

We also know from the discussion above that the orthogonal complement of is spanned by the set . The Uniform Wiener-Wintner Theorem asserts that for -a.e. , we have

or equivalently,

The Uniform Wiener-Wintner Theorem tells us about the limiting behavior of these averages. As a consequence of this result, one can show that if is a separable ergodic system and , the limit of Wiener-Wintner averages is a “projection” onto the eigenspace of if is one of the eigenvalues of (the averages converge to otherwise).

We remark however that the Uniform Wiener-Wintner Theorem is false without the assumption of ergodicity. Let be the same skew-product system that we saw earlier. Then is a measure-preserving system that is not ergodic (for instance, is invariant under , even though it is not a constant function). The map can be explicitly written as

It is known that the closed linear span of eigenfunctions for is given by the closed linear span of products of the characters . In other words, it is given by the functions depending only on and .

Consider the functions , where . First, note that is orthogonal to the closed linear span⁠Footnote3 of the eigenfunctions of . But the Wiener-Wintner averages for this function are

3

While we do not discuss the notion of factor here, it may be worth mentioning that if the system is not ergodic, then the closed linear span of eigenfunctions does not form a factor. This is why we are not calling this the Kronecker factor.

Thus,

(the supremum is obtained when ), which shows that

so the uniformity result does not hold.

In the next few sections, we will see some of the applications of the Uniform Wiener-Wintner Theorem in other ergodic averages.

3. The Double Recurrence Theorem

Let be a measure-preserving system. Consider a set with positive measure. The sequence takes the value 1 if and both belong to (see Figure 5). One can observe that the gap between and goes to infinity even if itself goes to infinity which adds another level of difficulty.

Figure 5.

Double recurrence to the set : and are in .

Graphic for Figure 5.  without alt text

The Furstenberg averages for the pair of functions are equal to

More generally for functions that are in , we have the following definition.

Definition 3.1.

The Furstenberg averages of the functions , are defined as

Since the powers of commute, one can generalize this concept further and define the averages

when are commuting transformations acting on

It should be noted that the a.e. convergence of Furstenberg averages do not follow from the Birkhoff-Khinchin Theorem; the latter tells us that if

then . But this does not guarantee that for -a.e. , and this is because the set of the diagonal elements may be a -null set, so the Birkhoff-Khinchin Theorem provides no information on the convergence of the averages on . Similarly, the norm convergence of the Furstenberg averages do not follow from the von Neumann Theorem either.

The study of Furstenberg averages stems from the proof of Szemerédi’s Theorem provided by H. Furstenberg in 1977 14. Szemerédi’s Theorem is stated as follows: Let be a set with positive upper density, i.e.,

where denotes the cardinality of the set inside. Then the set has arbitrary long arithmetic progressions, i.e., given , there exist and such that are all in .

Furstenberg translated this problem in number theory into ergodic theory, and proved the following result, which can be used to prove Szemerédi’s Theorem: Given a measure-preserving system and a set such that , we have

for every . Note that the averages above can be rewritten as

Figure 6.

J. Bourgain (left) and H. Furstenberg (right).

Graphic for Figure 6.  without alt text

In 1981, Furstenberg asked if the multiple ergodic averages converge pointwise. J. Bourgain 6 proved the following result in 1990.

Theorem 3.2.

Let be two functions. Then the averages

converge a.e.

The norm convergence of the averages in this theorem had been established earlier by Furstenberg. At the time of Bourgain’s result, not much was known about the convergence of Furstenberg averages, aside from the norm convergence results obtained by J.-P. Conze and E. Lesigne* in 1994 (for two functions with commuting transformations and three functions with different powers of a single transformation). Bourgain’s proof is quite technical and not easy to read, although one of the key elements of his proof was the uniform Wiener-Wintner Ergodic Theorem (Theorem 2.2). To see this, we consider two cases: the first when , and the second when is orthogonal to . The first case essentially narrows down to showing the convergence holds if is an eigenfunction of , which is quite straightforward to do. We then consider the second case. Define

By following strings of simple (yet clever) arguments presented in the first two pages of Bourgain’s proof, one can show that, by using the inequalities of Parseval and Cauchy-Schwarz, for every , we have

for some constant that may depend on and .⁠Footnote4 This estimate gives an upper bound of the -norm in terms of the Wiener-Wintner averages. The remainder of Bourgain’s paper gets very technical, but perhaps it would be nice to work with this estimate directly at the expense of imposing an additional assumption. And that key assumption is the following rate of convergence on the Wiener-Wintner averages: let us consider to be a function for which

4

The value of the constant may differ in the subsequent lines, but it will always depend only on and , and the exact value of will be irrelevant since we will see that the upper estimate will vanish as .

for some and a positive constant that may depend on (it should be noted that one can use the Cauchy-Schwarz inequality to show that is indeed an upper bound for the set of possible values of for this estimate to hold). If we make this assumption, we get

If we set (where is the floor function), one would get

This estimate alone already grants the norm convergence of to zero, but one can go further. Consider the sequence where is a real number. This implies that

for every . We sum both sides over to get

Following this fact and the Monotone Convergence Theorem, we get

which tells us that the function is bounded for -a.e. . This immediately implies that

for -a.e. . To show the a.e. convergence of , first let such that . Then we compute that

We already know the first term of the right-hand side converges to . The second term is bounded above by

As we let , we can also let , so the above expression goes to . And this proves the Double Recurrence Theorem for this special case where we can prescribe the rate of convergence to the Wiener-Wintner averages.

Based on this observation, we felt it was sensible to define a class of ergodic dynamical systems where one can obtain a rate of convergence in the Uniform Wiener-Wintner Theorem. In this way, one can prove the a.e. Double Recurrence Theorem in a much simpler fashion for this particular class. We called them Wiener-Wintner dynamical systems and some of their properties, including the full proof of the Double Recurrence Theorem on this class of systems, can be found in 3.

Definition 3.3.

Let and . An ergodic dynamical system is said to be a Wiener-Wintner dynamical system of power class in if there exists a dense class of functions in the orthogonal complement of its Kronecker factor such that for each one of these functions one can find a finite constant such that for each

If a function satisfies the inequality above, we call a Wiener-Wintner function of type .

There are numerous examples of Wiener-Wintner dynamical systems. For instance:

Theorem 3.4 (Cf. 3, Proposition 6.1).

Let be a skew-product system as discussed above, where . There exists a set of irrational numbers that has full Lebesgue measure such that for each , the system is a Wiener-Wintner dynamical system of power type in .

As we saw earlier, the skew-product system is not weakly mixing, so one may not conclude the pointwise convergence of double recurrence averages by Theorem 3.6.

Another class of examples of Wiener-Wintner dynamical systems are the so-called K-systems, which are dynamical systems with K automorphisms (the letter K stands for Kolmogorov). For example, let and such that and is a discrete probability measure on . Then consider , , and if , then we define to be a shift map. Then is called a Bernoulli shift, and one can verify that it is a K-system. While all Bernoulli shifts are K-systems, not all K-systems are Bernoulli shifts, which was demonstrated by D. Ornstein*.

Theorem 3.5 (Cf. 3, Theorem 6.1).

A K-system is a Wiener-Wintner dynamical system of power type in for (where the positive constant may depend on ).

It should be noted that the pointwise convergence of the Double Recurrence Theorem on a K-system was proven by J. Derrien and E. Lesigne in 1995.

Remark.

Not all ergodic systems are Wiener-Wintner dynamical systems of power class. Examples obtained independently by E. Lesigne and the author are mentioned in the author’s paper* published in 2004.

Using Bourgain’s result and combining it with Host’s* result on pairwise independent joining we were able to prove the following result on a weakly mixing system (we say an ergodic system with additional conditions is weakly mixing if the product space is ergodic). A joining of the measure spaces and is a measure on such that its projections on and are respectively and We refer to Eli Glasner’s* book for more on the joinings.

Theorem 3.6.

Let be a weakly mixing system such that its restriction to its Pinsker algebra⁠Footnote5 has singular spectrum. Then the averages

5

It is the maximum -algebra such that restricted on it has zero entropy.

converge a.e. for every and for each

This result was published in 1998. In the same paper we indicated our belief that the general result for functions should be true.

We conclude this section by mentioning some results related to Furstenberg averages and the Wiener-Wintner Ergodic Theorem. For Wiener-Wintner type results, extensions of the Wiener-Wintner Ergodic Theorem have been given for the Double Recurrence Theorem by D. Duncan, R. Moore, and myself*, and for nilsequences independently by myself* and P. Zorin-Kranich*.

Furthermore, partial results on the pointwise convergence of the Furstenberg averages have been obtained by Huang, Shao, and Ye 16 and for commuting transformations by S. Donoso and W. Sun 13. More recently B. Krause, M. Mirek, and T. Tao* have announced the pointwise convergence of the averages

while M. Christ, M. Curcik, V. Kovač, and J. Roos 10 announced a pointwise convergence of a continuous counterpart of these averages. Let us mention that the best possible result at the pointwise level would be the pointwise analog of the norm convergence obtained for polynomial nilpotent actions by M. Walsh 19.

4. Pointwise Convergence of Averages Along the Cubes

In 2001, another proof of Szemerédi’s Theorem was obtained by T. Gowers*. In his work, he introduced certain seminorms on . These seminorms were later generalized by B. Host and B. Kra 15 to be defined on arbitrary measure-preserving systems, which was used to prove norm convergence of Furstenberg averages (see §3). Furthermore, their proof developed the concept of averages along cubes, introduced for the case of three functions by V. Bergelson*, which we will discuss in this section.

Let be a measure-preserving system. The averages along the cubes of order 2 are given by

where

The averages along the cubes of order 3 are defined as

More generally for functions all in the averages along the cubes of order are defined as

In 15, Host and Kra proved the norm convergence of these averages. In 2004, we proved the pointwise case.

Theorem 4.1.

The averages along the cubes of any order converge a.e.

To our knowledge, this was the first complete pointwise convergence result obtained in the theory of nonconventional ergodic averages. The paper was published in 2010 2. Other proofs of Theorem 4.1 have since been given by several authors, such as W. Huang, S. Shao, and X. Ye 16, as well as S. Donoso and W. Sun 13.

Interestingly, we can prove this result by using the Wiener-Wintner Theorem, as we discuss below. We will discuss the second-order averages along the cubes for simplicity. In order to make the Wiener-Wintner averages appear, we have the following estimate on the (deterministic) averages along the cubes.

Lemma 4.2 (2, Lemma 5).

Let , and , , be three complex bounded sequences. Then for each positive integer , there exists such that

In order to prove Theorem 4.1, we utilize the uniform Wiener-Wintner Theorem. We illustrate this for the case of order 2. We denote

where . By Lemma 4.2, we have

where is a positive constant that depends on and . Thus, if is orthogonal to the Kronecker factor , the averages converge to a.e. by the Uniform Wiener-Wintner Theorem. If is an eigenfunction of with the corresponding eigenvalue, one sees that

The pointwise convergence follows from the Birkhoff-Khinchin Theorem applied on the product of and the rotation by (or the Wiener-Wintner Theorem). The general case where follows from a relatively simple approximation argument.

The higher-order case follows from a (not so simple) induction argument. It uses two main elements. One is based on the structure of the Host-Kra-Ziegler factors and the Host-Kra seminorms. The second is based on Wiener-Wintner estimates derived from the structure of the averages along the cubes. Both are needed to prove the higher-order case. Depending on the measure-preserving system one of these two elements will carry more weight. For instance, if the measure system is weakly mixing, then the Host-Kra-Ziegler factors are trivial and the bulk of the proof is based entirely on the second element.

We proved also in 2 that the pointwise convergence could be obtained for measure-preserving maps not necessarily commuting and acting on the same measure space. More precisely we proved the following result in the same paper.

Theorem 4.3.

Let be measure-preserving transformations acting on the same measure space Consider the functions Then the averages

converge a.e.

This theorem can also be proven from Lemma 4.2. In particular, the lemma shows how weighted averages become useful to bound such averages.

This shows that averages along the cubes behave very differently from Furstenberg averages, since the averages

do not converge -a.e. in general if and are non-commuting, as it was shown by D. Berend*.

Later, we extended Theorem 4.3 to the averages of six functions. Q. Chu and N. Frantzikinakis 11 used our method to extend this result to averages along cubes of arbitrary order.

5. The Return Times Theorem

We have seen earlier that given a set with positive measure and off a single null set one could consider the sequence of return times to the set (see Figure 3). A. Brunel in his thesis from 1968 introduced the study of the averages

with a “twist.” We fix the probability measure-preserving system and a set with positive measure. One seeks for a universal set of full measure off which, once is fixed in this set, the averages

would converge a.e. for all other measure-preserving systems and .

Return times averages can be classified as weighted ergodic averages, where one studies the averages of the form

where is a complex-valued sequence acting as the “weight.” We refer to a classical survey paper by A. Bellow and V. Losert* for more on this topic. What makes return times averages interesting is that the sequence is random as opposed to deterministic.

We also remark that the return times averages above can be viewed as a problem on averages along a subsequence, which can be phrased as follows: given a measure-preserving system , , and a strictly increasing sequence of natural numbers , one studies convergence of the averages

For the case of the return times averages, we can set to be the th return time as defined in Definition 1.5 so that we have

There are also some deep results in the case the sequence is deterministic. For instance, suppose that for some integer-valued polynomial . Then J. Bourgain 7 showed that for every , , the averages

converge for -a.e. However, if and , the pointwise convergence need not hold; this result was later shown by Z. Buczolich and D. Mauldin 8.

Partial results on return times averages were obtained by A. Bellow and V. Losert*. The problem was solved first by J. Bourgain in an unpublished paper. Then a shorter proof was provided in 7, jointly with H. Furstenberg, Y. Katznelson, and D. Ornstein (this version of the proof is sometimes called the “BFKO argument”). This result is called the Return Times Theorem.

Theorem 5.1 (The Return Times Theorem).

Let be a probability measure-preserving system, and let . Then there exists a set such that for every and, for any other measure-preserving system and function , the averages

converge for -a.e. .

While the BFKO argument was simple in the sense that it did not require many tools beyond basic ergodic theory, it was originally written in a very concise matter (it was a 3-page proof mentioned in an “appendix” of another paper by Bourgain). Another fairly accessible proof of the Return Times Theorem via joinings was provided by D. Rudolph*.

We remark that this result is a generalization of the Wiener-Wintner Theorem: if we take , the Borel -algebra, the (normalized) Lebesgue measure, a rotation by any , and , then the Return Times Theorem tells us that there exists a set of full measure in that is independent on the value of for which

converge.

We also note that if we replace “for -a.e. with “in ,” the statement becomes a rather simple corollary of the Wiener-Wintner Theorem and the spectral theorem; see 3, Theorem 3.1. Furthermore, if is a Wiener-Wintner dynamical system, then one can obtain an even simpler proof. By the Spectral Theorem, there exists a Borel probability measure on such that

So because is a Wiener-Wintner function of type in , we have

In particular, if is a number such that , we have

This implies that , which implies that there exists a set of full measure that is independent of the other dynamical system such that

for -a.e. . Therefore, for every , we have

for -a.e. , and thus concludes the proof of the Return Times Theorem for the function One can pass from Wiener-Wintner functions of power class to general functions by approximation.

We conclude this section by outlining the development of the Return Times Theorem. The author raised four questions in 1990:

(i)

Consider the averages ; can we have a multidimensional return times result?

(ii)

What is the exact range of the functions for which such a result would hold? Would the dynamic created be powerful enough to allow the convergence to hold beyond the Hölderian inequality?

(iii)

In particular what could happen if both functions and are in ? (This is called the -problem.)

(iv)

What could be the characteristic factors for such averages?

Question (i) was solved by D. Rudolph*.

After some partial results with specific random weights (iid random variables) (see (Assani*, AIHP)), progress for the question (ii) on the averages was obtained in 12 by C. Demeter, M. Lacey, C. Thiele, and T. Tao, where they have shown that the Return Times Theorem holds if and . But some cases remain open.

As for (iii), Z. Buczolich, D. Mauldin, and myself 5 proved that the problem had a negative answer even for the tail which is the term . The current best result on the space for which the return property for the tail of the averages holds for the function while has been obtained by Maria Carro and C. Domingo-Salazar 9. See also S. Baena-Miret and M. Carro*.

Question (iv) was answered by the author and K. Presser*.

6. Future Directions

Here we list some open problems regarding the averages discussed in this paper. For more open problems on the return times averages, see the survey paper by myself and K. Presser*.

(1)

There are several similarities between the Return Times Theorem and the Furstenberg averages. They do have the same characteristic factors, namely the Host-Kra-Ziegler factors. Are they equivalent? Does one imply the other? For instance, is the Return Times Theorem equivalent to the Double Recurrence Theorem?

(2)

Does a polynomial extension of the Wiener-Wintner Theorem hold? In other words, given a measure-preserving system and , does there exist a set of full measure such that for every and every integer-valued polynomial and real-valued polynomial , the averages

converge? E. Lesigne* showed that the case holds. Beyond that, some partial solutions can be found in 3, Chapter 8 and a work of T. Eisner and B. Krause*. A related result was recently obtained by Z. Buczolich and T. Eisner*.

We also continue to ask if a polynomial extension of the Return Times Theorem holds (this was asked in Question 7.2 of the Assani-Presser* survey paper).

(3)

We ask if we can show convergence of multiple recurrence along multiple return times: consider a measure-preserving system and . We define a sequence . We ask if there exists a set of full measure such that for every , for any other measure-preserving system , , , the averages converge -a.e. The case is the classical Return Times Theorem. If , the author and R. Moore* showed that those averages converge in for every . Furthermore, P. Zorin-Kranich* showed the pointwise convergence for .

Acknowledgments

The author thanks the anonymous referees for making comments and suggestions, which were helpful in improving the presentation of the paper. He also thanks Z. Buczolich, A. Kachurovskii, R. Moore, and K. Presser for their comments.

References

[1]
I. Assani, Multiple recurrence and almost sure convergence for weakly mixing dynamical systems, Israel J. Math. 103 (1998), 111–124, DOI 10.1007/BF02762270. MR1613556Show rawAMSref\bib{Assa1}{article}{ author={Assani, I.}, title={Multiple recurrence and almost sure convergence for weakly mixing dynamical systems}, journal={Israel J. Math.}, volume={103}, date={1998}, pages={111--124}, issn={0021-2172}, review={\MR {1613556}}, doi={10.1007/BF02762270}, } Close amsref.
[2]
I. Assani, Pointwise convergence of ergodic averages along cubes, J. Anal. Math. 110 (2010), 241–269, DOI 10.1007/s11854-010-0006-3. MR2753294Show rawAMSref\bib{Assa2}{article}{ author={Assani, I.}, title={Pointwise convergence of ergodic averages along cubes}, journal={J. Anal. Math.}, volume={110}, date={2010}, pages={241--269}, issn={0021-7670}, review={\MR {2753294}}, doi={10.1007/s11854-010-0006-3}, } Close amsref.
[3]
Idris Assani, Wiener Wintner ergodic theorems, World Scientific Publishing Co., Inc., River Edge, NJ, 2003, DOI 10.1142/4538. MR1995517Show rawAMSref\bib{Assa3}{book}{ author={Assani, Idris}, title={Wiener Wintner ergodic theorems}, publisher={World Scientific Publishing Co., Inc., River Edge, NJ}, date={2003}, pages={xii+216}, isbn={981-02-4439-8}, review={\MR {1995517}}, doi={10.1142/4538}, } Close amsref.
[4]
Idris Assani and Zoltán Buczolich, The bilinear Hardy-Littlewood function for the tail, Israel J. Math. 179 (2010), 173–187, DOI 10.1007/s11856-010-0077-y. MR2735039Show rawAMSref\bib{AB}{article}{ author={Assani, Idris}, author={Buczolich, Zolt\'{a}n}, title={The $(L^p,L^q)$ bilinear Hardy-Littlewood function for the tail}, journal={Israel J. Math.}, volume={179}, date={2010}, pages={173--187}, issn={0021-2172}, review={\MR {2735039}}, doi={10.1007/s11856-010-0077-y}, } Close amsref.
[5]
Idris Assani, Zoltán Buczolich, and R. Daniel Mauldin, An counting problem in ergodic theory, J. Anal. Math. 95 (2005), 221–241, DOI 10.1007/BF02791503. MR2145565Show rawAMSref\bib{ABM}{article}{ author={Assani, Idris}, author={Buczolich, Zolt\'{a}n}, author={Mauldin, R. Daniel}, title={An $L^1$ counting problem in ergodic theory}, journal={J. Anal. Math.}, volume={95}, date={2005}, pages={221--241}, issn={0021-7670}, review={\MR {2145565}}, doi={10.1007/BF02791503}, } Close amsref.
[6]
J. Bourgain, Double recurrence and almost sure convergence, J. Reine Angew. Math. 404 (1990), 140–161, DOI 10.1515/crll.1990.404.140. MR1037434Show rawAMSref\bib{Bourg}{article}{ author={Bourgain, J.}, title={Double recurrence and almost sure convergence}, journal={J. Reine Angew. Math.}, volume={404}, date={1990}, pages={140--161}, issn={0075-4102}, review={\MR {1037434}}, doi={10.1515/crll.1990.404.140}, } Close amsref.
[7]
Jean Bourgain, Pointwise ergodic theorems for arithmetic sets, Inst. Hautes Études Sci. Publ. Math. 69 (1989), 5–45. With an appendix by the author, Harry Furstenberg, Yitzhak Katznelson and Donald S. Ornstein. MR1019960Show rawAMSref\bib{Bourg2}{article}{ author={Bourgain, Jean}, title={Pointwise ergodic theorems for arithmetic sets}, note={With an appendix by the author, Harry Furstenberg, Yitzhak Katznelson and Donald S. Ornstein}, journal={Inst. Hautes \'{E}tudes Sci. Publ. Math.}, number={69}, date={1989}, pages={5--45}, issn={0073-8301}, review={\MR {1019960}}, } Close amsref.
[8]
Zoltán Buczolich and R. Daniel Mauldin, Divergent square averages, Ann. of Math. (2) 171 (2010), no. 3, 1479–1530, DOI 10.4007/annals.2010.171.1479. MR2680392Show rawAMSref\bib{BucMaul}{article}{ author={Buczolich, Zolt\'{a}n}, author={Mauldin, R. Daniel}, title={Divergent square averages}, journal={Ann. of Math. (2)}, volume={171}, date={2010}, number={3}, pages={1479--1530}, issn={0003-486X}, review={\MR {2680392}}, doi={10.4007/annals.2010.171.1479}, } Close amsref.
[9]
María Jesús Carro and Carlos Domingo-Salazar, The return times property for the tail on logarithm-type spaces, Discrete Contin. Dyn. Syst. 38 (2018), no. 4, 2065–2078, DOI 10.3934/dcds.2018084. MR3809026Show rawAMSref\bib{CS}{article}{ author={Carro, Mar\'{\i }a Jes\'{u}s}, author={Domingo-Salazar, Carlos}, title={The return times property for the tail on logarithm-type spaces}, journal={Discrete Contin. Dyn. Syst.}, volume={38}, date={2018}, number={4}, pages={2065--2078}, issn={1078-0947}, review={\MR {3809026}}, doi={10.3934/dcds.2018084}, } Close amsref.
[10]
M. Christ, P. Durcik, V. Kovač, and J. Roos, Pointwise convergence of certain continuous-time double ergodic averages, to appear in Ergodic Theory Dynam. Systems, DOI: 10.1017/etds.2021.45.
[11]
Qing Chu and Nikos Frantzikinakis, Pointwise convergence for cubic and polynomial multiple ergodic averages of non-commuting transformations, Ergodic Theory Dynam. Systems 32 (2012), no. 3, 877–897, DOI 10.1017/S014338571100006X. MR2995648Show rawAMSref\bib{CF}{article}{ author={Chu, Qing}, author={Frantzikinakis, Nikos}, title={Pointwise convergence for cubic and polynomial multiple ergodic averages of non-commuting transformations}, journal={Ergodic Theory Dynam. Systems}, volume={32}, date={2012}, number={3}, pages={877--897}, issn={0143-3857}, review={\MR {2995648}}, doi={10.1017/S014338571100006X}, } Close amsref.
[12]
Ciprian Demeter, Michael T. Lacey, Terence Tao, and Christoph Thiele, Breaking the duality in the return times theorem, Duke Math. J. 143 (2008), no. 2, 281–355, DOI 10.1215/00127094-2008-020. MR2420509Show rawAMSref\bib{DLTT}{article}{ author={Demeter, Ciprian}, author={Lacey, Michael T.}, author={Tao, Terence}, author={Thiele, Christoph}, title={Breaking the duality in the return times theorem}, journal={Duke Math. J.}, volume={143}, date={2008}, number={2}, pages={281--355}, issn={0012-7094}, review={\MR {2420509}}, doi={10.1215/00127094-2008-020}, } Close amsref.
[13]
Sebastián Donoso and Wenbo Sun, Pointwise convergence of some multiple ergodic averages, Adv. Math. 330 (2018), 946–996, DOI 10.1016/j.aim.2018.03.022. MR3787561Show rawAMSref\bib{DS}{article}{ author={Donoso, Sebasti\'{a}n}, author={Sun, Wenbo}, title={Pointwise convergence of some multiple ergodic averages}, journal={Adv. Math.}, volume={330}, date={2018}, pages={946--996}, issn={0001-8708}, review={\MR {3787561}}, doi={10.1016/j.aim.2018.03.022}, } Close amsref.
[14]
Harry Furstenberg, Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions, J. Analyse Math. 31 (1977), 204–256, DOI 10.1007/BF02813304. MR498471Show rawAMSref\bib{Fu}{article}{ author={Furstenberg, Harry}, title={Ergodic behavior of diagonal measures and a theorem of Szemer\'{e}di on arithmetic progressions}, journal={J. Analyse Math.}, volume={31}, date={1977}, pages={204--256}, issn={0021-7670}, review={\MR {498471}}, doi={10.1007/BF02813304}, } Close amsref.
[15]
Bernard Host and Bryna Kra, Nonconventional ergodic averages and nilmanifolds, Ann. of Math. (2) 161 (2005), no. 1, 397–488, DOI 10.4007/annals.2005.161.397. MR2150389Show rawAMSref\bib{HK}{article}{ author={Host, Bernard}, author={Kra, Bryna}, title={Nonconventional ergodic averages and nilmanifolds}, journal={Ann. of Math. (2)}, volume={161}, date={2005}, number={1}, pages={397--488}, issn={0003-486X}, review={\MR {2150389}}, doi={10.4007/annals.2005.161.397}, } Close amsref.
[16]
Wen Huang, Song Shao, and Xiangdong Ye, Pointwise convergence of multiple ergodic averages and strictly ergodic models, J. Anal. Math. 139 (2019), no. 1, 265–305, DOI 10.1007/s11854-019-0061-3. MR4041103Show rawAMSref\bib{HSY}{article}{ author={Huang, Wen}, author={Shao, Song}, author={Ye, Xiangdong}, title={Pointwise convergence of multiple ergodic averages and strictly ergodic models}, journal={J. Anal. Math.}, volume={139}, date={2019}, number={1}, pages={265--305}, issn={0021-7670}, review={\MR {4041103}}, doi={10.1007/s11854-019-0061-3}, } Close amsref.
[17]
Calvin C. Moore, Ergodic theorem, ergodic theory, and statistical mechanics, Proc. Natl. Acad. Sci. USA 112 (2015), no. 7, 1907–1911, DOI 10.1073/pnas.1421798112. MR3324732Show rawAMSref\bib{Moore_C}{article}{ author={Moore, Calvin C.}, title={Ergodic theorem, ergodic theory, and statistical mechanics}, journal={Proc. Natl. Acad. Sci. USA}, volume={112}, date={2015}, number={7}, pages={1907--1911}, issn={0027-8424}, review={\MR {3324732}}, doi={10.1073/pnas.1421798112}, } Close amsref.
[18]
E. Szemerédi, On sets of integers containing no elements in arithmetic progression, Acta Arith. 27 (1975), 199–245, DOI 10.4064/aa-27-1-199-245. MR369312Show rawAMSref\bib{SZ}{article}{ author={Szemer\'{e}di, E.}, title={On sets of integers containing no $k$ elements in arithmetic progression}, journal={Acta Arith.}, volume={27}, date={1975}, pages={199--245}, issn={0065-1036}, review={\MR {369312}}, doi={10.4064/aa-27-1-199-245}, } Close amsref.
[19]
Miguel N. Walsh, Norm convergence of nilpotent ergodic averages, Ann. of Math. (2) 175 (2012), no. 3, 1667–1688, DOI 10.4007/annals.2012.175.3.15. MR2912715Show rawAMSref\bib{Walsh}{article}{ author={Walsh, Miguel N.}, title={Norm convergence of nilpotent ergodic averages}, journal={Ann. of Math. (2)}, volume={175}, date={2012}, number={3}, pages={1667--1688}, issn={0003-486X}, review={\MR {2912715}}, doi={10.4007/annals.2012.175.3.15}, } Close amsref.

Credits

Opening image is courtesy of agsandrew via Getty.

Figures 1, 3, and 5 and photo of Idris Assani are courtesy of Idris Assani.

Figure 2 is courtesy of Wikimedia Commons.

Figure 4 is courtesy of the Library of Congress.

Figure 6 (left) is courtesy of George M. Bergman. Licensed under CC-BY-SA 4.0.

Figure 6 (right) is courtesy of Yosef Adest via Wikimedia Commons.