A topological dynamical system with two different positive sofic entropies

By Dylan Airey, Lewis Bowen, and Yuqing Frank Lin

Abstract

A sofic approximation to a countable group is a sequence of partial actions on finite sets that asymptotically approximates the action of the group on itself by left-translations. A group is sofic if it admits a sofic approximation. Sofic entropy theory is a generalization of classical entropy theory in dynamics to actions by sofic groups. However, the sofic entropy of an action may depend on a choice of sofic approximation. All previously known examples showing this dependence rely on degenerate behavior. This paper exhibits an explicit example of a mixing subshift of finite type with two different positive sofic entropies. The example is inspired by statistical physics literature on 2-colorings of random hyper-graphs.

1. Introduction

The topological entropy of a homeomorphism of a compact Hausdorff space was introduced in Reference 1. It was generalized to actions of amenable groups via Følner sequences in the 1970s Reference 2 and to certain non-amenable groups via sofic approximations more recently Reference 3. It plays a major role in the classification and structure theory of topological dynamical systems.

To explain further, suppose is a countable group with identity and is a map where is a finite set and is the group of permutations of . It is not required that is a homomorphism. Let (the symbol denotes a finite subset) and . Then is called

-multiplicative if

-trace preserving if

-sofic if it is both -multiplicative and -trace preserving.

A sofic approximation to consists of a sequence of maps such that for all finite , and all but finitely many , is -sofic. A group is sofic if it admits a sofic approximation. In this paper we will usually assume .

If acts by homeomorphisms on a compact Hausdorff space and a sofic approximation to is given then the -entropy of the action is a topological conjugacy invariant, denoted by . It is also called sofic entropy if is understood. It was first defined in Reference 3 where the authors obtain a variational principle connecting it with the previously introduced notion of sofic measure entropy Reference 4. It is monotone under embeddings and additive under direct products but not monotone under factor maps. See Reference 5 for a survey.

A curious feature of this new entropy is that it may depend on the choice of sofic approximation. This is not always the case; for example, if is amenable then sofic entropy and classical entropy always agree. However, there are examples of actions by non-amenable groups with sofic approximations satisfying

See Reference 5, Theorem 4.1. The case is considered degenerate: it implies that there are no good models for the action with respect to the given sofic approximation. Until this paper, it was an open problem whether a mixing action could have two different non-negative values of sofic entropy. Our main result is:

Theorem 1.1.

There exists a countable group , a mixing action by homeomorphisms on a compact metrizable space and two sofic approximations to such that

Remark 1.

The range of sofic entropies for an action is the set of all non-negative numbers of the form as varies over all sofic approximations to . By taking disjoint unions of copies of sofic approximations, it is possible to show the range of sofic entropies is an interval (which may be empty or a singleton). So for the example of Theorem 1.1, the range of sofic entropies is uncountable.

Remark 2.

It remains an open problem whether there is a measure-preserving action with two different non-negative sofic entropies. Theorem 1.1 does not settle this problem because it is entirely possible that any invariant measure on with satisfies .

In this paper we often assume , , …, .

1.1. Random sofic approximations

We do not know of any explicit sofic approximations to which are amenable to analysis. Instead, we study random sofic approximations. For the purposes of this paper, these are sequences of probability measures on spaces of homomorphisms such that, for any finite and there is an such that

for all sufficiently large . Because decays super-exponentially, if is any sequence with an exponential lower bound of the form (for some constant ) then there exists a sofic approximation with for all .

It is this non-constructive existence result that enables us to use random sofic approximations to prove Theorem 1.1.

1.2. Proper colorings of random hyper-graphs from a statistical physics viewpoint

The idea for our main construction comes from studies of proper colorings of random hyper-graphs. Although these studies have very different motivations than those that inspired this paper, the examples that they provide are roughly the same as the examples used to prove Theorem 1.1. The relevant literature and an outline is presented next.

A hyper-graph is a pair where is a collection of subsets of . Elements of are called hyper-edges but we will call them edges for brevity’s sake. is -uniform if every edge has cardinality .

A 2-coloring of is a map . An edge is monochromatic for if . A coloring is proper if it has no monochromatic edges.

Let denote a hyper-graph chosen uniformly among all -uniform hyper-graphs with vertices and edges. We will consider the number of proper 2-colorings of when is large but fixed, and the ratio of edges to vertices is bounded above and below by constants.

This random hyper-graph model was studied in Reference 6Reference 7Reference 8. These works are motivated by the satisfiability conjecture. To explain, the lower satisfiability threshold is the supremum over all such that

The upper satisfiability threshold is the infimum over all such that

The satisfiability conjecture posits that . It is still open.

Bounds on these thresholds were first obtained in Reference 6 as follows. Let be the number of proper 2-colorings of a hyper-graph . A first moment computation shows that

where . Let be such that . If then . Therefore .

Let be the supremum over numbers such that the second moment is equal to up to sub-exponential factors. The Paley-Zygmund inequality gives the bound .

In Reference 6, it is shown that

So there is a constant-sized gap between the two thresholds.

A more detailed view of the second moment is illuminating. But before explaining, we need some terminology. Let be the set of natural numbers . A coloring of is equitable if . We will assume from now on that is even so that equitable colorings of exist. Let be the number of equitable proper colorings of a hyper-graph . A computation shows that equals up to sub-exponential factors. This enables us to work with equitable proper colorings in place of all proper colorings. This reduces the computations because there is only one equitable coloring up to the action of the symmetric group .

A computation shows that the second moment factorizes as

where is any equitable 2-coloring. Let be the random hyper-graph chosen by conditioning on the event that is a proper 2-coloring. This is called the planted model and is the planted coloring. So computing the second moment of reduces to computing the first moment of .

The normalized Hamming distance between colorings is

Let be the number of equitable proper colorings with . Then

In Reference 6, it is shown that is equal to (up to sub-exponential factors) where is an explicit function.

Note that (since if is a proper equitable coloring then so is and ). A computation shows . If then is uniquely maximized at . However, if then the maximum of is attained in the interval . In fact, is negative for . So with high probability, there are no proper equitable colorings with . This motivates defining the local cluster, denoted , to be the set of all proper equitable 2-colorings with .

The papers Reference 7Reference 8 obtain a stronger lower bound on the lower satisfiability threshold using an argument they call the enhanced second moment method. To explain, we need some terminology. We say a proper equitable coloring is good if the size of the local cluster is bounded by . One of the main results of Reference 7Reference 8 is that tends to 1 as with and . An application of the Paley-Zygmund inequality to the number of good colorings yields the improved lower bound

The argument showing is combinatorial. It is shown that (with high probability) there is a set with cardinality which is rigid in the following sense: if is any proper equitable 2-coloring then either: the restriction of to is the same as the restriction of to or is at least for some constants . This rigid set is constructed explicitly in terms of local combinatorial data of the coloring on .

In summary, these papers study two random models and . When is in the interval , the typical number of proper colorings of grows exponentially in but is smaller (by an exponential factor) than the expected number of proper colorings of . It is these facts that we will generalize, by replacing with random sofic approximations to a group so that the exponential growth rate of the number of proper colorings roughly corresponds with sofic entropy.

Although the models that we study in this paper are similar to the models in Reference 6Reference 7Reference 8, they are different enough that we develop all results from scratch. Moreover, although the strategies we employ are roughly same, the proof details differ substantially. The reader need not be familiar with these papers to read this paper.

1.3. The action

In the rest of this introduction, we introduce the action in Theorem 1.1 and outline the first steps of its proof. So fix positive integers . Let

be the free product of copies of .

The Cayley hyper-tree of , denoted , has vertex set . The edges are the left-cosets of the generator subgroups. That is, each edge has the form for some and .

Remark 3.

It can be shown by considering each element of as a reduced word in the generators , …, that is a hyper-tree in the sense that there exists a unique “hyper-path” between any two vertices. More precisely, for any , there exists a unique sequence of edges such that , , , for any , and , . More intuitively, there are no “hyper-loops” in .

The group acts on by for . Let be the subset of proper 2-colorings. It is a closed -invariant subspace. Furthermore, is topologically mixing:

Claim 1.

For any nonempty open sets , in , there exists such that for any with , . Here denotes the shortest word length of representations of by generators , …, .

Proof.

It suffices to show the claim for , being cylinder sets. We make a further simplification by assuming each , is a cylinder set on a union of hyper-edges, and a yet further simplification that each , is a cylinder set on a connected union of hyper-edges. Informally, by shifting the “coordinates” on which depends so that they are far enough separated from the coordinates on which depends, we can always fill in the rest of the graph to get a proper coloring.

More precisely, suppose , where such that for any , there is a finite sequence of , , such that , , and , , and and similarly . and must be bichromatic on each edge in their respective domains since and are nonempty.

Let . Then it can be shown for any with that . It follows from our earlier remark that there exists a unique hyper-path connecting to (otherwise there would be a hyperloop in ). Thus for example one can recursively fill in a coloring on the rest of by levels of hyperedges - first the hyperedges adjacent to and , then the next layer of adjacent hyperedges, and so on. At each step, most hyperedges only have one vertex whose color is determined, so it is always possible to color another vertex of an edge to make it bichromatic. Only along the hyper-path connecting to at some step there will be a hyperedge with two vertices whose colors are already determined, but since is large there is still another vertex to color to make the edge bichromatic.

We will show that for certain values of , the action satisfies the conclusion of Theorem 1.1.

1.4. Sofic entropy of the shift action on proper colorings

Given a homomorphism , let be the hyper-graph with vertices and edges equal to the orbits of the generator subgroups. That is, a subset is an edge if and only if for some and .

Recall that a hyper-graph is -uniform if every edge has cardinality . We will say that a homomorphism is uniform if is -uniform. Equivalently, this occurs if for all , decomposes into a disjoint union of -cycles.

A 2-coloring of a hyper-graph is -proper if the number of monochromatic edges is . Using the formulation of sofic entropy in Reference 5 (which was inspired by Reference 9), we show in §2 that if is a sofic approximation to by uniform homomorphisms then the -entropy of is:

1.5. Random hyper-graph models

Definition 1.

Let denote the set of all uniform homomorphisms from to . Let be the uniform probability measure on and let be its expectation operator. The measure is called the uniform model. We will always assume so that is non-empty. In §3 we show that is a random sofic approximation. We will use the uniform model to obtain the sofic approximation which appears in Theorem 1.1.

Recall that if is a finite set, then a 2-coloring is equitable if . We assume from now on that is even so that equitable colorings of exist.

Definition 2.

Fix an equitable coloring . Let be the set of all uniform homomorphisms such that is proper as a coloring on . Let be the uniform probability measure on and let be its expectation operator. The measure is called the planted model and is the planted coloring. When is understood, we will write and instead of and . In §3 we show that is a random sofic approximation. We will use the planted model to obtain the sofic approximation which appears in Theorem 1.1.

Remark 4.

If and are both equitable -colorings then there are natural bijections from to as follows. Given a permutation and , define by . Because and are equitable, there exists such that . The map defines a bijection from to . Moreover defines a hyper-graph-isomorphism from to . Therefore, any random variable on that depends only on the hyper-graph up to hyper-graph-isomorphism has the same distribution under as under . This justifies calling the planted model.

1.6. The strategy and a key lemma

The idea behind the proof of Theorem 1.1 is to show that for some choices of , the uniform model admits an exponential number of proper 2-colorings, but it has exponentially fewer proper 2-colorings than the expected number of proper colorings of the planted model (with probability that decays at most sub-exponentially in ).

To make this strategy more precise, we introduce the following notation. Let denote the number of -proper 2-colorings of . A coloring is -proper if it is -proper. Let be the number of -proper 2-colorings.

In §3, the proof of Theorem 1.1 is reduced to the Key Lemma:

Lemma 1.2 (Key Lemma).

Let . Also let . Then

Moreover, for any

there exists (depending on ) such that for all if

for some then

Also,

In all cases above, the limits are over .

Equations Equation 1 and Equation 2 are proven in §4 and §5 using first and second moment arguments respectively. This part of the paper is similar to the arguments used in Reference 6.

Given and , let be the set of all proper equitable colorings with . In section §5.2, second moment arguments are used to reduce equation Equation 3 to the following:

Proposition 5.9.

Let . Then for all sufficiently large (depending on ), if

for some with then with high probability in the planted model, . In symbols,

In §6, Proposition 5.9 is reduced as follows. First, certain subsets of vertices are defined through local combinatorial constraints. There are two main lemmas concerning these subsets; one of which bounds their density and the other proves they are ‘rigid’. Proposition 5.9 is proven in §6 assuming these lemmas.

The density lemma is proven in §7 using a natural Markov model on the space of proper colorings that is the local-on-average limit of the planted model. Rigidity is proven in §8 using an expansivity argument similar to the way random regular graphs are proven to be good expanders. This completes the last step of the proof of Theorem 1.1.

2. Topological sofic entropy

This section defines topological sofic entropy for subshifts using the formulation from Reference 9. The main result is:

Lemma 2.1.

For any sofic approximation with ,

Let denote a countable group, a finite set (called the alphabet). Let be the shift action on defined by for . Let be a closed -invariant subspace. We denote the restriction of the action to by . Also let be a sofic approximation to .

Given , and the pullback name of at is defined by

For the sake of building some intuition, note that when is a homomorphism, the map is -equivariant (in the sense that ). In particular is periodic. In general, we think of as an approximate periodic point.

Given an open set containing and an , a map is called an -microstate if

Let denote the set of all -microstates. Finally, the -entropy of the action is defined by

where the infimum is over all open neighborhoods of in . This number depends on the action only up to topological conjugacy. It is an exercise in Reference 5 to show that this definition agrees with the definition in Reference 10. We include a proof in Appendix A for completeness.

Proof of Lemma 2.1.

Let be given. Let be the set of -proper 2-colorings. Let be the set of all 2-colorings such that for each generator hyper-edge , . A generator hyper-edge is a subgroup of the form for some . Note is an open superset of .

We claim that . To see this, let . Then if and only if all hyper-edges of containing are bi-chromatic (with respect to ). So if , then is contained in up to monochromatic hyperedges. On the other hand, each monochromatic hyperedge contains exactly vertices whose pullback name is not in . It follows that . This implies .

Given a finite subset of hyper-edges of the Cayley hyper-tree, let be the set of all with the property that for all . If is any open neighborhood of in then contains for some . To see this, suppose that there exist elements for every finite . Let be a cluster point of as increases to the set of all hyper-edges. Then , a contradiction. It follows that

Next, fix a finite subset of hyper-edges of the Cayley hyper-tree. We claim that . To see this, let and be the set of vertices contained in a monochromatic edge of . Now for , if and only if is monochromatic on some edge in . This occurs if and only if there is an element in the union of such that . There are at most such vertices. But , so there are at most such vertices. It follows that . Therefore,

3. Reduction to the key lemma

The purpose of this section is to show how Lemma 1.2 implies Theorem 1.1. This requires replacing the (random) uniform and planted models with (deterministic) sofic approximations. The next lemma facilitates this replacement.

Lemma 3.1.

Let be finite and . Then there are constants such that for all with ,

Proof.

The proof given here is for the uniform model. The planted model is similar.

The proof begins with a series of four reductions. By taking a union bound, it suffices to prove the special case in which for nontrivial. (This is the first reduction).

Let be the reduced form of . This means that ,…, , for all with indices mod and for all . Let be the length of .

For any , the fixed point sets of and have the same size. So after conjugating if necessary, we may assume that either or .

For , the -th beginning subword of is the element . Given a vertex and , let , …, be the path defined by: for each , is the unique hyper-edge of labeled containing . A vertex represents a -simple cycle if and for every , either

,

and ,

or and .

We say that represents a -simple degenerate cycle if and and .

If then either

represents a -simple cycle,

there exists nontrivial with such that some vertex represents a -simple cycle,

or there exists nontrivial with such that some vertex represents a -simple degenerate cycle.

So it suffices to prove there are constants such that for all ,

and

(This is the second reduction).

Two vertices represent vertex-disjoint -cycles if , …, , …, and for all .

Let be the set of all such that there exists a subset satisfying

(1)

,

(2)

every represents a -simple cycle,

(3)

the cycles for are pairwise vertex-disjoint.

If represents a simple -cycle then there are at most vertices such that also represents a simple -cycle but the two cycles are not vertex-disjoint. Since this bound does not depend on , it suffices to prove there exist and such that

for all . (This is the third reduction. The argument is similar for simple degenerate cycles).

Let and , …, be distinct vertices in . For , let be the set of all such that for all

(1)

represents a -simple cycle,

(2)

the cycles , …, are pairwise vertex-disjoint.

By summing over all subsets of size , we obtain

Since grows at most exponentially, it suffices to show there exist and such that for all . (This is the fourth reduction. The argument is similar for simple degenerate cycles).

Set . By the chain rule

In order to estimate , can be expressed a disjoint union over the cycles involved in its definition. To be precise, define an equivalence relation on by: are -equivalent if for every , and

In other words, are -equivalent if they define the same paths according to all vertices up to (so ) and their restrictions to every edge in these paths agree. Of course, is the disjoint union of the -classes. Note that is trivial (everything is equivalent).

In general, if , , …, are measurable sets and the ’s are pairwise disjoint then is a convex combination of (for any probability measure ). Therefore, is a convex combination of probabilities of the form where is an -class.

Now fix an -class (for some with ). Let be the set of all vertices covered by the cycles defining . To be precise, this means is the set of all such that there exists an edge with such that is contained in a path with and . Since each path covers at most vertices, .

If (the case is similar), fix subsets , …, of size . Conditioned on and the event that the first edges of are , …, , the -probability that represents a simple -cycle vertex-disjoint from is bounded by the probability that a uniformly random -element subset of

contains . Since

this probability is bounded by where is a constant not depending on or the choice of . It follows that for all and therefore

This implies the lemma (the argument is similar for simple degenerate cycles).

Proof of Theorem 1.1 from Lemma 1.2.

Choose according to the hypotheses of Lemma 1.2 so that , and all of the conclusions to Lemma 1.2 hold. Construct according to §1.3. We will always take . We will first show the existence of a sofic approximation to such that

Then by Lemma 2.1, we will have .

Observe that Lemma 1.2Equation 1 implies the following: for every there exists (with decreasing as decreases), , and such that for , . In other words, the number of -proper colorings can only exponentially exceed with exponentially small probability. This is because the negation of the above would imply that , contradicting Equation 1. Let be the event that .

Let be the event that . Lemma 1.2Equation 3 implies decays at most subexponentially in . Precisely, for any there exists such that implies .

Let be the event that . Notice that since for any , .

Consider a decreasing sequence , and and depending on as discussed above. Since is decaying only subexponentially, we can choose an increasing sequence satisfying , and for each , for all . Now it follows that implies . By the observation in the preceding paragraph, .

Given and finite, let be the event that is -sofic. Let be a decreasing sequence and be an increasing sequence of finite subsets of .

Lemma 3.1 implies that decays super-exponentially in for any and , so there exists an increasing sequence such that for , . It follows that we can choose a deterministic sofic approximation sequence such that

We next show the existence of . Equation Equation 2 of Lemma 1.2 implies the existence of a number with

Since for every , there exist constants such that

for all .

Now let and be finite. Then there exists such that if and is chosen at random with law , then with positive probability,

(1)

is -sofic,

(2)

.

This is implied by Lemma 3.1 and equation Equation 4. So there exists a sofic approximation to such that

Since , Lemma 2.1 implies .

4. The first moment

To simplify notation, we assume throughout the paper that without further mention. This section proves Equation 1 of Lemma 1.2. The proof is in two parts. Part 1, in Section 4.1, establishes:

Theorem 4.1.

Part 2 has to do with equitable colorings, where a 2-coloring is equitable if

Let be the number of proper equitable colorings of . Section 4.2 establishes

Theorem 4.2.

Moreover,

where .

Combined, Theorems 4.1 and 4.2 imply Equation 1 of Lemma 1.2.

Remark 5.

If then the formula for above is the same as the formula found in Reference 6Reference 7Reference 8 for the exponential growth rate of the number of proper 2-colorings of .

Remark 6.

When we write an error term, such as , we always assume that and the implicit constant is allowed to depend on or .

4.1. Almost proper 2-colorings

For , let . Also let . If is a collection of numbers with , then let

be the Shannon entropy of .

Definition 3.

A -partition of is an unordered partition of into sets of size . Of course, such a partition exists if and only if in which case there are

such partitions. By Stirling’s formula,

Definition 4.

The orbit-partition of a permutation is the partition of into orbits of . Fix a -partition . Then the number of permutations whose orbit partition is equals .

Given , define the -tuple , …, of -partitions by: is the orbit-partition of . Fix a -tuple of -partitions , …, . Then the number of uniform homomorphisms such that for all is . Combined with Equation 5, this shows the number of uniform homomorphisms into is

By Stirling’s formula,

Definition 5.

Let be a -partition, a 2-coloring and a vector with . The pair has type if for all ,

Lemma 4.3.

Let , , …, be such that and . Let . Let be a map such that . Let be the number of -partitions of such that has type . Then

Proof.

The following algorithm constructs all such partitions with no duplications:

Step 1.

Choose an unordered partition of the set into sets of size (, …, ).

Step 2.

Choose an unordered partition of the set into sets of size (, …, ).

Step 3.

Choose a bijection between the collection of subsets of size constructed in part 1 with the collection of subsets of size constructed in part 2.

Step 4.

The partition consists of all sets of the form where is a set of size constructed in Step 1 and is a set of size constructed in Step 2 that it is paired with under Step 3.

The number of choices in Step 1 is . The number of choices in Step 2 is . The number of choices in Step 3 is . So

The lemma follows from this and Stirling’s formula.

Let be the set of all matrices such that

(1)

for all ,

(2)

for all ,

(3)

there exists a number, denoted , such that for all .

Let be the set of all such that is integer-valued.

Lemma 4.4.

Let be an integer-valued matrix (for some ), and the set of all such that and for all . For , let be the set of all such that is integer-valued.

Assume is compact and there exists with for all . Then there is a constant such that if is non-empty then it is -dense in in the following sense. For any there exists such that . Moreover, the constant can be chosen to depend continuously on the vector .

Proof.

To begin, we will define several constants which will enable us to choose . Because is integer-valued, its kernel, denoted , is such that has rank equal to the dimension of . Therefore, is cocompact in . So there is a constant such that for any there is an element with .

By hypothesis, there is a constant and an element such that for all . Because is compact there is another constant such that for all . Let . Now let be arbitrary and suppose is non-empty. We will show there exists such that .

Let

Then for all . Also .

Because is non-empty, there exists . By linearity, is the intersection of the hyperplane with the positive orthant. Thus we can write where and satisfies . Let . Note . Since this implies for all . It is now straightforward to check that . By the triangle inequality

Because and are arbitrary, this implies the Lemma with . Moreover does not depend on the vector ; while can be chosen to depend continuously on .

Lemma 4.5.

Given a matrix define

where . Then for any ,

where the constant implicit in the error term does not depend on .

Proof.

Given and , let be the orbit-partition of . For as above, let be the number of -proper colorings such that has type , …, . It suffices to show that

for all such that . This is because the size of is a polynomial (depending on ) in so the supremum above determines the exponential growth rate of .

To prove this, fix a as above and let be such that is integer-valued. Fix a coloring such that . By symmetry,

The events are jointly independent. So

By symmetry, is the number of -partitions such that has type divided by the number of -partitions. By Lemma 4.3 and Equation 6,

Combine this with Equation 9 to obtain

This simplifies to the formula for using the assumption that for all .

Proof of Theorem 4.1.

By Lemma 4.4 applied to , continuity of and compactness of ,

Theorem 4.1 now follows from Lemma 4.5 by continuity of and compactness of .

4.2. Equitable colorings

Proof of Theorem 4.2.

Let be the set of all such that whenever . By Lemma 4.5, it suffices to show that admits a unique global maximum on and moreover if is the global maximum then and .

The function is symmetric in the index . To exploit this, let be the set of all vectors such that for all and . Let

Note that if is defined by for all . Moreover, since Shannon entropy is strictly concave, for any , if is defined to be the average: then with equality if and only if for all . So it suffices to show that admits a unique global maximum on and moreover if is the global maximum then and .

Because , , and ,

Since this is positive infinity whenever , it follows that every maximum of occurs in the interior of . The method of Lagrange multipliers implies that, at a critical point, there exists such that

So at a critical point,

Solve for to obtain

Note

implies

So define

It follows from the above that whenever is a critical point.

We claim that if and only if (for ). The change of variables in the formula for shows that . So it is enough to prove that for .

To obtain a simpler formula for , set . The binomial formula implies

Because , which implies that the middle coefficient . So

where the last inequality holds because

assuming . This proves the claim.

So if is a critical point then . Put this into the equation above for to obtain

where . Because

it must be that

The formula now follows from a straightforward computation.

5. The second moment

This section gives an estimate on the expected number of proper colorings at a given Hamming distance from the planted coloring. This computation yields Equation 2 of Lemma 1.2 as a corollary. It also reduces the proof of Equation 3 to obtaining an estimate on the typical number of proper colorings near the planted coloring.

Before stating the main result, it seems worthwhile to review notation. Fix with . Fix an equitable 2-coloring . This is the planted coloring. The planted model is the uniform probability measure on the set of all uniform homomorphisms such that is -proper. Also let be the number of equitable proper 2-colorings. For , let be the number of equitable proper 2-colorings such that where is the normalized Hamming distance defined by

We will also write when and/or are understood.

The main result of this section is:

Theorem 5.1.

With notation as above, for any such that is an integer,

(for ) where

is defined to be the unique solution to

and

Moreover, the constant implicit in the error term may depend on but not on . The constant implicit in the term depends continuously on for .

Remark 7.

If then . In the general case, . Theorem 5.1 parallels similar results in Reference 6Reference 7 for the random hyper-graph . This is explained in more detail in the next subsection.

The strategy behind the proof of Theorem 5.1 is as follows. We need to estimate the expected number of equitable colorings at distance from the planted coloring. By symmetry, it suffices to fix another coloring that is at distance from the planted coloring and count the number of uniform homomorphisms such that both and are proper with respect to . This can be handled one generator at a time. Moreover, only the orbit-partition induced by a generator is used in this computation. So, for fixed , we need to estimate the number of -partitions of that are bi-chromatic under both and . To make this strategy precise, we need the next definitions.

Definition 6.

Let be an equitable 2-coloring of . An edge is -bichromatic if . Recall that a -partition is a partition , …, of such that every part has cardinality . A -partition is -bichromatic if every part is -bichromatic.

Given a -bichromatic edge of size , there is a matrix defined by

Let denote the set of all such matrices (over all ). This is a finite set. To be precise, is the set of all matrices such that

, , , for all

.

If is a -bichromatic -partition then it induces a function by

Let be the set of all functions satisfying

,

,

.

Also let be the set of such that is integer-valued for each . A -partition has type if .

Lemma 5.2.

Given an equitable 2-coloring , let be the matrix

Then

In particular, is determined by the Hamming distance .

Proof.

Let . The lemma follows from this system of linear equations:

The first two occur because both and are equitable. The third follows from the definition of normalized Hamming distance and the last holds because partitions .

For , define the matrix by

If is a -partition that has type (for some equitable ) then . This motivates the definition.

The main combinatorial estimate we will need is:

Lemma 5.3.

Let and be equitable. Suppose . Let be the number of -partitions of type . Also let

where is the multinomial . Then

(for ) where the constant implicit in the error term depends on but not on or .

Proof.

The following algorithm constructs all such partitions with no duplications:

Step 1.

Choose a partition of such that

Step 2.

For and , choose an unordered partition of into sets of size .

Step 3.

For with and , choose a bijection .

Step 4.

The -partition consists of all sets of the form over all and .

The number of choices in Step 1 is

The combined number of choices in Steps 1 and 2 is

The number of choices in Step 3 is . So

An application of Stirling’s formula gives

(for ) where the constant implicit in the error term depends on but not on or .

Since ,

Substitute this into the formula above to finish the lemma.

Next we use Lagrange multipliers to maximize . To be precise, for , let be the set of all such that . Define . To motivate this definition, observe that if is an equitable 2-coloring and then . So if is a -partition with type then .

Lemma 5.4.

Let . Then there exists a unique such that

Moreover, if , and are defined by

then .

Proof.

Define by

For all , is constant in . Therefore, it suffices to prove the lemma with in place of .

The function is concave over . This implies the existence of a unique such that

By definition, is the set of all functions satisfying

where is the matrix

For any ,

Since this is positive infinity when , must lie in the interior of . By the method of Lagrange multipliers there exists and a matrix such that

Evaluate Equation 11 at , use Equation 12 and solve for to obtain

for some constants . In fact, since is concave, is the unique critical point and so it is the only element of of this form. So it suffices to check that the purported given in the statement of the lemma has this form and that it is in as claimed. The former is immediate while the latter is a tedious but straightforward computation. For example, to check that , observe that, by the multinomial formula for any ,

Substitute and to obtain

The rest of the verification that is left to the reader.

Proof of Theorem 5.1.

Let be the set of all equitable 2-colorings such that . Also let be the set of all such that is a proper 2-coloring of the hyper-graph . By linearity of expectation,

The cardinality of is . By Stirling’s formula

We have is the same for all . This follows by noting that the distribution of hyper-graphs in the planted model is invariant under any permutation which fixes . If are two configurations with then there is a permutation which fixes and such that . To see this note that we simply need to find a which maps the sets to for each . Such a map exists since for each the two sets have the same size. It follows that

for any fixed .

For , let be the set of uniform homomorphisms such that the orbit-partition of is -bichromatic in the sense that for every in the orbit-partition of . Then the events are i.i.d. and

Therefore,

Note is, up to sub-exponential factors, equal to the maximum of over divided by the number of -partitions of . So equation Equation 6 implies

So Lemma 5.3 implies

We apply Lemma 4.4 to to obtain the existence of with where the constant implicit in the term depends continuously on for . Since is differentiable in a neighborhood of , we have

So Lemma 5.4 implies

Since , . So

On the other hand, Theorem 4.2 implies

Combine this result with Equation 14, Equation 15 and Equation 16 to obtain

Since ,

Substitute this into the previous equation to obtain

where

Observe that in every estimate above, the constant implicit in the error term does not depend on . To finish the lemma, we need only simplify the expression for .

By Lemma 5.4,

Combined with the previous formula for , this implies

To simplify further, use the formula for in Lemma 5.4 to obtain

Thus .

5.1. Analysis of and the proof of Lemma 1.2 inequality Equation 2

Theorem 5.1 reduces inequality Equation 2 to analyzing the function . A related function , defined by

has been analyzed in Reference 6Reference 7. It is shown there is the exponential rate of growth of the number of proper colorings at normalized distance from the planted coloring in the model . Moreover, if is close to then the global maximum of is attained at some . Moreover, has a local maximum at and is symmetric around . It is negative in the region . We will not need these facts directly, and mention them only for context, especially because we will obtain similar results for .

The relevance of to lies in the fact that

As an aside, note that is the Kullback-Leibler divergence of the distribution with respect to .

To prove inequality Equation 2, we first estimate the difference and then estimate . Because the estimates we obtain are useful in the next subsection, we prove more than what is required for just inequality Equation 2.

Lemma 5.5.

Suppose . Define by . Then

The last equation implies .

Proof.

The first equality follows from:

The second estimate follows from:

The third estimate follows from:

The denominator is and the numerator is . The result follows.

The last equation follows from:

The last expression shows that .

Lemma 5.6.

Let be . If

then

In particular, if is sufficiently large then .

Proof.

By direct inspection . By Taylor series expansion, . So

Next we estimate . For convenience, let . Then

Since ,

So

Also,

Add these together to obtain

Corollary 5.7.

Inequality Equation 2 of Lemma 1.2 is true. To be precise, let be constant with respect to . Then for all sufficiently large (depending on ), if

for some with then

Proof.

By definition,

By Theorem 5.1,

where is the set of such that is an integer. Because is continuous, . The constant implicit in the depends continuously on for , while the constant implicit in the error term does not depend on . So

for any . Because is continuous,

Because (since it is a Kullback-Liebler divergence), the first equality of Lemma 5.5 implies

By Lemma 5.6, . By Lemma 5.5, . As varies over , also varies over , so there exists such that . For this value of ,

Combined with Equation 19 this implies the Corollary.

In the next subsection, we will need the following result.

Proposition 5.8.

Let

Then there exists (depending on ) such that for all if

for some then in the interval , attains its unique maximum at 1/2. That is,

and if and then .

Proof.

By Lemma 5.5, it suffices to restrict to the interval (because ). So we will assume without further mention.

Define by

Observe

By Equation 17 and the first inequality of Lemma 5.5,

Moreover, and, by Lemma 5.5, . Therefore,

Observe that . We divide the rest of the proof into five cases depending on where lies in the interval .

Case 1.

Suppose . We claim that . Note and . So

By Taylor series expansion,

So by Equation 21

Thus if is sufficiently large.

Case 2.

Let be a constant such that . Suppose . We claim that if is sufficiently large.

By monotonicity, . Since (for ),

By Equation 21,

This implies the claim.

Case 3.

Let be a constant such that . Suppose . We claim that for all sufficiently large (depending on ).

By Equation 21,

This proves the claim.

Case 4.

We claim that if then for all sufficiently large (independent of the choice of ).

Recall that we define by . By Equation 18,

since , assuming is sufficiently large.

The assumption on implies . So the second equality of Lemma 5.5 implies

Through elementary but messy calculus computations one may show using the fact that that

By Taylor’s theorem,

for some . Since , we have . Furthermore, since is monotone decreasing on , we have . Thus

and for sufficiently large we have . Since , Equation 20 implies

is strictly less than if is sufficiently large.

Case 5.

Suppose . Let . By Equation 18,

Define . We claim that . Since , it suffices to show that for all with . An elementary calculation shows

So if and is sufficiently large. Altogether this proves . So the second equality of Lemma 5.5 implies

By Equation 22 and Equation 20,

This is strictly less than if is sufficiently large.

5.2. Reducing Lemma 1.2 inequality Equation 3 to estimating the local cluster

As in the previous section, fix an equitable coloring . Given a uniform homomorphism , the cluster around is the set

We also call this the local cluster if is understood.

In §6 we prove:

Proposition 5.9.

Let . Then for all sufficiently large (depending on ), if

for some with then with high probability in the planted model, . In symbols,

The rest of this section proves Lemma 1.2 inequality Equation 3 from Proposition 5.9 and the second moment estimates from earlier in this section. So we assume the hypotheses of Proposition 5.9 without further mention.

We say that a coloring is -good if it is equitable and . Let be the set of all -good proper colorings and let be the number of -good proper colorings.

We will say a positive function is sub-exponential in if

Also we say functions and are asymptotic, denoted by , if . Similarly, if .

Lemma 5.10.

where is sub-exponential in .

Proof.

For brevity, let . Let be the probability operator in the planted model of . By definition,

where the asymptotic equality follows from Proposition 5.9. The equality holds by Theorem 4.2.

Lemma 5.11.

, where is sub-exponential in .

Proof.

For a fixed we analyze by breaking the colorings into those that are close (i.e. in the local cluster) and those that are far. So let be the number of good proper colorings such that . (We will also use for the analogous number of equitable proper colorings). Then

The coefficient above accounts for the following symmetry: if is a good coloring with then is a good coloring with . Note that

where the last inequality holds by definition of .

For colorings not in the local cluster,

where the sum is over all in the given range. By definition and Proposition 5.9,

as . Since , the above inequality now implies

where the second inequality holds by Theorem 5.1 for some function which is sub-exponential in . The second-to-last inequality holds because the number of summands is bounded by since is constrained to lie in and by Proposition 5.8, . The last inequality holds for some function that is sub-exponential in since by Theorem 4.2, converges to .

Combine Equation 24, Equation 25 and Equation 26 to obtain

Plug this into Equation 23 to obtain

where the asymptotic holds by Lemma 5.10. This proves the lemma.

Corollary 5.12.

Lemma 1.2 inequality Equation 3 is true. That is:

Proof.

By Theorem 4.2,

as . In particular, for every , for large enough , . So it suffices to prove

Since , it suffices to prove the same statement with in place of . By Lemma 5.10 and Theorem 4.2, converges to as . So we may replace in the statement above with . Then we may multiply by both sides and exponentiate inside the probability. So it suffices to prove

By the Paley-Zygmund inequality and Lemma 5.11

where is sub-exponential in . This implies Equation 27.

6. The local cluster

To prove Proposition 5.9, we show that with high probability in the planted model, there is a ‘rigid’ set of vertices with density approximately . Rigidity here means that any proper coloring either mostly agrees with the planted coloring on the rigid set or it must disagree on a large density subset. Before making these notions precise, we introduce the various subsets, state precise lemmas about them and prove Proposition 5.9 from these lemmas which are proven in the next two sections.

So suppose is a -uniform -regular hyper-graph and is a proper coloring. An edge is -critical if there is a vertex such that . If this is the case, then we say supports with respect to . If is understood then we will omit mention of it. We will apply these notions both to the case when is the Cayley hyper-tree of and when is a finite hyper-graph.

For , , , , define the depth -core of to be the subset satisfying

By induction, for all . Also let .

The set is defined to consist of vertices so that if is re-colored (in some proper coloring) then this re-coloring forces a sequence of re-colorings in the shape of an immersed hyper-tree of degree at least 3 and depth . Re-coloring a vertex of would force re-coloring an infinite immersed tree of degree at least 3.

Also define the attached vertices by: if but there exists an edge , supported by such that . Thus if is re-colored then it forces a re-coloring of some vertex in . In this definition, we allow (letting .

In order to avoid over-counting, we also need to define the subset of vertices such that there exists a vertex , with , and edges , supported by respectively such that

(1)

,

(2)

.

In this definition, we allow .

We will need the following constants:

The significance of is: if is an edge and a vertex then is the probability supports in a uniformly random proper coloring of . So is the expected number of edges that supports. For the values of and used in the Key Lemma 1.2, .

For the next two lemmas, we assume the hypotheses of Proposition 5.9.

Lemma 6.1.

For any there exists such that implies

Lemma 6.1 is proven in §7.

Definition 7.

Fix a proper 2-coloring . Let . A subset is -rigid (with respect to ) if for every proper coloring , is either less than or greater than .

Lemma 6.2.

For any ,

Lemma 6.2 is proven in §8. We can now prove Proposition 5.9:

Proposition 5.9.

Let . Then for all sufficiently large (depending on ), if

for some with then with high probability in the planted model, . In symbols,

Proof.

Let be small constants satisfying

Let be a natural number. Also let be a uniform homomorphism and a proper coloring. To simplify notation, let

By Lemmas 6.1 and 6.2 it suffices to show that if and is -rigid then (for all sufficiently large ). So assume and is -rigid.

Let . By definition, this means . Since is -rigid, this implies

Since this holds for all , it follows that

By Stirling’s formula

Since ,

Thus,

On the other hand,

by Lemma 5.6 and Theorem 4.2.

Therefore, the choice of in Equation 28 implies for all sufficiently large . This also depends on being sufficiently large, but the lower bound on is uniform in .

7. A Markov process on the Cayley hyper-tree

Here we study a -invariant measure on which is, in a sense, the limit of the planted model. We will define it by specifying its values on cylinder sets.

Let be a connected finite union of hyper-edges. Let be a proper coloring. We define to be the set of proper colorings with (where means “restricted to”). We set equal to the reciprocal of the number of proper colorings of . In particular, if is another proper coloring of , then .

Because is totally disconnected, any Borel probability measure on is determined by its values on clopen subsets (since these generate the topology). Since every clopen subset is a finite union of cylinder sets of the form above, Kolmogorov’s Extension Theorem implies the existence of a unique Borel probability on satisfying the aforementioned equalities.

Note this measure has the following Markov property. Let be a random element of with law . Let and let be a hyperedge containing . Let be the set of all such that every path in the Cayley hyper-tree from to an element of passes through . In particular, . Then the distribution of conditioned on is uniformly distributed on the set of all colorings such that there exists some with .

7.1. Local convergence

We will prove the following lemma.

Lemma 7.1.

Let be an equitable coloring with . If is clopen, then for every

To prove this lemma we will first show that if is the function

then concentrates about its expectation using Theorem B.1, and then we will show that this expectation is given by .

Proposition 7.2.

We have

Proof.

For , let be the projection map . For , let be the smallest Borel sigma-algebra such that is -measurable for every .

Since every clopen subset of is a finite union of cylinder sets, the function is -measurable for some finite set .

We will use the normalized Hamming metrics and on and respectively. These are defined in the beginning of Appendix B. We claim is -Lipschitz for some . Let . Because is -measurable,

Now is both left and right invariant. So

for any . Furthermore we immediately have for any that . Together these imply for any where is the distance from to the identity in the word metric on . Thus if we take we see that as desired.

The Proposition now follows from Theorem B.1.

To finish the proof of Lemma 7.1, it now suffices to show the expectation of with respect to the planted model converges to as . We will prove this by an inductive argument, the inductive step of which is covered in the next lemma.

Lemma 7.3.

Let be either a singleton or a connected finite union of hyperedges containing the identity. Let be a proper coloring. Let be the event that . Let be the number of proper colorings of . Then

Remark 8.

The factor of in is there to account for the fact that we are requiring .

Proof.

The statement is immediate if is a singleton. For induction we assume that the statement is true for some finite . Let be a connected union of hyper-edges such that there exists a unique hyper-edge with . By induction, it suffices to prove the statement for .

Note by symmetry that is the same for all . Let us denote this common value by . Define . If then is a bijection from to . This implies

Thus . After replacing with where is the unique element in , we may therefore assume without loss of generality that . By symmetry we may also assume and , , is the hyper-edge generated by .

For any subset , let be the event that for any with . Since the planted model is a random sofic approximation, by Lemma 3.1 we have uniformly in . We will show that

Next we show how to finish the lemma assuming Equation 30. We claim that . To see this, observe that if is a proper coloring of and is any coloring of then the concatenation is a proper coloring of unless for all . Since , this implies that every proper coloring of admits proper extensions to . So as claimed.

Since ,

The lemma now follows from the inductive hypothesis, Equation 30 and .

It now suffices to prove Equation 30. Let denote the union of all subsets , , over all such that , , . That is, is the union of all hyperedges generated by which lie completely within . Because , , , . Multiplication by on the right preserves .

Fix , …, and consider an injective function such that and for all hyper-edges . Let be the set of such that for all . By definition,

For each , …, , let be the set of permutations such that

(1)

the orbit-partition of is a -partition which is properly colored by ,

(2)

for all .

We claim that the map

that sends to , …, is a bijection.

The fact that it is injective is immediate since is generated by , …, . In order to prove that it is surjective, fix elements for all . Define by . It suffices to check that . By induction on the number of edges in , it suffices to assume that the equation holds for some and prove . This follows from:

This proves the claim. Therefore,

Let . We have

Similarly

Each is the restriction to of

distinct where . We can express

where the term follows from the inductive hypothesis and the fact that .

If then for all . Observe that . This follows by conjugating by a permutation in which preserves the color classes of and which maps to for each . Similarly, if then . Therefore,

for any and .

Next we choose and so that extends . Note for all because . Therefore,

where the second equality uses Equation 32. Here we are using the notation to mean .

To compute , let

be the set of possible types of orbit-partitions of permutations of that contribute to the count . To be precise, if then there is a -partition of such that the number of parts of the partition with is .

If we fix then the number of permutations whose orbit partition has type is

where we have used Equation 8. It follows that

where

So

We plug this into Equation 33 to obtain

Define by and for . We establish asymptotic estimates of the sum of in Lemma 7.4 and show that it suffices to sum over a small ball around .

Recall . Let . We define by

Because , if then .

We claim that for any there exists an and such that for , . This follows from observing that is invertible and that . By Lemma 7.4 we have

Furthermore, since and , for sufficiently small we have . Because , the ratio simplifies to

In particular, at we obtain

Suppose . Then . So

Since is arbitrary, this and Equation 36 imply

We plug this into Equation 35 to obtain

Lemma 7.4.

For any we have

Proof.

We use the following general consequence of Stirling’s formula: let , with . Let be any sequence in such that and as . If then .

By Equation 34

By setting , we obtain

We thus have:

where is given by

Note that does not depend on or . Furthermore is continuous and strictly concave. By the method of Lagrange multipliers (see §4.2) its maximum on occurs at . In particular given there is some such that

We claim that is non-empty for sufficiently large . Since this claim and standard arguments involving the exponential growth of imply the lemma.

To prove the claim, we exhibit a member of in a series of three steps. First, let be defined by for each . This satisfies the integrality condition, that is for all . Furthermore .

Second, define by

Since and for each we maintain the integrality condition, and . Furthermore

Finally let . Define by

We claim for sufficiently large. First , with the same equalities holding for . We have since for each and . We also have . Furthermore . Finally by repeated application of the triangle inequality

We show that is small for sufficiently large. This will not only imply that , but also that for each . Note that

This implies . Thus

As and are fixed, we have for large enough . Since for each we have . Thus for sufficiently large we will also have for each . Therefore and we have for sufficiently large.

Proof of Lemma 7.1.

Let be the Borel probability measure on defined by

for any Borel set . By Proposition 7.2, it suffices to show that as for any clopen set . Because clopen sets are finite unions of cylinder sets, it suffices to show that if is a finite subset and then where is the cylinder set . We can further assume to be a connected finite union of hyperedges with . The proof now follows from Lemma 7.3 since

where, in the last equality, is any vertex in . These equations are justified as follows. By symmetry, if , then . On the other hand, if , then . Lastly, .

7.2. The density of the rigid set

This subsection proves Lemma 6.1. So we assume the hypotheses of Proposition 5.9. An element is a 2-coloring of the Cayley hyper-tree of . Interpreted as such, are well-defined subsets of (see §6 to recall the definitions).

For , let

Recall that and . Since we assume the hypothesis of Proposition 5.9, is asymptotic to as .

Proposition 7.5.
Proof.

For brevity, let be the subgroup generated by . So is a hyper-edge of the Cayley hyper-tree. Let be the set of all such that

(1)

supports the edge with respect to and

(2)

.

Since , it follows that . The events for , …, are i.i.d. Let be their common probability.

We write for the probability that a binomial random variable with trials and success probability equals . Since the events , …, are i.i.d., is the event that either 1 or 2 of these events occur and is the event that at least 3 of these events occur, it follows that

Thus

Claim 2.

and for , where

Proof.

To reduce notational clutter, let . Note that is the probability that the edge is critical. So

Conditioned on , is the event that . By symmetry and the Markov property is the -st power of the probability that given that supports . By translation invariance, that probability is the same as the probability that given that does not support the edge . By definition of and the Markov property, this is the same as the probability that a binomial random variable with trials and success probability is at least 3. This implies the claim.

The next step is to bound from below:

The last inequality follows from the fact that . This motivates the next claim:

Claim 3.

Suppose is a number satisfying . Then for all sufficiently large ,

Proof.

The first inequality follows from:

The second inequality follows from:

The third line follows from the general inequality valid for all . To see the limit, observe that under the hypotheses of Proposition 5.9, . So . In particular, and as . The implies the limit. Thus if is large enough then the second inequality holds.

To see the last inequality, observe that since , as . On the other hand, . Thus and are asymptotic to . Since for all sufficiently large , this proves the last inequality assuming is sufficiently large.

Now suppose that is as in Claim 3. Then

The first inequality is implied by Equation 37. The second and third inequalities follow from Claim 3. For example, since , .

Therefore, if satisfies the bounds then satisfies the same bounds. Since , it follows that

Because for any ,

The first equality occurs because decreases to . By Equation 37 and Claim 3 (with in place of ),

Lemma 7.6.

where the implied limit is as and is bounded.

Proof.

As in the previous proof, let be the subgroup generated by . So is a hyper-edge of the Cayley hyper-tree.

Let . We say that an edge is attaching (for ) if it is supported by a vertex and . Let if . Otherwise, let be the number of attaching edges containing . Then by translation invariance,

Let be the set of all such that

(1)

is a critical edge supported by some vertex ,

(2)

,

(3)

.

By the Markov property and symmetry,

Let

be the set of all such that is supported by ,

be the set of all such that ,

be the set of all such that .

By symmetry

Conditioned on , if occurs then there are no more than attaching edge supported by with . By the Markov property and symmetry,

where we have used Equation 38. Also . Thus . So Equation 39 and Equation 40 along with straightforward estimates imply .

Lemma 7.7.

.

Proof.

Given a coloring of the Cayley hyper-tree and , define . Also define . Since and the sets are decreasing in , it suffices to prove that .

Suppose . Then there exists an infinite set such that . So . So for each , there exist and hyper-edges such that

(1)

supports (with respect to ),

(2)

supports (with respect to ),

(3)

,

(4)

.

Because , is necessarily contained in the finite set . So after passing to an infinite subset of if necessary, we may assume there is a fixed element such that . Similarly, we may assume there are edges such that and .

Observe that because implies . Similarly, . Because and the sets are decreasing in , it follows that . Therefore . This verifies all of the conditions showing that and therefore as required.

We can now prove Lemma 6.1:

Proof of Lemma 6.1.

Observe that the sets are clopen for finite . By Lemma 7.1,

for any finite . Since is the decreasing limit of , Lemma 7.7 implies

By Proposition 7.5 and Lemma 7.6,

Together with Equation 41, this implies the lemma.

8. Rigid vertices

This section proves Lemma 6.2. So we assume the hypotheses of Proposition 5.9.

As in the previous section, fix an equitable coloring . We assume and let be a uniformly random uniform homomorphism conditioned on the event that is proper with respect to .

Lemma 8.1 (Expansivity Lemma).

There is a constant such that the following holds. If then with high probability (with respect to the planted model), as , for any with the following is true. For a vertex let denote the set of hyperedges supported by . Let be the set of all edges such that . Then

Proof.
Claim 4.

There exists such that implies

,

,

and for any and

Proof.

Recall that . So the first two requirements are immediate for large enough.

We estimate each of the first three terms on the left as follows. Because , there exists such that implies .

Note,

So by making larger if necessary, we may assume

Since

we may also assume . Combining these inequalities, we obtain

From now on, we assume with as above. To simplify notation, let . Given a -tuple , , …, , of natural numbers, let be the event that there are exactly critical edges of the form and supported by a vertex of color , and critical edges of the form and supported by a vertex of color . We denote . Let be the planted model conditioned on .

This measure can be constructed as follows. Let be the set of triples with , and . First choose edges uniformly at random subject to the conditions:

(1)

each has cardinality and whenever or ,

(2)

each is critical and is supported by a vertex of color with respect to .

Next choose a uniformly random uniform homomorphism subject to:

(1)

is a proper coloring with respect to ,

(2)

each is of the form with respect to ,

(3)

the edges are precisely the critical edges of with respect to .

Then is distributed according to .

For , let be the collection of all subsets such that and . Note that is empty.

To prove the lemma, we claim it suffices to show the following:

Claim 5.

If is sufficiently large and then tends to zero in .

We briefly return to the model not conditioned on . Let be the set of all critical edges. By Lemma 7.1, with high probability in , is asymptotic to as . Let be the event that . So as .

By a first moment argument and the above paragraph, to prove the lemma it suffices to show that tends to in . Now is a convex combination of over those such that , so the lemma follows from Claim 5.

Before proving the above claim we need to prove another claim, which needs the following setup. For and , let be the event that is supported by a vertex in and . For , let . Note that for any , the event is contained in where the union is over all with . So

where the sum is over all and with and .

Before proving the claim above, we need to prove:

Claim 6.

For , any with cardinality , any with , and any , one has .

Proof of Claim 6.

For , let be a random edge in with cardinality as in the sampling algorithm above. Without loss of generality, we imagine that for has been chosen before . Let . We say that an edge is of type if is of the form . Let be those edges of type , and let ,

be the number of vertices such that , and

be the number of vertices such that .

Let . The probability that is supported by a vertex in is (this is conditioned on the edges for ).

Suppose first that is supported by a vertex in with (so ). Then the probability that is

It follows that for

In order to bound this expression, consider

Thus,

A similar argument shows that the same bound above holds for the case .

Because , , so that . Note

where we have used the assumption and Claim 4. Similarly, . Substitute these inequalities above to obtain

We now prove Claim 5. Apply the chain rule and Claim 6 to obtain: if has and with then

By Equation 42

where the sum is over all and with , . Define by and . By hypothesis . Consider the following cases:

Case 1.

. Then we make the following estimates:

It follows that , which is bounded by for large enough .

Case 2.

. We make the following estimates:

so that

for some constant . For sufficiently large and , this is bounded above by by Claim 4 and the choice of and . It follows that in this range of , decays super-polynomially.

This proves Claim 5 and finishes the lemma.

Lemma 8.2.

Let . Then there exists such that implies is -rigid (with high probability in the planted model as ).

Proof.

Without loss of generality, we may assume that .

Observe that the sets are clopen for finite . By Lemma 7.1,

Since the sets are decreasing with , this implies the existence of such that implies

Choose . Let be a -proper coloring. Let

Define similarly. Since (with high probability) and , it follows that (with high probability).

For every , let be the subset of -critical edges such that .

We claim that if then where

Since , . If then supports with respect to . Therefore because is a proper coloring, there must exist a vertex such that . This, combined with , implies since there are only two possible colors. Since , this means that and therefore , which proves the claim.

For every , by the definition of the sets . Since edges can only be supported by one vertex, the sets are pairwise disjoint. So

If then . So it follows from Lemma 8.1 that (with high probability), . Thus is -rigid.

We can now prove Lemma 6.2.

Proof of Lemma 6.2.

Let . By Lemma 8.2, there exists such that implies is -rigid with high probability in the planted model as . So without loss of generality we condition on the event that is -rigid.

Now let . Let be a -proper coloring. Let

We claim that . To see this, let . Then there exists an edge supported by (with respect to ) with . Since is proper and , there must exist a vertex with . Necessarily, . So there exists a function such that is contained in an edge supported by with . Because , is injective. This proves the claim.

Now suppose that . Since and are disjoint, either or . If , we are done because by assumption that is -rigid, and so

If the claim implies . Now in the proof of Lemma 8.2 we have shown that , so and we are again done. This proves the lemma.

Appendix A. Topological sofic entropy notions

In this appendix, we recall the notion of topological sofic entropy from Reference 10 and prove that it coincides with the definition given in §2.

Let be an action of on a compact metrizable space . So for , is a homeomorphism and . We will also denote this action by . Let be a map, be a pseudo-metric on , be finite and . For , let

be pseudo-metrics on . Also let

Informally, elements of are “good models” that approximate partial periodic orbits with respect to the chosen sofic approximation.

For a pseudo-metric space , a subset is -separated if for all , . Let be the maximum cardinality over all -separated subsets of .

Given a sofic approximation to , we define

where the symbol means that varies over all finite subsets of .

We say that a pseudo-metric on is generating if for every there exists such that . By Reference 10, Proposition 2.4, if is continuous and generating, is invariant under topological conjugacy and does not depend on the choice of . So we define where is any continuous generating pseudo-metric. The authors of Reference 10 define the topological sofic entropy of to be . The main result of this appendix is:

Proposition A.1.

Let be a finite set and a closed shift-invariant subspace. Let be the shift action of on . Then where is as defined in §2.

Proof.

To begin, we choose a pseudo-metric on as follows. For , let . Then is continuous and generating. So .

Let , be an open set. We first analyze from the definition of . Note that the topology for is generated by the base where if then . In other words, open sets of are those that specify a configuration on a finite subset of coordinates. For let be the open set containing all elements containing some configuration that appears in in the finite window .

Claim 7.

Every open superset contains some open set of the form .

Because decreases as decreases, it suffices to only consider open sets of the form in the definition of .

Proof.

is a union of elements in and is compact, so that there exists with containing only finitely many base elements. Let be the union of all coordinates specified by base elements in . It follows that contains .

Without loss of generality and for convenience we can assume that is symmetric, i.e. , and contains the identity. This is because we can replace any with the larger set , and both and are monotone decreasing in .

Let . We assume . Now for each we obtain an element and then show that these partial orbits form a good estimate for . Let . For every , choose some that agrees with on . For choose an arbitrary element . Thus .

Now for , . On the other hand we also want , which is true if and . It follows that .

Now consider separation of . We will show that a slightly smaller subset is -separated. By the pigeonhole principle there exists a subset of size at least such that has cardinality at least . Furthermore, if then if for some , where . Since there are at most configurations in with some fixed configuration on , there exists a -separated subset of of size at least . It follows that

On the other hand, suppose we have some . This means that for every , there exists a set of size such that for , . Let . Then and for , for every , .

Define by . Then for any fixed , for every , . Since , it follows that .

Also note that are -separated for any if and only if for some , so that . It follows that

Note that in the definitions of and , is fixed with respect to .

Appendix B. Concentration for the planted model

Definition 8 (Hamming metrics).

Define the normalized Hamming metric on by

Define the normalized Hamming metric on by

The purpose of this section is to prove:

Theorem B.1.

There exist constants (depending only on ) such that for every there exists such that for all , for every -Lipschitz ,

B.1. General considerations

To begin the proof we first introduce some general-purpose tools.

Definition 9.

A metric measure space is a triple where is a metric space and is a Borel probability measure on . We will say is -concentrated if for any 1-Lipschitz function ,

If is -concentrated and is -Lipschitz, then since is -Lipschitz,

Lemma B.2.

Let be -concentrated. If is an -Lipschitz map onto a measure metric space and is the push-forward measure, then is -concentrated.

Proof.

This follows from the observation that if is 1-Lipschitz, then the pullback is -Lipschitz. So equation Equation 43 implies

The next lemma is concerned with the following situation. Suppose is a finite disjoint union of spaces . Even if we have good concentration bounds on the spaces , this does not imply concentration on because it is possible that a -Lipschitz function will have different means when restricted to the ’s. However, if most of the mass of is concentrated on a sub-union (for some ) and the sets are all very close to each other, then there is a weak concentration inequality on .

Lemma B.3.

Let be a measure metric space with diameter . Suppose is a finite disjoint union of spaces , each with positive measure (). Let be the induced probability measure on . Suppose there exist and constants satisfying:

(1)

.

(2)

For every , there exists a measure on with marginals respectively such that

(3)

For each , is -concentrated.

Then for every -Lipschitz function and every ,

Proof.

Let be a 1-Lipschitz function. After adding a constant to if necessary, we may assume . Note that the mean of is a convex combination of its restrictions to the ’s:

Since is 1-Lipschitz with zero mean, . So

where the last inequality uses that and .

For any , the and -means of are -close:

So for any ,

Combined with the previous estimate, this gives

Now we estimate the -probability that is (assuming ):

The next lemma is essentially the same as Reference 11, Proposition 1.11. We include a proof for convenience.

Lemma B.4 (Reference 11).

Suppose is -concentrated and is -concentrated. Define a metric on by . Then is -concentrated.

Proof.

Let be 1-Lipschitz. For , define by . Define by . Then and are -Lipschitz.

If then either or . Thus

Lemma B.5.

Let and be metric-measure spaces. Suppose

(1)

are finite sets, and are uniform probability measures,

(2)

there is a surjective map and a constant such that for all ,

(3)

is -concentrated,

(4)

for each , the fiber is -concentrated (with respect to the uniform measure on and the restricted metric),

(5)

for each there is a probability measure on with marginals equal to the uniform measures on and such that

Then is -concentrated.

Proof.

Let be 1-Lipschitz. Let be its conditional expectation defined by

Also let be its expectation.

We claim that is -Lipschitz. So let . By hypothesis (5)

The first inequality holds because is 1-Lipschitz and the second by hypothesis (5). This proves is -Lipschitz.

Let . Because is -to-1, it pushes forward the measure to . Because is -concentrated,

Because each fiber is -concentrated, for any ,

Average this over to obtain

Combine this with Equation 44 to obtain

which implies the lemma.

B.2. Specific considerations

Given an equitable coloring , let be the stabilizer of :

Lemma B.6.

The group is -concentrated (when equipped with the uniform probability measure and the restriction of the normalized Hamming metric ).

Proof.

The group is isomorphic to the direct product which is isomorphic to . By Reference 11, Corollary 4.3, is -concentrated. So the result follows from Lemmas B.4 and B.2. This uses that the inclusion map from to itself is -Lipschitz when the source is equipped with the sum of the -metrics and the target equipped with the metric.

We need to show that certain subsets of the group are concentrated. To define these subsets, we need the following terminology.

Recall that a -partition of is an unordered partition , …, of such that each has cardinality . Let be the set of all -partitions of . The group acts on by , …, .

Let . The orbit-partition of is the partition of into orbits of . For example, for any the element of containing is . Let be the set of all permutations such that the orbit-partition of is a -partition.

Recall from §4.1 that a -partition has type with respect to a coloring if the number of partition elements of with is . We will also say that a permutation has type with respect to a coloring if its orbit-partition has type with respect to .

Let be the set of all permutations such that has type with respect to .

Lemma B.7.

The subset is either empty or -concentrated (when equipped with the normalized Hamming metric and the uniform probability measure) where is a constant depending only on .

Proof.

Let be the set of all (unordered) -partitions of with type (with respect to ). We will consider this set as a metric space in which the distance between partitions is where denotes symmetric difference.

Let be the map which sends a permutation to its orbit-partition. We will verify the conditions of Lemma B.5 with , and . Condition (1) is immediate.

Observe that is surjective and constant-to-1. In fact for any partition , since an element is obtained by choosing a -cycle for every part of . To be precise, if , …, then is the set of all permutations of the form where is a -cycle with support in . This verifies condition (2) of Lemma B.5.

Observe that acts transitively on . Fix and define a map by . We claim that is -Lipschitz. Indeed, if then

Because is -concentrated by Lemma B.6, Lemma B.2 implies is -concentrated. This verifies condition (3) of Lemma B.5.

We claim that is -concentrated. To see this, let , …, and let be the set of all -cycles with support in . Then is isometric to . The diameter of , viewed as a subset of with the normalized Hamming metric on , is . So the claim follows from Reference 11, Corollary 1.17. This verifies condition (4) of Lemma B.5.

For , let be the set of all pairs such that if then the restriction of to equals the restriction of to . Observe that is non-empty and the projection maps () are constant-to-1. In fact, for any , the set of with is bijective with the set of assignments of -cycles to parts in .

Let be the uniform probability measure on . Since the projection maps are constant-to-1, the marginals of are uniform. Moreover, if then

Thus

This verifies condition (5) of Lemma B.5.

We have now verified all of the conditions of Lemma B.5. The lemma follows.

Let be the set of all such that if is the type of with respect to then . In other words, if and only if the orbit-partition of is proper with respect to (where we think of as a collection of hyper-edges).

Let with and for . For let be the set of all such that if is the type of (with respect to ) then

Lemma B.8.

With notation as above, for sufficiently large

where is a constant depending only on .

Proof.

Let

The orbit-partition map from is constant-to-1 and maps onto and onto . Therefore, it suffices to prove

where is a constant depending only on .

Let be the set of all vectors such that , and .

Recall from Lemma 4.3 that if and is -valued then

where . By the proof of Theorem 4.2 (specifically equation Equation 10), is uniquely maximized in by the vector .

In order to get a lower bound on , observe that there exists such that is -valued and for all . Thus . It follows that

We claim that the Hessian of is negative definite. To see this, one can consider to be a function of . The linear terms in do not contribute to its Hessian. Since the second derivative of is ,

Thus the Hessian is diagonal and every eigenvalue is negative; so it is negative definite.

Thus if is such that then

where is half the smallest absolute value of an eigenvalue of the Hessian of on .

If is the type of a -partition of then , , , …, . Thus the number of different types of -partitions of is bounded by a polynomial in (namely ). Thus

where . This implies the lemma.

Recall that a -cycle is a permutation of the form , …, for some , …, . In other words, has fixed points and one orbit of size . The support of is the complement of the set of -fixed points. It is denoted by . Two permutations are disjoint if their supports are disjoint. A permutation is a disjoint product of -cycles if there exist pairwise disjoint -cycles , …, such that . In this case we say that each is contained in .

Lemma B.9.

Let . Suppose

Suppose and are non-empty (for some integer and equitable coloring ).

For , let be the number of -cycles that are either in or in but not in both. Let

Then is non-empty and there exists a probability measure on with marginals equal to the uniform probability measures on and respectively.

Proof.

Let be a disjoint product of -cycles. The type of with respect to is the vector defined by: is times the number of -cycles contained in such that .

Let . Then there exist disjoint -cycles , …, in such that if and is the type of then . Note by assumption on and . Moreover, there exist -cycles , , such that the collection , …, is pairwise disjoint and the type of is . Then . So which proves is non-empty.

We claim that there is a constant such that for every the number of with is . Indeed the following algorithm constructs all such with no duplications:

Step 1.

Let be a representation of as a disjoint product of -cycles. Choose a vector such that

(1)

there exists a subset with cardinality such that if then is the type of ;

(2)

for all .

Step 2.

Choose a subset satisfying the condition in Step 1.

Step 3.

Choose pairwise disjoint -cycles , …, such that

(1)

() ();

(2)

is not contained in ();

(3)

if then has type .

The range of possible vectors in Step 1 depends only on . The number of choices in Steps 2 and 3 depends only on the choice of in Step 1 and on . This proves the claim.

Similarly, there is a constant such that for every the number of with is . It follows that the uniform probability measure on has marginals equal to the uniform probability measures on and respectively.

Corollary B.10.

Let denote the uniform probability measure on and let be the associated expectation operator. There are constants (depending only on ) such that for every , there exists such that for all , for every 1-Lipschitz ,

Moreover is monotone decreasing.

Proof.

The set is the disjoint union of over . Let . Lemmas B.7, B.8 and B.9 imply that for all sufficiently large , this decomposition of satisfies the criterion in Lemma B.3 where we set , and where depend only on . So for every -Lipschitz function , every and all sufficiently large ,

In particular, there exist such that if the inequality above holds and . By choosing larger if necessary, we require that is monotone decreasing.

Set . Because

where . The corollary is now finished by changing variables.

Proof of Theorem B.1.

The space of homomorphisms is the -fold direct power of the spaces . So the Theorem follows from Corollary B.10 and the proof of Lemma B.4.

Acknowledgments

The second author would like to thank Tim Austin and Allan Sly for helpful conversations. We would like to thank the anonymous reviewer for many corrections which have greatly improved the paper.

Table of Contents

  1. Abstract
  2. 1. Introduction
    1. Theorem 1.1.
    2. 1.1. Random sofic approximations
    3. 1.2. Proper colorings of random hyper-graphs from a statistical physics viewpoint
    4. 1.3. The action
    5. 1.4. Sofic entropy of the shift action on proper colorings
    6. 1.5. Random hyper-graph models
    7. Definition 1.
    8. Definition 2.
    9. 1.6. The strategy and a key lemma
    10. Lemma 1.2 (Key Lemma).
    11. Proposition 5.9.
  3. 2. Topological sofic entropy
    1. Lemma 2.1.
  4. 3. Reduction to the key lemma
    1. Lemma 3.1.
  5. 4. The first moment
    1. Theorem 4.1.
    2. Theorem 4.2.
    3. 4.1. Almost proper 2-colorings
    4. Definition 3.
    5. Definition 4.
    6. Definition 5.
    7. Lemma 4.3.
    8. Lemma 4.4.
    9. Lemma 4.5.
    10. 4.2. Equitable colorings
  6. 5. The second moment
    1. Theorem 5.1.
    2. Definition 6.
    3. Lemma 5.2.
    4. Lemma 5.3.
    5. Lemma 5.4.
    6. 5.1. Analysis of and the proof of Lemma 1.2 inequality 2
    7. Lemma 5.5.
    8. Lemma 5.6.
    9. Corollary 5.7.
    10. Proposition 5.8.
    11. 5.2. Reducing Lemma 1.2 inequality 3 to estimating the local cluster
    12. Proposition 5.9.
    13. Lemma 5.10.
    14. Lemma 5.11.
    15. Corollary 5.12.
  7. 6. The local cluster
    1. Lemma 6.1.
    2. Definition 7.
    3. Lemma 6.2.
    4. Proposition 5.9.
  8. 7. A Markov process on the Cayley hyper-tree
    1. 7.1. Local convergence
    2. Lemma 7.1.
    3. Proposition 7.2.
    4. Lemma 7.3.
    5. Lemma 7.4.
    6. 7.2. The density of the rigid set
    7. Proposition 7.5.
    8. Lemma 7.6.
    9. Lemma 7.7.
  9. 8. Rigid vertices
    1. Lemma 8.1 (Expansivity Lemma).
    2. Lemma 8.2.
  10. Appendix A. Topological sofic entropy notions
    1. Proposition A.1.
  11. Appendix B. Concentration for the planted model
    1. Definition 8 (Hamming metrics).
    2. Theorem B.1.
    3. B.1. General considerations
    4. Definition 9.
    5. Lemma B.2.
    6. Lemma B.3.
    7. Lemma B.4 (11).
    8. Lemma B.5.
    9. B.2. Specific considerations
    10. Lemma B.6.
    11. Lemma B.7.
    12. Lemma B.8.
    13. Lemma B.9.
    14. Corollary B.10.
  12. Acknowledgments

Mathematical Fragments

Theorem 1.1.

There exists a countable group , a mixing action by homeomorphisms on a compact metrizable space and two sofic approximations to such that

Lemma 1.2 (Key Lemma).

Let . Also let . Then

Moreover, for any

there exists (depending on ) such that for all if

for some then

Also,

In all cases above, the limits are over .

Lemma 2.1.

For any sofic approximation with ,

Lemma 3.1.

Let be finite and . Then there are constants such that for all with ,

Equation (4)
Theorem 4.1.
Theorem 4.2.

Moreover,

where .

Definition 3.

A -partition of is an unordered partition of into sets of size . Of course, such a partition exists if and only if in which case there are

such partitions. By Stirling’s formula,

Lemma 4.3.

Let , , …, be such that and . Let . Let be a map such that . Let be the number of -partitions of such that has type . Then

Equation (8)
Lemma 4.4.

Let be an integer-valued matrix (for some ), and the set of all such that and for all . For , let be the set of all such that is integer-valued.

Assume is compact and there exists with for all . Then there is a constant such that if is non-empty then it is -dense in in the following sense. For any there exists such that . Moreover, the constant can be chosen to depend continuously on the vector .

Lemma 4.5.

Given a matrix define

where . Then for any ,

where the constant implicit in the error term does not depend on .

Equation (9)
Equation (10)
Theorem 5.1.

With notation as above, for any such that is an integer,

(for ) where

is defined to be the unique solution to

and

Moreover, the constant implicit in the error term may depend on but not on . The constant implicit in the term depends continuously on for .

Lemma 5.3.

Let and be equitable. Suppose . Let be the number of -partitions of type . Also let

where is the multinomial . Then

(for ) where the constant implicit in the error term depends on but not on or .

Lemma 5.4.

Let . Then there exists a unique such that

Moreover, if , and are defined by

then .

Equation (11)
Equation (12)
Equation (14)
Equation (15)
Equation (16)
Equation (17)
Lemma 5.5.

Suppose . Define by . Then

The last equation implies .

Equation (18)
Lemma 5.6.

Let be . If

then

In particular, if is sufficiently large then .

Equation (19)
Proposition 5.8.

Let

Then there exists (depending on ) such that for all if

for some then in the interval , attains its unique maximum at 1/2. That is,

and if and then .

Equation (20)
Equation (21)
Case 4.

We claim that if then for all sufficiently large (independent of the choice of ).

Recall that we define by . By Equation 18,

since , assuming is sufficiently large.

The assumption on implies . So the second equality of Lemma 5.5 implies

Through elementary but messy calculus computations one may show using the fact that that

By Taylor’s theorem,

for some . Since , we have . Furthermore, since is monotone decreasing on , we have . Thus

and for sufficiently large we have . Since , Equation 20 implies

is strictly less than if is sufficiently large.

Proposition 5.9.

Let . Then for all sufficiently large (depending on ), if

for some with then with high probability in the planted model, . In symbols,

Lemma 5.10.

where is sub-exponential in .

Lemma 5.11.

, where is sub-exponential in .

Equation (23)
Equation (24)
Equation (25)
Equation (26)
Equation (27)
Lemma 6.1.

For any there exists such that implies

Lemma 6.2.

For any ,

Equation (28)
Lemma 7.1.

Let be an equitable coloring with . If is clopen, then for every

Proposition 7.2.

We have

Lemma 7.3.

Let be either a singleton or a connected finite union of hyperedges containing the identity. Let be a proper coloring. Let be the event that . Let be the number of proper colorings of . Then

Equation (30)
Equation (32)
Equation (33)
Equation (34)
Equation (35)
Equation (36)
Lemma 7.4.

For any we have

Proposition 7.5.
Equation (37)
Claim 3.

Suppose is a number satisfying . Then for all sufficiently large ,

Equation (38)
Lemma 7.6.

where the implied limit is as and is bounded.

Equation (39)
Equation (40)
Lemma 7.7.

.

Equation (41)
Lemma 8.1 (Expansivity Lemma).

There is a constant such that the following holds. If then with high probability (with respect to the planted model), as , for any with the following is true. For a vertex let denote the set of hyperedges supported by . Let be the set of all edges such that . Then

Claim 4.

There exists such that implies

,

,

and for any and

Claim 5.

If is sufficiently large and then tends to zero in .

Equation (42)
Claim 6.

For , any with cardinality , any with , and any , one has .

Lemma 8.2.

Let . Then there exists such that implies is -rigid (with high probability in the planted model as ).

Theorem B.1.

There exist constants (depending only on ) such that for every there exists such that for all , for every -Lipschitz ,

Definition 9.

A metric measure space is a triple where is a metric space and is a Borel probability measure on . We will say is -concentrated if for any 1-Lipschitz function ,

If is -concentrated and is -Lipschitz, then since is -Lipschitz,

Lemma B.2.

Let be -concentrated. If is an -Lipschitz map onto a measure metric space and is the push-forward measure, then is -concentrated.

Lemma B.3.

Let be a measure metric space with diameter . Suppose is a finite disjoint union of spaces , each with positive measure (). Let be the induced probability measure on . Suppose there exist and constants satisfying:

(1)

.

(2)

For every , there exists a measure on with marginals respectively such that

(3)

For each , is -concentrated.

Then for every -Lipschitz function and every ,

Lemma B.4 (Reference 11).

Suppose is -concentrated and is -concentrated. Define a metric on by . Then is -concentrated.

Lemma B.5.

Let and be metric-measure spaces. Suppose

(1)

are finite sets, and are uniform probability measures,

(2)

there is a surjective map and a constant such that for all ,

(3)

is -concentrated,

(4)

for each , the fiber is -concentrated (with respect to the uniform measure on and the restricted metric),

(5)

for each there is a probability measure on with marginals equal to the uniform measures on and such that

Then is -concentrated.

Equation (44)
Lemma B.6.

The group is -concentrated (when equipped with the uniform probability measure and the restriction of the normalized Hamming metric ).

Lemma B.7.

The subset is either empty or -concentrated (when equipped with the normalized Hamming metric and the uniform probability measure) where is a constant depending only on .

Lemma B.8.

With notation as above, for sufficiently large

where is a constant depending only on .

Lemma B.9.

Let . Suppose

Suppose and are non-empty (for some integer and equitable coloring ).

For , let be the number of -cycles that are either in or in but not in both. Let

Then is non-empty and there exists a probability measure on with marginals equal to the uniform probability measures on and respectively.

Step 1.

Let be a representation of as a disjoint product of -cycles. Choose a vector such that

(1)

there exists a subset with cardinality such that if then is the type of ;

(2)

for all .

Step 2.

Choose a subset satisfying the condition in Step 1.

Step 3.

Choose pairwise disjoint -cycles , …, such that

(1)

() ();

(2)

is not contained in ();

(3)

if then has type .

The range of possible vectors in Step 1 depends only on . The number of choices in Steps 2 and 3 depends only on the choice of in Step 1 and on . This proves the claim.

Similarly, there is a constant such that for every the number of with is . It follows that the uniform probability measure on has marginals equal to the uniform probability measures on and respectively.

Corollary B.10.

Let denote the uniform probability measure on and let be the associated expectation operator. There are constants (depending only on ) such that for every , there exists such that for all , for every 1-Lipschitz ,

Moreover is monotone decreasing.

References

Reference [1]
R. L. Adler, A. G. Konheim, and M. H. McAndrew, Topological entropy, Trans. Amer. Math. Soc. 114 (1965), 309–319, DOI 10.2307/1994177. MR175106,
Show rawAMSref \bib{adler-konheim-mcandrew}{article}{ author={Adler, R. L.}, author={Konheim, A. G.}, author={McAndrew, M. H.}, title={Topological entropy}, journal={Trans. Amer. Math. Soc.}, volume={114}, date={1965}, pages={309--319}, issn={0002-9947}, review={\MR {175106}}, doi={10.2307/1994177}, }
Reference [2]
Jean Moulin Ollagnier, Ergodic theory and statistical mechanics, Lecture Notes in Mathematics, vol. 1115, Springer-Verlag, Berlin, 1985, DOI 10.1007/BFb0101575. MR781932,
Show rawAMSref \bib{ollagnier-book}{book}{ author={Moulin Ollagnier, Jean}, title={Ergodic theory and statistical mechanics}, series={Lecture Notes in Mathematics}, volume={1115}, publisher={Springer-Verlag, Berlin}, date={1985}, pages={vi+147}, isbn={3-540-15192-3}, review={\MR {781932}}, doi={10.1007/BFb0101575}, }
Reference [3]
David Kerr and Hanfeng Li, Entropy and the variational principle for actions of sofic groups, Invent. Math. 186 (2011), no. 3, 501–558, DOI 10.1007/s00222-011-0324-9. MR2854085,
Show rawAMSref \bib{kerr-li-variational}{article}{ author={Kerr, David}, author={Li, Hanfeng}, title={Entropy and the variational principle for actions of sofic groups}, journal={Invent. Math.}, volume={186}, date={2011}, number={3}, pages={501--558}, issn={0020-9910}, review={\MR {2854085}}, doi={10.1007/s00222-011-0324-9}, }
Reference [4]
Lewis Bowen, Measure conjugacy invariants for actions of countable sofic groups, J. Amer. Math. Soc. 23 (2010), no. 1, 217–245, DOI 10.1090/S0894-0347-09-00637-7. MR2552252,
Show rawAMSref \bib{bowen-jams-2010}{article}{ author={Bowen, Lewis}, title={Measure conjugacy invariants for actions of countable sofic groups}, journal={J. Amer. Math. Soc.}, volume={23}, date={2010}, number={1}, pages={217--245}, issn={0894-0347}, review={\MR {2552252}}, doi={10.1090/S0894-0347-09-00637-7}, }
Reference [5]
Lewis Bowen, Examples in the entropy theory of countable group actions, Ergodic Theory Dynam. Systems 40 (2020), no. 10, 2593–2680, DOI 10.1017/etds.2019.18. MR4138907,
Show rawAMSref \bib{MR4138907}{article}{ author={Bowen, Lewis}, title={Examples in the entropy theory of countable group actions}, journal={Ergodic Theory Dynam. Systems}, volume={40}, date={2020}, number={10}, pages={2593--2680}, issn={0143-3857}, review={\MR {4138907}}, doi={10.1017/etds.2019.18}, }
Reference [6]
Dimitris Achlioptas and Cristopher Moore, Random -SAT: two moments suffice to cross a sharp threshold, SIAM J. Comput. 36 (2006), no. 3, 740–762, DOI 10.1137/S0097539703434231. MR2263010,
Show rawAMSref \bib{MR2263010}{article}{ author={Achlioptas, Dimitris}, author={Moore, Cristopher}, title={Random $k$-SAT: two moments suffice to cross a sharp threshold}, journal={SIAM J. Comput.}, volume={36}, date={2006}, number={3}, pages={740--762}, issn={0097-5397}, review={\MR {2263010}}, doi={10.1137/S0097539703434231}, }
Reference [7]
Amin Coja-Oghlan and Lenka Zdeborova, The condensation transition in random hypergraph 2-coloring, Preprint arXiv:1107.2341, 2011.
Reference [8]
Amin Coja-Oghlan and Lenka Zdeborová, The condensation transition in random hypergraph 2-coloring, Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms, ACM, New York, 2012, pp. 241–250. MR3205212,
Show rawAMSref \bib{MR3205212}{article}{ author={Coja-Oghlan, Amin}, author={Zdeborov\'{a}, Lenka}, title={The condensation transition in random hypergraph 2-coloring}, conference={ title={Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms}, }, book={ publisher={ACM, New York}, }, date={2012}, pages={241--250}, review={\MR {3205212}}, }
Reference [9]
Tim Austin, Additivity properties of sofic entropy and measures on model spaces, Forum Math. Sigma 4 (2016), Paper No. e25, 79, DOI 10.1017/fms.2016.18. MR3542515,
Show rawAMSref \bib{MR3542515}{article}{ author={Austin, Tim}, title={Additivity properties of sofic entropy and measures on model spaces}, journal={Forum Math. Sigma}, volume={4}, date={2016}, pages={Paper No. e25, 79}, review={\MR {3542515}}, doi={10.1017/fms.2016.18}, }
Reference [10]
David Kerr and Hanfeng Li, Soficity, amenability, and dynamical entropy, Amer. J. Math. 135 (2013), no. 3, 721–761, DOI 10.1353/ajm.2013.0024. MR3068400,
Show rawAMSref \bib{kerr-li-sofic-amenable}{article}{ author={Kerr, David}, author={Li, Hanfeng}, title={Soficity, amenability, and dynamical entropy}, journal={Amer. J. Math.}, volume={135}, date={2013}, number={3}, pages={721--761}, issn={0002-9327}, review={\MR {3068400}}, doi={10.1353/ajm.2013.0024}, }
Reference [11]
Michel Ledoux, The concentration of measure phenomenon, Mathematical Surveys and Monographs, vol. 89, American Mathematical Society, Providence, RI, 2001, DOI 10.1090/surv/089. MR1849347,
Show rawAMSref \bib{MR1849347}{book}{ author={Ledoux, Michel}, title={The concentration of measure phenomenon}, series={Mathematical Surveys and Monographs}, volume={89}, publisher={American Mathematical Society, Providence, RI}, date={2001}, pages={x+181}, isbn={0-8218-2864-9}, review={\MR {1849347}}, doi={10.1090/surv/089}, }

Article Information

MSC 2020
Primary: 37A35 (Entropy and other invariants, isomorphism, classification in ergodic theory)
Keywords
  • Sofic entropy
  • random hyper-graphs
Author Information
Dylan Airey
Department of Mathematics, University of Texas at Austin, Austin, Texas
Address at time of publication: Department of Mathematics, Princeton University, Princeton, New Jersey
MathSciNet
Lewis Bowen
Department of Mathematics, University of Texas at Austin, Austin, Texas
MathSciNet
Yuqing Frank Lin
Department of Mathematics, University of Texas at Austin, Austin, Texas
Address at time of publication: Department of Mathematics, Texas A&M University, College Station, Texas
MathSciNet
Additional Notes

The first author was supported in part by NSF grant DGE-1656466. The second author was supported in part by NSF grant DMS-1900386. The third author was supported in part by NSF grant DMS-1900386.

Journal Information
Transactions of the American Mathematical Society, Series B, Volume 9, Issue 2, ISSN 2330-0000, published by the American Mathematical Society, Providence, Rhode Island.
Publication History
This article was received on , revised on , , , , and published on .
Copyright Information
Copyright 2022 by the authors under Creative Commons Attribution-Noncommercial 3.0 License (CC BY NC 3.0)
Article References
  • Permalink
  • Permalink (PDF)
  • DOI 10.1090/btran/101
  • MathSciNet Review: 4383230
  • Show rawAMSref \bib{4383230}{article}{ author={Airey, Dylan}, author={Bowen, Lewis}, author={Lin, Yuqing}, title={A topological dynamical system with two different positive sofic entropies}, journal={Trans. Amer. Math. Soc. Ser. B}, volume={9}, number={2}, date={2022}, pages={35-98}, issn={2330-0000}, review={4383230}, doi={10.1090/btran/101}, }

Settings

Change font size
Resize article panel
Enable equation enrichment

Note. To explore an equation, focus it (e.g., by clicking on it) and use the arrow keys to navigate its structure. Screenreader users should be advised that enabling speech synthesis will lead to duplicate aural rendering.

For more information please visit the AMS MathViewer documentation.