PDFLINK

Descriptive Combinatorics and Distributed Algorithms

Anton Bernshteyn

Communicated by Notices Associate Editor Emilie Purvine

In this article we shall explore a fascinating area called descriptive combinatorics and its recently discovered connections to distributed algorithms—a fundamental part of computer science that is becoming increasingly important in the modern era of decentralized computation. The interdisciplinary nature of these connections means that there is very little common background shared by the researchers who are interested in them. With this in mind, this article was written under the assumption that the reader would have close to no background in either descriptive set theory or computer science. The reader will judge to what degree this endeavor was successful.

The article comprises two parts. In the first part we give a brief introduction to some of the central notions and problems of descriptive combinatorics. The second part is devoted to a survey of some of the results concerning the interactions between descriptive combinatorics and distributed algorithms, as well as a few open problems.

1. A Brief Introduction to Descriptive Combinatorics

1.1. Basic notions of descriptive set theory

1.1.1. Countable computation and Borel sets

Descriptive combinatorics is an area that emerged quite recently (about two decades ago) as the result of a symbiosis between combinatorics (especially graph theory) and descriptive set theory. For excellent surveys of this subject, see KM20 by Kechris and Marks and Pik21 by Pikhurko. In addition to combinatorics and descriptive set theory, descriptive combinatorics has close connections to numerous other branches of mathematics, such as ergodic theory, topological dynamics, probability theory, and model theory, to name a few.

To motivate the questions studied in descriptive combinatorics, it will be beneficial to first briefly discuss descriptive set theory more abstractly.

Figure 1.

A countable circuit that outputs if the input bit string does not contain two consecutive s.

$\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture} \node at (-1.2,0) {inputs}; \node[circle,draw] (x0) at (0,0) {$x_0$}; \node[circle, draw] (x1) at (1,0) {$x_1$}; \node[circle, draw] (x2) at (2,0) {$x_2$}; \node[circle, draw] (x3) at (3,0) {$x_3$}; \node[circle, draw] (x4) at (4,0) {$x_4$}; \node[circle, draw] (x5) at (5,0) {$x_5$}; \node[circle,inner sep=5] (dot0) at (6,0) {$\ldots$}; \node[circle, draw,inner sep=2] (and0) at (0,2) {$\mathtt{and}$}; \node[circle, draw,inner sep=2] (and1) at (1,2) {$\mathtt{and}$}; \node[circle, draw,inner sep=2] (and2) at (2,2) {$\mathtt{and}$}; \node[circle, draw,inner sep=2] (and3) at (3,2) {$\mathtt{and}$}; \node[circle, draw,inner sep=2] (and4) at (4,2) {$\mathtt{and}$}; \node[circle, draw,inner sep=2] (and5) at (5,2) {$\mathtt{and}$}; \node[circle,inner sep=5] (dot1) at (6,2) {$\ldots$}; \node[circle,draw] (or) at (2.5,4) {$\mathtt{or}$}; \node at (1.3,6) {output}; \node[circle,draw,inner sep=2] (not) at (2.5,6) {$\mathtt{not}$}; \begin{scope}[thick,decoration={markings,mark=at position 0.5 with {\arrow{stealth}}}] \draw[postaction={decorate}] (x0) -- (and0); \draw[postaction={decorate}] (x1) -- (and0); \draw[postaction={decorate}] (x1) -- (and1); \draw[postaction={decorate}] (x2) -- (and1); \draw[postaction={decorate}] (x2) -- (and2); \draw[postaction={decorate}] (x3) -- (and2); \draw[postaction={decorate}] (x3) -- (and3); \draw[postaction={decorate}] (x4) -- (and3); \draw[postaction={decorate}] (x4) -- (and4); \draw[postaction={decorate}] (x5) -- (and4); \draw[postaction={decorate}] (x5) -- (and5); \draw[postaction={decorate}] (dot0) -- (and5); \draw[postaction={decorate}] (dot0) -- (dot1); \draw[postaction={decorate}] (and0) -- (or); \draw[postaction={decorate}] (and1) -- (or); \draw[postaction={decorate}] (and2) -- (or); \draw[postaction={decorate}] (and3) -- (or); \draw[postaction={decorate}] (and4) -- (or); \draw[postaction={decorate}] (and5) -- (or); \draw[postaction={decorate}] (dot1) -- (or); \draw[postaction={decorate}] (or) -- (not); \end{scope} \end{tikzpicture}$

One way (out of many) of thinking about descriptive set theory is that it provides a versatile framework for gauging the inherent difficulty of mathematical problems pertaining to countable structures, in a manner analogous to how computational complexity theory gauges the inherent difficulty of finite problems. To pursue this analogy further, we can naturally present the basic concepts of descriptive set theory using a particular model of computation, namely Boolean circuits. An example of an infinite Boolean circuit is shown in Fig. 1. It is a network consisting of nodes, also called gates, joined by directed edges, or wires. The bottom layer comprises the input nodes. Each input node receives a bit value— or . The values then propagate through the network along the wires, with every gate computing a particular Boolean function of the values that feed into it: not, and, or or. Finally, one or more nodes are designated as the outputs. Thus, a Boolean circuit can be used to compute a function , where and are the sets of the input and the output nodes respectively. For example, the circuit in Fig. 1 computes the function that equals if and only if the input string does not contain two consecutive s.

There is one technical issue that we must make explicit here. Not every directed network can be used to perform well-defined computation. For instance, if a network involves a directed cycle of nodes, such as , then the computational process wouldn’t even be able to start. More generally, a Boolean circuit must be well-founded, meaning that it must not contain an infinite descending sequence of nodes such as . It is not hard to show that well-foundedness is the only necessary requirement: any well-founded network of gates implements a well-defined function.

An important part of computational complexity theory is circuit complexity, which studies how large a (finite) circuit needs to be to compute a given function . In descriptive set theory we are similarly interested in functions that can be computed by countable circuits:

Definition 1.1 (Borel subsets of ).

A subset is Borel if its characteristic function can be computed by a countable Boolean circuit. We let denote the family of all Borel subsets of .

While Definition 1.1 applies to subsets of , it can naturally be extended to sets of other types, as long as their members can be somehow encoded by infinite bit strings. For example, a countable graph with vertex set can be represented by its adjacency matrix , i.e., a countable table of s and s whose entry in row and column is if and only if and are adjacent in . We can then call a set of countable graphs Borel if there is a countable Boolean circuit that decides, given the matrix , whether is in or not. For instance, the following sets of countable graphs are Borel:

(The last one may be a bit surprising, since deciding whether a finite graph is -colorable is believed to be a computationally difficult problem. The difficulty miraculously disappears in the countable world, however.) On the other hand, the following set can be shown to not be Borel:

In fact, this set is complete analytic, which is a notion somewhat analogous to NP-completeness from computational complexity, with the added benefit that in descriptive set theory it is a theorem (due to Suslin from 1917) that complete analytic sets cannot be Borel. In general, most subsets of are not Borel: has subsets but only of them are Borel (this result follows by enumerating all countable Boolean circuits).

1.1.2. Borel sets in Polish spaces

A useful (and more classical) perspective on Borel sets is provided by topological considerations. We can define the distance between two infinite bit strings , by the formula

(This can be thought of as a weighted version of the Hamming distance.) The coefficients are chosen so that the series converges (and hence the metric is bounded, in this case by ), but otherwise they are arbitrary. In particular, using a different convergent series with positive terms would yield a different metric, but the resulting topology on the space would be the same—namely, it is the product topology arising from viewing as the product of countably many copies of the discrete space . Actually, it is not hard to explicitly describe this topology without referring to the metric: a set is open if and only if for every sequence , the membership of in can be confirmed by looking at only finitely many entries of . More formally, is open if for each , there is some such that every infinite bit string whose first entries are belongs to .

Now, it turns out that Borel subsets of can be described in purely topological terms and without any reference to Boolean circuits as follows: is the smallest family of subsets of that includes all open sets and is closed under countable Boolean operations (i.e., complements, countable unions, and countable intersections). And this characterization can be taken as the definition of Borel subsets of any topological space:

Definition 1.2 (Borel sets in topological spaces).

Let be a topological space. We let be the smallest family of subsets of that includes all open sets and is closed under countable Boolean operations. A subset is Borel if it is a member of .

Although Definition 1.2 makes sense for an arbitrary topological space , descriptive set theory is mostly concerned with so-called Polish spaces. Formally, a topological space is Polish if it is second-countable (i.e., it has a countable basis) and completely metrizable (i.e., the topology is generated by a complete metric). A particularly compelling reason for focusing on Polish spaces is that the points of a Polish space can be encoded by infinite bit strings in a “well-behaved” way. This statement is made precise by the following remarkable theorem:

Theorem 1.3 (Borel Isomorphism Theorem).

If is an uncountable Polish space, then there is a bijection

such that a set is Borel in if and only if the set is Borel in .

In other words, the computational definition of Borel subsets of given in Definition 1.1 can also be used to identify, via an appropriate coding, Borel sets in any uncountable Polish space. This is a powerful observation because many natural examples of topological spaces are Polish, for instance and, more generally, for , the infinite-dimensional space , the Baire space , the unit intervals , , and , the unit circle and, more generally, any second-countable topological manifold, all compact metric spaces, all separable Banach spaces, etc. Furthermore, it is possible to parameterize various classes of mathematical structures by points in suitably defined Polish spaces. We have already seen how to use adjacency matrices to form a Polish space of countable graphs. In a similar vein, one can assemble, e.g., a Polish space of countable groups. There are also natural Polish spaces of continuous functions, measure-preserving transformations, separable Banach spaces, etc. It is even possible to define a Polish space of all Polish spaces!⁠Footnote¹

For the interested reader, we sketch the construction of the Polish space of Polish spaces. First, it turns out that every Polish space is homeomorphic to a closed subset of . Now, let be a countable basis for the topology of . To each closed set , we assign a code as follows: if and only if . Let be the set of all codes of closed subsets of . One can show that is Polish in the relative topology inherited from . Thus, all Polish spaces are parameterized by the points of the Polish space .

✖

1.1.3. Other classes of sets in descriptive set theory

In addition to Borel sets, there are various other classes of sets that play an important role in descriptive set theory. In this section we briefly introduce three such classes.

Let us temporarily return to the space of infinite bit strings. Recall that a set is Borel if the membership in can be decided by a countable Boolean circuit. Sometimes it makes sense to inquire whether a set is not just Borel but open. As mentioned in §1.1.2, a set is open if for every sequence , the membership of in can be confirmed by looking at only finitely many bits of . Of course, we can also investigate open sets in any other Polish space.

On the other hand, sometimes a set we are working with may not be Borel and yet be “almost” Borel in a certain suitable sense. For instance, a set is measurable if there is a countable Boolean circuit that correctly decides the membership in for a random input point . More precisely, is measurable if there is a countable Boolean circuit with the following property. Let denote the characteristic function of . Sample a sequence randomly by making each be or independently with probability . Then with probability ; in other words, may fail to correctly determine ’s membership in , but the probability of failure is . We can similarly study measurable sets in any Polish space , provided that it is equipped with a Borel probability measure .⁠Footnote² In many applications, sets of measure may be safely ignored, and thus measurability is often a “good enough” substitute for Borelness.

We should point out that the class of measurable sets varies with the choice of the measure . For simplicity, we shall assume throughout this article that whenever we speak of measurable sets, it is with respect to some implicitly fixed probability measure.

✖

Finally, we also often consider the so-called Baire-measurable sets. The notion of Baire-measurability is analogous to measurability, but it is typically easier to work with. Furthermore, it is defined purely topologically, without reference to a measure or any other additional structure. The role of sets of measure is played by the so-called meager sets, i.e., countable unions of nowhere dense sets. We think of meager sets as “topologically negligible.” A set is Baire-measurable if there is a Borel set such that the symmetric difference is meager—informally, and are “topologically almost equal.” The area studying meager and Baire-measurable sets is called Baire category theory (owing to the older term “sets of first category” for meager sets). For a detailed comparison of measure and Baire category, see the excellent book Oxt80 by Oxtoby.

1.2. Borel graphs and their combinatorics

As mentioned earlier, descriptive combinatorics investigates classical combinatorial problems—such as graph coloring, matching, etc.—from the perspective of descriptive set theory. The central notion here is that of a Borel graph:

Definition 1.4 (Borel graphs).

A Borel graph is a graph whose vertex set is a Polish space and whose edge set is a Borel subset of .

In this section, we describe some examples of Borel graphs and highlight a few of their properties that are of interest in descriptive combinatorics.

1.2.1. The hypercube

\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture} \begin{scope}[xshift=1cm] \filldraw(0,0.5) circle (1pt); \filldraw(0,-0.5) circle (1pt); \draw(0,-0.5) -- (0,0.5); \node[anchor=north] at (0,-0.5) {$0$}; \node[anchor=south] at (0,0.5) {$1$}; \end{scope} \begin{scope}[xshift=3cm] \filldraw(-0.5,0.5) circle (1pt); \filldraw(-0.5,-0.5) circle (1pt); \draw(-0.5,-0.5) -- (-0.5,0.5); \filldraw(0.5,0.5) circle (1pt); \filldraw(0.5,-0.5) circle (1pt); \draw(0.5,-0.5) -- (0.5,0.5); \draw(-0.5,-0.5) -- (0.5,-0.5); \draw(-0.5,0.5) -- (0.5,0.5); \node[anchor=north] at (-0.5,-0.5) {$00$}; \node[anchor=north] at (0.5,-0.5) {$01$}; \node[anchor=south] at (-0.5,0.5) {$10$}; \node[anchor=south] at (0.5,0.5) {$11$}; \end{scope} \begin{scope}[xshift=6cm,scale=1.5] \filldraw(-0.7,0.3) circle (1pt); \filldraw(-0.7,-0.7) circle (1pt); \draw(-0.7,-0.7) -- (-0.7,0.3); \filldraw(0.3,0.3) circle (1pt); \filldraw(0.3,-0.7) circle (1pt); \draw(0.3,-0.7) -- (0.3,0.3); \draw(-0.7,-0.7) -- (0.3,-0.7); \draw(-0.7,0.3) -- (0.3,0.3); \filldraw(-0.3,0.7) circle (1pt); \filldraw(-0.3,-0.3) circle (1pt); \draw(-0.3,-0.3) -- (-0.3,0.7); \filldraw(0.7,0.7) circle (1pt); \filldraw(0.7,-0.3) circle (1pt); \draw(0.7,-0.3) -- (0.7,0.7); \draw(-0.3,-0.3) -- (0.7,-0.3); \draw(-0.3,0.7) -- (0.7,0.7); \draw(-0.7,-0.7) -- (-0.3,-0.3); \draw(-0.7,0.3) -- (-0.3,0.7); \draw(0.3,-0.7) -- (0.7,-0.3); \draw(0.3,0.3) -- (0.7,0.7); \node[anchor=north] at (-0.7,-0.7) {$000$}; \node[anchor=north] at (0.3,-0.7) {$010$}; \node[anchor=east] at (-0.7,0.3) {$100$}; \node[anchor=south] at (-0.3,0.7) {$101$}; \node[anchor=180+40] at (-0.3,-0.3) {$001$}; \node[anchor=west] at (0.7,-0.3) {$011$}; \node[anchor=35] at (0.3,0.3) {$110$}; \node[anchor=south] at (0.7,0.7) {$111$}; \end{scope} \end{tikzpicture} — **Figure 2.**
-, -, and -dimensional hypercubes. In contrast to the infinite-dimensional hypercube, these graphs are connected.

For our first example, let be the graph with vertex set in which two bit strings , are adjacent if and only if they differ in exactly one coordinate, i.e., if there is a unique index such that . We call the infinite-dimensional hypercube (it can be viewed as the inverse limit of finite-dimensional hypercubes, the first few of which are shown in Fig. 2). The graph is Borel, since it is easy to construct a countable Boolean circuit which, given two infinite bit strings and , determines if they are adjacent in .

Note that the vertex set of has cardinality . This is typical for graphs studied in descriptive combinatorics, since their vertex sets usually are uncountable Polish spaces.

Next, we observe that, somewhat surprisingly and in contrast to finite-dimensional Boolean cubes, the graph is disconnected. To see this, recall that the connected component of a vertex in a graph is the smallest set that contains and such that there are no edges joining to . Equivalently, is the set of all vertices reachable from in by a finite path. A graph is connected if it has a single connected component. Now take any vertex in . A moment’s thought reveals that the connected component is the set of all sequences that differ from in only finitely many coordinates. For instance, if , then is the set of all bit strings with finitely many s. It follows that every connected component of is countable, which implies that has -many components. This is also a typical feature of graphs studied in descriptive combinatorics.

One of the most important parameters studied in graph theory is the chromatic number of a graph:

Definition 1.5 (Chromatic numbers).

Let be a graph. A subset is -independent if no edge of joins two vertices in . The chromatic number of is the smallest cardinal such that can be partitioned into -many -independent sets.

Let us compute the chromatic number of :

Proposition 1.6.

We have .

Proof.

Clearly, . To prove that , we need to partition into two -independent sets. To this end, choose an arbitrary representative from every connected component of . For , let be the representative from . Since and belong to the same component of , they differ in finitely many coordinates. Thus, we can let be the number of coordinates where and differ and define

If and are neighbors in , then , and hence . Therefore, the sets and are -independent and partition , as desired.

Since we know that , it makes sense to ask the following question:

Can the set be partitioned into two Borel -independent sets?

This is a special case of the following general problem:

Definition 1.7 (Borel chromatic numbers).

Let be a Borel graph. The Borel chromatic number of is the smallest such that can be partitioned into -many Borel -independent sets.⁠Footnote³ The measurable and Baire-measurable chromatic numbers and are defined analogously.

Note that, by definition, if is uncountable, then . This convention is motivated by the fact that every uncountable Polish space has cardinality . However, it may still be meaningful to ask whether can be partitioned into -many Borel -independent sets for some . This issue will not play a major role in this article, since we shall be mostly interested in the case when is finite.

✖

It turns out that, although the chromatic number of is , its Borel chromatic number is uncountable!

Proposition 1.8.

The set cannot be partitioned into countably many Borel -independent sets. In other words, .

Actually, we even have (and, of course, both and are lower bounds for ). To prove this, one shows that every measurable -independent set must have measure , while every Baire-measurable -independent set must be meager. Since countable unions of measure-/meager sets are still measure-/meager, it is impossible to cover the entire space by countably many measurable/Baire-measurable -independent sets.

One interpretation of Proposition 1.8 is that there is no “explicit” partition of the vertex set of into two—or even countably many—-independent sets. What makes the construction in the proof of Proposition 1.6 “inexplicit” is the very first step, namely choosing a single vertex in each connected component of . This step implicitly invokes the so-called Axiom of Choice:

Axiom 1.9 (Choice).

If is a family of nonempty sets, then there is a way to pick one element from each set in . Formally, there is a function that assigns to each an element .

The Axiom of Choice is part of the axiom system ZFC (Zermelo–Fraenkel set theory with the Axiom of Choice) and plays a crucial role in the foundations of mathematics. While most other axioms of ZFC assert the existence of concrete sets, such as the powerset of a given set , the Axiom of Choice postulates the existence of a choice function without actually describing how the choice is to be made. In this sense, the Axiom of Choice is non-constructive—so much so that in the past it was viewed with deep suspicion by many mathematicians. After all, how can we assert that something exists without being able to provide even a single example? Although by now the controversy around the Axiom of Choice has largely died down and most mathematicians use it without reservation, it is still true that the Axiom of Choice—and hence our proof of Proposition 1.6 based on it—is inherently non-constructive. From this perspective, Proposition 1.8 says that it is impossible to come up with an alternative, constructive argument—at least if by “constructive” we mean “Borel.”⁠Footnote⁴

⁴

This is not the only way in which the Axiom of Choice affects chromatic numbers of graphs. For instance, the following remarkable fact was established by Galvin and Komjáth: the statement that the chromatic number is well-defined for every graph is equivalent to the Axiom of Choice.

✖

1.2.2. Translation and rotation

\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture}[scale=0.6] \filldraw[fill=cyan] (8.9,-0.1) rectangle (10,0.1); \begin{scope} \clip(10,0) circle (0.5); \filldraw[fill=cyan] (9,-0.1) rectangle (10.5,0.1); \end{scope} \filldraw[fill=orange] (5.9,-0.1) rectangle (8.5,0.1); \begin{scope} \clip(8.5,0) circle (0.5); \filldraw[fill=orange] (6,-0.1) rectangle (9,0.1); \end{scope} \filldraw[fill=cyan] (4.4+1.5,-0.1) rectangle (5.5+1.5,0.1); \begin{scope} \clip(5.5+1.5,0) circle (0.5); \filldraw[fill=cyan] (4.5+1.5,-0.1) rectangle (6+1.5,0.1); \end{scope} \filldraw[fill=orange] (2.9+1.5,-0.1) rectangle (4+1.5,0.1); \begin{scope} \clip(4+1.5,0) circle (0.5); \filldraw[fill=orange] (3+1.5,-0.1) rectangle (4.5+1.5,0.1); \end{scope} \filldraw[fill=cyan] (1.4+1.5,-0.1) rectangle (2.5+1.5,0.1); \begin{scope} \clip(2.5+1.5,0) circle (0.5); \filldraw[fill=cyan] (1.5+1.5,-0.1) rectangle (3+1.5,0.1); \end{scope} \filldraw[fill=orange] (0+1.5,-0.1) rectangle (1+1.5,0.1); \begin{scope} \clip(1+1.5,0) circle (0.5); \filldraw[fill=orange] (0+1.5,-0.1) rectangle (1.5+1.5,0.1); \end{scope} \filldraw[fill=cyan] (0,-0.1) rectangle (1,0.1); \begin{scope} \clip(1,0) circle (0.5); \filldraw[fill=cyan] (0,-0.1) rectangle (1.5,0.1); \end{scope} \filldraw[fill=orange] (0-1.5,-0.1) rectangle (1-1.5,0.1); \begin{scope} \clip(1-1.5,0) circle (0.5); \filldraw[fill=orange] (0-1.5,-0.1) rectangle (1.5-1.5,0.1); \end{scope} \draw(1-1.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(1,0) ++(30:0.5) arc (30:-30:0.5); \draw(2.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(4,0) ++(30:0.5) arc (30:-30:0.5); \draw(5.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(7,0) ++(30:0.5) arc (30:-30:0.5); \draw(8.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(10,0) ++(30:0.5) arc (30:-30:0.5); \node[fill=white, shape=circle] at (0-1.5, 0) {\LARGE\,\ldots}; \node[fill=white, shape=circle] at (10, 0) {\LARGE\,\ldots}; \node at (0, -0.5) {$-3$}; \node at (1.5, -0.5) {$-2$}; \node at (3, -0.5) {$-1$}; \node at (4.5, -0.5) {$0$}; \node at (4.5+1.5, -0.5) {$1$}; \node at (4.5+3, -0.5) {$2$}; \node at (4.5+4.5, -0.5) {$3$}; \end{tikzpicture} — **Figure 3.**
A partition of into two Borel -independent sets (indicated by the colors).

In the remainder of this article we shall be concerned with Borel graphs of bounded degree, meaning that there is a natural number such that every vertex of has at most neighbors. A typical example is the graph with vertex set in which vertices and are adjacent if and only if . In other words, each has precisely two neighbors in : and . It follows that every connected component of is a bi-infinite path i.e., a path infinite in both directions. In particular, just like the hypercube graph from §1.2.1, has -many countable connected components. (This fact can also be observed directly, since the vertices in the half-open interval belong to distinct components.) Clearly, . In contrast to the graph from §1.2.1 however, we can explicitly describe a partition of into two independent sets and : for every integer , put the points in the interval in if is even and in if is odd (see Fig. 3). This yields

These sets are Borel, so we in fact have .

Next we fix a number and define a graph as follows. The vertex set of is the half-open unit interval . Two vertices , are adjacent in if and only if . Again, every vertex has precisely two neighbors, namely and . (Another way of thinking about this graph is by “wrapping” the unit interval around a circle, which turns adding modulo into rotation by the angle .) Since is irrational, it is not hard to see that has -many connected components, each of which is a bi-infinite path. In other words, the graphs and are isomorphic. In particular, . But can we describe a bipartition of explicitly? An attempt to adapt the construction used for fails, although it does yield a partition of into three Borel independent sets (see Fig. 4). In fact, it turns out that no explicit bipartition of exists. This statement is made precise by the following proposition:

\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture}[scale=0.6] \draw(0,-0.25) -- (0, 0.25); \draw(0,0) -- (11,0); \filldraw[fill=olive] (10.4,-0.1) rectangle (10.5,0.1); \begin{scope} \clip(10.5,0) circle (0.5); \filldraw[fill=olive] (10.5,-0.1) rectangle (11.5,0.1); \end{scope} \filldraw[fill=cyan] (8.9,-0.1) rectangle (10,0.1); \begin{scope} \clip(10,0) circle (0.5); \filldraw[fill=cyan] (9,-0.1) rectangle (10.5,0.1); \end{scope} \filldraw[fill=orange] (5.9,-0.1) rectangle (8.5,0.1); \begin{scope} \clip(8.5,0) circle (0.5); \filldraw[fill=orange] (6,-0.1) rectangle (9,0.1); \end{scope} \filldraw[fill=cyan] (4.4,-0.1) rectangle (5.5,0.1); \begin{scope} \clip(5.5,0) circle (0.5); \filldraw[fill=cyan] (4.5,-0.1) rectangle (6,0.1); \end{scope} \filldraw[fill=orange] (2.9,-0.1) rectangle (4,0.1); \begin{scope} \clip(4,0) circle (0.5); \filldraw[fill=orange] (3,-0.1) rectangle (4.5,0.1); \end{scope} \filldraw[fill=cyan] (1.4,-0.1) rectangle (2.5,0.1); \begin{scope} \clip(2.5,0) circle (0.5); \filldraw[fill=cyan] (1.5,-0.1) rectangle (3,0.1); \end{scope} \filldraw[fill=orange] (0,-0.1) rectangle (1,0.1); \begin{scope} \clip(1,0) circle (0.5); \filldraw[fill=orange] (0,-0.1) rectangle (1.5,0.1); \end{scope} \draw(1,0) ++(30:0.5) arc (30:-30:0.5); \draw(2.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(4,0) ++(30:0.5) arc (30:-30:0.5); \draw(5.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(8.5,0) ++(30:0.5) arc (30:-30:0.5); \draw(10,0) ++(30:0.5) arc (30:-30:0.5); \draw(10.5,0) ++(30:0.5) arc (30:-30:0.5); \node[fill=white, shape=circle] at (7.5, 0) {\LARGE\,\ldots}; \node at (0,-0.5) {$0$}; \node at (11,-0.5) {$1$}; \node at (1.5, -0.5) {$\alpha$}; \node at (3, -0.5) {$2\alpha$}; \node at (4.5, -0.5) {$3\alpha$}; \end{tikzpicture} — **Figure 4.**
A partition of into three Borel -independent sets (indicated by the colors).

Proposition 1.10.

We have . Furthermore, as well.

In other words, and are two graphs that are combinatorially isomorphic, but the topological structures on their vertex sets affect their Borel, measurable, and Baire-measurable chromatic numbers in different ways.

1.2.3. Locally checkable labeling problems

In the above examples, we have been interested in chromatic numbers of graphs as well as in their Borel, measurable, etc. analogs. Naturally, there are many other combinatorial concepts we may want to study. Many of them fall into the general framework of locally checkable labelling (LCL) problems (also known as local coloring problems). In an LCL problem , we are asked to assign labels from a finite set to the vertices of a graph in such a way that certain constraints are satisfied. The constraints must be “local” in the sense that they can be checked by looking at the labels of each vertex and its neighbors. For a formal (and somewhat more general) definition, see, e.g., Ber20, §2.A.1. For our purposes, it will suffice to give a few examples that illustrate what sort of problems fall into this category.

The prototypical example is the -coloring problem: assign the labels , referred to as “colors,” to the vertices of so that the label of each vertex is distinct from the labels of its neighbors. Note that this is a “local” constraint in the sense described above: it only involves comparing the label of a vertex to those of its neighbors. By definition, the set of all vertices receiving a particular label in a -coloring must be a -independent set, and hence admits a -coloring if and only if .

Another well-studied problem is the Maximal Independent Set (MIS) problem: find an independent set that is maximal under inclusion. This can be interpreted as an LCL problem as follows. We can encode a set by its characteristic function, which assigns the labels to the vertices of . It is easy to see that is an MIS if and only if this labelling satisfies the following “local” constraint:

If a vertex is labeled , then all its neighbors are labeled , while if a vertex is labeled , then at least one of its neighbors is labeled .

As our last example, consider the perfect matching problem: find a set of edges such that every vertex is incident to exactly one edge in . If is an edge in , then we say that the vertices and are matched to each other. Thus, a perfect matching splits into pairs of matched vertices. To place the perfect matching problem into the LCL framework, let us assume that is a graph of bounded degree; more precisely, suppose that every vertex of has at most neighbors, for some . For each , we fix an arbitrary ordering of the neighbors of . (If is a Borel graph, it is possible to fix such an ordering “in a Borel way,” in the sense that for each , the set is Borel.) Then we can identify a perfect matching with a labeling of using the labels such that a vertex is given the label when it is matched to its -th neighbor. Such labeling encodes a perfect matching if and only if the following constraint holds:

Let be a vertex labeled and let be the -th neighbor of . If ’s label is , then is the -th neighbor of .

In other words, if is matched to , then must be matched to . This constraint is again “local,” which makes the perfect matching problem an LCL problem (at least on bounded degree graphs).

In descriptive combinatorics, we are interested in Borel solutions to LCL problems:

Definition 1.11 (Borel solutions to LCL problems).

Let be an LCL problem with label set and let be a Borel graph. A labeling of the vertices of by the labels from is a Borel solution to if it fulfills all the constraints of the problem and, additionally, for each label , the set of all vertices labeled is Borel.

Measurable and Baire-measurable solutions to are defined analogously.

By applying Definition 1.11 to specific LCL problems, we obtain, as special cases, Borel -colorings, Borel maximal independent sets, and Borel perfect matchings. In general, a lot of research in descriptive combinatorics can be seen as addressing the following comprehensive question:

Question 1.12 (Descriptive complexity of LCL problems).

Given a class of Borel graphs, which LCL problems admit Borel/measurable/etc. solutions on the graphs in ?

As stated, this question may seem way too general to say anything meaningful about. However, rather surprisingly, some recent work has led to significant progress toward a complete answer! The key to this new work is the discovery of an intimate connection between the answer to Question 1.12 and the complexity of solving the problem on finite graphs using a distributed algorithm. The second part of this paper is dedicated to a survey of this connection.

2. Connections to Distributed Computing

2.1. The model of distributed computation

The area of distributed computing studies computational processes performed by decentralized groups of independent agents. There are several different models of distributed computation that emphasize various aspects of it, such as fault-tolerance, communication bandwidth, etc. For our purposes, the relevant model is called and was formally introduced by Linial in 1992 Lin92 (although some related results had already been established somewhat earlier). For a comprehensive introduction to this model, see the book BE13 by Barenboim and Elkin. We now briefly describe the key elements of this model.

The model operates on an -vertex graph . (Note that, in contrast to the examples discussed in §1.2, the graph here is finite.) We think of as representing a decentralized communication network where each vertex plays the role of a processor and edges represent communication links. The computation proceeds in rounds. During each round, the vertices first perform some local computations and then synchronously broadcast messages to all their neighbors. After a number of rounds, every vertex must generate its own part of the output of the algorithm. For example, if the goal of the algorithm is to solve some LCL problem, then each vertex must eventually decide on its own label. The efficiency of such an algorithm is measured by the number of communication rounds required to produce the output. In particular, there are no restrictions on the complexity of the local computations that the vertices perform during each round (we imagine that the computational resources available to the vertices are infinite) or on the length of the messages that the vertices send to each other.

An important feature of the model is that every vertex of is executing the same algorithm. Therefore, to make this model nontrivial, there must be a way to distinguish the vertices from each other. There are two standard symmetry-breaking approaches, leading to the distinction between deterministic and randomized algorithms:

•: In the deterministic version of the model, each vertex is assigned, as part of its input, an identifier , which is a string of bits. It is guaranteed that the identifiers assigned to different vertices are distinct. Subject to this guarantee, the algorithm must always output a correct solution to the problem, regardless of the specific assignment of the identifiers.
•: In the randomized version of the model, each vertex may generate an arbitrarily long finite sequence of independent uniformly distributed random bits. The algorithm may fail to produce a correct solution to the problem, but the probability of failure must not exceed .

We remark that the randomized version of the model is more computationally powerful than the deterministic one. This is because a deterministic algorithm can be simulated by a randomized one: each vertex can simply generate a random sequence of bits and use it as its identifier—the probability that two identifiers generated in this way coincide is easily seen to be less than .

If and are two vertices whose graph distance in (i.e., the length of the shortest -path) is , then no information from can reach in fewer than communication rounds (this explains the name “” given to the model). Conversely, in rounds every vertex can collect all the data present at the vertices at distance at most from it. Thus, a -round algorithm may be construed simply as a function that, given the structure of the radius- ball around (including all the additional information, such as the assignment of the identifiers or random bit sequences to its vertices), decides on ’s output. In other words, the complexity of a algorithm measures how far in the graph an individual vertex should be allowed to “see” in order to be able to produce its own output. Naturally, if every vertex is allowed to “see” the entire graph, then the model trivializes. Thus, we are interested in algorithms that only allow each vertex to access a relatively small part of the graph.

As an illustration, let us consider a simple example. Suppose that is an -vertex path. Of course, . How fast can a -coloring of be computed by a algorithm? Obviously, rounds suffice (because in that many rounds each vertex will have access to the entire graph). On the other hand, it is fairly easy to show that rounds are not enough:

\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture} \node[outer sep=2,inner sep=0] (x) at (1.5,-0.3) {$x$}; \node[outer sep=2,inner sep=0] (y) at (4,-0.3) {$y$}; \node[outer sep=2,inner sep=0] (z) at (6.7,-0.3) {$z$}; \fill[orange] (0.8,-0.1) rectangle (2.2,0.1); \fill[orange] (3.3,-0.1) rectangle (4.7,0.1); \fill[orange] (6,-0.1) rectangle (7.4,0.1); \draw[decorate,decoration={brace,amplitude=6pt}] (1.5,0.2) -- (4,0.2) node [black,midway,yshift=0.4cm] {even}; \draw[decorate,decoration={brace,amplitude=30pt}] (1.5,0.4) -- (6.7,0.4) node [black,midway,yshift=1.3cm] {odd}; \node(a) at (2.75,-1.5) {same color}; \draw[-{Stealth[length=1.6mm]}] (a) to (x); \draw[-{Stealth[length=1.6mm]}] (a) to (y); \node(b) at (6,-1.5) {different colors}; \draw[-{Stealth[length=1.6mm]}] (b) to (x); \draw[-{Stealth[length=1.6mm]}] (b) to (z); \draw[decorate,decoration={brace,amplitude=6pt},color=red] (3.3,0.1) -- (4.7,0.1); \draw[decorate,decoration={brace,amplitude=6pt},color=red] (6,0.1) -- (7.4,0.1); \node(c) at (6,1.5) {\color{red}{switch!}}; \draw[-{Stealth[length=1.6mm]},color=red] (c) to (4,0.35); \draw[-{Stealth[length=1.6mm]},color=red] (c) to (6.7,0.35); \draw(0,0) -- (8, 0); \filldraw(1.5,0) circle (2pt); \filldraw(4,0) circle (2pt); \filldraw(6.7,0) circle (2pt); \end{tikzpicture} — **Figure 5.**
An illustration for the proof of Proposition 2.1. The three marked intervals are of length .

Proposition 2.1.

There is no -round algorithm (either deterministic or randomized) that finds a -coloring of an -vertex path.

Proof.

We give an argument in the deterministic case (leaving the randomized case as a nice exercise to the reader). Suppose that there is a -round deterministic algorithm for -coloring an -vertex path, where . We can pick three vertices , , such that the distance between and is even, while the distance between and is odd, and, furthermore, all the pairwise distances between , , and exceed (see Fig. 5). Fix any assignment of identifiers and let , , be the colors given by the algorithm to , , and respectively. In a -coloring of a path, the two colors alternate, so we must have . Without loss of generality, say and . Now let us exchange the assignments of the identifiers in the interval of length centered around with those in the interval centered around and let , , be the colors given to , , and by the algorithm with this new assignment of identifiers. Since the output of at a vertex is determined by the identifiers in the interval of length centered around it, we have and . This is a contradiction since the colors of and must be the same, while the colors of and must be different.

2.2. A sample of results

The first hint of the connection between distributed algorithms and descriptive combinatorics can be obtained by comparing some known results in the two areas. For example, consider the following problem:

Fix a natural number . What is the minimum such that a -coloring of a graph of maximum degree can be found efficiently?

Here the maximum degree of a graph is the maximum number of neighbors of a vertex. If we interpret the word “efficiently” to mean “with a fast algorithm,” then we have the following result:

Theorem 2.2 (Goldberg–Plotkin–Shannon BE13, §3).

There exists a deterministic algorithm that finds a -coloring of an -vertex graph of maximum degree in rounds.⁠Footnote⁵

⁵

Here and in the sequel we treat as a constant, meaning that the implied factors in the notation may depend on .

✖

In the statement of Theorem 2.2, is the iterated logarithm of , i.e., the number of times the logarithm function must be iteratively applied to before the result becomes at most . This is an extremely slow-growing function. In particular—and most importantly for us—it is asymptotically of order . In a graph of maximum degree , a vertex can “see” at most other vertices in rounds of the model. Therefore, if , a vertex will definitely not have access to the entire graph. In this sense, rounds is a natural threshold for “truly local” algorithms.

Can we reduce the number of colors to ? An obvious obstacle is that may contain a clique on vertices, all of which would have to receive different colors. On the other hand, the so-called Brooks’s theorem in graph theory asserts that if and is a graph of maximum degree without a -clique, then , i.e., a -coloring of exists. Nevertheless, it turns out that no efficient deterministic algorithm can find such a coloring, even if has no cycles:

Theorem 2.3 (Chang–Kopelowitz–Pettie CKP19).

There is no deterministic algorithm that finds a -coloring of an -vertex acyclic graph of maximum degree in rounds.

In contrast to Theorem 2.3, it is possible to reduce the number of colors using a randomized algorithm:

Theorem 2.4 (Ghaffari–Hirvonen–Kuhn–Maus Gha+18).

There exists a randomized algorithm that finds a -coloring of an -vertex graph of maximum degree in rounds, provided that and has no -cliques.

Now let us turn to the descriptive combinatorics side of the picture. How many colors do we need for a Borel coloring of a Borel graph of maximum degree ? The first result parallels Theorem 2.2:

Theorem 2.5 (Kechris–Solecki–Todorcevic KST99).

If is a Borel graph of maximum degree , then .

On the other hand, reducing the number of colors to is impossible even for graphs with no cycles:

Theorem 2.6 (Marks Mar16).

For any , there is an acyclic Borel graph of maximum degree such that .

However, if we only want a measurable or Baire-measurable coloring, then colors suffice:

Theorem 2.7 (Conley–Marks–Tucker-Drob CMTD16).

If is a Borel graph of maximum degree , then both and are at most , provided that and has no -cliques.

Theorems 2.2–2.4 and 2.5–2.7 clearly mirror each other. It seems that deterministic algorithms somehow correspond to Borel constructions, while randomized algorithms—to measurable and Baire-measurable ones. It turns out that, indeed, this is not a coincidence and one can prove some general connections of this form.

2.3. Efficient distributed algorithms yield descriptive results

In Ber20, the following general result is established:

Theorem 2.8 (Bernshteyn Ber20).

Let be a class of finite graphs closed under adding isolated vertices and let be a Borel graph of bounded degree all of whose finite induced subgraphs are in . Fix an LCL problem .

(a): If can be solved on -vertex graphs from by an -round deterministic algorithm, then admits a Borel solution to .
(b): If can be solved on -vertex graphs from by an -round randomized algorithm, then admits both a measurable and a Baire-measurable solution to .

In short, efficient deterministic algorithms yield Borel solutions, while efficient randomized algorithms yield measurable and Baire-measurable solutions. The following implications are special cases of Theorem 2.8:

In addition to its theoretical significance, Theorem 2.8 has a number of applications to specific problems. For instance, if we already know an efficient algorithm for some LCL problem, we can then use it to derive corresponding results in descriptive combinatorics. Several instances of this are given in Ber20. For example, Theorem 2.8, together with known distributed algorithms due to Chung, Pettie, and Su, is used in Ber20, §3.B to obtain asymptotically optimal upper bounds on the measurable chromatic number of Borel graphs without short cycles. In the opposite direction, impossibility results in descriptive combinatorics yield lower bounds on the running time of distributed algorithms. In a recent paper Bra+21, Brandt et al. used this idea to derive new lower bounds for distributed algorithms via a version of the so-called determinacy method that was originally developed by Marks to prove Theorem 2.6.

Let us now say a few words about the proof of Theorem 2.8. The proof of part (a) is actually not too difficult:

Proof of Theorem 2.8 (a) (sketch).

Suppose that is solved on -vertex graphs from by an -round deterministic algorithm . The idea is to simulate the execution of the algorithm on the given Borel graph by “pretending” that is finite.

Let be the maximum degree of (which is finite by assumption). Take a very large natural number and let be the running time of on -vertex graphs. Let be the set of all bit strings of length .

Claim.

There is a Borel labeling such that whenever and the graph distance between and is at most .

Proof of the claim.

Define an auxiliary graph with the same vertex set as where distinct vertices and are adjacent if and only if their graph distance in is at most . It is not hard to see that the graph is Borel. Since , we just need to find a Borel -coloring of . The maximum degree of is at most , which is less than since . Therefore, by Theorem 2.5, has a Borel -coloring, as desired.

We treat the bit string as a substitute for an identifier of . By construction, although there may be other vertices with the same identifier, they are far from in . In particular, if explores its radius- ball in , it will only see distinct identifiers. Since the algorithm applied on -vertex graphs doesn’t allow a vertex to see outside its radius- ball, we may run on for rounds (as if had vertices) using the function in place of the identifier assignment. The output labeling will then be a solution to , and, since is a Borel assignment, one can show that the output will also be Borel.

Arguments similar to the above proof sketch are common in distributed computing theory. For example, an analogous approach was used by Chang, Kopelowitz, and Pettie in CKP19 to prove that no LCL problem can have deterministic complexity in the interval between and .

The proof of Theorem 2.8 (b) is quite a bit more involved, and we won’t venture to explain it here. Let us just mention one interesting feature of it. The key tool used to prove Theorem 2.8 (b) is the measurable version of the Local Lemma, which is also established in Ber20. The Local Lemma is a well-known powerful tool in probabilistic combinatorics that is often used to show that a given LCL problem has a solution. The measurable version of this lemma additionally guarantees that the solution is measurable. And here we encounter another point of interaction between descriptive combinatorics and distributed computing: the proof of the measurable Local Lemma in Ber20 crucially relies on the efficient randomized algorithm for the Local Lemma developed by Fischer and Ghaffari FG17.

2.4. Perfect harmony between the two worlds: the case of continuous solutions

Theorem 2.8 allows one to turn efficient distributed algorithms into results in descriptive combinatorics. It is natural to ask if some sort of converse to Theorem 2.8 also holds, i.e., if we can obtain efficient algorithms from descriptive results. While in general this question remains widely open, there is one salient case in which an exact correspondence between the descriptive and the distributed worlds has been established. This case concerns continuous solutions to LCL problems.

Definition 2.9 (Continuous solutions to LCL problems).

Let be an LCL problem with label set and let be a Borel graph. A labeling of the vertices of by the labels from is a continuous solution to if it fulfills all the constraints of the problem and, additionally, for each label , the set of all vertices labeled is open.

Note that the set of all vertices receiving a given label in a continuous solution must be both open and closed—clopen, in short. Hence, it is only interesting to study continuous solutions to LCL problems on graphs such that the space has many clopen subsets. Specifically, we focus on zero-dimensional spaces, i.e., spaces whose topology is generated by clopen sets. Even though familiar spaces such as do not have any nontrivial clopen sets, zero-dimensional spaces are rather common. For example, the space is zero-dimensional. Another example of a zero-dimensional Polish space is . Also, given any Polish space , it is possible to refine the topology on to make it zero-dimensional without changing the Borel sets.

The main result we want to describe in this section says, roughly, that an LCL problem can be solved continuously if and only if it can be solved via an efficient deterministic distributed algorithm. To make this statement precise, we need to specify a particular graph or a class of graphs on which we attempt to solve the problem. While it is possible to make the statement somewhat more general, it will be convenient to consider classes of graphs that look very symmetric, such as regular trees or multi-dimensional grids. The following general definition includes these examples as special cases:

Definition 2.10 (-graphs).

Let be a group generated by a fixed finite symmetric⁠Footnote⁶ set .

⁶

A subset is symmetric if for all , as well.

✖

The Cayley graph of corresponding to is the graph with vertex set that includes, for every and , an edge from to . To be precise, we view as an edge-labeled graph, where the label of the edge is , but the reader may safely ignore this technicality.

A -graph is a graph (with edge labels) in which every connected component is isomorphic to .

For instance, the Cayley graph of the additive group with generating set is a bi-infinite path. Similarly, the Cayley graph of with respect to an appropriately chosen generating set is an infinite -dimensional square grid. On the other hand, the Cayley graph of the free group of rank is an infinite -regular tree.

Given any group with a finite symmetric generating set , there is a canonical way to define a particular -graph on a zero-dimensional vertex set, called the shift graph of . The shift graph is an extremely important example that plays a central role not only in descriptive combinatorics, but also in such areas as ergodic theory and topological dynamics. Before giving the general definition, it will be instructive to consider the special case , with generating set . For a subset and an element , we write

The set can naturally be seen as a “shift” of by (hence the term “shift graph”). We say that a set is aperiodic if for every non-zero . This is equivalent to saying that the characteristic function of is not periodic, i.e., there is no integer such that for all . The shift graph is the graph whose vertices are the aperiodic subsets of and in which every aperiodic set has two neighbors: and . Thus, the connected component of in is the following infinite path:

(Since is aperiodic, all the vertices on this path are distinct.) In particular, every component of is isomorphic to the Cayley graph of , so it is indeed a -graph.

Now for the general construction. Let be a group with a finite symmetric generating set . For a set and an element , let

A subset is aperiodic if for all non-identity elements . The vertices of are the aperiodic subsets . For every and , includes an edge from to labeled . By construction, for each , the mapping establishes an isomorphism between and the connected component of in (the fact that is aperiodic guarantees that this mapping is injective). Thus, is a -graph. Furthermore, by identifying each subset of with its characteristic function, we can view as a subset of , which makes a Borel graph on a zero-dimensional Polish space.

\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \begin{tikzpicture} \node(detlittle) at (4,2.5) {$\mathsf{DetLOCAL}(o(\log n))$}; \node(cont) at (4,0) {$\mathsf{CONTINUOUS}$}; \node(Borel) at (7,0) {$\mathsf{BOREL}$}; \node(meas) at (10,0) {$\mathsf{MEASURABLE}$}; \node(Baire) at (14,0) {$\mathsf{BAIRE\ MEASURABLE}$}; \node(Detbig) at (14,2.5) {$\mathsf{DetLOCAL}(O(\log n))$}; \node(Ranlittle) at (8.5,2.5) {$\mathsf{RandLOCAL}(o(\log n))$}; \draw(detlittle) -- node [midway,above,sloped] {$=$} (cont); \draw(cont) -- node [midway,above,sloped] {$\subset$} (Borel); \draw(Borel) -- node [midway,above,sloped] {$\subset$} (meas); \draw(meas) -- node [midway,above,sloped] {$\subset$} (Baire); \draw(Detbig) -- node [midway,above,sloped] {$=$} (Baire); \draw(Borel) -- node [midway,above,sloped] {$\not\subseteq$} node [midway,below,sloped] {$\not\supseteq$} (Ranlittle); \draw(Ranlittle) -- node [midway,above,sloped] {$\subset$} (meas); \end{tikzpicture} — **Figure 6.**
Known equalities and inequalities between classes of LCL problems on -regular trees. Here and denote the classes of LCL problems solvable by a -round determinisitic (respectively, randomized) algorithm. The symbol “” indicates strict inclusion.

We now have the following result, obtained independently by Bernshteyn Ber21 and Seward (unpublished):

Theorem 2.11 (Bernshteyn Ber21/Seward).

Let be a group generated by a finite symmetric set . For every LCL problem , the following statements are equivalent:

(i): admits a continuous solution to .
(ii): There is an -round deterministic algorithm that solves on -vertex subgraphs of the Cayley graph .

The implication (ii) (i) of Theorem 2.11 is proved in a manner analogous to the proof of Theorem 2.8 (a) sketched in §2.3. The implication (i) (ii) is significantly more difficult and involves a careful analysis of the structure of continuous functions on . Interestingly, this analysis again relies, among other things, on tools from computer science such as the so-called method of conditional probabilities. It also builds on earlier work of Gao–Jackson–Seward GJS16, Seward–Tucker-Drob STD16, and Elek Ele18.

Theorem 2.11 explains the analogies between known results in continuous combinatorics and distributed computing. For example, in the case , the continuous combinatorics of the shift graph have been studied in detail by Gao et al. Gao+18. Similarly, Brandt et al. Bra+17 investigated the LCL problems that can be solved on -dimensional square grids by efficient distributed algorithms. The results obtained by these two groups of researchers—independently and using different methods—perfectly parallel each other, exactly as Theorem 2.11 predicts.

2.5. Baire-measurable solutions to LCL problems on trees

The last result we would like to mention is a very recent theorem that establishes a perfect equivalence between yet another pair of facets of the worlds of descriptive combinatorics and distributed computing:

Theorem 2.12 (Brandt et al. Bra+21).

Fix . For every LCL problem , the following statements are equivalent:

(i): Every Borel graph in which every component is an infinite -regular tree admits a Baire-measurable solution to .
(ii): There is an -round deterministic algorithm that solves on -vertex trees of maximum degree .

This is a truly inspiring result, which highlights how deep the connections between descriptive combinatorics and distributed computing lie. What makes it different from Theorems 2.8 and 2.11 is that the complexity bound on the distributed algorithm here is , not . Note that in rounds, a vertex of an -vertex tree of maximum degree is guaranteed to discover a vertex of degree less than —and hence this algorithm cannot be directly used on a -regular graph . Unsurprisingly, the proof of the implication (ii) (i) in Theorem 2.12 is considerably more difficult than the proof of Theorem 2.8 (a). In fact, both implications in Theorem 2.12 are highly nontrivial. The high-level idea of the argument is to isolate a certain combinatorial property of LCL problems and show that this property is equivalent to each of (i), (ii) separately (the former equivalence was established by Bernshteyn (unpublished), the latter—by Brandt et al. Bra+21).

2.6. Final thoughts and open problems

The intimate interactions between descriptive combinatorics and distributed computing have only very recently been discovered, and most questions about them remain open. Perhaps the central open problem is this:

Question 2.13.

Is there a version of Theorem 2.11 for Borel solutions to LCL problems? In other words, is there a precise algorithmic counterpart to the notion of a Borel solution (similar to how -round algorithms are equivalent to continuous solutions)?

The same questions for measurable and (except for graphs with no cycles) Baire-measurable solutions are also open. Even on graphs as simple as -regular trees, many open questions remain. The diagram in Fig. 6 summarizes our knowledge regarding the complexity of LCL problems on -regular trees. In particular, we know that there are LCL problems that admit Borel solutions but cannot be solved by an -round randomized algorithm, but there are also LCL problems that can be solved by such an algorithm but do not admit Borel solutions. Therefore, a new algorithmic framework seems to be needed to capture the notion of a Borel solution precisely.

On classes of graphs other than trees, our knowledge is even scanter. For example, the following basic question is open:

Question 2.14.

Fix an integer . Does there exist an LCL problem such that the shift graph admits a measurable solution to but not a Borel one? What about Baire-measurable solutions?

Another area with close ties to distributed computing and descriptive combinatorics is the study of so-called finitary factors of i.i.d. processes. This is a class of particularly well-behaved random processes on networks of great interest in probability theory. Finitary factors of i.i.d. processes can be used to create a complexity hierarchy of LCL problems, and Grebík and Rozhoň recently discovered that in many cases this hierarchy parallels the one arising from considering descriptive and complexity. For more details on this exciting connection, see the papers GR21 by Grebík and Rozhoň and Bra+21 by Brandt et al.

In spite of all the open problems mentioned above, it really does appear that the combination of tools from descriptive set theory and distributed computing is the key to attaining a deep understanding of the behavior of LCL problems under various “effectiveness” requirements. Exciting discoveries await, and they will surely enrich descriptive combinatorics and distributed computing alike.

Acknowledgment

I am very grateful to the anonymous referees for carefully reading this article and providing many helpful suggestions.

References

[BE13]: L. Barenboim and M. Elkin, Distributed graph coloring: Fundamentals and recent developments, Synthesis Lectures on Distributed Computing Theory, vol. 11, Morgan & Claypool Publishers, Williston, VT, 2013. MR3184899Show rawAMSref\bib{BE}{book}{ author={Barenboim, Leonid}, author={Elkin, Michael}, title={Distributed graph coloring}, series={Synthesis Lectures on Distributed Computing Theory}, volume={11}, subtitle={Fundamentals and recent developments}, publisher={Morgan \& Claypool Publishers, Williston, VT}, date={2013}, pages={xiv+157}, isbn={978-1-62705-018-0}, review={\MR {3184899}}, }Close amsref.^✖
[Ber20]: A. Bernshteyn, Distributed algorithms, the Lovász local lemma, and descriptive combinatorics, 2020, https://arxiv.org/abs/2004.04905.
[Ber21]: A. Bernshteyn, Probabilistic constructions in continuous combinatorics and a bridge to distributed algorithms, 2021, https://arxiv.org/abs/2102.08797.
[Bra+21]: S. Brandt, Y.-J. Chang, J. Grebík, C. Grunau, V. Rozhoň, and Z. Vidnyánszky, Local problems on trees from the perspectives of distributed algorithms, finitary factors, and descriptive combinatorics, 2021, https://arxiv.org/abs/2106.02066.
[Bra+17]: S. Brandt, J. Hirvonen, J. H. Korhonen, T. Lempiäinen, P. R. J. Östergård, C. Purcell, J. Rybicki, J. Suomela, and P. Uznański, problems on grids, ACM Symposium on Principles of Distributed Computing (PODC), 2017, 101–110. Full version: https://arxiv.org/abs/1702.05456.
[CKP19]: Y.-J. Chang, T. Kopelowitz, and S. Pettie, An exponential separation between randomized and deterministic complexity in the local model, SIAM J. Comput. 48 (2019), no. 1, 122–143, DOI 10.1137/17M1117537. MR3904438Show rawAMSref\bib{CKP}{article}{ author={Chang, Yi-Jun}, author={Kopelowitz, Tsvi}, author={Pettie, Seth}, title={An exponential separation between randomized and deterministic complexity in the local model}, journal={SIAM J. Comput.}, volume={48}, date={2019}, number={1}, pages={122--143}, issn={0097-5397}, review={\MR {3904438}}, doi={10.1137/17M1117537}, }Close amsref.^✖
[CMTD16]: C. T. Conley, A. S. Marks, and R. D. Tucker-Drob, Brooks’ theorem for measurable colorings, Forum Math. Sigma 4 (2016), Paper No. e16, 23, DOI 10.1017/fms.2016.14. MR3514902Show rawAMSref\bib{CMTD}{article}{ author={Conley, Clinton T.}, author={Marks, Andrew S.}, author={Tucker-Drob, Robin D.}, title={Brooks' theorem for measurable colorings}, journal={Forum Math. Sigma}, volume={4}, date={2016}, pages={Paper No. e16, 23}, review={\MR {3514902}}, doi={10.1017/fms.2016.14}, }Close amsref.^✖
[Ele18]: G. Elek, Qualitative graph limit theory. Cantor dynamical systems and constant-time distributed algorithms, 2018, https://arxiv.org/abs/1812.07511.
[FG17]: M. Fischer and M. Ghaffari, Sublogarithmic distributed algorithms for Lovász local lemma, and the complexity hierarchy, 31 International Symposium on Distributed Computing, LIPIcs. Leibniz Int. Proc. Inform., vol. 91, Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2017, pp. Art. No. 18, 16. MR3746281Show rawAMSref\bib{FG}{article}{ author={Fischer, Manuela}, author={Ghaffari, Mohsen}, title={Sublogarithmic distributed algorithms for Lov\'{a}sz local lemma, and the complexity hierarchy}, conference={ title={31 International Symposium on Distributed Computing}, }, book={ series={LIPIcs. Leibniz Int. Proc. Inform.}, volume={91}, publisher={Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern}, }, date={2017}, pages={Art. No. 18, 16}, review={\MR {3746281}}, }Close amsref.^✖
[Gao+18]: S. Gao, S. Jackson, E. Krohne, and B. Seward, Continuous combinatorics of abelian group actions, 2018, https://arxiv.org/abs/1803.03872.
[GJS16]: S. Gao, S. Jackson, and B. Seward, Group colorings and Bernoulli subflows, Mem. Amer. Math. Soc. 241 (2016), no. 1141, vi+241, DOI 10.1090/memo/1141. MR3478758Show rawAMSref\bib{GJS2}{article}{ author={Gao, Su}, author={Jackson, Steve}, author={Seward, Brandon}, title={Group colorings and Bernoulli subflows}, journal={Mem. Amer. Math. Soc.}, volume={241}, date={2016}, number={1141}, pages={vi+241}, issn={0065-9266}, isbn={978-1-4704-1847-2}, isbn={978-1-4704-2875-4}, review={\MR {3478758}}, doi={10.1090/memo/1141}, }Close amsref.^✖
[Gha+18]: M. Ghaffari, J. Hirvonen, F. Kuhn, and Y. Maus, Improved distributed -coloring, ACM Symposium on Principles of Distributed Computing (PODC), 2018, 427–436. Full version: https://arxiv.org/abs/1803.03248.
[GR21]: J. Grebík and V. Rozhoň, Local problems on grids from the perspective of distributed algorithms, finitary factors, and descriptive combinatorics, 2021, https://arxiv.org/abs/2103.08394.
[KM20]: A. S. Kechris and A. S. Marks, Descriptive graph combinatorics, 2020, http://www.math.caltech.edu/~kechris/papers/combinatorics20book.pdf.
[KST99]: A. S. Kechris, S. Solecki, and S. Todorcevic, Borel chromatic numbers, Adv. Math. 141 (1999), no. 1, 1–44, DOI 10.1006/aima.1998.1771. MR1667145Show rawAMSref\bib{KST}{article}{ author={Kechris, A. S.}, author={Solecki, S.}, author={Todorcevic, S.}, title={Borel chromatic numbers}, journal={Adv. Math.}, volume={141}, date={1999}, number={1}, pages={1--44}, issn={0001-8708}, review={\MR {1667145}}, doi={10.1006/aima.1998.1771}, }Close amsref.^✖
[Lin92]: N. Linial, Locality in distributed graph algorithms, SIAM J. Comput. 21 (1992), no. 1, 193–201, DOI 10.1137/0221015. MR1148825Show rawAMSref\bib{Linial}{article}{ author={Linial, Nathan}, title={Locality in distributed graph algorithms}, journal={SIAM J. Comput.}, volume={21}, date={1992}, number={1}, pages={193--201}, issn={0097-5397}, review={\MR {1148825}}, doi={10.1137/0221015}, }Close amsref.^✖
[Mar16]: A. S. Marks, A determinacy approach to Borel combinatorics, J. Amer. Math. Soc. 29 (2016), no. 2, 579–600, DOI 10.1090/jams/836. MR3454384Show rawAMSref\bib{Marks}{article}{ author={Marks, Andrew S.}, title={A determinacy approach to Borel combinatorics}, journal={J. Amer. Math. Soc.}, volume={29}, date={2016}, number={2}, pages={579--600}, issn={0894-0347}, review={\MR {3454384}}, doi={10.1090/jams/836}, }Close amsref.^✖
[Oxt80]: J. C. Oxtoby, Measure and category, 2nd ed., Graduate Texts in Mathematics, vol. 2, Springer-Verlag, New York-Berlin, 1980. A survey of the analogies between topological and measure spaces. MR584443Show rawAMSref\bib{Oxtoby}{book}{ author={Oxtoby, John C.}, title={Measure and category}, series={Graduate Texts in Mathematics}, volume={2}, edition={2}, note={A survey of the analogies between topological and measure spaces}, publisher={Springer-Verlag, New York-Berlin}, date={1980}, pages={x+106}, isbn={0-387-90508-1}, review={\MR {584443}}, }Close amsref.^✖
[Pik21]: O. Pikhurko, Borel combinatorics of locally finite graphs, Surveys in combinatorics 2021, London Math. Soc. Lecture Note Ser., vol. 470, Cambridge Univ. Press, Cambridge, 2021, pp. 267–319. MR4273433Show rawAMSref\bib{Pikh_survey}{article}{ author={Pikhurko, Oleg}, title={Borel combinatorics of locally finite graphs}, conference={ title={Surveys in combinatorics 2021}, }, book={ series={London Math. Soc. Lecture Note Ser.}, volume={470}, publisher={Cambridge Univ. Press, Cambridge}, }, date={2021}, pages={267--319}, review={\MR {4273433}}, }Close amsref.^✖
[STD16]: B. Seward and R. D. Tucker-Drob, Borel structurability on the 2-shift of a countable group, Ann. Pure Appl. Logic 167 (2016), no. 1, 1–21, DOI 10.1016/j.apal.2015.07.005. MR3413491Show rawAMSref\bib{STD}{article}{ author={Seward, Brandon}, author={Tucker-Drob, Robin D.}, title={Borel structurability on the 2-shift of a countable group}, journal={Ann. Pure Appl. Logic}, volume={167}, date={2016}, number={1}, pages={1--21}, issn={0168-0072}, review={\MR {3413491}}, doi={10.1016/j.apal.2015.07.005}, }Close amsref.^✖

Graphic without alt text — Anton Bernshteyn is a professor of mathematics at the Georgia Institute of Technology. His email address is bahtoh@gatech.edu.

Article DOI: 10.1090/noti2539

This work is partially supported by the NSF grant DMS-2045412.

Credits

Opener is courtesy of enot-poloskun via Getty.

Figures 1–5 and author photo are courtesy of Anton Bernshteyn.