Notices of the American Mathematical Society

Welcome to the current issue of the Notices of the American Mathematical Society.
With support from AMS membership, we are pleased to share the journal with the global mathematical community.


The Nearly Perfect Prediction Theorem

Joel David Hamkins

Like stock-market traders trying to guess today’s opening price and movements, we aim to predict the current and future values of an unknown function based on the observed history of previous values. The true function , not necessarily continuous, will be revealed only gradually.

Figure 1.
\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \setlength{\extrarowheight}{0.0pt}\begin{tikzpicture}[yscale=.6,point/.style={draw,circle,fill,scale=.25},openpoint/.style={very thick,draw,circle,scale=3,fill=Yellow!30}] \draw(5.03,1.45) node[semithick,Red,openpoint] (ft) {}; \draw[thin,domain=.5:5,samples=600,variable=\x] plot ({\x},{.5*\x+sin(\x r)+1.3*rand*exp(-abs(\x-5))}) (2.75,{1+sin(2.75 r)}) node[below=3pt] {$f\restrict t$} (5.02,1.45) node[semithick,Red,circle,draw,scale=3] {}; \draw[densely dotted,semithick,-,red] (ft) + (0,-1.5) node[below] {$t$} -- (ft) node[right=12pt,red,scale=.9] {$f(t)=\text{?}$}; \end{tikzpicture}

The nearly perfect prediction theorem, a remarkable result of Christopher Hardin and Alan Taylor HT08HT13 shows that there is a nearly perfect prediction strategy, predicting the current and immediate future values of every function almost always correctly.

For predicting the present, we define a prediction strategy on the set of possible histories , each a real-valued function on domain for some , assigning them each to a corresponding present-value prediction . For every function , these are almost always correct:

Specifically, the predictions will be correct except on a countable nowhere dense set of exceptional points. What is more, every real number will be followed by an interval on which the predictions are correct, and indeed there will be arbitrarily long intervals on which the predictions are perfectly correct. It is incredible.

To define the prediction strategy, simply fix a well order on the set of all functions , and let predict in accordance with the -least function standing in agreement with the observed data. That is, given historical information , let , where is the -least function compatible with the prior data, meaning .

To see that these predictions are almost always correct, suppose that the true function is . As time passes, more of the function is revealed and so the class of compatible functions becomes more restrictive. Consequently implies , where these are the -least functions in agreement with the data at those times. Further, if the prediction at time is incorrect, then would be incompatible with the data up to , and so . So the prediction functions climb strictly higher in the hierarchy at every exceptional point, and since this hierarchy is well ordered, the exceptional points themselves must therefore be well ordered in the real numbers.

From this, all the amazing consequences follow. Above every real number there is a next exceptional point, if any, and so there is an open interval on which all predictions are correct. The exceptional points are thus a countable nowhere dense set, countable since we can associate each exceptional with a rational number in this interval. Since there must also be an absolutely smallest exceptional point , if any, all predictions made at times are correct, and so there are arbitrarily long intervals on which all the predictions are correct. The real line is thus exhausted by a countable well-ordered sequence of intervals, which abut each other and their limits, with the predictions fully correct in the interior of each interval. Amazing!

Next, to predict the immediate future values of the function, we extend our earlier idea by defining a continuation prediction strategy , which maps each history to its predicted continuation , the -least function standing in agreement with the observed data .

Figure 2.
\renewcommand{\arraystretch}{1} \setlength{\unitlength}{1.0pt} \setlength{\extrarowheight}{0.0pt}\begin{tikzpicture}[yscale=.55,point/.style={draw,circle,fill,scale=.25},openpoint/.style={very thick,draw,circle,scale=1.2,fill=Yellow}] \draw(5,1.5) node[semithick,Red,openpoint] (ft) {}; \draw[densely dotted,semithick,-,red] (ft) + (0,-1.3) node[below] {$t$} -- (ft); \draw[Sepia!10!Red,semithick,domain=5:8,samples=200,variable=\x] (ft) plot ({\x},{.95+.5*\x+sin(\x r)-exp((\x-5)/2)-.4*rand*exp(-abs(\x-6.5))}) (6.75,.9) node[Sepia!50!Red,align=center,scale=.8] {$F(f\restrict t)=f^*$\\ \small predicted\\ \small continuation}; \draw[semithick,domain=.5:5,samples=500,variable=\x] plot ({\x},{.5*\x+sin(\x r)+.75*rand*exp(-abs(\x-4))}) -- (5,1.5) {} node[point,Sepia!30!Red] {} (2.5,{1+sin(2.5 r)}) node[below=4pt,align=center,scale=.8] {$f\restrict t$\\ \small observed data}; \draw(5,1.5) node[very thick,Red,draw,circle,scale=1.2] {}; \end{tikzpicture}

Incredible as it may seem, these predicted continuations will almost always agree with the true function in an interval extending strictly beyond :

We observed earlier in effect that if , then , since any function compatible with the later data is also compatible with the earlier data . As time passes, therefore, we are climbing in the hierarchy, and we climb strictly higher whenever the prediction at is not fully correct between and . So again the continuation prediction exceptional points will be well ordered in the real numbers, and consequently again they will form a countable nowhere dense set. So the continuation predictions are almost always correct. And again, following every point will be an interval on which all predicted continuations are correct for the duration, and there will again be an absolutely least exceptional point , meaning that the predicted continuations are fully correct for the entire interval , which includes arbitrarily long intervals on which the predictions are correct.

I find it interesting to notice a certain robust stability—whenever a correct continuation prediction is made, then the very same prediction will continue to be made for the duration of that correctness, since it will continue as the -least function in agreement with the observed data as long as it remains correct.

The theorem is interesting even just for continuous functions. Although predicting the present value of a continuous function is trivial, of course, since it is just the limit of as from below, the nearly perfect prediction theorem enables us to predict how a continuous function will continue into the immediate future, which is not trivial. We can ensure that strategies and predict in accordance with continuity when possible simply by placing the continuous functions at the bottom of the hierarchy.

Figure 3.

Stock traders in the platonic realm.

Graphic without alt text

Meanwhile, the reader will have noticed that the proof of the nearly perfect prediction theorem is not constructive, since it relies on the axiom of choice via the well-order theorem. The proof does not provide a useful concrete prediction strategy—it reveals little about the actual predictions to be made. I regret to say that you had best keep your day job.

Rather, we proved the theorem as a pure-existence result, showing that there is a nearly perfect prediction strategy as a matter of abstract mathematical ontology. I am confident that the iridescent mathematical stock traders of the platonic realm are truly making bank.

References

[HT08]
Christopher S. Hardin and Alan D. Taylor, A peculiar connection between the axiom of choice and predicting the future, Amer. Math. Monthly 115 (2008), no. 2, 91–96. MR2384262,
Show rawAMSref \bib{HardinTaylor2008}{article}{ author={Hardin, Christopher~S.}, author={Taylor, Alan~D.}, title={A peculiar connection between the axiom of choice and predicting the future}, date={2008}, issn={0002-9890,1930-0972}, journal={Amer. Math. Monthly}, volume={115}, number={2}, pages={91\ndash 96}, review={\MR {2384262}}, }
[HT13]
Christopher S. Hardin and Alan D. Taylor, The mathematics of coordinated inference: A study of generalized hat problems, Developments in Mathematics, vol. 33, Springer, Cham, 2013, https://doi.org/10.1007/978-3-319-01333-6. MR3100500,
Show rawAMSref \bib{HardinTaylor2013:The-mathematics-of-coordinated-inference}{book}{ author={Hardin, Christopher~S.}, author={Taylor, Alan~D.}, title={The mathematics of coordinated inference}, series={Developments in Mathematics}, publisher={Springer, Cham}, date={2013}, volume={33}, isbn={978-3-319-01332-9; 978-3-319-01333-6}, url={https://doi.org/10.1007/978-3-319-01333-6}, subtitle={A study of generalized hat problems}, review={\MR {3100500}}, }

Credits

Figure 1 and Figure 2 were created by the author using TikZ.

Figure 3 was created by the author via DALL·E.

Author photo is courtesy of Joel David Hamkins.