From 032584fe152ca1e91d0d8152e063b447211ff636 Mon Sep 17 00:00:00 2001 From: Jaron Kent-Dobias Date: Wed, 4 Sep 2024 01:23:01 +0200 Subject: Lots of writing and work. --- topology.tex | 357 ++++++++++++++++++++++++++++++++++++++++++++++++----------- 1 file changed, 291 insertions(+), 66 deletions(-) (limited to 'topology.tex') diff --git a/topology.tex b/topology.tex index 65c717e..884796c 100644 --- a/topology.tex +++ b/topology.tex @@ -176,11 +176,11 @@ can be written as \;\forall\;k=1,\ldots,M\big\} \end{equation} Because the constraints are all smooth functions, $\Omega$ is almost always a manifold without singular points.\footnote{The conditions for a singular point are that -$0=\frac\partial{\partial\mathbf x}V_k(\mathbf x)$ for all $k$. This is -equivalent to asking that the constraints $V_k$ all have a stationary point at -the same place. When the $V_k$ are independent and random, this is vanishingly -unlikely, requiring $NM+1$ independent equations to be simultaneously satisfied. -This means that different connected components of the set of solutions do not + $0=\frac\partial{\partial\mathbf x}V_k(\mathbf x)$ for all $k$. This is + equivalent to asking that the constraints $V_k$ all have a stationary point at + the same place. When the $V_k$ are independent and random, this is vanishingly + unlikely, requiring $NM+1$ independent equations to be simultaneously satisfied. + This means that different connected components of the set of solutions do not intersect, nor are there self-intersections, without extraordinary fine-tuning.} We study the topology of the manifold $\Omega$ by two related means: its average Euler characteristic, and the average number of stationary points of a @@ -190,9 +190,7 @@ direct sum of the Betti numbers of $\Omega$. We find that for the varied cases we study, these two always coincide at the largest exponential order in $N$, putting strong constraints on the resulting topology and geometry. -\section{Methods} - -\subsection{The average Euler characteristic} +\section{The average Euler characteristic} The Euler characteristic $\chi$ of a manifold is a topological invariant \cite{Hatcher_2002_Algebraic}. It is perhaps most familiar in the context of connected compact orientable surfaces, where it @@ -204,14 +202,14 @@ by an alternating sum over the number of cells of increasing dimension, which for 2-manifolds corresponds to the number of vertices, minus the number of edges, plus the number of faces. -Morse theory offers another way to compute the Euler characteristic using the -statistics of stationary points of a function $H:\Omega\to\mathbb R$ \cite{Audin_2014_Morse}. For +Morse theory offers another way to compute the Euler characteristic of a manifold $\Omega$ using the +statistics of stationary points in a function $H:\Omega\to\mathbb R$ \cite{Audin_2014_Morse}. For functions $H$ without any symmetries with respect to the manifold, the surfaces of gradient flow between adjacent stationary points form a complex. The alternating sum over cells to compute $\chi$ becomes an alternating sum over the count of stationary points of $H$ with increasing index, or \begin{equation} - \chi=\sum_{i=0}^N(-1)^i\mathcal N_H(\text{index}=i) + \chi(\Omega)=\sum_{i=0}^N(-1)^i\mathcal N_H(\text{index}=i) \end{equation} Conveniently, we can express this abstract sum as an integral over the manifold using a small variation on the Kac--Rice formula for counting stationary @@ -227,7 +225,7 @@ of the determinant is a nuisance that one must take pains to preserve We need to choose a function $H$ for our calculation. Because $\chi$ is a topological invariant, any choice will work so long as it does not share some -symmetry with the underlying manifold, i.e., that it $H$ satisfies the Smale condition. Because our manifold of random +symmetry with the underlying manifold, i.e., that $H$ satisfies the Smale condition. Because our manifold of random constraints has no symmetries, we can take a simple height function $H(\mathbf x)=\mathbf x_0\cdot\mathbf x$ for some $\mathbf x_0\in\mathbb R^N$ with $\|\mathbf x_0\|^2=N$. $H$ is a height function because when $\mathbf x_0$ is @@ -255,10 +253,12 @@ can be applied to calculate it. To evaluate the average of $\chi$ over the constraints, we first translate the $\delta$ functions and determinant to integral form, with \begin{align} + \label{eq:delta.exp} \delta\big(\partial L(\mathbf x,\pmb\omega)\big) &=\int\frac{d\hat{\mathbf x}}{(2\pi)^N}\frac{d\hat{\pmb\omega}}{(2\pi)^{M+1}} e^{i[\hat{\mathbf x},\hat{\pmb\omega}]\cdot\partial L(\mathbf x,\pmb\omega)} \\ + \label{eq:det.exp} \det\partial\partial L(\mathbf x,\pmb\omega) &=\int d\bar{\pmb\eta}\,d\pmb\eta\,d\bar{\pmb\gamma}\,d\pmb\gamma\, e^{-[\bar{\pmb\eta},\bar{\pmb\gamma}]^T\partial\partial L(\mathbf x,\pmb\omega)[\pmb\eta,\pmb\gamma]} @@ -272,11 +272,11 @@ therefore be averaged over, and the resulting expression treated with standard methods. Details of this calculation can be found in Appendix~\ref{sec:euler}. The result is the reduction of the average Euler characteristic to an expression of the form -\begin{equation} +\begin{equation} \label{eq:pre-saddle.characteristic} \overline{\chi(\Omega)} - =\int dR\,dD\,dm\,d\hat m\,g(R,D,m,\hat m)\,e^{N\mathcal S_\Omega(R,D,m,\hat m)} + =\left(\frac N{2\pi}\right)^2\int dR\,dD\,dm\,d\hat m\,g(R,D,m,\hat m)\,e^{N\mathcal S_\Omega(R,D,m,\hat m)} \end{equation} -where $g$ is a prefactor subexponential in $N$, and $\mathcal S_\Omega$ is an effective action defined by +where $g$ is a prefactor of $o(N^0)$, and $\mathcal S_\Omega$ is an effective action defined by \begin{equation} \begin{aligned} \mathcal S_\Omega(R,D,m,\hat m\mid\alpha,V_0) @@ -300,7 +300,7 @@ The remaining order parameters defined by the scalar products \hat m=-i\frac1N\hat {\mathbf x}\cdot\mathbf x_0 \end{align} -This integral can be evaluated by a saddle point method. For reasons we will +This integral can be evaluated to leading order by a saddle point method. For reasons we will see, it is best to extremize with respect to $R$, $D$, and $\hat m$, leaving a new effective action of $m$ alone. This can be solved to give \begin{equation} @@ -313,11 +313,11 @@ new effective action of $m$ alone. This can be solved to give \Bigg[ \alpha V_0^2f'(1) +(2-\alpha)f(1)\left(\frac{f(1)}{1-m^2}-f'(1)\right) \\ - &\quad+\operatorname{sgn}(m)\alpha\sqrt{ + &\quad+\alpha\sqrt{ \tfrac{4V_0^2}\alpha f(1)f'(1)\left[\tfrac{f(1)}{1-m^2}-f'(1)\right] +\left[\tfrac{f(1)^2}{1-m^2}-\big(V_0^2+f(1)\big)f'(1)\right]^2 } - \Bigg] + \Bigg] \end{aligned} \end{equation} \begin{equation} @@ -339,10 +339,10 @@ $\mathbf x_0$. Therefore, the value $m^*$ that maximizes this action can be understood as the latitude on the sphere where most of the contribution to the Euler characteristic is made. -The action $\mathcal S_\Omega$ is extremized with respect to $m$ at $m^*=0$ or $m^*=-R^*$. -In the latter case, $m*$ takes the value +The action $\mathcal S_\Omega$ is extremized with respect to $m$ at $m=0$ or $m=\pm m^*=\mp R^*(m^*)$. +In the latter case, $m^*$ takes the value \begin{equation} - m^*=\pm\sqrt{1-\frac{\alpha}{f'(1)}\big(V_0^2+f(1)\big)} + m^*=\sqrt{1-\frac{\alpha}{f'(1)}\big(V_0^2+f(1)\big)} \end{equation} and $\mathcal S_\Omega(m^*)=0$. If this solution was always well-defined, it would vanish when the argument of the square root vanishes for \begin{equation} @@ -403,7 +403,7 @@ relying on the saddle point of an exponential integral would be invalid. In the regime of $\mathcal S_\Omega(m)$ where the action is complex-valued, is this breakdown the result of a negative average characteristic, or is it the result of another reason? It is difficult to answer this with the previous calculation -alone. However, in the next subsection we will make a complementary calculation +alone. However, in the next section we will make a complementary calculation that rules out the negative Euler characteristic picture. Instead, we will see that in the complex region, the large-deviation function in $m$ that $\mathcal S_\Omega(m)$ represents breaks down due to the vanishingly small probability of @@ -411,7 +411,7 @@ finding any stationary points. -\subsection{Complexity of a linear test function} +\section{Complexity of a linear test function} One way to definitely narrow possible interpretations of the average Euler characteristic is to compute a complementary average. The Euler characteristic @@ -445,7 +445,7 @@ function around the determinant. Following \cite{Fyodorov_2004_Complexity}, we make use of the identity \begin{equation} \begin{aligned} - |\det A| + |\det A| &=\lim_{\epsilon\to0}\frac{(\det A)^2}{\sqrt{\det(A+i\epsilon I)}\sqrt{\det(A-i\epsilon I)}} \\ &=\frac1{(2\pi)^N}\int d\bar{\pmb\eta}_1\,d\pmb\eta_1\,d\bar{\pmb\eta}_2\,d\pmb\eta_2\,d\mathbf a\,d\mathbf b\, @@ -453,7 +453,7 @@ make use of the identity -\bar{\pmb\eta}_1^TA\pmb\eta_1-\bar{\pmb\eta}_2^TA\pmb\eta_2 -\frac12\mathbf a^T(A+i\epsilon I)\mathbf a-\frac12\mathbf b^T(A-i\epsilon I)\mathbf b } - \end{aligned} +\end{aligned} \end{equation} for an $N\times N$ matrix $A$. Here $\bar{\pmb\eta}_1$, $\pmb\eta_1$, $\bar{\pmb\eta}_2$, and $\pmb\eta_2$ are Grassmann vectors and $\mathbf a$ and @@ -465,7 +465,7 @@ Appendix~\ref{sec:complexity.details}. The result is that, to largest order in $N$, the logarithm of the average number of stationary points is the same as the logarithm of the average Euler characteristic. -\subsection{How to interpret these calculations} +\section{Implications for the topology of solutions} It is not straightforward to directly use the average Euler characteristic to infer something about the number of connected components in the set of @@ -514,13 +514,15 @@ have characteristics of either 0 or 4. \begin{figure} \includegraphics{figs/regime_1.pdf} - \hspace{-3em} + \hspace{-3.5em} \includegraphics{figs/regime_2.pdf} - \hspace{-3em} + \hspace{-3.5em} \includegraphics{figs/regime_3.pdf} + \hspace{-3.5em} + \includegraphics{figs/regime_4.pdf} \caption{ - \textbf{Behavior of the action in our three nontrivial regimes.} The effective action $\mathcal S_\Omega$ as a function of overlap $m$ with + \textbf{Behavior of the action in four regimes.} The effective action $\mathcal S_\Omega$ as a function of overlap $m$ with the height axis for our model with $f(q)=\frac12q^3$ and $\alpha=\frac12$ at three different target values $V_0$. \textbf{Left: the connected regime.} The action is maximized with $\mathcal S_\Omega(m^*)=0$, and no @@ -533,7 +535,7 @@ have characteristics of either 0 or 4. } \end{figure} -\paragraph{The connected regime: \boldmath{$V_0^20$ still exists. The minimum overlap indicates that the solution @@ -568,12 +570,10 @@ components to the solution manifold. While most minima and maxima of the height are located at the equator $m=0$, they are found in exponential number up to the latitude $m_\text{max}$. -\section{Results} - -\subsection{Topology of solutions to many equations and the satisfiability transition} +\paragraph{The \textsc{unsat} regime: \boldmath{$V_\text{\textsc{sat}}^2