summaryrefslogtreecommitdiff
path: root/topology.tex
diff options
context:
space:
mode:
authorJaron Kent-Dobias <jaron@kent-dobias.com>2024-08-27 18:35:29 +0200
committerJaron Kent-Dobias <jaron@kent-dobias.com>2024-08-27 18:35:29 +0200
commit8cff4a74e2792b6037364aa63bd7d78085724ca5 (patch)
treef35b2144ea393fdfd166a22ba4e8ef8ee53fbf92 /topology.tex
parentc0ef21f08ad290bdea74ba39f6002329e4511982 (diff)
downloadSciPostPhys_18_158-8cff4a74e2792b6037364aa63bd7d78085724ca5.tar.gz
SciPostPhys_18_158-8cff4a74e2792b6037364aa63bd7d78085724ca5.tar.bz2
SciPostPhys_18_158-8cff4a74e2792b6037364aa63bd7d78085724ca5.zip
Lots of writing, and new figures.
Diffstat (limited to 'topology.tex')
-rw-r--r--topology.tex247
1 files changed, 162 insertions, 85 deletions
diff --git a/topology.tex b/topology.tex
index 85d1518..65c717e 100644
--- a/topology.tex
+++ b/topology.tex
@@ -1,4 +1,4 @@
-\documentclass[submission, Phys]{SciPost}
+\documentclass{SciPost}
\usepackage[utf8]{inputenc}
\usepackage[T1]{fontenc}
@@ -9,16 +9,23 @@
\urlstyle{sf}
+\fancypagestyle{SPstyle}{
+\fancyhf{}
+\lhead{\colorbox{scipostblue}{\bf \color{white} ~SciPost Physics }}
+\rhead{{\bf \color{scipostdeepblue} ~Submission }}
+\renewcommand{\headrulewidth}{1pt}
+\fancyfoot[C]{\textbf{\thepage}}
+}
+
% Fix \cal and \mathcal characters look (so it's not the same as \mathscr)
\DeclareSymbolFont{usualmathcal}{OMS}{cmsy}{m}{n}
\DeclareSymbolFontAlphabet{\mathcal}{usualmathcal}
\hypersetup{
- colorlinks=true,
- urlcolor={blue!50!black},
- citecolor={blue!50!black},
- filecolor={blue!50!black},
- linkcolor={blue!50!black}
+ colorlinks,
+ linkcolor={red!50!black},
+ citecolor={blue!50!black},
+ urlcolor={blue!80!black}
}
\author{\footnote{\url{jaron.kent-dobias@roma1.infn.it}}}
@@ -26,47 +33,32 @@
\begin{document}
-\begin{center}{\Large \textbf{
+\pagestyle{SPstyle}
+
+\begin{center}{\Large \textbf{\color{scipostdeepblue}{
On the topology of solutions to random continuous constraint satisfaction problems\\
-}}\end{center}
+}}}\end{center}
-% TODO: write the author list here. Use first name (+ other initials) + surname format.
-% Separate subsequent authors by a comma, omit comma and use "and" for the last author.
-% Mark the corresponding author with a superscript star.
\begin{center}
-\bf Jaron Kent-Dobias$^*$
+ \textbf{Jaron Kent-Dobias$^\star$}
\end{center}
-% TODO: write all affiliations here.
-% Format: institute, city, country
\begin{center}
Istituto Nazionale di Fisica Nucleare, Sezione di Roma I, Italy
-\\
-% TODO: provide email address of corresponding author
-${}^\star$ {\small \sf jaron.kent-dobias@roma1.infn.it}
-\end{center}
-
-\begin{center}
-\today
+\\[\baselineskip]
+$\star$ \href{mailto:jaron.kent-dobias@roma1.infn.it}{\small jaron.kent-dobias@roma1.infn.it}
\end{center}
-% For convenience during refereeing (optional),
-% you can turn on line numbers by uncommenting the next line:
-%\linenumbers
-% You should run LaTeX twice in order for the line numbers to appear.
-
-\section*{Abstract}
-{\bf
-% TODO: write your abstract here.
+\section*{\color{scipostdeepblue}{Abstract}}
+\textbf{\boldmath{%
We consider the set of solutions to $M$ random polynomial equations with
independent Gaussian coefficients and a target value $V_0$ on the $(N-1)$-sphere. When solutions
exist, they form a manifold. We compute the average Euler characteristic of
this manifold in the limit of large $N$, and find different behavior
depending on the ratio $\alpha=M/N$. When $\alpha<\alpha_\text{onset}$, the
average characteristic is 2 and there is a single connected component, while
-for $\alpha_\text{onset}<\alpha<\alpha_\text{shatter}$ a large connected
-component coexists with many disconnected components. When
-$\alpha>\alpha_\text{shatter}$ the large connected component vanishes, and the
+for $\alpha_\text{onset}<\alpha<\alpha_\text{shatter}$ many large connected components coexist. When
+$\alpha>\alpha_\text{shatter}$ the large connected components vanish, replaced by small fragments, and the
entire manifold vanishes for $\alpha>\alpha_\text{\textsc{sat}}$. In the
limit $\alpha\to0$ there is a correspondence between this problem and the
topology of constant-energy level sets in the spherical spin glasses. We
@@ -74,16 +66,45 @@ conjecture that the energy $E_\text{shatter}$ associated with the vanishing of
the large connected component corresponds to the asymptotic limit of gradient
descent from a random initial condition.
}
+}
+
+\vspace{\baselineskip}
+
+%%%%%%%%%% BLOCK: Copyright information
+% This block will be filled during the proof stage, and finilized just before publication.
+% It exists here only as a placeholder, and should not be modified by authors.
+\noindent\textcolor{white!90!black}{%
+\fbox{\parbox{0.975\linewidth}{%
+\textcolor{white!40!black}{\begin{tabular}{lr}%
+ \begin{minipage}{0.6\textwidth}%
+ {\small Copyright attribution to authors. \newline
+ This work is a submission to SciPost Physics. \newline
+ License information to appear upon publication. \newline
+ Publication information to appear upon publication.}
+ \end{minipage} & \begin{minipage}{0.4\textwidth}
+ {\small Received Date \newline Accepted Date \newline Published Date}%
+ \end{minipage}
+\end{tabular}}
+}}
+}
+%%%%%%%%%% BLOCK: Copyright information
-% TODO: include a table of contents (optional)
+%%%%%%%%%% TODO: LINENO
+% For convenience during refereeing we turn on line numbers:
+%\linenumbers
+% You should run LaTeX twice in order for the line numbers to appear.
+%%%%%%%%%% END TODO: LINENO
+
+%%%%%%%%%% TODO: TOC
% Guideline: if your paper is longer that 6 pages, include a TOC
% To remove the TOC, simply cut the following block
\vspace{10pt}
\noindent\rule{\textwidth}{1pt}
-\tableofcontents\thispagestyle{fancy}
+\tableofcontents
\noindent\rule{\textwidth}{1pt}
\vspace{10pt}
+%%%%%%%%%% END TODO: TOC
\section{Introduction}
@@ -232,8 +253,6 @@ is the vector of partial derivatives with respect to all $N+M+1$ variables.
This integral is now in a form where standard techniques from mean-field theory
can be applied to calculate it.
-\subsubsection{Calculation of the average Euler characteristic}
-
To evaluate the average of $\chi$ over the constraints, we first translate the $\delta$ functions and determinant to integral form, with
\begin{align}
\delta\big(\partial L(\mathbf x,\pmb\omega)\big)
@@ -377,55 +396,6 @@ These transition values of the target $V_0$ correspond with transition values in
\end{figure}
-
-
-\subsubsection{How to interpret the average Euler characteristic}
-
-It is not straightforward to directly use the average Euler characteristic to
-infer something about the number of connected components in the set of
-solutions. To understand why, a simple example is helpful. Consider the set of
-solutions on the sphere $\|\mathbf x\|^2=N$ that satisfy the single quadratic
-constraint
-\begin{equation}
- 0=\sum_{i=1}^N\sigma_ix_i^2
-\end{equation}
-where each $\sigma_i$ is taken to be $\pm1$ with equal probability. If we take $\mathbf x$ to be ordered such that all terms with $\sigma_i=+1$ come first, this gives
-\begin{equation}
- 0=\sum_{i=1}^{N_+}x_i^2-\sum_{i=N_++1}^Nx_i^2
-\end{equation}
-where $N_+$ is the number of terms with $\sigma_i=+1$. The topology of the resulting manifold can be found by adding and subtracting this constraint from the spherical one, which gives
-\begin{align}
- \frac12=\sum_{i=1}^{N_+}x_i^2
- \qquad
- \frac12=\sum_{i=N_++1}^{N}x_i^2
-\end{align}
-These are two independent equations for spheres of radius $1/\sqrt2$, one of
-dimension $N_+$ and the other of dimension $N-N_+$. Therefore, the topology of
-the configuration space is that of $S^{N_+-1}\times S^{N-N_+-1}$. The Euler
-characteristic of a product space is the product of the Euler characteristics,
-and so we have $\chi(\Omega)=\chi(S^{N_+-1})\chi(S^{N-N_+-1})$.
-
-What is the average value of the Euler characteristic over values of
-$\sigma_i$? First, recall that the Euler characteristic of a sphere $S^d$ is 2
-when $d$ is even and 0 when $d$ is odd. When $N$ is odd, any value
-of $N_+$ will result in one of the two spheres in the product to be
-odd-dimensional, and therefore $\chi(\Omega)=0$, as is always true for
-odd-dimensional manifolds. When $N$ is even, there are two possibilities: when $N_+$ is even then both spheres are odd-dimensional, while when $N_+$ is odd then both spheres are even-dimensional.
-The number of terms $N_+$ with $\sigma_i=+1$ is distributed with the binomial distribution
-\begin{equation}
- P(N_+)=\frac1{2^N}\binom{N}{N_+}
-\end{equation}
-Therefore, the average Euler characteristic for even $N$ is
-\begin{equation}
- \overline{\chi(\Omega)}
- =\sum_{N_+=0}^NP(N_+)\chi(S^{N_+-1})\chi(S^{N-N_+-1})
- =\frac4{2^N}\sum_{n=0}^{N/2}\binom{N}{2n}
- =2
-\end{equation}
-Thus we find the average Euler characteristic in this simple example is 2
-despite the fact that the possible manifolds resulting from the constraints
-have characteristics of either 0 or 4.
-
The many positive values the Euler characteristic can take is one problem, but
a larger one is the many negative values it can take. A manifold with many
handles will have a very negative Euler characteristic, and our calculation
@@ -439,6 +409,8 @@ that in the complex region, the large-deviation function in $m$ that $\mathcal
S_\Omega(m)$ represents breaks down due to the vanishingly small probability of
finding any stationary points.
+
+
\subsection{Complexity of a linear test function}
One way to definitely narrow possible interpretations of the average Euler
@@ -493,6 +465,109 @@ Appendix~\ref{sec:complexity.details}. The result is that, to largest order in
$N$, the logarithm of the average number of stationary points is the same as
the logarithm of the average Euler characteristic.
+\subsection{How to interpret these calculations}
+
+It is not straightforward to directly use the average Euler characteristic to
+infer something about the number of connected components in the set of
+solutions. To understand why, a simple example is helpful. Consider the set of
+solutions on the sphere $\|\mathbf x\|^2=N$ that satisfy the single quadratic
+constraint
+\begin{equation}
+ 0=\sum_{i=1}^N\sigma_ix_i^2
+\end{equation}
+where each $\sigma_i$ is taken to be $\pm1$ with equal probability. If we take $\mathbf x$ to be ordered such that all terms with $\sigma_i=+1$ come first, this gives
+\begin{equation}
+ 0=\sum_{i=1}^{N_+}x_i^2-\sum_{i=N_++1}^Nx_i^2
+\end{equation}
+where $N_+$ is the number of terms with $\sigma_i=+1$. The topology of the resulting manifold can be found by adding and subtracting this constraint from the spherical one, which gives
+\begin{align}
+ \frac12=\sum_{i=1}^{N_+}x_i^2
+ \qquad
+ \frac12=\sum_{i=N_++1}^{N}x_i^2
+\end{align}
+These are two independent equations for spheres of radius $1/\sqrt2$, one of
+dimension $N_+$ and the other of dimension $N-N_+$. Therefore, the topology of
+the configuration space is that of $S^{N_+-1}\times S^{N-N_+-1}$. The Euler
+characteristic of a product space is the product of the Euler characteristics,
+and so we have $\chi(\Omega)=\chi(S^{N_+-1})\chi(S^{N-N_+-1})$.
+
+What is the average value of the Euler characteristic over values of
+$\sigma_i$? First, recall that the Euler characteristic of a sphere $S^d$ is 2
+when $d$ is even and 0 when $d$ is odd. When $N$ is odd, any value
+of $N_+$ will result in one of the two spheres in the product to be
+odd-dimensional, and therefore $\chi(\Omega)=0$, as is always true for
+odd-dimensional manifolds. When $N$ is even, there are two possibilities: when $N_+$ is even then both spheres are odd-dimensional, while when $N_+$ is odd then both spheres are even-dimensional.
+The number of terms $N_+$ with $\sigma_i=+1$ is distributed with the binomial distribution
+\begin{equation}
+ P(N_+)=\frac1{2^N}\binom{N}{N_+}
+\end{equation}
+Therefore, the average Euler characteristic for even $N$ is
+\begin{equation}
+ \overline{\chi(\Omega)}
+ =\sum_{N_+=0}^NP(N_+)\chi(S^{N_+-1})\chi(S^{N-N_+-1})
+ =\frac4{2^N}\sum_{n=0}^{N/2}\binom{N}{2n}
+ =2
+\end{equation}
+Thus we find the average Euler characteristic in this simple example is 2
+despite the fact that the possible manifolds resulting from the constraints
+have characteristics of either 0 or 4.
+
+\begin{figure}
+ \includegraphics{figs/regime_1.pdf}
+ \hspace{-3em}
+ \includegraphics{figs/regime_2.pdf}
+ \hspace{-3em}
+ \includegraphics{figs/regime_3.pdf}
+
+ \caption{
+ \textbf{Behavior of the action in our three nontrivial regimes.} The effective action $\mathcal S_\Omega$ as a function of overlap $m$ with
+ the height axis for our model with $f(q)=\frac12q^3$ and $\alpha=\frac12$
+ at three different target values $V_0$. \textbf{Left: the connected
+ regime.} The action is maximized with $\mathcal S_\Omega(m^*)=0$, and no
+ stationary points are found with overlap less than $m_\text{min}$ with a
+ random point on the sphere. \textbf{Center: the onset regime.} The action
+ is maximized with $\mathcal S_\Omega(m_\text{min})>0$, and is positive up
+ to $m_\text{max}$. No stationary points are found with overlap less than $m_\text{min}$. \textbf{Right: the shattered regime.} The action is
+ maximized with $\mathcal S_\Omega(0)>0$, and is positive up to
+ $m_\text{max}$.
+ }
+\end{figure}
+
+\paragraph{The connected regime: \boldmath{$V_0^2<V_\text{on}$}.}
+
+In our calculation above, $\overline{\chi(\Omega)}=2$ could mean a fine-tuned
+average like this, or it could indicate the presence of manifold homeomorphic
+to $S^{N-M-1}$ for even $N-M-1$. In either case it strongly indicates a single
+connected component, whose few minima and maxima are usually found at the
+latitude $m^*$. Randomly chosen points on the sphere have a typical nearest
+overlap $m^*$ with the solution manifold, but can never have a smaller overlap than
+$m_\text{min}$, indicating that the manifold is extensive.
+
+\paragraph{The onset regime: \boldmath{$V_\text{on}^2<V_0^2<V_\text{sh}$}.}
+
+In this regime $\log\overline{\chi(\Omega)}=O(N)$ but a minimum overlap
+$m_\text{min}>0$ still exists. The minimum overlap indicates that the solution
+manifold is exclusively made up of extensive components, because the existence
+of small components would lead to stationary points near the equator with
+respect to a randomly chosen axis $\mathbf x_0$. The solution manifold is
+homeomorphic to a topological space with large Euler characteristic like the
+product of union of many spheres. In the former case we would have one
+topologically nontrivial connected component, while in the latter we would have
+many simple disconnected components; the reality could be a combination of the
+two. It the framework of this calculation, it is not possible to distinguish between
+these scenarios. In any case, the minima and maxima of the height on the
+solution manifold are typically found at latitude $m_\text{min}$ but are found
+in exponential number up to the latitude $m_\text{max}$.
+
+\paragraph{The shattered regime: \boldmath{$V_\text{sh}^2<V_0^2<V_\text{\textsc{sat}}^2$}.}
+
+Here $\log\overline{\chi(\Omega)}=O(N)$ and the primary contribution to the
+number of stationary points and to the Euler characteristic comes from the
+equator. This indicates the presence of a large number of small disconnected
+components to the solution manifold. While most minima and maxima of the height
+are located at the equator $m=0$, they are found in exponential number up to
+the latitude $m_\text{max}$.
+
\section{Results}
\subsection{Topology of solutions to many equations and the satisfiability transition}
@@ -545,6 +620,8 @@ topology of the solution manifold.
\subsection{Topology of level sets of the spherical spin glasses and the dynamic threshold}
+\cite{Folena_2020_Rethinking, Folena_2021_Gradient}
+
As indicated earlier, for $M=1$ the solution manifold corresponds to the energy
level set of a spherical spin glass with energy density $E=\sqrt NV_0$. All the
results from the previous sections follow, and can be translated to the spin