diff options
-rw-r--r-- | when_annealed.tex | 121 |
1 files changed, 63 insertions, 58 deletions
diff --git a/when_annealed.tex b/when_annealed.tex index ae7e3b8..537381a 100644 --- a/when_annealed.tex +++ b/when_annealed.tex @@ -53,15 +53,15 @@ settings might be affected. \end{abstract} -Random high-dimensional energies, cost functions, and interaction networks are important in many fields. The energy landscape of glasses, the likelihood -landscape of machine learning and inference, and the -interactions between organisms in an ecosystem are just a few examples. A -traditional tool for making sense of their behavior is to analyze the -statistics of points where their dynamics are stationary. For energy or cost -landscapes, these correspond to the minima, maxima, and saddles, while for -ecosystems and other non-gradient dynamical systems these correspond to -equilibria of the dynamics. When many stationary points are present, the system -is considered complex. +Random high-dimensional energies, cost functions, and interaction networks are +important in many fields. The energy landscape of glasses, the likelihood +landscape of machine learning and inference, and the interactions between +organisms in an ecosystem are just a few examples. A traditional tool for +making sense of their behavior is to analyze the statistics of points where +their dynamics are stationary. For energy or cost landscapes, these correspond +to the minima, maxima, and saddles, while for ecosystems and other non-gradient +dynamical systems these correspond to equilibria of the dynamics. When many +stationary points are present, the system is considered complex. Despite the importance of stationary point statistics for understanding complex behavior, they are often calculated using an uncontrolled approximation. @@ -69,13 +69,15 @@ Because their number is so large, it cannot be reliably averaged. The annealed approximation takes this average anyway, risking a systematic bias by rare and atypical samples. The annealed approximation is known to be exact for certain models and in certain circumstances, but it is used outside those circumstances -without much reflection \cite{Wainrib_2013_Topological, Gershenzon_2023_On-Site}. In a few cases researches have made instead the +without much reflection \cite{Wainrib_2013_Topological, +Gershenzon_2023_On-Site}. In a few cases researches have made instead the better-controlled quenched average, which averages the logarithm of the number of stationary points, and find deviations from the annealed approximation with -important implications for the system's behavior \cite{Muller_2006_Marginal, Ros_2019_Complexity, -Kent-Dobias_2023_How, Ros_2023_Quenched}. Generically, the annealed -approximation to the complexity is wrong when a nonvanishing fraction of pairs -of stationary points have nontrivial correlations in their mutual position. +important implications for the system's behavior \cite{Muller_2006_Marginal, +Ros_2019_Complexity, Kent-Dobias_2023_How, Ros_2023_Quenched}. Generically, +the annealed approximation to the complexity is wrong when a nonvanishing +fraction of pairs of stationary points have nontrivial correlations in their +mutual position. A heuristic line of reasoning for the appropriateness of the annealed approximation is sometimes made when the approximation is correct for an @@ -91,10 +93,10 @@ stationary points? In this paper, we show that the behavior of the ground state, or \emph{any} equilibrium behavior, does not govern whether stationary points will have a correct annealed average. In a prototypical family of models of random -functions, we calculate a condition for determining when annealed averages should fail -and stationary points should have nontrivial correlations in their mutual -position. We produce examples of models whose equilibrium is guaranteed to -never see such correlations between thermodynamic states, but where a +functions, we calculate a condition for determining when annealed averages +should fail and stationary points should have nontrivial correlations in their +mutual position. We produce examples of models whose equilibrium is guaranteed +to never see such correlations between thermodynamic states, but where a population of saddle points is nevertheless correlated. We study the mixed spherical models, which are models of Gaussian-correlated @@ -106,16 +108,17 @@ product (or overlap) between the two configurations: \begin{equation} \label{eq:covariance} \overline{H(\pmb\sigma_1)H(\pmb\sigma_2)}=\frac1Nf\bigg(\frac{\pmb\sigma_1\cdot\pmb\sigma_2}N\bigg) \end{equation} -Specifying the covariance function $f$ uniquely specifies the model. The -series coefficients of $f$ need to be nonnnegative in order for $f$ to be a +Specifying the covariance function $f$ uniquely specifies the model. The series +coefficients of $f$ need to be nonnnegative in order for $f$ to be a well-defined covariance. The case where $f$ is a homogeneous polynomial has been extensively studied, and corresponds to the pure spherical models of glass physics or the spiked tensor models of statistical inference. Here we will study cases where $f(q)=\frac12\big(\lambda q^3+(1-\lambda)q^s\big)$ for -$\lambda\in(0,1)$, called $3+s$ models. These are examples of \emph{mixed} spherical models, which -have been studied in the physics and statistics literature and host a zoo of -complex orders and phase transitions \cite{Crisanti_2004_Spherical, -Crisanti_2006_Spherical, Crisanti_2011_Statistical}. +$\lambda\in(0,1)$, called $3+s$ models. These are examples of \emph{mixed} +spherical models, which have been studied in the physics and statistics +literature and host a zoo of complex orders and phase transitions +\cite{Crisanti_2004_Spherical, Crisanti_2006_Spherical, +Crisanti_2011_Statistical}. There are several well-established results on the equilibrium of this model. First, if the function $f$ is convex then it is not possible for the @@ -146,9 +149,9 @@ complexity has nontrivial \textsc{rsb}. As evidenced in that figure, \textsc{rsb} among saddles is possible well outside the bounds from equilibrium. -There are two important features which differentiate stationary points $\pmb\sigma^*$ in the -spherical models: their \emph{energy density} $E=\frac1NH(\pmb\sigma^*)$ and -their \emph{stability} +There are two important features which differentiate stationary points +$\pmb\sigma^*$ in the spherical models: their \emph{energy density} +$E=\frac1NH(\pmb\sigma^*)$ and their \emph{stability} $\mu=\frac1N\operatorname{\mathrm{Tr}}\operatorname{\mathrm{Hess}}H(\pmb\sigma^*)$. The energy density should be familiar, as the `height' in the landscape. The stability is so-called because it governs the spectrum of the stationary point. @@ -176,15 +179,15 @@ $\mu=\mu_\mathrm m$, the spectrum has a pseudogap, and we have marginal minima. \end{figure} The number $\mathcal N(E,\mu)$ of stationary points with energy density $E$ and -stability $\mu$ is exponential in $N$ for these models. Their complexity $\Sigma(E,\mu)$ is -defined by the average of the logarithm of their number, or -$\Sigma(E,\mu)=\frac1N\overline{\log\mathcal N(E,\mu)}$. More often the annealed -complexity is calculated, where the average is taken before the logarithm: -$\Sigma_\mathrm a(E,\mu)=\frac1N\log\overline{\mathcal N(E,\mu)}$. The annealed -complexity has been computed for these models \cite{BenArous_2019_Geometry, -Folena_2020_Rethinking}, and the quenched complexity has been computed for a -couple examples which have correlations among ground-state minima -\cite{Kent-Dobias_2023_How}. +stability $\mu$ is exponential in $N$ for these models. Their complexity +$\Sigma(E,\mu)$ is defined by the average of the logarithm of their number, or +$\Sigma(E,\mu)=\frac1N\overline{\log\mathcal N(E,\mu)}$. More often the +annealed complexity is calculated, where the average is taken before the +logarithm: $\Sigma_\mathrm a(E,\mu)=\frac1N\log\overline{\mathcal N(E,\mu)}$. +The annealed complexity has been computed for these models +\cite{BenArous_2019_Geometry, Folena_2020_Rethinking}, and the quenched +complexity has been computed for a couple examples which have correlations +among ground-state minima \cite{Kent-Dobias_2023_How}. In these models, trivial correlations between stationary points correspond with zero overlap: almost all stationary points are orthogonal to each other. This @@ -197,15 +200,16 @@ ansatz, which corresponds to two kinds of pairs of stationary point: a fraction $x$ of pairs have the trivial zero overlap, and the remaining fraction $1-x$ have a nontrivial overlap $q_1$. In the annealed or replica-symmetric case, $x=1$ and all but a vanishing fraction of stationary points are uncorrelated -with each other. Since other kinds of \textsc{rsb} order encompass {\oldstylenums1}\textsc{rsb}, we are guaranteed that -$\Sigma\leq\Sigma_{\oldstylenums1\textsc{rsb}}\leq\Sigma_\mathrm a$. We -will discuss later in what settings the {\oldstylenums1}\textsc{rsb} complexity -is correct. +with each other. Since other kinds of \textsc{rsb} order encompass +{\oldstylenums1}\textsc{rsb}, we are guaranteed that +$\Sigma\leq\Sigma_{\oldstylenums1\textsc{rsb}}\leq\Sigma_\mathrm a$. We will +discuss later in what settings the {\oldstylenums1}\textsc{rsb} complexity is +correct. When the complexity is calculated using the Kac--Rice formula and a physicists' tool set, the problem is reduced to the evaluation of an integral by the saddle -point method for large $N$ \cite{Kent-Dobias_2023_How}. -The complexity is given by extremizing an effective action, +point method for large $N$ \cite{Kent-Dobias_2023_How}. The complexity is given +by extremizing an effective action, \begin{equation} \Sigma_{\oldstylenums1\textsc{rsb}}(E,\mu) =\lim_{n\to0}\int dq_1\,dx\,\mathcal S_{\oldstylenums1\textsc{rsb}}(q_1,x\mid E,\mu)e^{nN\mathcal S_{\oldstylenums1\textsc{rsb}}(q_1,x\mid E,\mu)} @@ -268,13 +272,13 @@ solution, and some parameters, e.g., $q_1$, are unconstrained and can take any value in the old solution. There is one place where we can consistently search for a bifurcating solution -to the saddle point equations: along the zero complexity line -$\Sigma_\mathrm a(E,\mu)=0$. Going along this line in the replica symmetric solution, the +to the saddle point equations: along the zero complexity line $\Sigma_\mathrm +a(E,\mu)=0$. Going along this line in the replica symmetric solution, the {\oldstylenums1}\textsc{rsb} complexity transitions at a critical point where $x=q_1=1$ \cite{Kent-Dobias_2023_How}. Since all the parameters in the bifurcating solution are known at this point, we can search for it by looking -for a flat direction in the way described above. In the annealed -solution for points describing saddles ($\mu<\mu_\mathrm m$), this line is +for a flat direction in the way described above. In the annealed solution for +points describing saddles ($\mu<\mu_\mathrm m$), this line is \begin{equation} \label{eq:extremal.line} \mu_0=-\frac1{z_f}\left(2Ef'f''+\sqrt{2f''u_f\bigg(\log\frac{f''}{f'}z_f-E^2(f''-f')\bigg)}\right) \end{equation} @@ -319,11 +323,11 @@ by \qquad d=\frac{w_f}{f'f''} \end{equation} -Changing variables to $y$ from $\mu$ is a -convenient choice because the branch of \eqref{eq:extremal.line} is chosen -by the sign of $y$ (the lower-energy branch we are interested in corresponds -with $y>0$) and the relationship between $y$ and $E$ on the extremal line is -$g=2hy^2+eE^2$, where the constants $e$, $g$, and $h$ are given by +Changing variables to $y$ from $\mu$ is a convenient choice because the branch +of \eqref{eq:extremal.line} is chosen by the sign of $y$ (the lower-energy +branch we are interested in corresponds with $y>0$) and the relationship +between $y$ and $E$ on the extremal line is $g=2hy^2+eE^2$, where the constants +$e$, $g$, and $h$ are given by \begin{equation} e=f''-f' \qquad @@ -371,7 +375,8 @@ energy point, so that these two points give the precise range of energies at which \textsc{rsb} saddles are found. An example that conforms with this picture for a $3+5$ mixed model is shown in Fig.~\ref{fig:complexity_35}. -The expression inside the inner square root of \eqref{eq:energies} is proportional to +The expression inside the inner square root of \eqref{eq:energies} is +proportional to \begin{equation} \label{eq:condition} G_f = @@ -465,7 +470,8 @@ extended from $E_{\oldstylenums1\textsc{rsb}}^+$. } \label{fig:order} \end{figure} -There are implications for the emergence of \textsc{rsb} in equilibrium. Consider a specific $H$ with +There are implications for the emergence of \textsc{rsb} in equilibrium. +Consider a specific $H$ with \begin{equation} H(\pmb\sigma) =\frac{\sqrt\lambda}{p!}\sum_{i_1\cdots i_p}J^{(p)}_{i_1\cdots i_p}\sigma_{i_1}\cdots\sigma_{i_p} @@ -501,11 +507,10 @@ $s>2$, this transition line \emph{always} intersects the extremal line among some population of stationary points. However, it is likely that for much of the parameter space the so-called one-full \textsc{rsb} ({\oldstylenums1\textsc{frsb}}) is the correct solution, as it likely is for -large $s$ in the $3+s$ model at hand. Further work to find the -conditions for transitions of the complexity to these forms of order is -necessary. For values of $s$ where there is no -\textsc{rsb} of any kind in the ground state, we expect that the -{\oldstylenums1\textsc{rsb}} complexity is correct. +large $s$ in the $3+s$ model at hand. Further work to find the conditions for +transitions of the complexity to these forms of order is necessary. For values +of $s$ where there is no \textsc{rsb} of any kind in the ground state, we +expect that the {\oldstylenums1\textsc{rsb}} complexity is correct. What are the implications for dynamics? We find that nontrivial correlations tend to exist among saddle points with the maximum or minimum index possible at |