-
The integrable semi-discrete nonlinear Schrödinger equations with nonzero backgrounds: Bilinearization-reduction approach
Authors:
Xiao Deng,
Kui Chen,
Hongyang Chen,
Da-jun Zhang
Abstract:
In this paper the classical and nonlocal semi-discrete nonlinear Schrödinger (sdNLS) equations with nonzero backgrounds are solved by means of the bilinearization-reduction approach. In the first step of this approach, the unreduced sdNLS system with a nonzero background is bilinearized and its solutions are presented in terms of quasi double Casoratians. Then, reduction techniques are implemented…
▽ More
In this paper the classical and nonlocal semi-discrete nonlinear Schrödinger (sdNLS) equations with nonzero backgrounds are solved by means of the bilinearization-reduction approach. In the first step of this approach, the unreduced sdNLS system with a nonzero background is bilinearized and its solutions are presented in terms of quasi double Casoratians. Then, reduction techniques are implemented to deal with complex and nonlocal reductions, which yields solutions for the four classical and nonlocal sdNLS equations with a plane wave background or a hyperbolic function background. These solutions are expressed with explicit formulae and allow classifications according to canonical forms of certain spectral matrix. In particular, we present explicit formulae for general rogue waves for the classical focusing sdNLS equation. Some obtained solutions are analyzed and illustrated.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Edge of chaos as a guiding principle for modern neural network training
Authors:
Lin Zhang,
Ling Feng,
Kan Chen,
Choy Heng Lai
Abstract:
The success of deep neural networks in real-world problems has prompted many attempts to explain their training dynamics and generalization performance, but more guiding principles for the training of neural networks are still needed. Motivated by the edge of chaos principle behind the optimal performance of neural networks, we study the role of various hyperparameters in modern neural network tra…
▽ More
The success of deep neural networks in real-world problems has prompted many attempts to explain their training dynamics and generalization performance, but more guiding principles for the training of neural networks are still needed. Motivated by the edge of chaos principle behind the optimal performance of neural networks, we study the role of various hyperparameters in modern neural network training algorithms in terms of the order-chaos phase diagram. In particular, we study a fully analytical feedforward neural network trained on the widely adopted Fashion-MNIST dataset, and study the dynamics associated with the hyperparameters in back-propagation during the training process. We find that for the basic algorithm of stochastic gradient descent with momentum, in the range around the commonly used hyperparameter values, clear scaling relations are present with respect to the training time during the ordered phase in the phase diagram, and the model's optimal generalization power at the edge of chaos is similar across different training parameter combinations. In the chaotic phase, the same scaling no longer exists. The scaling allows us to choose the training parameters to achieve faster training without sacrificing performance. In addition, we find that the commonly used model regularization method - weight decay - effectively pushes the model towards the ordered phase to achieve better performance. Leveraging on this fact and the scaling relations in the other hyperparameters, we derived a principled guideline for hyperparameter determination, such that the model can achieve optimal performance by saturating it at the edge of chaos. Demonstrated on this simple neural network model and training algorithm, our work improves the understanding of neural network training dynamics, and can potentially be extended to guiding principles of more complex model architectures and algorithms.
△ Less
Submitted 20 July, 2021;
originally announced July 2021.
-
Strain and defects in oblique stripe growth
Authors:
Kelly Chen,
Zachary Deiman,
Ryan Goh,
Sally Jankovic,
Arnd Scheel
Abstract:
We study stripe formation in two-dimensional systems under directional quenching in a phase-diffusion approximation including non-adiabatic boundary effects. We find stripe formation through simple traveling waves for all angles relative to the quenching line using an analytic continuation procedure. We also present comprehensive analytical asymptotic formulas in limiting cases of small and large…
▽ More
We study stripe formation in two-dimensional systems under directional quenching in a phase-diffusion approximation including non-adiabatic boundary effects. We find stripe formation through simple traveling waves for all angles relative to the quenching line using an analytic continuation procedure. We also present comprehensive analytical asymptotic formulas in limiting cases of small and large angles as well as small and large quenching rates. Of particular interest is a regime of small angle and slow quenching rate which is well described by the glide motion of a boundary dislocation along the quenching line. A delocalization bifurcation of this dislocation leads to a sharp decrease of strain created in the growth process at small angles. We complement our results with numerical continuation reliant on a boundary-integral formulation. We also compare results in the phase-diffusion approximation numerically to quenched stripe formation in an anisotropic Swift Hohenberg equation.
△ Less
Submitted 17 May, 2021; v1 submitted 4 February, 2021;
originally announced February 2021.
-
Anticipation and Negative Group Delay in a Retina
Authors:
Po-Yu Chou,
Jo-Fan Chien,
Kevin Sean Chen,
Yu-Ting Huang,
Chun-Chung Chen,
C. K. Chan
Abstract:
The mechanism of negative group delay (NGD) is used to understand the anticipatory capability of a retina. Experiments with retinas from bull frogs are performed to compare with the predictions of the NGD model. In particulars, whole field stochastic stimulation with various time correlations are used to probe anticipatory responses from the retina. We find that the NGD model can reproduce essenti…
▽ More
The mechanism of negative group delay (NGD) is used to understand the anticipatory capability of a retina. Experiments with retinas from bull frogs are performed to compare with the predictions of the NGD model. In particulars, whole field stochastic stimulation with various time correlations are used to probe anticipatory responses from the retina. We find that the NGD model can reproduce essential features of experimental observations characterized by the cross correlations between the stimulation and the retinal responses. The prediction horizon of a retina is found to depend on the correlation time of the stimulation as predicted by the NGD model. Experiments with dark and bright Gaussian light pulses further support the NGD mechanism; but only for the dark pulses indicating that the NGD effect of a retina might originate from its OFF response. Our finding suggests that sensory systems capable of using negative feedback for adaptation can give rise to anticipation as a consequence of the delay in the system.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
Squared eigenfunction symmetry of the D$Δ$mKP hierarchy and its constraint
Authors:
Kui Chen,
Cheng Zhang,
Da-jun Zhang
Abstract:
In this paper squared eigenfunction symmetry of the differential-difference modified Kadomtsev-Petviashvili (D$Δ$mKP) hierarchy and its constraint are considered. Under the constraint, the Lax triplets of the D$Δ$mKP hierarchy, together with their adjoint forms, give rise to the positive relativistic Toda (R-Toda) hierarchy. An invertible transformation is given to connect the positive and negativ…
▽ More
In this paper squared eigenfunction symmetry of the differential-difference modified Kadomtsev-Petviashvili (D$Δ$mKP) hierarchy and its constraint are considered. Under the constraint, the Lax triplets of the D$Δ$mKP hierarchy, together with their adjoint forms, give rise to the positive relativistic Toda (R-Toda) hierarchy. An invertible transformation is given to connect the positive and negative R-Toda hierarchies. The positive R-Toda hierarchy is reduced to the differential-difference Burgers hierarchy. We also consider another D$Δ$mKP hierarchy and show that its squared eigenfunction symmetry constraint gives rise to the Volterra hierarchy. In addition, we revisit the Ragnisco-Tu hierarchy which is a squared eigenfunction symmetry constraint of the differential-difference Kadomtsev-Petviashvili (D$Δ$KP) system. It was thought the Ragnisco-Tu hierarchy does not exist one-field reduction, but here we find an one-field reduction to reduce the hierarchy to the Volterra hierarchy. Besides, the differential-difference Burgers hierarchy are also investigated in Appendix. A multi-dimensionally consistent 3-point discrete Burgers equation is given.
△ Less
Submitted 19 May, 2021; v1 submitted 17 April, 2019;
originally announced April 2019.
-
Covariant hodograph transformations between nonlocal short pulse models and AKNS$(-1)$ system
Authors:
Kui Chen,
Shimin Liu,
Da-jun Zhang
Abstract:
The paper presents hodograph transformation between nonlocal short pulse models and the first member in the AKNS negative hierarchy (AKNS($-1$)). We consider real and complex multi-component cases. It is shown that the independent variables of the short pulse models and AKNS($-1$) that are connected via hodograph transformation are covariant in nonlocal reductions.
The paper presents hodograph transformation between nonlocal short pulse models and the first member in the AKNS negative hierarchy (AKNS($-1$)). We consider real and complex multi-component cases. It is shown that the independent variables of the short pulse models and AKNS($-1$) that are connected via hodograph transformation are covariant in nonlocal reductions.
△ Less
Submitted 22 December, 2017;
originally announced December 2017.
-
Solutions of local and nonlocal equations reduced from the AKNS hierarchy
Authors:
Kui Chen,
Xiao Deng,
Senyue Lou,
Da-jun Zhang
Abstract:
In the paper possible local and nonlocal reductions of the Ablowitz-Kaup-Newell-Suger (AKNS) hierarchy are collected, including the Korteweg-de Vries (KdV) hierarchy, modified KdV hierarchy and their nonlocal versions, nonlinear Schrödinger hierarchy and their nonlocal versions, sine-Gordon equation in nonpotential form and its nonlocal forms. A reduction technique for solutions is employed, by wh…
▽ More
In the paper possible local and nonlocal reductions of the Ablowitz-Kaup-Newell-Suger (AKNS) hierarchy are collected, including the Korteweg-de Vries (KdV) hierarchy, modified KdV hierarchy and their nonlocal versions, nonlinear Schrödinger hierarchy and their nonlocal versions, sine-Gordon equation in nonpotential form and its nonlocal forms. A reduction technique for solutions is employed, by which exact solutions in double Wronskian form are obtained for these reduced equations from those double Wronskian solutions of the AKNS hierarchy. As examples of dynamics we illustrate new interaction of two-soliton solutions of the reverse-$t$ nonlinear Schrödinger equation. Although as a single soliton it is always stationary, two solitons travel along completely symmetric trajectories in $\{x,t\}$ plane and their amplitudes are affected by phase parameters. Asymptotic analysis is given as demonstration. The approach and relation described in this paper are systematic and general and can be used to other nonlocal equations.
△ Less
Submitted 12 November, 2017; v1 submitted 28 October, 2017;
originally announced October 2017.
-
Solutions of the nonlocal nonlinear Schrödinger hierarchy via reduction
Authors:
Kui Chen,
Da-jun Zhang
Abstract:
In this letter we propose an approach to obtain solutions for the nonlocal nonlinear Schrödinger hierarchy from the known ones of the Ablowitz-Kaup-Newell-Segur hierarchy by reduction. These solutions are presented in terms of double Wronskian and some of them are new.The approach is general and can be used for other systems with double Wronskian solutions which admit local and nonlocal reductions…
▽ More
In this letter we propose an approach to obtain solutions for the nonlocal nonlinear Schrödinger hierarchy from the known ones of the Ablowitz-Kaup-Newell-Segur hierarchy by reduction. These solutions are presented in terms of double Wronskian and some of them are new.The approach is general and can be used for other systems with double Wronskian solutions which admit local and nonlocal reductions.
△ Less
Submitted 25 April, 2017;
originally announced April 2017.
-
On a Second Discretization of the ZS-AKNS Spectral Problem: Revisit
Authors:
Kui Chen,
Xiao Deng,
Da-jun Zhang
Abstract:
In this paper we revisit a discrete spectral problem which was proposed by Ragnisco and Tu in 1989, as a second discretization of the ZS-AKNS spectral problem. We show that the spectral problem corresponds to a bidirectional discretization of the derivative of two wave functions $φ_{1,x}$ and $φ_{2,x}$. As a connection with higher dimensional systems, the spectral problem and a related hierarchy c…
▽ More
In this paper we revisit a discrete spectral problem which was proposed by Ragnisco and Tu in 1989, as a second discretization of the ZS-AKNS spectral problem. We show that the spectral problem corresponds to a bidirectional discretization of the derivative of two wave functions $φ_{1,x}$ and $φ_{2,x}$. As a connection with higher dimensional systems, the spectral problem and a related hierarchy can be derived from Lax triads of the differential-difference KP hierarchy via a symmetry constraint. Isospectral and nonisospectral flows derived from the spectral problem compose a Lie algebra. By considering its infinite dimensional subalgebras and continuum limit of recursion operator, three semi-discrete AKNS hierarchies are constructed.
△ Less
Submitted 16 February, 2017; v1 submitted 15 November, 2016;
originally announced November 2016.
-
Scale Dependent Dimension of Luminous Matter in the Universe
Authors:
Per Bak,
Kan Chen
Abstract:
We present a geometrical model of the distribution of luminous matter in the universe, derived from a very simple reaction-diffusion model of turbulent phenomena. The apparent dimension of luminous matter, $D(l)$, depends linearly on the logarithm of the scale $l$ under which the universe is viewed: $D(l) \sim 3\log(l/l_0)/\log(ξ/l_0)$, where $ξ$ is a correlation length. Comparison with data fro…
▽ More
We present a geometrical model of the distribution of luminous matter in the universe, derived from a very simple reaction-diffusion model of turbulent phenomena. The apparent dimension of luminous matter, $D(l)$, depends linearly on the logarithm of the scale $l$ under which the universe is viewed: $D(l) \sim 3\log(l/l_0)/\log(ξ/l_0)$, where $ξ$ is a correlation length. Comparison with data from the SARS red-shift catalogue, and the LEDA database provides a good fit with a correlation length $ξ\sim 300$ Mpc. The geometrical interpretation is clear: At small distances, the universe is zero-dimensional and point-like. At distances of the order of 1 Mpc the dimension is unity, indicating a filamentary, string-like structure; when viewed at larger scales it gradually becomes 2-dimensional wall-like, and finally, at and beyond the correlation length, it becomes uniform.
△ Less
Submitted 25 January, 2000;
originally announced January 2000.
-
Dynamics of Dry Friction: A Numerical Investigation
Authors:
Y. F. Lim,
Kan Chen
Abstract:
We perform extended numerical simulation of the dynamics of dry friction, based on a model derived from the phenomenological description proposed by T. Baumberger et al.. In the case of small deviation from the steady sliding motion, the model is shown to be equivalent to the state- and rate-dependent friction law which was first introduced by Rice and Ruina on the basis of experiments on rocks.…
▽ More
We perform extended numerical simulation of the dynamics of dry friction, based on a model derived from the phenomenological description proposed by T. Baumberger et al.. In the case of small deviation from the steady sliding motion, the model is shown to be equivalent to the state- and rate-dependent friction law which was first introduced by Rice and Ruina on the basis of experiments on rocks. We obtain the dynamical phase diagram that agrees well with the experimental results on the paper-on-paper systems. In particular, the bifurcation between stick-slip and steady sliding are shown to change from a direct (supercritical) Hopf type to an inverted (subcritical) one as the driving velocity increases, in agreement with the experiments.
△ Less
Submitted 18 March, 1998;
originally announced March 1998.
-
A general learning algorithm for solving optimization problems and its application to the spin glass problem
Authors:
Kan Chen
Abstract:
We propose a general learning algorithm for solving optimization problems, based on a simple strategy of trial and adaptation. The algorithm maintains a probability distribution of possible solutions (configurations), which is updated continuously in the learning process. As the probability distribution evolves, better and better solutions are shown to emerge. The performance of the algorithm is…
▽ More
We propose a general learning algorithm for solving optimization problems, based on a simple strategy of trial and adaptation. The algorithm maintains a probability distribution of possible solutions (configurations), which is updated continuously in the learning process. As the probability distribution evolves, better and better solutions are shown to emerge. The performance of the algorithm is illustrated by the application to the problem of finding the ground state of the Ising spin glass. A simple theoretical understanding of the algorithm is also presented.
△ Less
Submitted 17 April, 1997;
originally announced April 1997.
-
A Simple Learning Algorithm for the Traveling Salesman Problem
Authors:
Kan Chen
Abstract:
We propose a learning algorithm for solving the traveling salesman problem based on a simple strategy of trial and adaptation: i) A tour is selected by choosing cities probabilistically according to the ``synaptic'' strengths between cities. ii) The ``synaptic'' strengths of the links that form the tour are then enhanced (reduced) if the tour length is shorter (longer) than the average result of…
▽ More
We propose a learning algorithm for solving the traveling salesman problem based on a simple strategy of trial and adaptation: i) A tour is selected by choosing cities probabilistically according to the ``synaptic'' strengths between cities. ii) The ``synaptic'' strengths of the links that form the tour are then enhanced (reduced) if the tour length is shorter (longer) than the average result of the previous trials. We perform extensive simulations of the random distance traveling-salesman problem. For sufficiently slow learning rates, near optimal tours can be obtained with the average optimal tour lengths close to the lower bounds for the shortest tour lengths.
△ Less
Submitted 15 August, 1996;
originally announced August 1996.