-
Convergence of Markov Chains for Constant Step-size Stochastic Gradient Descent with Separable Functions
Authors:
David Shirokoff,
Philip Zaleski
Abstract:
Stochastic gradient descent (SGD) is a popular algorithm for minimizing objective functions that arise in machine learning. For constant step-sized SGD, the iterates form a Markov chain on a general state space. Focusing on a class of separable (non-convex) objective functions, we establish a "Doeblin-type decomposition," in that the state space decomposes into a uniformly transient set and a disj…
▽ More
Stochastic gradient descent (SGD) is a popular algorithm for minimizing objective functions that arise in machine learning. For constant step-sized SGD, the iterates form a Markov chain on a general state space. Focusing on a class of separable (non-convex) objective functions, we establish a "Doeblin-type decomposition," in that the state space decomposes into a uniformly transient set and a disjoint union of absorbing sets. Each of the absorbing sets contains a unique invariant measure, with the set of all invariant measures being the convex hull. Moreover the set of invariant measures are shown to be global attractors to the Markov chain with a geometric convergence rate. The theory is highlighted with examples that show: (1) the failure of the diffusion approximation to characterize the long-time dynamics of SGD; (2) the global minimum of an objective function may lie outside the support of the invariant measures (i.e., even if initialized at the global minimum, SGD iterates will leave); and (3) bifurcations may enable the SGD iterates to transition between two local minima. Key ingredients in the theory involve viewing the SGD dynamics as a monotone iterated function system and establishing a "splitting condition" of Dubins and Freedman 1966 and Bhattacharya and Lee 1988.
△ Less
Submitted 24 March, 2025; v1 submitted 18 September, 2024;
originally announced September 2024.
-
A variational model of charged drops in dielectrically matched binary fluids: the effect of charge discreteness
Authors:
Cyrill B. Muratov,
Matteo Novaga,
Philip Zaleski
Abstract:
This paper addresses the ill-posedness of the classical Rayleigh variational model of conducting charged liquid drops by incorporating the discreteness of the elementary charges. Introducing the model that describes two immiscible fluids with the same dielectric constant, with a drop of one fluid containing a fixed number of elementary charges together with their solvation spheres, we interpret th…
▽ More
This paper addresses the ill-posedness of the classical Rayleigh variational model of conducting charged liquid drops by incorporating the discreteness of the elementary charges. Introducing the model that describes two immiscible fluids with the same dielectric constant, with a drop of one fluid containing a fixed number of elementary charges together with their solvation spheres, we interpret the equilibrium shape of the drop as a global minimizer of the sum of its surface energy and the electrostatic repulsive energy between the charges under fixed drop volume. For all model parameters, we establish existence of generalized minimizers that consist of at most a finite number of components ``at infinity''. We also give several existence and non-existence results for classical minimizers consisting of only a single component. In particular, we identify an asymptotically sharp threshold for the number of charges to yield existence of minimizers in a regime corresponding to macroscopically large drops containing a large number of charges. The obtained non-trivial threshold is significantly below the corresponding threshold for the Rayleigh model, consistently with the ill-posedness of the latter and demonstrating a particular regularizing effect of the charge discreteness. However, when a minimizer does exist in this regime, it approaches a ball with the charge uniformly distributed on the surface as the number of charges goes to infinity, just as in the Rayleigh model. Finally, we provide an explicit solution for the problem with two charges and a macroscopically large drop.
△ Less
Submitted 17 June, 2024; v1 submitted 9 March, 2023;
originally announced March 2023.
-
Numerical Simulation of Superparamagnetic Nanoparticle Motion in Blood Vessels for Magnetic Drug Delivery
Authors:
M. Lee,
A. Shelke,
S. Singh,
J. Fan,
P. Zaleski,
S. Afkhami
Abstract:
A numerical model is developed for the motion of superparamagnetic nanoparticles in a non-Newtonian blood flow under the influence of a magnetic field. The rheological properties of blood are modeled by the Carreau flow and viscosity, and the stochastic effects of Brownian motion and red blood cell collisions are considered. The model is validated with existing data and good agreement with experim…
▽ More
A numerical model is developed for the motion of superparamagnetic nanoparticles in a non-Newtonian blood flow under the influence of a magnetic field. The rheological properties of blood are modeled by the Carreau flow and viscosity, and the stochastic effects of Brownian motion and red blood cell collisions are considered. The model is validated with existing data and good agreement with experimental results is shown. The effectiveness of magnetic drug delivery in various blood vessels is assessed and found to be most successful in arterioles and capillaries. A range of magnetic field strengths are modeled using equations for both a bar magnet and a point dipole: it is shown that the bar magnet is effective at capturing nanoparticles in limited cases while the point dipole is highly effective across a range of conditions. A parameter study is conducted to show the effects of changing the dipole moment, the distance from the magnet to the blood vessel, and the initial release point of the nanoparticles. The distance from the magnet to the blood vessel is shown to play a significant role in determining nanoparticle capture rate. The optimal initial release position is found to be located within the tumor radius in capillaries and arterioles to prevent rapid diffusion to the edges of the blood vessel prior to arriving at the tumor, and near the edge of the magnet when a bar magnet is used.
△ Less
Submitted 7 October, 2021; v1 submitted 30 September, 2021;
originally announced October 2021.
-
Determination of the sheet resistance of an infinite thin plate with five point contacts located at arbitrary positions
Authors:
Krzysztof R. Szymański,
Piotr A. Zaleski
Abstract:
In this paper, a five-probe method of sheet resistance measurement that is independent of probe positions is reported. The method is strict for an infinite homogeneous plane. It has potential applications as a sheet resistance standard based on planar molecular layers. The method can be used to measure the sheet resistance of layers covering objects with a spherical topology, particularly on micro…
▽ More
In this paper, a five-probe method of sheet resistance measurement that is independent of probe positions is reported. The method is strict for an infinite homogeneous plane. It has potential applications as a sheet resistance standard based on planar molecular layers. The method can be used to measure the sheet resistance of layers covering objects with a spherical topology, particularly on micro- and nanometric scales, where it is difficult to control probe positioning.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Excitation of interfacial waves via near---resonant surface---interfacial wave interactions
Authors:
Joseph Zaleski,
Philip Zaleski,
Yuri V Lvov
Abstract:
We consider interactions between surface and interfacial waves in the two layer system. Our approach is based on the Hamiltonian structure of the equations of motion, and includes the general procedure for diagonalization of the quadratic part of the Hamiltonian. Such diagonalization allows us to derive the interaction crossection between surface and interfacial waves and to derive the coupled kin…
▽ More
We consider interactions between surface and interfacial waves in the two layer system. Our approach is based on the Hamiltonian structure of the equations of motion, and includes the general procedure for diagonalization of the quadratic part of the Hamiltonian. Such diagonalization allows us to derive the interaction crossection between surface and interfacial waves and to derive the coupled kinetic equations describing spectral energy transfers in this system. Our kinetic equation allows resonant and near resonant interactions. We find that the energy transfers are dominated by the class III resonances of \cite{Alam}. We apply our formalism to calculate the rate of growth for interfacial waves for different values of the wind velocity. Using our kinetic equation, we also consider the energy transfer from the wind generated surface waves to interfacial waves for the case when the spectrum of the surface waves is given by the JONSWAP spectrum and interfacial waves are initially absent. We find that such energy transfer can occur along a timescale of hours; there is a range of wind speeds for the most effective energy transfer at approximately the wind speed corresponding to white capping of the sea. Furthermore, interfacial waves oblique to the direction of the wind are also generated.
△ Less
Submitted 9 December, 2019; v1 submitted 17 April, 2019;
originally announced April 2019.
-
Molecular Polymorphism: Microwave Spectra, Equilibrium Structures, and an Astronomical Investigation of the HNCS Isomeric Family
Authors:
Brett A. McGuire,
Marie-Aline Martin-Drumel,
Sven Thorwirth,
Sandra Brünken,
Valerio Lattanzi,
Justin L. Neill,
Silvia Spezzano,
Zhenhong Yu,
Daniel P. Zaleski,
Anthony J. Remijan,
Brooks H. Pate,
Michael C. McCarthy
Abstract:
The rotational spectra of thioisocyanic acid (HNCS), and its three energetic isomers (HSCN, HCNS, and HSNC) have been observed at high spectral resolution by a combination of chirped-pulse and Fabry-Pérot Fourier-transform microwave spectroscopy between 6 and 40~GHz in a pulsed-jet discharge expansion. Two isomers, thiofulminic acid (HCNS) and isothiofulminic acid (HSNC), calculated here to be 35-…
▽ More
The rotational spectra of thioisocyanic acid (HNCS), and its three energetic isomers (HSCN, HCNS, and HSNC) have been observed at high spectral resolution by a combination of chirped-pulse and Fabry-Pérot Fourier-transform microwave spectroscopy between 6 and 40~GHz in a pulsed-jet discharge expansion. Two isomers, thiofulminic acid (HCNS) and isothiofulminic acid (HSNC), calculated here to be 35-37~kcal/mol less stable than the ground state isomer HNCS, have been detected for the first time. Precise rotational, centrifugal distortion, and nitrogen hyperfine coupling constants have been determined for the normal and rare isotopic species of both molecules; all are in good agreement with theoretical predictions obtained at the coupled cluster level of theory. On the basis of isotopic spectroscopy, precise molecular structures have been derived for all four isomers by correcting experimental rotational constants for the effects of rotation-vibration calculated theoretically. Formation and isomerization pathways have also been investigated; the high abundance of HSCN relative to ground state HNCS, and the detection of strong lines of SH using CH$_3$CN and H$_2$S, suggest that HSCN is preferentially produced by the radical-radical reaction HS + CN. A radio astronomical search for HSCN and its isomers has been undertaken toward the high-mass star-forming region Sgr B2(N) in the Galactic Center with the 100 m Green Bank Telescope. While we find clear evidence for HSCN, only a tentative detection of HNCS is proposed, and there is no indication of HCNS or HSNC at the same rms noise level. HSCN, and tentatively HNCS, displays clear deviations from a single-excitation temperature model, suggesting weak masing may be occurring in some transitions in this source.
△ Less
Submitted 13 July, 2016;
originally announced July 2016.
-
Determination of the Riemann modulus and sheet resistivity by a six-point generalization of the van der Pauw method
Authors:
Krzysztof Szymański,
Kamil Łapiński,
Jan L. Cieśliński,
Artur Kobus,
Piotr Zaleski,
Maria Biernacka,
Krystyna Perzyńska
Abstract:
Six point generalization of the van der Pauw method is presented. The method is applicable for two dimensional homogeneous systems with an isolated hole. A single measurement performed on the contacts located arbitrarily on the sample edge allows to determine the specific resistivity and a dimensionless parameter related to the hole, known as the Riemann modulus. The parameter is invariant under c…
▽ More
Six point generalization of the van der Pauw method is presented. The method is applicable for two dimensional homogeneous systems with an isolated hole. A single measurement performed on the contacts located arbitrarily on the sample edge allows to determine the specific resistivity and a dimensionless parameter related to the hole, known as the Riemann modulus. The parameter is invariant under conformal mappings of the sample shape. The hole can be regarded as a high resistivity defect. Therefore the method can be applied for experimental determination of the sample inhomogeneity.
△ Less
Submitted 22 April, 2015;
originally announced April 2015.
-
The Detection of Interstellar Ethanimine (CH3CHNH) from Observations taken during the GBT PRIMOS Survey
Authors:
Ryan A. Loomis,
Daniel P. Zaleski,
Amanda L. Steber,
Justin L. Neill,
Matthew T. Muckle,
Brent J. Harris,
Jan M. Hollis,
Philip R. Jewell,
Valerio Lattanzi,
Frank J. Lovas,
Oscar Martinez, Jr.,
Michael C. McCarthy,
Anthony J. Remijan,
Brooks H. Pate
Abstract:
We have performed reaction product screening measurements using broadband rotational spectroscopy to identify rotational transition matches between laboratory spectra and the Green Bank Telescope PRIMOS radio astronomy survey spectra in Sagittarius B2 North (Sgr B2(N)). The broadband rotational spectrum of molecules created in an electrical discharge of CH3CN and H2S contained several frequency ma…
▽ More
We have performed reaction product screening measurements using broadband rotational spectroscopy to identify rotational transition matches between laboratory spectra and the Green Bank Telescope PRIMOS radio astronomy survey spectra in Sagittarius B2 North (Sgr B2(N)). The broadband rotational spectrum of molecules created in an electrical discharge of CH3CN and H2S contained several frequency matches to unidentified features in the PRIMOS survey that did not have molecular assignments based on standard radio astronomy spectral catalogs. Several of these transitions are assigned to the E- and Z-isomers of ethanimine. Global fits of the rotational spectra of these isomers in the range of 8 to 130 GHz have been performed for both isomers using previously published mm-wave spectroscopy measurements and the microwave measurements of the current study. Possible interstellar chemistry formation routes for E-ethanimine and Z-ethanimine are discussed. The detection of ethanimine is significant because of its possible role in the formation of alanine - one of the twenty amino acids in the genetic code.
△ Less
Submitted 5 February, 2013;
originally announced February 2013.
-
Detection of E-cyanomethanimine towards Sagittarius B2(N) in the Green Bank Telescope PRIMOS Survey
Authors:
Daniel P. Zaleski,
Nathan A. Seifert,
Amanda L. Steber,
Matt T. Muckle,
Ryan A. Loomis,
Joanna F. Corby,
Oscar Martinez, Jr.,
Kyle N. Crabtree,
Philip R. Jewell,
Jan M. Hollis,
Frank J. Lovas,
David Vasquez,
Jolie Nyiramahirwe,
Nicole Sciortino,
Kennedy Johnson,
Michael C. McCarthy,
Anthony J. Remijan,
Brooks H. Pate
Abstract:
The detection E-cyanomethanimine (E-HNCHCN) towards Sagittarius B2(N) is made by comparing the publicly available Green Bank Telescope (GBT) PRIMOS survey spectra (Hollis et al.) to laboratory rotational spectra from a reaction product screening experiment. The experiment uses broadband molecular rotational spectroscopy to monitor the reaction products produced in an electric discharge source usin…
▽ More
The detection E-cyanomethanimine (E-HNCHCN) towards Sagittarius B2(N) is made by comparing the publicly available Green Bank Telescope (GBT) PRIMOS survey spectra (Hollis et al.) to laboratory rotational spectra from a reaction product screening experiment. The experiment uses broadband molecular rotational spectroscopy to monitor the reaction products produced in an electric discharge source using a gas mixture of NH3 and CH3CN. Several transition frequency coincidences between the reaction product screening spectra and previously unassigned interstellar rotational transitions in the PRIMOS survey have been assigned to E cyanomethanimine. A total of 8 molecular rotational transitions of this molecule between 9 and 50 GHz are observed with the GBT. E-cyanomethanimine, often called the HCN dimer, is an important molecule in prebiotic chemistry because it is a chemical intermediate in proposed synthetic routes of adenine, one of the two purine nucleobases found in DNA and RNA. New analyses of the rotational spectra of both E-cyanomethanimine and Z-cyanomethanimine that incorporate previous mm-wave measurements are also reported.
△ Less
Submitted 4 February, 2013;
originally announced February 2013.
-
Laboratory and tentative interstellar detection of trans-methyl formate using the publicly available Green Bank Telescope PRIMOS survey
Authors:
Justin L. Neill,
Matt T. Muckle,
Daniel P. Zaleski,
Amanda L. Steber,
Brooks H. Pate,
Valerio Lattanzi,
Silvia Spezzano,
Michael C. McCarthy,
Anthony J. Remijan
Abstract:
The rotational spectrum of the higher-energy trans conformational isomer of methyl formate has been assigned for the first time using several pulsed-jet Fourier transform microwave spectrometers in the 6-60 GHz frequency range. This species has also been sought toward the Sagittarius B2(N) molecular cloud using the publicly available PRIMOS survey from the Green Bank Telescope. We detect seven abs…
▽ More
The rotational spectrum of the higher-energy trans conformational isomer of methyl formate has been assigned for the first time using several pulsed-jet Fourier transform microwave spectrometers in the 6-60 GHz frequency range. This species has also been sought toward the Sagittarius B2(N) molecular cloud using the publicly available PRIMOS survey from the Green Bank Telescope. We detect seven absorption features in the survey that coincide with laboratory transitions of trans-methyl formate, from which we derive a column density of 3.1 (+2.6, -1.2) \times 10^13 cm-2 and a rotational temperature of 7.6 \pm 1.5 K. This excitation temperature is significantly lower than that of the more stable cis conformer in the same source but is consistent with that of other complex molecular species recently detected in Sgr B2(N). The difference in the rotational temperatures of the two conformers suggests that they have different spatial distributions in this source. As the abundance of trans-methyl formate is far higher than would be expected if the cis and trans conformers are in thermodynamic equilibrium, processes that could preferentially form trans-methyl formate in this region are discussed. We also discuss measurements that could be performed to make this detection more certain. This manuscript demonstrates how publicly available broadband radio astronomical surveys of chemically rich molecular clouds can be used in conjunction with laboratory rotational spectroscopy to search for new molecules in the interstellar medium.
△ Less
Submitted 26 June, 2012;
originally announced June 2012.