Skip to main content

Showing 1–47 of 47 results for author: Yun, C

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.04126  [pdf, ps, other

    cs.LG math.OC

    Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems

    Authors: Yujun Kim, Jaeyoung Cha, Chulhee Yun

    Abstract: Recent theoretical results demonstrate that the convergence rates of permutation-based SGD (e.g., random reshuffling SGD) are faster than uniform-sampling SGD; however, these studies focus mainly on the large epoch regime, where the number of epochs $K$ exceeds the condition number $κ$. In contrast, little is known when $K$ is smaller than $κ$, and it is still a challenging open question whether p… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to ICML 2025, 56 pages, 6 figures

  2. arXiv:2505.23152  [pdf, ps, other

    math.OC

    Provable Benefit of Random Permutations over Uniform Sampling in Stochastic Coordinate Descent

    Authors: Donghwa Kim, Jaewook Lee, Chulhee Yun

    Abstract: We analyze the convergence rates of two popular variants of coordinate descent (CD): random CD (RCD), in which the coordinates are sampled uniformly at random, and random-permutation CD (RPCD), in which random permutations are used to select the update indices. Despite abundant empirical evidence that RPCD outperforms RCD in various tasks, the theoretical gap between the two algorithms' performanc… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025. 68 pages, 15 figures

  3. arXiv:2504.12712  [pdf, other

    cs.LG math.OC

    Convergence and Implicit Bias of Gradient Descent on Continual Linear Classification

    Authors: Hyunji Jung, Hanseul Cho, Chulhee Yun

    Abstract: We study continual learning on multiple linear classification tasks by sequentially running gradient descent (GD) for a fixed budget of iterations per task. When all tasks are jointly linearly separable and are presented in a cyclic/random order, we show the directional convergence of the trained linear classifier to the joint (offline) max-margin solution. This is surprising because GD training o… ▽ More

    Submitted 26 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: 67 pages, 11 figures, accepted to ICLR 2025, Camera-ready version

  4. arXiv:2501.00511  [pdf, other

    cs.LG math.OC

    Stochastic Extragradient with Flip-Flop Shuffling & Anchoring: Provable Improvements

    Authors: Jiseok Chae, Chulhee Yun, Donghwan Kim

    Abstract: In minimax optimization, the extragradient (EG) method has been extensively studied because it outperforms the gradient descent-ascent method in convex-concave (C-C) problems. Yet, stochastic EG (SEG) has seen limited success in C-C problems, especially for unconstrained cases. Motivated by the recent progress of shuffling-based stochastic methods, we investigate the convergence of shuffling-based… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 73+7 pages, 4 figures. Published in NeurIPS 2024

  5. arXiv:2405.16002  [pdf, other

    cs.LG math.OC stat.ML

    Does SGD really happen in tiny subspaces?

    Authors: Minhak Song, Kwangjun Ahn, Chulhee Yun

    Abstract: Understanding the training dynamics of deep neural networks is challenging due to their high-dimensional nature and intricate loss landscapes. Recent studies have revealed that, along the training trajectory, the gradient approximately aligns with a low-rank top eigenspace of the training loss Hessian, referred to as the dominant subspace. Given this alignment, this paper explores whether neural n… ▽ More

    Submitted 10 March, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Published at ICLR 2025

  6. arXiv:2403.06624  [pdf, ps, other

    math.AG math.AT math.CO

    On the topology of the moduli of tropical unramified p-covers

    Authors: Yassine El Maazouz, Paul Alexander Helminck, Felix Röhrle, Pedro Souza, Claudia He Yun

    Abstract: We study the topology of the moduli space of unramified $\mathbb{Z}/p$-covers of tropical curves of genus $g \geq 2$, where $p$ is a prime number. We use recent techniques by Chan--Galatius--Payne to identify contractible subcomplexes of the moduli space. We then use this contractibility result to show that this moduli space is simply connected. In the case of genus 2, we determine the homotopy ty… ▽ More

    Submitted 3 October, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 39 pages, 11 figures, 5 tables

    MSC Class: 14T20; 05E14; 14H10

  7. arXiv:2402.10475  [pdf, other

    math.OC cs.LG

    Fundamental Benefit of Alternating Updates in Minimax Optimization

    Authors: Jaewook Lee, Hanseul Cho, Chulhee Yun

    Abstract: The Gradient Descent-Ascent (GDA) algorithm, designed to solve minimax optimization problems, takes the descent and ascent steps either simultaneously (Sim-GDA) or alternately (Alt-GDA). While Alt-GDA is commonly observed to converge faster, the performance gap between the two is not yet well understood theoretically, especially in terms of global convergence rates. To address this theory-practice… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024 (Spotlight). 76 pages, 2 figures. Additional experiments (quadratic game, GAN) and proofs

  8. arXiv:2311.15051  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent with Polyak's Momentum Finds Flatter Minima via Large Catapults

    Authors: Prin Phunyaphibarn, Junghyun Lee, Bohan Wang, Huishuai Zhang, Chulhee Yun

    Abstract: Although gradient descent with Polyak's momentum is widely used in modern machine and deep learning, a concrete understanding of its effects on the training trajectory remains elusive. In this work, we empirically show that for linear diagonal networks and nonlinear neural networks, momentum gradient descent with a large learning rate displays large catapults, driving the iterates towards much fla… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: v3: major updates; 25 pages, 17 figures; the first two authors contributed equally. The preliminary version was accepted to the NeurIPS 2023 M3L Workshop (oral) under the title "Large Catapults in Momentum Gradient Descent with Warmup: An Empirical Study."

  9. arXiv:2310.01082  [pdf, other

    cs.LG cs.AI math.OC

    Linear attention is (maybe) all you need (to understand transformer optimization)

    Authors: Kwangjun Ahn, Xiang Cheng, Minhak Song, Chulhee Yun, Ali Jadbabaie, Suvrit Sra

    Abstract: Transformer training is notoriously difficult, requiring a careful design of optimizers and use of various heuristics. We make progress towards understanding the subtleties of training Transformers by carefully studying a simple yet canonical linearized shallow Transformer model. Specifically, we train linear Transformers to solve regression tasks, inspired by J.~von Oswald et al.~(ICML 2023), and… ▽ More

    Submitted 13 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024

  10. arXiv:2307.09265  [pdf, ps, other

    math.AG math.RT

    PGL orbits in tree varieties

    Authors: Izzet Coskun, Demir Eken, Chris Yun

    Abstract: In this paper, we introduce tree varieties as a natural generalization of products of partial flag varieties. We study orbits of the PGL action on tree varieties. We characterize tree varieties with finitely many PGL orbits, generalizing a celebrated theorem of Magyar, Weyman and Zelevinsky. We give criteria that guarantee that a tree variety has a dense PGL orbit and provide many examples of tree… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 25 pages

    MSC Class: Primary: 14L30; 14M15; 14M17. Secondary: 14L35; 51N30

  11. arXiv:2307.04204  [pdf, other

    cs.LG math.OC stat.ML

    Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory

    Authors: Minhak Song, Chulhee Yun

    Abstract: Cohen et al. (2021) empirically study the evolution of the largest eigenvalue of the loss Hessian, also known as sharpness, along the gradient descent (GD) trajectory and observe the Edge of Stability (EoS) phenomenon. The sharpness increases at the early phase of training (referred to as progressive sharpening), and eventually saturates close to the threshold of $2 / \text{(step size)}$. In this… ▽ More

    Submitted 26 October, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 camera-ready; 51 pages

  12. arXiv:2307.01960  [pdf, ps, other

    math.AG math.AT math.CO

    A Serre spectral sequence for the moduli space of tropical curves

    Authors: Christin Bibby, Melody Chan, Nir Gadish, Claudia He Yun

    Abstract: We construct, for all $g\geq 2$ and $n\geq 0$, a spectral sequence of rational $S_n$-representations which computes the $S_n$-equivariant reduced rational cohomology of the tropical moduli spaces of curves $Δ_{g,n}$ in terms of compactly supported cohomology groups of configuration spaces of $n$ points on graphs of genus $g$. Using the canonical $S_n$-equivariant isomorphisms… ▽ More

    Submitted 15 April, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 24 pages plus appendix

    MSC Class: 14H10; 14Q05; 14T20; 55N30; 55R80; 55T10

  13. arXiv:2306.13604  [pdf, other

    math.CO hep-th math.AG

    Positive del Pezzo Geometry

    Authors: Nick Early, Alheydis Geiger, Marta Panizzut, Bernd Sturmfels, Claudia He Yun

    Abstract: Real, complex, and tropical algebraic geometry join forces in a new branch of mathematical physics called positive geometry. We develop the positive geometry of del Pezzo surfaces and their moduli spaces, viewed as very affine varieties. Their connected components are derived from polyhedral spaces with Weyl group symmetries. We study their canonical forms and scattering amplitudes, and we solve t… ▽ More

    Submitted 6 January, 2025; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: 37 pages, 4 figures

  14. arXiv:2306.09850  [pdf, other

    cs.LG math.OC stat.ML

    Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima

    Authors: Dongkuk Si, Chulhee Yun

    Abstract: Sharpness-Aware Minimization (SAM) is an optimizer that takes a descent step based on the gradient at a perturbation $y_t = x_t + ρ\frac{\nabla f(x_t)}{\lVert \nabla f(x_t) \rVert}$ of the current point $x_t$. Existing studies prove convergence of SAM for smooth functions, but they do so by assuming decaying perturbation size $ρ$ and/or no gradient normalization in $y_t$, which is detached from pr… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 39 pages. v3 NeurIPS 2023 camera ready version

  15. arXiv:2306.00267  [pdf, other

    cs.LG math.OC stat.ML

    Provable Benefit of Mixup for Finding Optimal Decision Boundaries

    Authors: Junsoo Oh, Chulhee Yun

    Abstract: We investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant $κ$, we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: ICML 2023 camera-ready version; 48 pages

  16. Some thoughts and experiments on Bergman's compact amalgamation problem

    Authors: Michael Joswig, Mario Kummer, Andreas Thom, Claudia He Yun

    Abstract: We study the question whether copies of $S^1$ in $\mathrm{SU}(3)$ can be amalgamated in a compact group. This is the simplest instance of a fundamental open problem in the theory of compact groups raised by George Bergman in 1987. Considerable computational experiments suggest that the answer is positive in this case. We obtain a positive answer for a relaxed problem using theoretical consideratio… ▽ More

    Submitted 13 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 15 pages, 2 figures, 3 tables; update contains minor changes that address referee comments

    MSC Class: 22C05; 18B99; 90-05; 90C90

  17. arXiv:2303.07160  [pdf, ps, other

    cs.LG math.OC stat.ML

    Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond

    Authors: Jaeyoung Cha, Jaewook Lee, Chulhee Yun

    Abstract: We study convergence lower bounds of without-replacement stochastic gradient descent (SGD) for solving smooth (strongly-)convex finite-sum minimization problems. Unlike most existing results focusing on final iterate lower bounds in terms of the number of components $n$ and the number of epochs $K$, we seek bounds for arbitrary weighted average iterates that are tight in all factors including the… ▽ More

    Submitted 9 June, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 58 pages

  18. arXiv:2302.12444  [pdf, other

    cs.LG math.OC

    On the Training Instability of Shuffling SGD with Batch Normalization

    Authors: David X. Wu, Chulhee Yun, Suvrit Sra

    Abstract: We uncover how SGD interacts with batch normalization and can exhibit undesirable training dynamics such as divergence. More precisely, we study how Single Shuffle (SS) and Random Reshuffle (RR) -- two widely used variants of SGD -- interact surprisingly differently in the presence of batch normalization: RR leads to much more stable evolution of training loss than SS. As a concrete example, for r… ▽ More

    Submitted 14 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: ICML 2023 camera-ready version, added references; 75 pages

  19. arXiv:2210.05995  [pdf, other

    math.OC stat.ML

    SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization

    Authors: Hanseul Cho, Chulhee Yun

    Abstract: Stochastic gradient descent-ascent (SGDA) is one of the main workhorses for solving finite-sum minimax optimization problems. Most practical implementations of SGDA randomly reshuffle components and sequentially use them (i.e., without-replacement sampling); however, there are few theoretical results on this approach for minimax algorithms, especially outside the easier-to-analyze (strongly-)monot… ▽ More

    Submitted 20 February, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 camera-ready version; 46 pages

  20. arXiv:2209.01070  [pdf, ps, other

    math.CO

    Discrete Morse theory for symmetric Delta-complexes

    Authors: Claudia He Yun

    Abstract: We generalize Forman's discrete Morse theory to the context of symmetric $Δ$-complexes. As an application, we prove that the coloop subcomplex of the link of the origin $LA^{\mathrm{trop},\mathrm{P}}_g$ in the moduli space of principally polarized tropical abelian varieties of dimension $g$ with respect to the perfect cone decomposition is contractible.

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: 16 pages, 5 figures

    MSC Class: 57Q70; 14T15

  21. arXiv:2207.02800  [pdf, ps, other

    math.AG

    Equivariant Hodge polynomials of heavy/light moduli spaces

    Authors: Siddarth Kannan, Stefano Serpente, Claudia He Yun

    Abstract: Let $\bar{\mathcal{M}}_{g, m|n}$ denote Hassett's moduli space of weighted pointed stable curves of genus $g$ for the heavy/light weight data $\left(1^{(m)}, 1/n^{(n)}\right)$, and let $\mathcal{M}_{g, m|n} \subset \bar{\mathcal{M}}_{g, m|n}$ be the locus parameterizing smooth, not necessarily distinctly marked curves. We give a change-of-variables formula which computes the generating function fo… ▽ More

    Submitted 22 April, 2024; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: 21 pages, 3 tables. Edits based on referee suggestions

    MSC Class: 14H10

  22. arXiv:2110.10342  [pdf, other

    cs.LG math.OC stat.ML

    Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond

    Authors: Chulhee Yun, Shashank Rajput, Suvrit Sra

    Abstract: In distributed learning, local SGD (also known as federated averaging) and its simple baseline minibatch SGD are widely studied optimization methods. Most existing analyses of these methods assume independent and unbiased gradient estimates obtained via with-replacement sampling. In contrast, we study shuffling-based variants: minibatch and local Random Reshuffling, which draw stochastic gradients… ▽ More

    Submitted 23 March, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 camera-ready (selected for an oral presentation); 76 pages, 3 figures

  23. The role of viral infectivity in oncolytic virotherapy outcomes: A mathematical study

    Authors: Pantea Pooladvand, Chae-Ok Yun, A-Rum Yoon, Peter S. Kim, Federico Frascoli

    Abstract: A model capturing the dynamics between virus and tumour cells in the context of oncolytic virotherapy is presented and analysed. The ability of the virus to be internalised by uninfected cells is described by an infectivity parameter, which is inferred from available experimental data. The parameter is also able to describe the effects of changes in the tumour environment that affect viral uptake… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 29 pages, 13 figures, 1 table

    MSC Class: 92-10

    Journal ref: Mathematical Biosciences, 334: 108520 (2021)

  24. arXiv:2109.03302  [pdf, ps, other

    math.CO math.AG

    Homology representations of compactified configurations on graphs applied to $\mathcal{M}_{2,n}$

    Authors: Christin Bibby, Melody Chan, Nir Gadish, Claudia He Yun

    Abstract: We obtain new calculations of the top weight rational cohomology of the moduli spaces $\mathcal{M}_{2,n}$, equivalently the rational homology of the tropical moduli spaces $Δ_{2,n}$, as a representation of $S_n$. These calculations are achieved fully for all $n\leq 10$, and partially -- for specific irreducible representations of $S_n$ -- for $n\le 22$. We also present conjectures, verified up to… ▽ More

    Submitted 25 April, 2023; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: 18 pages, minor edits

    MSC Class: 05C10 (primary); 14H10; 14Q05; 14T20; 55R80; 55P65

  25. arXiv:2103.07079  [pdf, other

    cs.LG math.OC

    Can Single-Shuffle SGD be Better than Reshuffling SGD and GD?

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: We propose matrix norm inequalities that extend the Recht-Ré (2012) conjecture on a noncommutative AM-GM inequality by supplementing it with another inequality that accounts for single-shuffle, which is a widely used without-replacement sampling scheme that shuffles only once in the beginning and is overlooked in the Recht-Ré conjecture. Instead of general positive semidefinite matrices, we restri… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: 26 pages, 2 figures

  26. arXiv:2010.11767  [pdf, other

    math.CO math.AG

    Topology of tropical moduli spaces of weighted stable curves in higher genus

    Authors: Siddarth Kannan, Shiyue Li, Stefano Serpente, Claudia He Yun

    Abstract: Given integers $g \geq 0$, $n \geq 1$, and a vector $w \in (\mathbb{Q} \cap (0, 1])^n$ such that ${2g - 2 + \sum w_i > 0}$, we study the topology of the moduli space $Δ_{g, w}$ of $w$-stable tropical curves of genus $g$ with volume 1. The space $Δ_{g, w}$ is the dual complex of the divisor of singular curves in Hassett's moduli space of $w$-stable genus $g$ curves $\overline{\mathcal{M}}_{g, w}$.… ▽ More

    Submitted 15 March, 2022; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 14 pages; 1 figure; final version accepted at Advances in Geometry

    MSC Class: 14T05

  27. DML-GANR: Deep Metric Learning With Generative Adversarial Network Regularization for High Spatial Resolution Remote Sensing Image Retrieval

    Authors: Yun Cao, Yuebin Wang, Junhuan Peng, Liqiang Zhang, Linlin Xu, Kai Yan, Lihua Li

    Abstract: With a small number of labeled samples for training, it can save considerable manpower and material resources, especially when the amount of high spatial resolution remote sensing images (HSR-RSIs) increases considerably. However, many deep models face the problem of overfitting when using a small number of labeled samples. This might degrade HSRRSI retrieval accuracy. Aiming at obtaining more acc… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 17 pages

  28. SLCRF: Subspace Learning with Conditional Random Field for Hyperspectral Image Classification

    Authors: Yun Cao, Jie Mei, Yuebin Wang, Liqiang Zhang, Junhuan Peng, Bing Zhang, Lihua Li, Yibo Zheng

    Abstract: Subspace learning (SL) plays an important role in hyperspectral image (HSI) classification, since it can provide an effective solution to reduce the redundant information in the image pixels of HSIs. Previous works about SL aim to improve the accuracy of HSI recognition. Using a large number of labeled samples, related methods can train the parameters of the proposed solutions to obtain better rep… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 13 pages, 6 figures

  29. arXiv:2010.02501  [pdf, other

    cs.LG math.OC stat.ML

    A Unifying View on Implicit Bias in Training Linear Neural Networks

    Authors: Chulhee Yun, Shankar Krishnan, Hossein Mobahi

    Abstract: We study the implicit bias of gradient flow (i.e., gradient descent with infinitesimal step size) on linear neural network training. We propose a tensor formulation of neural networks that includes fully-connected, diagonal, and convolutional networks as special cases, and investigate the linear version of the formulation called linear tensor networks. With this formulation, we can characterize th… ▽ More

    Submitted 10 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 38 pages, 7 figures. Revision after ICLR 2021 camera-ready version. Figure 2 newly added, theorem statements revised, including correction of Theorem 2

  30. arXiv:2008.04426  [pdf, ps, other

    math.AG math.CO

    The $S_n$-equivariant rational homology of the tropical moduli spaces $Δ_{2,n}$

    Authors: Claudia He Yun

    Abstract: We compute the $S_n$-equivariant rational homology of the tropical moduli spaces $Δ_{2,n}$ for $n\leq 8$ using a cellular chain complex for symmetric $Δ$-complexes in Sage.

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: 17 pages, 2 figures, 6 tables

    MSC Class: 14T10 (Primary); 14Q05 (Secondary)

  31. arXiv:2006.14759  [pdf

    math.FA math.NA

    Existence and convergence theorems for monotone generalized alpa-nonexpansive mappings in uniformly convex partially ordered hyperbolic metric spaces and its application

    Authors: Chang Il Rim, Jong Gyong Kim, Chol-Hui Yun

    Abstract: In this paper, we generalize the existence result in [14] and prove convergence theorems of the iterative scheme in [12, 16] for monotone generalized alpa-nonexpansive mappings in uniformly convex partially ordered hyperbolic metric spaces. And we also give a numerical example to show that this scheme converges faster than the scheme in [14] and apply the result to the integral equation.

    Submitted 25 June, 2020; originally announced June 2020.

  32. arXiv:2006.06946  [pdf, other

    math.OC stat.ML

    SGD with shuffling: optimal rates without component convexity and large epoch requirements

    Authors: Kwangjun Ahn, Chulhee Yun, Suvrit Sra

    Abstract: We study without-replacement SGD for solving finite-sum optimization problems. Specifically, depending on how the indices of the finite-sum are shuffled, we consider the RandomShuffle (shuffle at the beginning of each epoch) and SingleShuffle (shuffle only once) algorithms. First, we establish minimax optimal convergence rates of these algorithms up to poly-log factors. Notably, our analysis is ge… ▽ More

    Submitted 21 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 53 pages; supersedes the preprint arXiv:2004.08657; v2 corrects an erroneous claim about SingleShuffle and newly adds Theorem 24 and Appendix F for SingleShuffle

  33. arXiv:1911.12876  [pdf, ps, other

    q-bio.CB math.DS q-bio.TO

    Mathematical modelling of the interaction between cancer cells and an oncolytic virus: insights into the effects of treatment protocols

    Authors: Adrianne L. Jenner, Chae-Ok Yun, Peter S. Kim, Adelle C. F. Coster

    Abstract: Oncolytic virotherapy is an experimental cancer treatment that uses genetically engineered viruses to target and kill cancer cells. One major limitation of this treatment is that virus particles are rapidly cleared by the immune system, preventing them from arriving at the tumour site. To improve virus survival and infectivity modified virus particles with the polymer polyethylene glycol (PEG) and… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: 15 pages, 6 figures

    Journal ref: Bulletin of Mathematical Biology 80: 1615-1629 (2018)

  34. arXiv:1907.03922  [pdf, ps, other

    cs.LG math.OC stat.ML

    Are deep ResNets provably better than linear predictors?

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: Recent results in the literature indicate that a residual network (ResNet) composed of a single residual block outperforms linear predictors, in the sense that all local minima in its optimization landscape are at least as good as the best linear predictor. However, these results are limited to a single residual block (i.e., shallow ResNets), instead of the deep ResNets composed of multiple residu… ▽ More

    Submitted 29 October, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 15 pages. NeurIPS 2019 Camera-ready version

  35. arXiv:1906.01355  [pdf

    math.DS

    Estimation of errors on perturbation of function contractivity factors and box-counting dimension of hidden variable recurrent fractal interpolation function

    Authors: Mi-Kyong Ri, Chol-Hui Yun

    Abstract: In this paper, we study errors on perturbation of function contractivity factors and box-counting dimension of hidden variable recurrent fractal interpolation function (HVRFIF). The HVRFIF is a hidden variable fractal interpolation function (HVFIF) constructed by recurrent iterated function system (RIFS) with function contractivity factors. The contractivity factors of RIFS determine fractal chara… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: text overlap with arXiv:1904.11884

  36. arXiv:1904.11884  [pdf

    math.DS

    Analytic properties of hidden variable recurrent fractal interpolation function with function contractivity factors

    Authors: Mi-Kyong Ri, Chol-Hui Yun

    Abstract: In this paper, we analyze the smoothness and stability of hidden variable recurrent fractal interpolation functions (HVRFIF) with function contractivity factors introduced in Ref. 1. The HVRFIF is a hidden variable fractal interpolation function (HVFIF) constructed by recurrent iterated function system (RIFS) with function contractivity factors. An attractor of RIFS has a local self-similar or sel… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

  37. Box-counting dimension and analytic properties of hidden variable fractal interpolation functions with function contractivity factors

    Authors: Chol-Hui Yun, Mi-Kyong Ri

    Abstract: We estimate the bounds of box-counting dimension of hidden variable fractal interpolation functions (HVFIFs) and hidden variable bivariate fractal interpolation functions (HVBFIFs) with four function contractivity factors and present analytic properties of HVFIFs which are constructed to ensure more flexibility and diversity in modeling natural phenomena. Firstly, we construct the HVFIFs and analy… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

  38. Hidden variable recurrent fractal interpolation function with four function contractivity factors

    Authors: Chol-Hui Yun

    Abstract: In this paper, we introduce a construction of hidden variable recurrent fractal interpolation functions (HVRFIF) with four function contractivity factors. In the fractal interpolation theory, it is very important to ensure flexibility and diversity of the construction of interpolation function. Recurrent iterated function system (RIFS) produce fractal sets with local self-similarity structure. The… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

  39. arXiv:1809.10858  [pdf, ps, other

    math.OC cs.LG stat.ML

    Efficiently testing local optimality and escaping saddles for ReLU networks

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: We provide a theoretical algorithm for checking local optimality and escaping saddles at nondifferentiable points of empirical risks of two-layer ReLU networks. Our algorithm receives any parameter value and returns: local minimum, second-order stationary point, or a strict descent direction. The presence of $M$ data points on the nondifferentiability of the ReLU divides the parameter space into a… ▽ More

    Submitted 28 May, 2019; v1 submitted 28 September, 2018; originally announced September 2018.

    Comments: 23 pages, appeared at ICLR 2019

  40. arXiv:1802.03487  [pdf, ps, other

    cs.LG math.OC stat.ML

    Small nonlinearities in activation functions create bad local minima in neural networks

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: We investigate the loss surface of neural networks. We prove that even for one-hidden-layer networks with "slightest" nonlinearity, the empirical risks have spurious local minima in most cases. Our results thus indicate that in general "no spurious local minima" is a property limited to deep linear networks, and insights obtained from linear networks may not be robust. Specifically, for ReLU(-like… ▽ More

    Submitted 28 May, 2019; v1 submitted 9 February, 2018; originally announced February 2018.

    Comments: 33 pages, appeared at ICLR 2019

  41. arXiv:1707.02444  [pdf, ps, other

    cs.LG math.OC stat.ML

    Global optimality conditions for deep neural networks

    Authors: Chulhee Yun, Suvrit Sra, Ali Jadbabaie

    Abstract: We study the error landscape of deep linear and nonlinear neural networks with the squared error loss. Minimizing the loss of a deep linear neural network is a nonconvex problem, and despite recent progress, our understanding of this loss surface is still incomplete. For deep linear networks, we present necessary and sufficient conditions for a critical point of the risk function to be a global mi… ▽ More

    Submitted 24 March, 2018; v1 submitted 8 July, 2017; originally announced July 2017.

    Comments: 14 pages. A camera-ready version that will appear at ICLR 2018

  42. arXiv:1404.1300  [pdf, other

    math.DS

    A construction of fractal surfaces with function scaling factors on a rectangular grid

    Authors: Chol-Hui Yun, Hui-Chol Choi, Hyong-Chol O

    Abstract: A fractal surface is a set which is a graph of a bivariate continuous function. In the construction of fractal surfaces using IFS, vertical scaling factors in IFS are important one which characterizes a fractal feature of surfaces constructed. We construct IFS with function vertical scaling factors which are 0 on the boundaries of a rectangular grid using arbitrary data set on a rectangular grid a… ▽ More

    Submitted 3 April, 2014; originally announced April 2014.

    Comments: 9 pages, 2 figures

    Report number: KISU-MATH-2014-E-R-008 MSC Class: 37C45; 28A80; 41A05

  43. arXiv:1307.3229  [pdf, other

    math.DS nlin.CD

    Construction of Recurrent Fractal Interpolation Surfaces with Function Scaling Factors and Estimation of Box-counting Dimension on Rectangular Grids

    Authors: Chol-Hui Yun, Hui-Chol Choi, Hyong-Chol O

    Abstract: We consider a construction of recurrent fractal interpolation surfaces with function vertical scaling factors and estimation of their box-counting dimension. A recurrent fractal interpolation surface (RFIS) is an attractor of a recurrent iterated function system (RIFS) which is a graph of bivariate interpolation function. For any given data set on rectangular grids, we construct general recurrent… ▽ More

    Submitted 9 July, 2013; originally announced July 2013.

    Comments: 12 pages, 3 figures

    Report number: KISU-MATH-2013-E-R-004 MSC Class: 37C45; 28A80; 41A05

  44. arXiv:1305.3365  [pdf, other

    math.DS math.NA

    A Construction of the Best Fractal Approximation

    Authors: Yong-Suk Kang, Chol-Hui Yun, Dong-Hyok Kim

    Abstract: In this paper we present a method for constructing the continuous best fractal approximation in the space of bounded functions. We construct the finite-dimensional subspace of the space of bounded functions whose base consists of the continuous fractal functions, and propose how to find the best approximation of given continuous function by element of the constructed space.

    Submitted 28 March, 2014; v1 submitted 15 May, 2013; originally announced May 2013.

    Comments: 9 pages

    Report number: KISU-MATH-2013-E-R-007 MSC Class: Primary 37C45; 28A80; Secondary 41A05

    Journal ref: Electronic Journal of Mathematical Analysis and Applications, Vol.2(2) July 2014, pp.144-151

  45. arXiv:1304.2014  [pdf

    math.DS cs.CV math.GT

    Image Compression predicated on Recurrent Iterated Function Systems

    Authors: Chol-Hui Yun, W. Metzler, M. Barski

    Abstract: Recurrent iterated function systems (RIFSs) are improvements of iterated function systems (IFSs) using elements of the theory of Marcovian stochastic processes which can produce more natural looking images. We construct new RIFSs consisting substantially of a vertical contraction factor function and nonlinear transformations. These RIFSs are applied to image compression.

    Submitted 7 April, 2013; originally announced April 2013.

    Comments: 11 pages, presented at 2nd International Conference on Mathematics & Statistics, 16-19 June, 2008, Athens, Greece

    Report number: KISU-MATH-2008-E-C-001

  46. arXiv:1303.0615  [pdf, other

    math.DS math-ph math.GT

    Construction of Fractal Surfaces by Recurrent Fractal Interpolation Curves

    Authors: Chol-hui Yun, Hyong-chol O., Hui-chol Choi

    Abstract: A method to construct fractal surfaces by recurrent fractal curves is provided. First we construct fractal interpolation curves using a recurrent iterated functions system(RIFS) with function scaling factors and estimate their box-counting dimension. Then we present a method of construction of wider class of fractal surfaces by fractal curves and Lipschitz functions and calculate the box-counting… ▽ More

    Submitted 11 August, 2014; v1 submitted 4 March, 2013; originally announced March 2013.

    Comments: 14 pages, 2 figures

    Report number: KISU-MATH-2013-E-R-003 MSC Class: Primary 37C45; 28A80; Secondary 41A05

    Journal ref: Chaos, Solitons & Fractals, 66(2014), 136-143

  47. arXiv:1208.2081  [pdf

    math.DS

    Box-counting dimension of a kind of fractal interpolation surface on rectangular grids

    Authors: CholHui Yun, MunChol Kim

    Abstract: We estimate a Box-counting dimension of fractal surfaces which are generated by iterated function systems with a vertical contraction factor function on an arbitrary data set over rectangular grids and can express well a lot of natural surfaces with very complicated structures.

    Submitted 9 August, 2012; originally announced August 2012.

    Report number: KISU-MATH-2012-E-R-011

    Journal ref: Romanian Journal of Mathematics and Computer Science, Vol. 2, No. 2, 2012, 61-69