Search | arXiv e-print repository

arXiv:2503.22072 [pdf, other]

CIMR-V: An End-to-End SRAM-based CIM Accelerator with RISC-V for AI Edge Device

Authors: Yan-Cheng Guo and, Tian-Sheuan Chang, Chih-Sheng Lin, Bo-Cheng Chiou, Chih-Ming Lai, Shyh-Shyuan Sheu, Wei-Chung Lo, Shih-Chieh Chang

Abstract: Computing-in-memory (CIM) is renowned in deep learning due to its high energy efficiency resulting from highly parallel computing with minimal data movement. However, current SRAM-based CIM designs suffer from long latency for loading weight or feature maps from DRAM for large AI models. Moreover, previous SRAM-based CIM architectures lack end-to-end model inference. To address these issues, this… ▽ More Computing-in-memory (CIM) is renowned in deep learning due to its high energy efficiency resulting from highly parallel computing with minimal data movement. However, current SRAM-based CIM designs suffer from long latency for loading weight or feature maps from DRAM for large AI models. Moreover, previous SRAM-based CIM architectures lack end-to-end model inference. To address these issues, this paper proposes CIMR-V, an end-to-end CIM accelerator with RISC-V that incorporates CIM layer fusion, convolution/max pooling pipeline, and weight fusion, resulting in an 85.14\% reduction in latency for the keyword spotting model. Furthermore, the proposed CIM-type instructions facilitate end-to-end AI model inference and full stack flow, effectively synergizing the high energy efficiency of CIM and the high programmability of RISC-V. Implemented using TSMC 28nm technology, the proposed design achieves an energy efficiency of 3707.84 TOPS/W and 26.21 TOPS at 50 MHz. △ Less

Submitted 27 March, 2025; originally announced March 2025.

Comments: published in IEEE International Symposium on Circuits and Systems (IEEE ISCAS 2024)

arXiv:2503.11010 [pdf, other]

Scaffold-Assisted Window Junctions for Superconducting Qubit Fabrication

Authors: Chung-Ting Ke, Jun-Yi Tsai, Yen-Chun Chen, Zhen-Wei Xu, Elam Blackwell, Matthew A. Snyder, Spencer Weeden, Peng-Sheng Chen, Chih-Ming Lai, Shyh-Shyuan Sheu, Zihao Yang, Cen-Shawn Wu, Alan Ho, R. McDermott, John Martinis, Chii-Dong Chen

Abstract: The superconducting qubit is one of the promising directions in realizing fault-tolerant quantum computing (FTQC), which requires many high-quality qubits. To achieve this, it is desirable to leverage modern semiconductor industry technology to ensure quality, uniformity, and reproducibility. However, conventional Josephson junction fabrication relies mainly on resist-assistant double-angle evapor… ▽ More The superconducting qubit is one of the promising directions in realizing fault-tolerant quantum computing (FTQC), which requires many high-quality qubits. To achieve this, it is desirable to leverage modern semiconductor industry technology to ensure quality, uniformity, and reproducibility. However, conventional Josephson junction fabrication relies mainly on resist-assistant double-angle evaporation, posing integration challenges. Here, we demonstrate a lift-off-free qubit fabrication that integrates seamlessly with existing industrial technologies. This method employs a silicon oxide (SiO$_2$) scaffold to define an etched window with a well-controlled size to form a Josephson junction. The SiO$_2$, which has a large dielectric loss, is etched away in the final step using vapor HF leaving little residue. This Window junction (WJ) process mitigates the degradation of qubit quality during fabrication and allows clean removal of the scaffold. The WJ process is validated by inspection and Josephson junction measurement. The scaffold removal process is verified by measuring the quality factor of the resonators. Furthermore, compared to scaffolds fabricated by plasma-enhanced chemical vapor deposition (PECVD), qubits made by WJ through physical vapor deposition (PVD) achieve relaxation time up to $57\,μ\text{s}$. Our results pave the way for a lift-off-free qubit fabrication process, designed to be compatible with modern foundry tools and capable of minimizing damage to the substrate and material surfaces. △ Less

Submitted 13 March, 2025; originally announced March 2025.

arXiv:2405.08369 [pdf, ps, other]

A generic approach to homogenization of a diffusion driven by growing incompressible drift

Authors: Brice Franke, Shuenn-Jyi Sheu

Abstract: We study how the resolvent-family of a diffusion behaves, as thedrift grows to infinity. The limit turns out to be a selfadjoint pseudo-resolvent.After reduction of the underlying Hilbert-space, this pseudo-resolvent becomesa resolvent to a strongly continuous semi-group of contractions. We prove thatthis semi-group is associated to some Hunt-process on some suitable state-space which is construct… ▽ More We study how the resolvent-family of a diffusion behaves, as thedrift grows to infinity. The limit turns out to be a selfadjoint pseudo-resolvent.After reduction of the underlying Hilbert-space, this pseudo-resolvent becomesa resolvent to a strongly continuous semi-group of contractions. We prove thatthis semi-group is associated to some Hunt-process on some suitable state-space which is constructed from equivalence classes of the drifts trajectories.Finally, we show a distributional limit theorem for the accelerated diffusiontoward the associated Hunt process. △ Less

Submitted 21 August, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

arXiv:1903.08957 [pdf, ps, other]

Expected exponential utility maximization of insurers with a general diffusion factor model : The complete market case

Authors: Hiroaki Hata, Shuenn-Jyi Sheu, Li-Hsien Sun

Abstract: In this paper, we consider the problem of optimal investment by an insurer. The insurer invests in a market consisting of a bank account and $m$ risky assets. The mean returns and volatilities of the risky assets depend nonlinearly on economic factors that are formulated as the solutions of general stochastic differential equations. The wealth of the insurer is described by a Cramér--Lundberg proc… ▽ More In this paper, we consider the problem of optimal investment by an insurer. The insurer invests in a market consisting of a bank account and $m$ risky assets. The mean returns and volatilities of the risky assets depend nonlinearly on economic factors that are formulated as the solutions of general stochastic differential equations. The wealth of the insurer is described by a Cramér--Lundberg process, and the insurer preferences are exponential. Adapting a dynamic programming approach, we derive Hamilton--Jacobi--Bellman (HJB) equation. And, we prove the unique solvability of HJB equation. In addition, the optimal strategy is also obtained using the coupled forward and backward stochastic differential equations (FBSDEs). Finally, proving the verification theorem, we construct the optimal strategy. △ Less

Submitted 21 March, 2019; originally announced March 2019.

MSC Class: 93E20; 60H30; 91B28; 91B30; 49L20; 90C40; 60J70; 62P05

arXiv:1805.01118 [pdf, ps, other]

Portfolio Optimization with Delay Factor Models

Authors: Shuenn-Jyi Sheu, Li-Hsien Sun, Zheng Zhang

Abstract: We propose an optimal portfolio problem in the incomplete market where the underlying assets depend on economic factors with delayed effects, such models can describe the short term forecasting and the interaction with time lag among different financial markets. The delay phenomenon can be recognized as the integral type and the pointwise type. The optimal strategy is identified through maximizing… ▽ More We propose an optimal portfolio problem in the incomplete market where the underlying assets depend on economic factors with delayed effects, such models can describe the short term forecasting and the interaction with time lag among different financial markets. The delay phenomenon can be recognized as the integral type and the pointwise type. The optimal strategy is identified through maximizing the power utility. Due to the delay leading to the non-Markovian structure, the conventional Hamilton-Jacobi-Bellman (HJB) approach is no longer applicable. By using the stochastic maximum principle, we argue that the optimal strategy can be characterized by the solutions of a decoupled quadratic forward-backward stochastic differential equations(QFBSDEs). The optimality is verified via the super-martingale argument. The existence and uniqueness of the solution to the QFBSDEs are established. In addition, if the market is complete, we also provide a martingale based method to solve our portfolio optimization problem, and investigate its connection with the proposed FBSDE approach. Finally, two particular cases are analyzed where the corresponding FBSDEs can be solved explicitly. △ Less

Submitted 3 May, 2018; originally announced May 2018.

arXiv:1001.2131 [pdf, ps, other]

doi 10.1214/09-AAP618

Asymptotics of the probability minimizing a "down-side" risk

Authors: Hiroaki Hata, Hideo Nagai, Shuenn-Jyi Sheu

Abstract: We consider a long-term optimal investment problem where an investor tries to minimize the probability of falling below a target growth rate. From a mathematical viewpoint, this is a large deviation control problem. This problem will be shown to relate to a risk-sensitive stochastic control problem for a sufficiently large time horizon. Indeed, in our theorem we state a duality in the relation b… ▽ More We consider a long-term optimal investment problem where an investor tries to minimize the probability of falling below a target growth rate. From a mathematical viewpoint, this is a large deviation control problem. This problem will be shown to relate to a risk-sensitive stochastic control problem for a sufficiently large time horizon. Indeed, in our theorem we state a duality in the relation between the above two problems. Furthermore, under a multidimensional linear Gaussian model we obtain explicit solutions for the primal problem. △ Less

Submitted 13 January, 2010; originally announced January 2010.

Comments: Published in at http://dx.doi.org/10.1214/09-AAP618 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP618 MSC Class: 35J60; 49L20; 60F10; 91B28; 93E20 (Primary)

Journal ref: Annals of Applied Probability 2010, Vol. 20, No. 1, 52-89

arXiv:0909.5068 [pdf, ps, other]

doi 10.1103/PhysRevB.80.144113

Sodium ion ordering of Na0.77CoO2 under competing multi-vacancy cluster, superlattice and domain formation

Authors: F. -T. Huang, G. J. Shu, M. -W. Chu, Y. K. Kuo, W. L. Lee, H. S. Sheu, F. C. Chou

Abstract: Hexagonal superlattice formed by sodium multi-vacancy cluster ordering in Na$_{0.77}$CoO$_2$ has been proposed based on synchrotron X-ray Laue diffraction study on electrochemically fine-tuned single crystals. The title compound sits closely to the proposed lower end of the miscibility gap of x ~ 0.77-0.82 phase separated range. The average sodium vacancy cluster size is estimated to be 4.5 Na v… ▽ More Hexagonal superlattice formed by sodium multi-vacancy cluster ordering in Na$_{0.77}$CoO$_2$ has been proposed based on synchrotron X-ray Laue diffraction study on electrochemically fine-tuned single crystals. The title compound sits closely to the proposed lower end of the miscibility gap of x ~ 0.77-0.82 phase separated range. The average sodium vacancy cluster size is estimated to be 4.5 Na vacancies per layer within a large superlattice size of sqrt{19}a*sqrt{19}a*3c. The exceptionally large Na vacancy cluster size favors large twinned simple hexagonal superlattice of sqrt{19}a, in competition with the smaller di-, tri- and quadri-vacancy clusters formed superlattices of sqrt{12}a and sqrt{13}a. Competing electronic correlations are revealed by the observed spin glass-like magnetic hysteresis below ~ 3K and the twin, triple and mono domain transformations during thermal cycling between 273-373K. △ Less

Submitted 28 September, 2009; originally announced September 2009.

Comments: 7 pages, 6 figures

arXiv:0901.3007 [pdf, ps, other]

Max-plus Stochastic Control and Risk-sensitivity

Authors: Wendell H. Fleming, Hidehiro Kaise, Shuenn-Jyi Sheu

Abstract: In the Maslov idempotent probability calculus, expectations of random variables are defined so as to be linear with respect to max-plus addition and scalar multiplication. This paper considers control problems in which the objective is to minimize the max-plus expectation of some max-plus additive running cost. Such problems arise naturally as limits of some types of risk sensitive stochastic co… ▽ More In the Maslov idempotent probability calculus, expectations of random variables are defined so as to be linear with respect to max-plus addition and scalar multiplication. This paper considers control problems in which the objective is to minimize the max-plus expectation of some max-plus additive running cost. Such problems arise naturally as limits of some types of risk sensitive stochastic control problems. The value function is a viscosity solution to a quasivariational inequality (QVI) of dynamic programming. Equivalence of this QVI to a nonlinear parabolic PDE with discontinuous Hamiltonian is used to prove a comparison theorem for viscosity sub- and super-solutions. An example from math finance is given, and an application in nonlinear H-infinity control is sketched. △ Less

Submitted 20 January, 2009; originally announced January 2009.

Comments: 58 pages

MSC Class: 35F20 (Primary) 49L20; 49L25; 93E03 (Secondary)

arXiv:0709.0085 [pdf, ps, other]

doi 10.1103/PhysRevLett.101.127404

Sodium vacancy ordering and the co-existence of localized spins and itinerant charges in NaxCoO2

Authors: F. C. Chou, M. -W. Chu, G. J. Shu, F. T. Huang, Woei Wu Pai, H. S. Sheu, T. Imai, F. L. Ning, Patrick A. Lee

Abstract: The sodium cobaltate family (NaxCoO2) is unique among transition metal oxides because the Co sits on a triangular lattice and its valence can be tuned over a wide range by varying the Na concentration x. Up to now detailed modeling of the rich phenomenology (which ranges from unconventional superconductivity to enhanced thermopower) has been hampered by the difficulty of controlling pure phases.… ▽ More The sodium cobaltate family (NaxCoO2) is unique among transition metal oxides because the Co sits on a triangular lattice and its valence can be tuned over a wide range by varying the Na concentration x. Up to now detailed modeling of the rich phenomenology (which ranges from unconventional superconductivity to enhanced thermopower) has been hampered by the difficulty of controlling pure phases. We discovered that certain Na concentrations are specially stable and are associated with superlattice ordering of the Na clusters. This leads naturally to a picture of co-existence of localized spins and itinerant charge carriers. For x = 0.84 we found a remarkably small Fermi energy of 87 K. Our picture brings coherence to a variety of measurements ranging from NMR to optical to thermal transport. Our results also allow us to take the first step towards modeling the mysterious ``Curie-Weiss'' metal state at x = 0.71. We suggest the local moments may form a quantum spin liquid state and we propose experimental test of our hypothesis. △ Less

Submitted 2 September, 2007; originally announced September 2007.

Comments: 16 pages, 5 figures

Journal ref: Phys. Rev. Lett. 101, 127404 (2008)

arXiv:0708.0280 [pdf, ps, other]

doi 10.1103/PhysRevB.76.184115

Searching for Stable Na-ordered Phases in Single Crystal Samples of gamma-NaxCoO2

Authors: G. J. Shu, Andrea Prodi, S. Y. Chu, Y. S. Lee, H. S. Sheu, F. C. Chou

Abstract: We report on the preparation and characterization of single crystal gamma phase NaxCoO2 with 0.25 < x < 0.84 using a non-aqueous electrochemical chronoamperemetry technique. By carefully mapping the overpotential versus x (for x < 0.84), we find six distinct stable phases with Na levels corresponding to x ~ 0.75, 0.71, 0.50, 0.43, 0.33 and 0.25. The composition with x ~0.55 appears to have a cri… ▽ More We report on the preparation and characterization of single crystal gamma phase NaxCoO2 with 0.25 < x < 0.84 using a non-aqueous electrochemical chronoamperemetry technique. By carefully mapping the overpotential versus x (for x < 0.84), we find six distinct stable phases with Na levels corresponding to x ~ 0.75, 0.71, 0.50, 0.43, 0.33 and 0.25. The composition with x ~0.55 appears to have a critical Na concentration which separates samples with different magnetic behavior as well as different Na ion diffusion mechanisms. Chemical analysis of an aged crystal reveals different Na ion diffusion mechanisms above and below x_c ~ 0.53, where the diffusion process above x_c has a diffusion coefficient about five times larger than that below x_c. The series of crystals were studied with X-ray diffraction, susceptibility, and transport measurements. The crystal with x = 0.5 shows a weak ferromagnetic transition below T=27 K in addition to the usual transitions at T = 51 K and 88 K. The resistivity of the Curie-Weiss metallic Na0.71CoO2 composition has a very low residual resistivity, which attests to the high homogeneity of the crystals prepared by this improved electrochemical method. Our results on the various stable crystal compositions point to the importance of Na ion ordering across the phase diagram. △ Less

Submitted 2 August, 2007; originally announced August 2007.

Comments: 9 pages, 9 figures

arXiv:math/0702828 [pdf, ps, other]

doi 10.1214/074921706000001094

Price systems for markets with transaction costs and control problems for some finance problems

Authors: Tzuu-Shuh Chiang, Shang-Yuan Shiu, Shuenn-Jyi Sheu

Abstract: In a market with transaction costs, the price of a derivative can be expressed in terms of (preconsistent) price systems (after Kusuoka (1995)). In this paper, we consider a market with binomial model for stock price and discuss how to generate the price systems. From this, the price formula of a derivative can be reformulated as a stochastic control problem. Then the dynamic programming approac… ▽ More In a market with transaction costs, the price of a derivative can be expressed in terms of (preconsistent) price systems (after Kusuoka (1995)). In this paper, we consider a market with binomial model for stock price and discuss how to generate the price systems. From this, the price formula of a derivative can be reformulated as a stochastic control problem. Then the dynamic programming approach can be used to calculate the price. We also discuss optimization of expected utility using price systems. △ Less

Submitted 27 February, 2007; originally announced February 2007.

Comments: Published at http://dx.doi.org/10.1214/074921706000001094 in the IMS Lecture Notes Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-LNMS52-LNMS5218 MSC Class: 60K35; 60K35 (Primary)

Journal ref: IMS Lecture Notes Monograph Series 2006, Vol. 52, 257-271

arXiv:math/0602625 [pdf, ps, other]

doi 10.1214/009117905000000431

On the structure of solutions of ergodic type Bellman equation related to risk-sensitive control

Authors: Hidehiro Kaise, Shuenn-Jyi Sheu

Abstract: Bellman equations of ergodic type related to risk-sensitive control are considered. We treat the case that the nonlinear term is positive quadratic form on first-order partial derivatives of solution, which includes linear exponential quadratic Gaussian control problem. In this paper we prove that the equation in general has multiple solutions. We shall specify the set of all the classical solut… ▽ More Bellman equations of ergodic type related to risk-sensitive control are considered. We treat the case that the nonlinear term is positive quadratic form on first-order partial derivatives of solution, which includes linear exponential quadratic Gaussian control problem. In this paper we prove that the equation in general has multiple solutions. We shall specify the set of all the classical solutions and classify the solutions by a global behavior of the diffusion process associated with the given solution. The solution associated with ergodic diffusion process plays particular role. We shall also prove the uniqueness of such solution. Furthermore, the solution which gives us ergodicity is stable under perturbation of coefficients. Finally, we have a representation result for the solution corresponding to the ergodic diffusion. △ Less

Submitted 27 February, 2006; originally announced February 2006.

Comments: Published at http://dx.doi.org/10.1214/009117905000000431 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOP-AOP0091 MSC Class: 60G35 (Primary) 60H30; 93E20 (Secondary)

Journal ref: Annals of Probability 2006, Vol. 34, No. 1, 284-320

arXiv:math/0505245 [pdf, ps, other]

doi 10.1214/105051605000000025

Accelerating diffusions

Authors: Chii-Ruey Hwang, Shu-Yin Hwang-Ma, Shuenn-Jyi Sheu

Abstract: Let U be a given function defined on R^d and π(x) be a density function proportional to \exp -U(x). The following diffusion X(t) is often used to sample from π(x), dX(t)=-\nabla U(X(t)) dt+\sqrt2 dW(t),\qquad X(0)=x_0. To accelerate the convergence, a family of diffusions with π(x) as their common equilibrium is considered, dX(t)=\bigl(-\nabla U(X(t))+C(X(t))\bigr) dt+\sqrt2 dW(t),\qquad X(0)=x_… ▽ More Let U be a given function defined on R^d and π(x) be a density function proportional to \exp -U(x). The following diffusion X(t) is often used to sample from π(x), dX(t)=-\nabla U(X(t)) dt+\sqrt2 dW(t),\qquad X(0)=x_0. To accelerate the convergence, a family of diffusions with π(x) as their common equilibrium is considered, dX(t)=\bigl(-\nabla U(X(t))+C(X(t))\bigr) dt+\sqrt2 dW(t),\qquad X(0)=x_0. Let L_C be the corresponding infinitesimal generator. The spectral gap of L_C in L^2(π) (λ(C)), and the convergence exponent of X(t) to πin variational norm (ρ(C)), are used to describe the convergence rate, where λ(C)= Sup{real part of μ\dvtxμis in the spectrum of L_C, μis not zero}, {-2.8cm}ρ(C) = Inf\biggl{ρ\dvtx\int | p(t,x,y) -π(y)| dy \le g(x) e^{ρt}\biggr}.Roughly speaking, L_C is a perturbation of the self-adjoint L_0 by an antisymmetric operator C\cdot\nabla, where C is weighted divergence free. We prove that λ(C)\le λ(0) and equality holds only in some rare situations. Furthermore, ρ(C)\le λ(C) and equality holds for C=0. In other words, adding an extra drift, C(x), accelerates convergence. Related problems are also discussed. △ Less

Submitted 12 May, 2005; originally announced May 2005.

Comments: Published at http://dx.doi.org/10.1214/105051605000000025 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP087 MSC Class: 60J60; 47D07 (Primary) 65B99; 35P05. (Secondary)

Journal ref: Annals of Applied Probability 2005, Vol. 15, No. 2, 1433-1444

Showing 1–13 of 13 results for author: Sheu, S