Skip to main content

Showing 1–9 of 9 results for author: Solgi, R

.
  1. arXiv:2505.23966  [pdf, ps, other

    cs.CL

    FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression

    Authors: Jiayi Tian, Ryan Solgi, Jinming Lu, Yifan Yang, Hai Li, Zheng Zhang

    Abstract: Large Language Models (LLMs) have enabled remarkable progress in natural language processing, yet their high computational and memory demands pose challenges for deployment in resource-constrained environments. Although recent low-rank decomposition methods offer a promising path for structural compression, they often suffer from accuracy degradation, expensive calibration procedures, and result i… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.14871  [pdf, ps, other

    cs.CL cs.LG

    Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models

    Authors: Ryan Solgi, Kai Zhen, Rupak Vignesh Swaminathan, Nathan Susanj, Athanasios Mouchtaris, Siegfried Kunzmann, Zheng Zhang

    Abstract: The efficient implementation of large language models (LLMs) is crucial for deployment on resource-constrained devices. Low-rank tensor compression techniques, such as tensor-train (TT) networks, have been widely studied for over-parameterized neural networks. However, their applications to compress pre-trained large language models (LLMs) for downstream tasks (post-training) remains challenging d… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  3. arXiv:2310.20077  [pdf, other

    cs.CL cs.LG

    Partial Tensorized Transformers for Natural Language Processing

    Authors: Subhadra Vadlamannati, Ryan Solgi

    Abstract: The transformer architecture has revolutionized Natural Language Processing (NLP) and other machine-learning tasks, due to its unprecedented accuracy. However, their extensive memory and parameter requirements often hinder their practical applications. In this work, we study the effect of tensor-train decomposition to improve the accuracy and compress transformer vision-language neural networks, n… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: In Review under the 16th International Conference on Agents and Artificial Intelligence

  4. arXiv:2205.10651  [pdf, other

    eess.IV cs.LG cs.NE

    Tensor Shape Search for Optimum Data Compression

    Authors: Ryan Solgi, Zichang He, William Jiahua Liang, Zheng Zhang

    Abstract: Various tensor decomposition methods have been proposed for data compression. In real world applications of the tensor decomposition, selecting the tensor shape for the given data poses a challenge and the shape of the tensor may affect the error and the compression ratio. In this work, we study the effect of the tensor shape on the tensor decomposition and propose an optimization model to find an… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  5. arXiv:1605.03471  [pdf, other

    stat.ME

    Nonparametric hierarchical Bayesian quantiles

    Authors: Luke Bornn, Neil Shephard, Reza Solgi

    Abstract: Here we develop a method for performing nonparametric Bayesian inference on quantiles. Relying on geometric measure theory and employing a Hausdorff base measure, we are able to specify meaningful priors for the quantile while treating the distribution of the data otherwise nonparametrically. We further extend the method to a hierarchical model for quantiles of subpopulations, linking subgroups to… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

  6. arXiv:1507.08645  [pdf, other

    stat.ME math.PR math.ST stat.AP stat.CO

    Moment conditions and Bayesian nonparametrics

    Authors: Luke Bornn, Neil Shephard, Reza Solgi

    Abstract: Models phrased though moment conditions are central to much of modern inference. Here these moment conditions are embedded within a nonparametric Bayesian setup. Handling such a model is not probabilistically straightforward as the posterior has support on a manifold. We solve the relevant issues, building new probability and computational tools using Hausdorff measures to analyze them on real and… ▽ More

    Submitted 13 January, 2016; v1 submitted 30 July, 2015; originally announced July 2015.

  7. arXiv:1502.04266  [pdf

    eess.SY cs.AI math.OC

    Constrained Nonlinear Model Predictive Control of an MMA Polymerization Process via Evolutionary Optimization

    Authors: Masoud Abbaszadeh, Reza Solgi

    Abstract: In this work, a nonlinear model predictive controller is developed for a batch polymerization process. The physical model of the process is parameterized along a desired trajectory resulting in a trajectory linearized piecewise model (a multiple linear model bank) and the parameters are identified for an experimental polymerization reactor. Then, a multiple model adaptive predictive controller is… ▽ More

    Submitted 14 February, 2015; originally announced February 2015.

    Comments: 12 pages, 9 figures, 28 references

  8. Zero Variance Markov Chain Monte Carlo for Bayesian Estimators

    Authors: Antonietta Mira, Reza Solgi, Daniele Imparato

    Abstract: Interest is in evaluating, by Markov chain Monte Carlo (MCMC) simulation, the expected value of a function with respect to a, possibly unnormalized, probability distribution. A general purpose variance reduction technique for the MCMC estimator, based on the zero-variance principle introduced in the physics literature, is proposed. Conditions for asymptotic unbiasedness of the zero-variance estima… ▽ More

    Submitted 26 June, 2012; v1 submitted 14 December, 2010; originally announced December 2010.

    Comments: 26 pages, 4 figures. This is an updated version: the results are the same as the previous one, but presentation is more essential

    MSC Class: 62

    Journal ref: Statistics and Computing, 2012

  9. arXiv:cond-mat/0410289  [pdf, ps, other

    cond-mat.other q-fin.ST

    Statistical analysis of the price index of Tehran Stock Exchange

    Authors: A. Rasoolizadeh, R. Solgi

    Abstract: This paper presents a statistical analysis of Tehran Price Index (TePIx) for the period of 1992 to 2004. The results present asymmetric property of the return distribution which tends to the right hand of the mean. Also the return distribution can be fitted by a stable Levy distribution and the tails are very fatter than the gaussian distribution. We estimate the tail index of the TePIx returns… ▽ More

    Submitted 12 October, 2004; originally announced October 2004.