Skip to main content

Showing 1–16 of 16 results for author: Cichocki, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.13984  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Mirror Descent Using the Tempesta Generalized Multi-parametric Logarithms

    Authors: Andrzej Cichocki

    Abstract: In this paper, we develop a wide class Mirror Descent (MD) algorithms, which play a key role in machine learning. For this purpose we formulated the constrained optimization problem, in which we exploits the Bregman divergence with the Tempesta multi-parametric deformation logarithm as a link function. This link function called also mirror function defines the mapping between the primal and dual s… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  2. arXiv:2004.09222  [pdf, other

    cs.LG stat.ML

    Towards Understanding Normalization in Neural ODEs

    Authors: Julia Gusak, Larisa Markeeva, Talgat Daulbaev, Alexandr Katrutsa, Andrzej Cichocki, Ivan Oseledets

    Abstract: Normalization is an important and vastly investigated technique in deep learning. However, its role for Ordinary Differential Equation based networks (neural ODEs) is still poorly understood. This paper investigates how different normalization techniques affect the performance of neural ODEs. Particularly, we show that it is possible to achieve 93% accuracy in the CIFAR-10 classification task, and… ▽ More

    Submitted 27 April, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  3. arXiv:2003.05271  [pdf, other

    cs.NE math.NA stat.ML

    Interpolation Technique to Speed Up Gradients Propagation in Neural ODEs

    Authors: Talgat Daulbaev, Alexandr Katrutsa, Larisa Markeeva, Julia Gusak, Andrzej Cichocki, Ivan Oseledets

    Abstract: We propose a simple interpolation-based method for the efficient approximation of gradients in neural ODE models. We compare it with the reverse dynamic method (known in the literature as "adjoint method") to train neural ODEs on classification, density estimation, and inference approximation tasks. We also propose a theoretical justification of our approach using logarithmic norm formalism. As a… ▽ More

    Submitted 30 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  4. arXiv:2002.12135  [pdf, other

    cs.LG eess.SP stat.ML

    Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

    Authors: Qiquan Shi, Jiaming Yin, Jiajun Cai, Andrzej Cichocki, Tatsuya Yokota, Lei Chen, Mingxuan Yuan, Jia Zeng

    Abstract: This work proposes a novel approach for multiple time series forecasting. At first, multi-way delay embedding transform (MDT) is employed to represent time series as low-rank block Hankel tensors (BHT). Then, the higher-order tensors are projected to compressed core tensors by applying Tucker decomposition. At the same time, the generalized tensor Autoregressive Integrated Moving Average (ARIMA) i… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted by AAAI 2020

  5. arXiv:1910.06995  [pdf, other

    cs.LG stat.ML

    Reduced-Order Modeling of Deep Neural Networks

    Authors: Julia Gusak, Talgat Daulbaev, Evgeny Ponomarev, Andrzej Cichocki, Ivan Oseledets

    Abstract: We introduce a new method for speeding up the inference of deep neural networks. It is somewhat inspired by the reduced-order modeling techniques for dynamical systems.The cornerstone of the proposed method is the maximum volume algorithm. We demonstrate efficiency on neural networks pre-trained on different datasets. We show that in many practical cases it is possible to replace convolutional lay… ▽ More

    Submitted 25 November, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  6. arXiv:1907.12827  [pdf

    cs.LG eess.IV stat.ML

    Multi-Kernel Capsule Network for Schizophrenia Identification

    Authors: Tian Wang, Anastasios Bezerianos, Andrzej Cichocki, Junhua Li

    Abstract: Objective: Schizophrenia seriously affects the quality of life. To date, both simple (linear discriminant analysis) and complex (deep neural network) machine learning methods have been utilized to identify schizophrenia based on functional connectivity features. The existing simple methods need two separate steps (i.e., feature extraction and classification) to achieve the identification, which di… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: IEEE Transactions on Cybernetics (2020)

  7. arXiv:1903.09973  [pdf, other

    cs.LG cs.CV stat.ML

    MUSCO: Multi-Stage Compression of neural networks

    Authors: Julia Gusak, Maksym Kholiavchenko, Evgeny Ponomarev, Larisa Markeeva, Ivan Oseledets, Andrzej Cichocki

    Abstract: The low-rank tensor approximation is very promising for the compression of deep neural networks. We propose a new simple and efficient iterative approach, which alternates low-rank factorization with a smart rank selection and fine-tuning. We demonstrate the efficiency of our method comparing to non-iterative ones. Our approach improves the compression rate while maintaining the accuracy for a var… ▽ More

    Submitted 15 November, 2019; v1 submitted 24 March, 2019; originally announced March 2019.

  8. arXiv:1806.05017  [pdf

    q-bio.QM eess.SP stat.ML

    Brain-Computer Interface with Corrupted EEG Data: A Tensor Completion Approach

    Authors: Jordi Sole-Casals, Cesar F. Caiafa, Qibin Zhao, Adrzej Cichocki

    Abstract: One of the current issues in Brain-Computer Interface is how to deal with noisy Electroencephalography measurements organized as multidimensional datasets. On the other hand, recently, significant advances have been made in multidimensional signal completion algorithms that exploit tensor decomposition models to capture the intricate relationship among entries in a multidimensional signal. We prop… ▽ More

    Submitted 26 July, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: 21 pages, 3 tables, 4 figures

    Journal ref: Sole-Casals, J., Caiafa, C.F., Zhao, Q. et al. Cogn Comput (2018). https://doi.org/10.1007/s12559-018-9574-9

  9. arXiv:1603.01372  [pdf, ps, other

    math.NA stat.CO

    Numerical CP Decomposition of Some Difficult Tensors

    Authors: Petr Tichavsky, Anh Huy Phan, Andrzej Cichocki

    Abstract: In this paper, a numerical method is proposed for canonical polyadic (CP) decomposition of small size tensors. The focus is primarily on decomposition of tensors that correspond to small matrix multiplications. Here, rank of the tensors is equal to the smallest number of scalar multiplications that are necessary to accomplish the matrix multiplication. The proposed method is based on a constrained… ▽ More

    Submitted 4 March, 2016; originally announced March 2016.

  10. arXiv:1505.02343  [pdf, other

    cs.LG math.NA stat.ML

    Bayesian Sparse Tucker Models for Dimension Reduction and Tensor Completion

    Authors: Qibin Zhao, Liqing Zhang, Andrzej Cichocki

    Abstract: Tucker decomposition is the cornerstone of modern machine learning on tensorial data analysis, which have attracted considerable attention for multiway feature extraction, compressive sensing, and tensor completion. The most challenging problem is related to determination of model complexity (i.e., multilinear rank), especially when noise and missing data are present. In addition, existing methods… ▽ More

    Submitted 10 May, 2015; originally announced May 2015.

  11. arXiv:1412.7146  [pdf, other

    stat.CO cs.IT

    Log-Determinant Divergences Revisited: Alpha--Beta and Gamma Log-Det Divergences

    Authors: Andrzej Cichocki, Sergio Cruces, Shun-Ichi Amari

    Abstract: In this paper, we review and extend a family of log-det divergences for symmetric positive definite (SPD) matrices and discuss their fundamental properties. We show how to generate from parameterized Alpha-Beta (AB) and Gamma Log-det divergences many well known divergences, for example, the Stein's loss, S-divergence, called also Jensen-Bregman LogDet (JBLD) divergence, the Logdet Zero (Bhattachar… ▽ More

    Submitted 23 December, 2014; v1 submitted 18 December, 2014; originally announced December 2014.

    Comments: 35 pages, 4 figures

  12. arXiv:1404.4412  [pdf, other

    cs.LG cs.CV stat.ML

    Efficient Nonnegative Tucker Decompositions: Algorithms and Uniqueness

    Authors: Guoxu Zhou, Andrzej Cichocki, Qibin Zhao, Shengli Xie

    Abstract: Nonnegative Tucker decomposition (NTD) is a powerful tool for the extraction of nonnegative parts-based and physically meaningful latent components from high-dimensional tensor data while preserving the natural multilinear structure of data. However, as the data tensor often has multiple modes and is large-scale, existing NTD algorithms suffer from a very high computational complexity in terms of… ▽ More

    Submitted 16 September, 2015; v1 submitted 16 April, 2014; originally announced April 2014.

    Comments: appears in IEEE Transactions on Image Processing, 2015

  13. arXiv:1402.1673  [pdf, ps, other

    math.NA stat.OT

    Non-Orthogonal Tensor Diagonalization

    Authors: Petr Tichavsky, Anh Huy Phan, Andrzej Cichocki

    Abstract: Tensor diagonalization means transforming a given tensor to an exactly or nearly diagonal form through multiplying the tensor by non-orthogonal invertible matrices along selected dimensions of the tensor. It is generalization of approximate joint diagonalization (AJD) of a set of matrices. In particular, we derive (1) a new algorithm for symmetric AJD, which is called two-sided symmetric diagonali… ▽ More

    Submitted 1 July, 2016; v1 submitted 7 February, 2014; originally announced February 2014.

    Comments: The manuscript was revised deeply, but the main idea is the same. The algorithm has changed significantly

  14. arXiv:1401.6497  [pdf, other

    cs.LG cs.CV stat.ML

    Bayesian CP Factorization of Incomplete Tensors with Automatic Rank Determination

    Authors: Qibin Zhao, Liqing Zhang, Andrzej Cichocki

    Abstract: CANDECOMP/PARAFAC (CP) tensor factorization of incomplete data is a powerful technique for tensor completion through explicitly capturing the multilinear latent factors. The existing CP algorithms require the tensor rank to be manually specified, however, the determination of tensor rank remains a challenging problem especially for CP rank. In addition, existing approaches do not take into account… ▽ More

    Submitted 9 October, 2014; v1 submitted 25 January, 2014; originally announced January 2014.

  15. Frequency Recognition in SSVEP-based BCI using Multiset Canonical Correlation Analysis

    Authors: Yu Zhang, Guoxu Zhou, Jing Jin, Xingyu Wang, Andrzej Cichocki

    Abstract: Canonical correlation analysis (CCA) has been one of the most popular methods for frequency recognition in steady-state visual evoked potential (SSVEP)-based brain-computer interfaces (BCIs). Despite its efficiency, a potential problem is that using pre-constructed sine-cosine waves as the required reference signals in the CCA method often does not result in the optimal recognition accuracy due to… ▽ More

    Submitted 16 January, 2014; v1 submitted 26 August, 2013; originally announced August 2013.

    Journal ref: International Journal of Neural Systems, 2014, vol.24, no.2, pp.1450013 (14 pages)

  16. arXiv:1305.0395  [pdf, ps, other

    math.NA cs.LG q-bio.NC stat.ML

    Tensor Decompositions: A New Concept in Brain Data Analysis?

    Authors: Andrzej Cichocki

    Abstract: Matrix factorizations and their extensions to tensor factorizations and decompositions have become prominent techniques for linear and multilinear blind source separation (BSS), especially multiway Independent Component Analysis (ICA), NonnegativeMatrix and Tensor Factorization (NMF/NTF), Smooth Component Analysis (SmoCA) and Sparse Component Analysis (SCA). Moreover, tensor decompositions have ma… ▽ More

    Submitted 2 May, 2013; originally announced May 2013.

    Journal ref: Control Measurement, and System Integration (SICE), special issue; Measurement of Brain Functions and Bio-Signals, 7, 507-517, (2011)