Skip to main content

Showing 1–4 of 4 results for author: Iacovides, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.15674  [pdf, other

    cs.CL cs.LG

    TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs

    Authors: Yuxuan Gu, Wuyang Zhou, Giorgos Iacovides, Danilo Mandic

    Abstract: The reasoning abilities of Large Language Models (LLMs) can be improved by structurally denoising their weights, yet existing techniques primarily focus on denoising the feed-forward network (FFN) of the transformer block, and can not efficiently utilise the Multi-head Attention (MHA) block, which is the core of transformer architectures. To address this issue, we propose a novel intuitive framewo… ▽ More

    Submitted 15 May, 2025; v1 submitted 26 January, 2025; originally announced January 2025.

    Comments: Accpeted for IEEE International Joint Conference on Neural Networks (IJCNN 2025). The code is available at https://github.com/guyuxuan9/TensorLLM

  2. arXiv:2412.10257  [pdf, other

    cs.CL cs.AI

    Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models

    Authors: Harry J. Davies, Giorgos Iacovides, Danilo P. Mandic

    Abstract: The sheer scale of data required to train modern large language models (LLMs) poses significant risks, as models are likely to gain knowledge of sensitive topics such as bio-security, as well the ability to replicate copyrighted works. Methods designed to remove such knowledge must do so from all prompt directions, in a multi-lingual capacity and without degrading general model performance. To thi… ▽ More

    Submitted 16 December, 2024; v1 submitted 13 December, 2024; originally announced December 2024.

    Comments: 14 pages, 5 figures, 1 table. Fixing typo with the final weight editing equation

  3. arXiv:2410.10728  [pdf, other

    cs.LG cs.AI

    Towards LLM-guided Efficient and Interpretable Multi-linear Tensor Network Rank Selection

    Authors: Giorgos Iacovides, Wuyang Zhou, Danilo Mandic

    Abstract: We propose a novel framework that leverages large language models (LLMs) to guide the rank selection in tensor network models for higher-order data analysis. By utilising the intrinsic reasoning capabilities and domain knowledge of LLMs, our approach offers enhanced interpretability of the rank choices and can effectively optimise the objective function. This framework enables users without specia… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  4. arXiv:2403.12285  [pdf, other

    cs.CL cs.LG q-fin.ST q-fin.TR

    FinLlama: Financial Sentiment Classification for Algorithmic Trading Applications

    Authors: Thanos Konstantinidis, Giorgos Iacovides, Mingxue Xu, Tony G. Constantinides, Danilo Mandic

    Abstract: There are multiple sources of financial news online which influence market movements and trader's decisions. This highlights the need for accurate sentiment analysis, in addition to having appropriate algorithmic trading techniques, to arrive at better informed trading decisions. Standard lexicon based sentiment approaches have demonstrated their power in aiding financial decisions. However, they… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.