Skip to main content

Showing 1–21 of 21 results for author: Lim, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.08045  [pdf, other

    cs.LG cs.CL

    Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection

    Authors: Ying Fu Lim, Jiawen Zhu, Guansong Pang

    Abstract: Log Anomaly Detection (LAD) seeks to identify atypical patterns in log data that are crucial to assessing the security and condition of systems. Although Large Language Models (LLMs) have shown tremendous success in various fields, the use of LLMs in enabling the detection of log anomalies is largely unexplored. This work aims to fill this gap. Due to the prohibitive costs involved in fully fine-t… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 12 pages, 5 figures, accepted by PAKDD 2025 special session

  2. arXiv:2503.05749  [pdf, other

    cs.CY

    Operations & Supply Chain Management: Principles and Practice

    Authors: Fotios Petropoulos, Henk Akkermans, O. Zeynep Aksin, Imran Ali, Mohamed Zied Babai, Ana Barbosa-Povoa, Olga Battaïa, Maria Besiou, Nils Boysen, Stephen Brammer, Alistair Brandon-Jones, Dirk Briskorn, Tyson R. Browning, Paul Buijs, Piera Centobelli, Andrea Chiarini, Paul Cousins, Elizabeth A. Cudney, Andrew Davies, Steven J. Day, René de Koster, Rommert Dekker, Juliano Denicol, Mélanie Despeisse, Stephen M. Disney , et al. (68 additional authors not shown)

    Abstract: Operations and Supply Chain Management (OSCM) has continually evolved, incorporating a broad array of strategies, frameworks, and technologies to address complex challenges across industries. This encyclopedic article provides a comprehensive overview of contemporary strategies, tools, methods, principles, and best practices that define the field's cutting-edge advancements. It also explores the d… ▽ More

    Submitted 20 February, 2025; originally announced March 2025.

  3. arXiv:2406.07796  [pdf

    cs.HC cs.AI

    Battling Botpoop using GenAI for Higher Education: A Study of a Retrieval Augmented Generation Chatbots Impact on Learning

    Authors: Maung Thway, Jose Recatala-Gomez, Fun Siong Lim, Kedar Hippalgaonkar, Leonard W. T. Ng

    Abstract: Generative artificial intelligence (GenAI) and large language models (LLMs) have simultaneously opened new avenues for enhancing human learning and increased the prevalence of poor-quality information in student response - termed Botpoop. This study introduces Professor Leodar, a custom-built, Singlish-speaking Retrieval Augmented Generation (RAG) chatbot designed to enhance educational while redu… ▽ More

    Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures, SI with Annexes A, B and C upon request

  4. arXiv:2404.01353  [pdf, other

    cs.LG cs.AI cs.CL

    Efficiently Distilling LLMs for Edge Applications

    Authors: Achintya Kundu, Fabian Lim, Aaron Chew, Laura Wynter, Penny Chong, Rhui Dih Lee

    Abstract: Supernet training of LLMs is of great interest in industrial applications as it confers the ability to produce a palette of smaller models at constant cost, regardless of the number of models (of different size / latency) produced. We propose a new method called Multistage Low-rank Fine-tuning of Super-transformers (MLFS) for parameter-efficient supernet training. We show that it is possible to ob… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for publication in NAACL 2024 (Industry Track)

  5. Visually Improved Erosion Algorithm for the Procedural Generation of Tile-based Terrain

    Authors: Fong Yuan Lim, Yu Wei Tan, Anand Bhojan

    Abstract: Procedural terrain generation is the process of generating a digital representation of terrain using a computer program or procedure, with little to no human guidance. This paper proposes a procedural terrain generation algorithm based on a graph representation of fluvial erosion that offers several novel improvements over existing algorithms. Namely, the use of a height constraint map with two ty… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    ACM Class: I.3.5

  6. arXiv:2110.07275  [pdf, other

    cs.LG cs.AI

    Order Constraints in Optimal Transport

    Authors: Fabian Lim, Laura Wynter, Shiau Hong Lim

    Abstract: Optimal transport is a framework for comparing measures whereby a cost is incurred for transporting one measure to another. Recent works have aimed to improve optimal transport plans through the introduction of various forms of structure. We introduce novel order constraints into the optimal transport formulation to allow for the incorporation of structure. We define an efficient method for obtain… ▽ More

    Submitted 28 June, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: To appear in Proceedings of ICML 2022. Main Paper + Supplementary

  7. arXiv:2102.11906  [pdf, other

    eess.AS cs.SD

    Handling Background Noise in Neural Speech Generation

    Authors: Tom Denton, Alejandro Luebs, Felicia S. C. Lim, Andrew Storus, Hengchin Yeh, W. Bastiaan Kleijn, Jan Skoglund

    Abstract: Recent advances in neural-network based generative modeling of speech has shown great potential for speech coding. However, the performance of such models drops when the input is not clean speech, e.g., in the presence of background noise, preventing its use in practical applications. In this paper we examine the reason and discuss methods to overcome this issue. Placing a denoising preprocessing… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: 5 pages, 3 figures, presented at the Asilomar Conference on Signals, Systems, and Computers 2020

  8. arXiv:2102.09660  [pdf, other

    eess.AS cs.SD

    Generative Speech Coding with Predictive Variance Regularization

    Authors: W. Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh

    Abstract: The recent emergence of machine-learning based generative models for speech suggests a significant reduction in bit rate for speech codecs is possible. However, the performance of generative models deteriorates significantly with the distortions present in real-world input signals. We argue that this deterioration is due to the sensitivity of the maximum likelihood criterion to outliers and the in… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    MSC Class: 94 ACM Class: I.m

  9. arXiv:2004.09584  [pdf, other

    eess.AS cs.SD eess.SP

    ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric

    Authors: Michael Chinen, Felicia S. C. Lim, Jan Skoglund, Nikita Gureev, Feargus O'Gorman, Andrew Hines

    Abstract: Estimation of perceptual quality in audio and speech is possible using a variety of methods. The combined v3 release of ViSQOL and ViSQOLAudio (for speech and audio, respectively,) provides improvements upon previous versions, in terms of both design and usage. As an open source C++ library or binary with permissive licensing, ViSQOL can now be deployed beyond the research context into production… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX)

  10. arXiv:1910.06464  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder

    Authors: Cristina Gârbacea, Aäron van den Oord, Yazhe Li, Felicia S C Lim, Alejandro Luebs, Oriol Vinyals, Thomas C Walters

    Abstract: In order to efficiently transmit and store speech signals, speech codecs create a minimally redundant representation of the input signal which is then decoded at the receiver with the best possible perceptual quality. In this work we demonstrate that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: ICASSP 2019

    Journal ref: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 735-739. IEEE, 2019

  11. arXiv:1909.04776  [pdf, other

    eess.AS cs.SD

    Generative Speech Enhancement Based on Cloned Networks

    Authors: Michael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund

    Abstract: We propose to implement speech enhancement by the regeneration of clean speech from a salient representation extracted from the noisy signal. The network that extracts salient features is trained using a set of weight-sharing clones of the extractor network. The clones receive mel-frequency spectra of different noisy versions of the same speech signal as input. By encouraging the outputs of the cl… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Accepted WASPAA 2019

  12. arXiv:1908.07045  [pdf, other

    eess.AS cs.SD

    Salient Speech Representations Based on Cloned Networks

    Authors: W. Bastiaan Kleijn, Felicia S. C. Lim, Michael Chinen, Jan Skoglund

    Abstract: We define salient features as features that are shared by signals that are defined as being equivalent by a system designer. The definition allows the designer to contribute qualitative information. We aim to find salient features that are useful as conditioning for generative networks. We extract salient features by jointly training a set of clones of an encoder network. Each network clone receiv… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: Interspeech 2019

  13. arXiv:1712.01120  [pdf, other

    eess.AS cs.SD eess.SP

    Wavenet based low rate speech coding

    Authors: W. Bastiaan Kleijn, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Florian Stimberg, Quan Wang, Thomas C. Walters

    Abstract: Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s. We compare this parametric coder with a waveform coder based on the same generative m… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 5 pages, 2 figures

  14. arXiv:1601.07264  [pdf

    cs.HC cs.CY

    Persuasive Teachable Agent for Intergenerational Learning

    Authors: Su Fang Lim

    Abstract: Teachable agents are computer agents based on the pedagogical concept of learning-by-teaching. During the tutoring process, where students take on the role of the tutor to teach a computer agent tutee, learners have been observed to gain deeper understanding of the subject matter. Teachable agents are commonly used in the areas of science and mathematics learning where learners are able to learn c… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    Comments: This is a book draft

  15. The Single-Uniprior Index-Coding Problem: The Single-Sender Case and The Multi-Sender Extension

    Authors: Lawrence Ong, Chin Keong Ho, Fabian Lim

    Abstract: Index coding studies multiterminal source-coding problems where a set of receivers are required to decode multiple (possibly different) messages from a common broadcast, and they each know some messages a priori. In this paper, at the receiver end, we consider a special setting where each receiver knows only one message a priori, and each message is known to only one receiver. At the broadcasting… ▽ More

    Submitted 31 May, 2016; v1 submitted 3 December, 2014; originally announced December 2014.

    Comments: Author final manuscript

    Journal ref: IEEE Transactions on Information Theory, Vol. 62, No. 6, pp. 3165-3182, June 2016

  16. The Multi-Sender Multicast Index Coding

    Authors: Lawrence Ong, Fabian Lim, Chin Keong Ho

    Abstract: We focus on the following instance of an index coding problem, where a set of receivers are required to decode multiple messages, whilst each knows one of the messages a priori. In particular, here we consider a generalized setting where they are multiple senders, each sender only knows a subset of messages, and all senders are required to collectively transmit the index code. For a single sender,… ▽ More

    Submitted 26 June, 2013; v1 submitted 13 May, 2013; originally announced May 2013.

    Comments: This is an extended version of the same-titled paper accepted and to be presented at the IEEE International Symposium on Information Theory (ISIT), Istanbul, in July 2013

    Journal ref: Proceedings of the 2013 IEEE International Symposium on Information Theory Proceedings (ISIT), Istanbul. Turkey, 7-12 July 2013, pp. 1147-1151

  17. On U-Statistics and Compressed Sensing II: Non-Asymptotic Worst-Case Analysis

    Authors: Fabian Lim, Vladimir Stojanovic

    Abstract: In another related work, U-statistics were used for non-asymptotic "average-case" analysis of random compressed sensing matrices. In this companion paper the same analytical tool is adopted differently - here we perform non-asymptotic "worst-case" analysis. Simple union bounds are a natural choice for "worst-case" analyses, however their tightness is an issue (and questioned in previous works).… ▽ More

    Submitted 30 October, 2012; originally announced October 2012.

    Comments: 12 pages. Submitted to IEEE Transactions on Signal Processing

  18. On U-Statistics and Compressed Sensing I: Non-Asymptotic Average-Case Analysis

    Authors: Fabian Lim, Vladimir Marko Stojanovic

    Abstract: Hoeffding's U-statistics model combinatorial-type matrix parameters (appearing in CS theory) in a natural way. This paper proposes using these statistics for analyzing random compressed sensing matrices, in the non-asymptotic regime (relevant to practice). The aim is to address certain pessimisms of "worst-case" restricted isometry analyses, as observed by both Blanchard & Dossal, et. al. We sho… ▽ More

    Submitted 30 October, 2012; originally announced October 2012.

    Comments: 12 pages. 3 pages supplementary material. Submitted to IEEE Trans. Signal Processing

  19. arXiv:1207.6986  [pdf, other

    cs.DS cs.IT

    Two Embedding Theorems for Data with Equivalences under Finite Group Action

    Authors: Fabian Lim

    Abstract: There is recent interest in compressing data sets for non-sequential settings, where lack of obvious orderings on their data space, require notions of data equivalences to be considered. For example, Varshney & Goyal (DCC, 2006) considered multiset equivalences, while Choi & Szpankowski (IEEE Trans. IT, 2012) considered isomorphic equivalences in graphs. Here equivalences are considered under a re… ▽ More

    Submitted 15 October, 2012; v1 submitted 30 July, 2012; originally announced July 2012.

    Comments: 10 page extended abstract plus two sets of supplementary material. 1 figure. Preliminary report

  20. arXiv:1202.0241  [pdf, ps, other

    cs.IT

    Linear Programming Upper Bounds on Permutation Code Sizes From Coherent Configurations Related to the Kendall Tau Distance Metric

    Authors: Fabian Lim, Manabu Hagiwara

    Abstract: Recent interest on permutation rank modulation shows the Kendall tau metric as an important distance metric. This note documents our first efforts to obtain upper bounds on optimal code sizes (for said metric) ala Delsarte's approach. For the Hamming metric, Delsarte's seminal work on powerful linear programming (LP) bounds have been extended to permutation codes, via association scheme theory. Fo… ▽ More

    Submitted 5 June, 2012; v1 submitted 1 February, 2012; originally announced February 2012.

    Comments: IEEE Intl. Symp. on Inform. Theory 2012. Final Version

  21. arXiv:1007.0379  [pdf, ps, other

    cs.IT

    Reliability Distributions of Truncated Max-log-map (MLM) Detectors Applied to ISI Channels

    Authors: Fabian Lim, Aleksandar Kavcic

    Abstract: The max-log-map (MLM) receiver is an approximated version of the well-known, Bahl-Cocke-Jelinek-Raviv (BCJR) algorithm. The MLM algorithm is attractive due to its implementation simplicity. In practice, sliding-window implementations are preferred; these practical implementations consider truncated signaling neighborhoods around each transmission time instant. In this paper, we consider sliding-wi… ▽ More

    Submitted 11 April, 2013; v1 submitted 2 July, 2010; originally announced July 2010.

    Comments: 17 pages, 11 figures

    Journal ref: IEEE Trans. on Inform. Theory, Volume 59, Issue 4, pp. 2411-2425, April 2013