Skip to main content

Showing 1–2 of 2 results for author: Koh, J H

.
  1. arXiv:2409.01696  [pdf, other

    cs.CV

    On the Vulnerability of Skip Connections to Model Inversion Attacks

    Authors: Jun Hao Koh, Sy-Tuyen Ho, Ngoc-Bao Nguyen, Ngai-man Cheung

    Abstract: Skip connections are fundamental architecture designs for modern deep neural networks (DNNs) such as CNNs and ViTs. While they help improve model performance significantly, we identify a vulnerability associated with skip connections to Model Inversion (MI) attacks, a type of privacy attack that aims to reconstruct private training data through abusive exploitation of a model. In this paper, as a… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: Accepted by ECCV 2024

  2. arXiv:2408.03837  [pdf, other

    cs.CL cs.AI

    WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models

    Authors: Prannaya Gupta, Le Qi Yau, Hao Han Low, I-Shiang Lee, Hugo Maximus Lim, Yu Xin Teoh, Jia Hng Koh, Dar Win Liew, Rishabh Bhardwaj, Rajat Bhardwaj, Soujanya Poria

    Abstract: WalledEval is a comprehensive AI safety testing toolkit designed to evaluate large language models (LLMs). It accommodates a diverse range of models, including both open-weight and API-based ones, and features over 35 safety benchmarks covering areas such as multilingual safety, exaggerated safety, and prompt injections. The framework supports both LLM and judge benchmarking and incorporates custo… ▽ More

    Submitted 19 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Under review