Skip to main content

Showing 1–7 of 7 results for author: Rashid, M R U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.15815  [pdf, other

    cs.AI

    Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing

    Authors: Vishnu Asutosh Dasu, Md Rafi ur Rashid, Vipul Gupta, Saeid Tizpaz-Niari, Gang Tan

    Abstract: This paper explores pruning attention heads as a post-processing bias mitigation method for large language models (LLMs). Modern AI systems such as LLMs are expanding into sensitive social contexts where fairness concerns become especially crucial. Since LLMs develop decision-making patterns by training on massive datasets of human-generated content, they naturally encode and perpetuate societal b… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  2. arXiv:2501.16497  [pdf, other

    cs.LG cs.AI cs.CL cs.CR stat.ML

    Smoothed Embeddings for Robust Language Models

    Authors: Ryo Hase, Md Rafi Ur Rashid, Ashley Lewis, Jing Liu, Toshiaki Koike-Akino, Kieran Parsons, Ye Wang

    Abstract: Improving the safety and reliability of large language models (LLMs) is a crucial aspect of realizing trustworthy AI systems. Although alignment methods aim to suppress harmful content generation, LLMs are often still vulnerable to jailbreaking attacks that employ adversarial inputs that subvert alignment and induce harmful outputs. We propose the Randomized Embedding Smoothing and Token Aggregati… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: Presented in the Safe Generative AI Workshop at NeurIPS 2024

    MSC Class: 68T07 (Primary); 68T50 (Secondary)

  3. arXiv:2411.06426  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    SequentialBreak: Large Language Models Can be Fooled by Embedding Jailbreak Prompts into Sequential Prompt Chains

    Authors: Bijoy Ahmed Saiem, MD Sadik Hossain Shanto, Rakib Ahsan, Md Rafi ur Rashid

    Abstract: As the integration of the Large Language Models (LLMs) into various applications increases, so does their susceptibility to misuse, raising significant security concerns. Numerous jailbreak attacks have been proposed to assess the security defense of LLMs. Current jailbreak attacks mainly rely on scenario camouflage, prompt obfuscation, prompt optimization, and prompt iterative optimization to con… ▽ More

    Submitted 14 February, 2025; v1 submitted 10 November, 2024; originally announced November 2024.

  4. arXiv:2408.17354  [pdf, other

    cs.LG cs.AI cs.CR

    Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage

    Authors: Md Rafi Ur Rashid, Jing Liu, Toshiaki Koike-Akino, Shagufta Mehnaz, Ye Wang

    Abstract: Fine-tuning large language models on private data for downstream applications poses significant privacy risks in potentially exposing sensitive information. Several popular community platforms now offer convenient distribution of a large variety of pre-trained models, allowing anyone to publish without rigorous verification. This scenario creates a privacy threat, as pre-trained models can be inte… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  5. arXiv:2403.10557  [pdf, other

    cs.LG cs.AI cs.CL

    Second-Order Information Matters: Revisiting Machine Unlearning for Large Language Models

    Authors: Kang Gu, Md Rafi Ur Rashid, Najrin Sultana, Shagufta Mehnaz

    Abstract: With the rapid development of Large Language Models (LLMs), we have witnessed intense competition among the major LLM products like ChatGPT, LLaMa, and Gemini. However, various issues (e.g. privacy leakage and copyright violation) of the training corpus still remain underexplored. For example, the Times sued OpenAI and Microsoft for infringing on its copyrights by using millions of its articles fo… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  6. arXiv:2310.16152  [pdf, other

    cs.CR cs.LG

    FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering

    Authors: Md Rafi Ur Rashid, Vishnu Asutosh Dasu, Kang Gu, Najrin Sultana, Shagufta Mehnaz

    Abstract: Federated learning (FL) has become a key component in various language modeling applications such as machine translation, next-word prediction, and medical record analysis. These applications are trained on datasets from many FL participants that often include privacy-sensitive data, such as healthcare records, phone/credit card numbers, login credentials, etc. Although FL enables computation with… ▽ More

    Submitted 27 February, 2025; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 20 pages (including bibliography and Appendix), Submitted to ACM CCS '24

  7. arXiv:2308.05832  [pdf, other

    cs.CR cs.LG

    FLShield: A Validation Based Federated Learning Framework to Defend Against Poisoning Attacks

    Authors: Ehsanul Kabir, Zeyu Song, Md Rafi Ur Rashid, Shagufta Mehnaz

    Abstract: Federated learning (FL) is revolutionizing how we learn from data. With its growing popularity, it is now being used in many safety-critical domains such as autonomous vehicles and healthcare. Since thousands of participants can contribute in this collaborative setting, it is, however, challenging to ensure security and reliability of such systems. This highlights the need to design FL systems tha… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.