Skip to main content

Showing 1–20 of 20 results for author: Pentyala, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.17321  [pdf, other

    cs.CL

    Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

    Authors: Prafulla Kumar Choubey, Xiangyu Peng, Shilpa Bhagavath, Caiming Xiong, Shiva Kumar Pentyala, Chien-Sheng Wu

    Abstract: Automated service agents require well-structured workflows to provide consistent and accurate responses to customer queries. However, these workflows are often undocumented, and their automatic extraction from conversations remains unexplored. In this work, we present a novel framework for extracting and evaluating dialog workflows from historical interactions. Our extraction process consists of t… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  2. arXiv:2412.03766  [pdf, other

    cs.CR cs.LG

    End to End Collaborative Synthetic Data Generation

    Authors: Sikha Pentyala, Geetha Sitaraman, Trae Claar, Martine De Cock

    Abstract: The success of AI is based on the availability of data to train models. While in some cases a single data custodian may have sufficient data to enable AI, often multiple custodians need to collaborate to reach a cumulative size required for meaningful AI research. The latter is, for example, often the case for rare diseases, with each clinical site having data for only a small number of patients.… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  3. arXiv:2411.02448  [pdf, other

    cs.CL cs.AI

    Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models

    Authors: Aliyah R. Hsu, James Zhu, Zhichao Wang, Bin Bi, Shubham Mehrotra, Shiva K. Pentyala, Katherine Tan, Xiang-Bo Mao, Roshanak Omrani, Sougata Chaudhuri, Regunathan Radhakrishnan, Sitaram Asur, Claire Na Cheng, Bin Yu

    Abstract: LLMs have demonstrated impressive proficiency in generating coherent and high-quality text, making them valuable across a range of text-generation tasks. However, rigorous evaluation of this generated content is crucial, as ensuring its quality remains a significant challenge due to persistent issues such as factual inaccuracies and hallucination. This paper introduces three fine-tuned general-pur… ▽ More

    Submitted 20 May, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  4. arXiv:2410.05506  [pdf, other

    cs.CR cs.LG

    Privacy Vulnerabilities in Marginals-based Synthetic Data

    Authors: Steven Golob, Sikha Pentyala, Anuar Maratkhan, Martine De Cock

    Abstract: When acting as a privacy-enhancing technology, synthetic data generation (SDG) aims to maintain a resemblance to the real data while excluding personally-identifiable information. Many SDG algorithms provide robust differential privacy (DP) guarantees to this end. However, we show that the strongest class of SDG algorithms--those that preserve \textit{marginal probabilities}, or similar statistics… ▽ More

    Submitted 31 March, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: Accepted at 3rd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML) 2025

  5. arXiv:2408.15339  [pdf, other

    cs.LG cs.CL

    UNA: Unifying Alignments of RLHF/PPO, DPO and KTO by a Generalized Implicit Reward Function

    Authors: Zhichao Wang, Bin Bi, Can Huang, Shiva Kumar Pentyala, Zixu James Zhu, Sitaram Asur, Na Claire Cheng

    Abstract: An LLM is pretrained on trillions of tokens, but the pretrained LLM may still generate undesired responses. To solve this problem, alignment techniques such as RLHF, DPO and KTO are proposed. However, these alignment techniques have limitations. For example, RLHF requires training the reward model and policy separately, which is complex, time-consuming, memory intensive and unstable during trainin… ▽ More

    Submitted 5 April, 2025; v1 submitted 27 August, 2024; originally announced August 2024.

  6. arXiv:2407.16216  [pdf, other

    cs.CL

    A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More

    Authors: Zhichao Wang, Bin Bi, Shiva Kumar Pentyala, Kiran Ramnath, Sougata Chaudhuri, Shubham Mehrotra, Zixu, Zhu, Xiang-Bo Mao, Sitaram Asur, Na, Cheng

    Abstract: With advancements in self-supervised learning, the availability of trillions tokens in a pre-training corpus, instruction fine-tuning, and the development of large Transformers with billions of parameters, large language models (LLMs) are now capable of generating factual and coherent responses to human queries. However, the mixed quality of training data can lead to the generation of undesired re… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  7. arXiv:2406.17923  [pdf, other

    cs.CL

    PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

    Authors: Shiva Kumar Pentyala, Zhichao Wang, Bin Bi, Kiran Ramnath, Xiang-Bo Mao, Regunathan Radhakrishnan, Sitaram Asur, Na, Cheng

    Abstract: Large language models (LLMs) have shown remarkable abilities in diverse natural language processing (NLP) tasks. The LLMs generally undergo supervised fine-tuning (SFT) followed by preference alignment to be usable in downstream applications. However, this sequential training pipeline leads to alignment tax that degrades the LLM performance. This paper introduces PAFT, a new PArallel training pa… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  8. arXiv:2402.08614  [pdf, other

    cs.CR

    CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources

    Authors: Sikha Pentyala, Mayana Pereira, Martine De Cock

    Abstract: Data is the lifeblood of the modern world, forming a fundamental part of AI, decision-making, and research advances. With increase in interest in data, governments have taken important steps towards a regulated data world, drastically impacting data sharing and data usability and resulting in massive amounts of data confined within the walls of organizations. While synthetic data generation (SDG)… ▽ More

    Submitted 8 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 41st International Conference on Machine Learning, 2024

  9. arXiv:2402.06699  [pdf, other

    cs.CR

    High Epsilon Synthetic Data Vulnerabilities in MST and PrivBayes

    Authors: Steven Golob, Sikha Pentyala, Anuar Maratkhan, Martine De Cock

    Abstract: Synthetic data generation (SDG) has become increasingly popular as a privacy-enhancing technology. It aims to maintain important statistical properties of its underlying training data, while excluding any personally identifiable information. There have been a whole host of SDG algorithms developed in recent years to improve and balance both of these aims. Many of these algorithms provide robust di… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  10. arXiv:2311.14137  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving Algorithmic Recourse

    Authors: Sikha Pentyala, Shubham Sharma, Sanjay Kariyappa, Freddy Lecue, Daniele Magazzeni

    Abstract: When individuals are subject to adverse outcomes from machine learning models, providing a recourse path to help achieve a positive outcome is desirable. Recent work has shown that counterfactual explanations - which can be used as a means of single-step recourse - are vulnerable to privacy issues, putting an individuals' privacy at risk. Providing a sequential multi-step path for recourse can amp… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted at 3rd International Workshop on Explainable AI in Finance, ICAIF 2023

  11. arXiv:2303.02916  [pdf, other

    cs.IR cs.CR cs.CY

    Privacy-Preserving Fair Item Ranking

    Authors: Jia Ao Sun, Sikha Pentyala, Martine De Cock, Golnoosh Farnadi

    Abstract: Users worldwide access massive amounts of curated data in the form of rankings on a daily basis. The societal impact of this ease of access has been studied and work has been done to propose and enforce various notions of fairness in rankings. Current computational methods for fair item ranking rely on disclosing user data to a centralized server, which gives rise to privacy concerns for the users… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  12. arXiv:2210.07332  [pdf, other

    cs.CR cs.LG

    Secure Multiparty Computation for Synthetic Data Generation from Distributed Data

    Authors: Mayana Pereira, Sikha Pentyala, Anderson Nascimento, Rafael T. de Sousa Jr., Martine De Cock

    Abstract: Legal and ethical restrictions on accessing relevant data inhibit data science research in critical domains such as health, finance, and education. Synthetic data generation algorithms with privacy guarantees are emerging as a paradigm to break this data logjam. Existing approaches, however, assume that the data holders supply their raw data to a trusted curator, who uses it as fuel for synthetic… ▽ More

    Submitted 28 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  13. arXiv:2205.11584  [pdf, other

    cs.LG cs.CR

    PrivFairFL: Privacy-Preserving Group Fairness in Federated Learning

    Authors: Sikha Pentyala, Nicola Neophytou, Anderson Nascimento, Martine De Cock, Golnoosh Farnadi

    Abstract: Group fairness ensures that the outcome of machine learning (ML) based decision making systems are not biased towards a certain group of people defined by a sensitive attribute such as gender or ethnicity. Achieving group fairness in Federated Learning (FL) is challenging because mitigating bias inherently requires using the sensitive attribute values of all clients, while FL is aimed precisely at… ▽ More

    Submitted 26 August, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  14. arXiv:2202.04058  [pdf, other

    cs.LG cs.CR

    PrivFair: a Library for Privacy-Preserving Fairness Auditing

    Authors: Sikha Pentyala, David Melanson, Martine De Cock, Golnoosh Farnadi

    Abstract: Machine learning (ML) has become prominent in applications that directly affect people's quality of life, including in healthcare, justice, and finance. ML models have been found to exhibit discrimination based on sensitive attributes such as gender, race, or disability. Assessing if an ML model is free of bias remains challenging to date, and by definition has to be done with sensitive user chara… ▽ More

    Submitted 23 May, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  15. arXiv:2202.02625  [pdf, other

    cs.CR cs.LG

    Training Differentially Private Models with Secure Multiparty Computation

    Authors: Sikha Pentyala, Davis Railsback, Ricardo Maia, Rafael Dowsley, David Melanson, Anderson Nascimento, Martine De Cock

    Abstract: We address the problem of learning a machine learning model from training data that originates at multiple data owners while providing formal privacy guarantees regarding the protection of each owner's data. Existing solutions based on Differential Privacy (DP) achieve this at the cost of a drop in accuracy. Solutions based on Secure Multiparty Computation (MPC) do not incur such accuracy loss but… ▽ More

    Submitted 11 March, 2025; v1 submitted 5 February, 2022; originally announced February 2022.

  16. arXiv:2102.03513  [pdf, other

    cs.CR cs.CV cs.LG

    Privacy-Preserving Video Classification with Convolutional Neural Networks

    Authors: Sikha Pentyala, Rafael Dowsley, Martine De Cock

    Abstract: Many video classification applications require access to personal data, thereby posing an invasive security risk to the users' privacy. We propose a privacy-preserving implementation of single-frame method based video classification with convolutional neural networks that allows a party to infer a label from a video without necessitating the video owner to disclose their video to other entities in… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  17. arXiv:2007.12358  [pdf, other

    cs.IR cs.AI cs.SI

    Machine Learning Explanations to Prevent Overtrust in Fake News Detection

    Authors: Sina Mohseni, Fan Yang, Shiva Pentyala, Mengnan Du, Yi Liu, Nic Lupfer, Xia Hu, Shuiwang Ji, Eric Ragan

    Abstract: Combating fake news and misinformation propagation is a challenging task in the post-truth era. News feed and search algorithms could potentially lead to unintentional large-scale propagation of false and fabricated information with users being exposed to algorithmically selected false content. Our research investigates the effects of an Explainable AI assistant embedded in news review platforms f… ▽ More

    Submitted 26 July, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

  18. arXiv:1909.05999  [pdf, other

    cs.CV

    Towards Generalizable Deepfake Detection with Locality-aware AutoEncoder

    Authors: Mengnan Du, Shiva Pentyala, Yuening Li, Xia Hu

    Abstract: With advancements of deep learning techniques, it is now possible to generate super-realistic images and videos, i.e., deepfakes. These deepfakes could reach mass audience and result in adverse impacts on our society. Although lots of efforts have been devoted to detect deepfakes, their performance drops significantly on previously unseen but related manipulations and the detection generalization… ▽ More

    Submitted 19 September, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Accepted by CIKM 2020

  19. arXiv:1907.07757  [pdf, other

    cs.CY cs.CL cs.LG

    XFake: Explainable Fake News Detector with Visualizations

    Authors: Fan Yang, Shiva K. Pentyala, Sina Mohseni, Mengnan Du, Hao Yuan, Rhema Linder, Eric D. Ragan, Shuiwang Ji, Xia Hu

    Abstract: In this demo paper, we present the XFake system, an explainable fake news detector that assists end-users to identify news credibility. To effectively detect and interpret the fakeness of news items, we jointly consider both attributes (e.g., speaker) and statements. Specifically, MIMIC, ATTN and PERT frameworks are designed, where MIMIC is built for attribute analysis, ATTN is for statement seman… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 4 pages, WebConf'2019 Demo

  20. arXiv:1907.01791  [pdf, other

    cs.CL cs.AI cs.LG

    Multi-Task Networks With Universe, Group, and Task Feature Learning

    Authors: Shiva Pentyala, Mengwen Liu, Markus Dreyer

    Abstract: We present methods for multi-task learning that take advantage of natural groupings of related tasks. Task groups may be defined along known properties of the tasks, such as task domain or language. Such task groups represent supervised information at the inter-task level and can be encoded into the model. We investigate two variants of neural network architectures that accomplish this, learning d… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.