Skip to main content

Showing 1–9 of 9 results for author: Kang, J S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.17495  [pdf, ps, other

    cs.LG cs.AI cs.CL

    ProxySPEX: Inference-Efficient Interpretability via Sparse Feature Interactions in LLMs

    Authors: Landon Butler, Abhineet Agarwal, Justin Singh Kang, Yigit Efe Erginbas, Bin Yu, Kannan Ramchandran

    Abstract: Large Language Models (LLMs) have achieved remarkable performance by capturing complex interactions between input features. To identify these interactions, most existing approaches require enumerating all possible combinations of features up to a given order, causing them to scale poorly with the number of inputs $n$. Recently, Kang et al. (2025) proposed SPEX, an information-theoretic approach th… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2505.12651  [pdf, ps, other

    cs.AI

    $\texttt{DIAMONDs}$: A Dataset for $\mathbb{D}$ynamic $\mathbb{I}$nformation $\mathbb{A}$nd $\mathbb{M}$ental modeling $\mathbb{O}$f $\mathbb{N}$umeric $\mathbb{D}$iscussions

    Authors: Sayontan Ghosh, Mahnaz Koupaee, Yash Kumar Lal, Pegah Alipoormolabashi, Mohammad Saqib Hasan, Jun Seok Kang, Niranjan Balasubramanian

    Abstract: Understanding multiparty conversations demands robust Theory of Mind (ToM) capabilities, including the ability to track dynamic information, manage knowledge asymmetries, and distinguish relevant information across extended exchanges. To advance ToM evaluation in such settings, we present a carefully designed scalable methodology for generating high-quality benchmark conversation-question pairs wi… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  3. arXiv:2502.13870  [pdf, other

    cs.LG cs.AI cs.CL cs.IT

    SPEX: Scaling Feature Interaction Explanations for LLMs

    Authors: Justin Singh Kang, Landon Butler, Abhineet Agarwal, Yigit Efe Erginbas, Ramtin Pedarsani, Kannan Ramchandran, Bin Yu

    Abstract: Large language models (LLMs) have revolutionized machine learning due to their ability to capture complex interactions between input features. Popular post-hoc explanation methods like SHAP provide marginal feature attributions, while their extensions to interaction importances only scale to small input lengths ($\approx 20$). We propose Spectral Explainer (SPEX), a model-agnostic interaction attr… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  4. arXiv:2410.19236  [pdf, other

    cs.LG cs.CE q-bio.GN stat.CO

    SHAP zero Explains Biological Sequence Models with Near-zero Marginal Cost for Future Queries

    Authors: Darin Tsui, Aryan Musharaf, Yigit Efe Erginbas, Justin Singh Kang, Amirali Aghazadeh

    Abstract: The growing adoption of machine learning models for biological sequences has intensified the need for interpretable predictions, with Shapley values emerging as a theoretically grounded standard for model explanation. While effective for local explanations of individual input sequences, scaling Shapley-based interpretability to extract global biological insights requires evaluating thousands of se… ▽ More

    Submitted 22 May, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

  5. arXiv:2402.02631  [pdf, other

    cs.LG

    Learning to Understand: Identifying Interactions via the Möbius Transform

    Authors: Justin S. Kang, Yigit E. Erginbas, Landon Butler, Ramtin Pedarsani, Kannan Ramchandran

    Abstract: One of the key challenges in machine learning is to find interpretable representations of learned functions. The Möbius transform is essential for this purpose, as its coefficients correspond to unique importance scores for sets of input variables. This transform is closely related to widely used game-theoretic notions of importance like the Shapley and Bhanzaf value, but it also captures crucial… ▽ More

    Submitted 15 June, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: 34 pages, 16 figures

  6. arXiv:2301.06200  [pdf, other

    eess.SP cs.LG

    Efficiently Computing Sparse Fourier Transforms of $q$-ary Functions

    Authors: Yigit Efe Erginbas, Justin Singh Kang, Amirali Aghazadeh, Kannan Ramchandran

    Abstract: Fourier transformations of pseudo-Boolean functions are popular tools for analyzing functions of binary sequences. Real-world functions often have structures that manifest in a sparse Fourier transform, and previous works have shown that under the assumption of sparsity the transform can be computed efficiently. But what if we want to compute the Fourier transform of functions defined over a $q$-a… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: 29 pages, 3 figures

  7. arXiv:2007.15497  [pdf, other

    cs.IT

    Minimum Feedback for Collision-Free Scheduling in Massive Random Access

    Authors: Justin Singh Kang, Wei Yu

    Abstract: Consider a massive random access scenario in which a small set of $k$ active users out of a large number of $n$ potential users need to be scheduled in $b\ge k$ slots. What is the minimum common feedback to the users needed to ensure that scheduling is collision-free? Instead of a naive scheme of listing the indices of the $k$ active users in the order in which they should transmit, at a cost of… ▽ More

    Submitted 20 September, 2021; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: Accepted in IEEE Transactions on Information Theory

  8. arXiv:1905.11912  [pdf, other

    cs.CL

    A Cross-Domain Transferable Neural Coherence Model

    Authors: Peng Xu, Hamidreza Saghir, Jin Sung Kang, Teng Long, Avishek Joey Bose, Yanshuai Cao, Jackie Chi Kit Cheung

    Abstract: Coherence is an important aspect of text quality and is crucial for ensuring its readability. One important limitation of existing coherence models is that training on one domain does not easily generalize to unseen categories of text. Previous work advocates for generative models for cross-domain generalization, because for discriminative models, the space of incoherent sentence orderings to disc… ▽ More

    Submitted 9 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted at ACL 2019

  9. arXiv:1904.03111  [pdf, other

    cs.CL

    PoMo: Generating Entity-Specific Post-Modifiers in Context

    Authors: Jun Seok Kang, Robert L. Logan IV, Zewei Chu, Yang Chen, Dheeru Dua, Kevin Gimpel, Sameer Singh, Niranjan Balasubramanian

    Abstract: We introduce entity post-modifier generation as an instance of a collaborative writing task. Given a sentence about a target entity, the task is to automatically generate a post-modifier phrase that provides contextually relevant information about the entity. For example, for the sentence, "Barack Obama, _______, supported the #MeToo movement.", the phrase "a father of two girls" is a contextually… ▽ More

    Submitted 8 April, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: NAACL-HLT 2019