Skip to main content

Showing 1–50 of 55 results for author: Si, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07982  [pdf, ps, other

    cs.AI cs.CL

    $τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment

    Authors: Victor Barres, Honghua Dong, Soham Ray, Xujie Si, Karthik Narasimhan

    Abstract: Existing benchmarks for conversational AI agents simulate single-control environments, where only the AI agent can use tools to interact with the world, while the user remains a passive information provider. This differs from real-world scenarios like technical support, where users need to actively participate in modifying the state of the (shared) world. In order to address this gap, we introduce… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.00563  [pdf, ps, other

    cs.LG cs.AI

    Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments

    Authors: Ziyan Luo, Tianwei Ni, Pierre-Luc Bacon, Doina Precup, Xujie Si

    Abstract: A key approach to state abstraction is approximating behavioral metrics (notably, bisimulation metrics) in the observation space and embedding these learned distances in the representation space. While promising for robustness to task-irrelevant noise, as shown in prior work, accurately estimating these metrics remains challenging, requiring various design choices that create gaps between theory a… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  3. arXiv:2505.19271  [pdf, ps, other

    cs.SE

    VerifyThisBench: Generating Code, Specifications, and Proofs All at Once

    Authors: Xun Deng, Sicheng Zhong, Andreas Veneris, Fan Long, Xujie Si

    Abstract: Large language models (LLMs) have demonstrated remarkable progress in code generation, but many existing benchmarks are approaching saturation and offer little guarantee on the trustworthiness of the generated programs, offering limited insight into deeper reasoning capabilities. We introduce VerifyThisBench, a new benchmark designed to evaluate LLMs on end-to-end program verification tasks that r… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  4. arXiv:2504.17384  [pdf, other

    physics.geo-ph cs.AI

    On the workflow, opportunities and challenges of developing foundation model in geophysics

    Authors: Hanlin Sheng, Xinming Wu, Hang Gao, Haibin Di, Sergey Fomel, Jintao Li, Xu Si

    Abstract: Foundation models, as a mainstream technology in artificial intelligence, have demonstrated immense potential across various domains in recent years, particularly in handling complex tasks and multimodal data. In the field of geophysics, although the application of foundation models is gradually expanding, there is currently a lack of comprehensive reviews discussing the full workflow of integrati… ▽ More

    Submitted 25 April, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  5. arXiv:2504.03048  [pdf, other

    cs.LG cs.CL

    LLM Library Learning Fails: A LEGO-Prover Case Study

    Authors: Ian Berlot-Attwell, Frank Rudzicz, Xujie Si

    Abstract: Recent advancements in the coding, reasoning, and tool-using abilities of LLMs have spurred interest in library learning (i.e., online learning through the creation, storage, and retrieval of reusable and composable functions, knowledge, checklists, or lemmas). Such systems often promise improved task performance through the automatic creation of broadly applicable tools, as well as superior compu… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 24 pages, 5 figures

  6. arXiv:2503.19476  [pdf, ps, other

    cs.LG

    Extracting Interpretable Logic Rules from Graph Neural Networks

    Authors: Chuqin Geng, Ziyu Zhao, Zhaoyue Wang, Haolin Ye, Xujie Si

    Abstract: Graph neural networks (GNNs) operate over both input feature spaces and combinatorial graph structures, making it challenging to understand the rationale behind their predictions. As GNNs gain widespread popularity and demonstrate success across various domains, such as drug discovery, studying their interpretability has become a critical task. To address this, many explainability methods have bee… ▽ More

    Submitted 5 June, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: 22 pages, 9 figures

  7. arXiv:2503.10547  [pdf, other

    cs.CV

    Learning Interpretable Logic Rules from Deep Vision Models

    Authors: Chuqin Geng, Yuhe Jiang, Ziyu Zhao, Haolin Ye, Zhaoyue Wang, Xujie Si

    Abstract: We propose a general framework called VisionLogic to extract interpretable logic rules from deep vision models, with a focus on image classification tasks. Given any deep vision model that uses a fully connected layer as the output head, VisionLogic transforms neurons in the last layer into predicates and grounds them into vision concepts using causal validation. In this way, VisionLogic can provi… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 10 pages, 6 figures

  8. arXiv:2502.13834  [pdf, other

    cs.AI

    Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning

    Authors: Zenan Li, Zhaoyu Li, Wen Tang, Xian Zhang, Yuan Yao, Xujie Si, Fan Yang, Kaiyu Yang, Xiaoxing Ma

    Abstract: Large language models (LLMs) can prove mathematical theorems formally by generating proof steps (\textit{a.k.a.} tactics) within a proof system. However, the space of possible tactics is vast and complex, while the available training data for formal proofs is limited, posing a significant challenge to LLM-based tactic generation. To address this, we introduce a neuro-symbolic tactic generator that… ▽ More

    Submitted 26 February, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Published as a conference paper at ICLR 2025. Code is available at https://github.com/Lizn-zn/NeqLIPS/

  9. arXiv:2502.07829  [pdf, other

    cs.CV cs.LG

    Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing

    Authors: Sihao Wu, Xiaonan Si, Chi Xing, Jianhong Wang, Gaojie Jin, Guangliang Cheng, Lijun Zhang, Xiaowei Huang

    Abstract: The integration of preference alignment with diffusion models (DMs) has emerged as a transformative approach to enhance image generation and editing capabilities. Although integrating diffusion models with preference alignment strategies poses significant challenges for novices at this intersection, comprehensive and systematic reviews of this subject are still notably lacking. To bridge this gap,… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  10. arXiv:2502.05344  [pdf, other

    cs.SE cs.AI

    RAG-Verus: Repository-Level Program Verification with LLMs using Retrieval Augmented Generation

    Authors: Sicheng Zhong, Jiading Zhu, Yifang Tian, Xujie Si

    Abstract: Scaling automated formal verification to real-world projects requires resolving cross-module dependencies and global contexts, which are challenges overlooked by existing function-centric methods. We introduce RagVerus, a framework that synergizes retrieval-augmented generation with context-aware prompting to automate proof synthesis for multi-module repositories, achieving a 27% relative improvem… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  11. arXiv:2501.08281  [pdf, ps, other

    cs.LG

    From Neural Representations to Interpretable Logic Rules

    Authors: Chuqin Geng, Xiaojie Xu, Anqi Xing, Ziyu Zhao, Xujie Si

    Abstract: As deep neural networks continue to excel across various domains, their black-box nature has raised concerns about transparency and trust. In particular, interpretability has become increasingly essential for applications that demand high safety and knowledge rigor, such as drug discovery, autonomous driving, and genomics. However, progress in understanding even the simplest deep neural networks -… ▽ More

    Submitted 11 June, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: 23 pages, 7 figures

  12. arXiv:2412.20801  [pdf, other

    cs.CV

    Generalize Your Face Forgery Detectors: An Insertable Adaptation Module Is All You Need

    Authors: Xiaotian Si, Linghui Li, Liwei Zhang, Ziduo Guo, Kaiguo Yuan, Bingyu Li, Xiaoyong Li

    Abstract: A plethora of face forgery detectors exist to tackle facial deepfake risks. However, their practical application is hindered by the challenge of generalizing to forgeries unseen during the training stage. To this end, we introduce an insertable adaptation module that can adapt a trained off-the-shelf detector using only online unlabeled test data, without requiring modifications to the architectur… ▽ More

    Submitted 30 December, 2024; originally announced December 2024.

    Comments: ICASSP2025 accepted

  13. arXiv:2411.12773  [pdf, other

    cs.CV

    Decoupling Training-Free Guided Diffusion by ADMM

    Authors: Youyuan Zhang, Zehua Liu, Zenan Li, Zhaoyu Li, James J. Clark, Xujie Si

    Abstract: In this paper, we consider the conditional generation problem by guiding off-the-shelf unconditional diffusion models with differentiable loss functions in a plug-and-play fashion. While previous research has primarily focused on balancing the unconditional diffusion model and the guided loss through a tuned weight hyperparameter, we propose a novel framework that distinctly decouples these two co… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  14. arXiv:2411.00773  [pdf, other

    cs.AI

    LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation

    Authors: Bowen Li, Zhaoyu Li, Qiwei Du, Jinqi Luo, Wenshan Wang, Yaqi Xie, Simon Stepputtis, Chen Wang, Katia P. Sycara, Pradeep Kumar Ravikumar, Alexander G. Gray, Xujie Si, Sebastian Scherer

    Abstract: Recent years have witnessed the rapid development of Neuro-Symbolic (NeSy) AI systems, which integrate symbolic reasoning into deep neural networks. However, most of the existing benchmarks for NeSy AI fail to provide long-horizon reasoning tasks with complex multi-agent interactions. Furthermore, they are usually constrained by fixed and simplistic logical rules over limited entities, making them… ▽ More

    Submitted 3 April, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: 25 pages, 8 figures, In Advances in Neural Information Processing Systems (NeurIPS) 37 D&B Track (2024): 69840-69864

    Journal ref: Advances in Neural Information Processing Systems, 37, 69840-69864 (2024)

  15. arXiv:2410.20274  [pdf, other

    cs.LG cs.CL cs.SC

    Library Learning Doesn't: The Curious Case of the Single-Use "Library"

    Authors: Ian Berlot-Attwell, Frank Rudzicz, Xujie Si

    Abstract: Advances in Large Language Models (LLMs) have spurred a wave of LLM library learning systems for mathematical reasoning. These systems aim to learn a reusable library of tools, such as formal Isabelle lemmas or Python programs that are tailored to a family of tasks. Many of these systems are inspired by the human structuring of knowledge into reusable and extendable concepts, but do current method… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

    Comments: 24 pages, 7 figures. Accepted to the 4th MATH-AI Workshop at NeurIPS'24

  16. arXiv:2409.14779  [pdf, other

    cs.AR

    Hardware/Algorithm Co-design for Real-Time I/O Control with Improved Timing Accuracy and Robustness

    Authors: Zhe Jiang, Shuai Zhao, Ran Wei, Xin Si, Gang Chen, Nan Guan

    Abstract: In safety-critical systems, timing accuracy is the key to achieving precise I/O control. To meet such strict timing requirements, dedicated hardware assistance has recently been investigated and developed. However, these solutions are often fragile, due to unforeseen timing defects. In this paper, we propose a robust and timing-accurate I/O co-processor, which manages I/O tasks using Execution Tim… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: Accepted at the 2024 IEEE Real-Time Systems Symposium (RTSS)

    ACM Class: C.3; D.4.7

  17. arXiv:2409.04962  [pdf, other

    physics.geo-ph cs.LG

    A foundation model enpowered by a multi-modal prompt engine for universal seismic geobody interpretation across surveys

    Authors: Hang Gao, Xinming Wu, Luming Liang, Hanlin Sheng, Xu Si, Gao Hui, Yaxing Li

    Abstract: Seismic geobody interpretation is crucial for structural geology studies and various engineering applications. Existing deep learning methods show promise but lack support for multi-modal inputs and struggle to generalize to different geobody types or surveys. We introduce a promptable foundation model for interpreting any geobodies across seismic surveys. This model integrates a pre-trained visio… ▽ More

    Submitted 13 September, 2024; v1 submitted 7 September, 2024; originally announced September 2024.

  18. arXiv:2408.09034  [pdf, ps, other

    cs.PL

    Modernizing SMT-Based Type Error Localization

    Authors: Max Kopinsky, Brigitte Pientka, Xujie Si

    Abstract: Traditional implementations of strongly-typed functional programming languages often miss the root cause of type errors. As a consequence, type error messages are often misleading and confusing - particularly for students learning such a language. We describe Tyro, a type error localization tool which determines the optimal source of an error for ill-typed programs following fundamental ideas by P… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 10 pages, 7 figures. About Tyro, available at https://github.com/JKTKops/tyro. To be published in FMCAD 2024

    ACM Class: D.3.3

  19. arXiv:2407.05411  [pdf, other

    cs.SE

    Assessing Code Generation with Intermediate Languages

    Authors: Xun Deng, Sicheng Zhong, Honghua Dong, Jingyu Hu, Sidi Mohamed Beillahi, Xujie Si, Fan Long

    Abstract: Intermediate step methodologies like chain of thoughts (COT) have demonstrated effectiveness in enhancing the performance of Large Language Models (LLMs) on code generation. This study explores the utilization of intermediate languages, including various programming languages, natural language solutions, and pseudo-code, and systematically evaluates their impact on the performance of LLMs in code… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  20. arXiv:2406.13161  [pdf, other

    cs.AI cs.CL cs.LG cs.PL

    APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts

    Authors: Honghua Dong, Qidong Su, Yubo Gao, Zhaoyu Li, Yangjun Ruan, Gennady Pekhimenko, Chris J. Maddison, Xujie Si

    Abstract: Large Language Models (LLMs) have become increasingly capable of handling diverse tasks with the aid of well-crafted prompts and integration of external tools, but as task complexity rises, the workflow involving LLMs can be complicated and thus challenging to implement and maintain. To address this challenge, we propose APPL, A Prompt Programming Language that acts as a bridge between computer pr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  21. Contextual Distillation Model for Diversified Recommendation

    Authors: Fan Li, Xu Si, Shisong Tang, Dingmin Wang, Kunyan Han, Bing Han, Guorui Zhou, Yang Song, Hechang Chen

    Abstract: The diversity of recommendation is equally crucial as accuracy in improving user experience. Existing studies, e.g., Determinantal Point Process (DPP) and Maximal Marginal Relevance (MMR), employ a greedy paradigm to iteratively select items that optimize both accuracy and diversity. However, prior methods typically exhibit quadratic complexity, limiting their applications to the re-ranking stage… ▽ More

    Submitted 14 August, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: accepted by KDD 2024 v2

  22. arXiv:2405.17503  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    Code Repair with LLMs gives an Exploration-Exploitation Tradeoff

    Authors: Hao Tang, Keya Hu, Jin Peng Zhou, Sicheng Zhong, Wei-Long Zheng, Xujie Si, Kevin Ellis

    Abstract: Iteratively improving and repairing source code with large language models (LLMs), known as refinement, has emerged as a popular way of generating programs that would be too complex to construct in one shot. Given a bank of test cases, together with a candidate program, an LLM can improve that program by being prompted with failed test cases. But it remains an open question how to best iteratively… ▽ More

    Submitted 29 October, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  23. arXiv:2405.17216  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Autoformalizing Euclidean Geometry

    Authors: Logan Murphy, Kaiyu Yang, Jialiang Sun, Zhaoyu Li, Anima Anandkumar, Xujie Si

    Abstract: Autoformalization involves automatically translating informal math into formal theorems and proofs that are machine-verifiable. Euclidean geometry provides an interesting and controllable domain for studying autoformalization. In this paper, we introduce a neuro-symbolic framework for autoformalizing Euclidean geometry, which combines domain knowledge, SMT solvers, and large language models (LLMs)… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024. The first two authors contributed equally

  24. arXiv:2404.09939  [pdf, other

    cs.AI

    A Survey on Deep Learning for Theorem Proving

    Authors: Zhaoyu Li, Jialiang Sun, Logan Murphy, Qidong Su, Zenan Li, Xian Zhang, Kaiyu Yang, Xujie Si

    Abstract: Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in natural language to rigorous derivations in formal systems. In recent years, the advancement of deep learning, especially the emergence of large language models, has sparked a notable surge of research exploring these techniques to enhance the process of theorem proving. This paper presents a comprehensive… ▽ More

    Submitted 21 August, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  25. arXiv:2404.04731  [pdf, other

    cs.PL cs.SE

    SAT-DIFF: A Tree Diffing Framework Using SAT Solving

    Authors: Chuqin Geng, Haolin Ye, Yihan Zhang, Brigitte Pientka, Xujie Si

    Abstract: Computing differences between tree-structured data is a critical but challenging problem in software analysis. In this paper, we propose a novel tree diffing approach called SatDiff, which reformulates the structural diffing problem into a MaxSAT problem. By encoding the necessary transformations from the source tree to the target tree, SatDiff generates correct, minimal, and type safe low-level e… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 23 pages, 7 figures

  26. arXiv:2404.04662  [pdf, other

    cs.LG cs.PL

    Learning Minimal Neural Specifications

    Authors: Chuqin Geng, Zhaoyue Wang, Haolin Ye, Xujie Si

    Abstract: Formal verification is only as good as the specification of a system, which is also true for neural network verification. Existing specifications follow the paradigm of data as specification, where the local neighborhood around a reference data point is considered correct or robust. While these specifications provide a fair testbed for assessing model robustness, they are too restrictive for verif… ▽ More

    Submitted 14 March, 2025; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: 30 pages,7 figures

  27. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  28. arXiv:2401.08399  [pdf, other

    cs.CV

    TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding

    Authors: Yun Liu, Haolin Yang, Xu Si, Ling Liu, Zipeng Li, Yuxiang Zhang, Yebin Liu, Li Yi

    Abstract: Humans commonly work with multiple objects in daily life and can intuitively transfer manipulation skills to novel objects by understanding object functional regularities. However, existing technical approaches for analyzing and synthesizing hand-object manipulation are mostly limited to handling a single hand and object due to the lack of data support. To address this, we construct TACO, an exten… ▽ More

    Submitted 25 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  29. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  30. arXiv:2310.20078  [pdf, other

    cs.SE

    TorchProbe: Fuzzing Dynamic Deep Learning Compilers

    Authors: Qidong Su, Chuqin Geng, Gennady Pekhimenko, Xujie Si

    Abstract: Static and dynamic computational graphs represent two distinct approaches to constructing deep learning frameworks. The former prioritizes compiler-based optimizations, while the latter focuses on programmability and user-friendliness. The recent release of PyTorch 2.0, which supports compiling arbitrary deep learning programs in Python, signifies a new direction in the evolution of deep learning… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  31. arXiv:2310.02133  [pdf, other

    cs.AI cs.LG

    Learning Reliable Logical Rules with SATNet

    Authors: Zhaoyu Li, Jinpei Guo, Yuhe Jiang, Xujie Si

    Abstract: Bridging logical reasoning and deep learning is crucial for advanced AI systems. In this work, we present a new framework that addresses this goal by generating interpretable and verifiable logical rules through differentiable learning, without relying on pre-specified logical structures. Our approach builds upon SATNet, a differentiable MaxSAT solver that learns the underlying rules from input-ou… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  32. arXiv:2309.16941  [pdf, other

    cs.LG

    G4SATBench: Benchmarking and Advancing SAT Solving with Graph Neural Networks

    Authors: Zhaoyu Li, Jinpei Guo, Xujie Si

    Abstract: Graph neural networks (GNNs) have recently emerged as a promising approach for solving the Boolean Satisfiability Problem (SAT), offering potential alternatives to traditional backtracking or local search SAT solvers. However, despite the growing volume of literature in this field, there remains a notable absence of a unified dataset and a fair benchmark to evaluate and compare existing approaches… ▽ More

    Submitted 10 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  33. arXiv:2309.02320  [pdf, other

    physics.geo-ph cs.AI cs.LG

    SeisCLIP: A seismology foundation model pre-trained by multi-modal data for multi-purpose seismic feature extraction

    Authors: Xu Si, Xinming Wu, Hanlin Sheng, Jun Zhu, Zefeng Li

    Abstract: Training specific deep learning models for particular tasks is common across various domains within seismology. However, this approach encounters two limitations: inadequate labeled data for certain tasks and limited generalization across regions. To address these challenges, we develop SeisCLIP, a seismology foundation model trained through contrastive learning from multi-modal data. It consists… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 27 pages, 9 figures, 4 tables

  34. arXiv:2306.13918  [pdf, other

    physics.geo-ph cs.LG

    Multi-task multi-station earthquake monitoring: An all-in-one seismic Phase picking, Location, and Association Network (PLAN)

    Authors: Xu Si, Xinming Wu, Zefeng Li, Shenghou Wang, Jun Zhu

    Abstract: Earthquake monitoring is vital for understanding the physics of earthquakes and assessing seismic hazards. A standard monitoring workflow includes the interrelated and interdependent tasks of phase picking, association, and location. Although deep learning methods have been successfully applied to earthquake monitoring, they mostly address the tasks separately and ignore the geographic relationshi… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 30 pages, 12 figures, 3 tables

  35. arXiv:2306.07883  [pdf, other

    cs.LG cs.CR

    Temporal Gradient Inversion Attacks with Robust Optimization

    Authors: Bowen Li, Hanlin Gu, Ruoxin Chen, Jie Li, Chentao Wu, Na Ruan, Xueming Si, Lixin Fan

    Abstract: Federated Learning (FL) has emerged as a promising approach for collaborative model training without sharing private data. However, privacy concerns regarding information exchanged during FL have received significant research attention. Gradient Inversion Attacks (GIAs) have been proposed to reconstruct the private data retained by local clients from the exchanged gradients. While recovering priva… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 24 pages

  36. arXiv:2305.18234  [pdf, other

    eess.SP cs.AI cs.LG

    Temporal Aware Mixed Attention-based Convolution and Transformer Network (MACTN) for EEG Emotion Recognition

    Authors: Xiaopeng Si, Dong Huang, Yulin Sun, Dong Ming

    Abstract: Emotion recognition plays a crucial role in human-computer interaction, and electroencephalography (EEG) is advantageous for reflecting human emotional states. In this study, we propose MACTN, a hierarchical hybrid model for jointly modeling local and global temporal information. The model is inspired by neuroscience research on the temporal dynamics of emotions. MACTN extracts local emotional fea… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  37. arXiv:2305.03089  [pdf, other

    cs.SE

    Idiolect: A Reconfigurable Voice Coding Assistant

    Authors: Breandan Considine, Nicholas Albion, Xujie Si

    Abstract: This paper presents Idiolect, an open source (https://github.com/OpenASR/idiolect) IDE plugin for voice coding and a novel approach to building bots that allows for users to define custom commands on-the-fly. Unlike traditional chatbots, Idiolect does not pretend to be an omniscient virtual assistant but rather a reconfigurable voice programming system that empowers users to create their own comma… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  38. arXiv:2305.02230  [pdf, other

    cs.CY cs.AI

    Can ChatGPT Pass An Introductory Level Functional Language Programming Course?

    Authors: Chuqin Geng, Yihan Zhang, Brigitte Pientka, Xujie Si

    Abstract: The recent introduction of ChatGPT has drawn significant attention from both industry and academia due to its impressive capabilities in solving a diverse range of tasks, including language translation, text summarization, and computer programming. Its capability for writing, modifying, and even correcting code together with its ease of use and access is already dramatically impacting computer sci… ▽ More

    Submitted 3 May, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: 16 pages, 8 figures

  39. arXiv:2305.01206  [pdf, other

    cs.LO cs.AI cs.LG cs.PL cs.SC

    Chronosymbolic Learning: Efficient CHC Solving with Symbolic Reasoning and Inductive Learning

    Authors: Ziyan Luo, Xujie Si

    Abstract: Solving Constrained Horn Clauses (CHCs) is a fundamental challenge behind a wide range of verification and analysis tasks. Data-driven approaches show great promise in improving CHC solving without the painstaking manual effort of creating and tuning various heuristics. However, a large performance gap exists between data-driven CHC solvers and symbolic reasoning-based solvers. In this work, we de… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

  40. Identifying Different Student Clusters in Functional Programming Assignments: From Quick Learners to Struggling Students

    Authors: Chuqin Geng, Wenwen Xu, Yingjie Xu, Brigitte Pientka, Xujie Si

    Abstract: Instructors and students alike are often focused on the grade in programming assignments as a key measure of how well a student is mastering the material and whether a student is struggling. This can be, however, misleading. Especially when students have access to auto-graders, their grades may be heavily skewed. In this paper, we analyze student assignment submission data collected from a functio… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 7 pages, 4 figures

  41. arXiv:2212.10006  [pdf, other

    cs.LG cs.CR

    Multi-head Uncertainty Inference for Adversarial Attack Detection

    Authors: Yuqi Yang, Songyun Yang, Jiyang Xie. Zhongwei Si, Kai Guo, Ke Zhang, Kongming Liang

    Abstract: Deep neural networks (DNNs) are sensitive and susceptible to tiny perturbation by adversarial attacks which causes erroneous predictions. Various methods, including adversarial defense and uncertainty inference (UI), have been developed in recent years to overcome the adversarial attacks. In this paper, we propose a multi-head uncertainty inference (MH-UI) framework for detecting adversarial attac… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  42. arXiv:2211.08486  [pdf, other

    cs.CV cs.AI cs.LG

    Scalar Invariant Networks with Zero Bias

    Authors: Chuqin Geng, Xiaojie Xu, Haolin Ye, Xujie Si

    Abstract: Just like weights, bias terms are the learnable parameters of many popular machine learning models, including neural networks. Biases are thought to enhance the representational power of neural networks, enabling them to solve a variety of tasks in computer vision. However, we argue that biases can be disregarded for some image-related tasks such as image classification, by considering the intrins… ▽ More

    Submitted 29 May, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 22 pages, 8 figures

  43. arXiv:2211.03880  [pdf, other

    cs.AI cs.LG

    NSNet: A General Neural Probabilistic Framework for Satisfiability Problems

    Authors: Zhaoyu Li, Xujie Si

    Abstract: We present the Neural Satisfiability Network (NSNet), a general neural framework that models satisfiability problems as probabilistic inference and meanwhile exhibits proper explainability. Inspired by the Belief Propagation (BP), NSNet uses a novel graph neural network (GNN) to parameterize BP in the latent space, where its hidden representations maintain the same probabilistic interpretation as… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  44. arXiv:2210.16114  [pdf, other

    cs.LG cs.AI cs.LO cs.SE

    Towards Reliable Neural Specifications

    Authors: Chuqin Geng, Nham Le, Xiaojie Xu, Zhaoyue Wang, Arie Gurfinkel, Xujie Si

    Abstract: Having reliable specifications is an unavoidable challenge in achieving verifiable correctness, robustness, and interpretability of AI systems. Existing specifications for neural networks are in the paradigm of data as specification. That is, the local neighborhood centering around a reference input is considered to be correct (or robust). While existing specifications contribute to verifying adve… ▽ More

    Submitted 17 March, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: 19 pages, 16 figures

  45. arXiv:2210.03682  [pdf, other

    cs.PL cs.LG

    Novice Type Error Diagnosis with Natural Language Models

    Authors: Chuqin Geng, Haolin Ye, Yixuan Li, Tianyu Han, Brigitte Pientka, Xujie Si

    Abstract: Strong static type systems help programmers eliminate many errors without much burden of supplying type annotations. However, this flexibility makes it highly non-trivial to diagnose ill-typed programs, especially for novice programmers. Compared to classic constraint solving and optimization-based approaches, the data-driven approach has shown great promise in identifying the root causes of type… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 17 pages, 8 figures

  46. arXiv:2203.15275  [pdf, other

    eess.SP cs.AI cs.LG

    A Multi-size Kernel based Adaptive Convolutional Neural Network for Bearing Fault Diagnosis

    Authors: Guangwei Yu, Gang Li, Xingtong Si, Zhuoyuan Song

    Abstract: Bearing fault identification and analysis is an important research area in the field of machinery fault diagnosis. Aiming at the common faults of rolling bearings, we propose a data-driven diagnostic algorithm based on the characteristics of bearing vibrations called multi-size kernel based adaptive convolutional neural network (MSKACNN). Using raw bearing vibration signals as the inputs, MSKACNN… ▽ More

    Submitted 15 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: 21 pages, 16 figures

    MSC Class: 62H30 ACM Class: G.3

  47. arXiv:2112.08010  [pdf, ps, other

    cs.SE

    Static Code Analyzer Using Micro-Grammar

    Authors: Hanwen Zhu, Junyoung Jang, Xujie Si

    Abstract: [THIS IS AN UNDERGRADUATE PROJECT] This paper discusses the effectiveness of the bug finder based on "micro-grammar".

    Submitted 13 August, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: 4 pages

  48. arXiv:2111.04264  [pdf, other

    cs.CV

    Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

    Authors: Chenglong Li, Tianhao Zhu, Lei Liu, Xiaonan Si, Zilin Fan, Sulan Zhai

    Abstract: In many visual systems, visual tracking often bases on RGB image sequences, in which some targets are invalid in low-light conditions, and tracking performance is thus affected significantly. Introducing other modalities such as depth and infrared data is an effective way to handle imaging limitations of individual sources, but multi-modal imaging platforms usually require elaborate designs and ca… ▽ More

    Submitted 11 November, 2021; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: In Submission

  49. arXiv:2108.10821  [pdf, other

    cs.LG

    Graph Contrastive Pre-training for Effective Theorem Reasoning

    Authors: Zhaoyu Li, Binghong Chen, Xujie Si

    Abstract: Interactive theorem proving is a challenging and tedious process, which requires non-trivial expertise and detailed low-level instructions (or tactics) from human experts. Tactic prediction is a natural way to automate this process. Existing methods show promising results on tactic prediction by learning a deep neural network (DNN) based model from proofs written by human experts. In this paper, w… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

  50. arXiv:2106.11072  [pdf, other

    cs.AI cs.LG stat.ML

    Techniques for Symbol Grounding with SATNet

    Authors: Sever Topan, David Rolnick, Xujie Si

    Abstract: Many experts argue that the future of artificial intelligence is limited by the field's ability to integrate symbolic logical reasoning into deep learning architectures. The recently proposed differentiable MAXSAT solver, SATNet, was a breakthrough in its capacity to integrate with a traditional neural network and solve visual reasoning problems. For instance, it can learn the rules of Sudoku pure… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Code available at https://github.com/SeverTopan/SATNet