Skip to main content

Showing 1–32 of 32 results for author: Sheng, V S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.03288  [pdf, other

    cs.SE cs.AI

    CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models

    Authors: Zhenyu Xu, Victor S. Sheng

    Abstract: The rise of large language models (LLMs) like ChatGPT has significantly improved automated code generation, enhancing software development efficiency. However, this introduces challenges in academia, particularly in distinguishing between human-written and LLM-generated code, which complicates issues of academic integrity. Existing detection methods, such as pre-trained models and watermarking, fa… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  2. arXiv:2412.10690  [pdf, other

    cs.SI

    Affiliation-based Local Community Detection across Multiple Networks

    Authors: Li Ni, Zhou Xie, Yiwen Zhang, Wenjian Luo, Victor S. Sheng

    Abstract: Real-world networks are often constructed from different sources or domains, including various types of entities and diverse relationships between networks, thus forming multi-domain networks. A single network typically fails to capture the complete graph structure and the diverse relationships among multiple networks. Consequently, leveraging multiple networks is crucial for a comprehensive detec… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

    Comments: 11 pages,6 figures

  3. arXiv:2411.06989  [pdf, other

    cs.CL cs.AI

    The Backpropagation of the Wave Network

    Authors: Xin Zhang, Victor S. Sheng

    Abstract: This paper provides an in-depth analysis of Wave Network, a novel token representation method derived from the Wave Network, designed to capture both global and local semantics of input text through wave-inspired complex vectors. In complex vector token representation, each token is represented with a magnitude component, capturing the global semantics of the entire input text, and a phase compone… ▽ More

    Submitted 10 January, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

  4. arXiv:2411.04393  [pdf, other

    cs.AI cs.LG

    Bridging the Gap: Representation Spaces in Neuro-Symbolic AI

    Authors: Xin Zhang, Victor S. Sheng

    Abstract: Neuro-symbolic AI is an effective method for improving the overall performance of AI models by combining the advantages of neural networks and symbolic learning. However, there are differences between the two in terms of how they process data, primarily because they often use different data representation methods, which is often an important factor limiting the overall performance of the two. From… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  5. arXiv:2411.04383  [pdf, other

    cs.AI cs.LG

    Neuro-Symbolic AI: Explainability, Challenges, and Future Trends

    Authors: Xin Zhang, Victor S. Sheng

    Abstract: Explainability is an essential reason limiting the application of neural networks in many vital fields. Although neuro-symbolic AI hopes to enhance the overall explainability by leveraging the transparency of symbolic learning, the results are less evident than imagined. This article proposes a classification for explainability by considering both model design and behavior of 191 studies from 2013… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  6. arXiv:2411.02674  [pdf, other

    cs.CL cs.AI

    Wave Network: An Ultra-Small Language Model

    Authors: Xin Zhang, Victor S. Sheng

    Abstract: We propose an innovative token representation and update method in a new ultra-small language model: the Wave network. Specifically, we use a complex vector to represent each token, encoding both global and local semantics of the input text. A complex vector consists of two components: a magnitude vector representing the global semantics of the input text, and a phase vector capturing the relation… ▽ More

    Submitted 11 November, 2024; v1 submitted 4 November, 2024; originally announced November 2024.

  7. arXiv:2410.21282  [pdf, other

    cs.CY cs.AI cs.SE

    Logic Error Localization in Student Programming Assignments Using Pseudocode and Graph Neural Networks

    Authors: Zhenyu Xu, Kun Zhang, Victor S. Sheng

    Abstract: Pseudocode is extensively used in introductory programming courses to instruct computer science students in algorithm design, utilizing natural language to define algorithmic behaviors. This learning approach enables students to convert pseudocode into source code and execute it to verify their algorithms' correctness. This process typically introduces two types of errors: syntax errors and logic… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  8. arXiv:2410.10876  [pdf, other

    cs.CL cs.CR cs.LG

    FreqMark: Frequency-Based Watermark for Sentence-Level Detection of LLM-Generated Text

    Authors: Zhenyu Xu, Kun Zhang, Victor S. Sheng

    Abstract: The increasing use of Large Language Models (LLMs) for generating highly coherent and contextually relevant text introduces new risks, including misuse for unethical purposes such as disinformation or academic dishonesty. To address these challenges, we propose FreqMark, a novel watermarking technique that embeds detectable frequency-based watermarks in LLM-generated text during the token sampling… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  9. arXiv:2410.08241  [pdf, other

    cs.SE cs.AI cs.PL

    LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT

    Authors: Zhenyu Xu, Victor S. Sheng

    Abstract: Logical errors in programming don't raise compiler alerts, making them hard to detect. These silent errors can disrupt a program's function or cause run-time issues. Their correction requires deep insight into the program's logic, highlighting the importance of automated detection and repair. In this paper, we introduce LecPrompt to localize and repair logical errors, an prompt-based approach that… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  10. arXiv:2410.07271  [pdf, other

    cs.SE cs.AI

    Multi-Task Program Error Repair and Explanatory Diagnosis

    Authors: Zhenyu Xu, Victor S. Sheng

    Abstract: Program errors can occur in any type of programming, and can manifest in a variety of ways, such as unexpected output, crashes, or performance issues. And program error diagnosis can often be too abstract or technical for developers to understand, especially for beginners. The goal of this paper is to present a novel machine-learning approach for Multi-task Program Error Repair and Explanatory Dia… ▽ More

    Submitted 5 January, 2025; v1 submitted 9 October, 2024; originally announced October 2024.

  11. arXiv:2410.06545  [pdf, other

    cs.CR cs.LG

    Signal Watermark on Large Language Models

    Authors: Zhenyu Xu, Victor S. Sheng

    Abstract: As Large Language Models (LLMs) become increasingly sophisticated, they raise significant security concerns, including the creation of fake news and academic misuse. Most detectors for identifying model-generated text are limited by their reliance on variance in perplexity and burstiness, and they require substantial computational resources. In this paper, we proposed a watermarking method embeddi… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

  12. arXiv:2310.15080  [pdf, other

    cs.LG cs.CL cs.DC

    Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization

    Authors: Tianshi Che, Ji Liu, Yang Zhou, Jiaxiang Ren, Jiwen Zhou, Victor S. Sheng, Huaiyu Dai, Dejing Dou

    Abstract: Federated learning (FL) is a promising paradigm to enable collaborative model training with decentralized data. However, the training process of Large Language Models (LLMs) generally incurs the update of significant parameters, which limits the applicability of FL techniques to tackle the LLMs in real scenarios. Prompt tuning can significantly reduce the number of parameters to update, but it eit… ▽ More

    Submitted 11 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted by EMNLP 2023

  13. arXiv:2310.14318  [pdf, other

    cs.IR

    Intent Contrastive Learning with Cross Subsequences for Sequential Recommendation

    Authors: Xiuyuan Qin, Huanhuan Yuan, Pengpeng Zhao, Guanfeng Liu, Fuzhen Zhuang, Victor S. Sheng

    Abstract: The user purchase behaviors are mainly influenced by their intentions (e.g., buying clothes for decoration, buying brushes for painting, etc.). Modeling a user's latent intention can significantly improve the performance of recommendations. Previous works model users' intentions by considering the predefined label in auxiliary information or introducing stochastic data augmentation to learn purpos… ▽ More

    Submitted 25 November, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: 10pages, 5figures, WSDM2024. arXiv admin note: text overlap with arXiv:2304.07763

  14. arXiv:2310.13925  [pdf, other

    cs.IR

    Meta-optimized Joint Generative and Contrastive Learning for Sequential Recommendation

    Authors: Yongjing Hao, Pengpeng Zhao, Junhua Fang, Jianfeng Qu, Guanfeng Liu, Fuzhen Zhuang, Victor S. Sheng, Xiaofang Zhou

    Abstract: Sequential Recommendation (SR) has received increasing attention due to its ability to capture user dynamic preferences. Recently, Contrastive Learning (CL) provides an effective approach for sequential recommendation by learning invariance from different views of an input. However, most existing data or model augmentation methods may destroy semantic sequential interaction characteristics and oft… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  15. arXiv:2305.04322  [pdf, other

    cs.IR

    Contrastive Enhanced Slide Filter Mixer for Sequential Recommendation

    Authors: Xinyu Du, Huanhuan Yuan, Pengpeng Zhao, Junhua Fang, Guanfeng Liu, Yanchi Liu, Victor S. Sheng, Xiaofang Zhou

    Abstract: Sequential recommendation (SR) aims to model user preferences by capturing behavior patterns from their item historical interaction data. Most existing methods model user preference in the time domain, omitting the fact that users' behaviors are also influenced by various frequency patterns that are difficult to separate in the entangled chronological items. However, few attempts have been made to… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  16. arXiv:2304.14668  [pdf, other

    cs.IR

    Ensemble Modeling with Contrastive Knowledge Distillation for Sequential Recommendation

    Authors: Hanwen Du, Huanhuan Yuan, Pengpeng Zhao, Fuzhen Zhuang, Guanfeng Liu, Lei Zhao, Victor S. Sheng

    Abstract: Sequential recommendation aims to capture users' dynamic interest and predicts the next item of users' preference. Most sequential recommendation methods use a deep neural network as sequence encoder to generate user and item representations. Existing works mainly center upon designing a stronger sequence encoder. However, few attempts have been made with training an ensemble of networks as sequen… ▽ More

    Submitted 15 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: Accepted by SIGIR 2023

  17. arXiv:2304.11383  [pdf, other

    cs.AI

    Sequential Recommendation with Probabilistic Logical Reasoning

    Authors: Huanhuan Yuan, Pengpeng Zhao, Xuefeng Xian, Guanfeng Liu, Victor S. Sheng, Lei Zhao

    Abstract: Deep learning and symbolic learning are two frequently employed methods in Sequential Recommendation (SR). Recent neural-symbolic SR models demonstrate their potential to enable SR to be equipped with concurrent perception and cognition capacities. However, neural-symbolic SR remains a challenging problem due to open issues like representing users and items in logical reasoning. In this paper, we… ▽ More

    Submitted 15 May, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

  18. arXiv:2304.09184  [pdf, other

    cs.IR

    Frequency Enhanced Hybrid Attention Network for Sequential Recommendation

    Authors: Xinyu Du, Huanhuan Yuan, Pengpeng Zhao, Jianfeng Qu, Fuzhen Zhuang, Guanfeng Liu, Victor S. Sheng

    Abstract: The self-attention mechanism, which equips with a strong capability of modeling long-range dependencies, is one of the extensively used techniques in the sequential recommendation field. However, many recent studies represent that current self-attention based models are low-pass filters and are inadequate to capture high-frequency information. Furthermore, since the items in the user behaviors are… ▽ More

    Submitted 17 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 11 pages, 7 figures, The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

  19. arXiv:2302.04383  [pdf, ps, other

    cs.LG cs.CR

    Privacy-Preserving Representation Learning for Text-Attributed Networks with Simplicial Complexes

    Authors: Huixin Zhan, Victor S. Sheng

    Abstract: Although recent network representation learning (NRL) works in text-attributed networks demonstrated superior performance for various graph inference tasks, learning network representations could always raise privacy concerns when nodes represent people or human-related variables. Moreover, standard NRLs that leverage structural information from a graph proceed by first encoding pairwise relations… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted by AAAI-23 DC

  20. arXiv:2302.04373  [pdf, ps, other

    cs.LG cs.CR

    Measuring the Privacy Leakage via Graph Reconstruction Attacks on Simplicial Neural Networks (Student Abstract)

    Authors: Huixin Zhan, Kun Zhang, Keyi Lu, Victor S. Sheng

    Abstract: In this paper, we measure the privacy leakage via studying whether graph representations can be inverted to recover the graph used to generate them via graph reconstruction attack (GRA). We propose a GRA that recovers a graph's adjacency matrix from the representations via a graph decoder that minimizes the reconstruction loss between the partial graph and the reconstructed graph. We study three t… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted at AAAI 2023

    MSC Class: 51Hxx ACM Class: I.2.6

  21. A Roadmap to Domain Knowledge Integration in Machine Learning

    Authors: Himel Das Gupta, Victor S. Sheng

    Abstract: Many machine learning algorithms have been developed in recent years to enhance the performance of a model in different aspects of artificial intelligence. But the problem persists due to inadequate data and resources. Integrating knowledge in a machine learning model can help to overcome these obstacles up to a certain degree. Incorporating knowledge is a complex task though because of various fo… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  22. arXiv:2208.07216  [pdf, other

    cs.CV cs.LG

    Class-attention Video Transformer for Engagement Intensity Prediction

    Authors: Xusheng Ai, Victor S. Sheng, Chunhua Li, Zhiming Cui

    Abstract: In order to deal with variant-length long videos, prior works extract multi-modal features and fuse them to predict students' engagement intensity. In this paper, we present a new end-to-end method Class Attention in Video Transformer (CavT), which involves a single vector to process class embedding and to uniformly perform end-to-end learning on variant-length long videos and fixed-length short v… ▽ More

    Submitted 10 November, 2022; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 5 figures

  23. arXiv:2208.03895  [pdf, other

    cs.IR

    Contrastive Learning with Bidirectional Transformers for Sequential Recommendation

    Authors: Hanwen Du, Hui Shi, Pengpeng Zhao, Deqing Wang, Victor S. Sheng, Yanchi Liu, Guanfeng Liu, Lei Zhao

    Abstract: Contrastive learning with Transformer-based sequence encoder has gained predominance for sequential recommendation. It maximizes the agreements between paired sequence augmentations that share similar semantics. However, existing contrastive learning approaches in sequential recommendation mainly center upon left-to-right unidirectional Transformers as base encoders, which are suboptimal for seque… ▽ More

    Submitted 17 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022

  24. Knowledge Distillation via Weighted Ensemble of Teaching Assistants

    Authors: Durga Prasad Ganta, Himel Das Gupta, Victor S. Sheng

    Abstract: Knowledge distillation in machine learning is the process of transferring knowledge from a large model called the teacher to a smaller model called the student. Knowledge distillation is one of the techniques to compress the large network (teacher) to a smaller network (student) that can be deployed in small devices such as mobile phones. When the network size gap between the teacher and student i… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:1902.03393 by other authors

  25. arXiv:2204.10128  [pdf, other

    cs.IR cs.LG

    Learnable Model Augmentation Self-Supervised Learning for Sequential Recommendation

    Authors: Yongjing Hao, Pengpeng Zhao, Xuefeng Xian, Guanfeng Liu, Deqing Wang, Lei Zhao, Yanchi Liu, Victor S. Sheng

    Abstract: Sequential Recommendation aims to predict the next item based on user behaviour. Recently, Self-Supervised Learning (SSL) has been proposed to improve recommendation performance. However, most of existing SSL methods use a uniform data augmentation scheme, which loses the sequence correlation of an original sequence. To this end, in this paper, we propose a Learnable Model Augmentation self-superv… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  26. arXiv:2111.10539  [pdf, other

    cs.IR cs.LG

    Edge-Enhanced Global Disentangled Graph Neural Network for Sequential Recommendation

    Authors: Yunyi Li, Pengpeng Zhao, Guanfeng Liu, Yanchi Liu, Victor S. Sheng, Jiajie Xu, Xiaofang Zhou

    Abstract: Sequential recommendation has been a widely popular topic of recommender systems. Existing works have contributed to enhancing the prediction ability of sequential recommendation systems based on various methods, such as recurrent networks and self-attention mechanisms. However, they fail to discover and distinguish various relationships between items, which could be underlying factors which motiv… ▽ More

    Submitted 22 November, 2021; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 13 pages, 7 figures, 5 tables. Submitted to ICDE 2022

  27. arXiv:2111.10536  [pdf, other

    cs.IR cs.LG

    Quaternion-Based Graph Convolution Network for Recommendation

    Authors: Yaxing Fang, Pengpeng Zhao, Guanfeng Liu, Yanchi Liu, Victor S. Sheng, Lei Zhao, Xiaofang Zhou

    Abstract: Graph Convolution Network (GCN) has been widely applied in recommender systems for its representation learning capability on user and item embeddings. However, GCN is vulnerable to noisy and incomplete graphs, which are common in real world, due to its recursive message propagation mechanism. In the literature, some work propose to remove the feature transformation during message propagation, but… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: 13 pages, 7 figures, 6 tables. Submitted to ICDE 2022

  28. arXiv:2012.13662  [pdf, other

    cs.CV

    Coarse to Fine: Multi-label Image Classification with Global/Local Attention

    Authors: Fan Lyu, Fuyuan Hu, Victor S. Sheng, Zhengtian Wu, Qiming Fu, Baochuan Fu

    Abstract: In our daily life, the scenes around us are always with multiple labels especially in a smart city, i.e., recognizing the information of city operation to response and control. Great efforts have been made by using Deep Neural Networks to recognize multi-label images. Since multi-label image classification is very complicated, people seek to use the attention mechanism to guide the classification… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: Accepted by IEEE International Smart Cities Conference 2018

  29. arXiv:1906.08204  [pdf

    cs.CR

    A Novel DDoS Attack Detection Method Using Optimized Generalized Multiple Kernel Learning

    Authors: Jieren Cheng, Junqi Li, Xiangyan Tang, Victor S. Sheng, Chen Zhang, Mengyang Li

    Abstract: Distributed Denial of Service (DDoS) attack has become one of the most destructive network attacks which can pose a mortal threat to Internet security. Existing detection methods can not effectively detect early attacks. In this paper, we propose a detection method of DDoS attacks based on generalized multiple kernel learning (GMKL) combining with the constructed parameter R. The super-fusion feat… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  30. arXiv:1905.13030  [pdf, other

    cs.IR

    Deep Cross Networks with Aesthetic Preference for Cross-domain Recommendation

    Authors: Jian Liu, Pengpeng Zhao, Yanchi Liu, Victor S. Sheng, Fuzheng Zhuang, Jiajie Xu, Xiaofang Zhou, Hui Xiong

    Abstract: When purchasing appearance-first products, e.g., clothes, product appearance aesthetics plays an important role in the decision process. Moreover, user's aesthetic preference, which can be regarded as a personality trait and a basic requirement, is domain independent and could be used as a bridge between domains for knowledge transfer. However, existing work has rarely considered the aesthetic inf… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: text overlap with arXiv:1901.07199, arXiv:1804.06769 by other authors

  31. arXiv:1905.07893  [pdf

    cs.CR

    Adaptive DDoS attack detection method based on multiple-kernel learning

    Authors: Jieren Cheng, Chen Zhang, Xiangyan Tang, Victor S. Sheng, Zhe Dong, Junqi Li, Jing Chen

    Abstract: Distributed denial of service (DDoS) attacks have caused huge economic losses to society. They have become one of the main threats to Internet security. Most of the current detection methods based on a single feature and fixed model parameters cannot effectively detect early DDoS attacks in cloud and big data environment. In this paper, an adaptive DDoS attack detection method (ADADM) based on mul… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

  32. arXiv:1806.06671  [pdf, other

    cs.IR

    Where to Go Next: A Spatio-temporal LSTM model for Next POI Recommendation

    Authors: Pengpeng Zhao, Haifeng Zhu, Yanchi Liu, Zhixu Li, Jiajie Xu, Victor S. Sheng

    Abstract: Next Point-of-Interest (POI) recommendation is of great value for both location-based service providers and users. Recently Recurrent Neural Networks (RNNs) have been proved to be effective on sequential recommendation tasks. However, existing RNN solutions rarely consider the spatio-temporal intervals between neighbor check-ins, which are essential for modeling user check-in behaviors in next POI… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.