Skip to main content

Showing 1–47 of 47 results for author: Lian, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03002  [pdf, ps, other

    eess.SY cs.AI

    Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality

    Authors: Yuansheng Lian, Ke Zhang, Meng Li, Shen Li

    Abstract: Modeling the decision-making behavior of vehicles presents unique challenges, particularly during unprotected left turns at intersections, where the uncertainty of human drivers is especially pronounced. In this context, connected autonomous vehicle (CAV) technology emerges as a promising avenue for effectively managing such interactions while ensuring safety and efficiency. Traditional approaches… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2506.23026  [pdf, ps, other

    cs.IR

    Machine Assistant with Reliable Knowledge: Enhancing Student Learning via RAG-based Retrieval

    Authors: Yongsheng Lian

    Abstract: We present Machine Assistant with Reliable Knowledge (MARK), a retrieval-augmented question-answering system designed to support student learning through accurate and contextually grounded responses. The system is built on a retrieval-augmented generation (RAG) framework, which integrates a curated knowledge base to ensure factual consistency. To enhance retrieval effectiveness across diverse ques… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  3. arXiv:2506.08053  [pdf, ps, other

    cs.ET cs.NI eess.SP

    Power Domain Sparse Dimensional Constellation Multiple Access (PD-SDCMA) for Enabled Flexible PONs

    Authors: Yuhao Lian, Xiao Han, Xinmao Deng

    Abstract: With the commercial deployment of 5G and the in-depth research of 6G, the demand for high-speed data services in the next-generation fiber optic access systems is growing increasingly. Passive optical networks (PONs) have become a research hotspot due to their characteristics of low loss, high bandwidth, and low cost. However, the traditional orthogonal multiple access (OMA-PON) has difficulty mee… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2502.16271 by other authors

  4. arXiv:2504.08850  [pdf, other

    cs.DC cs.AI

    SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting

    Authors: Jiaming Xu, Jiayi Pan, Yongkang Zhou, Siming Chen, Jinhao Li, Yaoxiu Lian, Junyi Wu, Guohao Dai

    Abstract: Early exiting has recently emerged as a promising technique for accelerating large language models (LLMs) by effectively reducing the hardware computation and memory access. In this paper, we present SpecEE, a fast LLM inference engine with speculative early exiting. (1) At the algorithm level, we propose the speculation-based lightweight predictor design by exploiting the probabilistic correlatio… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Accepted by ISCA 2025

  5. arXiv:2503.17662  [pdf, other

    cs.CL

    Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning

    Authors: Ke Ji, Yixin Lian, Linxu Li, Jingsheng Gao, Weiyuan Li, Bin Dai

    Abstract: In recent years, large language models (LLMs) have achieved breakthrough progress in many dialogue generation tasks. However, their lack of emotion and fine-grained role awareness limits the model's ability to provide personalized and diverse interactions further. Current methods face high costs in collecting high-quality annotated data for scenarios such as role-playing, and traditional human ali… ▽ More

    Submitted 25 March, 2025; v1 submitted 22 March, 2025; originally announced March 2025.

    Comments: 18 pages, 4 figures

  6. arXiv:2503.14178  [pdf

    cs.HC

    Figame: A Family Digital Game Based on JME for Shaping Parent-Child Healthy Gaming Relationship

    Authors: Liyi Zhang, Yujie Peng, Yi Lian, Mengru Xue

    Abstract: With the development of technology, digital games have permeated into family and parent-child relationships, leading to cognitive deficiencies and inter-generational conflicts that have yet to be effectively addressed. Building on previous research on digital games and parent-child relationships, we have developed Figame, a Joint Media Engagement (JME) based parent-child digital game aimed at fost… ▽ More

    Submitted 30 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  7. arXiv:2503.12695  [pdf, other

    cs.RO eess.SY

    CDKFormer: Contextual Deviation Knowledge-Based Transformer for Long-Tail Trajectory Prediction

    Authors: Yuansheng Lian, Ke Zhang, Meng Li

    Abstract: Predicting the future movements of surrounding vehicles is essential for ensuring the safe operation and efficient navigation of autonomous vehicles (AVs) in urban traffic environments. Existing vehicle trajectory prediction methods primarily focus on improving overall performance, yet they struggle to address long-tail scenarios effectively. This limitation often leads to poor predictions in rare… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  8. arXiv:2503.09512  [pdf, other

    cs.LG cs.CL

    Reinforcement Learning is all You Need

    Authors: Yongsheng Lian

    Abstract: Inspired by the success of DeepSeek R1 in reasoning via reinforcement learning without human feedback, we train a 3B language model using the Countdown Game with pure reinforcement learning. Our model outperforms baselines on four of five benchmarks, demonstrating improved generalization beyond its training data. Notably, response length does not correlate with reasoning quality, and while "aha mo… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 15 pages, 2 figures

    Report number: Report-no: SPEED-2025-0312

  9. arXiv:2502.16271  [pdf, other

    cs.ET eess.SP

    Power Domain Sparse Dimensional Constellation Multiple Access (PD-SDCMA): A Novel PD-NOMA for More Access Users

    Authors: Zihan Li, Youzhi Li, Chenyu Liuand Yuhao Lian

    Abstract: With the advent of the 6G mobile communication network era, the existing non-orthogonal multiple-access (NOMA) technology faces the challenge of high successive interference in multi-user scenarios, which limits its ability to support more user access. To address this, this paper proposes a novel power-domain sparse-dimensional constellation multiple-access scheme (PD-SDCMA). Through the signal sp… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  10. arXiv:2502.04038  [pdf, other

    cs.CL

    Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents

    Authors: Yuchen Lian, Arianna Bisazza, Tessa Verhoef

    Abstract: Differential Case Marking (DCM) refers to the phenomenon where grammatical case marking is applied selectively based on semantic, pragmatic, or other factors. The emergence of DCM has been studied in artificial language learning experiments with human participants, which were specifically aimed at disentangling the effects of learning from those of communication (Smith & Culbertson, 2020). Multi-a… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  11. arXiv:2501.12202  [pdf, other

    cs.CV

    Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

    Authors: Zibo Zhao, Zeqiang Lai, Qingxiang Lin, Yunfei Zhao, Haolin Liu, Shuhui Yang, Yifei Feng, Mingxin Yang, Sheng Zhang, Xianghui Yang, Huiwen Shi, Sicong Liu, Junta Wu, Yihang Lian, Fan Yang, Ruining Tang, Zebin He, Xinzhou Wang, Jian Liu, Xuhui Zuo, Zhuo Chen, Biwen Lei, Haohan Weng, Jing Xu, Yiling Zhu , et al. (49 additional authors not shown)

    Abstract: We present Hunyuan3D 2.0, an advanced large-scale 3D synthesis system for generating high-resolution textured 3D assets. This system includes two foundation components: a large-scale shape generation model -- Hunyuan3D-DiT, and a large-scale texture synthesis model -- Hunyuan3D-Paint. The shape generative model, built on a scalable flow-based diffusion transformer, aims to create geometry that pro… ▽ More

    Submitted 26 February, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: GitHub link: https://github.com/Tencent/Hunyuan3D-2

  12. arXiv:2411.02293  [pdf, other

    cs.CV cs.AI

    Hunyuan3D 1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

    Authors: Xianghui Yang, Huiwen Shi, Bowen Zhang, Fan Yang, Jiacheng Wang, Hongxu Zhao, Xinhai Liu, Xinzhou Wang, Qingxiang Lin, Jiaao Yu, Lifu Wang, Jing Xu, Zebin He, Zhuo Chen, Sicong Liu, Junta Wu, Yihang Lian, Shaoxiong Yang, Yuhong Liu, Yong Yang, Di Wang, Jie Jiang, Chunchao Guo

    Abstract: While 3D generative models have greatly improved artists' workflows, the existing diffusion models for 3D generation suffer from slow generation and poor generalization. To address this issue, we propose a two-stage approach named Hunyuan3D 1.0 including a lite version and a standard version, that both support text- and image-conditioned generation. In the first stage, we employ a multi-view diffu… ▽ More

    Submitted 23 January, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Comments: Technical Report; 3D Generation

  13. arXiv:2411.01424  [pdf, other

    cs.SI cs.DB

    Effective Community Detection Over Streaming Bipartite Networks (Technical Report)

    Authors: Nan Zhang, Yutong Ye, Yuyang Wang Xiang Lian, Mingsong Chen

    Abstract: The streaming bipartite graph is extensively used to model the dynamic relationship between two types of entities in many real-world applications, such as movie recommendations, location-based services, and online shopping. Since it contains abundant information, discovering the dense subgraph with high structural cohesiveness (i.e., community detection) in the bipartite streaming graph is becomin… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

  14. arXiv:2410.16119  [pdf, other

    cs.LG cs.AI

    SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation

    Authors: Xinyi Zhou, Xing Li, Yingzhao Lian, Yiwen Wang, Lei Chen, Mingxuan Yuan, Jianye Hao, Guangyong Chen, Pheng Ann Heng

    Abstract: We introduce SeaDAG, a semi-autoregressive diffusion model for conditional generation of Directed Acyclic Graphs (DAGs). Considering their inherent layer-wise structure, we simulate layer-wise autoregressive generation by designing different denoising speed for different layers. Unlike conventional autoregressive generation that lacks a global graph structure view, our method maintains a complete… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  15. arXiv:2410.04466  [pdf, ps, other

    cs.AR cs.LG

    Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective

    Authors: Jinhao Li, Jiaming Xu, Shan Huang, Yonghua Chen, Wen Li, Jun Liu, Yaoxiu Lian, Jiayi Pan, Li Ding, Hao Zhou, Yu Wang, Guohao Dai

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various fields, from natural language understanding to text generation. Compared to non-generative LLMs like BERT and DeBERTa, generative LLMs like GPT series and Llama series are currently the main focus due to their superior algorithmic performance. The advancements in generative LLMs are closely intertwined with the d… ▽ More

    Submitted 13 June, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: Collect and update results in recent half year. 54 pages. Github link: https://github.com/Kimho666/LLM_Hardware_Survey

  16. arXiv:2410.02120  [pdf, ps, other

    cs.NI cs.LG eess.SY

    Lossy Cooperative UAV Relaying Networks: Outage Probability Analysis and Location Optimization

    Authors: Ya Lian, Wensheng Lin, Lixin Li, Fucheng Yang, Zhu Han, Tad Matsumoto

    Abstract: In this paper, performance of a lossy cooperative unmanned aerial vehicle (UAV) relay communication system is analyzed. In this system, the UAV relay adopts lossy forward (LF) strategy and the receiver has certain distortion requirements for the received information. For the system described above, we first derive the achievable rate distortion region of the system. Then, on the basis of the regio… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  17. arXiv:2409.11414  [pdf, other

    cs.AR cs.AI cs.SE

    RTLRewriter: Methodologies for Large Models aided RTL Code Optimization

    Authors: Xufeng Yao, Yiwen Wang, Xing Li, Yingzhao Lian, Ran Chen, Lei Chen, Mingxuan Yuan, Hong Xu, Bei Yu

    Abstract: Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages. Currently, optimization relies heavily on manual efforts by skilled engineers, often requiring multiple iterations based on synthesis feedback. In contrast, existing compiler-based methods fall short in addressing complex designs. This paper int… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: ICCAD2024

  18. arXiv:2409.10572  [pdf, other

    stat.ML cs.CE cs.LG

    A clustering adaptive Gaussian process regression method: response patterns based real-time prediction for nonlinear solid mechanics problems

    Authors: Ming-Jian Li, Yanping Lian, Zhanshan Cheng, Lehui Li, Zhidong Wang, Ruxin Gao, Daining Fang

    Abstract: Numerical simulation is powerful to study nonlinear solid mechanics problems. However, mesh-based or particle-based numerical methods suffer from the common shortcoming of being time-consuming, particularly for complex problems with real-time analysis requirements. This study presents a clustering adaptive Gaussian process regression (CAG) method aiming for real-time prediction for nonlinear struc… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  19. arXiv:2409.10331  [pdf

    q-fin.RM cs.LG

    Research and Design of a Financial Intelligent Risk Control Platform Based on Big Data Analysis and Deep Machine Learning

    Authors: Shuochen Bi, Yufan Lian, Ziyue Wang

    Abstract: In the financial field of the United States, the application of big data technology has become one of the important means for financial institutions to enhance competitiveness and reduce risks. The core objective of this article is to explore how to fully utilize big data technology to achieve complete integration of internal and external data of financial institutions, and create an efficient and… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 10 pages, 5 figures

  20. arXiv:2408.11611  [pdf, ps, other

    cs.IR cs.LG

    DTN: Deep Multiple Task-specific Feature Interactions Network for Multi-Task Recommendation

    Authors: Yaowen Bi, Yuteng Lian, Jie Cui, Jun Liu, Peijian Wang, Guanghui Li, Xuejun Chen, Jinglin Zhao, Hao Wen, Jing Zhang, Zhaoqi Zhang, Wenzhuo Song, Yang Sun, Weiwei Zhang, Mingchen Cai, Jian Dong, Guanxing Zhang

    Abstract: Neural-based multi-task learning (MTL) has been successfully applied to many recommendation applications. However, these MTL models (e.g., MMoE, PLE) did not consider feature interaction during the optimization, which is crucial for capturing complex high-order features and has been widely used in ranking models for real-world recommender systems. Moreover, through feature importance analysis acro… ▽ More

    Submitted 4 July, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

  21. arXiv:2407.13999  [pdf, other

    cs.CL

    NeLLCom-X: A Comprehensive Neural-Agent Framework to Simulate Language Learning and Group Communication

    Authors: Yuchen Lian, Tessa Verhoef, Arianna Bisazza

    Abstract: Recent advances in computational linguistics include simulating the emergence of human-like languages with interacting neural network agents, starting from sets of random symbols. The recently introduced NeLLCom framework (Lian et al., 2023) allows agents to first learn an artificial language and then use it to communicate, with the aim of studying the emergence of specific linguistics properties.… ▽ More

    Submitted 11 October, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted to CoNLL2024

  22. arXiv:2402.13035  [pdf, other

    cs.CL cs.AI

    Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models

    Authors: Che Zhang, Zhenyang Xiao, Chengcheng Han, Yixin Lian, Yuejian Fang

    Abstract: Self-correction has achieved impressive results in enhancing the style and security of the generated output from large language models (LLMs). However, recent studies suggest that self-correction might be limited or even counterproductive in reasoning tasks due to LLMs' difficulties in identifying logical mistakes. In this paper, we aim to enhance the self-checking capabilities of LLMs by constr… ▽ More

    Submitted 17 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  23. arXiv:2402.11903  [pdf, other

    cs.CL cs.AI

    DiLA: Enhancing LLM Tool Learning with Differential Logic Layer

    Authors: Yu Zhang, Hui-Ling Zhen, Zehua Pei, Yingzhao Lian, Lihao Yin, Mingxuan Yuan, Bei Yu

    Abstract: Considering the challenges faced by large language models (LLMs) in logical reasoning and planning, prior efforts have sought to augment LLMs with access to external solvers. While progress has been made on simple reasoning problems, solving classical constraint satisfaction problems, such as the Boolean Satisfiability Problem (SAT) and Graph Coloring Problem (GCP), remains difficult for off-the-s… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.12295 by other authors

  24. arXiv:2311.16442  [pdf, other

    cs.LG cs.DC

    Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization

    Authors: Jinhao Li, Jiaming Xu, Shiyao Li, Shan Huang, Jun Liu, Yaoxiu Lian, Guohao Dai

    Abstract: Large language models (LLMs) have demonstrated impressive abilities in various domains while the inference cost is expensive. Many previous studies exploit quantization methods to reduce LLM inference cost by reducing latency and memory consumption. Applying 2-bit single-precision weight quantization brings >3% accuracy loss, so the state-of-the-art methods use mixed-precision methods for LLMs (e.… ▽ More

    Submitted 9 November, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  25. arXiv:2310.05074  [pdf, other

    cs.CL cs.AI

    DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

    Authors: Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang

    Abstract: Chain-of-Thought (CoT) prompting has proven to be effective in enhancing the reasoning capabilities of Large Language Models (LLMs) with at least 100 billion parameters. However, it is ineffective or even detrimental when applied to reasoning tasks in Smaller Language Models (SLMs) with less than 10 billion parameters. To address this limitation, we introduce Dialogue-guided Chain-of-Thought (Dial… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023

  26. arXiv:2310.02629  [pdf, other

    cs.SD eess.AS

    BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

    Authors: Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie

    Abstract: Mixture-of-experts based models, which use language experts to extract language-specific representations effectively, have been well applied in code-switching automatic speech recognition. However, there is still substantial space to improve as similar pronunciation across languages may result in ineffective multi-language modeling and inaccurate language boundary estimation. To eliminate these dr… ▽ More

    Submitted 7 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted by ASRU2023

  27. arXiv:2306.08401  [pdf, other

    cs.CL

    LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming

    Authors: Jingsheng Gao, Yixin Lian, Ziyi Zhou, Yuzhuo Fu, Baoyuan Wang

    Abstract: Open-domain dialogue systems have made promising progress in recent years. While the state-of-the-art dialogue agents are built upon large-scale text-based social media data and large pre-trained models, there is no guarantee these agents could also perform well in fast-growing scenarios, such as live streaming, due to the bounded transferability of pre-trained models and biased distributions of p… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Main Conference

  28. arXiv:2305.16885  [pdf, other

    cs.CL

    Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification

    Authors: Ke Ji, Yixin Lian, Jingsheng Gao, Baoyuan Wang

    Abstract: Due to the complex label hierarchy and intensive labeling cost in practice, the hierarchical text classification (HTC) suffers a poor performance especially when low-resource or few-shot settings are considered. Recently, there is a growing trend of applying prompts on pre-trained language models (PLMs), which has exhibited effectiveness in the few-shot flat text classification tasks. However, lim… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 14 pages, 8 figures, Accepted by ACL 2023

  29. arXiv:2303.11138  [pdf, other

    stat.ML cs.LG eess.SY math.OC

    Fault Detection via Occupation Kernel Principal Component Analysis

    Authors: Zachary Morrison, Benjamin P. Russo, Yingzhao Lian, Rushikesh Kamalapurkar

    Abstract: The reliable operation of automatic systems is heavily dependent on the ability to detect faults in the underlying dynamical system. While traditional model-based methods have been widely used for fault detection, data-driven approaches have garnered increasing attention due to their ease of deployment and minimal need for expert knowledge. In this paper, we present a novel principal component ana… ▽ More

    Submitted 26 June, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  30. arXiv:2301.13083  [pdf, other

    cs.CL cs.AI

    Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off

    Authors: Yuchen Lian, Arianna Bisazza, Tessa Verhoef

    Abstract: Artificial learners often behave differently from human learners in the context of neural agent-based simulations of language emergence and change. A common explanation is the lack of appropriate cognitive biases in these learners. However, it has also been proposed that more naturalistic settings of language learning and use could lead to more human-like results. We investigate this latter accoun… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to TACL, pre-MIT Press publication version

  31. arXiv:2301.05834  [pdf, ps, other

    math.CO cs.IT

    On lattice tilings of $\mathbb{Z}^{n}$ by limited magnitude error balls $\mathcal{B}(n,2,1,1)$

    Authors: Tao Zhang, Yanlu Lian, Gennian Ge

    Abstract: Limited magnitude error model has applications in flash memory. In this model, a perfect code is equivalent to a tiling of $\mathbb{Z}^n$ by limited magnitude error balls. In this paper, we give a complete classification of lattice tilings of $\mathbb{Z}^n$ by limited magnitude error balls $\mathcal{B}(n,2,1,1)$.

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 15 pages

  32. arXiv:2205.15703  [pdf, other

    eess.SY cs.LG

    Lessons Learned from Data-Driven Building Control Experiments: Contrasting Gaussian Process-based MPC, Bilevel DeePC, and Deep Reinforcement Learning

    Authors: Loris Di Natale, Yingzhao Lian, Emilio T. Maddalena, Jicheng Shi, Colin N. Jones

    Abstract: This manuscript offers the perspective of experimentalists on a number of modern data-driven techniques: model predictive control relying on Gaussian processes, adaptive data-driven control based on behavioral theory, and deep reinforcement learning. These techniques are compared in terms of data requirements, ease of use, computational burden, and robustness in the context of real-world applicati… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  33. arXiv:2204.00376  [pdf, other

    cs.CV

    Few-shot One-class Domain Adaptation Based on Frequency for Iris Presentation Attack Detection

    Authors: Yachun Li, Ying Lian, Jingjing Wang, Yuhui Chen, Chunmao Wang, Shiliang Pu

    Abstract: Iris presentation attack detection (PAD) has achieved remarkable success to ensure the reliability and security of iris recognition systems. Most existing methods exploit discriminative features in the spatial domain and report outstanding performance under intra-dataset settings. However, the degradation of performance is inevitable under cross-dataset settings, suffering from domain shift. In co… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: Camera Ready, ICASSP 2022

  34. arXiv:2105.12371  [pdf, other

    cs.IR cs.LG

    Quotient Space-Based Keyword Retrieval in Sponsored Search

    Authors: Yijiang Lian, Shuang Li, Chaobing Feng, YanFeng Zhu

    Abstract: Synonymous keyword retrieval has become an important problem for sponsored search ever since major search engines relax the exact match product's matching requirement to a synonymous level. Since the synonymous relations between queries and keywords are quite scarce, the traditional information retrieval framework is inefficient in this scenario. In this paper, we propose a novel quotient space-ba… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  35. arXiv:2104.07637  [pdf, other

    cs.CL cs.AI cs.LG

    The Effect of Efficient Messaging and Input Variability on Neural-Agent Iterated Language Learning

    Authors: Yuchen Lian, Arianna Bisazza, Tessa Verhoef

    Abstract: Natural languages display a trade-off among different strategies to convey syntactic structure, such as word order or inflection. This trade-off, however, has not appeared in recent simulations of iterated language learning with neural network agents (Chaabouni et al., 2019b). We re-evaluate this result in light of three factors that play an important role in comparable experiments from the Langua… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: To appear at EMNLP 2021

  36. arXiv:2102.10560  [pdf, other

    cs.IR

    A Concept Knowledge-Driven Keywords Retrieval Framework for Sponsored Search

    Authors: Yijiang Lian, Yubo Liu, Zhicong Ye, Liang Yuan, Yanfeng Zhu, Min Zhao, Jianyi Cheng, Xinwei Feng

    Abstract: In sponsored search, retrieving synonymous keywords for exact match type is important for accurately targeted advertising. Data-driven deep learning-based method has been proposed to tackle this problem. An apparent disadvantage of this method is its poor generalization performance on entity-level long-tail instances, even though they might share similar concept-level patterns with frequent instan… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

  37. arXiv:2101.02392  [pdf, other

    cs.LG cs.CR

    Detecting Log Anomalies with Multi-Head Attention (LAMA)

    Authors: Yicheng Guo, Yujin Wen, Congwei Jiang, Yixin Lian, Yi Wan

    Abstract: Anomaly detection is a crucial and challenging subject that has been studied within diverse research areas. In this work, we explore the task of log anomaly detection (especially computer system logs and user behavior logs) by analyzing logs' sequential information. We propose LAMA, a multi-head attention based sequential model to process log streams as template activity (event) sequences. A nex… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  38. arXiv:2010.05594  [pdf, other

    cs.CL

    MultiWOZ 2.3: A multi-domain task-oriented dialogue dataset enhanced with annotation corrections and co-reference annotation

    Authors: Ting Han, Ximing Liu, Ryuichi Takanobu, Yixin Lian, Chongxuan Huang, Dazhen Wan, Wei Peng, Minlie Huang

    Abstract: Task-oriented dialogue systems have made unprecedented progress with multiple state-of-the-art (SOTA) models underpinned by a number of publicly available MultiWOZ datasets. Dialogue state annotations are error-prone, leading to sub-optimal performance. Various efforts have been put in rectifying the annotation errors presented in the original MultiWOZ dataset. In this paper, we introduce MultiWOZ… ▽ More

    Submitted 14 June, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  39. arXiv:2008.02014  [pdf, other

    cs.LG cs.IR stat.ML

    Optimizing AD Pruning of Sponsored Search with Reinforcement Learning

    Authors: Yijiang Lian, Zhijie Chen, Xin Pei, Shuang Li, Yifei Wang, Yuefeng Qiu, Zhiheng Zhang, Zhipeng Tao, Liang Yuan, Hanju Guan, Kefeng Zhang, Zhigang Li, Xiaochun Liu

    Abstract: Industrial sponsored search system (SSS) can be logically divided into three modules: keywords matching, ad retrieving, and ranking. During ad retrieving, the ad candidates grow exponentially. A query with high commercial value might retrieve a great deal of ad candidates such that the ranking module could not afford. Due to limited latency and computing resources, the candidates have to be pruned… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

  40. arXiv:2008.01969  [pdf, other

    cs.IR

    Retrieve Synonymous keywords for Frequent Queries in Sponsored Search in a Data Augmentation Way

    Authors: Yijiang Lian, Zhenjun You, Fan Wu, Wenqiang Liu, Jing Jia

    Abstract: In sponsored search, retrieving synonymous keywords is of great importance for accurately targeted advertising. The semantic gap between queries and keywords and the extremely high precision requirements (>= 95\%) are two major challenges to this task. To the best of our knowledge, the problem has not been openly discussed. In an industrial sponsored search system, the retrieved keywords for frequ… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

  41. arXiv:1902.00592  [pdf, other

    cs.IR

    An end-to-end Generative Retrieval Method for Sponsored Search Engine --Decoding Efficiently into a Closed Target Domain

    Authors: Yijiang Lian, Zhijie Chen, Jinlong Hu, Kefeng Zhang, Chunwei Yan, Muchenxuan Tong, Wenying Han, Hanju Guan, Ying Li, Ying Cao, Yang Yu, Zhigang Li, Xiaochun Liu, Yue Wang

    Abstract: In this paper, we present a generative retrieval method for sponsored search engine, which uses neural machine translation (NMT) to generate keywords directly from query. This method is completely end-to-end, which skips query rewriting and relevance judging phases in traditional retrieval systems. Different from standard machine translation, the target space in the retrieval setting is a constrai… ▽ More

    Submitted 18 March, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 8 pages, 8 figures, conference

  42. arXiv:1812.06585  [pdf, other

    cs.NE cs.AI

    Generalizable Meta-Heuristic based on Temporal Estimation of Rewards for Large Scale Blackbox Optimization

    Authors: Mingde Zhao, Hongwei Ge, Yi Lian, Kai Zhang

    Abstract: The generalization abilities of heuristic optimizers may deteriorate with the increment of the search space dimensionality. To achieve generalized performance across Large Scale Blackbox Optimization (LSBO) tasks, it ispossible to ensemble several heuristics and devise a meta-heuristic to control their initiation. This paper first proposes a methodology of transforming LSBO problems into online de… ▽ More

    Submitted 18 September, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

    Comments: 7 pages of contents, 1 page of references, 2 pages for appendix

  43. arXiv:1810.09517  [pdf

    physics.med-ph cs.ET eess.SP

    A High Accuracy and High Sensitivity System Architecture for Electrical Impedance Tomography System

    Authors: Hui Li, Boxiao Liu, Yongfu Li, Guoxing Wang, Yong Lian

    Abstract: A high accuracy and high sensitivity system architecture is proposed for the read-out circuit of electrical impedance tomography system-on-chip. The switched ratiometric technique is applied in the proposed architecture. The proposed system architecture minimizes the device noise by processing signals from both read-out electrodes and the stimulus. The quantized signals are post-processed in the d… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

  44. arXiv:1808.03679  [pdf

    physics.ins-det cs.LG stat.ML

    Machine Learning Promoting Extreme Simplification of Spectroscopy Equipment

    Authors: Jianchao Lee, Qiannan Duan, Sifan Bi, Ruen Luo, Yachao Lian, Hanqiang Liu, Ruixing Tian, Jiayuan Chen, Guodong Ma, Jinhong Gao, Zhaoyi Xu

    Abstract: The spectroscopy measurement is one of main pathways for exploring and understanding the nature. Today, it seems that racing artificial intelligence will remould its styles. The algorithms contained in huge neural networks are capable of substituting many of expensive and complex components of spectrum instruments. In this work, we presented a smart machine learning strategy on the measurement of… ▽ More

    Submitted 13 September, 2019; v1 submitted 5 August, 2018; originally announced August 2018.

    Comments: This is the second version. On pages 7 through 8, we have added a new case about the spectral properties of mixtures. Specifically, paragraph 1 on page 8 and Fig.7 is added

  45. A Smart Cushion for Real-Time Heart Rate Monitoring

    Authors: Chacko John Deepu, Zhihao Chen, Ju Teng Teo, Soon Huat Ng, Xiefeng Yang, Yong Lian

    Abstract: This paper presents a smart cushion for real time heart rate monitoring. The cushion comprises of an integrated micro-bending fiber sensor, which records the BCG (Ballistocardiogram) signal without direct skin-electrode contact, and an optical transceiver that does signal amplification, digitization, and pre-filtering. To remove the artifacts and extract heart rate from BCG signal, a computational… ▽ More

    Submitted 29 September, 2014; originally announced September 2014.

    Comments: 2012 IEEE Biomedical Circuits and Systems Conference

  46. An ECG-on-Chip for Wearable Cardiac Monitoring Devices

    Authors: C. J. Deepu, X. Y. Xu, X. D. Zou, L. B. Yao, Y. Lian

    Abstract: This paper describes a highly integrated, low power chip solution for ECG signal processing in wearable devices. The chip contains an instrumentation amplifier with programmable gain, a band-pass filter, a 12-bit SAR ADC, a novel QRS detector, 8K on-chip SRAM, and relevant control circuitry and CPU interfaces. The analog front end circuits accurately senses and digitizes the raw ECG signal, which… ▽ More

    Submitted 29 September, 2014; originally announced September 2014.

    Journal ref: 5th IEEE International Symposium on Electronic Design Test and Applications 2010

  47. An ECG-on-Chip with 535-nW/Channel Integrated Lossless Data Compressor for Wireless Sensors

    Authors: C. J. Deepu, X. Zhang, W. -S. Liew, D. L. T. Wong, Y. Lian

    Abstract: This paper presents a low-power ECG recording system-on-chip (SoC) with on-chip low-complexity lossless ECG compression for data reduction in wireless/ambulatory ECG sensor devices. The chip uses a linear slope predictor for data compression, and incorporates a novel low-complexity dynamic coding-packaging scheme to frame the prediction error into fixed-length 16-bit format. The proposed technique… ▽ More

    Submitted 29 September, 2014; originally announced September 2014.

    Journal ref: IEEE Journal of Solid-State Circuits, Nov 2014