Skip to main content

Showing 1–50 of 51 results for author: Zhuo, H

.
  1. arXiv:2504.17066  [pdf, other

    cs.LG cs.CY cs.SE stat.ML

    Whence Is A Model Fair? Fixing Fairness Bugs via Propensity Score Matching

    Authors: Kewen Peng, Yicheng Yang, Hao Zhuo

    Abstract: Fairness-aware learning aims to mitigate discrimination against specific protected social groups (e.g., those categorized by gender, ethnicity, age) while minimizing predictive performance loss. Despite efforts to improve fairness in machine learning, prior studies have shown that many models remain unfair when measured against various fairness metrics. In this paper, we examine whether the way tr… ▽ More

    Submitted 1 May, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

  2. arXiv:2504.15439  [pdf, other

    cs.LG cs.SE

    Combating Toxic Language: A Review of LLM-Based Strategies for Software Engineering

    Authors: Hao Zhuo, Yicheng Yang, Kewen Peng

    Abstract: Large Language Models (LLMs) have become integral to software engineering (SE), where they are increasingly used in development workflows. However, their widespread use raises concerns about the presence and propagation of toxic language--harmful or offensive content that can foster exclusionary environments. This paper provides a comprehensive review of recent research on toxicity detection and m… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  3. arXiv:2504.12587  [pdf, other

    cs.LG cs.SE

    Software Engineering Principles for Fairer Systems: Experiments with GroupCART

    Authors: Kewen Peng, Hao Zhuo, Yicheng Yang, Tim Menzies

    Abstract: Discrimination-aware classification aims to make accurate predictions while satisfying fairness constraints. Traditional decision tree learners typically optimize for information gain in the target attribute alone, which can result in models that unfairly discriminate against protected social groups (e.g., gender, ethnicity). Motivated by these shortcomings, we propose GroupCART, a tree-based ense… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  4. arXiv:2502.08048  [pdf, other

    physics.optics hep-ex

    Efficiently Laser Driven Terahertz Surface Plasmon Polaritons on Long Metal Wire

    Authors: Shuoting Shao, Xiangbing Wang, Rong Huang, Guangyue Hu, Min Chen, Huibo Tang, Longyu Kuang, Yuxi Liu, Yuqiu Gu, Yongkun Ding, Ruxin Li, Hongbin Zhuo, Mingyang Yu

    Abstract: We experimentally demonstrate a novel scheme for efficiently generating intense terahertz (THz) surface plasmon polaritons (SPPs) on a sub-wavelength-diameter meter-long metal wire. Driven by a subrelativistic femtosecond laser (a0=0.3, 3 mJ) focused at the wire's midpoint, single-cycle ten-megawatt THz SPPs are excited and propagating bidirectionally along it over 25 cm. The measured laser-to-SPP… ▽ More

    Submitted 21 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  5. arXiv:2403.00783  [pdf, other

    cs.AI

    On the Roles of LLMs in Planning: Embedding LLMs into Planning Graphs

    Authors: Hankz Hankui Zhuo, Xin Chen, Rong Pan

    Abstract: Plan synthesis aims to generate a course of actions or policies to transit given initial states to goal states, provided domain models that could be designed by experts or learnt from training data or interactions with the world. Intrigued by the claims of emergent planning capabilities in large language models (LLMs), works have been proposed to investigate the planning effectiveness of LLMs, wit… ▽ More

    Submitted 26 July, 2024; v1 submitted 18 February, 2024; originally announced March 2024.

  6. arXiv:2312.15864  [pdf, other

    cs.AI

    BalMCTS: Balancing Objective Function and Search Nodes in MCTS for Constraint Optimization Problems

    Authors: Yingkai Xiao, Jingjin Liu, Hankz Hankui Zhuo

    Abstract: Constraint Optimization Problems (COP) pose intricate challenges in combinatorial problems usually addressed through Branch and Bound (B\&B) methods, which involve maintaining priority queues and iteratively selecting branches to search for solutions. However, conventional approaches take a considerable amount of time to find optimal solutions, and it is also crucial to quickly identify a near-opt… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  7. arXiv:2308.13782  [pdf, other

    cs.CL cs.AI

    Planning with Logical Graph-based Language Model for Instruction Generation

    Authors: Fan Zhang, Kebing Jin, Hankz Hankui Zhuo

    Abstract: Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from free-form texts. In this paper, we propose a novel graph-based language model, Logical-GLM, to infuse logic into language models for more valid text generation… ▽ More

    Submitted 5 July, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 9 pages, 8 figures

  8. arXiv:2308.00108  [pdf, other

    cs.CL cs.AI

    DPBERT: Efficient Inference for BERT based on Dynamic Planning

    Authors: Weixin Wu, Hankz Hankui Zhuo

    Abstract: Large-scale pre-trained language models such as BERT have contributed significantly to the development of NLP. However, those models require large computational resources, making it difficult to be applied to mobile devices where computing power is limited. In this paper we aim to address the weakness of existing input-adaptive inference methods which fail to take full advantage of the structure o… ▽ More

    Submitted 26 July, 2023; originally announced August 2023.

  9. arXiv:2306.08359  [pdf, other

    cs.AI cs.LG

    Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning

    Authors: Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu, Jianye Hao

    Abstract: Exploring sparse reward multi-agent reinforcement learning (MARL) environments with traps in a collaborative manner is a complex task. Agents typically fail to reach the goal state and fall into traps, which affects the overall performance of the system. To overcome this issue, we present SOMARL, a framework that uses prior knowledge to reduce the exploration space and assist learning. In SOMARL,… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  10. arXiv:2305.17866  [pdf, other

    cs.AI cs.IR

    Sequential Condition Evolved Interaction Knowledge Graph for Traditional Chinese Medicine Recommendation

    Authors: Jingjin Liu, Hankz Hankui Zhuo, Kebing Jin, Jiamin Yuan, Zhimin Yang, Zhengan Yao

    Abstract: Traditional Chinese Medicine (TCM) has a rich history of utilizing natural herbs to treat a diversity of illnesses. In practice, TCM diagnosis and treatment are highly personalized and organically holistic, requiring comprehensive consideration of the patient's state and symptoms over time. However, existing TCM recommendation approaches overlook the changes in patient status and only explore pote… ▽ More

    Submitted 6 October, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  11. arXiv:2305.13823  [pdf, other

    cs.AI

    XRoute Environment: A Novel Reinforcement Learning Environment for Routing

    Authors: Zhanwen Zhou, Hankz Hankui Zhuo, Xiaowu Zhang, Qiyuan Deng

    Abstract: Routing is a crucial and time-consuming stage in modern design automation flow for advanced technology nodes. Great progress in the field of reinforcement learning makes it possible to use those approaches to improve the routing quality and efficiency. However, the scale of the routing problems solved by reinforcement learning-based methods in recent studies is too small for these methods to be us… ▽ More

    Submitted 5 June, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:1907.11180 by other authors

  12. arXiv:2305.03613  [pdf, other

    physics.plasm-ph physics.comp-ph

    Branching of high-current relativistic electron beam in porous materials

    Authors: K. Jiang, T. W. Huang, R. Li, M. Y. Yu, H. B. Zhuo, S. Z. Wu, C. T. Zhou, S. C. Ruan

    Abstract: Propagation of high-current relativistic electron beam (REB) in plasma is relevant to many high-energy astrophysical phenomena as well as applications based on high-intensity lasers and charged-particle beams. Here we report a new regime of beam-plasma interaction arising from REB propagation in medium with fine structures. In this regime, the REB cascades into thin branches with local density hun… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 8 pages, 5 figures

    Journal ref: Phys. Rev. Lett. 130, 185001 (2023)

  13. arXiv:2304.12090  [pdf, other

    cs.AI

    Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey

    Authors: Chao Yu, Shicheng Ye, Hankz Hankui Zhuo

    Abstract: Reinforcement Learning (RL) has achieved tremendous development in recent years, but still faces significant obstacles in addressing complex real-life problems due to the issues of poor system generalization, low sample efficiency as well as safety and interpretability concerns. The core reason underlying such dilemmas can be attributed to the fact that most of the work has focused on the computat… ▽ More

    Submitted 23 February, 2025; v1 submitted 24 April, 2023; originally announced April 2023.

  14. arXiv:2303.17984  [pdf, other

    cs.MA

    Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning

    Authors: Zifan Wu, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

    Abstract: Research in model-based reinforcement learning has made significant progress in recent years. Compared to single-agent settings, the exponential dimension growth of the joint state-action space in multi-agent systems dramatically increases the complexity of the environment dynamics, which makes it infeasible to learn an accurate global model and thus necessitates the use of agent-wise local models… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

  15. arXiv:2301.08502  [pdf, other

    cs.LG

    Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning

    Authors: Zifan Wu, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

    Abstract: In Model-based Reinforcement Learning (MBRL), model learning is critical since an inaccurate model can bias policy learning via generating misleading samples. However, learning an accurate model can be difficult since the policy is continually updated and the induced distribution over visited states used for model learning shifts accordingly. Prior methods alleviate this issue by quantifying the u… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Accepted by NeurIPS2022

  16. arXiv:2212.05412  [pdf, other

    cs.AI

    A Hierarchical Temporal Planning-Based Approach for Dynamic Hoist Scheduling Problems

    Authors: Kebing Jin, Yingkai Xiao, Hankz Hankui Zhuo, Renyong Ma

    Abstract: Hoist scheduling has become a bottleneck in electroplating industry applications with the development of autonomous devices. Although there are a few approaches proposed to target at the challenging problem, they generally cannot scale to large-scale scheduling problems. In this paper, we formulate the hoist scheduling problem as a new temporal planning problem in the form of adapted PDDL, and pro… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  17. arXiv:2211.15666  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Visual Planning Models from Partially Observed Images

    Authors: Kebing Jin, Zhanhao Xiao, Hankui Hankz Zhuo, Hai Wan, Jiaran Cai

    Abstract: There has been increasing attention on planning model learning in classical planning. Most existing approaches, however, focus on learning planning models from structured data in symbolic representations. It is often difficult to obtain such structured data in real-world scenarios. Although a number of approaches have been developed for learning planning models from fully observed unstructured dat… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 25 pages, 5 figures

  18. arXiv:2202.08373  [pdf, other

    cs.LG cs.AI cs.CL

    Text-Based Action-Model Acquisition for Planning

    Authors: Kebing Jin, Huaixun Chen, Hankz Hankui Zhuo

    Abstract: Although there have been approaches that are capable of learning action models from plan traces, there is no work on learning action models from textual observations, which is pervasive and much easier to collect from real-world applications compared to plan traces. In this paper we propose a novel approach to learning action models from natural language texts by integrating Constraint Satisfactio… ▽ More

    Submitted 17 February, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  19. arXiv:2202.07138  [pdf, other

    cs.AI cs.CL

    Integrating AI Planning with Natural Language Processing: A Combination of Explicit and Tacit Knowledge

    Authors: Kebing Jin, Hankz Hankui Zhuo

    Abstract: Natural language processing (NLP) aims at investigating the interactions between agents and humans, processing and analyzing large amounts of natural language data. Large-scale language models play an important role in current natural language processing. However, the challenges of explainability and complexity come along with the developments of language models. One way is to introduce logical re… ▽ More

    Submitted 13 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

  20. arXiv:2202.01256  [pdf, other

    cs.AI

    Introduction to The Dynamic Pickup and Delivery Problem Benchmark -- ICAPS 2021 Competition

    Authors: Jianye Hao, Jiawen Lu, Xijun Li, Xialiang Tong, Xiang Xiang, Mingxuan Yuan, Hankz Hankui Zhuo

    Abstract: The Dynamic Pickup and Delivery Problem (DPDP) is an essential problem within the logistics domain. So far, research on this problem has mainly focused on using artificial data which fails to reflect the complexity of real-world problems. In this draft, we would like to introduce a new benchmark from real business scenarios as well as a simulator supporting the dynamic evaluation. The benchmark an… ▽ More

    Submitted 18 January, 2022; originally announced February 2022.

  21. arXiv:2112.09836  [pdf, other

    cs.AI cs.LG

    Creativity of AI: Hierarchical Planning Model Learning for Facilitating Deep Reinforcement Learning

    Authors: Hankz Hankui Zhuo, Shuting Deng, Mu Jin, Zhihao Ma, Kebing Jin, Chen Chen, Chao Yu

    Abstract: Despite of achieving great success in real-world applications, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, i.e., data efficiency, lack of the interpretability and transferability. Recent research shows that embedding symbolic knowledge into DRL is promising in addressing those challenges. Inspired by this, we introduce a novel deep reinforcement learning framew… ▽ More

    Submitted 7 July, 2023; v1 submitted 17 December, 2021; originally announced December 2021.

  22. arXiv:2112.06028  [pdf, other

    cs.AI

    Retrosynthetic Planning with Experience-Guided Monte Carlo Tree Search

    Authors: Siqi Hong, Hankz Hankui Zhuo, Kebing Jin, Guang Shao, Zhanwen Zhou

    Abstract: In retrosynthetic planning, the huge number of possible routes to synthesize a complex molecule using simple building blocks leads to a combinatorial explosion of possibilities. Even experienced chemists often have difficulty to select the most promising transformations. The current approaches rely on human-defined or machine-trained score functions which have limited chemical knowledge or use exp… ▽ More

    Submitted 9 June, 2023; v1 submitted 11 December, 2021; originally announced December 2021.

  23. arXiv:2111.09475  [pdf, other

    cs.AI

    Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines

    Authors: Xuejing Zheng, Chao Yu, Chen Chen, Jianye Hao, Hankz Hankui Zhuo

    Abstract: Continuously learning new tasks using high-level ideas or knowledge is a key capability of humans. In this paper, we propose Lifelong reinforcement learning with Sequential linear temporal logic formulas and Reward Machines (LSRM), which enables an agent to leverage previously learned knowledge to fasten learning of logically specified tasks. For the sake of more flexible specification of tasks, w… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  24. arXiv:2111.04051  [pdf, other

    cs.AI

    Coordinated Proximal Policy Optimization

    Authors: Zifan Wu, Chao Yu, Deheng Ye, Junge Zhang, Haiyin Piao, Hankz Hankui Zhuo

    Abstract: We present Coordinated Proximal Policy Optimization (CoPPO), an algorithm that extends the original Proximal Policy Optimization (PPO) to the multi-agent setting. The key idea lies in the coordinated adaptation of step size during the policy update process among multiple agents. We prove the monotonicity of policy improvement when optimizing a theoretically-grounded joint objective, and derive a s… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

  25. Gradient-Based Mixed Planning with Symbolic and Numeric Action Parameters

    Authors: Kebing Jin, Hankz Hankui Zhuo, Zhanhao Xiao, Hai Wan, Subbarao Kambhampati

    Abstract: Dealing with planning problems with both logical relations and numeric changes in real-world dynamic environments is challenging. Existing numeric planning systems for the problem often discretize numeric variables or impose convex constraints on numeric variables, which harms the performance when solving problems. In this paper, we propose a novel algorithm framework to solve numeric planning pro… ▽ More

    Submitted 9 October, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: 41 pages, 22 figures. Accepted by Artificial Intelligence

  26. arXiv:2109.14327  [pdf, other

    physics.plasm-ph physics.optics

    Branched flow of intense laser light in plasma with uneven density distribution

    Authors: K. Jiang, T. W. Huang, C. N. Wu, M. Y. Yu, H. Zhang, S. Z. Wu, H. B. Zhuo, A. Pukhov, C. T. Zhou, S. C. Ruan

    Abstract: Branched flow is an interesting phenomenon that can occur in diverse systems. It is usually linear in the sense that the flow does not alter the medium properties. Branched flow of light on thin films was recently discovered. A question of interest is thus if nonlinear branched flow of light can also occur. Here we found using particle-in-cell simulations that with intense laser propagating in pla… ▽ More

    Submitted 15 May, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: 6 pages, 5 figures

  27. arXiv:2103.08228  [pdf, other

    cs.AI

    Learning Symbolic Rules for Interpretable Deep Reinforcement Learning

    Authors: Zhihao Ma, Yuzheng Zhuang, Paul Weng, Hankz Hankui Zhuo, Dong Li, Wulong Liu, Jianye Hao

    Abstract: Recent progress in deep reinforcement learning (DRL) can be largely attributed to the use of neural networks. However, this black-box approach fails to explain the learned policy in a human understandable way. To address this challenge and improve the transparency, we propose a Neural Symbolic Reinforcement Learning framework by introducing symbolic logic into DRL. This framework features a fertil… ▽ More

    Submitted 16 March, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  28. arXiv:2005.01996  [pdf, other

    eess.IV cs.CV

    NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results

    Authors: Andreas Lugmayr, Martin Danelljan, Radu Timofte, Namhyuk Ahn, Dongwoon Bai, Jie Cai, Yun Cao, Junyang Chen, Kaihua Cheng, SeYoung Chun, Wei Deng, Mostafa El-Khamy, Chiu Man Ho, Xiaozhong Ji, Amin Kheradmand, Gwantae Kim, Hanseok Ko, Kanghyu Lee, Jungwon Lee, Hao Li, Ziluan Liu, Zhi-Song Liu, Shuai Liu, Yunhua Lu, Zibo Meng , et al. (21 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real world super-resolution. It focuses on the participating methods and final results. The challenge addresses the real world setting, where paired true high and low-resolution images are unavailable. For training, only one set of source input images is therefore provided along with a set of unpaired high-quality target images. In Track 1: Image Proc… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  29. arXiv:2002.11501  [pdf, other

    cs.LG stat.ML

    Dual Graph Representation Learning

    Authors: Huiling Zhu, Xin Luo, Hankz Hankui Zhuo

    Abstract: Graph representation learning embeds nodes in large graphs as low-dimensional vectors and is of great benefit to many downstream applications. Most embedding frameworks, however, are inherently transductive and unable to generalize to unseen nodes or learn representations across different graphs. Although inductive approaches can generalize to unseen nodes, they neglect different contexts of nodes… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

  30. arXiv:1911.12949  [pdf, other

    cs.AI

    Refining HTN Methods via Task Insertion with Preferences

    Authors: Zhanhao Xiao, Hai Wan, Hankui Hankz Zhuo, Andreas Herzig, Laurent Perrussel, Peilin Chen

    Abstract: Hierarchical Task Network (HTN) planning is showing its power in real-world planning. Although domain experts have partial hierarchical domain knowledge, it is time-consuming to specify all HTN methods, leaving them incomplete. On the other hand, traditional HTN learning approaches focus only on declarative goals, omitting the hierarchical domain knowledge. In this paper, we propose a novel learni… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: 8 pages,7 figures, Accepted in AAAI-20

  31. arXiv:1911.05701  [pdf, other

    cs.LG cs.AI

    Transfer Value Iteration Networks

    Authors: Junyi Shen, Hankz Hankui Zhuo, Jin Xu, Bin Zhong, Sinno Jialin Pan

    Abstract: Value iteration networks (VINs) have been demonstrated to have a good generalization ability for reinforcement learning tasks across similar domains. However, based on our experiments, a policy learned by VINs still fail to generalize well on the domain whose action space and feature space are not identical to those in the domain where it is trained. In this paper, we propose a transfer learning a… ▽ More

    Submitted 26 November, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

  32. arXiv:1909.09616  [pdf, other

    cs.AI

    Repositioning Bikes with Carrier Vehicles and Bike Trailers in Bike Sharing Systems

    Authors: Xinghua Zheng, Ming Tang, Hankz Hankui Zhuo, Kevin X. Wen

    Abstract: Bike Sharing Systems (BSSs) have been adopted in many major cities of the world due to traffic congestion and carbon emissions. Although there have been approaches to exploiting either bike trailers via crowdsourcing or carrier vehicles to reposition bikes in the ``right'' stations in the ``right'' time, they do not jointly consider the usage of both bike trailers and carrier vehicles. In this pap… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

  33. arXiv:1908.09800  [pdf, other

    cs.AI

    Learning Action Models from Disordered and Noisy Plan Traces

    Authors: Hankz Hankui Zhuo, Jing Peng, Subbarao Kambhampati

    Abstract: There is increasing awareness in the planning community that the burden of specifying complete domain models is too high, which impedes the applicability of planning technology in many real-world domains. Although there have many learning systems that help automatically learning domain models, most existing work assumes that the input traces are completely correct. A more realistic situation is th… ▽ More

    Submitted 9 September, 2019; v1 submitted 26 August, 2019; originally announced August 2019.

    Comments: 8 pages

  34. arXiv:1907.08352  [pdf, other

    cs.AI

    Representation Learning for Classical Planning from Partially Observed Traces

    Authors: Zhanhao Xiao, Hai Wan, Hankui Hankz Zhuo, Jinxia Lin, Yanan Liu

    Abstract: Specifying a complete domain model is time-consuming, which has been a bottleneck of AI planning technique application in many real-world scenarios. Most classical domain-model learning approaches output a domain model in the form of the declarative planning language, such as STRIPS or PDDL, and solve new planning instances by invoking an existing planner. However, planning in such a representatio… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: 11 pages, 6 figures

  35. arXiv:1906.00638  [pdf, other

    cs.IR cs.CL

    Federated Hierarchical Hybrid Networks for Clickbait Detection

    Authors: Feng Liao, Hankz Hankui Zhuo, Xiaoling Huang, Yu Zhang

    Abstract: Online media outlets adopt clickbait techniques to lure readers to click on articles in a bid to expand their reach and subsequently increase revenue through ad monetization. As the adverse effects of clickbait attract more and more attention, researchers have started to explore machine learning techniques to automatically detect clickbaits. Previous work on clickbait detection assumes that all th… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: 10 pages

  36. arXiv:1901.08277  [pdf, other

    cs.LG cs.AI

    Federated Deep Reinforcement Learning

    Authors: Hankz Hankui Zhuo, Wenfeng Feng, Yufeng Lin, Qian Xu, Qiang Yang

    Abstract: In deep reinforcement learning, building policies of high-quality is challenging when the feature space of states is small and the training data is limited. Despite the success of previous transfer learning approaches in deep reinforcement learning, directly transferring data or models from an agent to another agent is often not allowed due to the privacy of data and/or models in many privacy-awar… ▽ More

    Submitted 9 February, 2020; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 9 pages, 5 figures

  37. arXiv:1806.05320  [pdf, other

    cs.CV

    SCSP: Spectral Clustering Filter Pruning with Soft Self-adaption Manners

    Authors: Huiyuan Zhuo, Xuelin Qian, Yanwei Fu, Heng Yang, Xiangyang Xue

    Abstract: Deep Convolutional Neural Networks (CNN) has achieved significant success in computer vision field. However, the high computational cost of the deep complex models prevents the deployment on edge devices with limited memory and computational resource. In this paper, we proposed a novel filter pruning for convolutional neural networks compression, namely spectral clustering filter pruning with soft… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  38. arXiv:1804.07013  [pdf

    cs.AI

    An Integrated Development Environment for Planning Domain Modeling

    Authors: Yuncong Li, Hankz Hankui Zhuo

    Abstract: In order to make the task, description of planning domains and problems, more comprehensive for non-experts in planning, the visual representation has been used in planning domain modeling in recent years. However, current knowledge engineering tools with visual modeling, like itSIMPLE (Vaquero et al. 2012) and VIZ (Vodrážka and Chrpa 2010), are less efficient than the traditional method of hand-c… ▽ More

    Submitted 19 April, 2018; originally announced April 2018.

  39. arXiv:1803.02632  [pdf, other

    cs.AI cs.CL

    Extracting Action Sequences from Texts Based on Deep Reinforcement Learning

    Authors: Wenfeng Feng, Hankz Hankui Zhuo, Subbarao Kambhampati

    Abstract: Extracting action sequences from natural language texts is challenging, as it requires commonsense inferences based on world knowledge. Although there has been work on extracting action scripts, instructions, navigation actions, etc., they require that either the set of candidate actions be provided in advance, or that action descriptions are restricted to a specific form, e.g., description templa… ▽ More

    Submitted 11 May, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

    Comments: 7pages, 6 figures

  40. arXiv:1803.02208  [pdf, other

    cs.AI

    Discovering Underlying Plans Based on Shallow Models

    Authors: Hankz Hankui Zhuo, Yantian Zha, Subbarao Kambhampati

    Abstract: Plan recognition aims to discover target plans (i.e., sequences of actions) behind observed actions, with history plan libraries or domain models in hand. Previous approaches either discover plans by maximally "matching" observed actions to plan libraries, assuming target plans are from plan libraries, or infer plans by executing domain models to best explain the observed actions, assuming that co… ▽ More

    Submitted 3 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1511.05662

  41. arXiv:1708.03744  [pdf, ps, other

    physics.plasm-ph

    Effective suppression of parametric instabilities with decoupled broadband lasers in plasma

    Authors: Yao Zhao, Suming Weng, Min Chen, Jun Zheng, Hongbin Zhuo, Chuang Ren, Zhengming Sheng, Jie Zhang

    Abstract: A theoretical analysis for the stimulated Raman scattering (SRS) instability driven by two laser beams with certain frequency difference is presented. It is found that strong coupling and enhanced SRS take place only when the unstable regions corresponding respectively to the two beams are overlapped in the wavenumber space. Hence a threshold of the beam frequency difference for their decoupling i… ▽ More

    Submitted 23 March, 2022; v1 submitted 12 August, 2017; originally announced August 2017.

    Comments: 6 pages, 4 figures

  42. arXiv:1703.06587  [pdf, other

    cs.IR

    Paper2vec: Citation-Context Based Document Distributed Representation for Scholar Recommendation

    Authors: Han Tian, Hankz Hankui Zhuo

    Abstract: Due to the availability of references of research papers and the rich information contained in papers, various citation analysis approaches have been proposed to identify similar documents for scholar recommendation. Despite of the success of previous approaches, they are, however, based on co-occurrence of items. Once there are no co-occurrence items available in documents, they will not work wel… ▽ More

    Submitted 19 March, 2017; originally announced March 2017.

  43. arXiv:1703.04854  [pdf, ps, other

    cs.IR cs.CL

    Distributed-Representation Based Hybrid Recommender System with Short Item Descriptions

    Authors: Junhua He, Hankz Hankui Zhuo, Jarvan Law

    Abstract: Collaborative filtering (CF) aims to build a model from users' past behaviors and/or similar decisions made by other users, and use the model to recommend items for users. Despite of the success of previous collaborative filtering approaches, they are all based on the assumption that there are sufficient rating scores available for building high-quality recommendation models. In real world applica… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

    Comments: 10 pages, 5 figures

  44. arXiv:1702.07543   

    cs.AI

    Embedding Knowledge Graphs Based on Transitivity and Antisymmetry of Rules

    Authors: Mengya Wang, Hankui Zhuo, Huiling Zhu

    Abstract: Representation learning of knowledge graphs encodes entities and relation types into a continuous low-dimensional vector space, learns embeddings of entities and relation types. Most existing methods only concentrate on knowledge triples, ignoring logic rules which contain rich background knowledge. Although there has been some work aiming at leveraging both knowledge triples and logic rules, they… ▽ More

    Submitted 19 April, 2017; v1 submitted 24 February, 2017; originally announced February 2017.

    Comments: This paper has been withdrawn by the authors due to a crucial sign error in equations

  45. arXiv:1702.07117  [pdf, other

    cs.CL

    LTSG: Latent Topical Skip-Gram for Mutually Learning Topic Model and Vector Representations

    Authors: Jarvan Law, Hankz Hankui Zhuo, Junhua He, Erhu Rong

    Abstract: Topic models have been widely used in discovering latent topics which are shared across documents in text mining. Vector representations, word embeddings and topic embeddings, map words and topics into a low-dimensional and dense real-value vector space, which have obtained high performance in NLP tasks. However, most of the existing models assume the result trained by one of them are perfect corr… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

  46. arXiv:1701.02918  [pdf, other

    physics.plasm-ph

    Containing intense laser light in circular cavity with magnetic trap door

    Authors: X. H. Yang, W. Yu, M. Y. Yu, H. Xu, Y. Y. Ma, Z. M. Sheng, H. B. Zhuo, Z. Y. Ge, F. Q. Shao

    Abstract: It is shown by particle-in-cell simulation that intense circularly polarized (CP) laser light can be contained in the cavity of a solid-density circular Al-plasma shell for hundreds of light-wave periods before it is dissipated by laser-plasma interaction. A right-hand CP laser pulse can propagate almost without reflection into the cavity through a highly magnetized overdense H-plasma slab filling… ▽ More

    Submitted 11 January, 2017; originally announced January 2017.

  47. arXiv:1610.05572  [pdf, ps, other

    physics.plasm-ph

    Study of filamentation instability on the divergence of ultraintense laser-driven electrons

    Authors: X. H. Yang, H. B. Zhuo, H. Xu, Z. Y. Ge, F. Q. Shao, M. Borghesi, Y. Y. Ma

    Abstract: Generation of relativistic electron (RE) beams during ultraintense laser pulse interaction with plasma targets is studied by collisional particle-in-cell (PIC) simulations. Strong magnetic field with transverse scale length of several local plasma skin depths, associated with RE currents propagation in the target, is generated by filamentation instability (FI) in collisional plasmas, inducing a gr… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

  48. arXiv:1511.08158  [pdf, other

    cs.AI cs.RO

    Plan Explicability and Predictability for Robot Task Planning

    Authors: Yu Zhang, Sarath Sreedharan, Anagha Kulkarni, Tathagata Chakraborti, Hankz Hankui Zhuo, Subbarao Kambhampati

    Abstract: Intelligent robots and machines are becoming pervasive in human populated environments. A desirable capability of these agents is to respond to goal-oriented commands by autonomously constructing task plans. However, such autonomy can add significant cognitive load and potentially introduce safety risks to humans when agents behave unexpectedly. Hence, for such agents to be helpful, one important… ▽ More

    Submitted 12 April, 2016; v1 submitted 25 November, 2015; originally announced November 2015.

    Comments: Added physical robot evaluations

  49. arXiv:1511.05662  [pdf, other

    cs.AI

    Discovering Underlying Plans Based on Distributed Representations of Actions

    Authors: Xin Tian, Hankz Hankui Zhuo, Subbarao Kambhampati

    Abstract: Plan recognition aims to discover target plans (i.e., sequences of actions) behind observed actions, with history plan libraries or domain models in hand. Previous approaches either discover plans by maximally "matching" observed actions to plan libraries, assuming target plans are from plan libraries, or infer plans by executing domain models to best explain the observed actions, assuming complet… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  50. $D_{sJ}(2860)$ From The Semileptonic Decays Of $B_s$ Mesons

    Authors: Long-Fei Gan, Jian-Rong Zhang, Ming-Qiu Huang, Hong-Bin Zhuo, Yan-Yun Ma, Qing-Jun Zhu, Jian-Xun Liu, Guo-Bo Zhang

    Abstract: In the framework of heavy quark effective theory, the leading order Isgur-Wise form factors relevant to semileptonic decays of the ground state $\bar{b}s$ meson $B_{s}$ into orbitally excited $D$-wave $\bar{c}s$ mesons, including the newly observed narrow $D^{*}_{s1}(2860)$ and $D^{*}_{s3}(2860)$ states by the LHCb Collaboration, are calculated with the QCD sum rule method. With these universal fo… ▽ More

    Submitted 3 April, 2015; v1 submitted 26 December, 2014; originally announced December 2014.

    Comments: 13 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:1112.5227, arXiv:1009.0980