Skip to main content

Showing 1–47 of 47 results for author: Ren, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23705  [pdf, ps, other

    cs.LG cs.RO

    Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

    Authors: Danny Driess, Jost Tobias Springenberg, Brian Ichter, Lili Yu, Adrian Li-Bell, Karl Pertsch, Allen Z. Ren, Homer Walke, Quan Vuong, Lucy Xiaoyang Shi, Sergey Levine

    Abstract: Vision-language-action (VLA) models provide a powerful approach to training control policies for physical systems, such as robots, by combining end-to-end learning with transfer of semantic knowledge from web-scale vision-language model (VLM) training. However, the constraints of real-time control are often at odds with the design of VLMs: the most powerful VLMs have tens or hundreds of billions o… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.15818  [pdf, ps, other

    cs.CV

    InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition

    Authors: Yijie Zheng, Weijie Wu, Qingyun Li, Xuehui Wang, Xu Zhou, Aiai Ren, Jun Shen, Long Zhao, Guoqing Li, Xue Yang

    Abstract: Language-Guided object recognition in remote sensing imagery is crucial for large-scale mapping and automated data annotation. However, existing open-vocabulary and visual grounding methods rely on explicit category cues, limiting their ability to handle complex or implicit queries that require advanced reasoning. To address this issue, we introduce a new suite of tasks, including Instruction-Orie… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2505.07728  [pdf, other

    cs.RO cs.AI cs.LG

    Guiding Data Collection via Factored Scaling Curves

    Authors: Lihan Zha, Apurva Badithela, Michael Zhang, Justin Lidard, Jeremy Bao, Emily Zhou, David Snyder, Allen Z. Ren, Dhruv Shah, Anirudha Majumdar

    Abstract: Generalist imitation learning policies trained on large datasets show great promise for solving diverse manipulation tasks. However, to ensure generalization to different conditions, policies need to be trained with data collected across a large set of environmental factor variations (e.g., camera pose, table height, distractors) $-$ a prohibitively expensive undertaking, if done exhaustively. We… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Project website: https://factored-data-scaling.github.io

  4. arXiv:2505.01985  [pdf, other

    math.OC cs.LG

    Optimization over Trained (and Sparse) Neural Networks: A Surrogate within a Surrogate

    Authors: Hung Pham, Aiden Ren, Ibrahim Tahir, Jiatai Tong, Thiago Serra

    Abstract: We can approximate a constraint or an objective function that is uncertain or nonlinear with a neural network that we embed in the optimization model. This approach, which is known as constraint learning, faces the challenge that optimization models with neural network surrogates are harder to solve. Such difficulties have motivated studies on model reformulation, specialized optimization algorith… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  5. arXiv:2504.16054  [pdf, other

    cs.LG cs.RO

    $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization

    Authors: Physical Intelligence, Kevin Black, Noah Brown, James Darpinian, Karan Dhabalia, Danny Driess, Adnan Esmail, Michael Equi, Chelsea Finn, Niccolo Fusai, Manuel Y. Galliker, Dibya Ghosh, Lachy Groom, Karol Hausman, Brian Ichter, Szymon Jakubczak, Tim Jones, Liyiming Ke, Devin LeBlanc, Sergey Levine, Adrian Li-Bell, Mohith Mothukuri, Suraj Nair, Karl Pertsch, Allen Z. Ren , et al. (11 additional authors not shown)

    Abstract: In order for robots to be useful, they must perform practically relevant tasks in the real world, outside of the lab. While vision-language-action (VLA) models have demonstrated impressive results for end-to-end robot control, it remains an open question how far such models can generalize in the wild. We describe $π_{0.5}$, a new model based on $π_{0}$ that uses co-training on heterogeneous tasks… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  6. arXiv:2503.22394  [pdf, other

    cs.CV cs.AI

    Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision

    Authors: Rulin Zhou, Wenlong He, An Wang, Qiqi Yao, Haijun Hu, Jiankun Wang, Xi Zhang an Hongliang Ren

    Abstract: Accurate tissue point tracking in endoscopic videos is critical for robotic-assisted surgical navigation and scene understanding, but remains challenging due to complex deformations, instrument occlusion, and the scarcity of dense trajectory annotations. Existing methods struggle with long-term tracking under these conditions due to limited feature utilization and annotation dependence. We present… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  7. arXiv:2501.14817  [pdf

    cs.LG cs.CE

    A Cutting Mechanics-based Machine Learning Modeling Method to Discover Governing Equations of Machining Dynamics

    Authors: Alisa Ren, Mason Ma, Jiajie Wu, Jaydeep Karandikar, Chris Tyler, Tony Shi, Tony Schmitz

    Abstract: This paper proposes a cutting mechanics-based machine learning (CMML) modeling method to discover governing equations of machining dynamics. The main idea of CMML design is to integrate existing physics in cutting mechanics and unknown physics in data to achieve automated model discovery, with the potential to advance machining modeling. Based on existing physics in cutting mechanics, CMML first e… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  8. arXiv:2412.05563  [pdf, ps, other

    cs.CL cs.AI

    A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

    Authors: Ola Shorinwa, Zhiting Mei, Justin Lidard, Allen Z. Ren, Anirudha Majumdar

    Abstract: The remarkable performance of large language models (LLMs) in content generation, coding, and common-sense reasoning has spurred widespread integration into many facets of society. However, integration of LLMs raises valid questions on their reliability and trustworthiness, given their propensity to generate hallucinations: plausible, factually-incorrect responses, which are expressed with strikin… ▽ More

    Submitted 1 July, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

  9. arXiv:2411.01790  [pdf, other

    cs.AI cs.LG

    Thinking Forward and Backward: Effective Backward Planning with Large Language Models

    Authors: Allen Z. Ren, Brian Ichter, Anirudha Majumdar

    Abstract: Large language models (LLMs) have exhibited remarkable reasoning and planning capabilities. Most prior work in this area has used LLMs to reason through steps from an initial to a goal state or criterion, thereby effectively reasoning in a forward direction. Nonetheless, many planning problems exhibit an inherent asymmetry such that planning backward from the goal is significantly easier -- for ex… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: Under review

  10. arXiv:2410.01971  [pdf, other

    cs.RO cs.LG

    Run-time Observation Interventions Make Vision-Language-Action Models More Visually Robust

    Authors: Asher J. Hancock, Allen Z. Ren, Anirudha Majumdar

    Abstract: Vision-language-action (VLA) models trained on large-scale internet data and robot demonstrations have the potential to serve as generalist robot policies. However, despite their large-scale training, VLAs are often brittle to task-irrelevant visual details such as distractor objects or background colors. We introduce Bring Your Own VLA (BYOVLA): a run-time intervention scheme that (1) dynamically… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: Website: https://aasherh.github.io/byovla/

  11. arXiv:2409.00588  [pdf, other

    cs.RO cs.LG

    Diffusion Policy Policy Optimization

    Authors: Allen Z. Ren, Justin Lidard, Lars L. Ankile, Anthony Simeonov, Pulkit Agrawal, Anirudha Majumdar, Benjamin Burchfiel, Hongkai Dai, Max Simchowitz

    Abstract: We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework including best practices for fine-tuning diffusion-based policies (e.g. Diffusion Policy) in continuous control and robot learning tasks using the policy gradient (PG) method from reinforcement learning (RL). PG methods are ubiquitous in training RL policies with other policy parameterizations; nevertheless, they had… ▽ More

    Submitted 9 December, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: Website: diffusion-ppo.github.io

  12. arXiv:2408.03562  [pdf, other

    cs.CL cs.AI

    A Comparison of LLM Finetuning Methods & Evaluation Metrics with Travel Chatbot Use Case

    Authors: Sonia Meyer, Shreya Singh, Bertha Tam, Christopher Ton, Angel Ren

    Abstract: This research compares large language model (LLM) fine-tuning methods, including Quantized Low Rank Adapter (QLoRA), Retrieval Augmented fine-tuning (RAFT), and Reinforcement Learning from Human Feedback (RLHF), and additionally compared LLM evaluation methods including End to End (E2E) benchmark method of "Golden Answers", traditional natural language processing (NLP) metrics, RAG Assessment (Rag… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  13. arXiv:2404.05425  [pdf, other

    cs.SE

    Requirements Elicitation in Government Projects: A Preliminary Empirical Study

    Authors: Anqi Ren, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Kaijia Xu, Xishuo Zhang, Chetan Arora

    Abstract: Government development projects vary significantly from private sector initiatives in scope, stakeholder complexity, and regulatory requirements. There is a lack of empirical studies focusing on requirements engineering (RE) activities specifically for government projects. We addressed this gap by conducting a series of semi-structured interviews with 12 professional software practitioners working… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  14. arXiv:2403.15941  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Explore until Confident: Efficient Exploration for Embodied Question Answering

    Authors: Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigh

    Abstract: We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions… ▽ More

    Submitted 7 July, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  15. arXiv:2403.08185  [pdf, other

    cs.RO eess.SY

    Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception

    Authors: Zhiting Mei, Anushri Dixit, Meghan Booker, Emily Zhou, Mariko Storey-Matsutani, Allen Z. Ren, Ola Shorinwa, Anirudha Majumdar

    Abstract: Rapid advances in perception have enabled large pre-trained models to be used out of the box for transforming high-dimensional, noisy, and partial observations of the world into rich occupancy representations. However, the reliability of these models and consequently their safe integration onto robots remains unknown when deployed in environments unseen during training. To provide safety guarantee… ▽ More

    Submitted 17 April, 2025; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Videos and code can be found at https://perceive-with-confidence.github.io

  16. arXiv:2402.11450  [pdf, other

    cs.RO

    Learning to Learn Faster from Human Feedback with Language Model Predictive Control

    Authors: Jacky Liang, Fei Xia, Wenhao Yu, Andy Zeng, Montserrat Gonzalez Arenas, Maria Attarian, Maria Bauza, Matthew Bennice, Alex Bewley, Adil Dostmohamed, Chuyuan Kelly Fu, Nimrod Gileadi, Marissa Giustina, Keerthana Gopalakrishnan, Leonard Hasenclever, Jan Humplik, Jasmine Hsu, Nikhil Joshi, Ben Jyenis, Chase Kew, Sean Kirmani, Tsang-Wei Edward Lee, Kuang-Huei Lee, Assaf Hurwitz Michaely, Joss Moore , et al. (25 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to exhibit a wide range of capabilities, such as writing robot code from language commands -- enabling non-experts to direct robot behaviors, modify them based on feedback, or compose them to perform new tasks. However, these capabilities (driven by in-context learning) are limited to short-term interactions, where users' feedback remains relevant for o… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  17. arXiv:2307.01928  [pdf, other

    cs.RO cs.AI stat.AP

    Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

    Authors: Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

    Abstract: Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions. In this work, we present KnowNo, which is a framework for measuring and aligning the uncertainty of LLM-based planners such that they know when they don't know and ask for… ▽ More

    Submitted 4 September, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Conference on Robot Learning (CoRL) 2023, Oral Presentation

  18. arXiv:2307.00390  [pdf, other

    cs.SE

    PersonaGen: A Tool for Generating Personas from User Feedback

    Authors: Xishuo Zhang, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Anqi Ren, Chetan Arora

    Abstract: Personas are crucial in software development processes, particularly in agile settings. However, no effective tools are available for generating personas from user feedback in agile software development processes. To fill this gap, we propose a novel tool that uses the GPT-4 model and knowledge graph to generate persona templates from well-processed user feedback, facilitating requirement analysis… ▽ More

    Submitted 6 July, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

  19. arXiv:2304.04602  [pdf, other

    cs.RO cs.HC cs.LG

    Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

    Authors: Zihan Ding, Yuanpei Chen, Allen Z. Ren, Shixiang Shane Gu, Qianxu Wang, Hao Dong, Chi Jin

    Abstract: Generating human-like behavior on robots is a great challenge especially in dexterous manipulation tasks with robotic hands. Scripting policies from scratch is intractable due to the high-dimensional control space, and training policies with reinforcement learning (RL) and manual reward engineering can also be hard and lead to unnatural motions. Leveraging the recent progress on RL from Human Feed… ▽ More

    Submitted 13 September, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  20. arXiv:2302.04903  [pdf, other

    cs.RO cs.LG eess.SY

    AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer

    Authors: Allen Z. Ren, Hongkai Dai, Benjamin Burchfiel, Anirudha Majumdar

    Abstract: Simulation parameter settings such as contact models and object geometry approximations are critical to training robust robotic policies capable of transferring from simulation to real-world deployment. Previous approaches typically handcraft distributions over such parameters (domain randomization), or identify parameters that best match the dynamics of the real environment (system identification… ▽ More

    Submitted 30 September, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Conference on Robot Learning (CoRL), 2023

  21. arXiv:2210.05857  [pdf, other

    cs.RO

    FlowDrone: Wind Estimation and Gust Rejection on UAVs Using Fast-Response Hot-Wire Flow Sensors

    Authors: Nathaniel Simon, Allen Z. Ren, Alexander Piqué, David Snyder, Daphne Barretto, Marcus Hultmark, Anirudha Majumdar

    Abstract: Unmanned aerial vehicles (UAVs) are finding use in applications that place increasing emphasis on robustness to external disturbances including extreme wind. However, traditional multirotor UAV platforms do not directly sense wind; conventional flow sensors are too slow, insensitive, or bulky for widespread integration on UAVs. Instead, drones typically observe the effects of wind indirectly throu… ▽ More

    Submitted 24 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Submitted to ICRA 2023. See supplementary video at https://youtu.be/KWqkH9Z-338

  22. arXiv:2206.13074  [pdf, other

    cs.RO cs.AI cs.LG

    Leveraging Language for Accelerated Learning of Tool Manipulation

    Authors: Allen Z. Ren, Bharat Govil, Tsung-Yen Yang, Karthik Narasimhan, Anirudha Majumdar

    Abstract: Robust and generalized tool manipulation requires an understanding of the properties and affordances of different tools. We investigate whether linguistic information about a tool (e.g., its geometry, common uses) can help control policies adapt faster to new tools for a given task. We obtain diverse descriptions of various tools in natural language and use pre-trained language models to generate… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

  23. arXiv:2202.05894  [pdf, other

    cs.RO

    Failure Prediction with Statistical Guarantees for Vision-Based Robot Control

    Authors: Alec Farid, David Snyder, Allen Z. Ren, Anirudha Majumdar

    Abstract: We are motivated by the problem of performing failure prediction for safety-critical robotic systems with high-dimensional sensor observations (e.g., vision). Given access to a black-box control policy (e.g., in the form of a neural network) and a dataset of training environments, we present an approach for synthesizing a failure predictor with guaranteed bounds on false-positive and false-negativ… ▽ More

    Submitted 5 May, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  24. Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

    Authors: Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Nguyen, Anirudha Majumdar, Jaime F. Fisac

    Abstract: Safety is a critical component of autonomous systems and remains a challenge for learning-based policies to be utilized in the real world. In particular, policies learned using reinforcement learning often fail to generalize to novel environments due to unsafe behavior. In this paper, we propose Sim-to-Lab-to-Real to bridge the reality gap with a probabilistically guaranteed safety-aware policy di… ▽ More

    Submitted 1 April, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to Special Issue on Risk-aware Autonomous Systems: Theory and Practice, Artificial Intelligence

  25. arXiv:2111.08761  [pdf, other

    cs.RO cs.LG eess.SY

    Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

    Authors: Abhinav Agarwal, Sushant Veer, Allen Z. Ren, Anirudha Majumdar

    Abstract: We are motivated by the problem of learning policies for robotic systems with rich sensory inputs (e.g., vision) in a manner that allows us to guarantee generalization to environments unseen during training. We provide a framework for providing such generalization guarantees by leveraging a finite dataset of real-world environments in combination with a (potentially inaccurate) generative model of… ▽ More

    Submitted 22 July, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  26. arXiv:2107.06353  [pdf, other

    cs.RO cs.LG

    Distributionally Robust Policy Learning via Adversarial Environment Generation

    Authors: Allen Z. Ren, Anirudha Majumdar

    Abstract: Our goal is to train control policies that generalize well to unseen environments. Inspired by the Distributionally Robust Optimization (DRO) framework, we propose DRAGEN - Distributionally Robust policy learning via Adversarial Generation of ENvironments - for iteratively improving robustness of policies to realistic distribution shifts by generating adversarial environments. The key idea is to l… ▽ More

    Submitted 7 July, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: IEEE Robotics and Automation Letters, 2022. Presented at ICRA 2022

  27. arXiv:2106.09166  [pdf, other

    cs.LG cs.AI cs.PF

    Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI

    Authors: Geng Yuan, Zhiheng Liao, Xiaolong Ma, Yuxuan Cai, Zhenglun Kong, Xuan Shen, Jingyan Fu, Zhengang Li, Chengming Zhang, Hongwu Peng, Ning Liu, Ao Ren, Jinhui Wang, Yanzhi Wang

    Abstract: Recent research demonstrated the promise of using resistive random access memory (ReRAM) as an emerging technology to perform inherently parallel analog domain in-situ matrix-vector multiplication -- the intensive and key computation in deep neural networks (DNNs). However, hardware failure, such as stuck-at-fault defects, is one of the main concerns that impedes the ReRAM devices to be a feasible… ▽ More

    Submitted 18 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: In Proceedings of the 22nd International Symposium on Quality Electronic Design (ISQED), 2021

  28. CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework

    Authors: Yu Zhang, Moming Duan, Duo Liu, Li Li, Ao Ren, Xianzhang Chen, Yujuan Tan, Chengliang Wang

    Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and fast convergence speed. However, this synchronous communication strategy has… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: This paper will be presented at IJCNN 2021

  29. FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems

    Authors: Li Li, Moming Duan, Duo Liu, Yu Zhang, Ao Ren, Xianzhang Chen, Yujuan Tan, Chengliang Wang

    Abstract: Federated Learning (FL) is a novel distributed machine learning which allows thousands of edge devices to train model locally without uploading data concentrically to the server. But since real federated settings are resource-constrained, FL is encountered with systems heterogeneity which causes a lot of stragglers directly and then leads to significantly accuracy reduction indirectly. To solve th… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: This paper will be presented at IJCNN 2021

  30. arXiv:2010.05649  [pdf, other

    cs.LG cs.AI

    Multivariate Time Series Classification with Hierarchical Variational Graph Pooling

    Authors: Ziheng Duan, Haoyan Xu, Yueyang Wang, Yida Huang, Anni Ren, Zhongbin Xu, Yizhou Sun, Wei Wang

    Abstract: With the advancement of sensing technology, multivariate time series classification (MTSC) has recently received considerable attention. Existing deep learning-based MTSC techniques, which mostly rely on convolutional or recurrent neural networks, are primarily concerned with the temporal dependency of single time series. As a result, they struggle to express pairwise dependencies among multivaria… ▽ More

    Submitted 5 November, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

  31. arXiv:2008.08617  [pdf, other

    cs.LG stat.ML

    MTHetGNN: A Heterogeneous Graph Embedding Framework for Multivariate Time Series Forecasting

    Authors: Yueyang Wang, Ziheng Duan, Yida Huang, Haoyan Xu, Jie Feng, Anni Ren

    Abstract: Multivariate time series forecasting, which analyzes historical time series to predict future trends, can effectively help decision-making. Complex relations among variables in MTS, including static, dynamic, predictable, and latent relations, have made it possible to mining more features of MTS. Modeling complex relations are not only essential in characterizing latent dependency as well as model… ▽ More

    Submitted 14 December, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

  32. arXiv:2008.07730  [pdf, other

    cs.LG stat.ML

    Parallel Extraction of Long-term Trends and Short-term Fluctuation Framework for Multivariate Time Series Forecasting

    Authors: Yifu Zhou, Ziheng Duan, Haoyan Xu, Jie Feng, Anni Ren, Yueyang Wang, Xiaoqian Wang

    Abstract: Multivariate time series forecasting is widely used in various fields. Reasonable prediction results can assist people in planning and decision-making, generate benefits and avoid risks. Normally, there are two characteristics of time series, that is, long-term trend and short-term fluctuation. For example, stock prices will have a long-term upward trend with the market, but there may be a small d… ▽ More

    Submitted 22 March, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

  33. arXiv:2008.01913  [pdf, other

    cs.RO cs.LG eess.SY math.OC

    Generalization Guarantees for Imitation Learning

    Authors: Allen Z. Ren, Sushant Veer, Anirudha Majumdar

    Abstract: Control policies from imitation learning can often fail to generalize to novel environments due to imperfect demonstrations or the inability of imitation learning algorithms to accurately infer the expert's policies. In this paper, we present rigorous generalization guarantees for imitation learning by leveraging the Probably Approximately Correct (PAC)-Bayes framework to provide upper bounds on t… ▽ More

    Submitted 3 December, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Presented at the Conference on Robot Learning (CoRL), 2020

  34. arXiv:1911.08020  [pdf, other

    cs.LG cs.NE

    DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks

    Authors: Ao Ren, Tao Zhang, Yuhao Wang, Sheng Lin, Peiyan Dong, Yen-kuang Chen, Yuan Xie, Yanzhi Wang

    Abstract: The rapidly growing parameter volume of deep neural networks (DNNs) hinders the artificial intelligence applications on resource constrained devices, such as mobile and wearable devices. Neural network pruning, as one of the mainstream model compression techniques, is under extensive study to reduce the number of parameters and computations. In contrast to irregular pruning that incurs high index… ▽ More

    Submitted 20 November, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: This paper has been accepted by AAAI'2020

  35. arXiv:1909.11025  [pdf, other

    cs.AI

    Interpretable Models for Understanding Immersive Simulations

    Authors: Nicholas Hoernle, Kobi Gal, Barbara Grosz, Leilah Lyons, Ada Ren, Andee Rubin

    Abstract: This paper describes methods for comparative evaluation of the interpretability of models of high dimensional time series data inferred by unsupervised machine learning algorithms. The time series data used in this investigation were logs from an immersive simulation like those commonly used in education and healthcare training. The structures learnt by the models provide representations of partic… ▽ More

    Submitted 4 May, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: To be published in IJCAI 2020

  36. arXiv:1907.09077  [pdf, other

    cs.NE cs.ET cs.LG eess.SP

    A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology

    Authors: Ruizhe Cai, Ao Ren, Olivia Chen, Ning Liu, Caiwen Ding, Xuehai Qian, Jie Han, Wenhui Luo, Nobuyuki Yoshikawa, Yanzhi Wang

    Abstract: The Adiabatic Quantum-Flux-Parametron (AQFP) superconducting technology has been recently developed, which achieves the highest energy efficiency among superconducting logic families, potentially huge gain compared with state-of-the-art CMOS. In 2016, the successful fabrication and testing of AQFP-based circuits with the scale of 83,000 JJs have demonstrated the scalability and potential of implem… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

  37. arXiv:1902.03562  [pdf, ps, other

    cs.CR

    A Novel Secure Authentication Scheme for Heterogeneous Internet of Thing

    Authors: Jingwei Liu, Ailian Ren, Lihuan Zhang, Rong Sun, Xiaojiang Du, Mohsen Guizani

    Abstract: Today, Internet of Things (IoT) technology is being increasingly popular which is applied in a wide range of industry sectors such as healthcare, transportation and some critical infrastructures. With the widespread applications of IoT technology, people's lives have changed dramatically. Due to its capabilities of sensitive data-aware, information collection, communication and processing, it rais… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: 7 pages, 4 figures

  38. arXiv:1812.11677  [pdf, other

    cs.LG cs.AI cs.AR cs.CV

    ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers

    Authors: Ao Ren, Tianyun Zhang, Shaokai Ye, Jiayu Li, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang

    Abstract: To facilitate efficient embedded and hardware implementations of deep neural networks (DNNs), two important categories of DNN model compression techniques: weight pruning and weight quantization are investigated. The former leverages the redundancy in the number of weights, whereas the latter leverages the redundancy in bit representation of weights. However, there lacks a systematic framework of… ▽ More

    Submitted 30 December, 2018; originally announced December 2018.

  39. arXiv:1805.04142  [pdf, other

    cs.NE cs.ET

    Towards Budget-Driven Hardware Optimization for Deep Convolutional Neural Networks using Stochastic Computing

    Authors: Zhe Li, Ji Li, Ao Ren, Caiwen Ding, Jeffrey Draper, Qinru Qiu, Bo Yuan, Yanzhi Wang

    Abstract: Recently, Deep Convolutional Neural Network (DCNN) has achieved tremendous success in many machine learning applications. Nevertheless, the deep structure has brought significant increases in computation complexity. Largescale deep learning systems mainly operate in high-performance server clusters, thus restricting the application extensions to personal or mobile devices. Previous works on GPU an… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: Accepted by IEEE Computer Society Annual Symposium on VLSI 2018

  40. arXiv:1804.11239  [pdf, other

    cs.DC cs.AR cs.LG cs.NE

    Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs

    Authors: Caiwen Ding, Ao Ren, Geng Yuan, Xiaolong Ma, Jiayu Li, Ning Liu, Bo Yuan, Yanzhi Wang

    Abstract: Both industry and academia have extensively investigated hardware accelerations. In this work, to address the increasing demands in computational capability and memory requirement, we propose structured weight matrices (SWM)-based compression techniques for both \emph{field programmable gate array} (FPGA) and \emph{application-specific integrated circuit} (ASIC) implementations. In algorithm part,… ▽ More

    Submitted 28 March, 2018; originally announced April 2018.

    Comments: 6 pages, 7 figures, GLSVLSI2018

  41. arXiv:1802.01016  [pdf, other

    cs.NE

    An Area and Energy Efficient Design of Domain-Wall Memory-Based Deep Convolutional Neural Networks using Stochastic Computing

    Authors: Xiaolong Ma, Yipeng Zhang, Geng Yuan, Ao Ren, Zhe Li, Jie Han, Jingtong Hu, Yanzhi Wang

    Abstract: With recent trend of wearable devices and Internet of Things (IoTs), it becomes attractive to develop hardware-based deep convolutional neural networks (DCNNs) for embedded applications, which require low power/energy consumptions and small hardware footprints. Recent works demonstrated that the Stochastic Computing (SC) technique can radically simplify the hardware implementation of arithmetic un… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

  42. arXiv:1802.00824  [pdf

    cs.ET

    Algorithm-Hardware Co-Optimization of the Memristor-Based Framework for Solving SOCP and Homogeneous QCQP Problems

    Authors: Ao Ren, Sijia Liu, Ruizhe Cai, Wujie Wen, Pramod K Varshney, Yanzhi Wang

    Abstract: A memristor crossbar, which is constructed with memristor devices, has the unique ability to change and memorize the state of each of its memristor elements. It also has other highly desirable features such as high density, low power operation and excellent scalability. Hence the memristor crossbar technology can potentially be utilized for developing low-complexity and high-scalability solution f… ▽ More

    Submitted 2 February, 2018; originally announced February 2018.

  43. arXiv:1802.00822  [pdf, other

    cs.LG cs.AR stat.ML

    VIBNN: Hardware Acceleration of Bayesian Neural Networks

    Authors: Ruizhe Cai, Ao Ren, Ning Liu, Caiwen Ding, Luhao Wang, Xuehai Qian, Massoud Pedram, Yanzhi Wang

    Abstract: Bayesian Neural Networks (BNNs) have been proposed to address the problem of model uncertainty in training and inference. By introducing weights associated with conditioned probability distributions, BNNs are capable of resolving the overfitting issue commonly seen in conventional neural networks and allow for small-data training, through the variational inference process. Frequent usage of Gaussi… ▽ More

    Submitted 2 February, 2018; originally announced February 2018.

  44. arXiv:1710.03792  [pdf, other

    cs.AI

    Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations

    Authors: Hongjia Li, Tianshu Wei, Ao Ren, Qi Zhu, Yanzhi Wang

    Abstract: The recent breakthroughs of deep reinforcement learning (DRL) technique in Alpha Go and playing Atari have set a good example in handling large state and actions spaces of complicated control problems. The DRL technique is comprised of (i) an offline deep neural network (DNN) construction phase, which derives the correlation between each state-action pair of the system and its value function, and… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

  45. arXiv:1703.04135  [pdf, other

    cs.CV

    Hardware-Driven Nonlinear Activation for Stochastic Computing Based Deep Convolutional Neural Networks

    Authors: Ji Li, Zihao Yuan, Zhe Li, Caiwen Ding, Ao Ren, Qinru Qiu, Jeffrey Draper, Yanzhi Wang

    Abstract: Recently, Deep Convolutional Neural Networks (DCNNs) have made unprecedented progress, achieving the accuracy close to, or even better than human-level perception in various tasks. There is a timely need to map the latest software DCNNs to application-specific hardware, in order to achieve orders of magnitude improvement in performance, energy efficiency and compactness. Stochastic Computing (SC),… ▽ More

    Submitted 12 March, 2017; originally announced March 2017.

    Comments: This paper is accepted by 2017 International Joint Conference on Neural Networks (IJCNN)

  46. arXiv:1611.05939  [pdf, other

    cs.CV

    SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

    Authors: Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan, Yanzhi Wang

    Abstract: With recent advancing of Internet of Things (IoTs), it becomes very attractive to implement the deep convolutional neural networks (DCNNs) onto embedded/portable systems. Presently, executing the software-based DCNNs requires high-performance server clusters in practice, restricting their widespread deployment on the mobile devices. To overcome this issue, considerable research efforts have been c… ▽ More

    Submitted 31 January, 2017; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: This paper is accepted by 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

  47. arXiv:1507.03000   

    cs.CR

    Toward Practical Differential Privacy in Smart Grid with Capacity-Limited Rechargeable Batteries

    Authors: Zijian Zhang, Zhan Qin, Liehuang Zhu, Wei Jiang, Chen Xu, and Kui Ren

    Abstract: The technology of differential privacy, adding a noise drawn from the Laplace distribution, successfully overcomes a difficulty of keeping both the privacy of individual data and the utility of the statistical result simultaneously. Therefore, it is prevalent to use a rechargeable battery as the noise for achieving differential privacy in the application of smart grid. Unfortunately, to the best o… ▽ More

    Submitted 22 July, 2015; v1 submitted 10 July, 2015; originally announced July 2015.

    Comments: This paper has been withdrawn by the author due to a crucial error in definition 3 and theorem 2