Skip to main content

Showing 1–50 of 85 results for author: Chien, S

.
  1. arXiv:2505.02613  [pdf, ps, other

    eess.IV cs.LG

    Lane-Wise Highway Anomaly Detection

    Authors: Mei Qiu, William Lorenz Reindl, Yaobin Chen, Stanley Chien, Shu Hu

    Abstract: This paper proposes a scalable and interpretable framework for lane-wise highway traffic anomaly detection, leveraging multi-modal time series data extracted from surveillance cameras. Unlike traditional sensor-dependent methods, our approach uses AI-powered vision models to extract lane-specific features, including vehicle count, occupancy, and truck percentage, without relying on costly hardware… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  2. arXiv:2502.14803  [pdf, other

    cs.RO eess.SY

    Planning, scheduling, and execution on the Moon: the CADRE technology demonstration mission

    Authors: Gregg Rabideau, Joseph Russino, Andrew Branch, Nihal Dhamani, Tiago Stegun Vaquero, Steve Chien, Jean-Pierre de la Croix, Federico Rossi

    Abstract: NASA's Cooperative Autonomous Distributed Robotic Exploration (CADRE) mission, slated for flight to the Moon's Reiner Gamma region in 2025/2026, is designed to demonstrate multi-agent autonomous exploration of the Lunar surface and sub-surface. A team of three robots and a base station will autonomously explore a region near the lander, collecting the data required for 3D reconstruction of the sur… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: To be presented at AAMAS 2025

  3. arXiv:2501.13375  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement

    Authors: Meng-Ping Lin, Jen-Cheng Hou, Chia-Wei Chen, Shao-Yi Chien, Jun-Cheng Chen, Xugang Lu, Yu Tsao

    Abstract: Speech enhancement (SE) aims to improve the quality and intelligibility of speech in noisy environments. Recent studies have shown that incorporating visual cues in audio signal processing can enhance SE performance. Given that human speech communication naturally involves audio, visual, and linguistic modalities, it is reasonable to expect additional improvements by integrating linguistic informa… ▽ More

    Submitted 26 May, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

  4. arXiv:2411.06297  [pdf, other

    cs.CV

    Adaptive Aspect Ratios with Patch-Mixup-ViT-based Vehicle ReID

    Authors: Mei Qiu, Lauren Ann Christopher, Stanley Chien, Lingxi Li

    Abstract: Vision Transformers (ViTs) have shown exceptional performance in vehicle re-identification (ReID) tasks. However, non-square aspect ratios of image or video inputs can negatively impact re-identification accuracy. To address this challenge, we propose a novel, human perception driven, and general ViT-based ReID framework that fuses models trained on various aspect ratios. Our key contributions are… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

  5. arXiv:2409.14430  [pdf, other

    cs.CV cs.AI

    Pomo3D: 3D-Aware Portrait Accessorizing and More

    Authors: Tzu-Chieh Liu, Chih-Ting Liu, Shao-Yi Chien

    Abstract: We propose Pomo3D, a 3D portrait manipulation framework that allows free accessorizing by decomposing and recomposing portraits and accessories. It enables the avatars to attain out-of-distribution (OOD) appearances of simultaneously wearing multiple accessories. Existing methods still struggle to offer such explicit and fine-grained editing; they either fail to generate additional objects on give… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  6. arXiv:2409.13953  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Training Large ASR Encoders with Differential Privacy

    Authors: Geeticka Chauhan, Steve Chien, Om Thakkar, Abhradeep Thakurta, Arun Narayanan

    Abstract: Self-supervised learning (SSL) methods for large speech models have proven to be highly effective at ASR. With the interest in public deployment of large pre-trained models, there is a rising concern for unintended memorization and leakage of sensitive data points from the training data. In this paper, we apply differentially private (DP) pre-training to a SOTA Conformer-based encoder, and study i… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: In proceedings of the IEEE Spoken Language Technologies Workshop, 2024

  7. arXiv:2407.09966  [pdf, other

    cs.CV eess.IV

    Optimizing ROI Benefits Vehicle ReID in ITS

    Authors: Mei Qiu, Lauren Ann Christopher, Lingxi Li, Stanley Chien, Yaobin Chen

    Abstract: Vehicle re-identification (ReID) is a computer vision task that matches the same vehicle across different cameras or viewpoints in a surveillance system. This is crucial for Intelligent Transportation Systems (ITS), where the effectiveness is influenced by the regions from which vehicle images are cropped. This study explores whether optimal vehicle detection regions, guided by detection confidenc… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  8. arXiv:2407.04688  [pdf, other

    cs.CV

    Enhancing Vehicle Re-identification and Matching for Weaving Analysis

    Authors: Mei Qiu, Wei Lin, Stanley Chien, Lauren Christopher, Yaobin Chen, Shu Hu

    Abstract: Vehicle weaving on highways contributes to traffic congestion, raises safety issues, and underscores the need for sophisticated traffic management systems. Current tools are inadequate in offering precise and comprehensive data on lane-specific weaving patterns. This paper introduces an innovative method for collecting non-overlapping video data in weaving zones, enabling the generation of quantit… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2406.15686  [pdf, other

    cs.CR cs.NI

    Transport-Level Encryption in Datacenter Networks

    Authors: Tianyi Gao, Xinshu Ma, Suhas Narreddy, Eugenio Luo, Steven W. D. Chien, Michio Honda

    Abstract: Cloud applications need network data encryption to isolate from other tenants and protect their data from potential eavesdroppers in the network infrastructure. This paper presents SDT, a protocol design for emerging datacenter transport protocols to integrate data encryption while using existing NIC offloading designed for TLS over TCP. Therefore, SDT could enable a deployment path of new transpo… ▽ More

    Submitted 24 September, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  10. arXiv:2404.15212  [pdf, other

    cs.CV eess.IV

    Real-time Lane-wise Traffic Monitoring in Optimal ROIs

    Authors: Mei Qiu, Wei Lin, Lauren Ann Christopher, Stanley Chien, Yaobin Chen, Shu Hu

    Abstract: In the US, thousands of Pan, Tilt, and Zoom (PTZ) traffic cameras monitor highway conditions. There is a great interest in using these highway cameras to gather valuable road traffic data to support traffic analysis and decision-making for highway safety and efficient traffic management. However, there are too many cameras for a few human traffic operators to effectively monitor, so a fully automa… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

  11. arXiv:2404.06135  [pdf, other

    cs.CV

    Efficient Concertormer for Image Deblurring and Beyond

    Authors: Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang

    Abstract: The Transformer architecture has achieved remarkable success in natural language processing and high-level vision tasks over the past few years. However, the inherent complexity of self-attention is quadratic to the size of the image, leading to unaffordable computational costs for high-resolution vision tasks. In this paper, we introduce Concertormer, featuring a novel Concerto Self-Attention (CS… ▽ More

    Submitted 3 December, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  12. arXiv:2401.14576  [pdf

    cs.DC cs.PF

    iFast: Host-Side Logging for Scientific Applications

    Authors: Steven W. D. Chien, Kento Sato, Artur Podobas, Niclas Jansson, Stefano Markidis, Michio Honda

    Abstract: We have seen an increase in the heterogeneity of storage technologies potentially available to scientific applications, such as burst buffers, managed cloud parallel file systems (PFS), and object stores. However, those applications cannot easily utilize those technologies, because they are designed for traditional HPC systems that offer very high remote storage and network bandwidth. We present i… ▽ More

    Submitted 2 August, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Submitted to VLDB 2025

  13. arXiv:2309.07173  [pdf, other

    cs.LG cs.CV

    Using Unsupervised and Supervised Learning and Digital Twin for Deep Convective Ice Storm Classification

    Authors: Jason Swope, Steve Chien, Emily Dunkel, Xavier Bosch-Lluis, Qing Yue, William Deal

    Abstract: Smart Ice Cloud Sensing (SMICES) is a small-sat concept in which a primary radar intelligently targets ice storms based on information collected by a lookahead radiometer. Critical to the intelligent targeting is accurate identification of storm/cloud types from eight bands of radiance collected by the radiometer. The cloud types of interest are: clear sky, thin cirrus, cirrus, rainy anvil, and co… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  14. arXiv:2308.07717  [pdf, other

    cs.CV

    Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global Pixels

    Authors: Ching-Hsun Tseng, Shao-Ju Chien, Po-Shen Wang, Shin-Jye Lee, Wei-Huan Hu, Bin Pu, Xiao-jun Zeng

    Abstract: Motion mode (M-mode) recording is an essential part of echocardiography to measure cardiac dimension and function. However, the current diagnosis cannot build an automatic scheme, as there are three fundamental obstructs: Firstly, there is no open dataset available to build the automation for ensuring constant results and bridging M-mode echocardiography with real-time instance segmentation (RIS);… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  15. arXiv:2307.16141  [pdf, other

    cs.LG

    Pupil Learning Mechanism

    Authors: Rua-Huan Tsaih, Yu-Hang Chien, Shih-Yi Chien

    Abstract: Studies on artificial neural networks rarely address both vanishing gradients and overfitting issues. In this study, we follow the pupil learning procedure, which has the features of interpreting, picking, understanding, cramming, and organizing, to derive the pupil learning mechanism (PLM) by which to modify the network structure and weights of 2-layer neural networks (2LNNs). The PLM consists of… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

  16. arXiv:2306.04931  [pdf

    eess.SY

    A New Scoring Method for the Evaluation of Vehicle Road Departure Detection Systems

    Authors: Dan Shen, Lingxi Li, Stanley Chien, Yaobin Chen, Rini Sherony

    Abstract: Road departure detection systems (RDDSs) for eliminating unintentional road departure collisions have been developed and equipped on some commercial vehicles in recent years. In order to provide a standardized and objective performance evaluation of RDDSs without the affections of systems complex nature of RDDSs and the design requirements, this paper proposes the development of the scoring method… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  17. arXiv:2303.08544  [pdf, other

    cs.GT cs.CR cs.NI

    Joint Security-vs-QoS Game Theoretical Optimization for Intrusion Response Mechanisms for Future Network Systems

    Authors: Arash Bozorgchenani, Charilaos C. Zarakovitis, Su Fong Chien, Qiang Ni, Antonios Gouglidis, Wissam Mallouli, Heng Siong Lim

    Abstract: Network connectivity exposes the network infrastructure and assets to vulnerabilities that attackers can exploit. Protecting network assets against attacks requires the application of security countermeasures. Nevertheless, employing countermeasures incurs costs, such as monetary costs, along with time and energy to prepare and deploy the countermeasures. Thus, an Intrusion Response System (IRS) s… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 12 pages, 8 figures

  18. arXiv:2303.03177  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

    Authors: Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano

    Abstract: Pre-trained model representations have demonstrated state-of-the-art performance in speech recognition, natural language processing, and other applications. Speech models, such as Bidirectional Encoder Representations from Transformers (BERT) and Hidden units BERT (HuBERT), have enabled generating lexical and acoustic representations to benefit speech recognition applications. We investigated the… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 5 pages, conference

  19. arXiv:2303.00654  [pdf, other

    cs.LG cs.CR stat.ML

    How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy

    Authors: Natalia Ponomareva, Hussein Hazimeh, Alex Kurakin, Zheng Xu, Carson Denison, H. Brendan McMahan, Sergei Vassilvitskii, Steve Chien, Abhradeep Thakurta

    Abstract: ML models are ubiquitous in real world applications and are a constant focus of research. At the same time, the community has started to realize the importance of protecting the privacy of ML training data. Differential Privacy (DP) has become a gold standard for making formal statements about data anonymization. However, while some adoption of DP has happened in industry, attempts to apply DP t… ▽ More

    Submitted 31 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Journal ref: Journal of Artificial Intelligence Research 77 (2023) 1113-1201

  20. arXiv:2302.14832  [pdf

    astro-ph.IM astro-ph.EP physics.geo-ph

    Planetary Exploration Horizon 2061 Report Chapter 5: Enabling technologies for planetary exploration

    Authors: Manuel Grande, Linli Guo, Michel Blanc, Advenit Makaya, Sami Asmar, David Atkinson, Anne Bourdon, Pascal Chabert, Steve Chien, John Day, Alberto G. Fairen, Anthony Freeman, Antonio Genova, Alain Herique, Wlodek Kofman, Joseph Lazio, Olivier Mousis, Gian Gabriele Ori, Victor Parro, Robert Preston, Jose A Rodriguez-Manfredi, Veerle Sterken, Keith Stephenson, Joshua Vander Hook, Hunter Waite , et al. (1 additional authors not shown)

    Abstract: The main objective of this chapter is to present an overview of the different areas of key technologies that will be needed to fly the technically most challenging of the representative missions identified in chapter 4 (the Pillar 2 Horizon 2061 report). It starts with a description of the future scientific instruments which will address the key questions of Horizon 2061 described in chapter 3 (th… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 100 pages, 23 figures, Horizon 2061 is a science-driven, foresight exercise, for future scientific investigations

  21. arXiv:2301.08248  [pdf, other

    cs.RO astro-ph.EP astro-ph.IM cs.AI cs.HC

    Enabling Astronaut Self-Scheduling using a Robust Advanced Modelling and Scheduling system: an assessment during a Mars analogue mission

    Authors: Michael Saint-Guillain, Jean Vanderdonckt, Nicolas Burny, Vladimir Pletser, Tiago Vaquero, Steve Chien, Alexander Karl, Jessica Marquez, John Karasinski, Cyril Wain, Audrey Comein, Ignacio S. Casla, Jean Jacobs, Julien Meert, Cheyenne Chamart, Sirga Drouet, Julie Manon

    Abstract: Human long duration exploration missions (LDEMs) raise a number of technological challenges. This paper addresses the question of the crew autonomy: as the distances increase, the communication delays and constraints tend to prevent the astronauts from being monitored and supported by a real time ground control. Eventually, future planetary missions will necessarily require a form of astronaut sel… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

  22. arXiv:2212.12660  [pdf

    eess.SY cs.CV

    Risk assessment and mitigation of e-scooter crashes with naturalistic driving data

    Authors: Avinash Prabu, Zhengming Zhang, Renran Tian, Stanley Chien, Lingxi Li, Yaobin Chen, Rini Sherony

    Abstract: Recently, e-scooter-involved crashes have increased significantly but little information is available about the behaviors of on-road e-scooter riders. Most existing e-scooter crash research was based on retrospectively descriptive media reports, emergency room patient records, and crash reports. This paper presents a naturalistic driving study with a focus on e-scooter and vehicle encounters. The… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  23. SceNDD: A Scenario-based Naturalistic Driving Dataset

    Authors: Avinash Prabu, Nitya Ranjan, Lingxi Li, Renran Tian, Stanley Chien, Yaobin Chen, Rini Sherony

    Abstract: In this paper, we propose SceNDD: a scenario-based naturalistic driving dataset that is built upon data collected from an instrumented vehicle in downtown Indianapolis. The data collection was completed in 68 driving sessions with different drivers, where each session lasted about 20--40 minutes. The main goal of creating this dataset is to provide the research community with real driving scenario… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Conference: 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC). Link: https://ieeexplore.ieee.org/document/9921953

  24. arXiv:2212.11979  [pdf

    eess.SY cs.CV

    A Wearable Data Collection System for Studying Micro-Level E-Scooter Behavior in Naturalistic Road Environment

    Authors: Avinash Prabu, Dan Shen, Renran Tian, Stanley Chien, Lingxi Li, Yaobin Chen, Rini Sherony

    Abstract: As one of the most popular micro-mobility options, e-scooters are spreading in hundreds of big cities and college towns in the US and worldwide. In the meantime, e-scooters are also posing new challenges to traffic safety. In general, e-scooters are suggested to be ridden in bike lanes/sidewalks or share the road with cars at the maximum speed of about 15-20 mph, which is more flexible and much fa… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Conference: Fast-zero'21, Kanazawa, Japan Date of publication: Sep 2021 Publisher: JSAE

    Journal ref: https://tech.jsae.or.jp/paperinfo/en/content/conf2021-02.11/

  25. arXiv:2211.02625  [pdf, other

    eess.SP cs.LG

    MAEEG: Masked Auto-encoder for EEG Representation Learning

    Authors: Hsiang-Yun Sherry Chien, Hanlin Goh, Christopher M. Sandino, Joseph Y. Cheng

    Abstract: Decoding information from bio-signals such as EEG, using machine learning has been a challenge due to the small data-sets and difficulty to obtain labels. We propose a reconstruction-based self-supervised learning model, the masked auto-encoder for EEG (MAEEG), for learning EEG representations by learning to reconstruct the masked EEG features using a transformer architecture. We found that MAEEG… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures, accepted by Workshop on Learning from Time Series for Health, NeurIPS2022 as poster presentation

  26. arXiv:2207.03334  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

    Authors: Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi

    Abstract: Estimating dimensional emotions, such as activation, valence and dominance, from acoustic speech signals has been widely explored over the past few years. While accurate estimation of activation and dominance from speech seem to be possible, the same for valence remains challenging. Previous research has shown that the use of lexical information can improve valence estimation performance. Lexical… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: 5 pages, 3 figures, Interspeech 2022

  27. Temporal Multimodal Multivariate Learning

    Authors: Hyoshin Park, Justice Darko, Niharika Deshpande, Venktesh Pandey, Hui Su, Masahiro Ono, Dedrick Barkely, Larkin Folsom, Derek Posselt, Steve Chien

    Abstract: We introduce temporal multimodal multivariate learning, a new family of decision making models that can indirectly learn and transfer online information from simultaneous observations of a probability distribution with more than one peak or more than one outcome variable from one time stage to another. We approximate the posterior by sequentially removing additional uncertainties across different… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 11 pages, 12 figures, SIGKDD Conference on Knowledge Discovery and Data Mining,

    ACM Class: F.4.1

  28. arXiv:2204.09606  [pdf, other

    cs.CL cs.CR cs.LG cs.SD eess.AS

    Detecting Unintended Memorization in Language-Model-Fused ASR

    Authors: W. Ronny Huang, Steve Chien, Om Thakkar, Rajiv Mathews

    Abstract: End-to-end (E2E) models are often being accompanied by language models (LMs) via shallow fusion for boosting their overall quality as well as recognition of rare words. At the same time, several prior works show that LMs are susceptible to unintentionally memorizing rare or unique sequences in the training data. In this work, we design a framework for detecting memorization of random textual seque… ▽ More

    Submitted 28 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Interspeech 2022

  29. arXiv:2201.12328  [pdf, other

    cs.LG

    Toward Training at ImageNet Scale with Differential Privacy

    Authors: Alexey Kurakin, Shuang Song, Steve Chien, Roxana Geambasu, Andreas Terzis, Abhradeep Thakurta

    Abstract: Differential privacy (DP) is the de facto standard for training machine learning (ML) models, including neural networks, while ensuring the privacy of individual examples in the training set. Despite a rich literature on how to train ML models with differential privacy, it remains extremely challenging to train real-life, large neural networks with both reasonable accuracy and privacy. We set ou… ▽ More

    Submitted 8 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 25 pages, 7 figures. Code available at https://github.com/google-research/dp-imagenet

  30. arXiv:2112.12496  [pdf, other

    cs.CV cs.AI

    FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition

    Authors: Chih-Ting Liu, Chien-Yi Wang, Shao-Yi Chien, Shang-Hong Lai

    Abstract: Current state-of-the-art deep learning based face recognition (FR) models require a large number of face identities for central training. However, due to the growing privacy awareness, it is prohibited to access the face images on user devices to continually improve face recognition models. Federated Learning (FL) is a technique to address the privacy issue, which can collaboratively optimize the… ▽ More

    Submitted 21 March, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: This paper was accepted by AAAI 2022 Conference on Artificial Intelligence and selected as an oral paper

  31. arXiv:2112.06879  [pdf, other

    cs.RO cs.MA

    Multi-Robot On-site Shared Analytics Information and Computing

    Authors: Joshua Vander Hook, Federico Rossi, Tiago Vaquero, Martina Troesch, Marc Sanchez Net, Joshua Schoolcraft, Jean-Pierre de la Croix, Steve Chien

    Abstract: Computation load-sharing across a network of heterogeneous robots is a promising approach to increase robots capabilities and efficiency as a team in extreme environments. However, in such environments, communication links may be intermittent and connections to the cloud or internet may be nonexistent. In this paper we introduce a communication-aware, computation task scheduling problem for multi-… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 14 pages, 11 figures. Extended version of journal submission in preparation

  32. arXiv:2112.03570  [pdf, other

    cs.CR cs.LG

    Membership Inference Attacks From First Principles

    Authors: Nicholas Carlini, Steve Chien, Milad Nasr, Shuang Song, Andreas Terzis, Florian Tramer

    Abstract: A membership inference attack allows an adversary to query a trained machine learning model to predict whether or not a particular example was contained in the model's training dataset. These attacks are currently evaluated using average-case "accuracy" metrics that fail to characterize whether the attack can confidently identify any members of the training set. We argue that attacks should instea… ▽ More

    Submitted 12 April, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

  33. arXiv:2111.13876  [pdf, other

    cs.CV

    Learning Discriminative Shrinkage Deep Networks for Image Deconvolution

    Authors: Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang

    Abstract: Most existing methods usually formulate the non-blind deconvolution problem into a maximum-a-posteriori framework and address it by manually designing kinds of regularization terms and data terms of the latent clear images. However, explicitly designing these two terms is quite challenging and usually leads to complex optimization problems which are difficult to solve. In this paper, we propose an… ▽ More

    Submitted 20 July, 2022; v1 submitted 27 November, 2021; originally announced November 2021.

  34. arXiv:2111.12885  [pdf, other

    cs.DC

    A Dense Tensor Accelerator with Data Exchange Mesh for DNN and Vision Workloads

    Authors: Yu-Sheng Lin, Wei-Chao Chen. Chia-Lin Yang, Shao-Yi Chien

    Abstract: We propose a dense tensor accelerator called VectorMesh, a scalable, memory-efficient architecture that can support a wide variety of DNN and computer vision workloads. Its building block is a tile execution unit~(TEU), which includes dozens of processing elements~(PEs) and SRAM buffers connected through a butterfly network. A mesh of FIFOs between the TEUs facilitates data exchange between tiles… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  35. arXiv:2111.10970  [pdf, other

    cs.RO cs.AI cs.HC eess.SY

    Operations for Autonomous Spacecraft

    Authors: Rebecca Castano, Tiago Vaquero, Federico Rossi, Vandi Verma, Ellen Van Wyk, Dan Allard, Bennett Huffmann, Erin M. Murphy, Nihal Dhamani, Robert A. Hewitt, Scott Davidoff, Rashied Amini, Anthony Barrett, Julie Castillo-Rogez, Steve A. Chien, Mathieu Choukroun, Alain Dadaian, Raymond Francis, Benjamin Gorr, Mark Hofstadter, Mitch Ingham, Cristina Sorice, Iain Tierney

    Abstract: Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

    Comments: 16 pages, 18 Figures, 1 Table, to be published in IEEE Aerospace 2022 (AeroConf 2022)

    Journal ref: Proceedings of the 2022 IEEE Aerospace Conference (IEEE AERO 2022), 1-20

  36. arXiv:2110.02017  [pdf

    physics.optics

    In-plane subwavelength near field optical capsule for lab-on-a-chip optical nano-tweezer

    Authors: Oleg V. Minin, Shuo-Chih Chien, Wei-Yu Chen, Cheng-Yang Liu, Igor V. Minin

    Abstract: In this letter, we propose a new proof-of-concept of optical nano-tweezer on the basis of a pair of dielectric rectangular rods capable of generating a novel class of controlled finite-volume near field light capsules. The finite-difference time-domain simulations of light spatial structure and optical trapping forces of the gold nanoparticle immersed in water demonstrate the physical concept of a… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: 5 pages, 5 Figures

    MSC Class: 65Z05 ACM Class: J.2

    Journal ref: Optics Letters 47, 794-797 (2022)

  37. arXiv:2107.09802  [pdf, other

    cs.LG cs.CR stat.ML

    Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates

    Authors: Steve Chien, Prateek Jain, Walid Krichene, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of differentially private (DP) matrix completion under user-level privacy. We design a joint differentially private variant of the popular Alternating-Least-Squares (ALS) method that achieves: i) (nearly) optimal sample complexity for matrix completion (in terms of number of items, users), and ii) the best known privacy/utility trade-off both theoretically, as well as on bench… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  38. arXiv:2107.06676  [pdf, other

    cs.LG cs.CE cs.DC cs.NE

    Higgs Boson Classification: Brain-inspired BCPNN Learning with StreamBrain

    Authors: Martin Svedin, Artur Podobas, Steven W. D. Chien, Stefano Markidis

    Abstract: One of the most promising approaches for data analysis and exploration of large data sets is Machine Learning techniques that are inspired by brain models. Such methods use alternative learning rules potentially more efficiently than established learning rules. In this work, we focus on the potential of brain-inspired ML for exploiting High-Performance Computing (HPC) resources to solve ML problem… ▽ More

    Submitted 17 August, 2021; v1 submitted 14 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at The 2nd Workshop on Artificial Intelligence and Machine Learning for Scientific Applications (AI4S 2021)

  39. arXiv:2106.10465  [pdf, other

    cs.CV

    Interactive Object Segmentation with Dynamic Click Transform

    Authors: Chun-Tse Lin, Wei-Chih Tu, Chih-Ting Liu, Shao-Yi Chien

    Abstract: In the interactive segmentation, users initially click on the target object to segment the main body and then provide corrections on mislabeled regions to iteratively refine the segmentation masks. Most existing methods transform these user-provided clicks into interaction maps and concatenate them with image as the input tensor. Typically, the interaction maps are determined by measuring the dist… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: This paper was accepted by IEEE International Conference on Image Processing (ICIP) 2021

  40. arXiv:2106.07204  [pdf, other

    cs.CV

    Hard Samples Rectification for Unsupervised Cross-domain Person Re-identification

    Authors: Chih-Ting Liu, Man-Yu Lee, Tsai-Shien Chen, Shao-Yi Chien

    Abstract: Person re-identification (re-ID) has received great success with the supervised learning methods. However, the task of unsupervised cross-domain re-ID is still challenging. In this paper, we propose a Hard Samples Rectification (HSR) learning scheme which resolves the weakness of original clustering-based methods being vulnerable to the hard positive and negative samples in the target unlabelled d… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: This paper was accepted by IEEE International Conference on Image Processing (ICIP) 2021

  41. arXiv:2106.05373  [pdf, other

    cs.DC cs.LG cs.NE

    StreamBrain: An HPC Framework for Brain-like Neural Networks on CPUs, GPUs and FPGAs

    Authors: Artur Podobas, Martin Svedin, Steven W. D. Chien, Ivy B. Peng, Naresh Balaji Ravichandran, Pawel Herman, Anders Lansner, Stefano Markidis

    Abstract: The modern deep learning method based on backpropagation has surged in popularity and has been used in multiple domains and application areas. At the same time, there are other -- less-known -- machine learning algorithms with a mature and solid theoretical foundation whose performance remains unexplored. One such example is the brain-like Bayesian Confidence Propagation Neural Network (BCPNN). In… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at the International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART 2021)

  42. arXiv:2106.04979  [pdf

    cs.DC

    Benchmarking the Nvidia GPU Lineage: From Early K80 to Modern A100 with Asynchronous Memory Transfers

    Authors: Martin Svedin, Steven W. D. Chien, Gibson Chikafa, Niclas Jansson, Artur Podobas

    Abstract: For many, Graphics Processing Units (GPUs) provides a source of reliable computing power. Recently, Nvidia introduced its 9th generation HPC-grade GPUs, the Ampere 100, claiming significant performance improvements over previous generations, particularly for AI-workloads, as well as introducing new architectural features such as asynchronous data movement. But how well does the A100 perform on non… ▽ More

    Submitted 3 July, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 7 pages

  43. arXiv:2106.03719  [pdf, other

    cs.CV

    Incremental False Negative Detection for Contrastive Learning

    Authors: Tsai-Shien Chen, Wei-Chih Hung, Hung-Yu Tseng, Shao-Yi Chien, Ming-Hsuan Yang

    Abstract: Self-supervised learning has recently shown great potential in vision tasks through contrastive learning, which aims to discriminate each image, or instance, in the dataset. However, such instance-level learning ignores the semantic relationship among instances and sometimes undesirably repels the anchor from the semantically similar samples, termed as "false negatives". In this work, we show that… ▽ More

    Submitted 16 March, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  44. arXiv:2105.10678  [pdf, other

    cs.CV

    Video-based Person Re-identification without Bells and Whistles

    Authors: Chih-Ting Liu, Jun-Cheng Chen, Chu-Song Chen, Shao-Yi Chien

    Abstract: Video-based person re-identification (Re-ID) aims at matching the video tracklets with cropped video frames for identifying the pedestrians under different cameras. However, there exists severe spatial and temporal misalignment for those cropped tracklets due to the imperfect detection and tracking results generated with obsolete methods. To address this issue, we present a simple re-Detect and Li… ▽ More

    Submitted 22 September, 2021; v1 submitted 22 May, 2021; originally announced May 2021.

    Comments: This paper was accepted by CVPR 2021 Biometrics Workshop

  45. arXiv:2105.05944  [pdf, other

    cs.LG

    Slower is Better: Revisiting the Forgetting Mechanism in LSTM for Slower Information Decay

    Authors: Hsiang-Yun Sherry Chien, Javier S. Turek, Nicole Beckage, Vy A. Vo, Christopher J. Honey, Ted L. Willke

    Abstract: Sequential information contains short- to long-range dependencies; however, learning long-timescale information has been a challenge for recurrent neural networks. Despite improvements in long short-term memory networks (LSTMs), the forgetting mechanism results in the exponential decay of information, limiting their capacity to capture long-timescale information. Here, we propose a power law forge… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 16 pages, 10 figures

  46. arXiv:2012.06717  [pdf, other

    cs.CL

    Mapping the Timescale Organization of Neural Language Models

    Authors: Hsiang-Yun Sherry Chien, Jinhan Zhang, Christopher. J. Honey

    Abstract: In the human brain, sequences of language input are processed within a distributed and hierarchical architecture, in which higher stages of processing encode contextual information over longer timescales. In contrast, in recurrent neural networks which perform natural language processing, we know little about how the multiple timescales of contextual information are functionally organized. Therefo… ▽ More

    Submitted 17 March, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: 23 pages, 4 main figures, 10 appendix figures; published as a conference paper at ICLR 2021

  47. arXiv:2012.01874  [pdf, other

    eess.IV cs.CV

    How to Exploit the Transferability of Learned Image Compression to Conventional Codecs

    Authors: Jan P. Klopp, Keng-Chi Liu, Liang-Gee Chen, Shao-Yi Chien

    Abstract: Lossy image compression is often limited by the simplicity of the chosen loss measure. Recent research suggests that generative adversarial networks have the ability to overcome this limitation and serve as a multi-modal loss, especially for textures. Together with learned image compression, these two techniques can be used to great effect when relaxing the commonly employed tight measures of dist… ▽ More

    Submitted 6 March, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 10 pages, 5 figures

  48. arXiv:2011.08733  [pdf, ps, other

    cs.AI

    Using Explainable Scheduling for the Mars 2020 Rover Mission

    Authors: Jagriti Agrawal, Amruta Yelamanchili, Steve Chien

    Abstract: Understanding the reasoning behind the behavior of an automated scheduling system is essential to ensure that it will be trusted and consequently used to its full capabilities in critical applications. In cases where a scheduler schedules activities in an invalid location, it is usually easy for the user to infer the missing constraint by inspecting the schedule with the invalid activity to determ… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Submitted to the International Workshop of Explainable AI Planning (XAIP) at the International Conference on Automated Planning and Scheduling (ICAPS) 2020

  49. arXiv:2010.05810  [pdf, other

    cs.CV

    Viewpoint-Aware Channel-Wise Attentive Network for Vehicle Re-Identification

    Authors: Tsai-Shien Chen, Man-Yu Lee, Chih-Ting Liu, Shao-Yi Chien

    Abstract: Vehicle re-identification (re-ID) matches images of the same vehicle across different cameras. It is fundamentally challenging because the dramatically different appearance caused by different viewpoints would make the framework fail to match two vehicles of the same identity. Most existing works solved the problem by extracting viewpoint-aware feature via spatial attention mechanism, which, yet,… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: CVPR Workshop 2020

  50. arXiv:2010.01892  [pdf, other

    cs.CV

    Joint Pruning & Quantization for Extremely Sparse Neural Networks

    Authors: Po-Hsiang Yu, Sih-Sian Wu, Jan P. Klopp, Liang-Gee Chen, Shao-Yi Chien

    Abstract: We investigate pruning and quantization for deep neural networks. Our goal is to achieve extremely high sparsity for quantized networks to enable implementation on low cost and low power accelerator hardware. In a practical scenario, there are particularly many applications for dense prediction tasks, hence we choose stereo depth estimation as target. We propose a two stage pruning and quantizat… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 13 page, 16 figures