Skip to main content

Showing 1–50 of 66 results for author: Hsieh, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22864  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval

    Authors: Li-Cheng Shen, Jih-Kang Hsieh, Wei-Hua Li, Chu-Song Chen

    Abstract: Text-to-image retrieval (TIR) aims to find relevant images based on a textual query, but existing approaches are primarily based on whole-image captions and lack interpretability. Meanwhile, referring expression segmentation (RES) enables precise object localization based on natural language descriptions but is computationally expensive when applied across large image collections. To bridge this g… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: ICMR 2025

  2. arXiv:2506.10361  [pdf, ps, other

    cs.CV

    FaceLiVT: Face Recognition using Linear Vision Transformer with Structural Reparameterization For Mobile Device

    Authors: Novendra Setyawan, Chi-Chia Sun, Mao-Hsiu Hsu, Wen-Kai Kuo, Jun-Wei Hsieh

    Abstract: This paper introduces FaceLiVT, a lightweight yet powerful face recognition model that integrates a hybrid Convolution Neural Network (CNN)-Transformer architecture with an innovative and lightweight Multi-Head Linear Attention (MHLA) mechanism. By combining MHLA alongside a reparameterized token mixer, FaceLiVT effectively reduces computational complexity while preserving competitive accuracy. Ex… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 2025 ICIP

  3. arXiv:2505.17360  [pdf, ps, other

    cs.CC cs.DS

    The Quasi-Polynomial Low-Degree Conjecture is False

    Authors: Rares-Darius Buhai, Jun-Ting Hsieh, Aayush Jain, Pravesh K. Kothari

    Abstract: There is a growing body of work on proving hardness results for average-case estimation problems by bounding the low-degree advantage (LDA) - a quantitative estimate of the closeness of low-degree moments - between a null distribution and a related planted distribution. Such hardness results are now ubiquitous not only for foundational average-case problems but also central questions in statistics… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  4. arXiv:2504.15087  [pdf, other

    math.CO cs.CC cs.DM cs.DS math.GR

    Explicit Lossless Vertex Expanders

    Authors: Jun-Ting Hsieh, Alexander Lubotzky, Sidhanth Mohanty, Assaf Reiner, Rachel Yun Zhang

    Abstract: We give the first construction of explicit constant-degree lossless vertex expanders. Specifically, for any $\varepsilon > 0$ and sufficiently large $d$, we give an explicit construction of an infinite family of $d$-regular graphs where every small set $S$ of vertices has $(1-\varepsilon)d|S|$ neighbors (which implies $(1-2\varepsilon)d|S|$ unique-neighbors). Our results also extend naturally to c… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 33 pages, 3 figures

  5. arXiv:2504.13392  [pdf, ps, other

    cs.CV cs.HC

    POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation

    Authors: Evans Xu Han, Alice Qian Zhang, Hong Shen, Haiyi Zhu, Paul Pu Liang, Jane Hsieh

    Abstract: State-of-the-art visual generative AI tools hold immense potential to assist users in the early ideation stages of creative tasks -- offering the ability to generate (rather than search for) novel and unprecedented (instead of existing) images of considerable quality that also adhere to boundless combinations of user specifications. However, many large-scale text-to-image systems are designed for… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  6. arXiv:2503.15512  [pdf, other

    cs.HC cs.AI cs.LG

    Beyond Accuracy, SHAP, and Anchors -- On the difficulty of designing effective end-user explanations

    Authors: Zahra Abba Omar, Nadia Nahar, Jacob Tjaden, Inès M. Gilles, Fikir Mekonnen, Jane Hsieh, Christian Kästner, Alka Menon

    Abstract: Modern machine learning produces models that are impossible for users or developers to fully understand -- raising concerns about trust, oversight and human dignity. Transparency and explainability methods aim to provide some help in understanding models, but it remains challenging for developers to design explanations that are understandable to target users and effective for their purpose. Emergi… ▽ More

    Submitted 28 January, 2025; originally announced March 2025.

  7. arXiv:2502.13221  [pdf, other

    cs.LG cs.AI cs.CY cs.GT

    Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations

    Authors: Lee Cohen, Jack Hsieh, Connie Hong, Judy Hanwen Shen

    Abstract: In an era of increasingly capable foundation models, job seekers are turning to generative AI tools to enhance their application materials. However, unequal access to and knowledge about generative AI tools can harm both employers and candidates by reducing the accuracy of hiring decisions and giving some candidates an unfair advantage. To address these challenges, we introduce a new variant of th… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  8. arXiv:2502.07417  [pdf, other

    cs.CV

    Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving

    Authors: Novendra Setyawan, Ghufron Wahyu Kurniawan, Chi-Chia Sun, Wen-Kai Kuo, Jun-Wei Hsieh

    Abstract: The perception system is a a critical role of an autonomous driving system for ensuring safety. The driving scene perception system fundamentally represents an object detection task that requires achieving a balance between accuracy and processing speed. Many contemporary methods focus on improving detection accuracy but often overlook the importance of real-time detection capabilities when comput… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Under Review on IEEE Transactions on Intelligent Transportation Systems

  9. MicroViT: A Vision Transformer with Low Complexity Self Attention for Edge Device

    Authors: Novendra Setyawan, Chi-Chia Sun, Mao-Hsiu Hsu, Wen-Kai Kuo, Jun-Wei Hsieh

    Abstract: The Vision Transformer (ViT) has demonstrated state-of-the-art performance in various computer vision tasks, but its high computational demands make it impractical for edge devices with limited resources. This paper presents MicroViT, a lightweight Vision Transformer architecture optimized for edge devices by significantly reducing computational complexity while maintaining high accuracy. The core… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

  10. Gig2Gether: Data-sharing to Empower, Unify and Demystify Gig Work

    Authors: Jane Hsieh, Angie Zhang, Sajel Surati, Sijia Xie, Yeshua Ayala, Nithila Sathiya, Tzu-Sheng Kuo, Min Kyung Lee, Haiyi Zhu

    Abstract: The wide adoption of platformized work has generated remarkable advancements in the labor patterns and mobility of modern society. Underpinning such progress, gig workers are exposed to unprecedented challenges and accountabilities: lack of data transparency, social and physical isolation, as well as insufficient infrastructural safeguards. Gig2Gether presents a space designed for workers to engag… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  11. arXiv:2501.07991  [pdf

    physics.optics cs.AI

    Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins

    Authors: Ilker Oguz, Louis J. E. Suter, Jih-Liang Hsieh, Mustafa Yildirim, Niyazi Ulas Dinc, Christophe Moser, Demetri Psaltis

    Abstract: The ability to train ever-larger neural networks brings artificial intelligence to the forefront of scientific and technical discoveries. However, their exponentially increasing size creates a proportionally greater demand for energy and computational hardware. Incorporating complex physical events in networks as fixed, efficient computation modules can address this demand by decreasing the comple… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 17 pages, 6 figures

  12. arXiv:2501.07917  [pdf

    cs.ET physics.app-ph physics.optics

    Roadmap on Neuromorphic Photonics

    Authors: Daniel Brunner, Bhavin J. Shastri, Mohammed A. Al Qadasi, H. Ballani, Sylvain Barbay, Stefano Biasi, Peter Bienstman, Simon Bilodeau, Wim Bogaerts, Fabian Böhm, G. Brennan, Sonia Buckley, Xinlun Cai, Marcello Calvanese Strinati, B. Canakci, Benoit Charbonnier, Mario Chemnitz, Yitong Chen, Stanley Cheung, Jeff Chiles, Suyeon Choi, Demetrios N. Christodoulides, Lukas Chrostowski, J. Chu, J. H. Clegg , et al. (125 additional authors not shown)

    Abstract: This roadmap consolidates recent advances while exploring emerging applications, reflecting the remarkable diversity of hardware platforms, neuromorphic concepts, and implementation philosophies reported in the field. It emphasizes the critical role of cross-disciplinary collaboration in this rapidly evolving field.

    Submitted 16 January, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  13. arXiv:2412.02973  [pdf, other

    cs.CY

    Supporting Gig Worker Needs and Advancing Policy Through Worker-Centered Data-Sharing

    Authors: Jane Hsieh, Angie Zhang, Mialy Rasetarinera, Erik Chou, Daniel Ngo, Karen Lightman, Min Kyung Lee, Haiyi Zhu

    Abstract: The proliferating adoption of platform-based gig work increasingly raises concerns for worker conditions. Past studies documented how platforms leveraged design to exploit labor, withheld information to generate power asymmetries, and left workers alone to manage logistical overheads as well as social isolation. However, researchers also called attention to the potential of helping workers overcom… ▽ More

    Submitted 11 December, 2024; v1 submitted 3 December, 2024; originally announced December 2024.

  14. arXiv:2411.14361  [pdf, other

    cs.CC math.CO

    Improved Lower Bounds for all Odd-Query Locally Decodable Codes

    Authors: Arpon Basu, Jun-Ting Hsieh, Pravesh K. Kothari, Andrew D. Lin

    Abstract: We prove that for every odd $q\geq 3$, any $q$-query binary, possibly non-linear locally decodable code ($q$-LDC) $E:\{\pm1\}^k \rightarrow \{\pm1\}^n$ must satisfy $k \leq \tilde{O}(n^{1-2/q})$. For even $q$, this bound was established in a sequence of prior works. For $q=3$, the above bound was achieved in a recent work of Alrabiah, Guruswami, Kothari and Manohar using an argument that crucially… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  15. arXiv:2411.11627  [pdf, ps, other

    math.CO cs.CC cs.DM cs.DS

    Explicit Two-Sided Vertex Expanders Beyond the Spectral Barrier

    Authors: Jun-Ting Hsieh, Ting-Chun Lin, Sidhanth Mohanty, Ryan O'Donnell, Rachel Yun Zhang

    Abstract: We construct the first explicit two-sided vertex expanders that bypass the spectral barrier. Previously, the strongest known explicit vertex expanders were given by $d$-regular Ramanujan graphs, whose spectral properties imply that every small subset of vertices $S$ has at least $0.5d|S|$ distinct neighbors. However, it is possible to construct Ramanujan graphs containing a small set $S$ with no… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 28 pages

  16. PolicyCraft: Supporting Collaborative and Participatory Policy Design through Case-Grounded Deliberation

    Authors: Tzu-Sheng Kuo, Quan Ze Chen, Amy X. Zhang, Jane Hsieh, Haiyi Zhu, Kenneth Holstein

    Abstract: Community and organizational policies are typically designed in a top-down, centralized fashion, with limited input from impacted stakeholders. This can result in policies that are misaligned with community needs or perceived as illegitimate. How can we support more collaborative, participatory approaches to policy design? In this paper, we present PolicyCraft, a system that structures collaborati… ▽ More

    Submitted 5 February, 2025; v1 submitted 23 September, 2024; originally announced September 2024.

    Journal ref: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)

  17. arXiv:2409.03684  [pdf, ps, other

    quant-ph cs.DS cs.LG

    Predicting quantum channels over general product distributions

    Authors: Sitan Chen, Jaume de Dios Pont, Jun-Ting Hsieh, Hsin-Yuan Huang, Jane Lange, Jerry Li

    Abstract: We investigate the problem of predicting the output behavior of unknown quantum channels. Given query access to an $n$-qubit channel $E$ and an observable $O$, we aim to learn the mapping \begin{equation*} ρ\mapsto \mathrm{Tr}(O E[ρ]) \end{equation*} to within a small error for most $ρ$ sampled from a distribution $D$. Previously, Huang, Chen, and Preskill proved a surprising result that even if… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 20 pages, comments welcome

  18. Data Collectives as a means to Improve Accountability, Combat Surveillance and Reduce Inequalities

    Authors: Jane Hsieh, Angie Zhang, Seyun Kim, Varun Nagaraj Rao, Samantha Dalal, Alexandra Mateescu, Rafael Do Nascimento Grohmann, Motahhare Eslami, Min Kyung Lee, Haiyi Zhu

    Abstract: Platform-based laborers face unprecedented challenges and working conditions that result from algorithmic opacity, insufficient data transparency, and unclear policies and regulations. The CSCW and HCI communities increasingly turn to worker data collectives as a means to advance related policy and regulation, hold platforms accountable for data transparency and disclosure, and empower the collect… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  19. arXiv:2405.16296  [pdf, other

    cs.CV cs.GR cs.MM

    Neural Network-Based Tracking and 3D Reconstruction of Baseball Pitch Trajectories from Single-View 2D Video

    Authors: Jhen Hsieh

    Abstract: In this paper, we present a neural network-based approach for tracking and reconstructing the trajectories of baseball pitches from 2D video footage to 3D coordinates. We utilize OpenCV's CSRT algorithm to accurately track the baseball and fixed reference points in 2D video frames. These tracked pixel coordinates are then used as input features for our neural network model, which comprises multipl… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  20. arXiv:2405.10238  [pdf, other

    cs.DS cs.CC

    Rounding Large Independent Sets on Expanders

    Authors: Mitali Bafna, Jun-Ting Hsieh, Pravesh K. Kothari

    Abstract: We develop a new approach for approximating large independent sets when the input graph is a one-sided spectral expander - that is, the uniform random walk matrix of the graph has its second eigenvalue bounded away from 1. Consequently, we obtain a polynomial time algorithm to find linear-sized independent sets in one-sided expanders that are almost $3$-colorable or are promised to contain an inde… ▽ More

    Submitted 5 November, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 57 pages, 3 figures

  21. arXiv:2405.05373  [pdf, other

    cs.DS cs.CC math.MG

    Certifying Euclidean Sections and Finding Planted Sparse Vectors Beyond the $\sqrt{n}$ Dimension Threshold

    Authors: Venkatesan Guruswami, Jun-Ting Hsieh, Prasad Raghavendra

    Abstract: We consider the task of certifying that a random $d$-dimensional subspace $X$ in $\mathbb{R}^n$ is well-spread - every vector $x \in X$ satisfies $c\sqrt{n} \|x\|_2 \leq \|x\|_1 \leq \sqrt{n}\|x\|_2$. In a seminal work, Barak et. al. showed a polynomial-time certification algorithm when $d \leq O(\sqrt{n})$. On the other hand, when $d \gg \sqrt{n}$, the certification task is information-theoretica… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 32 pages, 2 Figures

  22. arXiv:2404.09432  [pdf, other

    cs.CV cs.AI cs.LG

    The 8th AI City Challenge

    Authors: Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Pranamesh Chakraborty, Sanjita Prajapati, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Fady Alnajjar, Ganzorig Batnasan, Ping-Yang Chen, Jun-Wei Hsieh, Xunlei Wu, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Rama Chellappa

    Abstract: The eighth AI City Challenge highlighted the convergence of computer vision and artificial intelligence in areas like retail, warehouse settings, and Intelligent Traffic Systems (ITS), presenting significant research opportunities. The 2024 edition featured five tracks, attracting unprecedented interest from 726 teams in 47 countries and regions. Track 1 dealt with multi-target multi-camera (MTMC)… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Summary of the 8th AI City Challenge Workshop in conjunction with CVPR 2024

  23. arXiv:2403.15004  [pdf, other

    cs.CV cs.LG

    ParFormer: A Vision Transformer with Parallel Mixer and Sparse Channel Attention Patch Embedding

    Authors: Novendra Setyawan, Ghufron Wahyu Kurniawan, Chi-Chia Sun, Jun-Wei Hsieh, Jing-Ming Guo, Wen-Kai Kuo

    Abstract: Convolutional Neural Networks (CNNs) and Transformers have achieved remarkable success in computer vision tasks. However, their deep architectures often lead to high computational redundancy, making them less suitable for resource-constrained environments, such as edge devices. This paper introduces ParFormer, a novel vision transformer that addresses this challenge by incorporating a Parallel Mix… ▽ More

    Submitted 1 October, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: Under Review in IEEE Transactions on Cognitive and Developmental System

  24. arXiv:2403.02363  [pdf, other

    cs.LG cs.AI

    Addressing Long-Tail Noisy Label Learning Problems: a Two-Stage Solution with Label Refurbishment Considering Label Rarity

    Authors: Ying-Hsuan Wu, Jun-Wei Hsieh, Li Xin, Shin-You Teng, Yi-Kuan Hsieh, Ming-Ching Chang

    Abstract: Real-world datasets commonly exhibit noisy labels and class imbalance, such as long-tailed distributions. While previous research addresses this issue by differentiating noisy and clean samples, reliance on information from predictions based on noisy long-tailed data introduces potential errors. To overcome the limitations of prior works, we introduce an effective two-stage approach by combining s… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  25. arXiv:2401.11590  [pdf, ps, other

    cs.CC math.CO

    Small Even Covers, Locally Decodable Codes and Restricted Subgraphs of Edge-Colored Kikuchi Graphs

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Sidhanth Mohanty, David Munhá Correia, Benny Sudakov

    Abstract: Given a $k$-uniform hypergraph $H$ on $n$ vertices, an even cover in $H$ is a collection of hyperedges that touch each vertex an even number of times. Even covers are a generalization of cycles in graphs and are equivalent to linearly dependent subsets of a system of linear equations modulo $2$. As a result, they arise naturally in the context of well-studied questions in coding theory and refutin… ▽ More

    Submitted 25 November, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: 19 pages

  26. arXiv:2312.16771  [pdf, other

    cs.CV

    Scale-Aware Crowd Count Network with Annotation Error Correction

    Authors: Yi-Kuan Hsieh, Jun-Wei Hsieh, Yu-Chee Tseng, Ming-Ching Chang, Li Xin

    Abstract: Traditional crowd counting networks suffer from information loss when feature maps are downsized through pooling layers, leading to inaccuracies in counting crowds at a distance. Existing methods often assume correct annotations during training, disregarding the impact of noisy annotations, especially in crowded scenes. Furthermore, the use of a fixed Gaussian kernel fails to account for the varyi… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 7 pages, 6 figues. arXiv admin note: text overlap with arXiv:2211.06835

  27. arXiv:2311.09354  [pdf

    q-bio.QM cs.LG eess.IV

    Nondestructive, quantitative viability analysis of 3D tissue cultures using machine learning image segmentation

    Authors: Kylie J. Trettner, Jeremy Hsieh, Weikun Xiao, Jerry S. H. Lee, Andrea M. Armani

    Abstract: Ascertaining the collective viability of cells in different cell culture conditions has typically relied on averaging colorimetric indicators and is often reported out in simple binary readouts. Recent research has combined viability assessment techniques with image-based deep-learning models to automate the characterization of cellular properties. However, further development of viability measure… ▽ More

    Submitted 11 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 52 total pages, Main text and SI included, 35 figures (5 main text, 30 supplemental), 9 tables, 6 datasets (provided on linked GitHub), linked image files on Zenodo

  28. arXiv:2310.00393  [pdf, ps, other

    cs.DS cs.CC

    New SDP Roundings and Certifiable Approximation for Cubic Optimization

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Lucas Pesenti, Luca Trevisan

    Abstract: We give new rounding schemes for SDP relaxations for the problems of maximizing cubic polynomials over the unit sphere and the $n$-dimensional hypercube. In both cases, the resulting algorithms yield a $O(\sqrt{n/k})$ multiplicative approximation in $2^{O(k)} \text{poly}(n)$ time. In particular, we obtain a $O(\sqrt{n/\log n})$ approximation in polynomial time. For the unit sphere, this improves o… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  29. arXiv:2309.16897  [pdf, other

    cs.CC cs.DS

    Efficient Algorithms for Semirandom Planted CSPs at the Refutation Threshold

    Authors: Venkatesan Guruswami, Jun-Ting Hsieh, Pravesh K. Kothari, Peter Manohar

    Abstract: We present an efficient algorithm to solve semirandom planted instances of any Boolean constraint satisfaction problem (CSP). The semirandom model is a hybrid between worst-case and average-case input models, where the input is generated by (1) choosing an arbitrary planted assignment $x^*$, (2) choosing an arbitrary clause structure, and (3) choosing literal negations for each clause from an arbi… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: FOCS 2023

  30. arXiv:2308.12817  [pdf, other

    cs.CV

    MixNet: Toward Accurate Detection of Challenging Scene Text in the Wild

    Authors: Yu-Xiang Zeng, Jun-Wei Hsieh, Xin Li, Ming-Ching Chang

    Abstract: Detecting small scene text instances in the wild is particularly challenging, where the influence of irregular positions and nonideal lighting often leads to detection errors. We present MixNet, a hybrid architecture that combines the strengths of CNNs and Transformers, capable of accurately detecting small text from challenging natural scenes, regardless of the orientations, styles, and lighting… ▽ More

    Submitted 27 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  31. arXiv:2308.07427  [pdf, other

    cs.HC cs.SE

    Nip it in the Bud: Moderation Strategies in Open Source Software Projects and the Role of Bots

    Authors: Jane Hsieh, Joselyn Kim, Laura Dabbish, Haiyi Zhu

    Abstract: Much of our modern digital infrastructure relies critically upon open sourced software. The communities responsible for building this cyberinfrastructure require maintenance and moderation, which is often supported by volunteer efforts. Moderation, as a non-technical form of labor, is a necessary but often overlooked task that maintainers undertake to sustain the community around an OSS project. T… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  32. arXiv:2307.05954  [pdf, other

    math.PR cs.CC cs.DS

    Ellipsoid Fitting Up to a Constant

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Aaron Potechin, Jeff Xu

    Abstract: In [Sau11,SPW13], Saunderson, Parrilo and Willsky asked the following elegant geometric question: what is the largest $m= m(d)$ such that there is an ellipsoid in $\mathbb{R}^d$ that passes through $v_1, v_2, \ldots, v_m$ with high probability when the $v_i$s are chosen independently from the standard Gaussian distribution $N(0,I_{d})$. The existence of such an ellipsoid is equivalent to the exist… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: ICALP 2023

  33. Designing Individualized Policy and Technology Interventions to Improve Gig Work Conditions

    Authors: Jane Hsieh, Oluwatobi Adisa, Sachi Bafna, Haiyi Zhu

    Abstract: The gig economy is characterized by short-term contract work completed by independent workers who are paid to perform "gigs", and who have control over when, whether and how they conduct work. Gig economy platforms (e.g., Uber, Lyft, Instacart) offer workers increased job opportunities, lower barriers to entry, and improved flexibility. However, growing evidence suggests that worker well-being and… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  34. arXiv:2306.09662  [pdf, other

    cs.LG cs.AI cs.MA

    Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction

    Authors: Cheng Ruei Tang, Jun Wei Hsieh, Shin You Teng

    Abstract: Existing traffic signal control systems rely on oversimplified rule-based methods, and even RL-based methods are often suboptimal and unstable. To address this, we propose a cooperative multi-objective architecture called Multi-Objective Multi-Agent Deep Deterministic Policy Gradient (MOMA-DDPG), which estimates multiple reward terms for traffic signal control optimization using age-decaying weigh… ▽ More

    Submitted 16 July, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2205.11291

  35. arXiv:2305.19170  [pdf

    cs.LG physics.optics

    Forward-Forward Training of an Optical Neural Network

    Authors: Ilker Oguz, Junjie Ke, Qifei Wang, Feng Yang, Mustafa Yildirim, Niyazi Ulas Dinc, Jih-Liang Hsieh, Christophe Moser, Demetri Psaltis

    Abstract: Neural networks (NN) have demonstrated remarkable capabilities in various tasks, but their computation-intensive nature demands faster and more energy-efficient hardware implementations. Optics-based platforms, using technologies such as silicon photonics and spatial light modulators, offer promising avenues for achieving this goal. However, training multiple trainable layers in tandem with these… ▽ More

    Submitted 10 August, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  36. arXiv:2305.17449  [pdf, ps, other

    cs.CV

    FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection

    Authors: Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Erkhembayar Ganbold, Jun-Wei Hsieh, Ming-Ching Chang, Ping-Yang Chen, Byambaa Dorj, Hamad Al Jassmi, Ganzorig Batnasan, Fady Alnajjar, Mohammed Abduljabbar, Fang-Pang Lin

    Abstract: With the advance of AI, road object detection has been a prominent topic in computer vision, mostly using perspective cameras. Fisheye lens provides omnidirectional wide coverage for using fewer cameras to monitor road intersections, however with view distortions. To our knowledge, there is no existing open dataset prepared for traffic surveillance on fisheye cameras. This paper introduces an open… ▽ More

    Submitted 6 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: CVPR Workshops 2023

  37. arXiv:2305.04206  [pdf, other

    cs.CV cs.AI

    RATs-NAS: Redirection of Adjacent Trails on GCN for Neural Architecture Search

    Authors: Yu-Ming Zhang, Jun-Wei Hsieh, Chun-Chieh Lee, Kuo-Chin Fan

    Abstract: Various hand-designed CNN architectures have been developed, such as VGG, ResNet, DenseNet, etc., and achieve State-of-the-Art (SoTA) levels on different tasks. Neural Architecture Search (NAS) now focuses on automatically finding the best CNN architecture to handle the above tasks. However, the verification of a searched architecture is very time-consuming and makes predictor-based methods become… ▽ More

    Submitted 8 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

  38. Navigating Multi-Stakeholder Incentives and Preferences: Co-Designing Alternatives for the Future of Gig Worker Well-Being

    Authors: Jane Hsieh, Miranda Karger, Lucas Zagal, Haiyi Zhu

    Abstract: Gig workers, and the products and services they provide, play an increasingly ubiquitous role in our daily lives. But despite growing evidence suggesting that worker well-being in gig economy platforms have become significant societal problems, few studies have investigated possible solutions. We take a stride in this direction by engaging workers, platform employees, and local regulators in a ser… ▽ More

    Submitted 5 June, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

  39. arXiv:2302.01212  [pdf, other

    math.CO cs.CC cs.DM cs.DS

    Explicit two-sided unique-neighbor expanders

    Authors: Jun-Ting Hsieh, Theo McKenzie, Sidhanth Mohanty, Pedro Paredes

    Abstract: We study the problem of constructing explicit sparse graphs that exhibit strong vertex expansion. Our main result is the first two-sided construction of imbalanced unique-neighbor expanders, meaning bipartite graphs where small sets contained in both the left and right bipartitions exhibit unique-neighbor expansion, along with algebraic properties relevant to constructing quantum codes. Our cons… ▽ More

    Submitted 15 January, 2024; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: New version contains stronger result, and many new technical ingredients. 45 pages, 2 figures

    MSC Class: 05C48 ACM Class: G.2.1; G.2.2

  40. arXiv:2212.01287  [pdf, other

    cs.CV cs.AI

    SARAS-Net: Scale and Relation Aware Siamese Network for Change Detection

    Authors: Chao-Peng Chen, Jun-Wei Hsieh, Ping-Yang Chen, Yi-Kuan Hsieh, Bor-Shiun Wang

    Abstract: Change detection (CD) aims to find the difference between two images at different times and outputs a change map to represent whether the region has changed or not. To achieve a better result in generating the change map, many State-of-The-Art (SoTA) methods design a deep learning model that has a powerful discriminative ability. However, these methods still get lower performance because they igno… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  41. arXiv:2211.08824  [pdf, other

    cs.CV

    SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking

    Authors: Yu-Hsiang Wang, Jun-Wei Hsieh, Ping-Yang Chen, Ming-Ching Chang, Hung Hin So, Xin Li

    Abstract: Despite recent progress in Multiple Object Tracking (MOT), several obstacles such as occlusions, similar objects, and complex scenes remain an open challenge. Meanwhile, a systematic study of the cost-performance tradeoff for the popular tracking-by-detection paradigm is still lacking. This paper introduces SMILEtrack, an innovative object tracker that effectively addresses these challenges by int… ▽ More

    Submitted 22 January, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Our paper was accepted by AAAI2024

  42. arXiv:2211.06835  [pdf, other

    cs.CV cs.AI

    Scale-Aware Crowd Counting Using a Joint Likelihood Density Map and Synthetic Fusion Pyramid Network

    Authors: Yi-Kuan Hsieh, Jun-Wei Hsieh, Yu-Chee Tseng, Ming-Ching Chang, Bor-Shiun Wang

    Abstract: We develop a Synthetic Fusion Pyramid Network (SPF-Net) with a scale-aware loss function design for accurate crowd counting. Existing crowd-counting methods assume that the training annotation points were accurate and thus ignore the fact that noisy annotations can lead to large model-learning bias and counting error, especially for counting highly dense crowds that appear far away. To the best of… ▽ More

    Submitted 2 January, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 8 pages, 8 figures, 4 tables

  43. arXiv:2210.11173  [pdf, other

    cs.LG

    Mathematical Justification of Hard Negative Mining via Isometric Approximation Theorem

    Authors: Albert Xu, Jhih-Yi Hsieh, Bhaskar Vundurthy, Eliana Cohen, Howie Choset, Lu Li

    Abstract: In deep metric learning, the Triplet Loss has emerged as a popular method to learn many computer vision and natural language processing tasks such as facial recognition, object detection, and visual-semantic embeddings. One issue that plagues the Triplet Loss is network collapse, an undesirable phenomenon where the network projects the embeddings of all data onto a single point. Researchers predom… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 9 pages, 6 figures, submitted to AAAI 2023

  44. arXiv:2210.00698  [pdf, other

    cs.CV cs.LG

    NAS-based Recursive Stage Partial Network (RSPNet) for Light-Weight Semantic Segmentation

    Authors: Yi-Chun Wang, Jun-Wei Hsieh, Ming-Ching Chang

    Abstract: Current NAS-based semantic segmentation methods focus on accuracy improvements rather than light-weight design. In this paper, we proposed a two-stage framework to design our NAS-based RSPNet model for light-weight semantic segmentation. The first architecture search determines the inner cell structure, and the second architecture search considers exponentially growing paths to finalize the outer… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  45. arXiv:2210.00546  [pdf, other

    cs.CV cs.LG

    Siamese-NAS: Using Trained Samples Efficiently to Find Lightweight Neural Architecture by Prior Knowledge

    Authors: Yu-Ming Zhang, Jun-Wei Hsieh, Chun-Chieh Lee, Kuo-Chin Fan

    Abstract: In the past decade, many architectures of convolution neural networks were designed by handcraft, such as Vgg16, ResNet, DenseNet, etc. They all achieve state-of-the-art level on different tasks in their time. However, it still relies on human intuition and experience, and it also takes so much time consumption for trial and error. Neural Architecture Search (NAS) focused on this issue. In recent… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  46. arXiv:2209.01332   

    cs.CV

    Class-Specific Channel Attention for Few-Shot Learning

    Authors: Ying-Yu Chen, Jun-Wei Hsieh, Ming-Ching Chang

    Abstract: Few-Shot Learning (FSL) has attracted growing attention in computer vision due to its capability in model training without the need for excessive data. FSL is challenging because the training and testing categories (the base vs. novel sets) can be largely diversified. Conventional transfer-based solutions that aim to transfer knowledge learned from large labeled training sets to target testing set… ▽ More

    Submitted 13 December, 2022; v1 submitted 3 September, 2022; originally announced September 2022.

    Comments: There are errors in the phase of testing, leading to the wrong results listed in the paper

  47. arXiv:2208.04951  [pdf

    cs.ET physics.optics

    Programming Nonlinear Propagation for Efficient Optical Learning Machines

    Authors: Ilker Oguz, Jih-Liang Hsieh, Niyazi Ulas Dinc, Uğur Teğin, Mustafa Yildirim, Carlo Gigli, Christophe Moser, Demetri Psaltis

    Abstract: The ever-increasing demand for processing data with larger machine learning models requires more efficient hardware solutions due to limitations such as power dissipation and scalability. Optics is a promising contender for providing lower power computation since light propagation through a non-absorbing medium is a lossless operation. However, to carry out useful and efficient computations with l… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 32 pages, 11 figures

  48. arXiv:2208.00122  [pdf, ps, other

    cs.DS cs.CC

    Polynomial-Time Power-Sum Decomposition of Polynomials

    Authors: Mitali Bafna, Jun-Ting Hsieh, Pravesh K. Kothari, Jeff Xu

    Abstract: We give efficient algorithms for finding power-sum decomposition of an input polynomial $P(x)= \sum_{i\leq m} p_i(x)^d$ with component $p_i$s. The case of linear $p_i$s is equivalent to the well-studied tensor decomposition problem while the quadratic case occurs naturally in studying identifiability of non-spherical Gaussian mixtures from low-order moments. Unlike tensor decomposition, both the… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: To appear in FOCS 2022

  49. arXiv:2207.10850  [pdf, other

    math.CO cs.DM cs.DS

    A simple and sharper proof of the hypergraph Moore bound

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Sidhanth Mohanty

    Abstract: The hypergraph Moore bound is an elegant statement that characterizes the extremal trade-off between the girth - the number of hyperedges in the smallest cycle or even cover (a subhypergraph with all degrees even) and size - the number of hyperedges in a hypergraph. For graphs (i.e., $2$-uniform hypergraphs), a bound tight up to the leading constant was proven in a classical work of Alon, Hoory an… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  50. arXiv:2206.09204  [pdf, ps, other

    cs.DS cs.CC

    Approximating Max-Cut on Bounded Degree Graphs: Tighter Analysis of the FKL Algorithm

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari

    Abstract: In this note, we describe a $α_{GW} + \tildeΩ(1/d^2)$-factor approximation algorithm for Max-Cut on weighted graphs of degree $\leq d$. Here, $α_{GW}\approx 0.878$ is the worst-case approximation ratio of the Goemans-Williamson rounding for Max-Cut. This improves on previous results for unweighted graphs by Feige, Karpinski, and Langberg and Florén. Our guarantee is obtained by a tighter analysis… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.