Skip to main content

Showing 1–28 of 28 results for author: Tsai, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01564  [pdf, ps, other

    eess.IV cs.CV

    Multi Source COVID-19 Detection via Kernel-Density-based Slice Sampling

    Authors: Chia-Ming Lee, Bo-Cheng Qiu, Ting-Yao Chen, Ming-Han Sun, Fang-Ying Lin, Jung-Tse Tsai, I-An Tsai, Yu-Fan Lin, Chih-Chung Hsu

    Abstract: We present our solution for the Multi-Source COVID-19 Detection Challenge, which classifies chest CT scans from four distinct medical centers. To address multi-source variability, we employ the Spatial-Slice Feature Learning (SSFL) framework with Kernel-Density-based Slice Sampling (KDS). Our preprocessing pipeline combines lung region extraction, quality control, and adaptive slice sampling to se… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  2. arXiv:2505.20199  [pdf, ps, other

    cs.CL

    Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking

    Authors: Pengxiang Li, Shilin Yan, Joey Tsai, Renrui Zhang, Ruichuan An, Ziyu Guo, Xiaowei Gao

    Abstract: Classifier-Free Guidance (CFG) significantly enhances controllability in generative models by interpolating conditional and unconditional predictions. However, standard CFG often employs a static unconditional input, which can be suboptimal for iterative generation processes where model uncertainty varies dynamically. We introduce Adaptive Classifier-Free Guidance (A-CFG), a novel method that tail… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Project page: https://github.com/pixeli99/A-CFG

  3. arXiv:2505.17020  [pdf, ps, other

    cs.CV

    CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms

    Authors: Shilin Yan, Jiaming Han, Joey Tsai, Hongwei Xue, Rongyao Fang, Lingyi Hong, Ziyu Guo, Ray Zhang

    Abstract: The advent of Large Multimodal Models (LMMs) has significantly enhanced Large Language Models (LLMs) to process and interpret diverse data modalities (e.g., image and video). However, as input complexity increases, particularly with long video sequences, the number of required tokens has grown significantly, leading to quadratically computational costs. This has made the efficient compression of v… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Project page: https://github.com/shilinyan99/CrossLMM

  4. arXiv:2505.09115  [pdf, ps, other

    cs.HC cs.AI

    PreCare: Designing AI Assistants for Advance Care Planning (ACP) to Enhance Personal Value Exploration, Patient Knowledge, and Decisional Confidence

    Authors: Yu Lun Hsu, Yun-Rung Chou, Chiao-Ju Chang, Yu-Cheng Chang, Zer-Wei Lee, Rokas Gipiškis, Rachel Li, Chih-Yuan Shih, Jen-Kuei Peng, Hsien-Liang Huang, Jaw-Shiun Tsai, Mike Y. Chen

    Abstract: Advance Care Planning (ACP) allows individuals to specify their preferred end-of-life life-sustaining treatments before they become incapacitated by injury or terminal illness (e.g., coma, cancer, dementia). While online ACP offers high accessibility, it lacks key benefits of clinical consultations, including personalized value exploration, immediate clarification of decision consequences. To brid… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  5. arXiv:2502.18596  [pdf, other

    cs.DC

    Introducing JIRIAF: A Virtual Kubelet Integration for Optimizing HPC Resource Provisioning

    Authors: Vardan Gyurjyan, Graham Heyes, Christopher Larrieu, David Lawrence, Jeng-Yuan Tsai

    Abstract: The JIRIAF (JLab Integrated Research Infrastructure Across Facilities) framework is designed to streamline resource management and optimize high-performance computing (HPC) workloads across heterogeneous environments. Central to JIRIAF is the JIRIAF Resource Manager (JRM), which effectively leverages Kubernetes and Virtual Kubelet to manage resources dynamically, even in environments with restrict… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  6. arXiv:2412.12459  [pdf, other

    cs.CL cs.AI cs.IR

    LITA: An Efficient LLM-assisted Iterative Topic Augmentation Framework

    Authors: Chia-Hsuan Chang, Jui-Tse Tsai, Yi-Hang Tsai, San-Yih Hwang

    Abstract: Topic modeling is widely used for uncovering thematic structures within text corpora, yet traditional models often struggle with specificity and coherence in domain-focused applications. Guided approaches, such as SeededLDA and CorEx, incorporate user-provided seed words to improve relevance but remain labor-intensive and static. Large language models (LLMs) offer potential for dynamic topic refin… ▽ More

    Submitted 21 May, 2025; v1 submitted 16 December, 2024; originally announced December 2024.

    Comments: Accepted to PAKDD 2025

  7. arXiv:2411.16483  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Graph Transformer Networks for Accurate Band Structure Prediction: An End-to-End Approach

    Authors: Weiyi Gong, Tao Sun, Hexin Bai, Jeng-Yuan Tsai, Haibin Ling, Qimin Yan

    Abstract: Predicting electronic band structures from crystal structures is crucial for understanding structure-property correlations in materials science. First-principles approaches are accurate but computationally intensive. Recent years, machine learning (ML) has been extensively applied to this field, while existing ML models predominantly focus on band gap predictions or indirect band structure estimat… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 8 pages, 3 figures

  8. arXiv:2411.14652  [pdf, other

    cs.CY cs.AI cs.HC cs.SI

    Social Media Algorithms Can Shape Affective Polarization via Exposure to Antidemocratic Attitudes and Partisan Animosity

    Authors: Tiziano Piccardi, Martin Saveski, Chenyan Jia, Jeffrey T. Hancock, Jeanne L. Tsai, Michael Bernstein

    Abstract: There is widespread concern about the negative impacts of social media feed ranking algorithms on political polarization. Leveraging advancements in large language models (LLMs), we develop an approach to re-rank feeds in real-time to test the effects of content that is likely to polarize: expressions of antidemocratic attitudes and partisan animosity (AAPA). In a preregistered 10-day field experi… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  9. arXiv:2409.04068  [pdf

    cs.CV

    Site-Specific Color Features of Green Coffee Beans

    Authors: Shu-Min Tan, Shih-Hsun Hung, Je-Chiang Tsai

    Abstract: Coffee is one of the most valuable primary commodities. Despite this, the common selection technique of green coffee beans relies on personnel visual inspection, which is labor-intensive and subjective. Therefore, an efficient way to evaluate the quality of beans is needed. In this paper, we demonstrate a site-independent approach to find site-specific color features of the seed coat in qualified… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 21 pages, 7 figures

    ACM Class: I.5

  10. arXiv:2406.19571  [pdf, other

    cs.SI cs.CY

    Reranking Social Media Feeds: A Practical Guide for Field Experiments

    Authors: Tiziano Piccardi, Martin Saveski, Chenyan Jia, Jeffrey Hancock, Jeanne L. Tsai, Michael S. Bernstein

    Abstract: Social media plays a central role in shaping public opinion and behavior, yet performing experiments on these platforms and, in particular, on feed algorithms is becoming increasingly challenging. This article offers practical recommendations to researchers developing and deploying field experiments focused on real-time re-ranking of social media feeds. This article is organized around two contrib… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  11. arXiv:2404.11770  [pdf, other

    cs.CV cs.AI

    Event-Based Eye Tracking. AIS 2024 Challenge Survey

    Authors: Zuowen Wang, Chang Gao, Zongwei Wu, Marcos V. Conde, Radu Timofte, Shih-Chii Liu, Qinyu Chen, Zheng-jun Zha, Wei Zhai, Han Han, Bohao Liao, Yuliang Wu, Zengyu Wan, Zhong Wang, Yang Cao, Ganchao Tan, Jinze Chen, Yan Ru Pei, Sasskia Brüers, Sébastien Crouzet, Douglas McLelland, Oliver Coenen, Baoheng Zhang, Yizhao Gao, Jingyuan Li , et al. (14 additional authors not shown)

    Abstract: This survey reviews the AIS 2024 Event-Based Eye Tracking (EET) Challenge. The task of the challenge focuses on processing eye movement recorded with event cameras and predicting the pupil center of the eye. The challenge emphasizes efficient eye tracking with event cameras to achieve good task accuracy and efficiency trade-off. During the challenge period, 38 participants registered for the Kaggl… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Qinyu Chen is the corresponding author

  12. arXiv:2404.06581  [pdf, other

    cs.DC

    NotNets: Accelerating Microservices by Bypassing the Network

    Authors: Peter Alvaro, Matthew Adiletta, Adrian Cockroft, Frank Hady, Ramesh Illikkal, Esteban Ramos, James Tsai, Robert Soulé

    Abstract: Remote procedure calls are the workhorse of distributed systems. However, as software engineering trends, such as micro-services and serverless computing, push applications towards ever finer-grained decompositions, the overhead of RPC-based communication is becoming too great to bear. In this paper, we argue that point solutions that attempt to optimize one aspect of RPC logic are unlikely to mit… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures

  13. How Culture Shapes What People Want From AI

    Authors: Xiao Ge, Chunchen Xu, Daigo Misaki, Hazel Rose Markus, Jeanne L Tsai

    Abstract: There is an urgent need to incorporate the perspectives of culturally diverse groups into AI developments. We present a novel conceptual framework for research that aims to expand, reimagine, and reground mainstream visions of AI using independent and interdependent cultural models of the self and the environment. Two survey studies support this framework and provide preliminary evidence that peop… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: To appear at CHI 2024

  14. Interactive Shape Sonification for Tumor Localization in Breast Cancer Surgery

    Authors: Laura Schütz, Trishia El Chemaly, Emmanuelle Weber, Anh Thien Doan, Jacqueline Tsai, Christoph Leuze, Bruce Daniel, Nassir Navab

    Abstract: About 20 percent of patients undergoing breast-conserving surgery require reoperation due to cancerous tissue remaining inside the breast. Breast cancer localization systems utilize auditory feedback to convey the distance between a localization probe and a small marker (seed) implanted into the breast tumor prior to surgery. However, no information on the location of the tumor margin is provided.… ▽ More

    Submitted 28 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 15 pages, 9 figures

    ACM Class: H.5.2; H.5.5; J.3

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA. ACM, New York, NY, USA

  15. arXiv:2311.01729  [pdf, other

    cs.SI cs.LG

    CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model

    Authors: Jui-Yi Tsai, Ya-Wen Teng, Ho Chiok Yew, De-Nian Yang, Lydia Y. Chen

    Abstract: The social graphs synthesized by the generative models are increasingly in demand due to data scarcity and concerns over user privacy. One of the key performance criteria for generating social networks is the fidelity to specified conditionals, such as users with certain membership and financial status. While recent diffusion models have shown remarkable performance in generating images, their eff… ▽ More

    Submitted 5 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

  16. arXiv:2306.13075  [pdf

    cs.CL cs.CV

    Semi-automated extraction of research topics and trends from NCI funding in radiological sciences from 2000-2020

    Authors: Mark Nguyen, Peter Beidler, Joseph Tsai, August Anderson, Daniel Chen, Paul Kinahan, John Kang

    Abstract: Investigators, funders, and the public desire knowledge on topics and trends in publicly funded research but current efforts in manual categorization are limited in scale and understanding. We developed a semi-automated approach to extract and name research topics, and applied this to \$1.9B of NCI funding over 21 years in the radiological sciences to determine micro- and macro-scale research topi… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Presented at the American Society of Radiation Oncology annual meeting in 2021 ((doi: 10.1016/j.ijrobp.2021.07.263) and the Practical Big Data Workshop 2022

    MSC Class: 68T50 (Primary); 68T10 (Secondary) ACM Class: I.2.7; I.5.3; J.3

  17. Associations Between Natural Language Processing (NLP) Enriched Social Determinants of Health and Suicide Death among US Veterans

    Authors: Avijit Mitra, Richeek Pradhan, Rachel D Melamed, Kun Chen, David C Hoaglin, Katherine L Tucker, Joel I Reisman, Zhichao Yang, Weisong Liu, Jack Tsai, Hong Yu

    Abstract: Importance: Social determinants of health (SDOH) are known to be associated with increased risk of suicidal behaviors, but few studies utilized SDOH from unstructured electronic health record (EHR) notes. Objective: To investigate associations between suicide and recent SDOH, identified using structured and unstructured data. Design: Nested case-control study. Setting: EHR data from the US V… ▽ More

    Submitted 28 December, 2022; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: Submitted to JAMA Network Open

  18. Automated Identification of Eviction Status from Electronic Health Record Notes

    Authors: Zonghai Yao, Jack Tsai, Weisong Liu, David A. Levy, Emily Druhl, Joel I Reisman, Hong Yu

    Abstract: Objective: Evictions are important social and behavioral determinants of health. Evictions are associated with a cascade of negative events that can lead to unemployment, housing insecurity/homelessness, long-term poverty, and mental health problems. In this study, we developed a natural language processing system to automatically detect eviction status from electronic health record (EHR) notes.… ▽ More

    Submitted 20 May, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: This article has been accepted for publication in Journal of the American Medical Informatics Association Published by Oxford University Press. https://doi.org/10.1093/jamia/ocad081

    Journal ref: Journal of the American Medical Informatics Association, ocad081, 2023

  19. Live Multi-Streaming and Donation Recommendations via Coupled Donation-Response Tensor Factorization

    Authors: Hsu-Chao Lai, Jui-Yi Tsai, Hong-Han Shuai, Jiun-Long Huang, Wang-Chien Lee, De-Nian Yang

    Abstract: In contrast to traditional online videos, live multi-streaming supports real-time social interactions between multiple streamers and viewers, such as donations. However, donation and multi-streaming channel recommendations are challenging due to complicated streamer and viewer relations, asymmetric communications, and the tradeoff between personal interests and group interactions. In this paper, w… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 29th ACM International Conference on Information & Knowledge Management 1 2020 665-674

  20. arXiv:2005.13352  [pdf, other

    physics.comp-ph cs.LG

    Graph Neural Network for Hamiltonian-Based Material Property Prediction

    Authors: Hexin Bai, Peng Chu, Jeng-Yuan Tsai, Nathan Wilson, Xiaofeng Qian, Qimin Yan, Haibin Ling

    Abstract: Development of next-generation electronic devices for applications call for the discovery of quantum materials hosting novel electronic, magnetic, and topological properties. Traditional electronic structure methods require expensive computation time and memory consumption, thus a fast and accurate prediction model is desired with increasing importance. Representing the interactions among atomic o… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    ACM Class: J.2; I.5.1

  21. arXiv:2004.10345  [pdf, other

    cs.MM cs.SD eess.AS eess.IV

    MIDI-Sheet Music Alignment Using Bootleg Score Synthesis

    Authors: Thitaree Tanprasert, Teerapat Jenrungrot, Meinard Mueller, T. J. Tsai

    Abstract: MIDI-sheet music alignment is the task of finding correspondences between a MIDI representation of a piece and its corresponding sheet music images. Rather than using optical music recognition to bridge the gap between sheet music and MIDI, we explore an alternative approach: projecting the MIDI data into pixel space and performing alignment in the image domain. Our method converts the MIDI data i… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: 8 pages, 6 figures, 1 table. Accepted paper at the International Society for Music Information Retrieval Conference (ISMIR) 2019

  22. arXiv:2003.13912  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    Y-net: Multi-scale feature aggregation network with wavelet structure similarity loss function for single image dehazing

    Authors: Hao-Hsiang Yang, Chao-Han Huck Yang, Yi-Chang James Tsai

    Abstract: Single image dehazing is the ill-posed two-dimensional signal reconstruction problem. Recently, deep convolutional neural networks (CNN) have been successfully used in many computer vision problems. In this paper, we propose a Y-net that is named for its structure. This network reconstructs clear images by aggregating multi-scale features maps. Additionally, we propose a Wavelet Structure SIMilari… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted to IEEE ICASSP 2020

  23. Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding

    Authors: Yi-Chieh Liu, Yung-An Hsieh, Min-Hung Chen, Chao-Han Huck Yang, Jesper Tegner, Yi-Chang James Tsai

    Abstract: Performing driving behaviors based on causal reasoning is essential to ensure driving safety. In this work, we investigated how state-of-the-art 3D Convolutional Neural Networks (CNNs) perform on classifying driving behaviors based on causal reasoning. We proposed a perturbation-based visual explanation method to inspect the models' performance visually. By examining the video attention saliency,… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: Submitted to IEEE ICASSP 2020; Pytorch code will be released soon

    Journal ref: 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  24. arXiv:1902.04147  [pdf, other

    cs.CV cs.AI cs.LG

    Synthesizing New Retinal Symptom Images by Multiple Generative Models

    Authors: Yi-Chieh Liu, Hao-Hsiang Yang, Chao-Han Huck Yang, Jia-Hong Huang, Meng Tian, Hiromasa Morikawa, Yi-Chang James Tsai, Jesper Tegner

    Abstract: Age-Related Macular Degeneration (AMD) is an asymptomatic retinal disease which may result in loss of vision. There is limited access to high-quality relevant retinal images and poor understanding of the features defining sub-classes of this disease. Motivated by recent advances in machine learning we specifically explore the potential of generative modeling, using Generative Adversarial Networks… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Journal ref: AI for Retinal Image Analysis Workshop ACCV 2018

  25. arXiv:1902.03380  [pdf, other

    cs.CV cs.AI cs.LG cs.SC

    When Causal Intervention Meets Adversarial Examples and Image Masking for Deep Neural Networks

    Authors: Chao-Han Huck Yang, Yi-Chieh Liu, Pin-Yu Chen, Xiaoli Ma, Yi-Chang James Tsai

    Abstract: Discovering and exploiting the causality in deep neural networks (DNNs) are crucial challenges for understanding and reasoning causal effects (CE) on an explainable visual model. "Intervention" has been widely used for recognizing a causal relation ontologically. In this paper, we propose a causal inference framework for visual reasoning via do-calculus. To study the intervention effects on pixel-… ▽ More

    Submitted 25 June, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: Noted our camera-ready version has changed the title. "When Causal Intervention Meets Adversarial Examples and Image Masking for Deep Neural Networks" as the v3 official paper title in IEEE Proceeding. Please use it in your formal reference. Accepted at IEEE ICIP 2019. Pytorch code has released on https://github.com/jjaacckkyy63/Causal-Intervention-AE-wAdvImg

    Report number: page 3811--3815

    Journal ref: 2019 26th IEEE International Conference on Image Processing (ICIP). IEEE

  26. arXiv:0805.0903  [pdf

    cs.OH

    Design And Fabrication of High Numerical Aperture And Low Aberration Bi-Convex Micro Lens Array

    Authors: Jhy-Cherng Tsai, Ming-Fong Chen, Hsiharng Yang

    Abstract: Micro lens array is crucial in various kinds of optical and electronic applications. A micro lens array with high numerical aperture (NA) and low aberration is in particular needed. This research is aimed to design and fabricate such a micro lens array with simple structure while keeps the same NA of a same-diameter hemisphere lens. A bi-convex semispherical micro lens array, with corresponding… ▽ More

    Submitted 7 May, 2008; originally announced May 2008.

    Comments: Submitted on behalf of EDA Publishing Association (http://irevues.inist.fr/handle/2042/16838)

    Journal ref: Dans Symposium on Design, Test, Integration and Packaging of MEMS/MOEMS - DTIP 2008, Nice : France (2008)

  27. arXiv:0805.0856  [pdf

    cs.OH

    Design And Fabrication of Condenser Microphone Using Wafer Transfer And Micro-electroplating Technique

    Authors: Zhen-Zhun Shu, Ming-Li Ke, Guan-Wei Chen, Ray Hua Horng, Chao-Chih Chang, Jean-Yih Tsai, Chung-Ching Lai, Ji-Liang Chen

    Abstract: A novel fabrication process, which uses wafer transfer and micro-electroplating technique, has been proposed and tested. In this paper, the effects of the diaphragm thickness and stress, the air-gap thickness, and the area ratio of acoustic holes to backplate on the sensitivity of the condenser microphone have been demonstrated since the performance of the microphone depends on these parameters.… ▽ More

    Submitted 7 May, 2008; originally announced May 2008.

    Comments: Submitted on behalf of EDA Publishing Association (http://irevues.inist.fr/handle/2042/16838)

    Journal ref: Dans Symposium on Design, Test, Integration and Packaging of MEMS/MOEMS - DTIP 2008, Nice : France (2008)

  28. arXiv:0711.3329  [pdf

    cs.OH

    Micro-Ball Lens Array Fabrication in Photoresist Using Ptfe Hydrophobic Effect

    Authors: Ruey-Fang Shyu, Hsiharng Yang, Wen-Ren Tsai, Jhy-Cherng Tsai

    Abstract: This paper presents a simple method to fabricate micro-ball lens and its array. The key technology is to use the hydrophobic characteristics of polyterafluoroethylene (PTFE) substrate. High contact angle between melted photoresist pattern and PTFE can generate micro-ball lens and its array. PTFE thin film was spun onto a silicon wafer and dried in oven. Photoresist AZ4620 was used to pattern mic… ▽ More

    Submitted 21 November, 2007; originally announced November 2007.

    Comments: Submitted on behalf of TIMA Editions (http://irevues.inist.fr/tima-editions)

    Journal ref: Dans Symposium on Design, Test, Integration and Packaging of MEMS/MOEMS - DTIP 2006, Stresa, Lago Maggiore : Italie (2006)