Skip to main content

Showing 1–24 of 24 results for author: Wen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.01420  [pdf, other

    cs.CL cs.AI

    FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations

    Authors: Athena Wen, Tanush Patil, Ansh Saxena, Yicheng Fu, Sean O'Brien, Kevin Zhu

    Abstract: In an era where AI-driven hiring is transforming recruitment practices, concerns about fairness and bias have become increasingly important. To explore these issues, we introduce a benchmark, FAIRE (Fairness Assessment In Resume Evaluation), to test for racial and gender bias in large language models (LLMs) used to evaluate resumes across different industries. We use two methods-direct scoring and… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2503.16419  [pdf, other

    cs.CL

    Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

    Authors: Yang Sui, Yu-Neng Chuang, Guanchu Wang, Jiamu Zhang, Tianyi Zhang, Jiayi Yuan, Hongyi Liu, Andrew Wen, Shaochen Zhong, Hanjie Chen, Xia Hu

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in complex tasks. Recent advancements in Large Reasoning Models (LRMs), such as OpenAI o1 and DeepSeek-R1, have further improved performance in System-2 reasoning domains like mathematics and programming by harnessing supervised fine-tuning (SFT) and reinforcement learning (RL) techniques to enhance the Chain-of-Thought (CoT) r… ▽ More

    Submitted 23 April, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

    Comments: Project Website: https://github.com/Eclipsess/Awesome-Efficient-Reasoning-LLMs

  3. arXiv:2502.09670  [pdf, other

    cs.CL cs.AI

    The Science of Evaluating Foundation Models

    Authors: Jiayi Yuan, Jiamu Zhang, Andrew Wen, Xia Hu

    Abstract: The emergent phenomena of large foundation models have revolutionized natural language processing. However, evaluating these models presents significant challenges due to their size, capabilities, and deployment across diverse applications. Existing literature often focuses on individual aspects, such as benchmark performance or specific tasks, but fails to provide a cohesive process that integrat… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  4. arXiv:2410.01855  [pdf, other

    cs.LG cs.AI

    Explainable Diagnosis Prediction through Neuro-Symbolic Integration

    Authors: Qiuhao Lu, Rui Li, Elham Sagheb, Andrew Wen, Jinlian Wang, Liwei Wang, Jungwei W. Fan, Hongfang Liu

    Abstract: Diagnosis prediction is a critical task in healthcare, where timely and accurate identification of medical conditions can significantly impact patient outcomes. Traditional machine learning and deep learning models have achieved notable success in this domain but often lack interpretability which is a crucial requirement in clinical settings. In this study, we explore the use of neuro-symbolic met… ▽ More

    Submitted 7 January, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Proceedings of AMIA Informatics Summit 2025

  5. arXiv:2407.20266  [pdf, other

    cs.LG

    Accelerating the Low-Rank Decomposed Models

    Authors: Habib Hajimolahoseini, Walid Ahmed, Austin Wen, Yang Liu

    Abstract: Tensor decomposition is a mathematically supported technique for data compression. It consists of applying some kind of a Low Rank Decomposition technique on the tensors or matrices in order to reduce the redundancy of the data. However, it is not a popular technique for compressing the AI models duo to the high number of new layers added to the architecture after decomposition. Although the numbe… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  6. arXiv:2407.16514  [pdf, other

    cs.CV cs.AI

    Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?

    Authors: Habib Hajimolahoseini, Walid Ahmed, Austin Wen, Yang Liu

    Abstract: In this paper, we present a comprehensive study and propose several novel techniques for implementing 3D convolutional blocks using 2D and/or 1D convolutions with only 4D and/or 3D tensors. Our motivation is that 3D convolutions with 5D tensors are computationally very expensive and they may not be supported by some of the edge devices used in real-time applications such as robots. The existing ap… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  7. arXiv:2407.14649  [pdf, other

    cs.CV

    The Collection of a Human Robot Collaboration Dataset for Cooperative Assembly in Glovebox Environments

    Authors: Shivansh Sharma, Mathew Huang, Sanat Nair, Alan Wen, Christina Petlowany, Juston Moore, Selma Wanna, Mitch Pryor

    Abstract: Industry 4.0 introduced AI as a transformative solution for modernizing manufacturing processes. Its successor, Industry 5.0, envisions humans as collaborators and experts guiding these AI-driven manufacturing solutions. Developing these techniques necessitates algorithms capable of safe, real-time identification of human positions in a scene, particularly their hands, during collaborative assembl… ▽ More

    Submitted 13 January, 2025; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: draft paper to be submitted to IJRR

  8. arXiv:2407.00731  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models Struggle in Token-Level Clinical Named Entity Recognition

    Authors: Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu

    Abstract: Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial… ▽ More

    Submitted 16 August, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: AMIA 2024 Annual Symposium Proceedings

  9. arXiv:2405.12676  [pdf

    cs.CV math.NA

    Experimental investigation of trans-scale displacement responses of wrinkle defects in fiber reinforced composite laminates

    Authors: Li Ma, Shoulong Wang, Changchen Liu, Ange Wen, Kaidi Ying, Jing Guo

    Abstract: Wrinkle defects were found widely exist in the field of industrial products, i.e. wind turbine blades and filament-wound composite pressure vessels. The magnitude of wrinkle wavelength varies from several millimeters to over one hundred millimeters. Locating the wrinkle defects and measuring their responses are very important to the assessment of the structures that containing wrinkle defects. A m… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  10. arXiv:2401.15293  [pdf, other

    cs.CV cs.AI cs.LG

    SkipViT: Speeding Up Vision Transformers with a Token-Level Skip Connection

    Authors: Foozhan Ataiefard, Walid Ahmed, Habib Hajimolahoseini, Saina Asani, Farnoosh Javadi, Mohammad Hassanpour, Omar Mohamed Awad, Austin Wen, Kangling Liu, Yang Liu

    Abstract: Vision transformers are known to be more computationally and data-intensive than CNN models. These transformer models such as ViT, require all the input image tokens to learn the relationship among them. However, many of these tokens are not informative and may contain irrelevant information such as unrelated background or unimportant scenery. These tokens are overlooked by the multi-head self-att… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  11. arXiv:2311.15134  [pdf, other

    cs.LG cs.AI

    SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

    Authors: Habib Hajimolahoseini, Omar Mohamed Awad, Walid Ahmed, Austin Wen, Saina Asani, Mohammad Hassanpour, Farnoosh Javadi, Mehdi Ahmadi, Foozhan Ataiefard, Kangling Liu, Yang Liu

    Abstract: In this paper, we present SwiftLearn, a data-efficient approach to accelerate training of deep learning models using a subset of data samples selected during the warm-up stages of training. This subset is selected based on an importance criteria measured over the entire dataset during warm-up stages, aiming to preserve the model performance with fewer examples during the rest of training. The impo… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  12. arXiv:2311.03426  [pdf, other

    cs.LG cs.AI cs.CV

    GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values

    Authors: Farnoosh Javadi, Walid Ahmed, Habib Hajimolahoseini, Foozhan Ataiefard, Mohammad Hassanpour, Saina Asani, Austin Wen, Omar Mohamed Awad, Kangling Liu, Yang Liu

    Abstract: Massive transformer-based models face several challenges, including slow and computationally intensive pre-training and over-parametrization. This paper addresses these challenges by proposing a versatile method called GQKVA, which generalizes query, key, and value grouping techniques. GQKVA is designed to speed up transformer pre-training while reducing the model size. Our experiments with variou… ▽ More

    Submitted 13 December, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

  13. arXiv:2309.12412  [pdf, other

    cs.CV cs.LG

    Speeding up Resnet Architecture with Layers Targeted Low Rank Decomposition

    Authors: Walid Ahmed, Habib Hajimolahoseini, Austin Wen, Yang Liu

    Abstract: Compression of a neural network can help in speeding up both the training and the inference of the network. In this research, we study applying compression using low rank decomposition on network layers. Our research demonstrates that to acquire a speed up, the compression methodology should be aware of the underlying hardware as analysis should be done to choose which layers to compress. The adva… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  14. arXiv:2303.12848  [pdf, other

    cs.CV cs.LG

    Test-time Detection and Repair of Adversarial Samples via Masked Autoencoder

    Authors: Yun-Yun Tsai, Ju-Chin Chao, Albert Wen, Zhaoyuan Yang, Chengzhi Mao, Tapan Shah, Junfeng Yang

    Abstract: Training-time defenses, known as adversarial training, incur high training costs and do not generalize to unseen attacks. Test-time defenses solve these issues but most existing test-time defenses require adapting the model weights, therefore they do not work on frozen models and complicate model memory management. The only test-time defense that does not adapt model weights aims to adapt the inpu… ▽ More

    Submitted 2 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  15. arXiv:2208.07529  [pdf, other

    cs.HC

    Understanding the Challenges of Team-Based Live Streaming for First-person Shooter Games

    Authors: Jiaye Li, Minghao Li, Zikai Alex Wen, Wei Cai

    Abstract: First-person shooter (FPS) game tournaments take place across the globe. A growing number of people choose to watch FPS games online instead of attending the game events in person. However, live streaming might miss critical highlight moments in the game, including kills and tactics. We identify how and why the live streaming team fails to capture highlight moments to reduce such live streaming mi… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: Accepted by The IEEE CTSoc International Conference on Games Entertainment & Media 2022 (GEM 2022)

  16. arXiv:2208.06155  [pdf, other

    cs.HC

    What Features Influence Impact Feel? A Study of Impact Feedback in Action Games

    Authors: Zhonghao Lin, Haihan Duan, Zikai Alex Wen, Wei Cai

    Abstract: Making the hit effect satisfy players is a long-standing problem faced by action game designers. However, no research systematically analyzed which game design elements affect such game feel. There is not even a term to describe it. So, we propose to use impact feel to describe the player's feeling when receiving juicy impact feedback. After collecting player's comments on action games from Steam'… ▽ More

    Submitted 22 August, 2022; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: Accepted by The IEEE CTSoc International Conference on Games Entertainment & Media 2022 (GEM 2022)

  17. arXiv:2208.02759  [pdf, other

    cs.HC cs.CR

    New Differential Privacy Communication Pipeline and Design Framework

    Authors: Jingyu Jia, Zikai Alex Wen, Zheli Liu, Changyu Dong

    Abstract: Organizations started to adopt differential privacy (DP) techniques hoping to persuade more users to share personal data with them. However, many users do not understand DP techniques, thus may not be willing to share. Previous research suggested that the design of DP mechanism communication could influence users' willingness to share data. Based on the prior work, we propose a new communication p… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: poster

    Journal ref: The Eighteenth Symposium on Usable Privacy and Security (SOUPS 2022)

  18. arXiv:2110.10780  [pdf

    cs.CL cs.IR

    An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)

    Authors: Sijia Liu, Andrew Wen, Liwei Wang, Huan He, Sunyang Fu, Robert Miller, Andrew Williams, Daniel Harris, Ramakanth Kavuluru, Mei Liu, Noor Abu-el-rub, Dalton Schutte, Rui Zhang, Masoud Rouhizadeh, John D. Osborne, Yongqun He, Umit Topaloglu, Stephanie S Hong, Joel H Saltz, Thomas Schaffter, Emily Pfaff, Christopher G. Chute, Tim Duong, Melissa A. Haendel, Rafael Fuentes , et al. (7 additional authors not shown)

    Abstract: While we pay attention to the latest advances in clinical natural language processing (NLP), we can notice some resistance in the clinical and translational research community to adopt NLP models due to limited transparency, interpretability, and usability. In this study, we proposed an open natural language processing development framework. We evaluated it through the implementation of NLP algori… ▽ More

    Submitted 21 March, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Comments: update on contents

  19. arXiv:2103.16316  [pdf, other

    cs.LG

    Leveraging a Joint of Phenotypic and Genetic Features on Cancer Patient Subgrouping

    Authors: David Oniani, Chen Wang, Yiqing Zhao, Andrew Wen, Hongfang Liu, Feichen Shen

    Abstract: Cancer is responsible for millions of deaths worldwide every year. Although significant progress has been achieved in cancer medicine, many issues remain to be addressed for improving cancer therapy. Appropriate cancer patient stratification is the prerequisite for selecting appropriate treatment plan, as cancer patients are of known heterogeneous genetic make-ups and phenotypic differences. In th… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: text overlap with arXiv:2101.05866

  20. arXiv:2101.05866  [pdf, ps, other

    cs.LG q-bio.QM

    Comparisons of Graph Neural Networks on Cancer Classification Leveraging a Joint of Phenotypic and Genetic Features

    Authors: David Oniani, Chen Wang, Yiqing Zhao, Andrew Wen, Hongfang Liu, Feichen Shen

    Abstract: Cancer is responsible for millions of deaths worldwide every year. Although significant progress hasbeen achieved in cancer medicine, many issues remain to be addressed for improving cancer therapy.Appropriate cancer patient stratification is the prerequisite for selecting appropriate treatment plan, ascancer patients are of known heterogeneous genetic make-ups and phenotypic differences. In thiss… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  21. Adapting and evaluating a deep learning language model for clinical why-question answering

    Authors: Andrew Wen, Mohamed Y. Elwazir, Sungrim Moon, Jungwei Fan

    Abstract: Objectives: To adapt and evaluate a deep learning language model for answering why-questions based on patient-specific clinical text. Materials and Methods: Bidirectional encoder representations from transformers (BERT) models were trained with varying data sources to perform SQuAD 2.0 style why-question answering (why-QA) on clinical notes. The evaluation focused on: 1) comparing the merits from… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  22. Clinical Concept Extraction: a Methodology Review

    Authors: Sunyang Fu, David Chen, Huan He, Sijia Liu, Sungrim Moon, Kevin J Peterson, Feichen Shen, Liwei Wang, Yanshan Wang, Andrew Wen, Yiqing Zhao, Sunghwan Sohn, Hongfang Liu

    Abstract: Background Concept extraction, a subdomain of natural language processing (NLP) with a focus on extracting concepts of interest, has been adopted to computationally extract clinical information from text for a wide range of applications ranging from clinical decision support to care quality improvement. Objectives In this literature review, we provide a methodology review of clinical concept ext… ▽ More

    Submitted 10 August, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Journal ref: Journal of Biomedical Informatics (2020): 103526

  23. arXiv:1906.09543  [pdf

    cs.IR cs.CL

    Cross-lingual Data Transformation and Combination for Text Classification

    Authors: Jun Jiang, Shumao Pang, Xia Zhao, Liwei Wang, Andrew Wen, Hongfang Liu, Qianjin Feng

    Abstract: Text classification is a fundamental task for text data mining. In order to train a generalizable model, a large volume of text must be collected. To address data insufficiency, cross-lingual data may occasionally be necessary. Cross-lingual data sources may however suffer from data incompatibility, as text written in different languages can hold distinct word sequences and semantic patterns. Mach… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    MSC Class: 68U15

  24. CREATE: Cohort Retrieval Enhanced by Analysis of Text from Electronic Health Records using OMOP Common Data Model

    Authors: Sijia Liu, Yanshan Wang, Andrew Wen, Liwei Wang, Na Hong, Feichen Shen, Steven Bedrick, William Hersh, Hongfang Liu

    Abstract: Background: Widespread adoption of electronic health records (EHRs) has enabled secondary use of EHR data for clinical research and healthcare delivery. Natural language processing (NLP) techniques have shown promise in their capability to extract the embedded information in unstructured clinical data, and information retrieval (IR) techniques provide flexible and scalable solutions that can augme… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.