Skip to main content

Showing 1–18 of 18 results for author: Hu, S X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.12649  [pdf, other

    cs.LG cs.AI

    FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization

    Authors: Hao Mark Chen, Shell Xu Hu, Wayne Luk, Timothy Hospedales, Hongxiang Fan

    Abstract: Model merging has emerged as a promising approach for multi-task learning (MTL), offering a data-efficient alternative to conventional fine-tuning. However, with the rapid development of the open-source AI ecosystem and the increasing availability of fine-tuned foundation models, existing model merging methods face two key limitations: (i) They are primarily designed for in-house fine-tuned models… ▽ More

    Submitted 25 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

  2. arXiv:2408.13933  [pdf, other

    cs.CL

    MobileQuant: Mobile-friendly Quantization for On-device Language Models

    Authors: Fuwen Tan, Royson Lee, Łukasz Dudziak, Shell Xu Hu, Sourav Bhattacharya, Timothy Hospedales, Georgios Tzimiropoulos, Brais Martinez

    Abstract: Large language models (LLMs) have revolutionized language processing, delivering outstanding results across multiple applications. However, deploying LLMs on edge devices poses several challenges with respect to memory, energy, and compute costs, limiting their widespread use in devices such as mobile phones. A promising solution is to reduce the number of bits used to represent weights and activa… ▽ More

    Submitted 4 October, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: EMNLP 2024 Findings. Code and models available: https://github.com/saic-fi/MobileQuant

  3. arXiv:2407.00466  [pdf, other

    cs.CL cs.AI

    BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science

    Authors: Xinna Lin, Siqi Ma, Junjie Shan, Xiaojing Zhang, Shell Xu Hu, Tiannan Guo, Stan Z. Li, Kaicheng Yu

    Abstract: Pursuing artificial intelligence for biomedical science, a.k.a. AI Scientist, draws increasing attention, where one common approach is to build a copilot agent driven by Large Language Models (LLMs). However, to evaluate such systems, people either rely on direct Question-Answering (QA) to the LLM itself, or in a biomedical experimental manner. How to precisely benchmark biomedical agents from an… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  4. arXiv:2405.14791  [pdf, other

    cs.LG cs.CV cs.DC

    Recurrent Early Exits for Federated Learning with Heterogeneous Clients

    Authors: Royson Lee, Javier Fernandez-Marques, Shell Xu Hu, Da Li, Stefanos Laskaridis, Łukasz Dudziak, Timothy Hospedales, Ferenc Huszár, Nicholas D. Lane

    Abstract: Federated learning (FL) has enabled distributed learning of a model across multiple clients in a privacy-preserving manner. One of the main challenges of FL is to accommodate clients with varying hardware capacities; clients have differing compute and memory requirements. To tackle this challenge, recent state-of-the-art approaches leverage the use of early exits. Nonetheless, these approaches fal… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted at the 41st International Conference on Machine Learning (ICML 2024)

  5. arXiv:2310.02426  [pdf, other

    cs.CV

    EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods

    Authors: Samyadeep Basu, Mehrdad Saberi, Shweta Bhardwaj, Atoosa Malemir Chegini, Daniela Massiceti, Maziar Sanjabi, Shell Xu Hu, Soheil Feizi

    Abstract: A plethora of text-guided image editing methods have recently been developed by leveraging the impressive capabilities of large-scale diffusion-based generative models such as Imagen and Stable Diffusion. A standardized evaluation protocol, however, does not exist to compare methods across different types of fine-grained edits. To address this gap, we introduce EditVal, a standardized benchmark fo… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  6. arXiv:2307.09233  [pdf, other

    cs.CV

    Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP

    Authors: Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi

    Abstract: Image-text contrastive models like CLIP have wide applications in zero-shot classification, image-text retrieval, and transfer learning. However, they often struggle on compositional visio-linguistic tasks (e.g., attribute-binding or object-relationships) where their performance is no better than random chance. To address this, we introduce SDS-CLIP, a lightweight and sample-efficient distillation… ▽ More

    Submitted 1 July, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Short paper

  7. arXiv:2304.01917  [pdf, other

    cs.CV

    Strong Baselines for Parameter Efficient Few-Shot Fine-tuning

    Authors: Samyadeep Basu, Daniela Massiceti, Shell Xu Hu, Soheil Feizi

    Abstract: Few-shot classification (FSC) entails learning novel classes given only a few examples per class after a pre-training (or meta-training) phase on a set of base classes. Recent works have shown that simply fine-tuning a pre-trained Vision Transformer (ViT) on new test classes is a strong approach for FSC. Fine-tuning ViTs, however, is expensive in time, compute and storage. This has motivated the d… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  8. arXiv:2212.04084  [pdf, other

    cs.LG cs.DC

    Federated Learning for Inference at Anytime and Anywhere

    Authors: Zicheng Liu, Da Li, Javier Fernandez-Marques, Stefanos Laskaridis, Yan Gao, Łukasz Dudziak, Stan Z. Li, Shell Xu Hu, Timothy Hospedales

    Abstract: Federated learning has been predominantly concerned with collaborative training of deep networks from scratch, and especially the many challenges that arise, such as communication cost, robustness to heterogeneous data, and support for diverse device capabilities. However, there is no unified framework that addresses all these problems together. This paper studies the challenges and opportunities… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 14 pages, 3 figures

  9. arXiv:2209.00383  [pdf, other

    cs.CV stat.ML

    TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Yuan Yuan, Yuming Du, Maomao Li, Shell Xu Hu, James L Crowley, Dominique Vaufreydaz

    Abstract: In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or video are organised into a fully connected graph, where the edge between each pair of patches is labeled with a similarity score between patches using features l… ▽ More

    Submitted 5 December, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.11539

  10. arXiv:2207.07624  [pdf, other

    cs.LG stat.ML

    Feed-Forward Latent Domain Adaptation

    Authors: Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy Hospedales

    Abstract: We study a new highly-practical problem setting that enables resource-constrained edge devices to adapt a pre-trained model to their local data distributions. Recognizing that device's data are likely to come from multiple latent domains that include a mixture of unlabelled domain-relevant and domain-irrelevant examples, we focus on the comparatively under-studied problem of latent domain adaptati… ▽ More

    Submitted 31 January, 2024; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted at WACV 2024. Project page: https://ondrejbohdal.github.io/cxda

  11. arXiv:2206.13140  [pdf, other

    cs.LG stat.ML

    Compressing Features for Learning with Noisy Labels

    Authors: Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A. K. Suykens

    Abstract: Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper,… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted to TNNLS 2022. Project page: https://yingyichen-cyy.github.io/CompressFeatNoisyLabels/

  12. arXiv:2206.04920  [pdf, other

    cs.LG

    Fisher SAM: Information Geometry and Sharpness Aware Minimisation

    Authors: Minyoung Kim, Da Li, Shell Xu Hu, Timothy M. Hospedales

    Abstract: Recent sharpness-aware minimisation (SAM) is known to find flat minima which is beneficial for better generalisation with improved robustness. SAM essentially modifies the loss function by reporting the maximum loss value within the small neighborhood around the current iterate. However, it uses the Euclidean ball to define the neighborhood, which can be inaccurate since loss functions for neural… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  13. arXiv:2204.07305  [pdf, other

    cs.CV cs.LG

    Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference

    Authors: Shell Xu Hu, Da Li, Jan Stühmer, Minyoung Kim, Timothy M. Hospedales

    Abstract: Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated meta-learning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for more realistic and practical settings of few-shot image classification. To this end, we explore few-shot l… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022

  14. arXiv:2105.00609  [pdf, other

    cs.SD eess.AS

    AvaTr: One-Shot Speaker Extraction with Transformers

    Authors: Shell Xu Hu, Md Rifat Arefin, Viet-Nhat Nguyen, Alish Dipani, Xaq Pitkow, Andreas Savas Tolias

    Abstract: To extract the voice of a target speaker when mixed with a variety of other sounds, such as white and ambient noises or the voices of interfering speakers, we extend the Transformer network to attend the most relevant information with respect to the target speaker given the characteristics of his or her voices as a form of contextual information. The idea has a natural interpretation in terms of t… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: 6 pages, 4 main figures, 2 supplemental figures

  15. arXiv:2104.13766  [pdf, other

    cs.CV

    Boosting Co-teaching with Compression Regularization for Label Noise

    Authors: Yingyi Chen, Xi Shen, Shell Xu Hu, Johan A. K. Suykens

    Abstract: In this paper, we study the problem of learning image classification models in the presence of label noise. We revisit a simple compression regularization named Nested Dropout. We find that Nested Dropout, though originally proposed to perform fast information retrieval and adaptive data compression, can properly regularize a neural network to combat label noise. Moreover, owing to its simplicity,… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR Workshop 2021. Project page: https://github.com/yingyichen-cyy/Nested-Co-teaching

  16. arXiv:2004.12696  [pdf, other

    cs.LG stat.ML

    Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

    Authors: Shell Xu Hu, Pablo G. Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas Damianou

    Abstract: We propose a meta-learning approach that learns from multiple tasks in a transductive setting, by leveraging the unlabeled query set in addition to the support set to generate a more powerful model for each task. To develop our framework, we revisit the empirical Bayes formulation for multi-task learning. The evidence lower bound of the marginal log-likelihood of empirical Bayes decomposes as a su… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ICLR 2020

  17. arXiv:1904.05835  [pdf, other

    cs.CV cs.AI cs.LG

    Variational Information Distillation for Knowledge Transfer

    Authors: Sungsoo Ahn, Shell Xu Hu, Andreas Damianou, Neil D. Lawrence, Zhenwen Dai

    Abstract: Transferring knowledge from a teacher neural network pretrained on the same or a similar task to a student neural network can significantly improve the performance of the student neural network. Existing knowledge transfer approaches match the activations or the corresponding hand-crafted features of the teacher and the student networks. We propose an information-theoretic framework for knowledge… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Comments: To appear at CVPR 2019

  18. arXiv:1506.03852  [pdf, other

    stat.ML cs.CV

    Tree-Cut for Probabilistic Image Segmentation

    Authors: Shell X. Hu, Christopher K. I. Williams, Sinisa Todorovic

    Abstract: This paper presents a new probabilistic generative model for image segmentation, i.e. the task of partitioning an image into homogeneous regions. Our model is grounded on a mid-level image representation, called a region tree, in which regions are recursively split into subregions until superpixels are reached. Given the region tree, image segmentation is formalized as sampling cuts in the tree fr… ▽ More

    Submitted 11 June, 2015; originally announced June 2015.