Skip to main content

Showing 1–13 of 13 results for author: Thong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08179  [pdf, other

    eess.IV cs.LG eess.SP stat.AP stat.ML

    Do Bayesian imaging methods report trustworthy probabilities?

    Authors: David Y. W. Thong, Charlesquin Kemajou Mbakam, Marcelo Pereyra

    Abstract: Bayesian statistics is a cornerstone of imaging sciences, underpinning many and varied approaches from Markov random fields to score-based denoising diffusion models. In addition to powerful image estimation methods, the Bayesian paradigm also provides a framework for uncertainty quantification and for using image data as quantitative evidence. These probabilistic capabilities are important for th… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    MSC Class: 65J22 (Primary); 62F15 (Secondary); 68U10

  2. arXiv:2311.13895  [pdf, other

    cs.CV

    Query by Activity Video in the Wild

    Authors: Tao Hu, William Thong, Pascal Mettes, Cees G. M. Snoek

    Abstract: This paper focuses on activity retrieval from a video query in an imbalanced scenario. In current query-by-activity-video literature, a common assumption is that all activities have sufficient labelled examples when learning an embedding. This assumption does however practically not hold, as only a portion of activities have many examples, while other activities are only described by few examples.… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: An extended version of ICIP 2023

  3. arXiv:2309.05148  [pdf, other

    cs.CV

    Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color

    Authors: William Thong, Przemyslaw Joniak, Alice Xiang

    Abstract: This paper strives to measure apparent skin color in computer vision, beyond a unidimensional scale on skin tone. In their seminal paper Gender Shades, Buolamwini and Gebru have shown how gender classification systems can be biased against women with darker skin tones. Subsequently, fairness researchers and practitioners have adopted the Fitzpatrick skin type classification as a common measure to… ▽ More

    Submitted 3 October, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted at the International Conference on Computer Vision (ICCV) 2023

  4. Augmented Datasheets for Speech Datasets and Ethical Decision-Making

    Authors: Orestis Papakyriakopoulos, Anna Seo Gyeong Choi, Jerone Andrews, Rebecca Bourke, William Thong, Dora Zhao, Alice Xiang, Allison Koenecke

    Abstract: Speech datasets are crucial for training Speech Language Technologies (SLT); however, the lack of diversity of the underlying training data can lead to serious limitations in building equitable and robust SLT products, especially along dimensions of language, accent, dialect, variety, and speech impairment - and the intersectionality of speech features with socioeconomic and demographic features.… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: To appear in 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23), June 12-15, Chicago, IL, USA

  5. arXiv:2302.03629  [pdf, ps, other

    cs.CV cs.AI cs.DB cs.LG

    Ethical Considerations for Responsible Data Curation

    Authors: Jerone T. A. Andrews, Dora Zhao, William Thong, Apostolos Modas, Orestis Papakyriakopoulos, Alice Xiang

    Abstract: Human-centric computer vision (HCCV) data curation practices often neglect privacy and bias concerns, leading to dataset retractions and unfair models. HCCV datasets constructed through nonconsensual web scraping lack crucial metadata for comprehensive fairness and robustness evaluations. Current remedies are post hoc, lack persuasive justification for adoption, or fail to provide proper contextua… ▽ More

    Submitted 10 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023 Track on Datasets and Benchmarks (Oral)

  6. arXiv:2211.05215  [pdf, other

    cs.CV

    Content-Diverse Comparisons improve IQA

    Authors: William Thong, Jose Costa Pereira, Sarah Parisot, Ales Leonardis, Steven McDonagh

    Abstract: Image quality assessment (IQA) forms a natural and often straightforward undertaking for humans, yet effective automation of the task remains highly challenging. Recent metrics from the deep learning community commonly compare image pairs during training to improve upon traditional metrics such as PSNR or SSIM. However, current comparisons ignore the fact that image content affects quality assessm… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2022

  7. arXiv:2110.14336  [pdf, other

    cs.CV

    Feature and Label Embedding Spaces Matter in Addressing Image Classifier Bias

    Authors: William Thong, Cees G. M. Snoek

    Abstract: This paper strives to address image classifier bias, with a focus on both feature and label embedding spaces. Previous works have shown that spurious correlations from protected attributes, such as age, gender, or skin tone, can cause adverse decisions. To balance potential harms, there is a growing need to identify and mitigate image classifier bias. First, we identify in the feature space a bias… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2021

  8. arXiv:2105.03072  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Perceptual Image Quality Assessment

    Authors: Jinjin Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, Sungjun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, Jingyu Guo, Zirui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang , et al. (25 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021. As a new type of image processing technology, perceptual image processing algorithms based on Generative Adversarial Networks (GAN) have produced images with more realistic textures. These o… ▽ More

    Submitted 28 June, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

  9. arXiv:2104.04715  [pdf, other

    cs.CV

    Object Priors for Classifying and Localizing Unseen Actions

    Authors: Pascal Mettes, William Thong, Cees G. M. Snoek

    Abstract: This work strives for the classification and localization of human actions in videos, without the need for any labeled video training examples. Where existing work relies on transferring global attribute or object information from seen to unseen action videos, we seek to classify and spatio-temporally localize unseen actions in videos from image-based object information only. We propose three spat… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: Accepted to IJCV

  10. arXiv:2008.11185  [pdf, other

    cs.CV

    Bias-Awareness for Zero-Shot Learning the Seen and Unseen

    Authors: William Thong, Cees G. M. Snoek

    Abstract: Generalized zero-shot learning recognizes inputs from both seen and unseen classes. Yet, existing methods tend to be biased towards the classes seen during training. In this paper, we strive to mitigate this bias. We propose a bias-aware learner to map inputs to a semantic embedding space for generalized zero-shot learning. During training, the model learns to regress to real-valued class prototyp… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Accepted at British Machine Vision Conference (BMVC) 2020

  11. arXiv:1911.08621  [pdf, other

    cs.CV

    Open Cross-Domain Visual Search

    Authors: William Thong, Pascal Mettes, Cees G. M. Snoek

    Abstract: This paper addresses cross-domain visual search, where visual queries retrieve category samples from a different domain. For example, we may want to sketch an airplane and retrieve photographs of airplanes. Despite considerable progress, the search occurs in a closed setting between two pre-defined domains. In this paper, we make the step towards an open setting where multiple visual domains are a… ▽ More

    Submitted 28 July, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted at Computer Vision and Image Understanding (CVIU)

  12. arXiv:1904.01421  [pdf, other

    cs.CV

    Cooperative Embeddings for Instance, Attribute and Category Retrieval

    Authors: William Thong, Cees G. M. Snoek, Arnold W. M. Smeulders

    Abstract: The goal of this paper is to retrieve an image based on instance, attribute and category similarity notions. Different from existing works, which usually address only one of these entities in isolation, we introduce a cooperative embedding to integrate them while preserving their specific level of semantic representation. An algebraic structure defines a superspace filled with instances. Attribute… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  13. arXiv:1902.00671  [pdf, other

    cs.CV

    A Layer-Based Sequential Framework for Scene Generation with GANs

    Authors: Mehmet Ozgur Turkoglu, William Thong, Luuk Spreeuwers, Berkay Kicanaoglu

    Abstract: The visual world we sense, interpret and interact everyday is a complex composition of interleaved physical entities. Therefore, it is a very challenging task to generate vivid scenes of similar complexity using computers. In this work, we present a scene generation framework based on Generative Adversarial Networks (GANs) to sequentially compose a scene, breaking down the underlying problem into… ▽ More

    Submitted 2 February, 2019; originally announced February 2019.

    Comments: This paper was accepted at AAAI 2019