Skip to main content

Showing 1–22 of 22 results for author: SA, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00240  [pdf, other

    cs.CV cs.AI cs.LG

    One-Frame Calibration with Siamese Network in Facial Action Unit Recognition

    Authors: Shuangquan Feng, Virginia R. de Sa

    Abstract: Automatic facial action unit (AU) recognition is used widely in facial expression analysis. Most existing AU recognition systems aim for cross-participant non-calibrated generalization (NCG) to unseen faces without further calibration. However, due to the diversity of facial attributes across different identities, accurately inferring AU activation from single images of an unseen face is sometimes… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  2. arXiv:2407.15046  [pdf, other

    cs.CV cs.CL cs.MM

    Audio-visual training for improved grounding in video-text LLMs

    Authors: Shivprasad Sagare, Hemachandran S, Kinshuk Sarabhai, Prashant Ullegaddi, Rajeshkumar SA

    Abstract: Recent advances in multimodal LLMs, have led to several video-text models being proposed for critical video-related tasks. However, most of the previous works support visual input only, essentially muting the audio signal in the video. Few models that support both audio and visual input, are not explicitly trained on audio data. Hence, the effect of audio towards video understanding is largely une… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  3. arXiv:2404.01250  [pdf, other

    q-bio.NC cs.HC

    Perceptogram: Reconstructing Visual Percepts from EEG

    Authors: Teng Fei, Abhinav Uppal, Ian Jackson, Srinivas Ravishankar, David Wang, Virginia R. de Sa

    Abstract: Visual neural decoding from EEG has improved significantly due to diffusion models that can reconstruct high-quality images from decoded latents. While recent works have focused on relatively complex architectures to achieve good reconstruction performance from EEG, less attention has been paid to the source of this information. In this work, we attempt to discover EEG features that represent perc… ▽ More

    Submitted 24 February, 2025; v1 submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2312.03187  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    FERGI: Automatic Scoring of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction

    Authors: Shuangquan Feng, Junhua Ma, Virginia R. de Sa

    Abstract: Researchers have proposed to use data of human preference feedback to fine-tune text-to-image generative models. However, the scalability of human feedback collection has been limited by its reliance on manual annotation. Therefore, we develop and test a method to automatically score user preferences from their spontaneous facial expression reaction to the generated images. We collect a dataset of… ▽ More

    Submitted 25 May, 2025; v1 submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2311.06964  [pdf, other

    cs.CV cs.LG

    Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels

    Authors: Vijay Veerabadran, Srinivas Ravishankar, Yuan Tang, Ritik Raina, Virginia R. de Sa

    Abstract: Humans solving algorithmic (or) reasoning problems typically exhibit solution times that grow as a function of problem difficulty. Adaptive recurrent neural networks have been shown to exhibit this property for various language-processing tasks. However, little work has been performed to assess whether such adaptive computation can also enable vision models to extrapolate solutions beyond their tr… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  6. arXiv:2110.06139  [pdf, other

    eess.SP cs.LG

    Classification of anomalous gait using Machine Learning techniques and embedded sensors

    Authors: T. R. D. Sa, C. M. S. Figueiredo

    Abstract: Human gait can be a predictive factor for detecting pathologies that affect human locomotion according to studies. In addition, it is known that a high investment is demanded in order to raise a traditional clinical infrastructure able to provide human gait examinations, making them unaffordable for economically vulnerable patients. In face of this scenario, this work proposes an accessible and mo… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  7. COVID-19 Monitoring System using Social Distancing and Face Mask Detection on Surveillance video datasets

    Authors: Sahana Srinivasan, Rujula Singh R, Ruchita R Biradar, Revathi SA

    Abstract: In the current times, the fear and danger of COVID-19 virus still stands large. Manual monitoring of social distancing norms is impractical with a large population moving about and with insufficient task force and resources to administer them. There is a need for a lightweight, robust and 24X7 video-monitoring system that automates this process. This paper proposes a comprehensive and effective so… ▽ More

    Submitted 16 December, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: I, Rujula Singh R, would like to apologize to the research community for the confusion caused by the inconsistency in author lists between multiple versions of this paper. I take full responsibility for this error and will be more diligent in the future to ensure the accuracy and consistency of our research publications

    Journal ref: 2021 International Conference on Emerging Smart Computing and Informatics (ESCI), 2021, pp. 449-455

  8. arXiv:2006.11716  [pdf, other

    cs.CV

    Learning compact generalizable neural representations supporting perceptual grouping

    Authors: Vijay Veerabadran, Virginia R. de Sa

    Abstract: Work at the intersection of vision science and deep learning is starting to explore the efficacy of deep convolutional networks (DCNs) and recurrent networks in solving perceptual grouping problems that underlie primate visual recognition and segmentation. Here, we extend this line of work to investigate the compactness and generalizability of DCN solutions to learning low-level perceptual groupin… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

  9. arXiv:2006.06791  [pdf, other

    cs.LG stat.ML

    Deep Transfer Learning with Ridge Regression

    Authors: Shuai Tang, Virginia R. de Sa

    Abstract: The large amount of online data and vast array of computing resources enable current researchers in both industry and academia to employ the power of deep learning with neural networks. While deep models trained with massive amounts of data demonstrate promising generalisation ability on unseen data from relevant domains, the computational cost of finetuning gradually becomes a bottleneck in trans… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  10. arXiv:2006.01169  [pdf, other

    eess.SP cs.HC cs.LG

    RNNs on Monitoring Physical Activity Energy Expenditure in Older People

    Authors: Stylianos Paraschiakos, Cláudio Rebelo de Sá, Jeremiah Okai, Eline P. Slagboom, Marian Beekman, Arno Knobbe

    Abstract: Through the quantification of physical activity energy expenditure (PAEE), health care monitoring has the potential to stimulate vital and healthy ageing, inducing behavioural changes in older people and linking these to personal health gains. To be able to measure PAEE in a monitoring environment, methods from wearable accelerometers have been developed, however, mainly targeted towards younger p… ▽ More

    Submitted 11 January, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: For a revised, updated and published version (Jan 2022, open access) refer to the Journal of Data Mining and Knowledge Discovery, DOI https://doi.org/10.1007/s10618-021-00817-w. To make our experiments, scripts and results accessible to other researchers (open-source) we shared our scripts with the published version (Jan 2022)

    Journal ref: Data Mining Knowledge Discovery (2022)

  11. arXiv:2005.04258  [pdf, other

    cs.CV cs.LG eess.IV

    View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors

    Authors: Walid Bekhtaoui, Ruhan Sa, Brian Teixeira, Vivek Singh, Klaus Kirchberg, Yao-jen Chang, Ankur Kapoor

    Abstract: Point cloud based methods have produced promising results in areas such as 3D object detection in autonomous driving. However, most of the recent point cloud work focuses on single depth sensor data, whereas less work has been done on indoor monitoring applications, such as operation room monitoring in hospitals or indoor surveillance. In these scenarios multiple cameras are often used to tackle o… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  12. arXiv:1905.10971  [pdf, other

    cs.LG cs.CL stat.ML

    An Empirical Study on Post-processing Methods for Word Embeddings

    Authors: Shuai Tang, Mahta Mousavi, Virginia R. de Sa

    Abstract: Word embeddings learnt from large corpora have been adopted in various applications in natural language processing and served as the general input representations to learning systems. Recently, a series of post-processing methods have been proposed to boost the performance of word embeddings on similarity comparison and analogy retrieval tasks, and some have been adapted to compose sentence repres… ▽ More

    Submitted 23 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

  13. Preference rules for label ranking: Mining patterns in multi-target relations

    Authors: Cláudio Rebelo de Sá, Paulo Azevedo, Carlos Soares, Alípio Mário Jorge, Arno Knobbe

    Abstract: In this paper we investigate two variants of association rules for preference data, Label Ranking Association Rules and Pairwise Association Rules. Label Ranking Association Rules (LRAR) are the equivalent of Class Association Rules (CAR) for the Label Ranking task. In CAR, the consequent is a single class, to which the example is expected to belong to. In LRAR, the consequent is a ranking of the… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Journal ref: Information Fusion, Volume 40, March 2018, Pages 112-125

  14. arXiv:1810.12456  [pdf, other

    cs.NE cs.LG cs.SC

    A Simple Recurrent Unit with Reduced Tensor Product Representations

    Authors: Shuai Tang, Paul Smolensky, Virginia R. de Sa

    Abstract: idely used recurrent units, including Long-short Term Memory (LSTM) and the Gated Recurrent Unit (GRU), perform well on natural language tasks, but their ability to learn structured representations is still questionable. Exploiting reduced Tensor Product Representations (TPRs) --- distributed representations of symbolic structure in which vector-embedded symbols are bound to vector-embedded struct… ▽ More

    Submitted 5 November, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

  15. arXiv:1810.01064  [pdf, other

    cs.CL cs.LG cs.NE

    Improving Sentence Representations with Consensus Maximisation

    Authors: Shuai Tang, Virginia R. de Sa

    Abstract: Consensus maximisation learning can provide self-supervision when different views are available of the same data. The distributional hypothesis provides another form of useful self-supervision from adjacent sentences which are plentiful in large unlabelled corpora. Motivated by the observation that different learning architectures tend to emphasise different aspects of sentence meaning, we present… ▽ More

    Submitted 6 May, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.07443

  16. arXiv:1809.02731  [pdf, ps, other

    cs.NE cs.CL cs.LG

    Exploiting Invertible Decoders for Unsupervised Sentence Representation Learning

    Authors: Shuai Tang, Virginia R. de Sa

    Abstract: The encoder-decoder models for unsupervised sentence representation learning tend to discard the decoder after being trained on a large unlabelled corpus, since only the encoder is needed to map the input sentence into a vector representation. However, parameters learnt in the decoder also contain useful information about language. In order to utilise the decoder after learning, we present two typ… ▽ More

    Submitted 31 May, 2019; v1 submitted 7 September, 2018; originally announced September 2018.

  17. arXiv:1805.07443  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Multi-view Sentence Representation Learning

    Authors: Shuai Tang, Virginia R. de Sa

    Abstract: Multi-view learning can provide self-supervision when different views are available of the same data. The distributional hypothesis provides another form of useful self-supervision from adjacent sentences which are plentiful in large unlabelled corpora. Motivated by the asymmetry in the two hemispheres of the human brain as well as the observation that different learning architectures tend to emph… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

  18. arXiv:1804.05544  [pdf, other

    cs.LG stat.ML

    Building robust prediction models for defective sensor data using Artificial Neural Networks

    Authors: Arvind Kumar Shekar, Cláudio Rebelo de Sá, Hugo Ferreira, Carlos Soares

    Abstract: Predicting the health of components in complex dynamic systems such as an automobile poses numerous challenges. The primary aim of such predictive systems is to use the high-dimensional data acquired from different sensors and predict the state-of-health of a particular component, e.g., brake pad. The classical approach involves selecting a smaller set of relevant sensor signals using feature sele… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: 16 pages, 7 figures. Currently under review. This research has obtained funding from the Electronic Components and Systems for European Leadership (ECSEL) Joint Undertaking, the framework programme for research and innovation Horizon 2020 (2014-2020) under grant agreement number 662189-MANTIS-2014-1

  19. arXiv:1802.04128  [pdf, other

    cs.CY cs.LG

    Smart energy management as a means towards improved energy efficiency

    Authors: Dylan te Lindert, Cláudio Rebelo de Sá, Carlos Soares, Arno J. Knobbe

    Abstract: The costs associated with refrigerator equipment often represent more than half of the total energy costs in supermarkets. This presents a good motivation for running these systems efficiently. In this study, we investigate different ways to construct a reference behavior, which can serve as a baseline for judging the performance of energy consumption. We used 3 distinct learning models: Multiple… ▽ More

    Submitted 8 February, 2018; originally announced February 2018.

  20. arXiv:1710.10380  [pdf, other

    cs.NE cs.CL cs.LG

    Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding

    Authors: Shuai Tang, Hailin Jin, Chen Fang, Zhaowen Wang, Virginia R. de Sa

    Abstract: Context plays an important role in human language understanding, thus it may also be useful for machines learning vector representations of language. In this paper, we explore an asymmetric encoder-decoder structure for unsupervised context-based sentence representation learning. We carefully designed experiments to show that neither an autoregressive decoder nor an RNN decoder is required. After… ▽ More

    Submitted 31 May, 2018; v1 submitted 27 October, 2017; originally announced October 2017.

  21. arXiv:1706.03148  [pdf, other

    cs.CL

    Trimming and Improving Skip-thought Vectors

    Authors: Shuai Tang, Hailin Jin, Chen Fang, Zhaowen Wang, Virginia R. de Sa

    Abstract: The skip-thought model has been proven to be effective at learning sentence representations and capturing sentence semantics. In this paper, we propose a suite of techniques to trim and improve it. First, we validate a hypothesis that, given a current sentence, inferring the previous and inferring the next sentence provide similar supervision power, therefore only one decoder for predicting the ne… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.

  22. arXiv:1706.03146  [pdf, other

    cs.CL cs.AI cs.NE

    Rethinking Skip-thought: A Neighborhood based Approach

    Authors: Shuai Tang, Hailin Jin, Chen Fang, Zhaowen Wang, Virginia R. de Sa

    Abstract: We study the skip-thought model with neighborhood information as weak supervision. More specifically, we propose a skip-thought neighbor model to consider the adjacent sentences as a neighborhood. We train our skip-thought neighbor model on a large corpus with continuous sentences, and then evaluate the trained model on 7 tasks, which include semantic relatedness, paraphrase detection, and classif… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.