Skip to main content

Showing 1–12 of 12 results for author: Muscat, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.15084  [pdf, other

    cs.CV

    Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos

    Authors: Błażej Leporowski, Arian Bakhtiarnia, Nicole Bonnici, Adrian Muscat, Luca Zanella, Yiming Wang, Alexandros Iosifidis

    Abstract: We introduce the first audio-visual dataset for traffic anomaly detection taken from real-world scenes, called MAVAD, with a diverse range of weather and illumination conditions. In addition, we propose a novel method named AVACA that combines visual and audio features extracted from video sequences by means of cross-attention to detect anomalies. We demonstrate that the addition of audio improves… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  2. arXiv:2302.03729  [pdf, other

    cs.CV

    KENGIC: KEyword-driven and N-Gram Graph based Image Captioning

    Authors: Brandon Birmingham, Adrian Muscat

    Abstract: This paper presents a Keyword-driven and N-gram Graph based approach for Image Captioning (KENGIC). Most current state-of-the-art image caption generators are trained end-to-end on large scale paired image-caption datasets which are very laborious and expensive to collect. Such models are limited in terms of their explainability and their applicability across different domains. To address these li… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: Published in the Digital Image Computing: Techniques and Applications, 2022 (DICTA 2022)

  3. arXiv:2205.12342  [pdf, other

    cs.CV cs.NE

    Face2Text revisited: Improved data set and baseline results

    Authors: Marc Tanti, Shaun Abdilla, Adrian Muscat, Claudia Borg, Reuben A. Farrugia, Albert Gatt

    Abstract: Current image description generation models do not transfer well to the task of describing human faces. To encourage the development of more human-focused descriptions, we developed a new data set of facial descriptions based on the CelebA image data set. We describe the properties of this data set, and present results from a face description generator trained on it, which explores the feasibility… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: 7 pages, 5 figures, 4 tables, to appear in LREC 2022 (P-VLAM workshop)

  4. Automated segmentation of microtomography imaging of Egyptian mummies

    Authors: Marc Tanti, Camille Berruyer, Paul Tafforeau, Adrian Muscat, Reuben Farrugia, Kenneth Scerri, Gianluca Valentino, V. Armando Solé, Johann A. Briffa

    Abstract: Propagation Phase Contrast Synchrotron Microtomography (PPC-SR$μ$CT) is the gold standard for non-invasive and non-destructive access to internal structures of archaeological remains. In this analysis, the virtual specimen needs to be segmented to separate different parts or materials, a process that normally requires considerable human effort. In the Automated SEgmentation of Microtomography Imag… ▽ More

    Submitted 16 December, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Journal ref: PLOS ONE, vol. 16, no. 12, p. e0260707, 2021

  5. arXiv:2101.04626  [pdf, other

    cs.CV cs.AI

    Predicting Relative Depth between Objects from Semantic Features

    Authors: Stefan Cassar, Adrian Muscat, Dylan Seychell

    Abstract: Vision and language tasks such as Visual Relation Detection and Visual Question Answering benefit from semantic features that afford proper grounding of language. The 3D depth of objects depicted in 2D images is one such feature. However it is very difficult to obtain accurate depth information without learning the appropriate features, which are scene dependent. The state of the art in this area… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 9 pages, 2 figures

  6. arXiv:2101.01294  [pdf, other

    cs.LG

    One vs Previous and Similar Classes Learning -- A Comparative Study

    Authors: Daniel Cauchi, Adrian Muscat

    Abstract: When dealing with multi-class classification problems, it is common practice to build a model consisting of a series of binary classifiers using a learning paradigm which dictates how the classifiers are built and combined to discriminate between the individual classes. As new data enters the system and the model needs updating, these models would often need to be retrained from scratch. This work… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: 10 pages, 6 figures

  7. arXiv:2008.05760  [pdf, other

    cs.CL cs.LG

    MASRI-HEADSET: A Maltese Corpus for Speech Recognition

    Authors: Carlos Mena, Albert Gatt, Andrea DeMarco, Claudia Borg, Lonneke van der Plas, Amanda Muscat, Ian Padovani

    Abstract: Maltese, the national language of Malta, is spoken by approximately 500,000 people. Speech processing for Maltese is still in its early stages of development. In this paper, we present the first spoken Maltese corpus designed purposely for Automatic Speech Recognition (ASR). The MASRI-HEADSET corpus was developed by the MASRI project at the University of Malta. It consists of 8 hours of speech pai… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 8 pages, 2 figures, 4 tables, 1 appendix. Appears in Proceedings of the 12th edition of the Language Resources and Evaluation Conference (LREC'20)

  8. arXiv:1908.10725  [pdf, other

    cs.IR cs.CY

    VJAGG -- A Thick-Client Smart-Phone Journey Detection Algorithm

    Authors: Michael P. J. Camilleri, Adrian Muscat, Victor Buttigieg, Maria Attard

    Abstract: In this paper we describe $Vja\dot{g}\dot{g}$, a battery-aware journey detection algorithm that executes on the mobile device. The algorithm can be embedded in the client app of the transport service provider or in a general purpose mobility data collector. The thick client setup allows the customer/participant to select which journeys are transferred to the server, keeping customers in control of… ▽ More

    Submitted 1 April, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

  9. arXiv:1903.11029  [pdf, other

    cs.CV

    Optimising the Input Image to Improve Visual Relationship Detection

    Authors: Noel Mizzi, Adrian Muscat

    Abstract: Visual Relationship Detection is defined as, given an image composed of a subject and an object, the correct relation is predicted. To improve the visual part of this difficult problem, ten preprocessing methods were tested to determine whether the widely used Union method yields the optimal results. Therefore, focusing solely on predicate prediction, no object detection and linguistic knowledge w… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

  10. Pre-gen metrics: Predicting caption quality metrics without generating captions

    Authors: Marc Tanti, Albert Gatt, Adrian Muscat

    Abstract: Image caption generation systems are typically evaluated against reference outputs. We show that it is possible to predict output quality without generating the captions, based on the probability assigned by the neural model to the reference captions. Such pre-gen metrics are strongly correlated to standard evaluation metrics.

    Submitted 12 October, 2018; originally announced October 2018.

    Comments: 13 pages, 6 figures This publication will appear in the Proceedings of the First Workshop on Shortcomings in Vision and Language (2018). DOI to be inserted later

  11. arXiv:1803.03827  [pdf, other

    cs.CL cs.AI cs.CV

    Face2Text: Collecting an Annotated Image Description Corpus for the Generation of Rich Face Descriptions

    Authors: Albert Gatt, Marc Tanti, Adrian Muscat, Patrizia Paggio, Reuben A. Farrugia, Claudia Borg, Kenneth P. Camilleri, Mike Rosner, Lonneke van der Plas

    Abstract: The past few years have witnessed renewed interest in NLP tasks at the interface between vision and language. One intensively-studied problem is that of automatically generating text from images. In this paper, we extend this problem to the more specific domain of face description. Unlike scene descriptions, face descriptions are more fine-grained and rely on attributes extracted from the image, r… ▽ More

    Submitted 5 March, 2021; v1 submitted 10 March, 2018; originally announced March 2018.

    Comments: Proceedings of the 11th edition of the Language Resources and Evaluation Conference (LREC'18)

  12. arXiv:1601.03896  [pdf, ps, other

    cs.CL cs.CV

    Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

    Authors: Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat, Barbara Plank

    Abstract: Automatic description generation from natural images is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the existing approaches based on how they conceptualize this problem, viz., models that cast description as either generation problem or as a retrieval problem over a vis… ▽ More

    Submitted 24 April, 2017; v1 submitted 15 January, 2016; originally announced January 2016.

    Comments: Journal of Artificial Intelligence Research 55, 409-442, 2016