Skip to main content

Showing 1–15 of 15 results for author: Khan, A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10879  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Multi-Stage Speaker Diarization for Noisy Classrooms

    Authors: Ali Sartaz Khan, Tolulope Ogunremi, Ahmed Adel Attia, Dorottya Demszky

    Abstract: Speaker diarization, the process of identifying "who spoke when" in audio recordings, is essential for understanding classroom dynamics. However, classroom settings present distinct challenges, including poor recording quality, high levels of background noise, overlapping speech, and the difficulty of accurately capturing children's voices. This study investigates the effectiveness of multi-stage… ▽ More

    Submitted 27 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2503.16565  [pdf, other

    cs.LG cs.AI cs.CL q-bio.GN

    Gene42: Long-Range Genomic Foundation Model With Dense Attention

    Authors: Kirill Vishniakov, Boulbaba Ben Amor, Engin Tekin, Nancy A. ElNaker, Karthik Viswanathan, Aleksandr Medvedev, Aahan Singh, Maryam Nadeem, Mohammad Amaan Sayeed, Praveenkumar Kanithi, Tiago Magalhaes, Natalia Vassilieva, Dwarikanath Mahapatra, Marco Pimentel, and Shadab Khan

    Abstract: We introduce Gene42, a novel family of Genomic Foundation Models (GFMs) designed to manage context lengths of up to 192,000 base pairs (bp) at a single-nucleotide resolution. Gene42 models utilize a decoder-only (LLaMA-style) architecture with a dense self-attention mechanism. Initially trained on fixed-length sequences of 4,096 bp, our models underwent continuous pretraining to extend the context… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  3. arXiv:2407.20003  [pdf, other

    cs.LG stat.ML

    On the Effects of Irrelevant Variables in Treatment Effect Estimation with Deep Disentanglement

    Authors: Ahmad Saeed Khan, Erik Schaffernicht, Johannes Andreas Stork

    Abstract: Estimating treatment effects from observational data is paramount in healthcare, education, and economics, but current deep disentanglement-based methods to address selection bias are insufficiently handling irrelevant variables. We demonstrate in experiments that this leads to prediction errors. We disentangle pre-treatment variables with a deep embedding method and explicitly identify and repres… ▽ More

    Submitted 26 August, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

    Comments: Paper is accepted at ECAI-2024

  4. Offshore Software Maintenance Outsourcing Predicting Clients Proposal using Supervised Learning

    Authors: Atif Ikram, Masita Abdul Jalil, Amir Bin Ngah, Ahmad Salman Khan, Tahir Iqbal

    Abstract: In software engineering, software maintenance is the process of correction, updating, and improvement of software products after handed over to the customer. Through offshore software maintenance outsourcing clients can get advantages like reduce cost, save time, and improve quality. In most cases, the OSMO vendor generates considerable revenue. However, the selection of an appropriate proposal am… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: 10 pages, 2 figures

    Journal ref: International Journal of Advanced Trends in Computer Science and Engineering, 2021

  5. arXiv:2101.10658  [pdf

    cs.SE

    Software Effort Estimation Accuracy Prediction of Machine Learning Techniques: A Systematic Performance Evaluation

    Authors: Yasir Mahmood, Nazri Kama, Azri Azmi, Ahmad Salman Khan, Mazlan Ali

    Abstract: Software effort estimation accuracy is a key factor in effective planning, controlling and to deliver a successful software project within budget and schedule. The overestimation and underestimation both are the key challenges for future software development, henceforth there is a continuous need for accuracy in software effort estimation (SEE). The researchers and practitioners are striving to id… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: Pages: 27 Figures: 15 Tables: 8

  6. arXiv:1906.02728  [pdf, other

    cs.CV

    Feature-level and Model-level Audiovisual Fusion for Emotion Recognition in the Wild

    Authors: Jie Cai, Zibo Meng, Ahmed Shehab Khan, Zhiyuan Li, James O'Reilly, Shizhong Han, Ping Liu, Min Chen, Yan Tong

    Abstract: Emotion recognition plays an important role in human-computer interaction (HCI) and has been extensively studied for decades. Although tremendous improvements have been achieved for posed expressions, recognizing human emotions in "close-to-real-world" environments remains a challenge. In this paper, we proposed two strategies to fuse information extracted from different modalities, i.e., audio an… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  7. arXiv:1903.08051  [pdf, other

    cs.CV

    Identity-Free Facial Expression Recognition using conditional Generative Adversarial Network

    Authors: Jie Cai, Zibo Meng, Ahmed Shehab Khan, Zhiyuan Li, James O'Reilly, Shizhong Han, Yan Tong

    Abstract: A novel Identity-Free conditional Generative Adversarial Network (IF-GAN) was proposed for Facial Expression Recognition (FER) to explicitly reduce high inter-subject variations caused by identity-related facial attributes, e.g., age, race, and gender. As part of an end-to-end system, a cGAN was designed to transform a given input facial expression image to an "average" identity face with the same… ▽ More

    Submitted 20 May, 2021; v1 submitted 19 March, 2019; originally announced March 2019.

  8. arXiv:1812.07067  [pdf, other

    cs.CV

    Probabilistic Attribute Tree in Convolutional Neural Networks for Facial Expression Recognition

    Authors: Jie Cai, Zibo Meng, Ahmed Shehab Khan, Zhiyuan Li, James O'Reilly, Yan Tong

    Abstract: In this paper, we proposed a novel Probabilistic Attribute Tree-CNN (PAT-CNN) to explicitly deal with the large intra-class variations caused by identity-related attributes, e.g., age, race, and gender. Specifically, a novel PAT module with an associated PAT loss was proposed to learn features in a hierarchical tree structure organized according to attributes, where the final features are less aff… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 10 pages

  9. arXiv:1710.03144  [pdf, other

    cs.CV

    Island Loss for Learning Discriminative Features in Facial Expression Recognition

    Authors: Jie Cai, Zibo Meng, Ahmed Shehab Khan, Zhiyuan Li, James O'Reilly, Yan Tong

    Abstract: Over the past few years, Convolutional Neural Networks (CNNs) have shown promise on facial expression recognition. However, the performance degrades dramatically under real-world settings due to variations introduced by subtle facial appearance changes, head pose variations, illumination changes, and occlusions. In this paper, a novel island loss is proposed to enhance the discriminative power o… ▽ More

    Submitted 23 October, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 8 pages, 3 figures

  10. arXiv:1707.05395  [pdf, other

    cs.CV

    Incremental Boosting Convolutional Neural Network for Facial Action Unit Recognition

    Authors: Shizhong Han, Zibo Meng, Ahmed Shehab Khan, Yan Tong

    Abstract: Recognizing facial action units (AUs) from spontaneous facial expressions is still a challenging problem. Most recently, CNNs have shown promise on facial AU recognition. However, the learned CNNs are often overfitted and do not generalize well to unseen subjects due to limited AU-coded training images. We proposed a novel Incremental Boosting CNN (IB-CNN) to integrate boosting into the CNN via an… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: NIPS2016

  11. arXiv:1707.00860  [pdf, other

    cs.LG cs.AI cs.CV

    Conditional generation of multi-modal data using constrained embedding space mapping

    Authors: Subhajit Chaudhury, Sakyasingha Dasgupta, Asim Munawar, Md. A. Salam Khan, Ryuki Tachibana

    Abstract: We present a conditional generative model that maps low-dimensional embeddings of multiple modalities of data to a common latent space hence extracting semantic relationships between them. The embedding specific to a modality is first extracted and subsequently a constrained optimization procedure is performed to project the two embedding spaces to a common manifold. The individual embeddings are… ▽ More

    Submitted 25 July, 2017; v1 submitted 4 July, 2017; originally announced July 2017.

    Comments: 7 pages, 4 figures, ICML 2017 Workshop on Implicit Models

  12. Non-Orthogonal Multiple Access combined with Random Linear Network Coded Cooperation

    Authors: Amjad Saeed Khan, Ioannis Chatzigeorgiou

    Abstract: This letter considers two groups of source nodes. Each group transmits packets to its own designated destination node over single-hop links and via a cluster of relay nodes shared by both groups. In an effort to boost reliability without sacrificing throughput, a scheme is proposed, whereby packets at the relay nodes are combined using two methods; packets delivered by different groups are mixed u… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

  13. Improved bounds on the decoding failure probability of network coding over multi-source multi-relay networks

    Authors: Amjad Saeed Khan, Ioannis Chatzigeorgiou

    Abstract: This paper considers a multi-source multi-relay network, in which relay nodes employ a coding scheme based on random linear network coding on source packets and generate coded packets. If a destination node collects enough coded packets, it can recover the packets of all source nodes. The links between source-to-relay nodes and relay-to-destination nodes are modeled as packet erasure channels. Imp… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: 4 pages, 5 figures, accepted for publication in IEEE Communications Letters

  14. arXiv:1508.03664  [pdf, ps, other

    cs.CR cs.IT cs.NI cs.PF

    Rethinking the Intercept Probability of Random Linear Network Coding

    Authors: Amjad Saeed Khan, Andrea Tassi, Ioannis Chatzigeorgiou

    Abstract: This letter considers a network comprising a transmitter, which employs random linear network coding to encode a message, a legitimate receiver, which can recover the message if it gathers a sufficient number of linearly independent coded packets, and an eavesdropper. Closed-form expressions for the probability of the eavesdropper intercepting enough coded packets to recover the message are derive… ▽ More

    Submitted 14 August, 2015; originally announced August 2015.

    Comments: IEEE Communications Letters, to appear

  15. arXiv:1503.05696  [pdf, ps, other

    cs.IT cs.NI cs.PF

    Performance Analysis of Random Linear Network Coding in Two-Source Single-Relay Networks

    Authors: Amjad Saeed Khan, Ioannis Chatzigeorgiou

    Abstract: This paper considers the multiple-access relay channel in a setting where two source nodes transmit packets to a destination node, both directly and via a relay node, over packet erasure channels. Intra-session network coding is used at the source nodes and inter-session network coding is employed at the relay node to combine the recovered source packets of both source nodes. In this work, we inve… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: Proc. ICC 2015, Workshop on Cooperative and Cognitive Mobile Networks (CoCoNet), to appear