Skip to main content

Showing 1–26 of 26 results for author: Amjad, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.13504  [pdf, other

    cs.IR cs.AI cs.MA

    An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents

    Authors: Ayesha Amjad, Saurav Sthapit, Tahir Qasim Syed

    Abstract: Extracting alphanumeric data from form-like documents such as invoices, purchase orders, bills, and financial documents is often performed via vision (OCR) and learning algorithms or monolithic pipelines with limited potential for systemic improvements. We propose an agentic AI system that leverages Large Language Model (LLM) agents and a reinforcement learning (RL) driver agent to automate consis… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2504.17974  [pdf, other

    cs.CL

    Optimism, Expectation, or Sarcasm? Multi-Class Hope Speech Detection in Spanish and English

    Authors: Sabur Butt, Fazlourrahman Balouchzahi, Ahmad Imam Amjad, Maaz Amjad, Hector G. Ceballos, Salud Maria Jimenez-Zafra

    Abstract: Hope is a complex and underexplored emotional state that plays a significant role in education, mental health, and social interaction. Unlike basic emotions, hope manifests in nuanced forms ranging from grounded optimism to exaggerated wishfulness or sarcasm, making it difficult for Natural Language Processing systems to detect accurately. This study introduces PolyHope V2, a multilingual, fine-gr… ▽ More

    Submitted 5 May, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  3. arXiv:2504.04372  [pdf, other

    cs.SE cs.AI cs.LG

    How Accurately Do Large Language Models Understand Code?

    Authors: Sabaat Haroon, Ahmad Faraz Khan, Ahmad Humayun, Waris Gill, Abdul Haddi Amjad, Ali R. Butt, Mohammad Taha Khan, Muhammad Ali Gulzar

    Abstract: Large Language Models (LLMs) are increasingly used in post-development tasks such as code repair and testing. A key factor in these tasks' success is the model's deep understanding of code. However, the extent to which LLMs truly understand code remains largely unevaluated. Quantifying code comprehension is challenging due to its abstract nature and the lack of a standardized metric. Previously, t… ▽ More

    Submitted 9 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

    Comments: This paper is currently Under Review. It consists of 11 pages, 12 Figures, and 5 Tables

  4. arXiv:2502.08767  [pdf, other

    cs.CL cs.AI

    SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence

    Authors: Zhining Liu, Rana Ali Amjad, Ravinarayana Adkathimar, Tianxin Wei, Hanghang Tong

    Abstract: Providing Language Models (LMs) with relevant evidence in the context (either via retrieval or user-provided) can significantly improve their ability to provide better-grounded responses. However, recent studies have found that LMs often struggle to fully comprehend and utilize key evidence from the context, especially when it contains noise and irrelevant information, an issue common in real-worl… ▽ More

    Submitted 25 May, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

    Comments: Accepted by ACL 2025. 21 pages, 5 figures, 13 tables

  5. arXiv:2409.18590  [pdf, other

    cs.SE

    Accessibility Issues in Ad-Driven Web Applications

    Authors: Abdul Haddi Amjad, Muhammad Danish, Bless Jah, Muhammad Ali Gulzar

    Abstract: Website accessibility is essential for inclusiveness and regulatory compliance. Although third-party advertisements (ads) are a vital revenue source for free web services, they introduce significant accessibility challenges. Leasing a websiteś space to ad-serving technologies like DoubleClick results in developers losing control over ad content accessibility. Even on highly accessible websites, th… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  6. arXiv:2405.18385  [pdf, other

    cs.CR

    Blocking Tracking JavaScript at the Function Granularity

    Authors: Abdul Haddi Amjad, Shaoor Munir, Zubair Shafiq, Muhammad Ali Gulzar

    Abstract: Modern websites extensively rely on JavaScript to implement both functionality and tracking. Existing privacy enhancing content blocking tools struggle against mixed scripts, which simultaneously implement both functionality and tracking, because blocking the script would break functionality and not blocking it would allow tracking. We propose Not.js, a fine grained JavaScript blocking tool that o… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.00988  [pdf, other

    cs.CL cs.LG

    Context-Aware Clustering using Large Language Models

    Authors: Sindhu Tipirneni, Ravinarayana Adkathimar, Nurendra Choudhary, Gaurush Hiranandani, Rana Ali Amjad, Vassilis N. Ioannidis, Changhe Yuan, Chandan K. Reddy

    Abstract: Despite the remarkable success of Large Language Models (LLMs) in text understanding and generation, their potential for text clustering tasks remains underexplored. We observed that powerful closed-source LLMs provide good quality clusterings of entity sets but are not scalable due to the massive compute power required and the associated costs. Thus, we propose CACTUS (Context-Aware ClusTering wi… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 16 pages

    ACM Class: I.2.7; I.2.m

  8. arXiv:2302.01182  [pdf, other

    cs.CR cs.SE

    Blocking JavaScript without Breaking the Web: An Empirical Investigation

    Authors: Abdul Haddi Amjad, Zubair Shafiq, Muhammad Ali Gulzar

    Abstract: Modern websites heavily rely on JavaScript (JS) to implement legitimate functionality as well as privacy-invasive advertising and tracking. Browser extensions such as NoScript block any script not loaded by a trusted list of endpoints, thus hoping to block privacy-invasive scripts while avoiding breaking legitimate website functionality. In this paper, we investigate whether blocking JS on the web… ▽ More

    Submitted 23 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: petsymposium 2023

  9. arXiv:2109.12561  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

    Authors: Kumar Pratik, Rana Ali Amjad, Arash Behboodi, Joseph B. Soriaga, Max Welling

    Abstract: We propose Hypernetwork Kalman Filter (HKF) for tracking applications with multiple different dynamics. The HKF combines generalization power of Kalman filters with expressive power of neural networks. Instead of keeping a bank of Kalman filters and choosing one based on approximating the actual dynamics, HKF adapts itself to each dynamics based on the observed sequence. Through extensive experime… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: Accepted at IEEE Globecom 2021. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  10. arXiv:2108.13923  [pdf, other

    cs.NI

    TrackerSift: Untangling Mixed Tracking and Functional Web Resources

    Authors: Abdul Haddi Amjad, Danial Saleem, Fareed Zaffar, Muhammad Ali Gulzar, Zubair Shafiq

    Abstract: Trackers have recently started to mix tracking and functional resources to circumvent privacy-enhancing content blocking tools. Such mixed web resources put content blockers in a bind: risk breaking legitimate functionality if they act and risk missing privacy-invasive advertising and tracking if they do not. In this paper, we propose TrackerSift to progressively classify and untangle mixed web re… ▽ More

    Submitted 29 September, 2021; v1 submitted 28 August, 2021; originally announced August 2021.

  11. arXiv:2106.08295  [pdf, other

    cs.LG cs.AI cs.CV

    A White Paper on Neural Network Quantization

    Authors: Markus Nagel, Marios Fournarakis, Rana Ali Amjad, Yelysei Bondarenko, Mart van Baalen, Tijmen Blankevoort

    Abstract: While neural networks have advanced the frontiers in many applications, they often come at a high computational cost. Reducing the power and latency of neural network inference is key if we want to integrate modern networks into edge devices with strict power and compute requirements. Neural network quantization is one of the most effective ways of achieving these savings but the additional noise… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  12. arXiv:2010.10583  [pdf, ps, other

    cs.IT

    Invertible Low-Divergence Coding

    Authors: Patrick Schulte, Rana Ali Amjad, Thomas Wiegart, Gerhard Kramer

    Abstract: Several applications in communication, control, and learning require approximating target distributions to within small informational divergence (I-divergence). The additional requirement of invertibility usually leads to using encoders that are one-to-one mappings, also known as distribution matchers. However, even the best one-to-one encoders have I-divergences that grow logarithmically with the… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 13 pages, 6 figures

  13. arXiv:2005.07093  [pdf, other

    cs.LG cs.CV stat.ML

    Bayesian Bits: Unifying Quantization and Pruning

    Authors: Mart van Baalen, Christos Louizos, Markus Nagel, Rana Ali Amjad, Ying Wang, Tijmen Blankevoort, Max Welling

    Abstract: We introduce Bayesian Bits, a practical method for joint mixed precision quantization and pruning through gradient based optimization. Bayesian Bits employs a novel decomposition of the quantization operation, which sequentially considers doubling the bit width. At each new bit width, the residual error between the full precision value and the previously rounded value is quantized. We then decide… ▽ More

    Submitted 27 October, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

  14. arXiv:2004.10568  [pdf, other

    cs.LG cs.CV stat.ML

    Up or Down? Adaptive Rounding for Post-Training Quantization

    Authors: Markus Nagel, Rana Ali Amjad, Mart van Baalen, Christos Louizos, Tijmen Blankevoort

    Abstract: When quantizing neural networks, assigning each floating-point weight to its nearest fixed-point value is the predominant approach. We find that, perhaps surprisingly, this is not the best we can do. In this paper, we propose AdaRound, a better weight-rounding mechanism for post-training quantization that adapts to the data and the task loss. AdaRound is fast, does not require fine-tuning of the n… ▽ More

    Submitted 30 June, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Published as a conference paper at ICML 2020

  15. arXiv:1906.02576  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Class-Conditional Compression and Disentanglement: Bridging the Gap between Neural Networks and Naive Bayes Classifiers

    Authors: Rana Ali Amjad, Bernhard C. Geiger

    Abstract: In this draft, which reports on work in progress, we 1) adapt the information bottleneck functional by replacing the compression term by class-conditional compression, 2) relax this functional using a variational bound related to class-conditional disentanglement, 3) consider this functional as a training objective for stochastic neural networks, and 4) show that the latent representations are lea… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: draft; work in progress

  16. arXiv:1804.06679  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

    Authors: Rana Ali Amjad, Kairen Liu, Bernhard C. Geiger

    Abstract: In this work, we investigate the use of three information-theoretic quantities -- entropy, mutual information with the class variable, and a class selectivity measure based on Kullback-Leibler divergence -- to understand and study the behavior of already trained fully-connected feed-forward neural networks. We analyze the connection between these information-theoretic quantities and classification… ▽ More

    Submitted 9 June, 2021; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 12 pages; accepted for publication in IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: IEEE Trans. Neural Networks and Learning Systems 33(12):7842-7852

  17. arXiv:1803.04459  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.SI

    Extended Affinity Propagation: Global Discovery and Local Insights

    Authors: Rayyan Ahmad Khan, Rana Ali Amjad, Martin Kleinsteuber

    Abstract: We propose a new clustering algorithm, Extended Affinity Propagation, based on pairwise similarities. Extended Affinity Propagation is developed by modifying Affinity Propagation such that the desirable features of Affinity Propagation, e.g., exemplars, reasonable computational complexity and no need to specify number of clusters, are preserved while the shortcomings, e.g., the lack of global stru… ▽ More

    Submitted 15 April, 2019; v1 submitted 12 March, 2018; originally announced March 2018.

    Comments: Submitted to TKDE

  18. Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle

    Authors: Rana Ali Amjad, Bernhard C. Geiger

    Abstract: In this theory paper, we investigate training deep neural networks (DNNs) for classification via minimizing the information bottleneck (IB) functional. We show that the resulting optimization problem suffers from two severe issues: First, for deterministic DNNs, either the IB functional is infinite for almost all values of network parameters, making the optimization problem ill-posed, or it is pie… ▽ More

    Submitted 11 April, 2019; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: 16 pages, to appear in IEEE Trans. Pattern Analysis and Machine Intelligence

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 42(9):2225-2239, 2020. (c) IEEE

  19. arXiv:1802.05973  [pdf, other

    cs.IT

    Information Rates and Error Exponents for Probabilistic Amplitude Shaping

    Authors: Rana Ali Amjad

    Abstract: Probabilistic Amplitude Shaping (PAS) is a coded-modulation scheme in which the encoder is a concatenation of a distribution matcher with a systematic Forward Error Correction (FEC) code. For reduced computational complexity the decoder can be chosen as a concatenation of a mismatched FEC decoder and dematcher. This work studies the theoretic limits of PAS. The classical joint source-channel codin… ▽ More

    Submitted 4 June, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

    Comments: Shortened version submitted to Information Theory Workshop (ITW) 2018

  20. Co-Clustering via Information-Theoretic Markov Aggregation

    Authors: Clemens Bloechl, Rana Ali Amjad, Bernhard C. Geiger

    Abstract: We present an information-theoretic cost function for co-clustering, i.e., for simultaneous clustering of two sets based on similarities between their elements. By constructing a simple random walk on the corresponding bipartite graph, our cost function is derived from a recently proposed generalized framework for information-theoretic Markov chain aggregation. The goal of our cost function is to… ▽ More

    Submitted 15 June, 2018; v1 submitted 2 January, 2018; originally announced January 2018.

    Comments: accepted for publication in IEEE Trans. on Knowledge and Data Engineering; (c) 2018 IEEE

  21. arXiv:1709.05907  [pdf, ps, other

    eess.SY cs.IT math.OC

    A Generalized Framework for Kullback-Leibler Markov Aggregation

    Authors: Rana Ali Amjad, Clemens Blöchl, Bernhard C. Geiger

    Abstract: This paper proposes an information-theoretic cost function for aggregating a Markov chain via a (possibly stochastic) mapping. The cost function is motivated by two objectives: 1) The process obtained by observing the Markov chain through the mapping should be close to a Markov chain, and 2) the aggregated Markov chain should retain as much of the temporal dependence structure of the original Mark… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: 12 pages, 3 figures; submitted to a journal

  22. arXiv:1608.04872  [pdf, ps, other

    cs.IT cs.IR cs.LG

    Hard Clusters Maximize Mutual Information

    Authors: Bernhard C. Geiger, Rana Ali Amjad

    Abstract: In this paper, we investigate mutual information as a cost function for clustering, and show in which cases hard, i.e., deterministic, clusters are optimal. Using convexity properties of mutual information, we show that certain formulations of the information bottleneck problem are solved by hard clusters. Similarly, hard clusters are optimal for the information-theoretic co-clustering problem tha… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

  23. arXiv:1310.2882  [pdf, ps, other

    cs.IT

    Informational Divergence and Entropy Rate on Rooted Trees with Probabilities

    Authors: Georg Böcherer, Rana Ali Amjad

    Abstract: Rooted trees with probabilities are used to analyze properties of a variable length code. A bound is derived on the difference between the entropy rates of the code and a memoryless source. The bound is in terms of normalized informational divergence. The bound is used to derive converses for exact random number generation, resolution coding, and distribution matching.

    Submitted 10 October, 2013; originally announced October 2013.

    Comments: 5 pages. With proofs and illustrating example

  24. arXiv:1306.2550  [pdf, other

    cs.IT

    Fixed-to-Variable Length Resolution Coding for Target Distributions

    Authors: Georg Böcherer, Rana Ali Amjad

    Abstract: The number of random bits required to approximate a target distribution in terms of un-normalized informational divergence is considered. It is shown that for a variable-to-variable length encoder, this number is lower bounded by the entropy of the target distribution. A fixed-to-variable length encoder is constructed using M-type quantization and Tunstall coding. It is shown that the encoder achi… ▽ More

    Submitted 1 August, 2013; v1 submitted 11 June, 2013; originally announced June 2013.

    Comments: Essentially the ITW 2013 final version. Compared to v1, minor typos were corrected and Fig. 1 with an example variable length encoder was added

  25. arXiv:1302.1020  [pdf, other

    cs.IT

    Block-to-Block Distribution Matching

    Authors: Georg Böcherer, Rana Ali Amjad

    Abstract: In this work, binary block-to-block distribution matching is considered. m independent and uniformly distributed bits are mapped to n output bits resembling a target product distribution. A rate R is called achieved by a sequence of encoder-decoder pairs, if for m,n to infinity, (1) m/n approaches R, (2) the informational divergence per bit of the output distribution and the target distribution go… ▽ More

    Submitted 5 February, 2013; originally announced February 2013.

    Comments: 5 pages

  26. arXiv:1302.0019  [pdf, other

    cs.IT

    Fixed-to-Variable Length Distribution Matching

    Authors: Rana Ali Amjad, Georg Böcherer

    Abstract: Fixed-to-variable length (f2v) matchers are used to reversibly transform an input sequence of independent and uniformly distributed bits into an output sequence of bits that are (approximately) independent and distributed according to a target distribution. The degree of approximation is measured by the informational divergence between the output distribution and the target distribution. An algori… ▽ More

    Submitted 1 July, 2013; v1 submitted 31 January, 2013; originally announced February 2013.

    Comments: 5 pages, essentially the ISIT 2013 version