Skip to main content

Showing 1–6 of 6 results for author: Assem, H

.
  1. arXiv:2405.19967  [pdf, other

    cs.CL cs.AI cs.LG

    Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification

    Authors: Hossam M. Zawbaa, Wael Rashwan, Sourav Dutta, Haytham Assem

    Abstract: Detecting out-of-scope user utterances is essential for task-oriented dialogues and intent classification. Current methodologies face difficulties with the unpredictable distribution of outliers and often rely on assumptions about data distributions. We present the Dual Encoder for Threshold-Based Re-Classification (DETER) to address these challenges. This end-to-end framework efficiently detects… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.02750  [pdf, other

    cs.CL cs.AI

    Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding

    Authors: Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

    Abstract: Large language models (LLMs) tend to inadequately integrate input context during text generation, relying excessively on encoded prior knowledge in model parameters, potentially resulting in generated text with factual inconsistencies or contextually unfaithful content. LLMs utilize two primary knowledge sources: 1) prior (parametric) knowledge from pretraining, and 2) contextual (non-parametric)… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024

  3. arXiv:2204.01385  [pdf, other

    cs.CL cs.LG

    Aligned Weight Regularizers for Pruning Pretrained Neural Networks

    Authors: James O' Neill, Sourav Dutta, Haytham Assem

    Abstract: While various avenues of research have been explored for iterative pruning, little is known what effect pruning has on zero-shot test performance and its potential implications on the choice of pruning criteria. This pruning setup is particularly important for cross-lingual models that implicitly learn alignment between language representations during pretraining, which if distorted via pruning, n… ▽ More

    Submitted 5 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to ACL Findings 2022

  4. arXiv:2109.15014  [pdf, other

    cs.LG cs.CL

    Deep Neural Compression Via Concurrent Pruning and Self-Distillation

    Authors: James O' Neill, Sourav Dutta, Haytham Assem

    Abstract: Pruning aims to reduce the number of parameters while maintaining performance close to the original network. This work proposes a novel \emph{self-distillation} based pruning strategy, whereby the representational similarity between the pruned and unpruned versions of the same network is maximized. Unlike previous approaches that treat distillation and pruning separately, we use distillation to in… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  5. arXiv:2108.10019  [pdf, other

    cs.IR

    Sequence-to-Sequence Learning on Keywords for Efficient FAQ Retrieval

    Authors: Sourav Dutta, Haytham Assem, Edward Burgin

    Abstract: Frequently-Asked-Question (FAQ) retrieval provides an effective procedure for responding to user's natural language based queries. Such platforms are becoming common in enterprise chatbots, product question answering, and preliminary technical support for customers. However, the challenge in such scenarios lies in bridging the lexical and semantic gap between varied query formulations and the corr… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 6 pages

    Journal ref: Published at the IJCAI 2021 Workshop on Applied Semantics Extraction and Analytics (ASEA)

  6. arXiv:2011.13200  [pdf, other

    cs.CL cs.AI

    Unsupervised Word Translation Pairing using Refinement based Point Set Registration

    Authors: Silviu Oprea, Sourav Dutta, Haytham Assem

    Abstract: Cross-lingual alignment of word embeddings play an important role in knowledge transfer across languages, for improving machine translation and other multi-lingual applications. Current unsupervised approaches rely on similarities in geometric structure of word embedding spaces across languages, to learn structure-preserving linear transformations using adversarial networks and refinement strategi… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.