Skip to main content

Showing 1–23 of 23 results for author: Yousefi, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.00859  [pdf, ps, other

    cs.CL

    How Bidirectionality Helps Language Models Learn Better via Dynamic Bottleneck Estimation

    Authors: Md Kowsher, Nusrat Jahan Prottasha, Shiyun Xu, Shetu Mohanto, Chen Chen, Ozlem Garibay, Niloofar Yousefi

    Abstract: Bidirectional language models have better context understanding and perform better than unidirectional models on natural language understanding tasks, yet the theoretical reasons behind this advantage remain unclear. In this work, we investigate this disparity through the lens of the Information Bottleneck (IB) principle, which formalizes a trade-off between compressing input information and prese… ▽ More

    Submitted 2 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

  2. arXiv:2502.17817  [pdf, other

    cs.CL

    Predicting Through Generation: Why Generation Is Better for Prediction

    Authors: Md Kowsher, Nusrat Jahan Prottasha, Prakash Bhat, Chun-Nam Yu, Mojtaba Soltanalian, Ivan Garibay, Ozlem Garibay, Chen Chen, Niloofar Yousefi

    Abstract: This paper argues that generating output tokens is more effective than using pooled representations for prediction tasks because token-level generation retains more mutual information. Since LLMs are trained on massive text corpora using next-token prediction, generation aligns naturally with their learned behavior. Using the Data Processing Inequality (DPI), we provide both theoretical and empiri… ▽ More

    Submitted 26 May, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: ACL Accepted paper

  3. arXiv:2502.05729  [pdf, other

    cs.CL

    BnTTS: Few-Shot Speaker Adaptation in Low-Resource Setting

    Authors: Mohammad Jahid Ibna Basher, Md Kowsher, Md Saiful Islam, Rabindra Nath Nandi, Nusrat Jahan Prottasha, Mehadi Hasan Menon, Tareq Al Muntasir, Shammur Absar Chowdhury, Firoj Alam, Niloofar Yousefi, Ozlem Ozmen Garibay

    Abstract: This paper introduces BnTTS (Bangla Text-To-Speech), the first framework for Bangla speaker adaptation-based TTS, designed to bridge the gap in Bangla speech synthesis using minimal training data. Building upon the XTTS architecture, our approach integrates Bangla into a multilingual TTS pipeline, with modifications to account for the phonetic and linguistic characteristics of the language. We pre… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: Accepted paper in NAACL 2025

  4. arXiv:2412.00359  [pdf, other

    cs.CL

    Does Self-Attention Need Separate Weights in Transformers?

    Authors: Md Kowsher, Nusrat Jahan Prottasha, Chun-Nam Yu, Ozlem Ozmen Garibay, Niloofar Yousefi

    Abstract: The success of self-attention lies in its ability to capture long-range dependencies and enhance context understanding, but it is limited by its computational complexity and challenges in handling sequential data with inherent directionality. This work introduces a shared weight self-attention-based BERT model that only learns one weight matrix for (Key, Value, and Query) representations instead o… ▽ More

    Submitted 2 May, 2025; v1 submitted 29 November, 2024; originally announced December 2024.

    Comments: Preprint paper

  5. arXiv:2410.11674  [pdf, ps, other

    cs.LG cs.CL

    LLM-Mixer: Multiscale Mixing in LLMs for Time Series Forecasting

    Authors: Md Kowsher, Md. Shohanur Islam Sobuj, Nusrat Jahan Prottasha, E. Alejandro Alanis, Ozlem Ozmen Garibay, Niloofar Yousefi

    Abstract: Time series forecasting remains a challenging task, particularly in the context of complex multiscale temporal patterns. This study presents LLM-Mixer, a framework that improves forecasting accuracy through the combination of multiscale time-series decomposition with pre-trained LLMs (Large Language Models). LLM-Mixer captures both short-term fluctuations and long-term trends by decomposing the da… ▽ More

    Submitted 1 June, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: Time series forecasting using LLMs

  6. arXiv:2410.10075  [pdf, ps, other

    cs.CL

    RoCoFT: Efficient Finetuning of Large Language Models with Row-Column Updates

    Authors: Md Kowsher, Tara Esmaeilbeig, Chun-Nam Yu, Chen Chen, Mojtaba Soltanalian, Niloofar Yousefi

    Abstract: We propose RoCoFT, a parameter-efficient fine-tuning method for large-scale language models (LMs) based on updating only a few rows and columns of the weight matrices in transformers. Through extensive experiments with medium-size LMs like BERT and RoBERTa, and larger LMs like Bloom-7B, Llama2-7B, and Llama2-13B, we show that our method gives comparable or better accuracies than state-of-art PEFT… ▽ More

    Submitted 1 June, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: RoCoFT is a parameter-efficient method

  7. arXiv:2410.08598  [pdf, other

    cs.CL

    Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning

    Authors: Nusrat Jahan Prottasha, Asif Mahmud, Md. Shohanur Islam Sobuj, Prakash Bhat, Md Kowsher, Niloofar Yousefi, Ozlem Ozmen Garibay

    Abstract: Large Language Models (LLMs) are gaining significant popularity in recent years for specialized tasks using prompts due to their low computational cost. Standard methods like prefix tuning utilize special, modifiable tokens that lack semantic meaning and require extensive training for best performance, often falling short. In this context, we propose a novel method called Semantic Knowledge Tuning… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted in Nature Scientific Reports

  8. arXiv:2409.13688  [pdf, other

    cs.CV cs.AI stat.AP stat.ME

    Morphological Detection and Classification of Microplastics and Nanoplastics Emerged from Consumer Products by Deep Learning

    Authors: Hadi Rezvani, Navid Zarrabi, Ishaan Mehta, Christopher Kolios, Hussein Ali Jaafar, Cheng-Hao Kao, Sajad Saeedi, Nariman Yousefi

    Abstract: Plastic pollution presents an escalating global issue, impacting health and environmental systems, with micro- and nanoplastics found across mediums from potable water to air. Traditional methods for studying these contaminants are labor-intensive and time-consuming, necessitating a shift towards more efficient technologies. In response, this paper introduces micro- and nanoplastics (MiNa), a nove… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  9. arXiv:2402.18702  [pdf

    cs.MM

    Characterizing Multimedia Information Environment through Multi-modal Clustering of YouTube Videos

    Authors: Niloofar Yousefi, Mainuddin Shaik, Nitin Agarwal

    Abstract: This study aims to investigate the comprehensive characterization of information content in multimedia (videos), particularly on YouTube. The research presents a multi-method framework for characterizing multimedia content by clustering signals from various modalities, such as audio, video, and text. With a focus on South China Sea videos as a case study, this approach aims to enhance our understa… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 14 pages, In the 4th International Conference on SMART MULTIMEDIA, 2024

  10. arXiv:2311.11892  [pdf

    cs.MM

    Multimodal Characterization of Emotion within Multimedia Space

    Authors: Dayo Samuel Banjo, Connice Trimmingham, Niloofar Yousefi, Nitin Agarwal

    Abstract: Technological advancement and its omnipresent connection have pushed humans past the boundaries and limitations of a computer screen, physical state, or geographical location. It has provided a depth of avenues that facilitate human-computer interaction that was once inconceivable such as audio and body language detection. Given the complex modularities of emotions, it becomes vital to study human… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 8 pages, Published in International Conference on Computers and Computation (COMPUTE 2022), November 03-04, 2022, San Francisco, United States

  11. arXiv:2311.02326  [pdf, other

    cs.LG cs.AI

    FragXsiteDTI: Revealing Responsible Segments in Drug-Target Interaction with Transformer-Driven Interpretation

    Authors: Ali Khodabandeh Yalabadi, Mehdi Yazdani-Jahromi, Niloofar Yousefi, Aida Tayebi, Sina Abdidizaji, Ozlem Ozmen Garibay

    Abstract: Drug-Target Interaction (DTI) prediction is vital for drug discovery, yet challenges persist in achieving model interpretability and optimizing performance. We propose a novel transformer-based model, FragXsiteDTI, that aims to address these challenges in DTI prediction. Notably, FragXsiteDTI is the first DTI model to simultaneously leverage drug molecule fragments and protein pockets. Our informa… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: Accepted at the NeurIPS workshop (AI4D3) - 2023

  12. arXiv:2302.14270  [pdf

    cs.SI

    Comparing Toxicity Across Social Media Platforms for COVID-19 Discourse

    Authors: Nahiyan Bin Noor, Niloofar Yousefi, Billy Spann, Nitin Agarwal

    Abstract: The emergence of toxic information on social networking sites, such as Twitter, Parler, and Reddit, has become a growing concern. Consequently, this study aims to assess the level of toxicity in COVID-19 discussions on Twitter, Parler, and Reddit. Using data analysis from January 1 through December 31, 2020, we examine the development of toxicity over time and compare the findings across the three… ▽ More

    Submitted 26 April, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Journal ref: IARIA. (2023) 21-26

  13. arXiv:2012.15330  [pdf, other

    q-fin.RM cs.LG

    Sequential Deep Learning for Credit Risk Monitoring with Tabular Financial Data

    Authors: Jillian M. Clements, Di Xu, Nooshin Yousefi, Dmitry Efimov

    Abstract: Machine learning plays an essential role in preventing financial losses in the banking industry. Perhaps the most pertinent prediction task that can result in billions of dollars in losses each year is the assessment of credit risk (i.e., the risk of default on debt). Today, much of the gains from machine learning to predict credit risk are driven by gradient boosted decision tree models. However,… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

    ACM Class: I.2.1

  14. arXiv:2004.06793  [pdf, other

    cs.SI cs.CL cs.IR

    Probabilistic Model of Narratives Over Topical Trends in Social Media: A Discrete Time Model

    Authors: Toktam A. Oghaz, Ece C. Mutlu, Jasser Jasser, Niloofar Yousefi, Ivan Garibay

    Abstract: Online social media platforms are turning into the prime source of news and narratives about worldwide events. However,a systematic summarization-based narrative extraction that can facilitate communicating the main underlying events is lacking. To address this issue, we propose a novel event-based narrative summary extraction framework. Our proposed framework is designed as a probabilistic topic… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 9 pages, 4 figures

  15. arXiv:2003.11611  [pdf, other

    cs.SI physics.soc-ph

    Deep Agent: Studying the Dynamics of Information Spread and Evolution in Social Networks

    Authors: Ivan Garibay, Toktam A. Oghaz, Niloofar Yousefi, Ece C. Mutlu, Madeline Schiappa, Steven Scheinert, Georgios C. Anagnostopoulos, Christina Bouwens, Stephen M. Fiore, Alexander Mantzaris, John T. Murphy, William Rand, Anastasia Salter, Mel Stanfill, Gita Sukthankar, Nisha Baral, Gabriel Fair, Chathika Gunaratne, Neda B. Hajiakhoond, Jasser Jasser, Chathura Jayalath, Olivia Newton, Samaneh Saadat, Chathurani Senevirathna, Rachel Winter , et al. (1 additional authors not shown)

    Abstract: This paper explains the design of a social network analysis framework, developed under DARPA's SocialSim program, with novel architecture that models human emotional, cognitive and social factors. Our framework is both theory and data-driven, and utilizes domain expertise. Our simulation effort helps in understanding how information flows and evolves in social media platforms. We focused on modeli… ▽ More

    Submitted 29 May, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: 16 pages

  16. arXiv:2003.08759  [pdf, other

    cs.CV

    Facial Expression Phoenix (FePh): An Annotated Sequenced Dataset for Facial and Emotion-Specified Expressions in Sign Language

    Authors: Marie Alaghband, Niloofar Yousefi, Ivan Garibay

    Abstract: Facial expressions are important parts of both gesture and sign language recognition systems. Despite the recent advances in both fields, annotated facial expression datasets in the context of sign language are still scarce resources. In this manuscript, we introduce an annotated sequenced facial expression dataset in the context of sign language, comprising over $3000$ facial images extracted fro… ▽ More

    Submitted 8 September, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  17. arXiv:2003.02820  [pdf

    cs.DC cs.NI

    Workload Scheduling on heterogeneous Mobile Edge Cloud in 5G networks to Minimize SLA Violation

    Authors: Mostafa Hadadian Nejad Yousefi, Amirmasoud Ghiassi, Boshra Sadat Hashemi, Maziar Goudarzi

    Abstract: Smart devices have become an indispensable part of our lives and gain increasing applicability in almost every area. Latency-aware applications such as Augmented Reality (AR), autonomous driving, and online gaming demand more resources such as network bandwidth and computational capabilities. Since the traditional mobile networks cannot fulfill the required bandwidth and latency, Mobile Edge Cloud… ▽ More

    Submitted 21 March, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 12 pages, 8 figures, 4 tables contact: hadadian AT ce DOT sharif DOT edu

  18. arXiv:2001.03507  [pdf

    cs.LG eess.SP stat.ML

    A storage expansion planning framework using reinforcement learning and simulation-based optimization

    Authors: S. Tsianikas, N. Yousefi, J. Zhou, M. Rodgers, D. W. Coit

    Abstract: In the wake of the highly electrified future ahead of us, the role of energy storage is crucial wherever distributed generation is abundant, such as in microgrid settings. Given the variety of storage options that are becoming more and more economical, determining which type of storage technology to invest in, along with the appropriate timing and capacity becomes a critical research question. It… ▽ More

    Submitted 24 March, 2021; v1 submitted 10 January, 2020; originally announced January 2020.

    Journal ref: Applied Energy; Volume 290; 2021; Pages 116778;

  19. arXiv:1912.02629  [pdf, ps, other

    cs.LG cs.CR cs.CY

    A Comprehensive Survey on Machine Learning Techniques and User Authentication Approaches for Credit Card Fraud Detection

    Authors: Niloofar Yousefi, Marie Alaghband, Ivan Garibay

    Abstract: With the increase of credit card usage, the volume of credit card misuse also has significantly increased. As a result, financial organizations are working hard on developing and deploying credit card fraud detection methods, in order to adapt to ever-evolving, increasingly sophisticated defrauding strategies and identifying illicit transactions as quickly as possible to protect themselves and the… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

  20. arXiv:1910.07999  [pdf

    cs.SI cs.AI cs.LG cs.MA

    DeepFork: Supervised Prediction of Information Diffusion in GitHub

    Authors: Ramya Akula, Niloofar Yousefi, Ivan Garibay

    Abstract: Information spreads on complex social networks extremely fast, in other words, a piece of information can go viral within no time. Often it is hard to barricade this diffusion prior to the significant occurrence of chaos, be it a social media or an online coding platform. GitHub is one such trending online focal point for any business to reach their potential contributors and customers, simultaneo… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: 12 Pages, 7 Figures, 2 Tables

  21. arXiv:1707.03426  [pdf, ps, other

    cs.LG stat.ML

    Multi-Task Learning Using Neighborhood Kernels

    Authors: Niloofar Yousefi, Cong Li, Mansooreh Mollaghasemi, Georgios Anagnostopoulos, Michael Georgiopoulos

    Abstract: This paper introduces a new and effective algorithm for learning kernels in a Multi-Task Learning (MTL) setting. Although, we consider a MTL scenario here, our approach can be easily applied to standard single task learning, as well. As shown by our empirical results, our algorithm consistently outperforms the traditional kernel learning algorithms such as uniform combination solution, convex comb… ▽ More

    Submitted 11 July, 2017; originally announced July 2017.

  22. arXiv:1602.05916  [pdf, ps, other

    cs.LG

    Local Rademacher Complexity-based Learning Guarantees for Multi-Task Learning

    Authors: Niloofar Yousefi, Yunwen Lei, Marius Kloft, Mansooreh Mollaghasemi, Georgios Anagnostopoulos

    Abstract: We show a Talagrand-type concentration inequality for Multi-Task Learning (MTL), using which we establish sharp excess risk bounds for MTL in terms of distribution- and data-dependent versions of the Local Rademacher Complexity (LRC). We also give a new bound on the LRC for norm regularized as well as strongly convex hypothesis classes, which applies not only to MTL but also to the standard i.i.d.… ▽ More

    Submitted 9 February, 2017; v1 submitted 18 February, 2016; originally announced February 2016.

    Comments: In this version, some arguments and results (of the previous version) have been corrected, or modified

  23. arXiv:1508.03329  [pdf, ps, other

    cs.LG

    Multi-Task Learning with Group-Specific Feature Space Sharing

    Authors: Niloofar Yousefi, Michael Georgiopoulos, Georgios C. Anagnostopoulos

    Abstract: When faced with learning a set of inter-related tasks from a limited amount of usable data, learning each task independently may lead to poor generalization performance. Multi-Task Learning (MTL) exploits the latent relations between tasks and overcomes data scarcity limitations by co-learning all these tasks simultaneously to offer improved performance. We propose a novel Multi-Task Multiple Kern… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.