Skip to main content

Showing 1–50 of 85 results for author: Zaki, M

.
  1. arXiv:2505.21937  [pdf, ps, other

    cs.CL

    Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic Languages

    Authors: Pratik Rakesh Singh, Kritarth Prasad, Mohammadi Zaki, Pankaj Wasnik

    Abstract: Translating multi-word expressions (MWEs) and idioms requires a deep understanding of the cultural nuances of both the source and target languages. This challenge is further amplified by the one-to-many nature of idiomatic translations, where a single source idiom can have multiple target-language equivalents depending on cultural references and contextual variations. Traditional static knowledge… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Journal ref: ACL Findings 2025

  2. arXiv:2505.21777  [pdf, other

    cs.LG cond-mat.dis-nn cs.CV q-bio.NC stat.ML

    Memorization to Generalization: Emergence of Diffusion Models from Associative Memory

    Authors: Bao Pham, Gabriel Raya, Matteo Negri, Mohammed J. Zaki, Luca Ambrogioni, Dmitry Krotov

    Abstract: Hopfield networks are associative memory (AM) systems, designed for storing and retrieving patterns as local minima of an energy landscape. In the classical Hopfield model, an interesting phenomenon occurs when the amount of training data reaches its critical memory load $- spurious\,\,states$, or unintended stable points, emerge at the end of the retrieval dynamics, leading to incorrect recall. I… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2505.15069  [pdf, ps, other

    cs.CL

    In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

    Authors: Pratik Rakesh Singh, Kritarth Prasad, Mohammadi Zaki, Pankaj Wasnik

    Abstract: Neural Machine Translation (NMT) systems face significant challenges when working with low-resource languages, particularly in domain adaptation tasks. These difficulties arise due to limited training data and suboptimal model generalization, As a result, selecting an optimal model for translation is crucial for achieving strong performance on in-domain data, particularly in scenarios where fine-t… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Journal ref: AfricaNLP Workshop at ACL 2025

  4. arXiv:2505.14629  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV

    KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models

    Authors: Fnu Mohbat, Mohammed J Zaki

    Abstract: Recent advances in large language models (LLMs) and the abundance of food data have resulted in studies to improve food understanding using LLMs. Despite several recommendation systems utilizing LLMs and Knowledge Graphs (KGs), there has been limited research on integrating food related KGs with LLMs. We introduce KERL, a unified system that leverages food KGs and LLMs to provide personalized food… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted at ACL 2025

  5. arXiv:2504.06036  [pdf, other

    cs.CL

    Multi-Sense Embeddings for Language Models and Knowledge Distillation

    Authors: Qitong Wang, Mohammed J. Zaki, Georgios Kollias, Vasileios Kalantzis

    Abstract: Transformer-based large language models (LLMs) rely on contextual embeddings which generate different (continuous) representations for the same token depending on its surrounding context. Nonetheless, words and tokens typically have a limited number of senses (or meanings). We propose multi-sense embeddings as a drop-in replacement for each token in order to capture the range of their uses in a la… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 16 pages, 4 figures

  6. arXiv:2503.14801  [pdf, other

    eess.SY

    Towards Connected Smart Work Zones: Advancing Work Zone Management through Improved Connectivity

    Authors: Mariam Nour, Mohamed H. Zaki, Mohamed Abdel-Aty

    Abstract: Work zones play a key role in road and highway maintenance but can lead to significant risks to both drivers and workers. Smart Work Zones (SWZs) have emerged as a potential solution, offering decision-makers real-time insights into the status of the work zone. By utilizing work zone barrels equipped with sensors and communication nodes, SWZs facilitate collecting and transmitting critical data, i… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  7. arXiv:2501.15219  [pdf, other

    cs.CL

    Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

    Authors: Kritarth Prasad, Mohammadi Zaki, Pratik Singh, Pankaj Wasnik

    Abstract: Ensembling neural machine translation (NMT) models to produce higher-quality translations than the $L$ individual models has been extensively studied. Recent methods typically employ a candidate selection block (CSB) and an encoder-decoder fusion block (FB), requiring inference across \textit{all} candidate models, leading to significant computational overhead, generally $Ω(L)$. This paper introdu… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  8. arXiv:2501.10385  [pdf

    cs.CY cond-mat.mtrl-sci cs.AI physics.ins-det

    Autonomous Microscopy Experiments through Large Language Model Agents

    Authors: Indrajeet Mandal, Jitendra Soni, Mohd Zaki, Morten M. Smedskjaer, Katrin Wondraczek, Lothar Wondraczek, Nitya Nand Gosvami, N. M. Anoop Krishnan

    Abstract: The emergence of large language models (LLMs) has accelerated the development of self-driving laboratories (SDLs) for materials research. Despite their transformative potential, current SDL implementations rely on rigid, predefined protocols that limit their adaptability to dynamic experimental scenarios across different labs. A significant challenge persists in measuring how effectively AI agents… ▽ More

    Submitted 18 December, 2024; originally announced January 2025.

  9. arXiv:2412.20440  [pdf, other

    cs.CL

    Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs

    Authors: Pratik Rakesh Singh, Mohammadi Zaki, Pankaj Wasnik

    Abstract: We address the challenging task of neural machine translation (NMT) in the entertainment domain, where the objective is to automatically translate a given dialogue from a source language content to a target language. This task has various applications, particularly in automatic dubbing, subtitling, and other content localization tasks, enabling source content to reach a wider audience. Traditional… ▽ More

    Submitted 29 December, 2024; originally announced December 2024.

    Comments: Accepted to AAAI'25

  10. arXiv:2412.09560  [pdf, other

    cond-mat.mtrl-sci cs.CL cs.IR

    Foundational Large Language Models for Materials Research

    Authors: Vaibhav Mishra, Somaditya Singh, Dhruv Ahlawat, Mohd Zaki, Vaibhav Bihani, Hargun Singh Grover, Biswajit Mishra, Santiago Miret, Mausam, N. M. Anoop Krishnan

    Abstract: Materials discovery and development are critical for addressing global challenges. Yet, the exponential growth in materials science literature comprising vast amounts of textual data has created significant bottlenecks in knowledge extraction, synthesis, and scientific reasoning. Large Language Models (LLMs) offer unprecedented opportunities to accelerate materials research through automated analy… ▽ More

    Submitted 28 January, 2025; v1 submitted 12 December, 2024; originally announced December 2024.

  11. arXiv:2411.15221  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry

    Authors: Yoel Zimmermann, Adib Bazgir, Zartashia Afzal, Fariha Agbere, Qianxiang Ai, Nawaf Alampara, Alexander Al-Feghali, Mehrad Ansari, Dmytro Antypov, Amro Aswad, Jiaru Bai, Viktoriia Baibakova, Devi Dutta Biswajeet, Erik Bitzek, Joshua D. Bocarsly, Anna Borisova, Andres M Bran, L. Catherine Brinson, Marcel Moran Calderon, Alessandro Canalicchio, Victor Chen, Yuan Chiang, Defne Circi, Benjamin Charmes, Vikrant Chaudhary , et al. (119 additional authors not shown)

    Abstract: Here, we present the outcomes from the second Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry, which engaged participants across global hybrid locations, resulting in 34 team submissions. The submissions spanned seven key application areas and demonstrated the diverse utility of LLMs for applications in (1) molecular and material property prediction; (2) mo… ▽ More

    Submitted 2 January, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: Updating author information, the submission remains largely unchanged. 98 pages total

  12. arXiv:2411.05031  [pdf, other

    cs.CL

    On-Device Emoji Classifier Trained with GPT-based Data Augmentation for a Mobile Keyboard

    Authors: Hossam Amer, Joe Osborne, Michael Zaki, Mohamed Afify

    Abstract: Emojis improve communication quality among smart-phone users that use mobile keyboards to exchange text. To predict emojis for users based on input text, we should consider the on-device low memory and time constraints, ensure that the on-device emoji classifier covers a wide range of emoji classes even though the emoji dataset is typically imbalanced, and adapt the emoji classifier output to user… ▽ More

    Submitted 13 February, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: 8 pages

  13. arXiv:2410.02024  [pdf, other

    cs.CE cs.AI cs.CL cs.LG

    FLAG: Financial Long Document Classification via AMR-based GNN

    Authors: Bolun "Namir" Xia, Aparna Gupta, Mohammed J. Zaki

    Abstract: The advent of large language models (LLMs) has initiated much research into their various financial applications. However, in applying LLMs on long documents, semantic relations are not explicitly incorporated, and a full or arbitrarily sparse attention operation is employed. In recent years, progress has been made in Abstract Meaning Representation (AMR), which is a graph-based representation of… ▽ More

    Submitted 22 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: 8 pages, 3 figures, to be published in CIFEr Conference 2024 as "Semantic Graph Learning for Trend Prediction from Long Financial Documents"

  14. arXiv:2410.00876  [pdf, other

    cs.LG

    Replacing Paths with Connection-Biased Attention for Knowledge Graph Completion

    Authors: Sharmishtha Dutta, Alex Gittens, Mohammed J. Zaki, Charu C. Aggarwal

    Abstract: Knowledge graph (KG) completion aims to identify additional facts that can be inferred from the existing facts in the KG. Recent developments in this field have explored this task in the inductive setting, where at test time one sees entities that were not present during training; the most performant models in the inductive setting have employed path encoding modules in addition to standard subgra… ▽ More

    Submitted 8 April, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

  15. LLaVA-Chef: A Multi-modal Generative Model for Food Recipes

    Authors: Fnu Mohbat, Mohammed J. Zaki

    Abstract: In the rapidly evolving landscape of online recipe sharing within a globalized context, there has been a notable surge in research towards comprehending and generating food recipes. Recent advancements in large language models (LLMs) like GPT-2 and LLaVA have paved the way for Natural Language Processing (NLP) approaches to delve deeper into various facets of food-related tasks, encompassing ingre… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  16. arXiv:2407.00520  [pdf, other

    hep-ph

    Effects of Family Non-universal $Z^{\prime}$ Model in the angular observables of $B\to(ρ,a_{1})μ^{+}μ^{-}$ decays

    Authors: Nimra Farooq, Marwah Zaki, M. Ali Paracha, Faisal Munir Bhutta

    Abstract: We present the angular distribution of the four-fold $B\toρ(\toππ)μ^{+}μ^{-}$ and $B\to a_{1}(\toρ_{\parallel, \perp}π)μ^{+}μ^{-}$ decays both in the Standard Model and the family non-universal $Z^{\prime}$ model. At the quark level, these decays are governed by the $b\to dμ^{+}μ^{-}$ transition. Along with different angular observables, we also give predictions of differential branching ratios, f… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 39 pages, 6 figures, 38 tables; version to be published in Chinese Physics C

  17. arXiv:2406.08530  [pdf, other

    cs.DB

    Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ehab E. Hassanein, Ahmed Awad

    Abstract: Process mining extracts valuable insights from event data to help organizations improve their business processes, which is essential for their growth and success. By leveraging process mining techniques, organizations gain a comprehensive understanding of their processes' execution, enabling the discovery of process models, detection of deviations, identification of bottlenecks, and assessment of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  18. arXiv:2405.20587  [pdf, ps, other

    cs.NI eess.SP

    Quality-Aware Task Offloading for Cooperative Perception in Vehicular Edge Computing

    Authors: Amr M. Zaki, Sara A. Elsayed, Khalid Elgazzar, Hossam S. Hassanein

    Abstract: Task offloading in Vehicular Edge Computing (VEC) can advance cooperative perception (CP) to improve traffic awareness in Autonomous Vehicles. In this paper, we propose the Quality-aware Cooperative Perception Task Offloading (QCPTO) scheme. Q-CPTO is the first task offloading scheme that enhances traffic awareness by prioritizing the quality rather than the quantity of cooperative perception. Q-C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  19. arXiv:2405.07269  [pdf, other

    physics.optics

    The comparative study of high efficiency of Tm^{3+}-doped fiber laser at 1.72 μm for different pump schemes

    Authors: Mohamed Zaki, Mostafa Abouricha, Said Amrane

    Abstract: In this study, we revealed the impact of the pumping scheme, fiber length, pumping power, and reflectivity of the output fiber Bragg grating on the performance of a Tm^3+ -doped fiber laser (TDFL) operating at a wavelength of 1.72 μm. Using numerical simulations, we optimized the output power and reduced losses due to reabsorption; as well as amplified spontaneous emission (ASE) at approximately 1… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  20. arXiv:2403.15469  [pdf, other

    cs.CL cs.LG eess.AS

    Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

    Authors: Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar P. Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah

    Abstract: Traditional Automatic Video Dubbing (AVD) pipeline consists of three key modules, namely, Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS). Within AVD pipelines, isometric-NMT algorithms are employed to regulate the length of the synthesized output text. This is done to guarantee synchronization with respect to the alignment of video and audio subseque… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted in NAACL2024 Findings

  21. arXiv:2402.06185  [pdf, other

    cs.CV cs.AI cs.LG

    Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters

    Authors: Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, Todd C. Hollon, Paul Park

    Abstract: Objective. Achieving appropriate spinopelvic alignment has been shown to be associated with improved clinical symptoms. However, measurement of spinopelvic radiographic parameters is time-intensive and interobserver reliability is a concern. Automated measurement tools have the promise of rapid and consistent measurements, but existing tools are still limited by some degree of manual user-entry re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, to appear in Journal of Neurosurgery: Spine

  22. arXiv:2402.04538  [pdf, other

    cs.LG

    Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: Graph transformers typically lack third-order interactions, limiting their geometric understanding which is crucial for tasks like molecular geometry prediction. We propose the Triplet Graph Transformer (TGT) that enables direct communication between pairs within a 3-tuple of nodes via novel triplet attention and aggregation mechanisms. TGT is applied to molecular property prediction by first pred… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: ICML'24 Accepted Version, 25 pages, 10 figures, 18 tables

  23. arXiv:2310.08383  [pdf, other

    cs.CL cond-mat.mtrl-sci

    Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

    Authors: Kausik Hira, Mohd Zaki, Dhruvil Sheth, Mausam, N M Anoop Krishnan

    Abstract: The discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature… ▽ More

    Submitted 26 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Journal ref: Digital Discovery, 2024, Advance Article

  24. arXiv:2308.09115  [pdf

    cs.CL cond-mat.mtrl-sci

    MaScQA: A Question Answering Dataset for Investigating Materials Science Knowledge of Large Language Models

    Authors: Mohd Zaki, Jayadeva, Mausam, N. M. Anoop Krishnan

    Abstract: Information extraction and textual comprehension from materials literature are vital for developing an exhaustive knowledge base that enables accelerated materials discovery. Language models have demonstrated their capability to answer domain-specific questions and retrieve information from knowledge bases. However, there are no benchmark datasets in the materials domain that can evaluate the unde… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  25. arXiv:2306.03209  [pdf, other

    cs.LG

    End-to-end Differentiable Clustering with Associative Memories

    Authors: Bishwajit Saha, Dmitry Krotov, Mohammed J. Zaki, Parikshit Ram

    Abstract: Clustering is a widely used unsupervised learning technique involving an intensive discrete optimization problem. Associative Memory models or AMs are differentiable neural networks defining a recursive dynamical system, which have been integrated with various deep learning architectures. We uncover a novel connection between the AM dynamics and the inherent discrete assignment necessary in cluste… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  26. The Information Pathways Hypothesis: Transformers are Dynamic Self-Ensembles

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: Transformers use the dense self-attention mechanism which gives a lot of flexibility for long-range connectivity. Over multiple layers of a deep transformer, the number of possible connectivity patterns increases exponentially. However, very few of these contribute to the performance of the network, and even fewer are essential. We hypothesize that there are sparsely connected sub-networks within… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: KDD23 preprint, 12 pages, 7 figures, 10 tables

  27. arXiv:2305.17219  [pdf

    cs.CV cs.CL cs.LG

    GVdoc: Graph-based Visual Document Classification

    Authors: Fnu Mohbat, Mohammed J. Zaki, Catherine Finegan-Dollak, Ashish Verma

    Abstract: The robustness of a model for real-world deployment is decided by how well it performs on unseen data and distinguishes between in-domain and out-of-domain samples. Visual document classifiers have shown impressive performance on in-distribution test sets. However, they tend to have a hard time correctly classifying and differentiating out-of-distribution examples. Image-based classifiers lack the… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  28. Footprints of New Physics in the angular distribution of $B_{c}\to D_{s}^{\ast}(\to D_{s}γ,(D_{s}π))\ell^{+}\ell^{-}$ decays

    Authors: Marwah Zaki, M. Ali Paracha, Faisal Munir Bhutta

    Abstract: We investigate the angular decay distribution of the four-fold $B_{c}\to D^{\ast}_{s}(\to D_{s}γ)μ^{+}μ^{-}$, and $B_{c}\to D^{\ast}_{s}(\to D_{s}π)μ^{+}μ^{-}$ decays that proceed through $b\to sμ^{+}μ^{-}$ quark level transition. We use the model independent effective Hamiltonian with vector and axial vector new physics operators to formulate the angular observables and study the implications of… ▽ More

    Submitted 29 July, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 24 pages, 7 figures, 18 tables; ; version matching publication in NPB

  29. arXiv:2302.07253  [pdf, other

    cs.LG cond-mat.dis-nn cs.CV q-bio.NC stat.ML

    Energy Transformer

    Authors: Benjamin Hoover, Yuchen Liang, Bao Pham, Rameswar Panda, Hendrik Strobelt, Duen Horng Chau, Mohammed J. Zaki, Dmitry Krotov

    Abstract: Our work combines aspects of three promising paradigms in machine learning, namely, attention mechanism, energy-based models, and associative memory. Attention is the power-house driving modern deep learning successes, but it lacks clear theoretical foundations. Energy-based models allow a principled approach to discriminative and generative tasks, but the design of the energy functional is not st… ▽ More

    Submitted 31 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  30. arXiv:2301.08073  [pdf

    cond-mat.mtrl-sci

    Glass Hardness: Predicting Composition and Load Effects via Symbolic Reasoning-Informed Machine Learning

    Authors: Sajid Mannan, Mohd Zaki, Suresh Bishnoi, Daniel R. Cassar, Jeanini Jiusti, Julio Cesar Ferreira Faria, Johan F. S. Christensen, Nitya Nand Gosvami, Morten M. Smedskjaer, Edgar Dutra Zanotto, N. M. Anoop Krishnan

    Abstract: Glass hardness varies in a non-linear fashion with the chemical composition and applied load, a phenomenon known as the indentation size effect (ISE), which is challenging to predict quantitatively. Here, using a curated dataset of over approx. 3000 inorganic glasses from the literature comprising the composition, indentation load, and hardness, we develop machine learning (ML) models to predict t… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  31. arXiv:2211.03223  [pdf

    cs.CV cond-mat.mtrl-sci eess.IV

    Cementron: Machine Learning the Constituent Phases in Cement Clinker from Optical Images

    Authors: Mohd Zaki, Siddhant Sharma, Sunil Kumar Gurjar, Raju Goyal, Jayadeva, N. M. Anoop Krishnan

    Abstract: Cement is the most used construction material. The performance of cement hydrate depends on the constituent phases, viz. alite, belite, aluminate, and ferrites present in the cement clinker, both qualitatively and quantitatively. Traditionally, clinker phases are analyzed from optical images relying on a domain expert and simple image processing techniques. However, the non-uniformity of the image… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  32. arXiv:2211.00691  [pdf

    cond-mat.mtrl-sci

    Accelerated Design of Chalcogenide Glasses through Interpretable Machine Learning for Composition Property Relationships

    Authors: Sayam Singla, Sajid Mannan, Mohd Zaki, N. M. Anoop Krishnan

    Abstract: Chalcogenide glasses possess several outstanding properties that enable several ground breaking applications, such as optical discs, infrared cameras, and thermal imaging systems. Despite the ubiquitous usage of these glasses, the composition property relationships in these materials remain poorly understood. Here, we use a large experimental dataset comprising approx 24000 glass compositions made… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 17 pages, 8 figures

  33. arXiv:2208.14376  [pdf, other

    cs.LG cs.NE cs.SI q-bio.NC stat.ML

    Associative Learning for Network Embedding

    Authors: Yuchen Liang, Dmitry Krotov, Mohammed J. Zaki

    Abstract: The network embedding task is to represent the node in the network as a low-dimensional vector while incorporating the topological and structural information. Most existing approaches solve this problem by factorizing a proximity matrix, either directly or implicitly. In this work, we introduce a network embedding method from a new perspective, which leverages Modern Hopfield Networks (MHN) for as… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at the Eighth International Workshop on Deep Learning on Graphs: Methods and Applications (DLG-KDD 2022), Washington DC

  34. arXiv:2207.09090  [pdf, other

    cs.LG cs.AI eess.SY

    Actor-Critic based Improper Reinforcement Learning

    Authors: Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform each of the base ones. This can be useful in tuning across controllers, learnt possibly in mismatched or simulated environments, to obtain a good controller for a… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.08201

  35. arXiv:2207.05194  [pdf, other

    cs.CL

    Towards Neural Numeric-To-Text Generation From Temporal Personal Health Data

    Authors: Jonathan Harris, Mohammed J. Zaki

    Abstract: With an increased interest in the production of personal health technologies designed to track user data (e.g., nutrient intake, step counts), there is now more opportunity than ever to surface meaningful behavioral insights to everyday users in the form of natural language. This knowledge can increase their behavioral awareness and allow them to take action to meet their health goals. It can also… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: 5 pages, 2 figures, 1 table

  36. arXiv:2207.01079  [pdf, other

    cs.CL cond-mat.mtrl-sci cs.IR

    DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles

    Authors: Tanishq Gupta, Mohd Zaki, Devanshi Khatsuriya, Kausik Hira, N. M. Anoop Krishnan, Mausam

    Abstract: A crucial component in the curation of KB for a scientific domain (e.g., materials science, foods & nutrition, fuels) is information extraction from tables in the domain's published research articles. To facilitate research in this direction, we define a novel NLP task of extracting compositions of materials (e.g., glasses) from tables in materials science papers. The task involves solving several… ▽ More

    Submitted 28 January, 2024; v1 submitted 3 July, 2022; originally announced July 2022.

    Comments: Accepted long paper at ACL 2023 (https://2023.aclweb.org/program/accepted_main_conference/)

  37. arXiv:2206.09336  [pdf, other

    cs.DB

    Efficient Checking of Timed Order Compliance Rules over Graph-encoded Event Logs

    Authors: Nesma M. Zaki, Iman M. A. Helal, Ahmed Awad, Ehab E. Hassanein

    Abstract: Validation of compliance rules against process data is a fundamental functionality for business process management. Over the years, the problem has been addressed for different types of process data, i.e., process models, process event data at runtime, and event logs representing historical execution. Several approaches have been proposed to tackle compliance checking over process logs. These appr… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 18 pages, 5 figures, 6 tables

    MSC Class: 68

  38. arXiv:2206.06952  [pdf, other

    cs.CL cs.AI cs.LG

    FETILDA: An Effective Framework For Fin-tuned Embeddings For Long Financial Text Documents

    Authors: Bolun "Namir" Xia, Vipula D. Rawte, Mohammed J. Zaki, Aparna Gupta

    Abstract: Unstructured data, especially text, continues to grow rapidly in various domains. In particular, in the financial sphere, there is a wealth of accumulated unstructured financial data, such as the textual disclosure documents that companies submit on a regular basis to regulatory agencies, such as the Securities and Exchange Commission (SEC). These documents are typically very long and tend to cont… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: 10 pages, 9 figures, 7 tables

    ACM Class: I.2.7

  39. arXiv:2111.07198  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Keyphrase Extraction Using Neighborhood Knowledge Based on Word Embeddings

    Authors: Yuchen Liang, Mohammed J. Zaki

    Abstract: Keyphrase extraction is the task of finding several interesting phrases in a text document, which provide a list of the main topics within the document. Most existing graph-based models use co-occurrence links as cohesion indicators to model the relationship of syntactic elements. However, a word may have different forms of expression within the document, and may have several synonyms as well. Sim… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  40. arXiv:2110.06208  [pdf, other

    cs.CY eess.SY

    Towards formalization and monitoring of microscopic traffic parameters using temporal logic

    Authors: Mariam Nour, Mohamed H. Zaki

    Abstract: Smart cities are revolutionizing the transportation infrastructure by the integration of technology. However, ensuring that various transportation system components are operating as expected and in a safe manner is a great challenge. In this work, we propose the use of formal methods as a means to specify and reason about the traffic network's complex properties. Formal methods provide a flexible… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  41. arXiv:2109.15290  [pdf

    cs.CL cond-mat.mtrl-sci

    MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction

    Authors: Tanishq Gupta, Mohd Zaki, N. M. Anoop Krishnan, Mausam

    Abstract: An overwhelmingly large amount of knowledge in the materials domain is generated and stored as text published in peer-reviewed scientific literature. Recent developments in natural language processing, such as bidirectional encoder representations from transformers (BERT) models, provide promising tools to extract information from these texts. However, direct application of these models in the mat… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  42. Global Self-Attention as a Replacement for Graph Convolution

    Authors: Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian

    Abstract: We propose an extension to the transformer neural network architecture for general-purpose graph learning by adding a dedicated pathway for pairwise structural information, called edge channels. The resultant framework - which we call Edge-augmented Graph Transformer (EGT) - can directly accept, process and output structural information of arbitrary form, which is important for effective learning… ▽ More

    Submitted 3 June, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: The accepted version in KDD '22

  43. arXiv:2107.06369  [pdf, other

    eess.SY

    Exploring DMD-type Algorithms for Modeling Signalised Intersections

    Authors: Kazi Redwan Shabab, Shakib Mustavee, Shaurya Agarwal, Mohamed H. Zaki, Sajal Das

    Abstract: This paper explores a novel data-driven approach based on recent developments in Koopman operator theory and dynamic mode decomposition (DMD) for modeling signalized intersections. Vehicular flow and queue formation on signalized intersections have complex nonlinear dynamics, making system identification, modeling, and controller design tasks challenging. We employ a Koopman theoretic approach to… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: 11 pages, 8 figures, Submitted to: Journal of Intelligent Transportation Systems

    Report number: GITS-2021-0219

  44. arXiv:2105.00210  [pdf, other

    cs.LG

    Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling

    Authors: Mohammani Zaki, Avi Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay. Modern communication systems are becoming increasingly complex, and are required to handle multiple types of traffic with widely varying characteristics such as arrival rates and service times. This, coupled with the need for rapid network deployment, render a bottom up approach of first… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

    Comments: 4 pages, 5 figures, RLNQ workshop at the SIGMETRICS 2021

  45. arXiv:2103.12050  [pdf

    cond-mat.mtrl-sci

    Revealing the Compositional Control of Electrical, Mechanical, Optical, and Physical Properties of Inorganic Glasses

    Authors: R. Ravinder, Suresh Bishnoi, Mohd Zaki, N. M. Anoop Krishnan

    Abstract: Inorganic glasses, produced by the melt-quenching of a concoction of minerals, compounds, and elements, can possess unique optical and elastic properties along with excellent chemical, and thermal durability. Despite the ubiquitous use of glasses for critical applications such as touchscreen panels, windshields, bioactive implants, optical fibers and sensors, kitchen and laboratory glassware, ther… ▽ More

    Submitted 23 March, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  46. arXiv:2103.03633  [pdf

    physics.optics cond-mat.mtrl-sci physics.data-an

    Unveiling the Glass Veil: Elucidating the Optical Properties in Glasses with Interpretable Machine Learning

    Authors: Mohd Zaki, Vineeth Venugopal, R. Ravinder, Suresh Bishnoi, Sourabh Kumar Singh, Amarnath R. Allu, Jayadeva, N. M. Anoop Krishnan

    Abstract: Due to their excellent optical properties, glasses are used for various applications ranging from smartphone screens to telescopes. Developing compositions with tailored Abbe number (Vd) and refractive index (nd), two crucial optical properties, is a major challenge. To this extent, machine learning (ML) approaches have been successfully used to develop composition-property models. However, these… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: 13 pages, 5 figures

  47. arXiv:2102.08201  [pdf, other

    cs.LG eess.SY

    Improper Reinforcement Learning with Gradient-based Policy Optimization

    Authors: Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

    Abstract: We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform each of the base ones. This can be useful in tuning across controllers, learnt possibly in mismatched or simulated environments, to obtain a good controller for a… ▽ More

    Submitted 3 July, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  48. arXiv:2102.05571  [pdf, other

    cs.CR cs.AI cs.IR cs.LG

    TINKER: A framework for Open source Cyberthreat Intelligence

    Authors: Nidhi Rastogi, Sharmishtha Dutta, Mohammed J. Zaki, Alex Gittens, Charu Aggarwal

    Abstract: Threat intelligence on malware attacks and campaigns is increasingly being shared with other security experts for a cost or for free. Other security analysts use this intelligence to inform them of indicators of compromise, attack techniques, and preventative actions. Security analysts prepare threat analysis reports after investigating an attack, an emerging cyber threat, or a recently discovered… ▽ More

    Submitted 19 January, 2023; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 9 pages

  49. arXiv:2101.06887  [pdf, other

    cs.CL cs.LG cs.NE q-bio.NC stat.ML

    Can a Fruit Fly Learn Word Embeddings?

    Authors: Yuchen Liang, Chaitanya K. Ryali, Benjamin Hoover, Leopold Grinberg, Saket Navlakha, Mohammed J. Zaki, Dmitry Krotov

    Abstract: The mushroom body of the fruit fly brain is one of the best studied systems in neuroscience. At its core it consists of a population of Kenyon cells, which receive inputs from multiple sensory modalities. These cells are inhibited by the anterior paired lateral neuron, thus creating a sparse high dimensional representation of the inputs. In this work we study a mathematical formalization of this n… ▽ More

    Submitted 14 March, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ICLR 2021

  50. Personalized Food Recommendation as Constrained Question Answering over a Large-scale Food Knowledge Graph

    Authors: Yu Chen, Ananya Subburathinam, Ching-Hua Chen, Mohammed J. Zaki

    Abstract: Food recommendation has become an important means to help guide users to adopt healthy dietary habits. Previous works on food recommendation either i) fail to consider users' explicit requirements, ii) ignore crucial health factors (e.g., allergies and nutrition needs), or iii) do not utilize the rich food knowledge for recommending healthy recipes. To address these limitations, we propose a novel… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 9 pages. Accepted by WSDM 2021. Final version