Skip to main content

Showing 1–19 of 19 results for author: Shehu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.11610  [pdf, ps, other

    cs.AI cs.LG q-bio.BM q-bio.GN

    Foundation Models for AI-Enabled Biological Design

    Authors: Asher Moldwin, Amarda Shehu

    Abstract: This paper surveys foundation models for AI-enabled biological design, focusing on recent developments in applying large-scale, self-supervised models to tasks such as protein engineering, small molecule design, and genomic sequence design. Though this domain is evolving rapidly, this survey presents and discusses a taxonomy of current models and methods. The focus is on challenges and solutions i… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Published as part of the workshop proceedings at AAAI 2025 in the workshop "Foundation Models for Biological Discoveries"

  2. arXiv:2411.01030  [pdf, other

    cs.CL cs.AI cs.LG

    Birdie: Advancing State Space Models with Reward-Driven Objectives and Curricula

    Authors: Sam Blouir, Jimmy T. H. Smith, Antonios Anastasopoulos, Amarda Shehu

    Abstract: Efficient state space models (SSMs), such as linear recurrent neural networks and linear attention variants, offer computational advantages over Transformers but struggle with tasks requiring long-range in-context retrieval-like text copying, associative recall, and question answering over long contexts. Previous efforts to address these challenges have focused on architectural modifications, ofte… ▽ More

    Submitted 21 February, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: Accepted to EMNLP 2024 (Main Conference)

  3. arXiv:2409.03624  [pdf

    cs.CR

    On the Compliance of Self-Sovereign Identity with GDPR Principles: A Critical Review

    Authors: Abubakar-Sadiq Shehu

    Abstract: Identity Management Systems (IdMs) have complemented how users are identified, authenticated, and authorised on e-services. Among the methods used for this purpose are traditional IdMs (isolated, centralised and federated) that mostly rely on identity providers (IdPs) to broker trust between a user and service-providers (SPs). An IdP also identifies and authenticates a user on-behalf of the SP, wh… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  4. Towards a Knowledge Graph for Models and Algorithms in Applied Mathematics

    Authors: Björn Schembera, Frank Wübbeling, Hendrik Kleikamp, Burkhard Schmidt, Aurela Shehu, Marco Reidelbach, Christine Biedinger, Jochen Fiedler, Thomas Koprucki, Dorothea Iglezakis, Dominik Göddeke

    Abstract: Mathematical models and algorithms are an essential part of mathematical research data, as they are epistemically grounding numerical data. In order to represent models and algorithms as well as their relationship semantically to make this research data FAIR, two previously distinct ontologies were merged and extended, becoming a living knowledge graph. The link between the two ontologies is estab… ▽ More

    Submitted 26 February, 2025; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: Preprint submitted to the 18th International Conference on Metadata and Semantics Research 2024 and published as a full, revised article

    Journal ref: Sfakakis, M., Garoufallou, E., Damigos, M., Salaba, A., Papatheodorou, C. (eds) Metadata and Semantic Research. MTSR 2024. Communications in Computer and Information Science, vol 2331. Springer, Cham

  5. arXiv:2407.11407  [pdf, other

    cs.LG

    Accounting for Work Zone Disruptions in Traffic Flow Forecasting

    Authors: Yuanjie Lu, Amarda Shehu, David Lattanzi

    Abstract: Traffic speed forecasting is an important task in intelligent transportation system management. The objective of much of the current computational research is to minimize the difference between predicted and actual speeds, but information modalities other than speed priors are largely not taken into account. In particular, though state of the art performance is achieved on speed forecasting with g… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Traffic speed prediction, graph neural network, spatio-temporal correlation, hypergraph, work zone, maintenance downtime. arXiv admin note: text overlap with arXiv:2110.01535

  6. arXiv:2403.00574  [pdf, other

    cs.LG

    Beyond Single-Model Views for Deep Learning: Optimization versus Generalizability of Stochastic Optimization Algorithms

    Authors: Toki Tahmid Inan, Mingrui Liu, Amarda Shehu

    Abstract: Despite an extensive body of literature on deep learning optimization, our current understanding of what makes an optimization algorithm effective is fragmented. In particular, we do not understand well whether enhanced optimization translates to improved generalizability. Current research overlooks the inherent stochastic nature of stochastic gradient descent (SGD) and its variants, resulting in… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  7. arXiv:2310.20443  [pdf, other

    cs.AI cs.DB cs.DL cs.IR

    Ontologies for Models and Algorithms in Applied Mathematics and Related Disciplines

    Authors: Björn Schembera, Frank Wübbeling, Hendrik Kleikamp, Christine Biedinger, Jochen Fiedler, Marco Reidelbach, Aurela Shehu, Burkhard Schmidt, Thomas Koprucki, Dorothea Iglezakis, Dominik Göddeke

    Abstract: In applied mathematics and related disciplines, the modeling-simulation-optimization workflow is a prominent scheme, with mathematical models and numerical algorithms playing a crucial role. For these types of mathematical research data, the Mathematical Research Data Initiative has developed, merged and implemented ontologies and knowledge graphs. This contributes to making mathematical research… ▽ More

    Submitted 31 July, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    ACM Class: H.3; H.4; I.2.4

    Journal ref: In: Metadata and Semantic Research. MTSR 2023. Communications in Computer and Information Science, vol 2048. Springer, Cham (2024)

  8. arXiv:2210.01796  [pdf, other

    cs.LG cs.AI

    Multi-objective Deep Data Generation with Correlated Property Control

    Authors: Shiyu Wang, Xiaojie Guo, Xuanyang Lin, Bo Pan, Yuanqi Du, Yinkai Wang, Yanfang Ye, Ashley Ann Petersen, Austin Leitgeb, Saleh AlKhalifa, Kevin Minbiole, William Wuest, Amarda Shehu, Liang Zhao

    Abstract: Developing deep generative models has been an emerging field due to the ability to model and generate complex data for various purposes, such as image synthesis and molecular design. However, the advancement of deep generative models is limited by challenges to generate objects that possess multiple desired properties: 1) the existence of complex correlation among real-world properties is common b… ▽ More

    Submitted 17 October, 2022; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: This paper has been accepted by NeurIPS 2022

  9. arXiv:2210.01707  [pdf, other

    cs.LG

    Multiple Instance Learning for Detecting Anomalies over Sequential Real-World Datasets

    Authors: Parastoo Kamranfar, David Lattanzi, Amarda Shehu, Daniel Barbará

    Abstract: Detecting anomalies over real-world datasets remains a challenging task. Data annotation is an intensive human labor problem, particularly in sequential datasets, where the start and end time of anomalies are not known. As a result, data collected from sequential real-world processes can be largely unlabeled or contain inaccurate labels. These characteristics challenge the application of anomaly d… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 9 pages,5 figures, Anomaly and Novelty Detection, Explanation and Accommodation (ANDEA 2022)

  10. arXiv:2207.04459  [pdf

    cs.CR

    A Decentralised Real Estate Transfer Verification Based on Self-Sovereign Identity and Smart Contracts

    Authors: Abubakar-Sadiq Shehu, Antonio Pinto, Manuel E. Correia

    Abstract: Since its first introduction in late 90s, the use of marketplaces has continued to grow, today virtually everything from physical assets to services can be purchased on digital marketplaces, real estate is not an exception. Some marketplaces allow acclaimed asset owners to advertise their products, to which the services gets commission/percentage from proceeds of sale/lease. Despite the success re… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: Shehu, A-S.; Pinto, A. and Correia, M. (2022). A Decentralised Real Estate Transfer Verification based on Self-Sovereign Identity and Smart Contracts. This article has been accepted for publication In Proceedings of the 19th International Conference on Security and Cryptography

    Report number: ISBN 978-989-758-590-6, ISSN 2184-7711, pages 469-476

  11. arXiv:2206.11057  [pdf, other

    cs.LG cs.AI q-bio.QM

    Transformer Neural Networks Attending to Both Sequence and Structure for Protein Prediction Tasks

    Authors: Anowarul Kabir, Amarda Shehu

    Abstract: The increasing number of protein sequences decoded from genomes is opening up new avenues of research on linking protein sequence to function with transformer neural networks. Recent research has shown that the number of known protein sequences supports learning useful, task-agnostic sequence representations via transformers. In this paper, we posit that learning joint sequence-structure represent… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 8 pages, 4 figures, 3 tables

  12. arXiv:2203.00412  [pdf, other

    cs.LG cs.AI

    Interpretable Molecular Graph Generation via Monotonic Constraints

    Authors: Yuanqi Du, Xiaojie Guo, Amarda Shehu, Liang Zhao

    Abstract: Designing molecules with specific properties is a long-lasting research problem and is central to advancing crucial domains such as drug discovery and material science. Recent advances in deep graph generative models treat molecule design as graph generation problems which provide new opportunities toward the breakthrough of this long-lasting problem. Existing models, however, have many shortcomin… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: In SIAM International Conference on Data Mining (SDM22)

  13. arXiv:2110.01535  [pdf, other

    cs.LG cs.AI cs.CV

    Traffic Flow Forecasting with Maintenance Downtime via Multi-Channel Attention-Based Spatio-Temporal Graph Convolutional Networks

    Authors: Yuanjie Lu, Parastoo Kamranfar, David Lattanzi, Amarda Shehu

    Abstract: Forecasting traffic flows is a central task in intelligent transportation system management. Graph structures have shown promise as a modeling framework, with recent advances in spatio-temporal modeling via graph convolution neural networks, improving the performance or extending the prediction horizon on traffic flows. However, a key shortcoming of state-of-the-art methods is their inability to t… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 10 pages, 6 figures

  14. arXiv:2104.10103  [pdf, other

    stat.ML cs.LG stat.CO

    Space Partitioning and Regression Mode Seeking via a Mean-Shift-Inspired Algorithm

    Authors: Wanli Qiao, Amarda Shehu

    Abstract: The mean shift (MS) algorithm is a nonparametric method used to cluster sample points and find the local modes of kernel density estimates, using an idea based on iterative gradient ascent. In this paper we develop a mean-shift-inspired algorithm to estimate the modes of regression functions and partition the sample points in the input space. We prove convergence of the sequences generated by the… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: 44 pages, 4 figures

    MSC Class: 62G08

  15. arXiv:2010.01441  [pdf, other

    q-bio.BM cs.LG

    Decoy Selection for Protein Structure Prediction Via Extreme Gradient Boosting and Ranking

    Authors: Nasrin Akhter, Gopinath Chennupati, Hristo Djidjev, Amarda Shehu

    Abstract: Identifying one or more biologically-active/native decoys from millions of non-native decoys is one of the major challenges in computational structural biology. The extreme lack of balance in positive and negative samples (native and non-native decoys) in a decoy set makes the problem even more complicated. Consensus methods show varied success in handling the challenge of decoy selection despite… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: Accepted for BMC Bioinformatics

  16. Interpretable Deep Graph Generation with Node-Edge Co-Disentanglement

    Authors: Xiaojie Guo, Liang Zhao, Zhao Qin, Lingfei Wu, Amarda Shehu, Yanfang Ye

    Abstract: Disentangled representation learning has recently attracted a significant amount of attention, particularly in the field of image representation learning. However, learning the disentangled representations behind a graph remains largely unexplored, especially for the attributed graph with both node and edge features. Disentanglement learning for graph generation has substantial new challenges incl… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: This paper has been accepted by KDD 2020

  17. arXiv:2004.07119  [pdf, other

    q-bio.BM cs.LG stat.ML

    Generating Tertiary Protein Structures via an Interpretative Variational Autoencoder

    Authors: Xiaojie Guo, Yuanqi Du, Sivani Tadepalli, Liang Zhao, Amarda Shehu

    Abstract: Much scientific enquiry across disciplines is founded upon a mechanistic treatment of dynamic systems that ties form to function. A highly visible instance of this is in molecular biology, where an important goal is to determine functionally-relevant forms/structures that a protein molecule employs to interact with molecular partners in the living cell. This goal is typically pursued under the umb… ▽ More

    Submitted 16 June, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

  18. arXiv:1905.08331  [pdf, other

    q-bio.BM cs.HC

    ROMEO: A Plug-and-play Software Platform of Robotics-inspired Algorithms for Modeling Biomolecular Structures and Motions

    Authors: Kevin Molloy, Erion Plaku, Amarda Shehu

    Abstract: Motivation: Due to the central role of protein structure in molecular recognition, great computational efforts are devoted to modeling protein structures and motions that mediate structural rearrangements. The size, dimensionality, and non-linearity of the protein structure space present outstanding challenges. Such challenges also arise in robot motion planning, and robotics-inspired treatments o… ▽ More

    Submitted 20 May, 2019; originally announced May 2019.

    Comments: 6 pages, 5 figures

  19. Probabilistically Perfect Cloning of Two Pure States: A Geometric Approach

    Authors: Vadim Yerokhin, Andi Shehu, Edgar Feldman, Emilio Bagan, Janos A. Bergou

    Abstract: We solve the long-standing problem of making n perfect clones from m copies of one of two known pure states with minimum failure probability in the general case where the known states have arbitrary a priori probabilities. The solution emerges from a geometric formulation of the problem. This formulation also reveals a deeper connection between cloning and state discrimination. The convergence of… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

    Journal ref: Phys. Rev. Lett. 116, 200401 (2016)