Skip to main content

Showing 1–17 of 17 results for author: Behrouz, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23735  [pdf, ps, other

    cs.CL cs.AI

    ATLAS: Learning to Optimally Memorize the Context at Test Time

    Authors: Ali Behrouz, Zeman Li, Praneeth Kacham, Majid Daliri, Yuan Deng, Peilin Zhong, Meisam Razaviyayn, Vahab Mirrokni

    Abstract: Transformers have been established as the most popular backbones in sequence modeling, mainly due to their effectiveness in in-context retrieval tasks and the ability to learn at scale. Their quadratic memory and time complexity, however, bound their applicability in longer sequences and so has motivated researchers to explore effective alternative architectures such as modern recurrent neural net… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2504.13173  [pdf, other

    cs.LG cs.AI

    It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

    Authors: Ali Behrouz, Meisam Razaviyayn, Peilin Zhong, Vahab Mirrokni

    Abstract: Designing efficient and effective architectural backbones has been in the core of research efforts to enhance the capability of foundation models. Inspired by the human cognitive phenomenon of attentional bias-the natural tendency to prioritize certain events or stimuli-we reconceptualize neural architectures, including Transformers, Titans, and modern linear recurrent neural networks as associati… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2501.00663  [pdf, other

    cs.LG cs.AI cs.CL

    Titans: Learning to Memorize at Test Time

    Authors: Ali Behrouz, Peilin Zhong, Vahab Mirrokni

    Abstract: Over more than a decade there has been an extensive research effort on how to effectively utilize recurrent models and attention. While recurrent models aim to compress the data into a fixed-size memory (called hidden state), attention allows attending to the entire context window, capturing the direct dependencies of all tokens. This more accurate modeling of dependencies, however, comes with a q… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  4. arXiv:2411.15671  [pdf, other

    cs.LG cs.SI

    Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

    Authors: Ali Behrouz, Ali Parviz, Mahdi Karami, Clayton Sanford, Bryan Perozzi, Vahab Mirrokni

    Abstract: Modern sequence models (e.g., Transformers, linear RNNs, etc.) emerged as dominant backbones of recent deep learning frameworks, mainly due to their efficiency, representational power, and/or ability to capture long-range dependencies. Adopting these sequence models for graph-structured data has recently gained popularity as the alternative to Message Passing Neural Networks (MPNNs). There is, how… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

  5. arXiv:2406.13734  [pdf, other

    cs.SI

    A Unified Core Structure in Multiplex Networks: From Finding the Densest Subgraph to Modeling User Engagement

    Authors: Farnoosh Hashemi, Ali Behrouz

    Abstract: In many complex systems, the interactions between objects span multiple aspects. Multiplex networks are accurate paradigms to model such systems, where each edge is associated with a type. A key graph mining primitive is extracting dense subgraphs, and this has led to interesting notions such as K-cores, known as building blocks of complex networks. Despite recent attempts to extend the notion of… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.04320  [pdf, other

    cs.LG cs.AI

    Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models

    Authors: Ali Behrouz, Michele Santacatterina, Ramin Zabih

    Abstract: Modeling multivariate time series is a well-established problem with a wide range of applications from healthcare to financial markets. Traditional State Space Models (SSMs) are classical approaches for univariate time series modeling due to their simplicity and expressive power to represent linear dependencies. They, however, have fundamentally limited expressive power to capture non-linear depen… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2403.19888  [pdf, other

    cs.LG cs.AI cs.CV

    MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

    Authors: Ali Behrouz, Michele Santacatterina, Ramin Zabih

    Abstract: Recent advances in deep learning have mainly relied on Transformers due to their data dependency and ability to learn at scale. The attention module in these architectures, however, exhibits quadratic time and space in input size, limiting their scalability for long-sequence modeling. Despite recent attempts to design efficient and effective architecture backbone for multi-dimensional data, such a… ▽ More

    Submitted 23 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  8. arXiv:2402.08678  [pdf, other

    cs.LG

    Graph Mamba: Towards Learning on Graphs with State Space Models

    Authors: Ali Behrouz, Farnoosh Hashemi

    Abstract: Graph Neural Networks (GNNs) have shown promising potential in graph representation learning. The majority of GNNs define a local message-passing mechanism, propagating information over the graph by stacking multiple layers. These methods, however, are known to suffer from two major limitations: over-squashing and poor capturing of long-range dependencies. Recently, Graph Transformers (GTs) emerge… ▽ More

    Submitted 19 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  9. arXiv:2310.04893  [pdf, ps, other

    cs.SI cs.DS

    Generalized Densest Subgraph in Multiplex Networks

    Authors: Ali Behrouz, Farnoosh Hashemi

    Abstract: Finding dense subgraphs of a large network is a fundamental problem in graph mining that has been studied extensively both for its theoretical richness and its many practical applications over the last five decades. However, most existing studies have focused on graphs with a single type of connection. In applications such as biological, social, and transportation networks, interactions between ob… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  10. arXiv:2306.11147  [pdf, other

    cs.LG cs.SI

    CAT-Walk: Inductive Hypergraph Learning via Set Walks

    Authors: Ali Behrouz, Farnoosh Hashemi, Sadaf Sadeghian, Margo Seltzer

    Abstract: Temporal hypergraphs provide a powerful paradigm for modeling time-dependent, higher-order interactions in complex systems. Representation learning for hypergraphs is essential for extracting patterns of the higher-order interactions that are critically important in real-world problems in social network analysis, neuroscience, finance, etc. However, existing methods are typically designed only for… ▽ More

    Submitted 3 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  11. CS-TGN: Community Search via Temporal Graph Neural Networks

    Authors: Farnoosh Hashemi, Ali Behrouz, Milad Rezaei Hajidehi

    Abstract: Searching for local communities is an important research challenge that allows for personalized community discovery and supports advanced data analysis in various complex networks, such as the World Wide Web, social networks, and brain networks. The evolution of these networks over time has motivated several recent studies to identify local communities in temporal networks. Given any query nodes,… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: This is the author's version of the paper. Published in companion proceedings of the ACM Web Conference 2023 (WWW '23 Companion)

  12. arXiv:2211.08378  [pdf, other

    cs.LG cs.AI cs.SI

    Anomaly Detection in Multiplex Dynamic Networks: from Blockchain Security to Brain Disease Prediction

    Authors: Ali Behrouz, Margo Seltzer

    Abstract: The problem of identifying anomalies in dynamic networks is a fundamental task with a wide range of applications. However, it raises critical challenges due to the complex nature of anomalies, lack of ground truth knowledge, and complex and dynamic interactions in the network. Most existing approaches usually study networks with a single type of connection between vertices, while in many applicati… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 Temporal Graph Learning Workshop (Spotlight)

  13. CS-MLGCN : Multiplex Graph Convolutional Networks for Community Search in Multiplex Networks

    Authors: Ali Behrouz, Farnoosh Hashemi

    Abstract: Community Search (CS) is one of the fundamental tasks in network science and has attracted much attention due to its ability to discover personalized communities with a wide range of applications. Given any query nodes, CS seeks to find a densely connected subgraph containing query nodes. Most existing approaches usually study networks with a single type of proximity between nodes, which defines a… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: CIKM 2022

  14. arXiv:2210.06825  [pdf, other

    cs.LG cs.AI

    Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

    Authors: Ali Behrouz, Mathias Lecuyer, Cynthia Rudin, Margo Seltzer

    Abstract: Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be… ▽ More

    Submitted 25 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Advances in Interpretable Machine Learning, AIMLAI 2022. arXiv admin note: text overlap with arXiv:2112.00798

  15. FirmCore Decomposition of Multilayer Networks

    Authors: Farnoosh Hashemi, Ali Behrouz, Laks V. S. Lakshmanan

    Abstract: A key graph mining primitive is extracting dense structures from graphs, and this has led to interesting notions such as $k$-cores which subsequently have been employed as building blocks for capturing the structure of complex networks and for designing efficient approximation algorithms for challenging problems such as finding the densest subgraph. In applications such as biological, social, and… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: This is the author's version of the paper. Published in The ACM Web Conference (WWW), 2022

  16. arXiv:2205.00742  [pdf, other

    cs.SI

    FirmTruss Community Search in Multilayer Networks

    Authors: Ali Behrouz, Farnoosh Hashemi, Laks V. S. Lakshmanan

    Abstract: In applications such as biological, social, and transportation networks, interactions between objects span multiple aspects. For accurately modeling such applications, multilayer networks have been proposed. Community search allows for personalized community discovery and has a wide range of applications in large real-world networks. While community search has been widely explored for single-layer… ▽ More

    Submitted 17 November, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted to VLDB 2023 (PVLDB 2022)

  17. arXiv:1805.09104  [pdf, other

    cs.CR cs.SE

    Secure Credit Reporting on the Blockchain

    Authors: Amir Kafshdar Goharshady, Ali Behrouz, Krishnendu Chatterjee

    Abstract: We present a secure approach for maintaining and reporting credit history records on the Blockchain. Our approach removes third-parties such as credit reporting agencies from the lending process and replaces them with smart contracts. This allows customers to interact directly with the lenders or banks while ensuring the integrity, unmalleability and privacy of their credit data. Most importantly,… ▽ More

    Submitted 24 May, 2018; v1 submitted 23 May, 2018; originally announced May 2018.