Skip to main content

Showing 1–15 of 15 results for author: Ibrahimi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.15706  [pdf, other

    cs.NI eess.SP

    Multi-Failure Localization in High-Degree ROADM-based Optical Networks using Rules-Informed Neural Networks

    Authors: Ruikun Wang, Qiaolun Zhang, Jiawei Zhang, Zhiqun Gu, Memedhe Ibrahimi, Hao Yu, Bojun Zhang, Francesco Musumeci, Yuefeng Ji, Massimo Tornatore

    Abstract: To accommodate ever-growing traffic, network operators are actively deploying high-degree reconfigurable optical add/drop multiplexers (ROADMs) to build large-capacity optical networks. High-degree ROADM-based optical networks have multiple parallel fibers between ROADM nodes, requiring the adoption of ROADM nodes with a large number of inter-/intra-node components. However, this large number of i… ▽ More

    Submitted 20 January, 2025; originally announced February 2025.

    Comments: This is the author's version of the work. This work was accepted by IEEE Journal on Selected Areas in Communications

    Journal ref: IEEE Journal on Selected Areas in Communications, 2025

  2. arXiv:2502.02874  [pdf, other

    cs.NI cs.AI cs.DC cs.LG

    Vertical Federated Learning for Failure-Cause Identification in Disaggregated Microwave Networks

    Authors: Fatih Temiz, Memedhe Ibrahimi, Francesco Musumeci, Claudio Passera, Massimo Tornatore

    Abstract: Machine Learning (ML) has proven to be a promising solution to provide novel scalable and efficient fault management solutions in modern 5G-and-beyond communication networks. In the context of microwave networks, ML-based solutions have received significant attention. However, current solutions can only be applied to monolithic scenarios in which a single entity (e.g., an operator) manages the ent… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 6 pages, 7 figure, IEEE ICC 2025

  3. arXiv:2302.09205  [pdf, other

    cs.LG cs.AI

    Approximate Thompson Sampling via Epistemic Neural Networks

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

    Abstract: Thompson sampling (TS) is a popular heuristic for action selection, but it requires sampling from a posterior distribution. Unfortunately, this can become computationally intractable in complex environments, such as those modeled using neural networks. Approximate posterior samples can produce effective actions, but only if they reasonably approximate joint predictive distributions of outputs acro… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  4. arXiv:2110.04629  [pdf, other

    cs.LG cs.AI stat.ML

    The Neural Testbed: Evaluating Joint Predictions

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

    Abstract: Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a… ▽ More

    Submitted 1 November, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

  5. arXiv:2107.09224  [pdf, ps, other

    cs.LG stat.ML

    From Predictions to Decisions: The Importance of Joint Predictive Distributions

    Authors: Zheng Wen, Ian Osband, Chao Qin, Xiuyuan Lu, Morteza Ibrahimi, Vikranth Dwaracherla, Mohammad Asghari, Benjamin Van Roy

    Abstract: A fundamental challenge for any intelligent system is prediction: given some inputs, can you predict corresponding outcomes? Most work on supervised learning has focused on producing accurate marginal predictions for each input. However, we show that for a broad class of decision problems, accurate joint predictions are required to deliver good performance. In particular, we establish several resu… ▽ More

    Submitted 23 May, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

  6. arXiv:2107.08924  [pdf, other

    cs.LG cs.AI stat.ML

    Epistemic Neural Networks

    Authors: Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

    Abstract: Intelligence relies on an agent's knowledge of what it does not know. This capability can be assessed based on the quality of joint predictions of labels across multiple inputs. In principle, ensemble-based approaches produce effective joint predictions, but the computational costs of training large ensembles can become prohibitive. We introduce the epinet: an architecture that can supplement any… ▽ More

    Submitted 17 May, 2023; v1 submitted 19 July, 2021; originally announced July 2021.

  7. arXiv:2103.04047  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning, Bit by Bit

    Authors: Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

    Abstract: Reinforcement learning agents have demonstrated remarkable achievements in simulated environments. Data efficiency poses an impediment to carrying this success over to real environments. The design of data-efficient agents calls for a deeper understanding of information acquisition and representation. We discuss concepts and regret analysis that together offer principled guidance. This line of thi… ▽ More

    Submitted 4 May, 2023; v1 submitted 6 March, 2021; originally announced March 2021.

  8. arXiv:2006.07464  [pdf, other

    cs.LG math.OC stat.ML

    Hypermodels for Exploration

    Authors: Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

    Abstract: We study the use of hypermodels to represent epistemic uncertainty and guide exploration. This generalizes and extends the use of ensembles to approximate Thompson sampling. The computational cost of training an ensemble grows with its size, and as such, prior work has typically been limited to ensembles with tens of elements. We show that alternative hypermodels can enjoy dramatic efficiency gain… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: Published as a conference paper at ICLR 2020

  9. arXiv:1308.4077  [pdf, other

    cs.IT cs.LG math.PR math.ST

    Support Recovery for the Drift Coefficient of High-Dimensional Diffusions

    Authors: Jose Bento, Morteza Ibrahimi

    Abstract: Consider the problem of learning the drift coefficient of a $p$-dimensional stochastic differential equation from a sample path of length $T$. We assume that the drift is parametrized by a high-dimensional vector, and study the support recovery problem when both $p$ and $T$ can tend to infinity. In particular, we prove a general lower bound on the sample-complexity $T$ by using a characterization… ▽ More

    Submitted 19 August, 2013; v1 submitted 19 August, 2013; originally announced August 2013.

    Comments: 24 pages, 12 figures

    MSC Class: 60J60; 60H10; 94A15; 62B10

  10. arXiv:1303.5984  [pdf, ps, other

    stat.ML cs.LG math.OC

    Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

    Authors: Morteza Ibrahimi, Adel Javanmard, Benjamin Van Roy

    Abstract: We study the problem of adaptive control of a high dimensional linear quadratic (LQ) system. Previous work established the asymptotic convergence to an optimal controller for various adaptive control schemes. More recently, for the average cost LQ problem, a regret bound of ${O}(\sqrt{T})$ was shown, apart form logarithmic factors. However, this bound scales exponentially with $p$, the dimension o… ▽ More

    Submitted 24 March, 2013; originally announced March 2013.

    Comments: 16 pages

    Journal ref: Advances in Neural Information Processing Systems (NIPS) 2012: 2645-2653

  11. arXiv:1212.4269  [pdf, other

    math.OC cs.CE stat.ML

    Accelerated Time-of-Flight Mass Spectrometry

    Authors: Morteza Ibrahimi, Andrea Montanari, George S Moore

    Abstract: We study a simple modification to the conventional time of flight mass spectrometry (TOFMS) where a \emph{variable} and (pseudo)-\emph{random} pulsing rate is used which allows for traces from different pulses to overlap. This modification requires little alteration to the currently employed hardware. However, it requires a reconstruction method to recover the spectrum from highly aliased traces.… ▽ More

    Submitted 28 July, 2013; v1 submitted 18 December, 2012; originally announced December 2012.

    Comments: 14 pages, 18 figures. This paper is submitted to IEEE Transaction on Signal Processing

  12. arXiv:1111.6214  [pdf, other

    cs.CE cs.LG math.OC

    Robust Max-Product Belief Propagation

    Authors: Morteza Ibrahimi, Adel Javanmard, Yashodhan Kanoria, Andrea Montanari

    Abstract: We study the problem of optimizing a graph-structured objective function under \emph{adversarial} uncertainty. This problem can be modeled as a two-persons zero-sum game between an Engineer and Nature. The Engineer controls a subset of the variables (nodes in the graph), and tries to assign their values to maximize an objective function. Nature controls the complementary subset of variables and tr… ▽ More

    Submitted 26 November, 2011; originally announced November 2011.

    Comments: 7 pages, 4 figures

  13. arXiv:1107.5377  [pdf, ps, other

    cs.DM cond-mat.dis-nn math.PR

    The set of solutions of random XORSAT formulae

    Authors: Morteza Ibrahimi, Yash Kanoria, Matt Kraning, Andrea Montanari

    Abstract: The XOR-satisfiability (XORSAT) problem requires finding an assignment of $n$ Boolean variables that satisfy $m$ exclusive OR (XOR) clauses, whereby each clause constrains a subset of the variables. We consider random XORSAT instances, drawn uniformly at random from the ensemble of formulae containing $n$ variables and $m$ clauses of size $k$. This model presents several structural similarities to… ▽ More

    Submitted 9 September, 2015; v1 submitted 26 July, 2011; originally announced July 2011.

    Comments: Published at http://dx.doi.org/10.1214/14-AAP1060 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP1060

    Journal ref: Annals of Applied Probability 2015, Vol. 25, No. 5, 2743-2808

  14. arXiv:1103.1689  [pdf, other

    cs.IT cs.LG math.ST q-fin.ST stat.ML

    Information Theoretic Limits on Learning Stochastic Differential Equations

    Authors: José Bento, Morteza Ibrahimi, Andrea Montanari

    Abstract: Consider the problem of learning the drift coefficient of a stochastic differential equation from a sample path. In this paper, we assume that the drift is parametrized by a high dimensional vector. We address the question of how long the system needs to be observed in order to learn this vector of parameters. We prove a general lower bound on this time complexity by using a characterization of mu… ▽ More

    Submitted 8 March, 2011; originally announced March 2011.

    Comments: 6 pages, 2 figures, conference version

  15. arXiv:1011.0415  [pdf, ps, other

    math.ST cond-mat.stat-mech cs.IT cs.LG

    Learning Networks of Stochastic Differential Equations

    Authors: José Bento, Morteza Ibrahimi, Andrea Montanari

    Abstract: We consider linear models for stochastic dynamics. To any such model can be associated a network (namely a directed graph) describing which degrees of freedom interact under the dynamics. We tackle the problem of learning such a network from observation of the system trajectory over a time interval $T$. We analyze the $\ell_1$-regularized least squares algorithm and, in the setting in which the… ▽ More

    Submitted 1 November, 2010; originally announced November 2010.

    Comments: This publication is to appear in NIPS 2010