Skip to main content

Showing 1–18 of 18 results for author: Marina, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01803  [pdf, ps, other

    cs.LG

    Towards Decentralized and Sustainable Foundation Model Training with the Edge

    Authors: Leyang Xue, Meghana Madhyastha, Randal Burns, Myungjin Lee, Mahesh K. Marina

    Abstract: Foundation models are at the forefront of AI research, appealing for their ability to learn from vast datasets and cater to diverse tasks. Yet, their significant computational demands raise issues of environmental impact and the risk of centralized control in their development. We put forward a vision towards decentralized and sustainable foundation model training that leverages the collective com… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  2. arXiv:2506.23740  [pdf, ps, other

    cs.NI

    Campus5G: A Campus Scale Private 5G Open RAN Testbed

    Authors: Andrew E. Ferguson, Ujjwal Pawar, Tianxin Wang, Mahesh K. Marina

    Abstract: Mobile networks are embracing disaggregation, reflected by the industry trend towards Open RAN. Private 5G networks are viewed as particularly suitable contenders as early adopters of Open RAN, owing to their setting, high degree of control, and opportunity for innovation they present. Motivated by this, we have recently deployed Campus5G, the first of its kind campus-wide, O-RAN-compliant private… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    ACM Class: C.2.1

  3. arXiv:2505.21115  [pdf, other

    cs.CL

    Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

    Authors: Sergey Pletenev, Maria Marina, Nikolay Ivanov, Daria Galimzianova, Nikita Krayko, Mikhail Salnikov, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

    Abstract: Large Language Models (LLMs) often hallucinate in question answering (QA) tasks. A key yet underexplored factor contributing to this is the temporality of questions -- whether they are evergreen (answers remain stable over time) or mutable (answers change). In this work, we introduce EverGreenQA, the first multilingual QA dataset with evergreen labels, supporting both evaluation and training. Usin… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.12566  [pdf, ps, other

    cs.LG

    HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing

    Authors: Leyang Xue, Yao Fu, Luo Mai, Mahesh K. Marina

    Abstract: Giant Deep Neural Networks (DNNs), have become indispensable for accurate and robust support of large-scale cloud based AI services. However, serving giant DNNs is prohibitively expensive from an energy consumption viewpoint easily exceeding that of training, due to the enormous scale of GPU clusters needed to hold giant DNN model partitions and replicas. Existing approaches can either optimize en… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  5. arXiv:2505.04253  [pdf, other

    cs.CL cs.LG

    LLM-Independent Adaptive RAG: Let the Question Speak for Itself

    Authors: Maria Marina, Nikolay Ivanov, Sergey Pletenev, Mikhail Salnikov, Daria Galimzianova, Nikita Krayko, Vasily Konovalov, Alexander Panchenko, Viktor Moskvoretskii

    Abstract: Large Language Models~(LLMs) are prone to hallucinations, and Retrieval-Augmented Generation (RAG) helps mitigate this, but at a high computational cost while risking misinformation. Adaptive retrieval aims to retrieve only when necessary, but existing approaches rely on LLM-based uncertainty estimation, which remain inefficient and impractical. In this study, we introduce lightweight LLM-independ… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 figures, 2 tables

  6. arXiv:2502.14502  [pdf, other

    cs.CL

    How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

    Authors: Sergey Pletenev, Maria Marina, Daniil Moskovskiy, Vasily Konovalov, Pavel Braslavski, Alexander Panchenko, Mikhail Salnikov

    Abstract: The performance of Large Language Models (LLMs) on many tasks is greatly limited by the knowledge learned during pre-training and stored in the model's parameters. Low-rank adaptation (LoRA) is a popular and efficient training technique for updating or domain-specific adaptation of LLMs. In this study, we investigate how new facts can be incorporated into the LLM using LoRA without compromising th… ▽ More

    Submitted 24 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  7. arXiv:2411.10108  [pdf, other

    physics.ao-ph cs.AI

    Identifying Key Drivers of Heatwaves: A Novel Spatio-Temporal Framework for Extreme Event Detection

    Authors: J. Pérez-Aracil, C. Peláez-Rodríguez, Ronan McAdam, Antonello Squintu, Cosmin M. Marina, Eugenio Lorente-Ramos, Niklas Luther, Veronica Torralba, Enrico Scoccimarro, Leone Cavicchia, Matteo Giuliani, Eduardo Zorita, Felicitas Hansen, David Barriopedro, Ricardo Garcia-Herrera, Pedro A. Gutiérrez, Jürg Luterbacher, Elena Xoplaki, Andrea Castelletti, S. Salcedo-Sanz

    Abstract: Heatwaves (HWs) are extreme atmospheric events that produce significant societal and environmental impacts. Predicting these extreme events remains challenging, as their complex interactions with large-scale atmospheric and climatic variables are difficult to capture with traditional statistical and dynamical models. This work presents a general method for driver identification in extreme climate… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

    Comments: 28 pages, 10 figures, 4 tables

  8. arXiv:2410.05468  [pdf, other

    cs.CV

    PH-Dropout: Practical Epistemic Uncertainty Quantification for View Synthesis

    Authors: Chuanhao Sun, Thanos Triantafyllou, Anthos Makris, Maja Drmač, Kai Xu, Luo Mai, Mahesh K. Marina

    Abstract: View synthesis using Neural Radiance Fields (NeRF) and Gaussian Splatting (GS) has demonstrated impressive fidelity in rendering real-world scenarios. However, practical methods for accurate and efficient epistemic Uncertainty Quantification (UQ) in view synthesis are lacking. Existing approaches for NeRF either introduce significant computational overhead (e.g., ``10x increase in training time" o… ▽ More

    Submitted 11 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: 21 pages, in submision

  9. arXiv:2407.09370  [pdf, other

    cs.LG

    Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding

    Authors: Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina

    Abstract: Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels. Despite their effectiveness, existing PEs require manual, empirical adjustment of crucial hyperparameters, specifically the Fourier features, tailored t… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 16 pages, Conference, Accepted by ICML 2024

  10. arXiv:2401.14361  [pdf, other

    cs.LG cs.PF

    MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache

    Authors: Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, Mahesh Marina

    Abstract: This paper presents MoE-Infinity, an efficient MoE inference system designed for personal machines with limited GPU memory capacity. The key idea for MoE-Infinity is that on personal machines, which are often single-user environments, MoE-based LLMs typically operate with a batch size of one. In this setting, MoE models exhibit a high degree of activation sparsity, meaning a small number of expert… ▽ More

    Submitted 12 March, 2025; v1 submitted 25 January, 2024; originally announced January 2024.

  11. A Multifaceted Look at Starlink Performance

    Authors: Nitinder Mohan, Andrew Ferguson, Hendrik Cech, Prakita Rayyan Renatin, Rohan Bose, Mahesh Marina, Jörg Ott

    Abstract: In recent years, Low-Earth Orbit (LEO) mega-constellations have emerged as a promising network technology and have ushered in a new era for democratizing Internet access. The Starlink network from SpaceX stands out as the only consumer-facing LEO network with over 2M+ customers and more than 4000 operational satellites. In this paper, we conduct the first-of-its-kind extensive multi-faceted analys… ▽ More

    Submitted 22 February, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted in ACM Web Conference 2024 (WWW 24)

    Journal ref: In Proceedings of ACM Web Conference 2024 (WWW 24)

  12. arXiv:2212.00742  [pdf, other

    cs.NE cs.AI math.OC

    New Probabilistic-Dynamic Multi-Method Ensembles for Optimization based on the CRO-SL

    Authors: Jorge Pérez-Aracil, Carlos Camacho-Gómez, Eugenio Lorente-Ramos, Cosmin M. Marina, Sancho Salcedo-Sanz

    Abstract: In this paper we propose new probabilistic and dynamic (adaptive) strategies to create multi-method ensembles based on the Coral Reefs Optimization with Substrate Layers (CRO-SL) algorithm. The CRO-SL is an evolutionary-based ensemble approach, able to combine different search procedures within a single population. In this work we discuss two different probabilistic strategies to improve the algor… ▽ More

    Submitted 2 December, 2022; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: 18 pages, 6 figures, 5 tables

    MSC Class: 68T01 and 68T20

  13. arXiv:2009.02473  [pdf, other

    cs.NI cs.LG

    Examining Machine Learning for 5G and Beyond through an Adversarial Lens

    Authors: Muhammad Usama, Rupendra Nath Mitra, Inaam Ilahi, Junaid Qadir, Mahesh K. Marina

    Abstract: Spurred by the recent advances in deep learning to harness rich information hidden in large volumes of data and to tackle problems that are hard to model/solve (e.g., resource allocation problems), there is currently tremendous excitement in the mobile networks domain around the transformative potential of data-driven AI/ML based network automation, control and analytics for 5G and beyond. In this… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

  14. arXiv:2007.11472  [pdf, other

    cs.NI cs.LG

    Characterization and Identification of Cloudified Mobile Network Performance Bottlenecks

    Authors: G. Patounas, X. Foukas, A. Elmokashfi, M. K. Marina

    Abstract: This study is a first attempt to experimentally explore the range of performance bottlenecks that 5G mobile networks can experience. To this end, we leverage a wide range of measurements obtained with a prototype testbed that captures the key aspects of a cloudified mobile network. We investigate the relevance of the metrics and a number of approaches to accurately and efficiently identify bottlen… ▽ More

    Submitted 23 July, 2020; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: 17 pages, 16 figures, documentclass[journal,comsoc]{IEEEtran}, corrected title

  15. Urban Vibes and Rural Charms: Analysis of Geographic Diversity in Mobile Service Usage at National Scale

    Authors: Rajkarn Singh, Marco Fiore, Mahesh K. Marina, Alessandro Nordio, Alberto Tarable

    Abstract: We investigate spatial patterns in mobile service consumption that emerge at national scale. Our investigation focuses on a representative case study, i.e., France, where we find that: (i) the demand for popular mobile services is fairly uniform across the whole country, and only a reduced set of peculiar services (mainly operating system updates and long-lived video streaming) yields geographic d… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

    Comments: to be published in Proceedings of the 2019 World Wide Web Conference (WWW'19), May 13-17 2019, San Francisco, CA, USA. 11 pages, 11 figures, 2 tables

  16. Iris: Deep Reinforcement Learning Driven Shared Spectrum Access Architecture for Indoor Neutral-Host Small Cells

    Authors: Xenofon Foukas, Mahesh K. Marina, Kimon Kontovasilis

    Abstract: We consider indoor mobile access, a vital use case for current and future mobile networks. For this key use case, we outline a vision that combines a neutral-host based shared small-cell infrastructure with a common pool of spectrum for dynamic sharing as a way forward to proliferate indoor small-cell deployments and open up the mobile operator ecosystem. Towards this vision, we focus on the chall… ▽ More

    Submitted 24 July, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

  17. Holistic Small Cell Traffic Balancing across Licensed and Unlicensed Bands

    Authors: Ursula Challita, Mahesh K. Marina

    Abstract: Due to the dramatic growth in mobile data traffic on one hand and the scarcity of the licensed spectrum on the other hand, mobile operators are considering the use of unlicensed bands (especially those in 5 GHz) as complementary spectrum for providing higher system capacity and better user experience. This approach is currently being standardized by 3GPP under the name of LTE Licensed-Assisted Acc… ▽ More

    Submitted 13 September, 2016; v1 submitted 17 August, 2016; originally announced August 2016.

    Comments: Accepted for publication at MSWiM 2016

  18. arXiv:1307.0962  [pdf, ps, other

    cs.NI

    Auctioning based Coordinated TV White Space Spectrum Sharing for Home Networks

    Authors: Saravana Manickam, Mahesh K. Marina, Sofia Pediaditaki, Maziar Nekovee

    Abstract: The idea of having the geolocation database monitor the secondary use of TV white space (TVWS) spectrum and assist in coordinating the secondary usage is gaining ground. Considering the home networking use case, we leverage the geolocation database for interference-aware coordinated TVWS sharing among secondary users (home networks) using {\em short-term auctions}, thereby realize a dynamic second… ▽ More

    Submitted 5 December, 2013; v1 submitted 3 July, 2013; originally announced July 2013.