-
Latent Guidance in Diffusion Models for Perceptual Evaluations
Authors:
Shreshth Saini,
Ru-Ling Liao,
Yan Ye,
Alan C. Bovik
Abstract:
Despite recent advancements in latent diffusion models that generate high-dimensional image data and perform various downstream tasks, there has been little exploration into perceptual consistency within these models on the task of No-Reference Image Quality Assessment (NR-IQA). In this paper, we hypothesize that latent diffusion models implicitly exhibit perceptually consistent local regions with…
▽ More
Despite recent advancements in latent diffusion models that generate high-dimensional image data and perform various downstream tasks, there has been little exploration into perceptual consistency within these models on the task of No-Reference Image Quality Assessment (NR-IQA). In this paper, we hypothesize that latent diffusion models implicitly exhibit perceptually consistent local regions within the data manifold. We leverage this insight to guide on-manifold sampling using perceptual features and input measurements. Specifically, we propose Perceptual Manifold Guidance (PMG), an algorithm that utilizes pretrained latent diffusion models and perceptual quality features to obtain perceptually consistent multi-scale and multi-timestep feature maps from the denoising U-Net. We empirically demonstrate that these hyperfeatures exhibit high correlation with human perception in IQA tasks. Our method can be applied to any existing pretrained latent diffusion model and is straightforward to integrate. To the best of our knowledge, this paper is the first work on guiding diffusion model with perceptual features for NR-IQA. Extensive experiments on IQA datasets show that our method, LGDM, achieves state-of-the-art performance, underscoring the superior generalization capabilities of diffusion models for NR-IQA tasks.
△ Less
Submitted 30 May, 2025;
originally announced June 2025.
-
Power law $α$-Starobinsky inflation
Authors:
Saisandri Saini,
Akhilesh Nautiyal
Abstract:
In this work we consider a generalization of Starobinsky inflation obtained by combining power law ($R^β$), and $α$-Starobinsky inflation ($E$-model). The Einstein frame potential for this model is that of power law Starobinsky inflation modified by a parameter $α$ in the exponential. After computing power spectra for scalar and tensor perturbations numerically, we perform MCMC analysis to put con…
▽ More
In this work we consider a generalization of Starobinsky inflation obtained by combining power law ($R^β$), and $α$-Starobinsky inflation ($E$-model). The Einstein frame potential for this model is that of power law Starobinsky inflation modified by a parameter $α$ in the exponential. After computing power spectra for scalar and tensor perturbations numerically, we perform MCMC analysis to put constraints on the potential parameter $α$, $β$ and $M$, and the number of e-foldings $N_{pivot}$ during inflation, using Planck-2018, BICEP/Keck (BK18) and other LSS observations. We find $\log_{10}α= 0.37^{+0.82}_{-0.85}$, $β= 1.969^{+0.020}_{-0.023}$, $M=\left(3.54^{+2.62}_{-1.73}\right)\times 10^{-5}$ and $N_{pivot} = 47\pm{10}$. We compute the Bayesian evidences for our proposed model, power law Starobinsky inflation, $α$-Starobinsky inflation and Starobinsky inflation. Considering the Starobinsky model as the base model, we calculate the Bayes factor and find that our proposed model is preferred by the CMB and LSS observations.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
The Muon Collider
Authors:
Carlotta Accettura,
Simon Adrian,
Rohit Agarwal,
Claudia Ahdida,
Chiara Aime',
Avni Aksoy,
Gian Luigi Alberghi,
Siobhan Alden,
Luca Alfonso,
Muhammad Ali,
Anna Rita Altamura,
Nicola Amapane,
Kathleen Amm,
David Amorim,
Paolo Andreetto,
Fabio Anulli,
Ludovica Aperio Bella,
Rob Appleby,
Artur Apresyan,
Pouya Asadi,
Mohammed Attia Mahmoud,
Bernhard Auchmann,
John Back,
Anthony Badea,
Kyu Jung Bae
, et al. (433 additional authors not shown)
Abstract:
Muons offer a unique opportunity to build a compact high-energy electroweak collider at the 10 TeV scale. A Muon Collider enables direct access to the underlying simplicity of the Standard Model and unparalleled reach beyond it. It will be a paradigm-shifting tool for particle physics representing the first collider to combine the high-energy reach of a proton collider and the high precision of an…
▽ More
Muons offer a unique opportunity to build a compact high-energy electroweak collider at the 10 TeV scale. A Muon Collider enables direct access to the underlying simplicity of the Standard Model and unparalleled reach beyond it. It will be a paradigm-shifting tool for particle physics representing the first collider to combine the high-energy reach of a proton collider and the high precision of an electron-positron collider, yielding a physics potential significantly greater than the sum of its individual parts. A high-energy muon collider is the natural next step in the exploration of fundamental physics after the HL-LHC and a natural complement to a future low-energy Higgs factory. Such a facility would significantly broaden the scope of particle colliders, engaging the many frontiers of the high energy community.
The last European Strategy for Particle Physics Update and later the Particle Physics Project Prioritisation Panel in the US requested a study of the muon collider, which is being carried on by the International Muon Collider Collaboration. In this comprehensive document we present the physics case, the state of the work on accelerator design and technology, and propose an R\&D project that can make the muon collider a reality.
△ Less
Submitted 30 April, 2025;
originally announced April 2025.
-
Effect of pressure on the transport properties and thermoelectric performance of Dirac semimetal ZrTe5
Authors:
Sanskar Mishra,
Nagendra Singh,
V. K. Gangwar,
Rajan Walia,
Manindra Kumar,
Udai Bhan Singh,
Deepash Sekhar Saini,
Jianping Sun,
Genfu Chen,
Dilip Bhoi,
Sandip Chatterjee,
Yoshiya Uwatoko,
Jinguang Cheng,
Prashant Shahi
Abstract:
In this study, we have investigated and compared the effect of hydrostatic pressure up to ~20 kbar on the transport properties of ZrTe5 single crystals grown by chemical vapor transport (CVT) and flux methods. With the application of pressure, the electrical resistivity Rho(T) and thermopower S(T) of both crystals were found to increase in the whole temperature range unlike the other known thermoe…
▽ More
In this study, we have investigated and compared the effect of hydrostatic pressure up to ~20 kbar on the transport properties of ZrTe5 single crystals grown by chemical vapor transport (CVT) and flux methods. With the application of pressure, the electrical resistivity Rho(T) and thermopower S(T) of both crystals were found to increase in the whole temperature range unlike the other known thermoelectric materials, such as Bi2Te3, SnSe etc. This observation is supported by the complementary first-principles band structure calculation as the application of pressure widens the direct bandgap at Γ point. Moreover, the analysis of the pressure dependent magneto-transport and Shubnikov de-Hass oscillation results revealed an increase in carrier concentration and effective mass along with the reduction of mobility as pressure rises. Furthermore, with the application of pressure, the flux-grown ZrTe5 crystals display a transition from unipolar to bipolar charge transport as evidenced by the emergence of resistivity peak at T* under high pressure, unlike the CVT-grown ZrTe5 crystals where the bipolar charge transport near its characteristic resistivity peak (Tp) remains unaffected.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
Prototype Guided Backdoor Defense
Authors:
Venkat Adithya Amula,
Sunayana Samavedam,
Saurabh Saini,
Avani Gupta,
Narayanan P J
Abstract:
Deep learning models are susceptible to {\em backdoor attacks} involving malicious attackers perturbing a small subset of training data with a {\em trigger} to causes misclassifications. Various triggers have been used, including semantic triggers that are easily realizable without requiring the attacker to manipulate the image. The emergence of generative AI has eased the generation of varied poi…
▽ More
Deep learning models are susceptible to {\em backdoor attacks} involving malicious attackers perturbing a small subset of training data with a {\em trigger} to causes misclassifications. Various triggers have been used, including semantic triggers that are easily realizable without requiring the attacker to manipulate the image. The emergence of generative AI has eased the generation of varied poisoned samples. Robustness across types of triggers is crucial to effective defense. We propose Prototype Guided Backdoor Defense (PGBD), a robust post-hoc defense that scales across different trigger types, including previously unsolved semantic triggers. PGBD exploits displacements in the geometric spaces of activations to penalize movements toward the trigger. This is done using a novel sanitization loss of a post-hoc fine-tuning step. The geometric approach scales easily to all types of attacks. PGBD achieves better performance across all settings. We also present the first defense against a new semantic attack on celebrity face images. Project page: \hyperlink{https://venkatadithya9.github.io/pgbd.github.io/}{this https URL}.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Towards Efficient Large Scale Spatial-Temporal Time Series Forecasting via Improved Inverted Transformers
Authors:
Jiarui Sun,
Chin-Chia Michael Yeh,
Yujie Fan,
Xin Dai,
Xiran Fan,
Zhimeng Jiang,
Uday Singh Saini,
Vivian Lai,
Junpeng Wang,
Huiyuan Chen,
Zhongfang Zhuang,
Yan Zheng,
Girish Chowdhary
Abstract:
Time series forecasting at scale presents significant challenges for modern prediction systems, particularly when dealing with large sets of synchronized series, such as in a global payment network. In such systems, three key challenges must be overcome for accurate and scalable predictions: 1) emergence of new entities, 2) disappearance of existing entities, and 3) the large number of entities pr…
▽ More
Time series forecasting at scale presents significant challenges for modern prediction systems, particularly when dealing with large sets of synchronized series, such as in a global payment network. In such systems, three key challenges must be overcome for accurate and scalable predictions: 1) emergence of new entities, 2) disappearance of existing entities, and 3) the large number of entities present in the data. The recently proposed Inverted Transformer (iTransformer) architecture has shown promising results by effectively handling variable entities. However, its practical application in large-scale settings is limited by quadratic time and space complexity ($O(N^2)$) with respect to the number of entities $N$. In this paper, we introduce EiFormer, an improved inverted transformer architecture that maintains the adaptive capabilities of iTransformer while reducing computational complexity to linear scale ($O(N)$). Our key innovation lies in restructuring the attention mechanism to eliminate redundant computations without sacrificing model expressiveness. Additionally, we incorporate a random projection mechanism that not only enhances efficiency but also improves prediction accuracy through better feature representation. Extensive experiments on the public LargeST benchmark dataset and a proprietary large-scale time series dataset demonstrate that EiFormer significantly outperforms existing methods in both computational efficiency and forecasting accuracy. Our approach enables practical deployment of transformer-based forecasting in industrial applications where handling time series at scale is essential.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Can KAN CANs? Input-convex Kolmogorov-Arnold Networks (KANs) as hyperelastic constitutive artificial neural networks (CANs)
Authors:
Prakash Thakolkaran,
Yaqi Guo,
Shivam Saini,
Mathias Peirlinck,
Benjamin Alheit,
Siddhant Kumar
Abstract:
Traditional constitutive models rely on hand-crafted parametric forms with limited expressivity and generalizability, while neural network-based models can capture complex material behavior but often lack interpretability. To balance these trade-offs, we present monotonic Input-Convex Kolmogorov-Arnold Networks (ICKANs) for learning polyconvex hyperelastic constitutive laws. ICKANs leverage the Ko…
▽ More
Traditional constitutive models rely on hand-crafted parametric forms with limited expressivity and generalizability, while neural network-based models can capture complex material behavior but often lack interpretability. To balance these trade-offs, we present monotonic Input-Convex Kolmogorov-Arnold Networks (ICKANs) for learning polyconvex hyperelastic constitutive laws. ICKANs leverage the Kolmogorov-Arnold representation, decomposing the model into compositions of trainable univariate spline-based activation functions for rich expressivity. We introduce trainable monotonic input-convex splines within the KAN architecture, ensuring physically admissible polyconvex models for isotropic compressible hyperelasticity. The resulting models are both compact and interpretable, enabling explicit extraction of analytical constitutive relationships through a monotonic input-convex symbolic regression technique. Through unsupervised training on full-field strain data and limited global force measurements, ICKANs accurately capture nonlinear stress-strain behavior across diverse strain states. Finite element simulations of unseen geometries with trained ICKAN hyperelastic constitutive models confirm the framework's robustness and generalization capability.
△ Less
Submitted 4 June, 2025; v1 submitted 7 March, 2025;
originally announced March 2025.
-
Visual Attention Exploration in Vision-Based Mamba Models
Authors:
Junpeng Wang,
Chin-Chia Michael Yeh,
Uday Singh Saini,
Mahashweta Das
Abstract:
State space models (SSMs) have emerged as an efficient alternative to transformer-based models, offering linear complexity that scales better than transformers. One of the latest advances in SSMs, Mamba, introduces a selective scan mechanism that assigns trainable weights to input tokens, effectively mimicking the attention mechanism. Mamba has also been successfully extended to the vision domain…
▽ More
State space models (SSMs) have emerged as an efficient alternative to transformer-based models, offering linear complexity that scales better than transformers. One of the latest advances in SSMs, Mamba, introduces a selective scan mechanism that assigns trainable weights to input tokens, effectively mimicking the attention mechanism. Mamba has also been successfully extended to the vision domain by decomposing 2D images into smaller patches and arranging them as 1D sequences. However, it remains unclear how these patches interact with (or attend to) each other in relation to their original 2D spatial location. Additionally, the order used to arrange the patches into a sequence also significantly impacts their attention distribution. To better understand the attention between patches and explore the attention patterns, we introduce a visual analytics tool specifically designed for vision-based Mamba models. This tool enables a deeper understanding of how attention is distributed across patches in different Mamba blocks and how it evolves throughout a Mamba model. Using the tool, we also investigate the impact of different patch-ordering strategies on the learned attention, offering further insights into the model's behavior.
△ Less
Submitted 28 February, 2025;
originally announced February 2025.
-
A Compact Model for Large-Scale Time Series Forecasting
Authors:
Chin-Chia Michael Yeh,
Xiran Fan,
Zhimeng Jiang,
Yujie Fan,
Huiyuan Chen,
Uday Singh Saini,
Vivian Lai,
Xin Dai,
Junpeng Wang,
Zhongfang Zhuang,
Liang Wang,
Yan Zheng
Abstract:
Spatio-temporal data, which commonly arise in real-world applications such as traffic monitoring, financial transactions, and ride-share demands, represent a special category of multivariate time series. They exhibit two distinct characteristics: high dimensionality and commensurability across spatial locations. These attributes call for computationally efficient modeling approaches and facilitate…
▽ More
Spatio-temporal data, which commonly arise in real-world applications such as traffic monitoring, financial transactions, and ride-share demands, represent a special category of multivariate time series. They exhibit two distinct characteristics: high dimensionality and commensurability across spatial locations. These attributes call for computationally efficient modeling approaches and facilitate the use of univariate forecasting models in a channel-independent fashion. SparseTSF, a recently introduced competitive univariate forecasting model, harnesses periodicity to achieve compactness by concentrating on cross-period dynamics, thereby extending the Pareto frontier with respect to model size and predictive performance. Nonetheless, it underperforms on spatio-temporal data due to an inadequate capture of intra-period temporal dependencies. To address this shortcoming, we propose UltraSTF, which integrates a cross-period forecasting module with an ultra-compact shape bank component. Our model effectively detects recurring patterns in time series through the attention mechanism of the shape bank component, thereby strengthening its ability to learn intra-period dynamics. UltraSTF achieves state-of-the-art performance on the LargeST benchmark while employing fewer than 0.2% of the parameters required by the second-best approaches, thus further extending the Pareto frontier of existing methods.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation
Authors:
Shubham Agarwal,
Sai Sundaresan,
Subrata Mitra,
Debabrata Mahapatra,
Archit Gupta,
Rounak Sharma,
Nirmal Joshua Kapu,
Tong Yu,
Shiv Saini
Abstract:
Retrieval-Augmented Generation (RAG) is often used with Large Language Models (LLMs) to infuse domain knowledge or user-specific information. In RAG, given a user query, a retriever extracts chunks of relevant text from a knowledge base. These chunks are sent to an LLM as part of the input prompt. Typically, any given chunk is repeatedly retrieved across user questions. However, currently, for eve…
▽ More
Retrieval-Augmented Generation (RAG) is often used with Large Language Models (LLMs) to infuse domain knowledge or user-specific information. In RAG, given a user query, a retriever extracts chunks of relevant text from a knowledge base. These chunks are sent to an LLM as part of the input prompt. Typically, any given chunk is repeatedly retrieved across user questions. However, currently, for every question, attention-layers in LLMs fully compute the key values (KVs) repeatedly for the input chunks, as state-of-the-art methods cannot reuse KV-caches when chunks appear at arbitrary locations with arbitrary contexts. Naive reuse leads to output quality degradation. This leads to potentially redundant computations on expensive GPUs and increases latency. In this work, we propose Cache-Craft, a system for managing and reusing precomputed KVs corresponding to the text chunks (we call chunk-caches) in RAG-based systems. We present how to identify chunk-caches that are reusable, how to efficiently perform a small fraction of recomputation to fix the cache to maintain output quality, and how to efficiently store and evict chunk-caches in the hardware for maximizing reuse while masking any overheads. With real production workloads as well as synthetic datasets, we show that Cache-Craft reduces redundant computation by 51% over SOTA prefix-caching and 75% over full recomputation. Additionally, with continuous batching on a real production workload, we get a 1.6X speed up in throughput and a 2X reduction in end-to-end response latency over prefix-caching while maintaining quality, for both the LLaMA-3-8B and LLaMA-3-70B models.
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
Exploring generalized Starobinsky Model of Inflation: Observational Constraints
Authors:
Saisandri Saini,
Akhilesh Nautiyal
Abstract:
We examine the power-law Starobinsky model, a generalized version of the Starobinsky inflation model, characterized by a power-law correction to Einstein gravity. Employing the $f(R)$ formalism, the scalar and tensor power spectra were numerically computed as functions of the dimensionless parameters $M$ and $β$. A Markov Chain Monte Carlo (MCMC) analysis was conducted using Planck-2018, BICEP3 an…
▽ More
We examine the power-law Starobinsky model, a generalized version of the Starobinsky inflation model, characterized by a power-law correction to Einstein gravity. Employing the $f(R)$ formalism, the scalar and tensor power spectra were numerically computed as functions of the dimensionless parameters $M$ and $β$. A Markov Chain Monte Carlo (MCMC) analysis was conducted using Planck-2018, BICEP3 and BAO observational data, yielding precise constraints on $β= 1.987^{+0.013}_{-0.016},\, 95\%\, C.\, L.$. and $ \log_{10}M = -4.72^{+0.21}_{-0.20}$. The derived scalar spectral index $n_s=0.9676^{+0.0069}_{-0.0068}$ and tensor-to-scalar ratio $r=0.0074^{+0.0061}_{-0.0044}$ lie within the bounds set by Planck observations. We analyse a general reheating scenario while keeping the number of e-folds during inflation, $N_{pivot}$, fixed. The analysis confirms that deviations from the Starobinsky $R^2$ model are observationaly viable, with implications for high-energy physics and supergravity-based inflationary models.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
Tab-Shapley: Identifying Top-k Tabular Data Quality Insights
Authors:
Manisha Padala,
Lokesh Nagalapatti,
Atharv Tyagi,
Ramasuri Narayanam,
Shiv Kumar Saini
Abstract:
We present an unsupervised method for aggregating anomalies in tabular datasets by identifying the top-k tabular data quality insights. Each insight consists of a set of anomalous attributes and the corresponding subsets of records that serve as evidence to the user. The process of identifying these insight blocks is challenging due to (i) the absence of labeled anomalies, (ii) the exponential siz…
▽ More
We present an unsupervised method for aggregating anomalies in tabular datasets by identifying the top-k tabular data quality insights. Each insight consists of a set of anomalous attributes and the corresponding subsets of records that serve as evidence to the user. The process of identifying these insight blocks is challenging due to (i) the absence of labeled anomalies, (ii) the exponential size of the subset search space, and (iii) the complex dependencies among attributes, which obscure the true sources of anomalies. Simple frequency-based methods fail to capture these dependencies, leading to inaccurate results. To address this, we introduce Tab-Shapley, a cooperative game theory based framework that uses Shapley values to quantify the contribution of each attribute to the data's anomalous nature. While calculating Shapley values typically requires exponential time, we show that our game admits a closed-form solution, making the computation efficient. We validate the effectiveness of our approach through empirical analysis on real-world tabular datasets with ground-truth anomaly labels.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
A Review of the Duality of Adversarial Learning in Network Intrusion: Attacks and Countermeasures
Authors:
Shalini Saini,
Anitha Chennamaneni,
Babatunde Sawyerr
Abstract:
Deep learning solutions are instrumental in cybersecurity, harnessing their ability to analyze vast datasets, identify complex patterns, and detect anomalies. However, malevolent actors can exploit these capabilities to orchestrate sophisticated attacks, posing significant challenges to defenders and traditional security measures. Adversarial attacks, particularly those targeting vulnerabilities i…
▽ More
Deep learning solutions are instrumental in cybersecurity, harnessing their ability to analyze vast datasets, identify complex patterns, and detect anomalies. However, malevolent actors can exploit these capabilities to orchestrate sophisticated attacks, posing significant challenges to defenders and traditional security measures. Adversarial attacks, particularly those targeting vulnerabilities in deep learning models, present a nuanced and substantial threat to cybersecurity. Our study delves into adversarial learning threats such as Data Poisoning, Test Time Evasion, and Reverse Engineering, specifically impacting Network Intrusion Detection Systems. Our research explores the intricacies and countermeasures of attacks to deepen understanding of network security challenges amidst adversarial threats. In our study, we present insights into the dynamic realm of adversarial learning and its implications for network intrusion. The intersection of adversarial attacks and defenses within network traffic data, coupled with advances in machine learning and deep learning techniques, represents a relatively underexplored domain. Our research lays the groundwork for strengthening defense mechanisms to address the potential breaches in network security and privacy posed by adversarial attacks. Through our in-depth analysis, we identify domain-specific research gaps, such as the scarcity of real-life attack data and the evaluation of AI-based solutions for network traffic. Our focus on these challenges aims to stimulate future research efforts toward the development of resilient network defense strategies.
△ Less
Submitted 18 December, 2024;
originally announced December 2024.
-
Ekpyrosis in Quantum Gravitational Anisotropic Bouncing Models
Authors:
Rachel Brown,
A. Meenakshi McNamara,
Sahil Saini,
Parampreet Singh
Abstract:
We explore the isotropization of the universe starting from potentially large anisotropies in the bouncing models using the ekpyrotic mechanism. As an example of a concrete non-singular bouncing mechanism, we consider the effective description of loop quantum cosmology for Bianchi-I and Bianchi-IX spacetimes for ekpyrotic and ekpyrotic-like potentials. For both of these spacetimes the cosmological…
▽ More
We explore the isotropization of the universe starting from potentially large anisotropies in the bouncing models using the ekpyrotic mechanism. As an example of a concrete non-singular bouncing mechanism, we consider the effective description of loop quantum cosmology for Bianchi-I and Bianchi-IX spacetimes for ekpyrotic and ekpyrotic-like potentials. For both of these spacetimes the cosmological singularity is resolved via multiple short-duration non-singular bounces. We perform a large number of numerical simulations for a wide range of initial conditions and find that the relative strength of the anisotropies at the end of the bounce regime is noticeably reduced in more than $90\%$ of the simulations, providing strong evidence for the isotropization ability of the ekpyrotic potentials. While the ekpyrosis phase in all the simulations is found to be rather short-lived, isotropization occurs over cycles of rapid non-singular bounces in the Planck regime via enhancement of the contribution of the (isotropic) energy density relative to the anisotropies at the bounces. Achieving isotropization is found to be easier in Bianchi-I spacetimes when compared to Bianchi-IX spacetimes. Our results demonstrate that, while ekpyrosis might itself be insufficient to tame anisotropies at a single bounce, it can be significant when coupled with non-singular cycles in the bounce regime.
△ Less
Submitted 16 December, 2024;
originally announced December 2024.
-
Iteration-Free Cooperative Distributed MPC through Multiparametric Programming
Authors:
Radhe S. T. Saini,
Parth R. Brahmbhatt,
Styliani Avraamidou,
Hari S. Ganesh
Abstract:
Cooperative Distributed Model Predictive Control (DiMPC) architecture employs local MPC controllers to control different subsystems, exchanging information with each other through an iterative procedure to enhance overall control performance compared to the decentralized architecture. However, this method can result in high communication between the controllers and computational costs. In this wor…
▽ More
Cooperative Distributed Model Predictive Control (DiMPC) architecture employs local MPC controllers to control different subsystems, exchanging information with each other through an iterative procedure to enhance overall control performance compared to the decentralized architecture. However, this method can result in high communication between the controllers and computational costs. In this work, the amount of information exchanged and the computational costs of DiMPC are reduced significantly by developing novel iteration-free solution algorithms based on multiparametric (mp) programming. These algorithms replace the iterative procedure with simultaneous solutions of explicit mpDiMPC control law functions. The reduced communication among local controllers decreases system latency, which is crucial for real-time control applications. The effectiveness of the proposed iteration-free mpDiMPC algorithms is demonstrated through comprehensive numerical simulations involving groups of coupled linear subsystems, which are interconnected through their inputs and a cooperative plant-wide cost function.
△ Less
Submitted 3 December, 2024; v1 submitted 21 November, 2024;
originally announced November 2024.
-
HARP: A Large-Scale Higher-Order Ambisonic Room Impulse Response Dataset
Authors:
Shivam Saini,
Jürgen Peissig
Abstract:
This contribution introduces a dataset of 7th-order Ambisonic Room Impulse Responses (HOA-RIRs), created using the Image Source Method. By employing higher-order Ambisonics, our dataset enables precise spatial audio reproduction, a critical requirement for realistic immersive audio applications. Leveraging the virtual simulation, we present a unique microphone configuration, based on the superposi…
▽ More
This contribution introduces a dataset of 7th-order Ambisonic Room Impulse Responses (HOA-RIRs), created using the Image Source Method. By employing higher-order Ambisonics, our dataset enables precise spatial audio reproduction, a critical requirement for realistic immersive audio applications. Leveraging the virtual simulation, we present a unique microphone configuration, based on the superposition principle, designed to optimize sound field coverage while addressing the limitations of traditional microphone arrays. The presented 64-microphone configuration allows us to capture RIRs directly in the Spherical Harmonics domain. The dataset features a wide range of room configurations, encompassing variations in room geometry, acoustic absorption materials, and source-receiver distances. A detailed description of the simulation setup is provided alongside for an accurate reproduction. The dataset serves as a vital resource for researchers working on spatial audio, particularly in applications involving machine learning to improve room acoustics modeling and sound field synthesis. It further provides a very high level of spatial resolution and realism crucial for tasks such as source localization, reverberation prediction, and immersive sound reproduction.
△ Less
Submitted 19 January, 2025; v1 submitted 21 November, 2024;
originally announced November 2024.
-
MuCol Milestone Report No. 5: Preliminary Parameters
Authors:
Carlotta Accettura,
Simon Adrian,
Rohit Agarwal,
Claudia Ahdida,
Chiara Aimé,
Avni Aksoy,
Gian Luigi Alberghi,
Siobhan Alden,
Luca Alfonso,
Nicola Amapane,
David Amorim,
Paolo Andreetto,
Fabio Anulli,
Rob Appleby,
Artur Apresyan,
Pouya Asadi,
Mohammed Attia Mahmoud,
Bernhard Auchmann,
John Back,
Anthony Badea,
Kyu Jung Bae,
E. J. Bahng,
Lorenzo Balconi,
Fabrice Balli,
Laura Bandiera
, et al. (369 additional authors not shown)
Abstract:
This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power…
▽ More
This document is comprised of a collection of updated preliminary parameters for the key parts of the muon collider. The updated preliminary parameters follow on from the October 2023 Tentative Parameters Report. Particular attention has been given to regions of the facility that are believed to hold greater technical uncertainty in their design and that have a strong impact on the cost and power consumption of the facility. The data is collected from a collaborative spreadsheet and transferred to overleaf.
△ Less
Submitted 5 November, 2024;
originally announced November 2024.
-
Multiscale Encoder and Omni-Dimensional Dynamic Convolution Enrichment in nnU-Net for Brain Tumor Segmentation
Authors:
Sahaj K. Mistry,
Sourav Saini,
Aashray Gupta,
Aayush Gupta,
Sunny Rai,
Vinit Jakhetiya,
Ujjwal Baid,
Sharath Chandra Guntuku
Abstract:
Brain tumor segmentation plays a crucial role in computer-aided diagnosis. This study introduces a novel segmentation algorithm utilizing a modified nnU-Net architecture. Within the nnU-Net architecture's encoder section, we enhance conventional convolution layers by incorporating omni-dimensional dynamic convolution layers, resulting in improved feature representation. Simultaneously, we propose…
▽ More
Brain tumor segmentation plays a crucial role in computer-aided diagnosis. This study introduces a novel segmentation algorithm utilizing a modified nnU-Net architecture. Within the nnU-Net architecture's encoder section, we enhance conventional convolution layers by incorporating omni-dimensional dynamic convolution layers, resulting in improved feature representation. Simultaneously, we propose a multi-scale attention strategy that harnesses contemporary insights from various scales. Our model's efficacy is demonstrated on diverse datasets from the BraTS-2023 challenge. Integrating omni-dimensional dynamic convolution (ODConv) layers and multi-scale features yields substantial improvement in the nnU-Net architecture's performance across multiple tumor segmentation datasets. Remarkably, our proposed model attains good accuracy during validation for the BraTS Africa dataset. The ODconv source code along with full training code is available on GitHub.
△ Less
Submitted 20 September, 2024;
originally announced September 2024.
-
Matrix Profile for Anomaly Detection on Multidimensional Time Series
Authors:
Chin-Chia Michael Yeh,
Audrey Der,
Uday Singh Saini,
Vivian Lai,
Yan Zheng,
Junpeng Wang,
Xin Dai,
Zhongfang Zhuang,
Yujie Fan,
Huiyuan Chen,
Prince Osei Aboagye,
Liang Wang,
Wei Zhang,
Eamonn Keogh
Abstract:
The Matrix Profile (MP), a versatile tool for time series data mining, has been shown effective in time series anomaly detection (TSAD). This paper delves into the problem of anomaly detection in multidimensional time series, a common occurrence in real-world applications. For instance, in a manufacturing factory, multiple sensors installed across the site collect time-varying data for analysis. T…
▽ More
The Matrix Profile (MP), a versatile tool for time series data mining, has been shown effective in time series anomaly detection (TSAD). This paper delves into the problem of anomaly detection in multidimensional time series, a common occurrence in real-world applications. For instance, in a manufacturing factory, multiple sensors installed across the site collect time-varying data for analysis. The Matrix Profile, named for its role in profiling the matrix storing pairwise distance between subsequences of univariate time series, becomes complex in multidimensional scenarios. If the input univariate time series has n subsequences, the pairwise distance matrix is a n x n matrix. In a multidimensional time series with d dimensions, the pairwise distance information must be stored in a n x n x d tensor. In this paper, we first analyze different strategies for condensing this tensor into a profile vector. We then investigate the potential of extending the MP to efficiently find k-nearest neighbors for anomaly detection. Finally, we benchmark the multidimensional MP against 19 baseline methods on 119 multidimensional TSAD datasets. The experiments covers three learning setups: unsupervised, supervised, and semi-supervised. MP is the only method that consistently delivers high performance across all setups.
△ Less
Submitted 14 September, 2024;
originally announced September 2024.
-
Observational constraints on $α$-Starobinsky inflation
Authors:
Saisandri Saini,
Akhilesh Nautiyal
Abstract:
In this work we revisit $α$-Starobinsky inflation, also know as $E$-model, in the light of current CMB and LSS observations. The inflaton potential in the Einstein frame for this model contains a parameter $α$ in the exponential, which alters the predictions for the scalar and tensor power spectra of Starobinsky inflation. We obtain these power spectra numerically without using slow-roll approxima…
▽ More
In this work we revisit $α$-Starobinsky inflation, also know as $E$-model, in the light of current CMB and LSS observations. The inflaton potential in the Einstein frame for this model contains a parameter $α$ in the exponential, which alters the predictions for the scalar and tensor power spectra of Starobinsky inflation. We obtain these power spectra numerically without using slow-roll approximation and perform MCMC analysis to put constraints on parameters $M$ and $α$ from Planck-2018, BICEP/Keck (BK18) and other LSS observations. We consider general reheating scenario by varying the number of e-foldings during inflation, $N_{pivot}$, along with the other parameters. We find $\log_{10}α= 0.0^{+1.6}_{-5.6}$, $\log_{10}M= -4.91^{+0.69}_{-2.7}$ and $N_{pivot} = 53.2^{+3.9}_{-5}$ with $95\%$ C. L.. This implies that the present CMB and LSS observations are insufficient to constrain the parameter $α$. We also find that there is no correlation between $N_{pivot}$ and $α$.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Preserving Individuality while Following the Crowd: Understanding the Role of User Taste and Crowd Wisdom in Online Product Rating Prediction
Authors:
Liang Wang,
Shubham Jain,
Yingtong Dou,
Junpeng Wang,
Chin-Chia Michael Yeh,
Yujie Fan,
Prince Aboagye,
Yan Zheng,
Xin Dai,
Zhongfang Zhuang,
Uday Singh Saini,
Wei Zhang
Abstract:
Numerous algorithms have been developed for online product rating prediction, but the specific influence of user and product information in determining the final prediction score remains largely unexplored. Existing research often relies on narrowly defined data settings, which overlooks real-world challenges such as the cold-start problem, cross-category information utilization, and scalability a…
▽ More
Numerous algorithms have been developed for online product rating prediction, but the specific influence of user and product information in determining the final prediction score remains largely unexplored. Existing research often relies on narrowly defined data settings, which overlooks real-world challenges such as the cold-start problem, cross-category information utilization, and scalability and deployment issues. To delve deeper into these aspects, and particularly to uncover the roles of individual user taste and collective wisdom, we propose a unique and practical approach that emphasizes historical ratings at both the user and product levels, encapsulated using a continuously updated dynamic tree representation. This representation effectively captures the temporal dynamics of users and products, leverages user information across product categories, and provides a natural solution to the cold-start problem. Furthermore, we have developed an efficient data processing strategy that makes this approach highly scalable and easily deployable. Comprehensive experiments in real industry settings demonstrate the effectiveness of our approach. Notably, our findings reveal that individual taste dominates over collective wisdom in online product rating prediction, a perspective that contrasts with the commonly observed wisdom of the crowd phenomenon in other domains. This dominance of individual user taste is consistent across various model types, including the boosting tree model, recurrent neural network (RNN), and transformer-based architectures. This observation holds true across the overall population, within individual product categories, and in cold-start scenarios. Our findings underscore the significance of individual user tastes in the context of online product rating prediction and the robustness of our approach across different model architectures.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
Spherically symmetric loop quantum gravity: Schwarzschild spacetimes with a cosmological constant
Authors:
Esteban Mato,
Javier Olmedo,
Sahil Saini
Abstract:
We provide a quantization of the Schwarzschild spacetime in the presence of a cosmological constant, based on midisuperspace methods developed in the spherically symmetric sector of loop quantum gravity, using in particular the 'improved dynamics' scheme. We include both the de Sitter and anti-de Sitter cases. We find that the quantization puts a Planckian positive upper limit on the possible valu…
▽ More
We provide a quantization of the Schwarzschild spacetime in the presence of a cosmological constant, based on midisuperspace methods developed in the spherically symmetric sector of loop quantum gravity, using in particular the 'improved dynamics' scheme. We include both the de Sitter and anti-de Sitter cases. We find that the quantization puts a Planckian positive upper limit on the possible values of the cosmological constant similar to the bounds obtained earlier from studies of homogeneous spacetimes. This means that, for negative cosmological constant, no negative bound is found. Moreover, using semiclassical physical states, we obtain the effective metric and demonstrate the causal structure for various cases. Quantum gravity modifications ensure that the singularity is replaced by a transition surface in all the cases, where the curvature invariants approach mass-independent Planckian bounds. Analysis of the effective stress-energy tensor shows that the null energy condition is strongly violated in the vicinity of the transition surface. Moreover, it shows a weaker asymptotic fall off for a nonvanishing cosmological constant, which could have interesting phenomenological implications.
△ Less
Submitted 8 March, 2025; v1 submitted 9 August, 2024;
originally announced August 2024.
-
Interim report for the International Muon Collider Collaboration (IMCC)
Authors:
C. Accettura,
S. Adrian,
R. Agarwal,
C. Ahdida,
C. Aimé,
A. Aksoy,
G. L. Alberghi,
S. Alden,
N. Amapane,
D. Amorim,
P. Andreetto,
F. Anulli,
R. Appleby,
A. Apresyan,
P. Asadi,
M. Attia Mahmoud,
B. Auchmann,
J. Back,
A. Badea,
K. J. Bae,
E. J. Bahng,
L. Balconi,
F. Balli,
L. Bandiera,
C. Barbagallo
, et al. (362 additional authors not shown)
Abstract:
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accele…
▽ More
The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accelerator complex, detectors and physics for a future muon collider. In 2023, European Commission support was obtained for a design study of a muon collider (MuCol) [3]. This project started on 1st March 2023, with work-packages aligned with the overall muon collider studies. In preparation of and during the 2021-22 U.S. Snowmass process, the muon collider project parameters, technical studies and physics performance studies were performed and presented in great detail. Recently, the P5 panel [4] in the U.S. recommended a muon collider R&D, proposed to join the IMCC and envisages that the U.S. should prepare to host a muon collider, calling this their "muon shot". In the past, the U.S. Muon Accelerator Programme (MAP) [5] has been instrumental in studies of concepts and technologies for a muon collider.
△ Less
Submitted 28 January, 2025; v1 submitted 17 July, 2024;
originally announced July 2024.
-
Illustrating an Effective Workflow for Accelerated Materials Discovery
Authors:
Mrinalini Mulukutla,
A. Nicole Person,
Sven Voigt,
Lindsey Kuettner,
Branden Kappes,
Danial Khatamsaz,
Robert Robinson,
Daniel Salas,
Wenle Xu,
Daniel Lewis,
Hongkyu Eoh,
Kailu Xiao,
Haoren Wang,
Jaskaran Singh Saini,
Raj Mahat,
Trevor Hastings,
Matthew Skokan,
Vahid Attari,
Michael Elverud,
James D. Paramore,
Brady Butler,
Kenneth Vecchio,
Surya R. Kalidindi,
Douglas Allaire,
Ibrahim Karaman
, et al. (4 additional authors not shown)
Abstract:
Algorithmic materials discovery is a multi-disciplinary domain that integrates insights from specialists in alloy design, synthesis, characterization, experimental methodologies, computational modeling, and optimization. Central to this effort is a robust data management system paired with an interactive work platform. This platform should empower users to not only access others data but also inte…
▽ More
Algorithmic materials discovery is a multi-disciplinary domain that integrates insights from specialists in alloy design, synthesis, characterization, experimental methodologies, computational modeling, and optimization. Central to this effort is a robust data management system paired with an interactive work platform. This platform should empower users to not only access others data but also integrate their analyses, paving the way for sophisticated data pipelines. To realize this vision, there is a need for an integrative collaboration platform, streamlined data sharing and analysis tools, and efficient communication channels. Such a collaborative mechanism should transcend geographical barriers, facilitating remote interaction and fostering a challenge-response dynamic. In this paper, we present our ongoing efforts in addressing the critical challenges related to an accelerated Materials Discovery Framework as a part of the High-Throughput Materials Discovery for Extreme Conditions Initiative. Our BIRDSHOT Center has successfully harnessed various tools and strategies, including the utilization of cloud-based storage, a standardized sample naming convention, a structured file system, the implementation of sample travelers, a robust sample tracking method, and the incorporation of knowledge graphs for efficient data management. Additionally, we present the development of a data collection platform, reinforcing seamless collaboration among our team members. In summary, this paper provides an illustration and insight into the various elements of an efficient and effective workflow within an accelerated materials discovery framework while highlighting the dynamic and adaptable nature of the data management tools and sharing platforms.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Deep Learning Descriptor Hybridization with Feature Reduction for Accurate Cervical Cancer Colposcopy Image Classification
Authors:
Saurabh Saini,
Kapil Ahuja,
Siddartha Chennareddy,
Karthik Boddupalli
Abstract:
Cervical cancer stands as a predominant cause of female mortality, underscoring the need for regular screenings to enable early diagnosis and preemptive treatment of pre-cancerous conditions. The transformation zone in the cervix, where cellular differentiation occurs, plays a critical role in the detection of abnormalities. Colposcopy has emerged as a pivotal tool in cervical cancer prevention si…
▽ More
Cervical cancer stands as a predominant cause of female mortality, underscoring the need for regular screenings to enable early diagnosis and preemptive treatment of pre-cancerous conditions. The transformation zone in the cervix, where cellular differentiation occurs, plays a critical role in the detection of abnormalities. Colposcopy has emerged as a pivotal tool in cervical cancer prevention since it provides a meticulous examination of cervical abnormalities. However, challenges in visual evaluation necessitate the development of Computer Aided Diagnosis (CAD) systems.
We propose a novel CAD system that combines the strengths of various deep-learning descriptors (ResNet50, ResNet101, and ResNet152) with appropriate feature normalization (min-max) as well as feature reduction technique (LDA). The combination of different descriptors ensures that all the features (low-level like edges and colour, high-level like shape and texture) are captured, feature normalization prevents biased learning, and feature reduction avoids overfitting. We do experiments on the IARC dataset provided by WHO. The dataset is initially segmented and balanced. Our approach achieves exceptional performance in the range of 97%-100% for both the normal-abnormal and the type classification. A competitive approach for type classification on the same dataset achieved 81%-91% performance.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Privacy and Security of Women's Reproductive Health Apps in a Changing Legal Landscape
Authors:
Shalini Saini,
Nitesh Saxena
Abstract:
FemTech, a rising trend in mobile apps, empowers women to digitally manage their health and family planning. However, privacy and security vulnerabilities in period-tracking and fertility-monitoring apps present significant risks, such as unintended pregnancies and legal consequences. Our approach involves manual observations of privacy policies and app permissions, along with dynamic and static a…
▽ More
FemTech, a rising trend in mobile apps, empowers women to digitally manage their health and family planning. However, privacy and security vulnerabilities in period-tracking and fertility-monitoring apps present significant risks, such as unintended pregnancies and legal consequences. Our approach involves manual observations of privacy policies and app permissions, along with dynamic and static analysis using multiple evaluation frameworks. Our research reveals that many of these apps gather personally identifiable information (PII) and sensitive healthcare data. Furthermore, our analysis identifies that 61% of the code vulnerabilities found in the apps are classified under the top-ten Open Web Application Security Project (OWASP) vulnerabilities. Our research emphasizes the significance of tackling the privacy and security vulnerabilities present in period-tracking and fertility-monitoring mobile apps. By highlighting these crucial risks, we aim to initiate a vital discussion and advocate for increased accountability and transparency of digital tools for women's health. We encourage the industry to prioritize user privacy and security, ultimately promoting a safer and more secure environment for women's health management.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Specularity Factorization for Low-Light Enhancement
Authors:
Saurabh Saini,
P J Narayanan
Abstract:
We present a new additive image factorization technique that treats images to be composed of multiple latent specular components which can be simply estimated recursively by modulating the sparsity during decomposition. Our model-driven {\em RSFNet} estimates these factors by unrolling the optimization into network layers requiring only a few scalars to be learned. The resultant factors are interp…
▽ More
We present a new additive image factorization technique that treats images to be composed of multiple latent specular components which can be simply estimated recursively by modulating the sparsity during decomposition. Our model-driven {\em RSFNet} estimates these factors by unrolling the optimization into network layers requiring only a few scalars to be learned. The resultant factors are interpretable by design and can be fused for different image enhancement tasks via a network or combined directly by the user in a controllable fashion. Based on RSFNet, we detail a zero-reference Low Light Enhancement (LLE) application trained without paired or unpaired supervision. Our system improves the state-of-the-art performance on standard benchmarks and achieves better generalization on multiple other datasets. We also integrate our factors with other task specific fusion networks for applications like deraining, deblurring and dehazing with negligible overhead thereby highlighting the multi-domain and multi-task generalizability of our proposed RSFNet. The code and data is released for reproducibility on the project homepage.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Workshop on the limiting compactness objects: Black holes and Buchdahl stars
Authors:
Dawood Kothawala,
Sahil Saini
Abstract:
The workshop was organized at IUCAA on Oct 30 - Nov 3, 2023 as a compact discussion and discourse meeting with a threadbare exposition and discussion of the various aspects and the questions arising. It was occasioned by the visit of Professor Hakan Andreasson of the Gothenburg Technical University, Sweden. He has been exploring with his collaborators the Einstein - Vlasov system for over a decade…
▽ More
The workshop was organized at IUCAA on Oct 30 - Nov 3, 2023 as a compact discussion and discourse meeting with a threadbare exposition and discussion of the various aspects and the questions arising. It was occasioned by the visit of Professor Hakan Andreasson of the Gothenburg Technical University, Sweden. He has been exploring with his collaborators the Einstein - Vlasov system for over a decade and a half as a possible matter source for compact objects. This system characterizes itself by free particles in motion and interacting only through gravity. For a limiting compactness, this may be the most appropriate state. The main thrust of the workshop was to understand this new object, Buchdahl Star (BS), of limiting compactness without a horizon. It is almost as compact as a black hole (BH) and yet has no horizon and hence is open for interaction with the outside world. Ever since the proposal of the membrane paradigm envisaging a timelike fiducial surface near BH horizon, BS offers an excellent possibility of the existence of such a real astrophysical object. It could very well compete with BH as a mimicker for various physical and astrophysical phenomena. Thus, it opens up a new vista of study and investigation of all the questions that one asks for BH, for this new creature, BS. The workshop was intended to identify certain interesting questions as well as the people interested in studying them. On this count, the workshop has been a huge success as several interesting questions have been identified, a few groups have been formed to take up different problems, and the work has already started. Nothing more could one have asked from such an exercise. A brief summary of some of the talks is included, followed by a brief discussion of the projects identified as a result of the discussions during the workshop.
△ Less
Submitted 11 March, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
A Systematic Literature Review on Task Allocation and Performance Management Techniques in Cloud Data Center
Authors:
Nidhika Chauhan,
Navneet Kaur,
Kamaljit Singh Saini,
Sahil Verma,
Abdulatif Alabdulatif,
Ruba Abu Khurma,
Maribel Garcia-Arenas,
Pedro A. Castillo
Abstract:
As cloud computing usage grows, cloud data centers play an increasingly important role. To maximize resource utilization, ensure service quality, and enhance system performance, it is crucial to allocate tasks and manage performance effectively. The purpose of this study is to provide an extensive analysis of task allocation and performance management techniques employed in cloud data centers. The…
▽ More
As cloud computing usage grows, cloud data centers play an increasingly important role. To maximize resource utilization, ensure service quality, and enhance system performance, it is crucial to allocate tasks and manage performance effectively. The purpose of this study is to provide an extensive analysis of task allocation and performance management techniques employed in cloud data centers. The aim is to systematically categorize and organize previous research by identifying the cloud computing methodologies, categories, and gaps. A literature review was conducted, which included the analysis of 463 task allocations and 480 performance management papers. The review revealed three task allocation research topics and seven performance management methods. Task allocation research areas are resource allocation, load-Balancing, and scheduling. Performance management includes monitoring and control, power and energy management, resource utilization optimization, quality of service management, fault management, virtual machine management, and network management. The study proposes new techniques to enhance cloud computing work allocation and performance management. Short-comings in each approach can guide future research. The research's findings on cloud data center task allocation and performance management can assist academics, practitioners, and cloud service providers in optimizing their systems for dependability, cost-effectiveness, and scalability. Innovative methodologies can steer future research to fill gaps in the literature.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
RPMixer: Shaking Up Time Series Forecasting with Random Projections for Large Spatial-Temporal Data
Authors:
Chin-Chia Michael Yeh,
Yujie Fan,
Xin Dai,
Uday Singh Saini,
Vivian Lai,
Prince Osei Aboagye,
Junpeng Wang,
Huiyuan Chen,
Yan Zheng,
Zhongfang Zhuang,
Liang Wang,
Wei Zhang
Abstract:
Spatial-temporal forecasting systems play a crucial role in addressing numerous real-world challenges. In this paper, we investigate the potential of addressing spatial-temporal forecasting problems using general time series forecasting models, i.e., models that do not leverage the spatial relationships among the nodes. We propose a all-Multi-Layer Perceptron (all-MLP) time series forecasting arch…
▽ More
Spatial-temporal forecasting systems play a crucial role in addressing numerous real-world challenges. In this paper, we investigate the potential of addressing spatial-temporal forecasting problems using general time series forecasting models, i.e., models that do not leverage the spatial relationships among the nodes. We propose a all-Multi-Layer Perceptron (all-MLP) time series forecasting architecture called RPMixer. The all-MLP architecture was chosen due to its recent success in time series forecasting benchmarks. Furthermore, our method capitalizes on the ensemble-like behavior of deep neural networks, where each individual block within the network behaves like a base learner in an ensemble model, particularly when identity mapping residual connections are incorporated. By integrating random projection layers into our model, we increase the diversity among the blocks' outputs, thereby improving the overall performance of the network. Extensive experiments conducted on the largest spatial-temporal forecasting benchmark datasets demonstrate that the proposed method outperforms alternative methods, including both spatial-temporal graph models and general forecasting models.
△ Less
Submitted 12 June, 2024; v1 submitted 16 February, 2024;
originally announced February 2024.
-
Dynamic Multi Color Switching using Ultrathin Vanadium Oxide on Aluminium based Asymmetric Fabry-Perot Resonant Structure
Authors:
Shubhangi Saini,
Ashok P,
Amit Verma
Abstract:
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color g…
▽ More
Vanadium dioxide ($VO_{2}$) exhibits strong infrared optical switching due to its insulator-metal phase-transition property. However, in the visible wavelengths, it's intrinsic optical switching is quite low. Current research explores solutions like multilayering, intricate structural patterning, high thermal budget processes and costly metals for improved color switching. Nonetheless, the color gamut coverage with these methodologies remains notably limited. This work overcomes these limitations and demonstrates dynamic multi-colour switching covering a large color gamut using a simple, unpatterned, ultrathin ($\sim$ $\fracλ{14}$, where wavelength $λ$ is taken as 575 nm at the center of visible spectrum) asymmetric Fabry-Pérot structure of $VO_{2}$ on Aluminium (Al). We use the transfer matrix method to design the $VO_{2}/Aluminium\,(Al)/Sapphire$ structure for maximum visible reflectance switching. $VO_{2}$ films are synthesized using a simple, low thermal budget atmospheric oxidation of Vanadium (V). With varying oxidation durations, different colors of the oxidized samples are observed. Consistent and reversible color-switching is observed visibly and in reflectance measurements with the change in temperature from low (RT $\sim$ 30$^{\circ}$C) to high (HT $\sim$ 100$^{\circ}$C) or vice versa due to the phase transition property of the $VO_{2}$ layer in the structure. Compared to the existing studies, this work shows a significant change in chromaticities and covers a large color gamut when plotted on the CIE chromaticity diagram. This work has potential applications in the fields of display, thermochromic structures, and visible camouflage.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Has Your Pretrained Model Improved? A Multi-head Posterior Based Approach
Authors:
Prince Aboagye,
Yan Zheng,
Junpeng Wang,
Uday Singh Saini,
Xin Dai,
Michael Yeh,
Yujie Fan,
Zhongfang Zhuang,
Shubham Jain,
Liang Wang,
Wei Zhang
Abstract:
The emergence of pre-trained models has significantly impacted Natural Language Processing (NLP) and Computer Vision to relational datasets. Traditionally, these models are assessed through fine-tuned downstream tasks. However, this raises the question of how to evaluate these models more efficiently and more effectively. In this study, we explore a novel approach where we leverage the meta-featur…
▽ More
The emergence of pre-trained models has significantly impacted Natural Language Processing (NLP) and Computer Vision to relational datasets. Traditionally, these models are assessed through fine-tuned downstream tasks. However, this raises the question of how to evaluate these models more efficiently and more effectively. In this study, we explore a novel approach where we leverage the meta-features associated with each entity as a source of worldly knowledge and employ entity representations from the models. We propose using the consistency between these representations and the meta-features as a metric for evaluating pre-trained models. Our method's effectiveness is demonstrated across various domains, including models with relational datasets, large language models and image models.
△ Less
Submitted 14 February, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Approximate Caching for Efficiently Serving Diffusion Models
Authors:
Shubham Agarwal,
Subrata Mitra,
Sarthak Chakraborty,
Srikrishna Karanam,
Koyel Mukherjee,
Shiv Saini
Abstract:
Text-to-image generation using diffusion models has seen explosive popularity owing to their ability in producing high quality images adhering to text prompts. However, production-grade diffusion model serving is a resource intensive task that not only require high-end GPUs which are expensive but also incurs considerable latency. In this paper, we introduce a technique called approximate-caching…
▽ More
Text-to-image generation using diffusion models has seen explosive popularity owing to their ability in producing high quality images adhering to text prompts. However, production-grade diffusion model serving is a resource intensive task that not only require high-end GPUs which are expensive but also incurs considerable latency. In this paper, we introduce a technique called approximate-caching that can reduce such iterative denoising steps for an image generation based on a prompt by reusing intermediate noise states created during a prior image generation for similar prompts. Based on this idea, we present an end to end text-to-image system, Nirvana, that uses the approximate-caching with a novel cache management-policy Least Computationally Beneficial and Frequently Used (LCBFU) to provide % GPU compute savings, 19.8% end-to-end latency reduction and 19% dollar savings, on average, on two real production workloads. We further present an extensive characterization of real production text-to-image prompts from the perspective of caching, popularity and reuse of intermediate states in a large production environment.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
SICKLE: A Multi-Sensor Satellite Imagery Dataset Annotated with Multiple Key Cropping Parameters
Authors:
Depanshu Sani,
Sandeep Mahato,
Sourabh Saini,
Harsh Kumar Agarwal,
Charu Chandra Devshali,
Saket Anand,
Gaurav Arora,
Thiagarajan Jayaraman
Abstract:
The availability of well-curated datasets has driven the success of Machine Learning (ML) models. Despite greater access to earth observation data in agriculture, there is a scarcity of curated and labelled datasets, which limits the potential of its use in training ML models for remote sensing (RS) in agriculture. To this end, we introduce a first-of-its-kind dataset called SICKLE, which constitu…
▽ More
The availability of well-curated datasets has driven the success of Machine Learning (ML) models. Despite greater access to earth observation data in agriculture, there is a scarcity of curated and labelled datasets, which limits the potential of its use in training ML models for remote sensing (RS) in agriculture. To this end, we introduce a first-of-its-kind dataset called SICKLE, which constitutes a time-series of multi-resolution imagery from 3 distinct satellites: Landsat-8, Sentinel-1 and Sentinel-2. Our dataset constitutes multi-spectral, thermal and microwave sensors during January 2018 - March 2021 period. We construct each temporal sequence by considering the cropping practices followed by farmers primarily engaged in paddy cultivation in the Cauvery Delta region of Tamil Nadu, India; and annotate the corresponding imagery with key cropping parameters at multiple resolutions (i.e. 3m, 10m and 30m). Our dataset comprises 2,370 season-wise samples from 388 unique plots, having an average size of 0.38 acres, for classifying 21 crop types across 4 districts in the Delta, which amounts to approximately 209,000 satellite images. Out of the 2,370 samples, 351 paddy samples from 145 plots are annotated with multiple crop parameters; such as the variety of paddy, its growing season and productivity in terms of per-acre yields. Ours is also one among the first studies that consider the growing season activities pertinent to crop phenology (spans sowing, transplanting and harvesting dates) as parameters of interest. We benchmark SICKLE on three tasks: crop type, crop phenology (sowing, transplanting, harvesting), and yield prediction
△ Less
Submitted 29 November, 2023;
originally announced December 2023.
-
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement
Authors:
Avani Gupta,
Saurabh Saini,
P J Narayanan
Abstract:
Humans use abstract concepts for understanding instead of hard features. Recent interpretability research has focused on human-centered concept explanations of neural networks. Concept Activation Vectors (CAVs) estimate a model's sensitivity and possible biases to a given concept. In this paper, we extend CAVs from post-hoc analysis to ante-hoc training in order to reduce model bias through fine-t…
▽ More
Humans use abstract concepts for understanding instead of hard features. Recent interpretability research has focused on human-centered concept explanations of neural networks. Concept Activation Vectors (CAVs) estimate a model's sensitivity and possible biases to a given concept. In this paper, we extend CAVs from post-hoc analysis to ante-hoc training in order to reduce model bias through fine-tuning using an additional Concept Loss. Concepts were defined on the final layer of the network in the past. We generalize it to intermediate layers using class prototypes. This facilitates class learning in the last convolution layer, which is known to be most informative. We also introduce Concept Distillation to create richer concepts using a pre-trained knowledgeable model as the teacher. Our method can sensitize or desensitize a model towards concepts. We show applications of concept-sensitive training to debias several classification problems. We also use concepts to induce prior knowledge into IID, a reconstruction problem. Concept-sensitive training can improve model interpretability, reduce biases, and induce prior knowledge. Please visit https://avani17101.github.io/Concept-Distilllation/ for code and more details.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
HIDRO-VQA: High Dynamic Range Oracle for Video Quality Assessment
Authors:
Shreshth Saini,
Avinab Saha,
Alan C. Bovik
Abstract:
We introduce HIDRO-VQA, a no-reference (NR) video quality assessment model designed to provide precise quality evaluations of High Dynamic Range (HDR) videos. HDR videos exhibit a broader spectrum of luminance, detail, and color than Standard Dynamic Range (SDR) videos. As HDR content becomes increasingly popular, there is a growing demand for video quality assessment (VQA) algorithms that effecti…
▽ More
We introduce HIDRO-VQA, a no-reference (NR) video quality assessment model designed to provide precise quality evaluations of High Dynamic Range (HDR) videos. HDR videos exhibit a broader spectrum of luminance, detail, and color than Standard Dynamic Range (SDR) videos. As HDR content becomes increasingly popular, there is a growing demand for video quality assessment (VQA) algorithms that effectively address distortions unique to HDR content. To address this challenge, we propose a self-supervised contrastive fine-tuning approach to transfer quality-aware features from the SDR to the HDR domain, utilizing unlabeled HDR videos. Our findings demonstrate that self-supervised pre-trained neural networks on SDR content can be further fine-tuned in a self-supervised setting using limited unlabeled HDR videos to achieve state-of-the-art performance on the only publicly available VQA database for HDR content, the LIVE-HDR VQA database. Moreover, our algorithm can be extended to the Full Reference VQA setting, also achieving state-of-the-art performance. Our code is available publicly at https://github.com/avinabsaha/HIDRO-VQA.
△ Less
Submitted 20 December, 2023; v1 submitted 18 November, 2023;
originally announced November 2023.
-
CHIMERA Occultation Constraints on the Abundance of Kilometer-scale Kuiper Belt Objects
Authors:
Qicheng Zhang,
Gregg W. Hallinan,
Navtej S. Saini,
Hilke E. Schlichting,
Leon K. Harding,
Jennifer W. Milburn
Abstract:
Occultations provide indirect sensitivity to the number density of small Kuiper Belt objects (KBOs) too faint to directly detect telescopically. We present results from the Caltech HI-speed Multicolor camERA (CHIMERA) survey with the Palomar Hale Telescope, which monitored stars over the central 5'x5' of the M22 globular cluster along the ecliptic plane for serendipitous occultations by kilometer-…
▽ More
Occultations provide indirect sensitivity to the number density of small Kuiper Belt objects (KBOs) too faint to directly detect telescopically. We present results from the Caltech HI-speed Multicolor camERA (CHIMERA) survey with the Palomar Hale Telescope, which monitored stars over the central 5'x5' of the M22 globular cluster along the ecliptic plane for serendipitous occultations by kilometer-scale KBOs over 63 hr across 24 nights at a 33 Hz frame rate simultaneously in i' and g'. We adapted dense-field photometry and occultation template fitting techniques to this dataset, finding a 95% confidence upper limit on the occultation rate corresponding to an ecliptic sky density of <10^7 deg^-2 of >1 km diameter classical KBOs. We discuss a few of the occultation-like light curve signatures at the edge of the sensitivity limit responsible for setting the upper bounds, and their likely nonviability as true occultations.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Mirror Symmetry in three-dimensional Multiple-Scattering Media
Authors:
Sudhir K. Saini,
Evangelos Marakis,
Kayleigh Start,
Gerwin Osnabrugge,
Ivo M. Vellekoop,
Pepijn W. H. Pinkse
Abstract:
We investigate the effect of a mirror-symmetry plane in multiple-scattering media under plane-wave illumination along the symmetry plane. Designed and fabricated samples' optical transport properties are compared quantitatively with three-dimensional modeling. Strong polarization-dependent deviations of the bulk speckle-averaged intensity distribution at the symmetry plane are observed, showing ei…
▽ More
We investigate the effect of a mirror-symmetry plane in multiple-scattering media under plane-wave illumination along the symmetry plane. Designed and fabricated samples' optical transport properties are compared quantitatively with three-dimensional modeling. Strong polarization-dependent deviations of the bulk speckle-averaged intensity distribution at the symmetry plane are observed, showing either up to a factor two enhancement or complete suppression of the ensemble-averaged intensities. We derive analytical expressions for the ensemble-averaged intensity profiles near the symmetry plane. Apart from their interest in fundamental light propagation studies, applications of mirror-symmetric scattering media are envisioned in anti-counterfeiting.
△ Less
Submitted 24 September, 2024; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Outage-Watch: Early Prediction of Outages using Extreme Event Regularizer
Authors:
Shubham Agarwal,
Sarthak Chakraborty,
Shaddy Garg,
Sumit Bisht,
Chahat Jain,
Ashritha Gonuguntla,
Shiv Saini
Abstract:
Cloud services are omnipresent and critical cloud service failure is a fact of life. In order to retain customers and prevent revenue loss, it is important to provide high reliability guarantees for these services. One way to do this is by predicting outages in advance, which can help in reducing the severity as well as time to recovery. It is difficult to forecast critical failures due to the rar…
▽ More
Cloud services are omnipresent and critical cloud service failure is a fact of life. In order to retain customers and prevent revenue loss, it is important to provide high reliability guarantees for these services. One way to do this is by predicting outages in advance, which can help in reducing the severity as well as time to recovery. It is difficult to forecast critical failures due to the rarity of these events. Moreover, critical failures are ill-defined in terms of observable data. Our proposed method, Outage-Watch, defines critical service outages as deteriorations in the Quality of Service (QoS) captured by a set of metrics. Outage-Watch detects such outages in advance by using current system state to predict whether the QoS metrics will cross a threshold and initiate an extreme event. A mixture of Gaussian is used to model the distribution of the QoS metrics for flexibility and an extreme event regularizer helps in improving learning in tail of the distribution. An outage is predicted if the probability of any one of the QoS metrics crossing threshold changes significantly. Our evaluation on a real-world SaaS company dataset shows that Outage-Watch significantly outperforms traditional methods with an average AUC of 0.98. Additionally, Outage-Watch detects all the outages exhibiting a change in service metrics and reduces the Mean Time To Detection (MTTD) of outages by up to 88% when deployed in an enterprise cloud-service system, demonstrating efficacy of our proposed method.
△ Less
Submitted 10 November, 2023; v1 submitted 29 September, 2023;
originally announced September 2023.
-
End-to-end numerical modeling of the Roman Space Telescope coronagraph
Authors:
John E. Krist,
John B. Steeves,
Brandon D. Dube,
A. J. Eldorado Riggs,
Brian D. Kern,
David S. Marx,
Eric J. Cady,
Hanying Zhou,
Ilya Y. Poberezhskiy,
Caleb W. Baker,
James P. McGuire,
Bijan Nemati,
Gary M. Kuan,
Bertrand Mennesson,
John T. Trauger,
Navtej S. Saini,
Sergi Hildebrandt Rafels
Abstract:
The Roman Space Telescope will have the first advanced coronagraph in space, with deformable mirrors for wavefront control, low-order wavefront sensing and maintenance, and a photon-counting detector. It is expected to be able to detect and characterize mature, giant exoplanets in reflected visible light. Over the past decade the performance of the coronagraph in its flight environment has been si…
▽ More
The Roman Space Telescope will have the first advanced coronagraph in space, with deformable mirrors for wavefront control, low-order wavefront sensing and maintenance, and a photon-counting detector. It is expected to be able to detect and characterize mature, giant exoplanets in reflected visible light. Over the past decade the performance of the coronagraph in its flight environment has been simulated with increasingly detailed diffraction and structural/thermal finite element modeling. With the instrument now being integrated in preparation for launch within the next few years, the present state of the end-to-end modeling is described, including the measured flight components such as deformable mirrors. The coronagraphic modes are thoroughly described, including characteristics most readily derived from modeling. The methods for diffraction propagation, wavefront control, and structural and thermal finite-element modeling are detailed. The techniques and procedures developed for the instrument will serve as a foundation for future coronagraphic missions such as the Habitable Worlds Observatory.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
ESRO: Experience Assisted Service Reliability against Outages
Authors:
Sarthak Chakraborty,
Shubham Agarwal,
Shaddy Garg,
Abhimanyu Sethia,
Udit Narayan Pandey,
Videh Aggarwal,
Shiv Saini
Abstract:
Modern cloud services are prone to failures due to their complex architecture, making diagnosis a critical process. Site Reliability Engineers (SREs) spend hours leveraging multiple sources of data, including the alerts, error logs, and domain expertise through past experiences to locate the root cause(s). These experiences are documented as natural language text in outage reports for previous out…
▽ More
Modern cloud services are prone to failures due to their complex architecture, making diagnosis a critical process. Site Reliability Engineers (SREs) spend hours leveraging multiple sources of data, including the alerts, error logs, and domain expertise through past experiences to locate the root cause(s). These experiences are documented as natural language text in outage reports for previous outages. However, utilizing the raw yet rich semi-structured information in the reports systematically is time-consuming. Structured information, on the other hand, such as alerts that are often used during fault diagnosis, is voluminous and requires expert knowledge to discern. Several strategies have been proposed to use each source of data separately for root cause analysis. In this work, we build a diagnostic service called ESRO that recommends root causes and remediation for failures by utilizing structured as well as semi-structured sources of data systematically. ESRO constructs a causal graph using alerts and a knowledge graph using outage reports, and merges them in a novel way to form a unified graph during training. A retrieval-based mechanism is then used to search the unified graph and rank the likely root causes and remediation techniques based on the alerts fired during an outage at inference time. Not only the individual alerts, but their respective importance in predicting an outage group is taken into account during recommendation. We evaluated our model on several cloud service outages of a large SaaS enterprise over the course of ~2 years, and obtained an average improvement of 27% in rouge scores after comparing the likely root causes against the ground truth over state-of-the-art baselines. We further establish the effectiveness of ESRO through qualitative analysis on multiple real outage examples.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Investigation of charge carrier dynamics in Ti3C2Tx MXene for ultrafast photonics applications
Authors:
Ankita Rawat,
Nitesh K. Chourasia,
Saurabh K. Saini,
Gaurav Rajput,
Aditya Yadav,
Ritesh Kumar Chourasia,
Govind Gupta,
P. K. Kulriya
Abstract:
The rapid advancement of nanomaterials has paved the way for various technological breakthroughs, and MXenes, in particular, have gained substantial attention due to their unique properties such as high conductivity, broad-spectrum absorption strength, and tunable band gap. This article presents the impact of the process parameters on the structural and optical properties of Ti3C2Tx MXene for appl…
▽ More
The rapid advancement of nanomaterials has paved the way for various technological breakthroughs, and MXenes, in particular, have gained substantial attention due to their unique properties such as high conductivity, broad-spectrum absorption strength, and tunable band gap. This article presents the impact of the process parameters on the structural and optical properties of Ti3C2Tx MXene for application in ultrafast dynamics. XRD along with Raman spectroscopy studies, confirmed the synthesis of a single phase from their MAX phase Ti3AlC2. The complete etching of Al and increase in the interplanar distance is also observed on centrifugation at very high speed. The ultrafast transient absorption spectroscopy used to understand the effect of centrifuge speed on the charge carrier dynamics and ultrafast spectrum of MXene displayed that the carrier lifetime is critically influenced by rotation per minute (rpm) e.g. faster decay lifetime at 10k rpm than 7k rpm. The electronic relaxation probed using the time-resolved photoluminescence (TRPL) technique exhibits an average decay time of 5.13 ns and 5.35 ns at the 7k and 10k rpm, respectively, which confirms that the optical properties of the MXene are strongly affected by the centrifuge speed. The synthesized MXene at 10k rpm typically suggests that radiative processes due to longer decay lifetime and experiences fewer nonradiative losses, resulting in enhanced luminescence properties.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
CARL-G: Clustering-Accelerated Representation Learning on Graphs
Authors:
William Shiao,
Uday Singh Saini,
Yozen Liu,
Tong Zhao,
Neil Shah,
Evangelos E. Papalexakis
Abstract:
Self-supervised learning on graphs has made large strides in achieving great performance in various downstream tasks. However, many state-of-the-art methods suffer from a number of impediments, which prevent them from realizing their full potential. For instance, contrastive methods typically require negative sampling, which is often computationally costly. While non-contrastive methods avoid this…
▽ More
Self-supervised learning on graphs has made large strides in achieving great performance in various downstream tasks. However, many state-of-the-art methods suffer from a number of impediments, which prevent them from realizing their full potential. For instance, contrastive methods typically require negative sampling, which is often computationally costly. While non-contrastive methods avoid this expensive step, most existing methods either rely on overly complex architectures or dataset-specific augmentations. In this paper, we ask: Can we borrow from classical unsupervised machine learning literature in order to overcome those obstacles? Guided by our key insight that the goal of distance-based clustering closely resembles that of contrastive learning: both attempt to pull representations of similar items together and dissimilar items apart. As a result, we propose CARL-G - a novel clustering-based framework for graph representation learning that uses a loss inspired by Cluster Validation Indices (CVIs), i.e., internal measures of cluster quality (no ground truth required). CARL-G is adaptable to different clustering methods and CVIs, and we show that with the right choice of clustering method and CVI, CARL-G outperforms node classification baselines on 4/5 datasets with up to a 79x training speedup compared to the best-performing baseline. CARL-G also performs at par or better than baselines in node clustering and similarity search tasks, training up to 1,500x faster than the best-performing baseline. Finally, we also provide theoretical foundations for the use of CVI-inspired losses in graph representation learning.
△ Less
Submitted 31 July, 2023; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Towards Optimizing Storage Costs on the Cloud
Authors:
Koyel Mukherjee,
Raunak Shah,
Shiv Kumar Saini,
Karanpreet Singh,
Khushi,
Harsh Kesarwani,
Kavya Barnwal,
Ayush Chauhan
Abstract:
We study the problem of optimizing data storage and access costs on the cloud while ensuring that the desired performance or latency is unaffected. We first propose an optimizer that optimizes the data placement tier (on the cloud) and the choice of compression schemes to apply, for given data partitions with temporal access predictions. Secondly, we propose a model to learn the compression perfor…
▽ More
We study the problem of optimizing data storage and access costs on the cloud while ensuring that the desired performance or latency is unaffected. We first propose an optimizer that optimizes the data placement tier (on the cloud) and the choice of compression schemes to apply, for given data partitions with temporal access predictions. Secondly, we propose a model to learn the compression performance of multiple algorithms across data partitions in different formats to generate compression performance predictions on the fly, as inputs to the optimizer. Thirdly, we propose to approach the data partitioning problem fundamentally differently than the current default in most data lakes where partitioning is in the form of ingestion batches. We propose access pattern aware data partitioning and formulate an optimization problem that optimizes the size and reading costs of partitions subject to access patterns.
We study the various optimization problems theoretically as well as empirically, and provide theoretical bounds as well as hardness results. We propose a unified pipeline of cost minimization, called SCOPe that combines the different modules. We extensively compare the performance of our methods with related baselines from the literature on TPC-H data as well as enterprise datasets (ranging from GB to PB in volume) and show that SCOPe substantially improves over the baselines. We show significant cost savings compared to platform baselines, of the order of 50% to 83% on enterprise Data Lake datasets that range from terabytes to petabytes in volume.
△ Less
Submitted 6 July, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Observational constraints on power law Starobinsky inflation
Authors:
Saisandri Saini,
Akhilesh Nautiyal
Abstract:
In this work we revisit power law, $\frac{1}{M^2}R^β$, inflation to find the deviations from $R^2$ inflation allowed by current CMB and LSS observations. We compute the power spectra for scalar and tensor perturbations numerically and perform MCMC analysis to put constraints on parameters $M$ and $β$ from Planck-2018, BICEP3 and other LSS observations. We consider general reheating scenario and al…
▽ More
In this work we revisit power law, $\frac{1}{M^2}R^β$, inflation to find the deviations from $R^2$ inflation allowed by current CMB and LSS observations. We compute the power spectra for scalar and tensor perturbations numerically and perform MCMC analysis to put constraints on parameters $M$ and $β$ from Planck-2018, BICEP3 and other LSS observations. We consider general reheating scenario and also vary the number of e-foldings during inflation, $N_{pivot}$, along with the other parameters. We find $β= 1.966^{+0.035}_{-0.042}$, $M= \left(3.31^{+5}_{-2}\right)\times 10^{-5}$ and $N_{pivot} = 41^{+10}_{-10}$ with $95\%\, C.\, L.$. This indicates that the current observations allow deviation from Starobinsky inflation. The scalar spectral index, $n_s$, and tensor-to-scalar ratio, $r$, derived from these parameters, are consistent with the Planck and BICEP3 observations.
△ Less
Submitted 15 November, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
CausIL: Causal Graph for Instance Level Microservice Data
Authors:
Sarthak Chakraborty,
Shaddy Garg,
Shubham Agarwal,
Ayush Chauhan,
Shiv Kumar Saini
Abstract:
AI-based monitoring has become crucial for cloud-based services due to its scale. A common approach to AI-based monitoring is to detect causal relationships among service components and build a causal graph. Availability of domain information makes cloud systems even better suited for such causal detection approaches. In modern cloud systems, however, auto-scalers dynamically change the number of…
▽ More
AI-based monitoring has become crucial for cloud-based services due to its scale. A common approach to AI-based monitoring is to detect causal relationships among service components and build a causal graph. Availability of domain information makes cloud systems even better suited for such causal detection approaches. In modern cloud systems, however, auto-scalers dynamically change the number of microservice instances, and a load-balancer manages the load on each instance. This poses a challenge for off-the-shelf causal structure detection techniques as they neither incorporate the system architectural domain information nor provide a way to model distributed compute across varying numbers of service instances. To address this, we develop CausIL, which detects a causal structure among service metrics by considering compute distributed across dynamic instances and incorporating domain knowledge derived from system architecture. Towards the application in cloud systems, CausIL estimates a causal graph using instance-specific variations in performance metrics, modeling multiple instances of a service as independent, conditional on system assumptions. Simulation study shows the efficacy of CausIL over baselines by improving graph estimation accuracy by ~25% as measured by Structural Hamming Distance whereas the real-world dataset demonstrates CausIL's applicability in deployment settings.
△ Less
Submitted 19 March, 2023; v1 submitted 1 March, 2023;
originally announced March 2023.
-
Evolution of ion acoustic solitary waves in pulsar wind
Authors:
Kuldeep Singh,
Amar Kakad,
Bharati Kakad,
N. S. Saini
Abstract:
We have studied the evolution of ion-acoustic solitary waves (IASWs) in pulsar wind. The pulsar wind is modeled by considering a weakly relativistic unmagnetized collisionless plasma comprised of relativistic ions and superthermal electrons and positrons. Through fluid simulations, we have demonstrated that the localized ion density perturbations generated in the polar wind plasma can evolve the r…
▽ More
We have studied the evolution of ion-acoustic solitary waves (IASWs) in pulsar wind. The pulsar wind is modeled by considering a weakly relativistic unmagnetized collisionless plasma comprised of relativistic ions and superthermal electrons and positrons. Through fluid simulations, we have demonstrated that the localized ion density perturbations generated in the polar wind plasma can evolve the relativistic IASW pulses. It is found that the concentration of positrons, relativistic factor, superthermality of electrons, and positrons have a significant influence on the dynamical evolution of IASW pulses. Our results may provide insight to understand the evolution of IASW pulses and their role in astrophysical plasmas, especially in the relativistic pulsar winds with supernova outflow which is responsible for the production of superthermal particles and relativistic ions.
△ Less
Submitted 18 February, 2023;
originally announced February 2023.
-
Interactive Segmentation of Radiance Fields
Authors:
Rahul Goel,
Dhawal Sirikonda,
Saurabh Saini,
PJ Narayanan
Abstract:
Radiance Fields (RF) are popular to represent casually-captured scenes for new view synthesis and several applications beyond it. Mixed reality on personal spaces needs understanding and manipulating scenes represented as RFs, with semantic segmentation of objects as an important step. Prior segmentation efforts show promise but don't scale to complex objects with diverse appearance. We present th…
▽ More
Radiance Fields (RF) are popular to represent casually-captured scenes for new view synthesis and several applications beyond it. Mixed reality on personal spaces needs understanding and manipulating scenes represented as RFs, with semantic segmentation of objects as an important step. Prior segmentation efforts show promise but don't scale to complex objects with diverse appearance. We present the ISRF method to interactively segment objects with fine structure and appearance. Nearest neighbor feature matching using distilled semantic features identifies high-confidence seed regions. Bilateral search in a joint spatio-semantic space grows the region to recover accurate segmentation. We show state-of-the-art results of segmenting objects from RFs and compositing them to another scene, changing appearance, etc., and an interactive segmentation tool that others can use.
Project Page: https://rahul-goel.github.io/isrf/
△ Less
Submitted 25 March, 2023; v1 submitted 27 December, 2022;
originally announced December 2022.
-
StyleTRF: Stylizing Tensorial Radiance Fields
Authors:
Rahul Goel,
Sirikonda Dhawal,
Saurabh Saini,
P. J. Narayanan
Abstract:
Stylized view generation of scenes captured casually using a camera has received much attention recently. The geometry and appearance of the scene are typically captured as neural point sets or neural radiance fields in the previous work. An image stylization method is used to stylize the captured appearance by training its network jointly or iteratively with the structure capture network. The sta…
▽ More
Stylized view generation of scenes captured casually using a camera has received much attention recently. The geometry and appearance of the scene are typically captured as neural point sets or neural radiance fields in the previous work. An image stylization method is used to stylize the captured appearance by training its network jointly or iteratively with the structure capture network. The state-of-the-art SNeRF method trains the NeRF and stylization network in an alternating manner. These methods have high training time and require joint optimization. In this work, we present StyleTRF, a compact, quick-to-optimize strategy for stylized view generation using TensoRF. The appearance part is fine-tuned using sparse stylized priors of a few views rendered using the TensoRF representation for a few iterations. Our method thus effectively decouples style-adaption from view capture and is much faster than the previous methods. We show state-of-the-art results on several scenes used for this purpose.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Rarefied gas flow past a liquid droplet: interplay between internal and external flows
Authors:
Rahul Bhattacharjee,
Sonu Saini,
Vinay Kumar Gupta,
Anirudh S. Rana
Abstract:
Experimental and theoretical studies on millimetre-sized droplets suggest that at low Reynolds number the difference between the drag force on a circulating water droplet and that on a rigid sphere is very small (less than 1 %) (LeClair et al., J. Atmos. Sci., vol. 29, 1972, pp. 728-740). While the drag force on a spherical liquid droplet at high viscosity ratios (of the liquid to the gas), is app…
▽ More
Experimental and theoretical studies on millimetre-sized droplets suggest that at low Reynolds number the difference between the drag force on a circulating water droplet and that on a rigid sphere is very small (less than 1 %) (LeClair et al., J. Atmos. Sci., vol. 29, 1972, pp. 728-740). While the drag force on a spherical liquid droplet at high viscosity ratios (of the liquid to the gas), is approximately the same as that on a rigid sphere of the same size, the other quantities of interest (e.g. the temperature) in the case of a rarefied gas flow over a liquid droplet differ from the same quantities in the case of a rarefied gas flow over a rigid sphere. The goal of this article is to study the effects of internal motion within a spherical microdroplet/nanodroplet -- such that its diameter is comparable to the mean free path of the surrounding gas -- on the drag force and its overall dynamics. To this end, the problem of a slow rarefied gas flowing over an incompressible liquid droplet is investigated analytically by considering the internal motion of the liquid inside the droplet and also by accounting for kinetic effects in the gas. Detailed results for different values of the Knudsen number, the ratio of the thermal conductivities and the ratio of viscosities are presented for the pressure and temperature profiles inside and outside the liquid droplet. The results for the drag force obtained in the present work are in good agreement with the theoretical and experimental results existing in the literature.
△ Less
Submitted 26 January, 2024; v1 submitted 19 October, 2022;
originally announced October 2022.