Skip to main content

Showing 1–37 of 37 results for author: Trivedi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07925  [pdf

    cs.CV cs.LG eess.IV

    A Comparative Study of U-Net Architectures for Change Detection in Satellite Images

    Authors: Yaxita Amin, Naimisha S Trivedi, Rashmi Bhattad

    Abstract: Remote sensing change detection is essential for monitoring the everchanging landscapes of the Earth. The U-Net architecture has gained popularity for its capability to capture spatial information and perform pixel-wise classification. However, their application in the Remote sensing field remains largely unexplored. Therefore, this paper fill the gap by conducting a comprehensive analysis of 34 p… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.04462  [pdf, ps, other

    cs.CL cs.CR cs.LG

    Watermarking Degrades Alignment in Language Models: Analysis and Mitigation

    Authors: Apurv Verma, NhatHai Phan, Shubhendu Trivedi

    Abstract: Watermarking techniques for large language models (LLMs) can significantly impact output quality, yet their effects on truthfulness, safety, and helpfulness remain critically underexamined. This paper presents a systematic analysis of how two popular watermarking approaches-Gumbel and KGW-affect these core alignment properties across four aligned LLMs. Our experiments reveal two distinct degradati… ▽ More

    Submitted 10 July, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

    Comments: Published at the 1st Workshop on GenAI Watermarking, collocated with ICLR 2025. OpenReview: https://openreview.net/forum?id=SIBkIV48gF

    ACM Class: I.2.7

    Journal ref: 1st Workshop on GenAI Watermarking, ICLR 2025

  3. arXiv:2506.03037  [pdf, ps, other

    cs.LG stat.ME stat.ML

    On the Need to Align Intent and Implementation in Uncertainty Quantification for Machine Learning

    Authors: Shubhendu Trivedi, Brian D. Nord

    Abstract: Quantifying uncertainties for machine learning (ML) models is a foundational challenge in modern data analysis. This challenge is compounded by at least two key aspects of the field: (a) inconsistent terminology surrounding uncertainty and estimation across disciplines, and (b) the varying technical requirements for establishing trustworthy uncertainties in diverse problem contexts. In this positi… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  4. arXiv:2506.02293  [pdf, ps, other

    cs.LG

    On Universality Classes of Equivariant Networks

    Authors: Marco Pacini, Gabriele Santin, Bruno Lepri, Shubhendu Trivedi

    Abstract: Equivariant neural networks provide a principled framework for incorporating symmetry into learning architectures and have been extensively analyzed through the lens of their separation power, that is, the ability to distinguish inputs modulo symmetry. This notion plays a central role in settings such as graph learning, where it is often formalized via the Weisfeiler-Leman hierarchy. In contrast,… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Preprint. Under review. 22 pages

  5. arXiv:2503.22034  [pdf, other

    cs.NI

    Reliability and Availability in Virtualized Networks: A Survey on Standards, Modeling Approaches, and Research Challenges

    Authors: Mario Di Mauro, Walter Cerroni, Fabio Postiglione, Massimo Tornatore, Kishor S. Trivedi

    Abstract: The rise of Network Function Virtualization (NFV) has transformed network infrastructures by replacing fixed hardware with software-based Virtualized Network Functions (VNFs), enabling greater agility, scalability, and cost efficiency. Virtualization increases the distribution of system components and introduces stronger interdependencies. As a result, failures become harder to predict, monitor, a… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  6. arXiv:2502.14268  [pdf, other

    cs.CL cs.AI

    MCQA-Eval: Efficient Confidence Evaluation in NLG with Gold-Standard Correctness Labels

    Authors: Xiaoou Liu, Zhen Lin, Longchao Da, Chacha Chen, Shubhendu Trivedi, Hua Wei

    Abstract: Large Language Models (LLMs) require robust confidence estimation, particularly in critical domains like healthcare and law where unreliable outputs can lead to significant consequences. Despite much recent work in confidence estimation, current evaluation frameworks rely on correctness functions -- various heuristics that are often noisy, expensive, and possibly introduce systematic biases. These… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  7. arXiv:2410.10042  [pdf, other

    cs.CL

    LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering

    Authors: Saikrishna Sanniboina, Shiv Trivedi, Sreenidhi Vijayaraghavan

    Abstract: Retrieval-based question answering systems often suffer from positional bias, leading to suboptimal answer generation. We propose LoRE (Logit-Ranked Retriever Ensemble), a novel approach that improves answer accuracy and relevance by mitigating positional bias. LoRE employs an ensemble of diverse retrievers, such as BM25 and sentence transformers with FAISS indexing. A key innovation is a logit-ba… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

  8. arXiv:2409.11772  [pdf, other

    stat.ML cs.LG

    Symmetry-Based Structured Matrices for Efficient Approximately Equivariant Networks

    Authors: Ashwin Samudre, Mircea Petrache, Brian D. Nord, Shubhendu Trivedi

    Abstract: There has been much recent interest in designing neural networks (NNs) with relaxed equivariance, which interpolate between exact equivariance and full flexibility for consistent performance gains. In a separate line of work, structured parameter matrices with low displacement rank (LDR) -- which permit fast function and gradient evaluation -- have been used to create compact NNs, though primarily… ▽ More

    Submitted 18 April, 2025; v1 submitted 18 September, 2024; originally announced September 2024.

    Comments: 19 pages. Updated to published version of the paper in the proceedings of the 28th International Conference on Artificial Intelligence and Statistics (AISTATS) 2025

  9. arXiv:2408.13242  [pdf, other

    cs.LG

    Improving Equivariant Model Training via Constraint Relaxation

    Authors: Stefanos Pertigkiozoglou, Evangelos Chatzipantazis, Shubhendu Trivedi, Kostas Daniilidis

    Abstract: Equivariant neural networks have been widely used in a variety of applications due to their ability to generalize well in tasks where the underlying data symmetries are known. Despite their successes, such networks can be difficult to optimize and require careful hyperparameter tuning to train successfully. In this work, we propose a novel framework for improving the optimization of such models by… ▽ More

    Submitted 2 January, 2025; v1 submitted 23 August, 2024; originally announced August 2024.

  10. arXiv:2406.01806  [pdf, other

    cs.CL cs.AI

    Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: The advent of large language models (LLMs) has dramatically advanced the state-of-the-art in numerous natural language generation tasks. For LLMs to be applied reliably, it is essential to have an accurate measure of their confidence. Currently, the most commonly used confidence score function is the likelihood of the generated sequence, which, however, conflates semantic and syntactic components.… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  11. arXiv:2405.19328  [pdf, other

    cs.MA

    Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation

    Authors: Atrisha Sarkar, Andrei Ioan Muresanu, Carter Blair, Aaryam Sharma, Rakshit S Trivedi, Gillian K Hadfield

    Abstract: Generative agents, which implement behaviors using a large language model (LLM) to interpret and evaluate an environment, has demonstrated the capacity to solve complex tasks across many social and technological domains. However, when these agents interact with other agents and humans in presence of social structures such as existing norms, fostering cooperation between them is a fundamental chall… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  12. arXiv:2402.01629  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Position Paper: Generalized grammar rules and structure-based generalization beyond classical equivariance for lexical tasks and transduction

    Authors: Mircea Petrache, Shubhendu Trivedi

    Abstract: Compositional generalization is one of the main properties which differentiates lexical learning in humans from state-of-art neural networks. We propose a general framework for building models that can generalize compositionally using the concept of Generalized Grammar Rules (GGRs), a class of symmetry-based compositional constraints for transduction tasks, which we view as a transduction analogue… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 12 pages

  13. arXiv:2308.12784  [pdf

    cs.SE

    Understanding Container-based Services under Software Aging: Dependability and Performance Views

    Authors: Jing Bai, Xiaolin Chang, Fumio Machida, Kishor S. Trivedi

    Abstract: Container technology, as the key enabler behind microservice architectures, is widely applied in Cloud and Edge Computing. A long and continuous running of operating system (OS) host-ing container-based services can encounter software aging that leads to performance deterioration and even causes system fail-ures. OS rejuvenation techniques can mitigate the impact of software aging but the rejuvena… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  14. arXiv:2308.12748  [pdf

    cs.SE

    Towards Semi-Markov Model-based Dependability Evaluation of VM-based Multi-Domain Service Function Chain

    Authors: Lina Liu, Jing Bai, Xiaolin Chang, Fumio Machida, Kishor S. Trivedi, Haoran Zhu

    Abstract: In NFV networks, service functions (SFs) can be deployed on virtual machines (VMs) across multiple domains and then form a service function chain (MSFC) for end-to-end network service provision. However, any software component in a VM-based MSFC must experience software aging issue after a long period of operation. This paper quantitatively investigates the capability of proactive rejuvenation tec… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  15. arXiv:2305.19187  [pdf, other

    cs.CL cs.LG stat.ML

    Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: Large language models (LLMs) specializing in natural language generation (NLG) have recently started exhibiting promising capabilities across a variety of domains. However, gauging the trustworthiness of responses generated by LLMs remains an open challenge, with limited research on uncertainty quantification (UQ) for NLG. Furthermore, existing literature typically assumes white-box access to lang… ▽ More

    Submitted 19 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Published in Transactions on Machine Learning Research (05/2024)

  16. arXiv:2305.17592  [pdf, other

    cs.LG stat.ML

    Approximation-Generalization Trade-offs under (Approximate) Group Equivariance

    Authors: Mircea Petrache, Shubhendu Trivedi

    Abstract: The explicit incorporation of task-specific inductive biases through symmetry has emerged as a general design precept in the development of high-performance machine learning models. For example, group equivariant neural networks have demonstrated impressive performance across various domains and applications such as protein and drug design. A prevalent intuition about such models is that the integ… ▽ More

    Submitted 18 April, 2025; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 23 Pages. Updated to the published version. Advances in Neural Information Processing Systems 36, 61936-61959

  17. arXiv:2302.00839  [pdf, other

    cs.LG stat.ME stat.ML

    Fast Online Value-Maximizing Prediction Sets with Conformal Cost Control

    Authors: Zhen Lin, Shubhendu Trivedi, Cao Xiao, Jimeng Sun

    Abstract: Many real-world multi-label prediction problems involve set-valued predictions that must satisfy specific requirements dictated by downstream usage. We focus on a typical scenario where such requirements, separately encoding $\textit{value}$ and $\textit{cost}$, compete with each other. For instance, a hospital might expect a smart diagnosis system to capture as many severe, often co-morbid, disea… ▽ More

    Submitted 25 April, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023. 11 pages (main paper, including references) + 10 pages (supplementary material)

  18. arXiv:2207.03529  [pdf

    cs.LG eess.SP

    A Novel IoT-based Framework for Non-Invasive Human Hygiene Monitoring using Machine Learning Techniques

    Authors: Md Jobair Hossain Faruk, Shashank Trivedi, Mohammad Masum, Maria Valero, Hossain Shahriar, Sheikh Iqbal Ahamed

    Abstract: People's personal hygiene habits speak volumes about the condition of taking care of their bodies and health in daily lifestyle. Maintaining good hygiene practices not only reduces the chances of contracting a disease but could also reduce the risk of spreading illness within the community. Given the current pandemic, daily habits such as washing hands or taking regular showers have taken primary… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Journal ref: ICHI 2022 : 10th IEEE International Conference on Healthcare Informatics

  19. arXiv:2205.12940  [pdf, other

    stat.ML cs.LG stat.ME

    Conformal Prediction Intervals with Temporal Dependence

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: Cross-sectional prediction is common in many domains such as healthcare, including forecasting tasks using electronic health records, where different patients form a cross-section. We focus on the task of constructing valid prediction intervals (PIs) in time series regression with a cross-section. A prediction interval is considered valid if it covers the true response with (a pre-specified) high… ▽ More

    Submitted 2 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: 16 pages (main paper, including references) + 6 pages (supplementary material). Transactions of Machine Learning Research (September 2022). Code is available at https://github.com/zlin7/CPTD

  20. arXiv:2205.09940  [pdf, other

    stat.ML cs.LG stat.ME

    Conformal Prediction with Temporal Quantile Adjustments

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: We develop Temporal Quantile Adjustment (TQA), a general method to construct efficient and valid prediction intervals (PIs) for regression on cross-sectional time series data. Such data is common in many domains, including econometrics and healthcare. A canonical example in healthcare is predicting patient outcomes using physiological time-series data, where a population of patients composes a cro… ▽ More

    Submitted 23 May, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 12 pages (main paper, including references) + 11 pages (supplementary material)

  21. arXiv:2202.07679  [pdf, other

    stat.ML cs.LG stat.ME

    Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: Deep neural network (DNN) classifiers are often overconfident, producing miscalibrated class probabilities. In high-risk applications like healthcare, practitioners require $\textit{fully calibrated}$ probability predictions for decision-making. That is, conditioned on the prediction $\textit{vector}$, $\textit{every}$ class' probability should be close to the predicted value. Most existing calibr… ▽ More

    Submitted 8 December, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  22. arXiv:2110.07472  [pdf, other

    cs.LG cs.CV stat.ML

    Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?

    Authors: Matthew Farrell, Blake Bordelon, Shubhendu Trivedi, Cengiz Pehlevan

    Abstract: Equivariance has emerged as a desirable property of representations of objects subject to identity-preserving transformations that constitute a group, such as translations and rotations. However, the expressivity of a representation constrained by group equivariance is still not fully understood. We address this gap by providing a generalization of Cover's Function Counting Theorem that quantifies… ▽ More

    Submitted 5 February, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Version accepted to ICLR 2022

  23. arXiv:2106.00225  [pdf, other

    cs.LG stat.ME stat.ML

    Locally Valid and Discriminative Prediction Intervals for Deep Learning Models

    Authors: Zhen Lin, Shubhendu Trivedi, Jimeng Sun

    Abstract: Crucial for building trust in deep learning models for critical real-world applications is efficient and theoretically sound uncertainty quantification, a task that continues to be challenging. Useful uncertainty information is expected to have two key properties: It should be valid (guaranteeing coverage) and discriminative (more uncertain when the expected risk is high). Moreover, when combined… ▽ More

    Submitted 26 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Advances in Neural Information Processing Systems 34 (NeurIPS 2021). Code is available at https://github.com/zlin7/LVD

  24. arXiv:2102.13123  [pdf, other

    astro-ph.CO cs.CV cs.LG

    DeepSZ: Identification of Sunyaev-Zel'dovich Galaxy Clusters using Deep Learning

    Authors: Zhen Lin, Nicholas Huang, Camille Avestruz, W. L. Kimmy Wu, Shubhendu Trivedi, João Caldeira, Brian Nord

    Abstract: Galaxy clusters identified from the Sunyaev Zel'dovich (SZ) effect are a key ingredient in multi-wavelength cluster-based cosmology. We present a comparison between two methods of cluster identification: the standard Matched Filter (MF) method in SZ cluster finding and a method using Convolutional Neural Networks (CNN). We further implement and show results for a `combined' identifier. We apply th… ▽ More

    Submitted 8 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Report number: FERMILAB-PUB-21-077-SCD

  25. arXiv:2012.04474  [pdf, other

    cs.CV cs.LG

    Rotation-Invariant Autoencoders for Signals on Spheres

    Authors: Suhas Lohit, Shubhendu Trivedi

    Abstract: Omnidirectional images and spherical representations of $3D$ shapes cannot be processed with conventional 2D convolutional neural networks (CNNs) as the unwrapping leads to large distortion. Using fast implementations of spherical and $SO(3)$ convolutions, researchers have recently developed deep learning methods better suited for classifying spherical images. These newly proposed convolutional la… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  26. arXiv:2006.03550  [pdf, ps, other

    stat.ML cs.LG

    The Expected Jacobian Outerproduct: Theory and Empirics

    Authors: Shubhendu Trivedi, J. Wang

    Abstract: The expected gradient outerproduct (EGOP) of an unknown regression function is an operator that arises in the theory of multi-index regression, and is known to recover those directions that are most relevant to predicting the output. However, work on the EGOP, including that on its cheap estimators, is restricted to the regression setting. In this work, we adapt this operator to the multi-class se… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Technical Report

  27. arXiv:1911.05796  [pdf, ps, other

    astro-ph.IM cs.AI physics.soc-ph

    Response to NITRD, NCO, NSF Request for Information on "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan"

    Authors: J. Amundson, J. Annis, C. Avestruz, D. Bowring, J. Caldeira, G. Cerati, C. Chang, S. Dodelson, D. Elvira, A. Farahi, K. Genser, L. Gray, O. Gutsche, P. Harris, J. Kinney, J. B. Kowalkowski, R. Kutschke, S. Mrenna, B. Nord, A. Para, K. Pedro, G. N. Perdue, A. Scheinker, P. Spentzouris, J. St. John , et al. (5 additional authors not shown)

    Abstract: We present a response to the 2018 Request for Information (RFI) from the NITRD, NCO, NSF regarding the "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan." Through this document, we provide a response to the question of whether and how the National Artificial Intelligence Research and Development Strategic Plan (NAIRDSP) should be updated from the perspect… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Report number: FERMILAB-FN-1092-SCD

  28. arXiv:1910.05132  [pdf, other

    math.NA cs.LG stat.ML

    Asymmetric Multiresolution Matrix Factorization

    Authors: Pramod Kaushik Mudrakarta, Shubhendu Trivedi, Risi Kondor

    Abstract: Multiresolution Matrix Factorization (MMF) was recently introduced as an alternative to the dominant low-rank paradigm in order to capture structure in matrices at multiple different scales. Using ideas from multiresolution analysis (MRA), MMF teased out hierarchical structure in symmetric matrices by constructing a sequence of wavelet bases. While effective for such matrices, there is plenty of d… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: preliminary work

  29. arXiv:1909.04648  [pdf, other

    cond-mat.soft cond-mat.dis-nn cond-mat.mtrl-sci cs.LG stat.ML

    Deep Learning for Automated Classification and Characterization of Amorphous Materials

    Authors: Kirk Swanson, Shubhendu Trivedi, Joshua Lequieu, Kyle Swanson, Risi Kondor

    Abstract: It is difficult to quantify structure-property relationships and to identify structural features of complex materials. The characterization of amorphous materials is especially challenging because their lack of long-range order makes it difficult to define structural metrics. In this work, we apply deep learning algorithms to accurately classify amorphous materials and characterize their structura… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  30. arXiv:1810.01483  [pdf, other

    astro-ph.CO cs.CV stat.ML

    DeepCMB: Lensing Reconstruction of the Cosmic Microwave Background with Deep Neural Networks

    Authors: João Caldeira, W. L. Kimmy Wu, Brian Nord, Camille Avestruz, Shubhendu Trivedi, Kyle T. Story

    Abstract: Next-generation cosmic microwave background (CMB) experiments will have lower noise and therefore increased sensitivity, enabling improved constraints on fundamental physics parameters such as the sum of neutrino masses and the tensor-to-scalar ratio r. Achieving competitive constraints on these parameters requires high signal-to-noise extraction of the projected gravitational potential from the C… ▽ More

    Submitted 12 June, 2020; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: 19 pages; LaTeX; 12 figures; changes to match published version

    Report number: FERMILAB-PUB-18-515-A-CD

    Journal ref: Astronomy and Computing 28 100307 (2019)

  31. arXiv:1808.10078  [pdf, other

    stat.ML cs.LG

    Discriminative Learning of Similarity and Group Equivariant Representations

    Authors: Shubhendu Trivedi

    Abstract: One of the most fundamental problems in machine learning is to compare examples: Given a pair of objects we want to return a value which indicates degree of (dis)similarity. Similarity is often task specific, and pre-defined distances can perform poorly, leading to work in metric learning. However, being able to learn a similarity-sensitive distance function also presupposes access to a rich, disc… ▽ More

    Submitted 30 August, 2022; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: PhD thesis, September 2018 [Previous version had a compile error that was spotted recently, which is fixed. The uploaded version is the final thesis that was submitted in September 2018]

  32. arXiv:1806.09231  [pdf, other

    stat.ML cs.LG

    Clebsch-Gordan Nets: a Fully Fourier Space Spherical Convolutional Neural Network

    Authors: Risi Kondor, Zhen Lin, Shubhendu Trivedi

    Abstract: Recent work by Cohen \emph{et al.} has achieved state-of-the-art results for learning spherical images in a rotation invariant way by using ideas from group representation theory and noncommutative harmonic analysis. In this paper we propose a generalization of this work that generally exhibits improved performace, but from an implementation point of view is actually simpler. An unusual feature of… ▽ More

    Submitted 10 November, 2018; v1 submitted 24 June, 2018; originally announced June 2018.

    Comments: Camera ready version for the proceedings of the thirty-second conference on Neural Information Processing Systems (NIPS), Montreal, Canada, 2018

  33. arXiv:1802.03690  [pdf, ps, other

    stat.ML cs.LG

    On the Generalization of Equivariance and Convolution in Neural Networks to the Action of Compact Groups

    Authors: Risi Kondor, Shubhendu Trivedi

    Abstract: Convolutional neural networks have been extremely successful in the image recognition domain because they ensure equivariance to translations. There have been many recent attempts to generalize this framework to other domains, including graphs and data lying on manifolds. In this paper we give a rigorous, theoretical treatment of convolution and equivariance in neural networks with respect to not… ▽ More

    Submitted 10 November, 2018; v1 submitted 10 February, 2018; originally announced February 2018.

    Comments: Final version that appeared in the proceedings of the 35th International Conference on Machine Learning (ICML 2018), Stockholm, Sweden

  34. arXiv:1801.02144  [pdf, other

    cs.LG

    Covariant Compositional Networks For Learning Graphs

    Authors: Risi Kondor, Hy Truong Son, Horace Pan, Brandon Anderson, Shubhendu Trivedi

    Abstract: Most existing neural networks for learning graphs address permutation invariance by conceiving of the network as a message passing scheme, where each node sums the feature vectors coming from its neighbors. We argue that this imposes a limitation on their representation power, and instead propose a new general architecture for representing objects consisting of a hierarchy of parts, which we call… ▽ More

    Submitted 7 January, 2018; originally announced January 2018.

  35. arXiv:1509.06163  [pdf

    cs.LG

    The Utility of Clustering in Prediction Tasks

    Authors: Shubhendu Trivedi, Zachary A. Pardos, Neil T. Heffernan

    Abstract: We explore the utility of clustering in reducing error in various prediction tasks. Previous work has hinted at the improvement in prediction accuracy attributed to clustering algorithms if used to pre-process the data. In this work we more deeply investigate the direct utility of using clustering to improve prediction accuracy and provide explanations for why this may be so. We look at a number o… ▽ More

    Submitted 21 September, 2015; originally announced September 2015.

    Comments: An experimental research report, dated 11 September 2011

  36. NDTAODV: Neighbor Defense Technique for Ad Hoc On-Demand Distance Vector(AODV) to mitigate flood attack in MANETS

    Authors: Akshai Aggarwal, Savita Gandhi, Nirbhay Chaubey, Naren Tada, Srushti Trivedi

    Abstract: Mobile Ad Hoc Networks (MANETs) are collections of mobile nodes that can communicate with one another using multihop wireless links. MANETs are often deployed in the environments, where there is no fixed infrastructure and centralized management. The nodes of mobile ad hoc networks are susceptible to compromise. In such a scenario, designing an efficient, reliable and secure routing protocol has b… ▽ More

    Submitted 10 February, 2014; originally announced May 2014.

    Comments: 14 Pages, 13 Figure. arXiv admin note: text overlap with arXiv:1202.4628 by other authors

  37. arXiv:1209.6540  [pdf, other

    math.CO cs.DS

    A Practical Regularity Partitioning Algorithm and its Applications in Clustering

    Authors: Gábor N. Sárközy, Fei Song, Endre Szemerédi, Shubhendu Trivedi

    Abstract: In this paper we introduce a new clustering technique called Regularity Clustering. This new technique is based on the practical variants of the two constructive versions of the Regularity Lemma, a very useful tool in graph theory. The lemma claims that every graph can be partitioned into pseudo-random graphs. While the Regularity Lemma has become very important in proving theoretical results, it… ▽ More

    Submitted 28 September, 2012; originally announced September 2012.