Skip to main content

Showing 1–30 of 30 results for author: Desai, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  2. arXiv:2505.02007  [pdf, other

    cs.CV

    Efficient Noise Calculation in Deep Learning-based MRI Reconstructions

    Authors: Onat Dalmaz, Arjun D. Desai, Reinhard Heckel, Tolga Çukur, Akshay S. Chaudhari, Brian A. Hargreaves

    Abstract: Accelerated MRI reconstruction involves solving an ill-posed inverse problem where noise in acquired data propagates to the reconstructed images. Noise analyses are central to MRI reconstruction for providing an explicit measure of solution fidelity and for guiding the design and deployment of novel reconstruction methods. However, deep learning (DL)-based reconstruction methods have often overloo… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

    Comments: Accepted ICML 2025. Supplementary material included

    MSC Class: 65C60; 94A08; 68T07 ACM Class: I.4.5; I.2.10; G.1.2

  3. arXiv:2504.16075  [pdf, other

    stat.ML cs.LG

    Explainable Unsupervised Anomaly Detection with Random Forest

    Authors: Joshua S. Harvey, Joshua Rosaler, Mingshu Li, Dhruv Desai, Dhagash Mehta

    Abstract: We describe the use of an unsupervised Random Forest for similarity learning and improved unsupervised anomaly detection. By training a Random Forest to discriminate between real data and synthetic data sampled from a uniform distribution over the real data bounds, a distance measure is obtained that anisometrically transforms the data, expanding distances at the boundary of the data manifold. We… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 14 pages, 5 figures

  4. arXiv:2502.18359  [pdf

    cs.CY

    Responsible AI Agents

    Authors: Deven R. Desai, Mark O. Riedl

    Abstract: Thanks to advances in large language models, a new type of software agent, the artificial intelligence (AI) agent, has entered the marketplace. Companies such as OpenAI, Google, Microsoft, and Salesforce promise their AI Agents will go from generating passive text to executing tasks. Instead of a travel itinerary, an AI Agent would book all aspects of your trip. Instead of generating text or image… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  5. arXiv:2502.08849  [pdf, other

    cs.NI

    Geofeed Adoption and Authentication

    Authors: Dipsy Desai, Kicho Yu, Sulyab Thottungal Valapu

    Abstract: IP Geofeed is a recently proposed informational standard that allows network operators to publish the geographical location of deployed IPv4 and IPv6 prefixes. In this work we study the adoption of IP geofeed, assess deployment of geofeed at Regional Internet Registry and Autonomous System levels, and analyze adherence to RFC 8805 and RFC 9092 in deployed geofeeds. We evaluate the authentication m… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: To be published in IEEE/IFIP NOMS 2025

  6. arXiv:2408.10340  [pdf, other

    stat.ML cs.LG q-fin.ST stat.AP

    Can an unsupervised clustering algorithm reproduce a categorization system?

    Authors: Nathalia Castellanos, Dhruv Desai, Sebastian Frank, Stefano Pasquali, Dhagash Mehta

    Abstract: Peer analysis is a critical component of investment management, often relying on expert-provided categorization systems. These systems' consistency is questioned when they do not align with cohorts from unsupervised clustering algorithms optimized for various metrics. We investigate whether unsupervised clustering can reproduce ground truth classes in a labeled dataset, showing that success depend… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 tables 28 figures

  7. arXiv:2408.06679  [pdf, other

    cs.LG q-fin.ST stat.ML

    Case-based Explainability for Random Forest: Prototypes, Critics, Counter-factuals and Semi-factuals

    Authors: Gregory Yampolsky, Dhruv Desai, Mingshu Li, Stefano Pasquali, Dhagash Mehta

    Abstract: The explainability of black-box machine learning algorithms, commonly known as Explainable Artificial Intelligence (XAI), has become crucial for financial and other regulated industrial applications due to regulatory requirements and the need for transparency in business practices. Among the various paradigms of XAI, Explainable Case-Based Reasoning (XCBR) stands out as a pragmatic approach that e… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 8 pages, 2 figures, 5 tables

  8. arXiv:2408.02684  [pdf

    cs.LG stat.ML

    Open Set Recognition for Random Forest

    Authors: Guanchao Feng, Dhruv Desai, Stefano Pasquali, Dhagash Mehta

    Abstract: In many real-world classification or recognition tasks, it is often difficult to collect training examples that exhaust all possible classes due to, for example, incomplete knowledge during training or ever changing regimes. Therefore, samples from unknown/novel classes may be encountered in testing/deployment. In such scenarios, the classifiers should be able to i) perform classification on known… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  9. arXiv:2408.02355  [pdf, other

    stat.ML cs.LG q-fin.ST q-fin.TR

    Quantile Regression using Random Forest Proximities

    Authors: Mingshu Li, Bhaskarjit Sarmah, Dhruv Desai, Joshua Rosaler, Snigdha Bhagat, Philip Sommer, Dhagash Mehta

    Abstract: Due to the dynamic nature of financial markets, maintaining models that produce precise predictions over time is difficult. Often the goal isn't just point prediction but determining uncertainty. Quantifying uncertainty, especially the aleatoric uncertainty due to the unpredictable nature of market drivers, helps investors understand varying risk levels. Recently, quantile regression forests (QRF)… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 9 pages, 5 figures, 3 tables

  10. arXiv:2404.19134  [pdf, other

    cs.CV

    Evaluating Deep Clustering Algorithms on Non-Categorical 3D CAD Models

    Authors: Siyuan Xiang, Chin Tseng, Congcong Wen, Deshana Desai, Yifeng Kou, Binil Starly, Daniele Panozzo, Chen Feng

    Abstract: We introduce the first work on benchmarking and evaluating deep clustering algorithms on large-scale non-categorical 3D CAD models. We first propose a workflow to allow expert mechanical engineers to efficiently annotate 252,648 carefully sampled pairwise CAD model similarities, from a subset of the ABC dataset with 22,968 shapes. Using seven baseline deep clustering methods, we then investigate t… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  11. arXiv:2403.14653  [pdf

    cs.CY

    Between Copyright and Computer Science: The Law and Ethics of Generative AI

    Authors: Deven R. Desai, Mark Riedl

    Abstract: Copyright and computer science continue to intersect and clash, but they can coexist. The advent of new technologies such as digitization of visual and aural creations, sharing technologies, search engines, social media offerings, and more challenge copyright-based industries and reopen questions about the reach of copyright law. Breakthroughs in artificial intelligence research, especially Large… ▽ More

    Submitted 5 September, 2024; v1 submitted 24 February, 2024; originally announced March 2024.

    Comments: Northwestern Journal of Technology and Intellectual Property, Vol. 22

  12. arXiv:2310.12428  [pdf, other

    stat.ML cs.AI cs.LG q-fin.ST stat.ME

    Enhanced Local Explainability and Trust Scores with Random Forest Proximities

    Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

    Abstract: We initiate a novel approach to explain the predictions and out of sample performance of random forest (RF) regression and classification models by exploiting the fact that any RF can be mathematically formulated as an adaptive weighted K nearest-neighbors model. Specifically, we employ a recent result that, for both regression and classification tasks, any RF prediction can be rewritten exactly a… ▽ More

    Submitted 5 August, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 5 pages, 6 figures

  13. arXiv:2308.06882  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Quantifying Outlierness of Funds from their Categories using Supervised Similarity

    Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

    Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 tables, 8 figures

  14. arXiv:2210.07936  [pdf, other

    eess.IV cs.CV

    Data-Limited Tissue Segmentation using Inpainting-Based Self-Supervised Learning

    Authors: Jeffrey Dominic, Nandita Bhaskhar, Arjun D. Desai, Andrew Schmidt, Elka Rubin, Beliz Gunel, Garry E. Gold, Brian A. Hargreaves, Leon Lenchik, Robert Boutin, Akshay S. Chaudhari

    Abstract: Although supervised learning has enabled high performance for image segmentation, it requires a large amount of labeled training data, which can be difficult to obtain in the medical imaging field. Self-supervised learning (SSL) methods involving pretext tasks have shown promise in overcoming this requirement by first pretraining models using unlabeled data. In this work, we evaluate the efficacy… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Submitted to Radiology: Artificial Intelligence

  15. arXiv:2207.08393  [pdf, other

    eess.IV cs.CV

    GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction

    Authors: Batu Ozturkler, Arda Sahiner, Tolga Ergen, Arjun D Desai, Christopher M Sandino, Shreyas Vasanawala, John M Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction. These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization. However, they require several iterations of a large neural network to handle high-dimensional imaging tasks such as 3D MRI. This limits traditional training… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  16. arXiv:2204.10436  [pdf, other

    eess.IV cs.CV cs.LG

    Scale-Equivariant Unrolled Neural Networks for Data-Efficient Accelerated MRI Reconstruction

    Authors: Beliz Gunel, Arda Sahiner, Arjun D. Desai, Akshay S. Chaudhari, Shreyas Vasanawala, Mert Pilanci, John Pauly

    Abstract: Unrolled neural networks have enabled state-of-the-art reconstruction performance and fast inference times for the accelerated magnetic resonance imaging (MRI) reconstruction task. However, these approaches depend on fully-sampled scans as ground truth data which is either costly or not possible to acquire in many clinical medical imaging applications; hence, reducing dependence on data is desirab… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  17. arXiv:2203.06823  [pdf, other

    eess.IV cs.CV

    SKM-TEA: A Dataset for Accelerated MRI Reconstruction with Dense Image Labels for Quantitative Clinical Evaluation

    Authors: Arjun D Desai, Andrew M Schmidt, Elka B Rubin, Christopher M Sandino, Marianne S Black, Valentina Mazzoli, Kathryn J Stevens, Robert Boutin, Christopher Ré, Garry E Gold, Brian A Hargreaves, Akshay S Chaudhari

    Abstract: Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have sh… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Accepted to NeurIPS Datasets & Benchmarks (2021)

  18. Don't let Ricci v. DeStefano Hold You Back: A Bias-Aware Legal Solution to the Hiring Paradox

    Authors: Jad Salem, Deven R. Desai, Swati Gupta

    Abstract: Companies that try to address inequality in employment face a hiring paradox. Failing to address workforce imbalance can result in legal sanctions and scrutiny, but proactive measures to address these issues might result in the same legal conflict. Recent run-ins of Microsoft and Wells Fargo with the Labor Department's Office of Federal Contract Compliance Programs (OFCCP) are not isolated and are… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 29 pages, 9 figures

    ACM Class: K.4.3; J.0

  19. arXiv:2110.00075  [pdf, other

    eess.IV cs.CV

    Noise2Recon: Enabling Joint MRI Reconstruction and Denoising with Semi-Supervised and Self-Supervised Learning

    Authors: Arjun D Desai, Batu M Ozturkler, Christopher M Sandino, Robert Boutin, Marc Willis, Shreyas Vasanawala, Brian A Hargreaves, Christopher M Ré, John M Pauly, Akshay S Chaudhari

    Abstract: Deep learning (DL) has shown promise for faster, high quality accelerated MRI reconstruction. However, supervised DL methods depend on extensive amounts of fully-sampled (labeled) data and are sensitive to out-of-distribution (OOD) shifts, particularly low signal-to-noise ratio (SNR) acquisitions. To alleviate this challenge, we propose Noise2Recon, a model-agnostic, consistency training method fo… ▽ More

    Submitted 7 October, 2022; v1 submitted 30 September, 2021; originally announced October 2021.

  20. arXiv:2106.12987  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Fund2Vec: Mutual Funds Similarity using Graph Learning

    Authors: Vipul Satone, Dhruv Desai, Dhagash Mehta

    Abstract: Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) am… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 2 column format, 8 pages, 8 figures, 5 tables

  21. arXiv:2010.01421  [pdf, other

    cs.CV

    Early Bird: Loop Closures from Opposing Viewpoints for Perceptually-Aliased Indoor Environments

    Authors: Satyajit Tourani, Dhagash Desai, Udit Singh Parihar, Sourav Garg, Ravi Kiran Sarvadevabhatla, Michael Milford, K. Madhava Krishna

    Abstract: Significant advances have been made recently in Visual Place Recognition (VPR), feature correspondence, and localization due to the proliferation of deep-learning-based methods. However, existing approaches tend to address, partially or fully, only one of two key challenges: viewpoint change and perceptual aliasing. In this paper, we present novel research that simultaneously addresses both challe… ▽ More

    Submitted 20 December, 2020; v1 submitted 3 October, 2020; originally announced October 2020.

    Comments: Accepted to VISAPP 2021. Video Link: https://youtu.be/q6cKYW0kX4s

  22. ACORNS: An Easy-To-Use Code Generator for Gradients and Hessians

    Authors: Deshana Desai, Etai Shuchatowitz, Zhongshi Jiang, Teseo Schneider, Daniele Panozzo

    Abstract: The computation of first and second-order derivatives is a staple in many computing applications, ranging from machine learning to scientific computing. We propose an algorithm to automatically differentiate algorithms written in a subset of C99 code and its efficient implementation as a Python script. We demonstrate that our algorithm enables automatic, reliable, and efficient differentiation of… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Journal ref: SoftwareX, Volume 17, 2022

  23. arXiv:2006.00123  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.ML

    Machine Learning Fund Categorizations

    Authors: Dhagash Mehta, Dhruv Desai, Jithin Pradeep

    Abstract: Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many appli… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 8 pages, 2-column format, 5 figures

  24. arXiv:2004.14003  [pdf, other

    eess.IV cs.CV

    The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset

    Authors: Arjun D. Desai, Francesco Caliva, Claudia Iriondo, Naji Khosravan, Aliasghar Mortazi, Sachin Jambawalikar, Drew Torigian, Jutta Ellermann, Mehmet Akcakaya, Ulas Bagci, Radhika Tibrewala, Io Flament, Matthew O`Brien, Sharmila Majumdar, Mathias Perslev, Akshay Pai, Christian Igel, Erik B. Dam, Sibaji Gaj, Mingrui Yang, Kunio Nakamura, Xiaojuan Li, Cem M. Deniz, Vladimir Juras, Ravinder Regatte , et al. (4 additional authors not shown)

    Abstract: Purpose: To organize a knee MRI segmentation challenge for characterizing the semantic and clinical efficacy of automatic segmentation methods relevant for monitoring osteoarthritis progression. Methods: A dataset partition consisting of 3D knee MRI from 88 subjects at two timepoints with ground-truth articular (femoral, tibial, patellar) cartilage and meniscus segmentations was standardized. Ch… ▽ More

    Submitted 26 May, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Submitted to Radiology: Artificial Intelligence; Fixed typos

  25. arXiv:1902.01977  [pdf, other

    eess.IV cs.CV

    Technical Considerations for Semantic Segmentation in MRI using Convolutional Neural Networks

    Authors: Arjun D. Desai, Garry E. Gold, Brian A. Hargreaves, Akshay S. Chaudhari

    Abstract: High-fidelity semantic segmentation of magnetic resonance volumes is critical for estimating tissue morphometry and relaxation parameters in both clinical and research applications. While manual segmentation is accepted as the gold-standard, recent advances in deep learning and convolutional neural networks (CNNs) have shown promise for efficient automatic segmentation of soft tissues. However, du… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

    Comments: Submitted to Magnetic Resonance in Medicine

  26. arXiv:1611.03298  [pdf, other

    cs.SI

    Role of Temporal Diversity in Inferring Social Ties Based on Spatio-Temporal Data

    Authors: Deshana Desai, Harsh Nisar, Rishab Bhardawaj

    Abstract: The last two decades have seen a tremendous surge in research on social networks and their implications. The studies includes inferring social relationships, which in turn have been used for target advertising, recommendations, search customization etc. However, the offline experiences of human, the conversations with people and face-to-face interactions that govern our lives interactions have rec… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: 7 pages, 3 figures

  27. arXiv:1211.3439  [pdf, ps, other

    cs.CC

    Optimal Hitting Sets for Combinatorial Shapes

    Authors: Aditya Bhaskara, Devendra Desai, Srikanth Srinivasan

    Abstract: We consider the problem of constructing explicit Hitting sets for Combinatorial Shapes, a class of statistical tests first studied by Gopalan, Meka, Reingold, and Zuckerman (STOC 2011). These generalize many well-studied classes of tests, including symmetric functions and combinatorial rectangles. Generalizing results of Linial, Luby, Saks, and Zuckerman (Combinatorica 1997) and Rabani and Shpilka… ▽ More

    Submitted 14 November, 2012; originally announced November 2012.

    Comments: 24 pages

  28. arXiv:1111.3048  [pdf, ps, other

    cs.SI cs.CC physics.soc-ph

    On a Connection Between Small Set Expansions and Modularity Clustering in Social Networks

    Authors: Bhaskar DasGupta, Devendra Desai

    Abstract: In this paper we explore a connection between two seemingly different problems from two different domains: the small-set expansion problem studied in unique games conjecture, and a popular community finding approach for social networks known as the modularity clustering approach. We show that a sub-exponential time algorithm for the small-set expansion problem leads to a sub-exponential time const… ▽ More

    Submitted 11 February, 2014; v1 submitted 13 November, 2011; originally announced November 2011.

    Comments: Information Processing Letters, 2014

    MSC Class: 68Q25; 68W25 ACM Class: F.2.2; J.4

    Journal ref: Information Processing Letters, 114(7), 349-352, 2014

  29. arXiv:1102.0969  [pdf, ps, other

    physics.soc-ph cs.CC cs.DM cs.SI

    On the Complexity of Newman's Community Finding Approach for Biological and Social Networks

    Authors: Bhaskar DasGupta, Devendra Desai

    Abstract: Given a graph of interactions, a module (also called a community or cluster) is a subset of nodes whose fitness is a function of the statistical significance of the pairwise interactions of nodes in the module. The topic of this paper is a model-based community finding approach, commonly referred to as modularity clustering, that was originally proposed by Newman and has subsequently been extremel… ▽ More

    Submitted 10 April, 2012; v1 submitted 4 February, 2011; originally announced February 2011.

    Comments: Journal of Computer and System Sciences, 2012

    MSC Class: 68Q25; 68R01; 05C85 ACM Class: F.2.2; G.2.2

    Journal ref: Journal of Computer & System Sciences, 79, 50-67, 2013

  30. arXiv:1002.3864  [pdf, other

    cs.CC cs.DS

    Limits of Approximation Algorithms: PCPs and Unique Games (DIMACS Tutorial Lecture Notes)

    Authors: Prahladh Harsha, Moses Charikar, Matthew Andrews, Sanjeev Arora, Subhash Khot, Dana Moshkovitz, Lisa Zhang, Ashkan Aazami, Dev Desai, Igor Gorodezky, Geetha Jagannathan, Alexander S. Kulikov, Darakhshan J. Mir, Alantha Newman, Aleksandar Nikolov, David Pritchard, Gwen Spencer

    Abstract: These are the lecture notes for the DIMACS Tutorial "Limits of Approximation Algorithms: PCPs and Unique Games" held at the DIMACS Center, CoRE Building, Rutgers University on 20-21 July, 2009. This tutorial was jointly sponsored by the DIMACS Special Focus on Hardness of Approximation, the DIMACS Special Focus on Algorithmic Foundations of the Internet, and the Center for Computational Intracta… ▽ More

    Submitted 20 February, 2010; originally announced February 2010.

    Comments: 74 pages, lecture notes

    Report number: DIMACS Technical Report 2010-02