Search | arXiv e-print repository

Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi

Authors: Monojit Choudhury, Shivam Chauhan, Rocktim Jyoti Das, Dhruv Sahnan, Xudong Han, Haonan Li, Aaryamonvikram Singh, Alok Anil Jadhav, Utkarsh Agarwal, Mukund Choudhary, Debopriyo Banerjee, Fajri Koto, Junaid Bhat, Awantika Shukla, Samujjwal Ghosh, Samta Kamboj, Onkar Pandit, Lalit Pradhan, Rahul Pal, Sunil Sahu, Soundar Doraiswamy, Parvez Mullah, Ali El Filali, Neha Sengupta, Gokul Ramakrishnan , et al. (5 additional authors not shown)

Abstract: Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorp… ▽ More Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorporates continuous pre-training with expanded transformer blocks, leveraging the Llama Pro methodology. A key challenge was the limited availability of high-quality Hindi text data; we addressed this through rigorous data curation, augmentation, and strategic bilingual training, balancing Hindi and English corpora to optimize cross-linguistic knowledge transfer. With 10 billion parameters, Nanda stands among the top-performing open-source Hindi and multilingual models of similar scale, demonstrating significant advantages over many existing models. We provide an in-depth discussion of training strategies, fine-tuning techniques, safety alignment, and evaluation metrics, demonstrating how these approaches enabled Nanda to achieve state-of-the-art results. By open-sourcing Nanda, we aim to advance research in Hindi LLMs and support a wide range of real-world applications across academia, industry, and public services. △ Less

Submitted 8 April, 2025; originally announced April 2025.

arXiv:2407.21107 [pdf, ps, other]

doi 10.1103/PhysRevD.110.064082

Toward Singularity Theorems with Torsion

Authors: Armin van de Venn, Ujjwal Agarwal, David Vasak

Abstract: This study examines the formulation of a singularity theorem for timelike curves including torsion, and establishes the foundational framework necessary for its derivation. We begin by deriving the relative acceleration for an arbitrary congruence of timelike curves. The resulting ``deviation equation'' offers an alternative pathway to the well-known Raychaudhuri equation with torsion. Conjugate p… ▽ More This study examines the formulation of a singularity theorem for timelike curves including torsion, and establishes the foundational framework necessary for its derivation. We begin by deriving the relative acceleration for an arbitrary congruence of timelike curves. The resulting ``deviation equation'' offers an alternative pathway to the well-known Raychaudhuri equation with torsion. Conjugate points are then introduced and analyzed in relation to the behavior of the scalar expansion. Together with the sensible requirement of hypersurface orthogonality, the Raychaudhuri equation is examined for several specific cases of torsion that are prominent in the literature. Our findings indicate that a totally antisymmetric torsion tensor does not influence the behavior of the congruence of timelike curves. Finally, we formulate a singularity theorem for timelike curves and highlight the critical requirement of non-autoparallel curves. △ Less

Submitted 27 September, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

Comments: 13 pages, 3 figures; Updated to published version

Journal ref: Phys. Rev. D 110, 064082 (2024)

arXiv:2404.18460 [pdf, other]

Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in

Authors: Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury

Abstract: Ethical reasoning is a crucial skill for Large Language Models (LLMs). However, moral values are not universal, but rather influenced by language and culture. This paper explores how three prominent LLMs -- GPT-4, ChatGPT, and Llama2-70B-Chat -- perform ethical reasoning in different languages and if their moral judgement depend on the language in which they are prompted. We extend the study of et… ▽ More Ethical reasoning is a crucial skill for Large Language Models (LLMs). However, moral values are not universal, but rather influenced by language and culture. This paper explores how three prominent LLMs -- GPT-4, ChatGPT, and Llama2-70B-Chat -- perform ethical reasoning in different languages and if their moral judgement depend on the language in which they are prompted. We extend the study of ethical reasoning of LLMs by Rao et al. (2023) to a multilingual setup following their framework of probing LLMs with ethical dilemmas and policies from three branches of normative ethics: deontology, virtue, and consequentialism. We experiment with six languages: English, Spanish, Russian, Chinese, Hindi, and Swahili. We find that GPT-4 is the most consistent and unbiased ethical reasoner across languages, while ChatGPT and Llama2-70B-Chat show significant moral value bias when we move to languages other than English. Interestingly, the nature of this bias significantly vary across languages for all LLMs, including GPT-4. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2402.02135 [pdf, other]

Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test

Authors: Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury

Abstract: This paper explores the moral judgment and moral reasoning abilities exhibited by Large Language Models (LLMs) across languages through the Defining Issues Test. It is a well known fact that moral judgment depends on the language in which the question is asked. We extend the work of beyond English, to 5 new languages (Chinese, Hindi, Russian, Spanish and Swahili), and probe three LLMs -- ChatGPT,… ▽ More This paper explores the moral judgment and moral reasoning abilities exhibited by Large Language Models (LLMs) across languages through the Defining Issues Test. It is a well known fact that moral judgment depends on the language in which the question is asked. We extend the work of beyond English, to 5 new languages (Chinese, Hindi, Russian, Spanish and Swahili), and probe three LLMs -- ChatGPT, GPT-4 and Llama2Chat-70B -- that shows substantial multilingual text processing and generation abilities. Our study shows that the moral reasoning ability for all models, as indicated by the post-conventional score, is substantially inferior for Hindi and Swahili, compared to Spanish, Russian, Chinese and English, while there is no clear trend for the performance of the latter four languages. The moral judgments too vary considerably by the language. △ Less

Submitted 3 February, 2024; originally announced February 2024.

Comments: Accepted to EACL 2024 (main)

arXiv:2310.07251 [pdf, other]

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

Authors: Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury

Abstract: In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale. When provided with an ethical policy, an LLM should be capable of making decisions that are ethically consistent to the policy. We develop a framework that integrate… ▽ More In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical principles, we should infuse generic ethical reasoning capabilities into them so that they can handle value pluralism at a global scale. When provided with an ethical policy, an LLM should be capable of making decisions that are ethically consistent to the policy. We develop a framework that integrates moral dilemmas with moral principles pertaining to different foramlisms of normative ethics, and at different levels of abstractions. Initial experiments with GPT-x models shows that while GPT-4 is a nearly perfect ethical reasoner, the models still have bias towards the moral values of Western and English speaking societies. △ Less

Submitted 11 October, 2023; originally announced October 2023.

arXiv:2310.00782 [pdf, ps, other]

Distributed Deterministic Exact Minimum Weight Cycle and Multi Source Shortest Paths in Near Linear Rounds in CONGEST model

Authors: Udit Agarwal

Abstract: We present new deterministic algorithms for computing distributed weighted minimum weight cycle (MWC) in undirected and directed graphs and distributed weighted all nodes shortest cycle (ANSC) in directed graphs. Our algorithms for these problems run in $\tilde{O}(n)$ rounds in the CONGEST model on graphs with arbitrary non-negative edge weights, matching the lower bound up to polylogarithmic fact… ▽ More We present new deterministic algorithms for computing distributed weighted minimum weight cycle (MWC) in undirected and directed graphs and distributed weighted all nodes shortest cycle (ANSC) in directed graphs. Our algorithms for these problems run in $\tilde{O}(n)$ rounds in the CONGEST model on graphs with arbitrary non-negative edge weights, matching the lower bound up to polylogarithmic factors. Before our work, no near linear rounds deterministic algorithms were known for these problems. The previous best bound for solving these problems deterministically requires an initial computation of all pairs shortest paths (APSP) on the given graph, followed by post-processing of $O(n)$ rounds, and in total takes $\tilde{O}(n^{4/3})$ rounds, using deterministic APSP~\cite{AR-SPAA20}. The main component of our new $\tilde{O}(n)$ rounds algorithms is a deterministic technique for constructing a sequence of successive blocker sets. These blocker sets are then treated as source nodes to compute $h$-hop shortest paths, which can then be used to compute candidate shortest cycles whose hop length lies in a particular range. The shortest cycles can then be obtained by selecting the cycle with the minimum weight from all these candidate cycles. Additionally using the above blocker set sequence technique, we also obtain $\tilde{O}(n)$ rounds deterministic algorithm for the multi-source shortest paths problem (MSSP) for both directed and undirected graphs, given that the size of the source set is at most $\sqrt{n}$. This new result for MSSP can be a step towards obtaining a $o(n^{4/3})$ rounds algorithm for deterministic APSP. We also believe that our new blocker set sequence technique may have potential applications for other distributed algorithms. △ Less

Submitted 1 October, 2023; originally announced October 2023.

arXiv:2309.13356 [pdf, other]

Probing the Moral Development of Large Language Models through Defining Issues Test

Authors: Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, Monojit Choudhury

Abstract: In this study, we measure the moral reasoning ability of LLMs using the Defining Issues Test - a psychometric instrument developed for measuring the moral development stage of a person according to the Kohlberg's Cognitive Moral Development Model. DIT uses moral dilemmas followed by a set of ethical considerations that the respondent has to judge for importance in resolving the dilemma, and then r… ▽ More In this study, we measure the moral reasoning ability of LLMs using the Defining Issues Test - a psychometric instrument developed for measuring the moral development stage of a person according to the Kohlberg's Cognitive Moral Development Model. DIT uses moral dilemmas followed by a set of ethical considerations that the respondent has to judge for importance in resolving the dilemma, and then rank-order them by importance. A moral development stage score of the respondent is then computed based on the relevance rating and ranking. Our study shows that early LLMs such as GPT-3 exhibit a moral reasoning ability no better than that of a random baseline, while ChatGPT, Llama2-Chat, PaLM-2 and GPT-4 show significantly better performance on this task, comparable to adult humans. GPT-4, in fact, has the highest post-conventional moral reasoning score, equivalent to that of typical graduate school students. However, we also observe that the models do not perform consistently across all dilemmas, pointing to important gaps in their understanding and reasoning abilities. △ Less

Submitted 7 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

Comments: First three authors contributed equally

arXiv:2210.15593 [pdf]

An Investigation into Neuromorphic ICs using Memristor-CMOS Hybrid Circuits

Authors: Udit Kumar Agarwal, Shikhar Makhija, Varun Tripathi, Kunwar Singh

Abstract: The memristance of a memristor depends on the amount of charge flowing through it and when current stops flowing through it, it remembers the state. Thus, memristors are extremely suited for implementation of memory units. Memristors find great application in neuromorphic circuits as it is possible to couple memory and processing, compared to traditional Von-Neumann digital architectures where mem… ▽ More The memristance of a memristor depends on the amount of charge flowing through it and when current stops flowing through it, it remembers the state. Thus, memristors are extremely suited for implementation of memory units. Memristors find great application in neuromorphic circuits as it is possible to couple memory and processing, compared to traditional Von-Neumann digital architectures where memory and processing are separate. Neural networks have a layered structure where information passes from one layer to another and each of these layers have the possibility of a high degree of parallelism. CMOS-Memristor based neural network accelerators provide a method of speeding up neural networks by making use of this parallelism and analog computation. In this project we have conducted an initial investigation into the current state of the art implementation of memristor based programming circuits. Various memristor programming circuits and basic neuromorphic circuits have been simulated. The next phase of our project revolved around designing basic building blocks which can be used to design neural networks. A memristor bridge based synaptic weighting block, a operational transconductor based summing block were initially designed. We then designed activation function blocks which are used to introduce controlled non-linearity. Blocks for a basic rectified linear unit and a novel implementation for tan-hyperbolic function have been proposed. An artificial neural network has been designed using these blocks to validate and test their performance. We have also used these fundamental blocks to design basic layers of Convolutional Neural Networks. Convolutional Neural Networks are heavily used in image processing applications. The core convolutional block has been designed and it has been used as an image processing kernel to test its performance. △ Less

Submitted 19 August, 2022; originally announced October 2022.

Comments: Bachelor's thesis

arXiv:2107.10533 [pdf, other]

CGuard: Efficient Spatial Safety for C

Authors: Piyus Kedia, Rahul Purandare, Udit Kumar Agarwal, Rishabh

Abstract: Spatial safety violations are the root cause of many security attacks and unexpected behavior of applications. Existing techniques to enforce spatial safety work broadly at either object or pointer granularity. Object-based approaches tend to incur high CPU overheads, whereas pointer-based approaches incur both high CPU and memory overheads. SGXBounds, an object-based approach, is so far the most… ▽ More Spatial safety violations are the root cause of many security attacks and unexpected behavior of applications. Existing techniques to enforce spatial safety work broadly at either object or pointer granularity. Object-based approaches tend to incur high CPU overheads, whereas pointer-based approaches incur both high CPU and memory overheads. SGXBounds, an object-based approach, is so far the most efficient technique that provides complete out-of-bounds protection for objects. However, a major drawback of this approach is that it can't support address space larger than 32-bit. In this paper, we present CGuard, a tool that provides object-bounds protection for C applications with comparable overheads to SGXBounds without restricting the application address space. CGuard stores the bounds information just before the base address of an object and encodes the relative offset of the base address in the spare bits of the virtual address available in x86_64 architecture. For an object that can't fit in the spare bits, CGuard uses a custom memory layout that enables it to find the base address of the object in just one memory access. Our study revealed spatial safety violations in the gcc and x264 benchmarks from the SPEC CPU2017 benchmark suite and the string_match benchmark from the Phoenix benchmark suite. The execution time overheads for the SPEC CPU2017 and Phoenix benchmark suites were 42% and 26% respectively, whereas the reduction in the throughput for the Apache webserver when the CPUs were fully saturated was 30%. These results indicate that CGuard can be highly effective while maintaining a reasonable degree of efficiency. △ Less

Submitted 29 August, 2023; v1 submitted 22 July, 2021; originally announced July 2021.

arXiv:2106.08500 [pdf, other]

Optimizing Graph Transformer Networks with Graph-based Techniques

Authors: Loc Hoang, Udit Agarwal, Gurbinder Gill, Roshan Dathathri, Abhik Seal, Brian Martin, Keshav Pingali

Abstract: Graph transformer networks (GTN) are a variant of graph convolutional networks (GCN) that are targeted to heterogeneous graphs in which nodes and edges have associated type information that can be exploited to improve inference accuracy. GTNs learn important metapaths in the graph, create weighted edges for these metapaths, and use the resulting graph in a GCN. Currently, the only available implem… ▽ More Graph transformer networks (GTN) are a variant of graph convolutional networks (GCN) that are targeted to heterogeneous graphs in which nodes and edges have associated type information that can be exploited to improve inference accuracy. GTNs learn important metapaths in the graph, create weighted edges for these metapaths, and use the resulting graph in a GCN. Currently, the only available implementation of GTNs uses dense matrix multiplication to find metapaths. Unfortunately, the space overhead of this approach can be large, so in practice it is used only for small graphs. In addition, the matrix-based implementation is not fine-grained enough to use random-walk based methods to optimize metapath finding. In this paper, we present a graph-based formulation and implementation of the GTN metapath finding problem. This graph-based formulation has two advantages over the matrix-based approach. First, it is more space efficient than the original GTN implementation and more compute-efficient for metapath sizes of practical interest. Second, it permits us to implement a sampling method that reduces the number of metapaths that must be enumerated, allowing the implementation to be used for larger graphs and larger metapath sizes. Experimental results show that our implementation is $6.5\times$ faster than the original GTN implementation on average for a metapath length of 4, and our sampling implementation is $155\times$ faster on average than this implementation without compromising on the accuracy of the GTN. △ Less

Submitted 15 June, 2021; originally announced June 2021.

arXiv:2012.13618 [pdf, other]

doi 10.1145/3437801.3441611

BiPart: A Parallel and Deterministic Multilevel Hypergraph Partitioner

Authors: Sepideh Maleki, Udit Agarwal, Martin Burtscher, Keshav Pingali

Abstract: Hypergraph partitioning is used in many problem domains including VLSI design, linear algebra, Boolean satisfiability, and data mining. Most versions of this problem are NP-complete or NP-hard, so practical hypergraph partitioners generate approximate partitioning solutions for all but the smallest inputs. One way to speed up hypergraph partitioners is to exploit parallelism. However, existing par… ▽ More Hypergraph partitioning is used in many problem domains including VLSI design, linear algebra, Boolean satisfiability, and data mining. Most versions of this problem are NP-complete or NP-hard, so practical hypergraph partitioners generate approximate partitioning solutions for all but the smallest inputs. One way to speed up hypergraph partitioners is to exploit parallelism. However, existing parallel hypergraph partitioners are not deterministic, which is considered unacceptable in domains like VLSI design where the same partitions must be produced every time a given hypergraph is partitioned. In this paper, we describe BiPart, the first deterministic, parallel hypergraph partitioner. Experimental results show that BiPart outperforms state-of-the-art hypergraph partitioners in runtime and partition quality while generating partitions deterministically. △ Less

Submitted 25 December, 2020; originally announced December 2020.

Comments: Accepted for publication at PPoPP21

arXiv:2005.09588 [pdf, ps, other]

Faster Deterministic All Pairs Shortest Paths in Congest Model

Authors: Udit Agarwal, Vijaya Ramachandran

Abstract: We present a new deterministic algorithm for distributed weighted all pairs shortest paths (APSP) in both undirected and directed graphs. Our algorithm runs in $\tilde{O}(n^{4/3})$ rounds in the Congest models on graphs with arbitrary edge weights, and it improves on the previous $\tilde{O}(n^{3/2})$ bound of Agarwal et al. [ARKP18]. The main components of our new algorithm are a new faster techni… ▽ More We present a new deterministic algorithm for distributed weighted all pairs shortest paths (APSP) in both undirected and directed graphs. Our algorithm runs in $\tilde{O}(n^{4/3})$ rounds in the Congest models on graphs with arbitrary edge weights, and it improves on the previous $\tilde{O}(n^{3/2})$ bound of Agarwal et al. [ARKP18]. The main components of our new algorithm are a new faster technique for constructing blocker set deterministically and a new pipelined method for deterministically propagating distance values from source nodes to the blocker set nodes in the network. Both of these techniques have potential applications to other distributed algorithms. Our new deterministic algorithm for computing blocker set adapts the NC approximate hypergraph set cover algorithm in [BRS94] to the distributed construction of a blocker set. It follows the two-step process of first designing a randomized algorithm that uses only pairwise independence, and then derandomizes this algorithm using a sample space of linear size. This algorithm runs in almost the same number of rounds as the initial step in our APSP algorithm that computes $h$-hops shortest paths, and significantly improves on the deterministic blocker set algorithms in [ARKP18, AR19] by removing an additional $n\cdot |Q|$ term in the round bound, where Q is the blocker set. The other new component in our APSP algorithm is a deterministic pipelined approach to propagate distance values from source nodes to blocker nodes. We use a simple natural round-robin method for this step, and we show using a suitable progress measure that it achieve the $\tilde{O}(n^{4/3})$ bound on the number of rounds. It appears that the standard deterministic methods for efficiently broadcasting multiple values, and for sending or receiving messages using the routing schedule in [HPDG+19,LSP19] do not apply to this setting. △ Less

Submitted 19 May, 2020; originally announced May 2020.

Comments: An extended abstract of this paper will appear in the Proceedings of the ACM Symposium on Parallel Algorithms and Architectures (SPAA) 2020

arXiv:2002.11617

A Polynomial Time Algorithm for Almost Optimal Vertex Fault Tolerant Spanners

Authors: Udit Agarwal

Abstract: We present the first polynomial time algorithm for the f vertex fault tolerant spanner problem, which achieves almost optimal spanner size. Our algorithm for constructing f vertex fault tolerant spanner takes $O(k\cdot n\cdot m^2 \cdot W)$ time, where W is the maximum edge weight, and constructs a spanner of size $O(n^{1+1/k}f^{1-1/k}\cdot (\log n)^{1-1/k})$. Our spanner has almost optimal size an… ▽ More We present the first polynomial time algorithm for the f vertex fault tolerant spanner problem, which achieves almost optimal spanner size. Our algorithm for constructing f vertex fault tolerant spanner takes $O(k\cdot n\cdot m^2 \cdot W)$ time, where W is the maximum edge weight, and constructs a spanner of size $O(n^{1+1/k}f^{1-1/k}\cdot (\log n)^{1-1/k})$. Our spanner has almost optimal size and is at most a $\log n$ factor away from the upper bound on the worst-case size. Prior to this work, no other polynomial time algorithm was known for constructing f vertex fault tolerant spanner with optimal size. Our algorithm is based on first greedily constructing a hitting set for the collection of paths of weight at most $k \cdot w(u,v)$ between the endpoints u and v of an edge (u,v) and then using this set to decide whether the edge (u,v) needs to be added to the growing spanner. △ Less

Submitted 6 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: Incorrect statement of Lemma 5.1

arXiv:1911.12234 [pdf]

Structural, Electronic and Elastic Properties of zincblende III-Arsenide Binary Compounds: First-Principles Study

Authors: Umang Agarwal, Satish Chandra, Virendra Kumar

Abstract: First-principles calculations were performed, and the results from the study of structural, electronic and elastic properties of zincblende III-arsenide binary compounds (BAs, AlAs, GaAs and InAs) are presented. These properties have been calculated using an ab initio pseudopotential method based on density functional theory (DFT) with the local density approximation (LDA) for the exchange-correla… ▽ More First-principles calculations were performed, and the results from the study of structural, electronic and elastic properties of zincblende III-arsenide binary compounds (BAs, AlAs, GaAs and InAs) are presented. These properties have been calculated using an ab initio pseudopotential method based on density functional theory (DFT) with the local density approximation (LDA) for the exchange-correlation potential. The results obtained for the calculated properties have been compared with experimental data and other computational works. It has also been found that our results with LDA are in good agreement with other computational work wherever these are available. △ Less

Submitted 27 November, 2019; originally announced November 2019.

Comments: 6 pages, 12 figures, 2 tables

arXiv:1810.08544 [pdf, ps, other]

New and Simplified Distributed Algorithms for Weighted All Pairs Shortest Paths

Authors: Udit Agarwal, Vijaya Ramachandran

Abstract: We consider the problem of computing all pairs shortest paths (APSP) and shortest paths for k sources in a weighted graph in the distributed CONGEST model. For graphs with non-negative integer edge weights (including zero weights) we build on a recent pipelined algorithm to obtain $\tilde{O}(λ^{1/4}\cdot n^{5/4})$ in graphs with non-negative integer edge-weight at most $λ$, and… ▽ More We consider the problem of computing all pairs shortest paths (APSP) and shortest paths for k sources in a weighted graph in the distributed CONGEST model. For graphs with non-negative integer edge weights (including zero weights) we build on a recent pipelined algorithm to obtain $\tilde{O}(λ^{1/4}\cdot n^{5/4})$ in graphs with non-negative integer edge-weight at most $λ$, and $\tilde{O}(n \cdot \bigtriangleup^{1/3})$ rounds for shortest path distances at most $\bigtriangleup$. Additionally, we simplify some of the procedures in the earlier APSP algorithms for non-negative edge weights in [HNS17,ARKP18]. We also present results for computing h-hop shortest paths and shortest paths from $k$ given sources. In other results, we present a randomized exact APSP algorithm for graphs with arbitrary edge weights that runs in $\tilde{O}(n^{4/3})$ rounds w.h.p. in n, which improves the previous best $\tilde{O}(n^{3/2})$ bound, which is deterministic. We also present an $\tilde{O}(n/ε^2)$-round deterministic $(1+ε)$ approximation algorithm for graphs with non-negative $poly(n)$ integer weights (including zero edge-weights), improving results in [Nanongkai14,LP15] that hold only for positive integer weights. △ Less

Submitted 18 October, 2018; originally announced October 2018.

Comments: arXiv admin note: text overlap with arXiv:1807.08824

arXiv:1807.08824 [pdf, ps, other]

A Deterministic Distributed Algorithm for Weighted All Pairs Shortest Paths Through Pipelining

Authors: Udit Agarwal, Vijaya Ramachandran

Abstract: We present a new pipelined approach to compute all pairs shortest paths (APSP) in a directed graph with nonnegative integer edge weights (including zero weights) in the CONGEST model in the distributed setting. Our deterministic distributed algorithm computes shortest paths of distance at most $Δ$ for all pairs of vertices in at most $2 n \sqrtΔ + 2n$ rounds, and more generally, it computes h-hop… ▽ More We present a new pipelined approach to compute all pairs shortest paths (APSP) in a directed graph with nonnegative integer edge weights (including zero weights) in the CONGEST model in the distributed setting. Our deterministic distributed algorithm computes shortest paths of distance at most $Δ$ for all pairs of vertices in at most $2 n \sqrtΔ + 2n$ rounds, and more generally, it computes h-hop shortest paths for k sources in $2\sqrt{nkh} + n + k$ rounds. The algorithm is simple, and it has some novel features and a nontrivial analysis.It uses only the directed edges in the graph for communication. This algorithm can be used as a base within asymptotically faster algorithms that match or improve on the current best deterministic bound of $\tilde{O}(n^{3/2})$ rounds for this problem when edge weights are $O(n)$ or shortest path distances are $\tilde{O}(n^{3/2})$. △ Less

Submitted 22 October, 2018; v1 submitted 23 July, 2018; originally announced July 2018.

Comments: Minor correction in Citations

arXiv:1804.05441 [pdf, ps, other]

A Deterministic Distributed Algorithm for Exact Weighted All-Pairs Shortest Paths in $\tilde{O}(n^{3/2})$ Rounds

Authors: Udit Agarwal, Vijaya Ramachandran, Valerie King, Matteo Pontecorvi

Abstract: We present a deterministic distributed algorithm to compute all-pairs shortest paths(APSP) in an edge-weighted directed or undirected graph. Our algorithm runs in $\tilde{O}(n^{3/2})$ rounds in the Congest model, where $n$ is the number of nodes in the graph. This is the first $o(n^2)$ rounds deterministic distributed algorithm for the weighted APSP problem. Our algorithm is fairly simple and inco… ▽ More We present a deterministic distributed algorithm to compute all-pairs shortest paths(APSP) in an edge-weighted directed or undirected graph. Our algorithm runs in $\tilde{O}(n^{3/2})$ rounds in the Congest model, where $n$ is the number of nodes in the graph. This is the first $o(n^2)$ rounds deterministic distributed algorithm for the weighted APSP problem. Our algorithm is fairly simple and incorporates a deterministic distributed algorithm we develop for computing a `blocker set' \cite{King99}, which has been used earlier in sequential dynamic computation of APSP. △ Less

Submitted 15 April, 2018; originally announced April 2018.

arXiv:1709.04768 [pdf, ps, other]

doi 10.1103/PhysRevE.96.033313

Renormalization Group theory outperforms other approaches in statistical comparison between upscaling techniques for porous media

Authors: Shravan Hanasoge, Umang Agarwal, Kunj Tandon, J. M. Vianney A. Koelman

Abstract: Determining the pressure differential required to achieve a desired flow rate in a porous medium requires solving Darcy's law, a Laplace-like equation, with a spatially varying tensor permeability. In various scenarios, the permeability coefficient is sampled at high spatial resolution, which makes solving Darcy's equation numerically prohibitively expensive. As a consequence, much effort has gone… ▽ More Determining the pressure differential required to achieve a desired flow rate in a porous medium requires solving Darcy's law, a Laplace-like equation, with a spatially varying tensor permeability. In various scenarios, the permeability coefficient is sampled at high spatial resolution, which makes solving Darcy's equation numerically prohibitively expensive. As a consequence, much effort has gone into creating upscaled or low-resolution effective models of the coefficient while ensuring that the estimated flow rate is well reproduced, bringing to fore the classic tradeoff between computational cost and numerical accuracy. Here we perform a statistical study to characterize the relative success of upscaling methods on a large sample of permeability coefficients that are above the percolation threshold. We introduce a new technique based on Mode-Elimination Renormalization-Group theory (MG) to build coarse-scale permeability coefficients. Comparing the results with coefficients upscaled using other methods, we find that MG is consistently more accurate, particularly so due to its ability to address the tensorial nature of the coefficients. MG places a low computational demand, in the manner that we have implemented it, and accurate flow-rate estimates are obtained when using MG-upscaled permeabilities that approach or are beyond the percolation threshold. △ Less

Submitted 12 September, 2017; originally announced September 2017.

Comments: 15 pages, 7 figures, Physical Review E

arXiv:1611.07008 [pdf, ps, other]

Fine-Grained Complexity and Conditional Hardness for Sparse Graphs

Authors: Udit Agarwal, Vijaya Ramachandran

Abstract: We consider the fine-grained complexity of sparse graph problems that currently have $\tilde{O}(mn)$ time algorithms, where m is the number of edges and n is the number of vertices in the input graph. This class includes several important path problems on both directed and undirected graphs, including APSP, MWC (minimum weight cycle), and Eccentricities, which is the problem of computing, for each… ▽ More We consider the fine-grained complexity of sparse graph problems that currently have $\tilde{O}(mn)$ time algorithms, where m is the number of edges and n is the number of vertices in the input graph. This class includes several important path problems on both directed and undirected graphs, including APSP, MWC (minimum weight cycle), and Eccentricities, which is the problem of computing, for each vertex in the graph, the length of a longest shortest path starting at that vertex. We introduce the notion of a sparse reduction which preserves the sparsity of graphs, and we present near linear-time sparse reductions between various pairs of graph problems in the $\tilde{O}(mn)$ class. Surprisingly, very few of the known nontrivial reductions between problems in the $\tilde{O}(mn)$ class are sparse reductions. In the directed case, our results give a partial order on a large collection of problems in the $\tilde{O}(mn)$ class (along with some equivalences). In the undirected case we give two nontrivial sparse reductions: from MWC to APSP, and from unweighted ANSC (all nodes shortest cycles) to APSP. The latter reduction also gives an improved algorithm for ANSC (for dense graphs). We propose the MWC Conjecture, a new conditional hardness conjecture that the weight of a minimum weight cycle in a directed graph cannot be computed in time polynomially smaller than mn. Our sparse reductions for directed path problems in the $\tilde{O}(mn)$ class establish that several problems in this class, including 2-SiSP (second simple shortest path), Radius, and Eccentricities, are MWCC hard. We also identify Eccentricities as a key problem in the $\tilde{O}(mn)$ class which is simultaneously MWCC-hard, SETH-hard and k-DSH-hard, where SETH is the Strong Exponential Time Hypothesis, and k-DSH is the hypothesis that a dominating set of size k cannot be computed in time polynomially smaller than n^k. △ Less

Submitted 18 October, 2017; v1 submitted 21 November, 2016; originally announced November 2016.

Comments: abstract updated; There is no update to the paper

arXiv:1512.02157 [pdf, ps, other]

Finding $k$ Simple Shortest Paths and Cycles

Authors: Udit Agarwal, Vijaya Ramachandran

Abstract: The problem of finding multiple simple shortest paths in a weighted directed graph $G=(V,E)$ has many applications, and is considerably more difficult than the corresponding problem when cycles are allowed in the paths. Even for a single source-sink pair, it is known that two simple shortest paths cannot be found in time polynomially smaller than $n^3$ (where $n=|V|$) unless the All-Pairs Shortest… ▽ More The problem of finding multiple simple shortest paths in a weighted directed graph $G=(V,E)$ has many applications, and is considerably more difficult than the corresponding problem when cycles are allowed in the paths. Even for a single source-sink pair, it is known that two simple shortest paths cannot be found in time polynomially smaller than $n^3$ (where $n=|V|$) unless the All-Pairs Shortest Paths problem can be solved in a similar time bound. The latter is a well-known open problem in algorithm design. We consider the all-pairs version of the problem, and we give a new algorithm to find $k$ simple shortest paths for all pairs of vertices. For $k=2$, our algorithm runs in $O(mn + n^2 \log n)$ time (where $m=|E|$), which is almost the same bound as for the single pair case, and for $k=3$ we improve earlier bounds. Our approach is based on forming suitable path extensions to find simple shortest paths; this method is different from the `detour finding' technique used in most of the prior work on simple shortest paths, replacement paths, and distance sensitivity oracles. Enumerating simple cycles is a well-studied classical problem. We present new algorithms for generating simple cycles and simple paths in $G$ in non-decreasing order of their weights; the algorithm for generating simple paths is much faster, and uses another variant of path extensions. We also give hardness results for sparse graphs, relative to the complexity of computing a minimum weight cycle in a graph, for several variants of problems related to finding $k$ simple paths and cycles. △ Less

Submitted 22 February, 2016; v1 submitted 7 December, 2015; originally announced December 2015.

Comments: The current version includes new results for undirected graphs. In Section 4, the notion of an (m,n) reduction is generalized to an f(m,n) reduction

arXiv:1309.2989 [pdf, other]

Phase behavior of binary mixtures of hard convex polyehdra

Authors: Mihir R. Khadilkar, Umang Agarwal, Fernando A. Escobedo

Abstract: Shape anisotropy of colloidal nanoparticles has emerged as an important design variable for engineering assemblies with targeted structure and properties. In particular, a number of polyhedral nanoparticles have been shown to exhibit a rich phase behavior [Agarwal et al., Nature Materials, 2011, 10, 230]. Since real synthesized particles have polydispersity not only in size but also in shape, we e… ▽ More Shape anisotropy of colloidal nanoparticles has emerged as an important design variable for engineering assemblies with targeted structure and properties. In particular, a number of polyhedral nanoparticles have been shown to exhibit a rich phase behavior [Agarwal et al., Nature Materials, 2011, 10, 230]. Since real synthesized particles have polydispersity not only in size but also in shape, we explore here the phase behavior of binary mixtures of hard convex polyhedra having similar sizes but different shapes. Choosing representative particle shapes from those readily synthesizable, we study in particular four mixtures: (i) cubes and spheres (with spheres providing a non-polyhedral reference case), (ii) cubes and truncated octahedra, (iii) cubes and cuboctahedra, and (iv) cuboctahedra and truncated octahedra. The phase behavior of such mixtures is dependent on the interplay of mixing and packing entropy, which can give rise to miscible or phase-separated states. The extent of mixing of two such particle types is expected to depend on the degree of shape similarity, relative sizes, composition, and compatibility of the crystal structures formed by the pure components. While expectedly the binary systems studied exhibit phase separation at high pressures due to the incompatible pure-component crystal structures, our study shows that the essential qualitative trends in miscibility and phase separation can be correlated to properties of the pure components, such as the relative values of the order-disorder transition pressure (ODP) of each component. Specifically, if for a mixture A+B we have that ODP_B <ODP_A and ΔODP = ODP_A - ODP_B, then at any particular pressure where phase separation occurs, the larger the ΔODP the lower the solubility of A in the B-rich ordered phase and the higher the solubility of B in the A-rich ordered phase. △ Less

Submitted 11 September, 2013; originally announced September 2013.

arXiv:1008.1919 [pdf, ps, other]

doi 10.1103/PhysRevLett.105.078301

Glassy Dislocation Dynamics in 2-D Colloidal Dimer Crystals

Authors: Sharon J. Gerbode, Umang Agarwal, Desmond C. Ong, Chekesha M. Liddell, Fernando Escobedo, Itai Cohen

Abstract: Although glassy relaxation is typically associated with disorder, here we report on a new type of glassy dynamics relating to dislocations within 2-D crystals of colloidal dimers. Previous studies have demonstrated that dislocation motion in dimer crystals is restricted by certain particle orientations. Here, we drag an optically trapped particle through such dimer crystals, creating dislocations.… ▽ More Although glassy relaxation is typically associated with disorder, here we report on a new type of glassy dynamics relating to dislocations within 2-D crystals of colloidal dimers. Previous studies have demonstrated that dislocation motion in dimer crystals is restricted by certain particle orientations. Here, we drag an optically trapped particle through such dimer crystals, creating dislocations. We find a two-stage relaxation response where initially dislocations glide until encountering particles that cage their motion. Subsequent relaxation occurs logarithmically slowly through a second process where dislocations hop between caged configurations. Finally, in simulations of sheared dimer crystals, the dislocation mean squared displacement displays a caging plateau typical of glassy dynamics. Together, these results reveal a novel glassy system within a colloidal crystal. △ Less

Submitted 11 August, 2010; originally announced August 2010.

Comments: 4 pages + 2 page supp. mat

Journal ref: Phys. Rev. Lett. 105, 078301 (2010)

Showing 1–22 of 22 results for author: Agarwal, U