-
LTLCodeGen: Code Generation of Syntactically Correct Temporal Logic for Robot Task Planning
Authors:
Behrad Rabiei,
Mahesh Kumar A. R.,
Zhirui Dai,
Surya L. S. R. Pilla,
Qiyue Dong,
Nikolay Atanasov
Abstract:
This paper focuses on planning robot navigation tasks from natural language specifications. We develop a modular approach, where a large language model (LLM) translates the natural language instructions into a linear temporal logic (LTL) formula with propositions defined by object classes in a semantic occupancy map. The LTL formula and the semantic occupancy map are provided to a motion planning…
▽ More
This paper focuses on planning robot navigation tasks from natural language specifications. We develop a modular approach, where a large language model (LLM) translates the natural language instructions into a linear temporal logic (LTL) formula with propositions defined by object classes in a semantic occupancy map. The LTL formula and the semantic occupancy map are provided to a motion planning algorithm to generate a collision-free robot path that satisfies the natural language instructions. Our main contribution is LTLCodeGen, a method to translate natural language to syntactically correct LTL using code generation. We demonstrate the complete task planning method in real-world experiments involving human speech to provide navigation instructions to a mobile robot. We also thoroughly evaluate our approach in simulated and real-world experiments in comparison to end-to-end LLM task planning and state-of-the-art LLM-to-LTL translation methods.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Physically Constrained 3D Diffusion for Inverse Design of Fiber-reinforced Polymer Composite Materials
Authors:
Pei Xu,
Yunpeng Wu,
Srikanth Pilla,
Gang Li,
Feng Luo
Abstract:
Designing fiber-reinforced polymer composites (FRPCs) with a tailored nonlinear stress-strain response can enable innovative applications across various industries. Currently, no efforts have achieved the inverse design of FRPCs that target the entire stress-strain curve. Here, we develop PC3D_Diffusion, a 3D spatial diffusion model designed for the inverse design of FRPCs. We generate 1.35 millio…
▽ More
Designing fiber-reinforced polymer composites (FRPCs) with a tailored nonlinear stress-strain response can enable innovative applications across various industries. Currently, no efforts have achieved the inverse design of FRPCs that target the entire stress-strain curve. Here, we develop PC3D_Diffusion, a 3D spatial diffusion model designed for the inverse design of FRPCs. We generate 1.35 million FRPCs and calculate their stress-strain curves for training. Although the vanilla PC3D_Diffusion can generate visually appealing results, less than 10% of FRPCs generated by the vanilla model are collision-free, in which fibers do not intersect with each other. We then propose a loss-guided, learning-free approach to apply physical constraints during generation. As a result, PC3D_Diffusion can generate high-quality designs with tailored mechanical behaviors while guaranteeing to satisfy the physical constraints. PC3D_Diffusion advances FRPC inverse design and may facilitate the inverse design of other 3D materials, offering potential applications in industries reliant on materials with custom mechanical properties.
△ Less
Submitted 2 December, 2024;
originally announced December 2024.
-
A Foundation Model for Chemical Design and Property Prediction
Authors:
Feiyang Cai,
Katelin Hanna,
Tianyu Zhu,
Tzuen-Rong Tzeng,
Yongping Duan,
Ling Liu,
Srikanth Pilla,
Gang Li,
Feng Luo
Abstract:
Artificial intelligence (AI) has significantly advanced computational chemistry research in various tasks. However, traditional AI methods often rely on task-specific model designs and training, which constrain both the scalability of model size and generalization across different tasks. Here, we introduce ChemFM, a large foundation model specifically developed for chemicals. ChemFM comprises 3 bi…
▽ More
Artificial intelligence (AI) has significantly advanced computational chemistry research in various tasks. However, traditional AI methods often rely on task-specific model designs and training, which constrain both the scalability of model size and generalization across different tasks. Here, we introduce ChemFM, a large foundation model specifically developed for chemicals. ChemFM comprises 3 billion parameters and is pre-trained on 178 million molecules using self-supervised causal language modeling to extract generalizable molecular representations. This model can be adapted to diverse downstream chemical applications using either full-parameter or parameter-efficient fine-tuning methods. ChemFM consistently outperforms state-of-the-art task-specific AI models across all tested tasks. Notably, it achieves up to 67.48% performance improvement across 34 property prediction benchmarks, up to 33.80% reduction in mean average deviation between conditioned and actual properties of generated molecules in conditional molecular generation tasks, and up to 3.7% top-1 accuracy improvement across 4 reaction prediction datasets. Moreover, ChemFM demonstrates its superior performance in predicting antibiotic activity and cytotoxicity, highlighting its potential to advance the discovery of novel antibiotics. We anticipate that ChemFM will significantly advance chemistry research by providing a foundation model capable of effectively generalizing across a broad range of tasks with minimal additional training.
△ Less
Submitted 23 January, 2025; v1 submitted 28 October, 2024;
originally announced October 2024.
-
Explainable Deep Learning to Profile Mitochondrial Disease Using High Dimensional Protein Expression Data
Authors:
Atif Khan,
Conor Lawless,
Amy E Vincent,
Satish Pilla,
Sushanth Ramesh,
A. Stephen McGough
Abstract:
Mitochondrial diseases are currently untreatable due to our limited understanding of their pathology. We study the expression of various mitochondrial proteins in skeletal myofibres (SM) in order to discover processes involved in mitochondrial pathology using Imaging Mass Cytometry (IMC). IMC produces high dimensional multichannel pseudo-images representing spatial variation in the expression of a…
▽ More
Mitochondrial diseases are currently untreatable due to our limited understanding of their pathology. We study the expression of various mitochondrial proteins in skeletal myofibres (SM) in order to discover processes involved in mitochondrial pathology using Imaging Mass Cytometry (IMC). IMC produces high dimensional multichannel pseudo-images representing spatial variation in the expression of a panel of proteins within a tissue, including subcellular variation. Statistical analysis of these images requires semi-automated annotation of thousands of SMs in IMC images of patient muscle biopsies. In this paper we investigate the use of deep learning (DL) on raw IMC data to analyse it without any manual pre-processing steps, statistical summaries or statistical models. For this we first train state-of-art computer vision DL models on all available image channels, both combined and individually. We observed better than expected accuracy for many of these models. We then apply state-of-the-art explainable techniques relevant to computer vision DL to find the basis of the predictions of these models. Some of the resulting visual explainable maps highlight features in the images that appear consistent with the latest hypotheses about mitochondrial disease progression within myofibres.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Model Building for Semiparametric Mixtures
Authors:
Ramani S. Pilla,
Francesco Bartolucci,
Bruce G. Lindsay
Abstract:
An important and yet difficult problem in fitting multivariate mixture models is determining the mixture complexity. We develop theory and a unified framework for finding the nonparametric maximum likelihood estimator of a multivariate mixing distribution and consequently estimating the mixture complexity. Multivariate mixtures provide a flexible approach to fitting high-dimensional data while o…
▽ More
An important and yet difficult problem in fitting multivariate mixture models is determining the mixture complexity. We develop theory and a unified framework for finding the nonparametric maximum likelihood estimator of a multivariate mixing distribution and consequently estimating the mixture complexity. Multivariate mixtures provide a flexible approach to fitting high-dimensional data while offering data reduction through the number, location and shape of the component densities. The central principle of our method is to cast the mixture maximization problem in the concave optimization framework with finitely many linear inequality constraints and turn it into an unconstrained problem using a "penalty function". We establish the existence of parameter estimators and prove the convergence properties of the proposed algorithms. The role of a "sieve parameter'' in reducing the dimensionality of mixture models is demonstrated. We derive analytical machinery for building a collection of semiparametric mixture models, including the multivariate case, via the sieve parameter. The performance of the methods are shown with applications to several data sets including the cdc15 cell-cycle yeast microarray data.
△ Less
Submitted 3 June, 2006;
originally announced June 2006.
-
Enhancing the photomixing efficiency of optoelectronic devices in the terahertz regime
Authors:
Subrahmanyam Pilla
Abstract:
A method to reduce the transit time of majority of carriers in photomixers and photo detectors to $< 1$ ps is proposed. Enhanced optical fields associated with surface plasmon polaritons, coupled with velocity overshoot phenomenon results in net decrease of transit time of carriers. As an example, model calculations demonstrating $> 280\times$ (or $\sim$2800 and 31.8 $μ$W at 1 and 5 THz respecti…
▽ More
A method to reduce the transit time of majority of carriers in photomixers and photo detectors to $< 1$ ps is proposed. Enhanced optical fields associated with surface plasmon polaritons, coupled with velocity overshoot phenomenon results in net decrease of transit time of carriers. As an example, model calculations demonstrating $> 280\times$ (or $\sim$2800 and 31.8 $μ$W at 1 and 5 THz respectively) improvement in THz power generation efficiency of a photomixer based on Low Temperature grown GaAs are presented. Due to minimal dependence on the carrier recombination time, it is anticipated that the proposed method paves the way for enhancing the speed and efficiency of photomixers and detectors covering UV to far infrared communications wavelengths (300 to 1600 nm).
△ Less
Submitted 27 April, 2006;
originally announced April 2006.
-
Inference in Perturbation Models, Finite Mixtures and Scan Statistics: The Volume-of-Tube Formula
Authors:
Ramani S. Pilla,
Catherine Loader
Abstract:
This research creates a general class of "perturbation models" which are described by an underlying "null" model that accounts for most of the structure in data and a perturbation that accounts for possible small localized departures. The perturbation models encompass finite mixture models and spatial scan process. In this article, (1) we propose a new test statistic to detect the presence of pe…
▽ More
This research creates a general class of "perturbation models" which are described by an underlying "null" model that accounts for most of the structure in data and a perturbation that accounts for possible small localized departures. The perturbation models encompass finite mixture models and spatial scan process. In this article, (1) we propose a new test statistic to detect the presence of perturbation, including the case where the null model contains a set of nuisance parameters, and show that it is equivalent to the likelihood ratio test; (2) we establish that the asymptotic distribution of the test statistic is equivalent to the supremum of a Gaussian random field over a high-dimensional manifold (e.g., curve, surface etc.) with boundaries and singularities; (3) we derive a technique for approximating the quantiles of the test statistic using the Hotelling-Weyl-Naiman "volume-of-tube formula"; and (4) we solve the long-pending problem of testing for the order of a mixture model; in particular, derive the asymptotic null distribution for a general family of mixture models including the multivariate mixtures. The inferential theory developed in this article is applicable for a class of non-regular statistical problems involving loss of identifiability or when some of the parameters are on the boundary of the parametric space.
△ Less
Submitted 3 June, 2006; v1 submitted 20 November, 2005;
originally announced November 2005.
-
Inference Under Convex Cone Alternatives for Correlated Data
Authors:
Ramani S. Pilla
Abstract:
In this research, inferential theory for hypothesis testing under general convex cone alternatives for correlated data is developed. While there exists extensive theory for hypothesis testing under smooth cone alternatives with independent observations, extension to correlated data under general convex cone alternatives remains an open problem. This long-pending problem is addressed by (1) estab…
▽ More
In this research, inferential theory for hypothesis testing under general convex cone alternatives for correlated data is developed. While there exists extensive theory for hypothesis testing under smooth cone alternatives with independent observations, extension to correlated data under general convex cone alternatives remains an open problem. This long-pending problem is addressed by (1) establishing that a "generalized quasi-score" statistic is asymptotically equivalent to the squared length of the projection of the standard Gaussian vector onto the convex cone and (2) showing that the asymptotic null distribution of the test statistic is a weighted chi-squared distribution, where the weights are "mixed volumes" of the convex cone and its polar cone. Explicit expressions for these weights are derived using the volume-of-tube formula around a convex manifold in the unit sphere. Furthermore, an asymptotic lower bound is constructed for the power of the generalized quasi-score test under a sequence of local alternatives in the convex cone. Applications to testing under order restricted alternatives for correlated data are illustrated.
△ Less
Submitted 3 June, 2006; v1 submitted 25 June, 2005;
originally announced June 2005.
-
A New Technique for Finding Needles in Haystacks: A Geometric Approach to Distinguishing Between a New Source and Random Fluctuations
Authors:
Ramani S. Pilla,
Catherine Loader,
Cyrus Taylor
Abstract:
We propose a new test statistic based on a score process for determining the statistical significance of a putative signal that may be a small perturbation to a noisy experimental background. We derive the reference distribution for this score test statistic; it has an elegant geometrical interpretation as well as broad applicability. We illustrate the technique in the context of a model problem…
▽ More
We propose a new test statistic based on a score process for determining the statistical significance of a putative signal that may be a small perturbation to a noisy experimental background. We derive the reference distribution for this score test statistic; it has an elegant geometrical interpretation as well as broad applicability. We illustrate the technique in the context of a model problem from high-energy particle physics. Monte Carlo experimental results confirm that the score test results in a significantly improved rate of signal detection.
△ Less
Submitted 29 May, 2005;
originally announced May 2005.
-
On large-sample estimation and testing via quadratic inference functions for correlated data
Authors:
Ramani S. Pilla,
Catherine Loader
Abstract:
Hansen (1982) proposed a class of "generalized method of moments" (GMMs) for estimating a vector of regression parameters from a set of score functions. Hansen established that, under certain regularity conditions, the estimator based on the GMMs is consistent, asymptotically normal and asymptotically efficient. In the generalized estimating equation framework, extending the principle of the GMM…
▽ More
Hansen (1982) proposed a class of "generalized method of moments" (GMMs) for estimating a vector of regression parameters from a set of score functions. Hansen established that, under certain regularity conditions, the estimator based on the GMMs is consistent, asymptotically normal and asymptotically efficient. In the generalized estimating equation framework, extending the principle of the GMMs to implicitly estimate the underlying correlation structure leads to a "quadratic inference function" (QIF) for the analysis of correlated data. The main objectives of this research are to (1) formulate an appropriate estimated covariance matrix for the set of extended score functions defining the inference functions; (2) develop a unified large-sample theoretical framework for the QIF; (3) derive a generalization of the QIF test statistic for a general linear hypothesis problem involving correlated data while establishing the asymptotic distribution of the test statistic under the null and local alternative hypotheses; (4) propose an iteratively reweighted generalized least squares algorithm for inference in the QIF framework; and (5) investigate the effect of basis matrices, defining the set of extended score functions, on the size and power of the QIF test through Monte Carlo simulated experiments.
△ Less
Submitted 7 January, 2006; v1 submitted 17 May, 2005;
originally announced May 2005.
-
Using Electrons on Liquid Helium for Quantum Computing
Authors:
A. J. Dahm,
J. M. Goodkind,
I. Karakurt,
S. Pilla
Abstract:
We describe a quantum computer based on electrons supported by a helium film and localized laterally by small electrodes just under the helium surface. Each qubit is made of combinations of the ground and first excited state of an electron trapped in the image potential well at the surface. Mechanisms for preparing the initial state of the qubit, operations with the qubits, and a proposed readou…
▽ More
We describe a quantum computer based on electrons supported by a helium film and localized laterally by small electrodes just under the helium surface. Each qubit is made of combinations of the ground and first excited state of an electron trapped in the image potential well at the surface. Mechanisms for preparing the initial state of the qubit, operations with the qubits, and a proposed readout are described. This system is, in principle, capable of 100,000 operations in a decoherence time.
△ Less
Submitted 5 November, 2001;
originally announced November 2001.
-
Electric field induced memory and aging effects in pure solid N_2
Authors:
S. Pilla,
J. A. Hamida,
K. A. Muttalib,
N. S. Sullivan
Abstract:
We report combined high sensitivity dielectric constant and heat capacity measurements of pure solid N_2 in the presence of a small external ac electric field in the audio frequency range. We have observed strong field induced aging and memory effects which show that field cooled samples may be prepared in a variety of metastable states leading to a free energy landscape with experimentally ``tu…
▽ More
We report combined high sensitivity dielectric constant and heat capacity measurements of pure solid N_2 in the presence of a small external ac electric field in the audio frequency range. We have observed strong field induced aging and memory effects which show that field cooled samples may be prepared in a variety of metastable states leading to a free energy landscape with experimentally ``tunable'' barriers, and tunneling between these states may occur within laboratory time scales.
△ Less
Submitted 17 March, 2000;
originally announced March 2000.
-
A modified dual-slope method for heat capacity measurements of condensable gases
Authors:
S. Pilla,
J. A. Hamida,
N. S. Sullivan
Abstract:
A high resolution non-adiabatic method for measuring the heat capacity ($C_v$) of bulk samples of condensable gases in the range 7.5-70 K is described. In this method $C_v$ is evaluated by directly comparing the heating and cooling rates of the sample temperature for two algebraically independent heat pulse sequences without explicit use of the thermal conductance between sample and thermal bath…
▽ More
A high resolution non-adiabatic method for measuring the heat capacity ($C_v$) of bulk samples of condensable gases in the range 7.5-70 K is described. In this method $C_v$ is evaluated by directly comparing the heating and cooling rates of the sample temperature for two algebraically independent heat pulse sequences without explicit use of the thermal conductance between sample and thermal bath. A fully automated calorimeter for rapid measurement of $C_v$ of molecular solids utilizing this technique is presented.
△ Less
Submitted 6 March, 2000;
originally announced March 2000.
-
Dielectric anomalies of solid CO and N_2 in the audio frequency range
Authors:
S. Pilla,
J. A. Hamida,
K. A. Muttalib,
N. S. Sullivan
Abstract:
We report the first audio frequency dielectric constant measurements of CO and N_2 in their solid phases. We have observed several new features, including (1) strong hysteresis effects above an onset temperature $T_{h} \simeq 42 K$ in the β-phase of pure N_2, (2) absence of the expected short range antiferroelectric dipolar ordering in CO down to 4.2 K, and (3) anomalous temperature dependence o…
▽ More
We report the first audio frequency dielectric constant measurements of CO and N_2 in their solid phases. We have observed several new features, including (1) strong hysteresis effects above an onset temperature $T_{h} \simeq 42 K$ in the β-phase of pure N_2, (2) absence of the expected short range antiferroelectric dipolar ordering in CO down to 4.2 K, and (3) anomalous temperature dependence of the dielectric constant in the α-phases of both CO and N_2. Quantum mechanical treatment of the molecular rotation explains some of the observed anomalies in the α-phase, but the strong hysteresis indicates that pure geometrical frustration plays a significant role in the β-phase.
△ Less
Submitted 10 March, 1999;
originally announced March 1999.