-
A study and comparison of COordinate Rotation DIgital Computer (CORDIC) architectures
Authors:
Neha K Nawandar,
Vishal R Satpute
Abstract:
Most of the digital signal processing applications performs operations like multiplication, addition, square-root calculation, solving linear equations etc. The physical implementation of these operations consumes a lot of hardware and, software implementation consumes large memory. Even if they are implemented in hardware, they do not provide high speed, and due to this reason, even today the sof…
▽ More
Most of the digital signal processing applications performs operations like multiplication, addition, square-root calculation, solving linear equations etc. The physical implementation of these operations consumes a lot of hardware and, software implementation consumes large memory. Even if they are implemented in hardware, they do not provide high speed, and due to this reason, even today the software implementation dominates hardware. For realizing operations from basic to very complex ones with less hardware, a Co-ordinate Rotation Digital Computer (CORDIC) proves beneficial. It is capable of performing mathematical operations right from addition to highly complex functions with the help of arithmetic unit and shifters only. This paper gives a brief overview of various existing CORDIC architectures, their working principle, application domain and a comparison of these architectures. Different designs are available as per the target, i.e. high accuracy and precision, low area, low latency, hardware efficient, low power, reconfigurability, etc. that can be used as per the application in which the architecture needs to be employed.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Transfer Learning with Kernel Methods
Authors:
Adityanarayanan Radhakrishnan,
Max Ruiz Luyten,
Neha Prasad,
Caroline Uhler
Abstract:
Transfer learning refers to the process of adapting a model trained on a source task to a target task. While kernel methods are conceptually and computationally simple machine learning models that are competitive on a variety of tasks, it has been unclear how to perform transfer learning for kernel methods. In this work, we propose a transfer learning framework for kernel methods by projecting and…
▽ More
Transfer learning refers to the process of adapting a model trained on a source task to a target task. While kernel methods are conceptually and computationally simple machine learning models that are competitive on a variety of tasks, it has been unclear how to perform transfer learning for kernel methods. In this work, we propose a transfer learning framework for kernel methods by projecting and translating the source model to the target task. We demonstrate the effectiveness of our framework in applications to image classification and virtual drug screening. In particular, we show that transferring modern kernels trained on large-scale image datasets can result in substantial performance increase as compared to using the same kernel trained directly on the target task. In addition, we show that transfer-learned kernels allow a more accurate prediction of the effect of drugs on cancer cell lines. For both applications, we identify simple scaling laws that characterize the performance of transfer-learned kernels as a function of the number of target examples. We explain this phenomenon in a simplified linear setting, where we are able to derive the exact scaling laws. By providing a simple and effective transfer learning framework for kernel methods, our work enables kernel methods trained on large datasets to be easily adapted to a variety of downstream target tasks.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Timed Alignments with Mixed Moves
Authors:
Neha Rino,
Thomas Chatain
Abstract:
The subject of this paper is to study conformance checking for timed models, that is, process models that consider both the sequence of events in a process as well as the timestamps at which each event is recorded. Time-aware process mining is a growing subfield of research, and as tools that seek to discover timing related properties in processes develop, so does the need for conformance checking…
▽ More
The subject of this paper is to study conformance checking for timed models, that is, process models that consider both the sequence of events in a process as well as the timestamps at which each event is recorded. Time-aware process mining is a growing subfield of research, and as tools that seek to discover timing related properties in processes develop, so does the need for conformance checking techniques that can tackle time constraints and provide insightful quality measures for time-aware process models. In particular, one of the most useful conformance artefacts is the alignment, that is, finding the minimal changes necessary to correct a new observation to conform to a process model. This paper follows a previous one, where we have set our problem of timed alignment. In the present paper, we solve the case where the metrics used to compare timed processes allows mixed moves, i.e. an error on the timestamp of an event may or may not have propagated to its successors, and provide linear time algorithms for distance computation and alignment on models with sequential causal processes.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Instruction Tuning for Few-Shot Aspect-Based Sentiment Analysis
Authors:
Siddharth Varia,
Shuai Wang,
Kishaloy Halder,
Robert Vacareanu,
Miguel Ballesteros,
Yassine Benajiba,
Neha Anna John,
Rishita Anubhai,
Smaranda Muresan,
Dan Roth
Abstract:
Aspect-based Sentiment Analysis (ABSA) is a fine-grained sentiment analysis task which involves four elements from user-generated texts: aspect term, aspect category, opinion term, and sentiment polarity. Most computational approaches focus on some of the ABSA sub-tasks such as tuple (aspect term, sentiment polarity) or triplet (aspect term, opinion term, sentiment polarity) extraction using eithe…
▽ More
Aspect-based Sentiment Analysis (ABSA) is a fine-grained sentiment analysis task which involves four elements from user-generated texts: aspect term, aspect category, opinion term, and sentiment polarity. Most computational approaches focus on some of the ABSA sub-tasks such as tuple (aspect term, sentiment polarity) or triplet (aspect term, opinion term, sentiment polarity) extraction using either pipeline or joint modeling approaches. Recently, generative approaches have been proposed to extract all four elements as (one or more) quadruplets from text as a single task. In this work, we take a step further and propose a unified framework for solving ABSA, and the associated sub-tasks to improve the performance in few-shot scenarios. To this end, we fine-tune a T5 model with instructional prompts in a multi-task learning fashion covering all the sub-tasks, as well as the entire quadruple prediction task. In experiments with multiple benchmark datasets, we show that the proposed multi-task prompting approach brings performance boost (by absolute 8.29 F1) in the few-shot learning setting.
△ Less
Submitted 11 June, 2023; v1 submitted 12 October, 2022;
originally announced October 2022.
-
IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces
Authors:
Kelly Marchisio,
Neha Verma,
Kevin Duh,
Philipp Koehn
Abstract:
The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism." We address the root-cause of faulty cross-lingual mapping: that word embedding training resulted in the underlying spaces being non-isomorphic. We incorporate global measures of isomorphism directly into t…
▽ More
The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism." We address the root-cause of faulty cross-lingual mapping: that word embedding training resulted in the underlying spaces being non-isomorphic. We incorporate global measures of isomorphism directly into the Skip-gram loss function, successfully increasing the relative isomorphism of trained word embedding spaces and improving their ability to be mapped to a shared cross-lingual space. The result is improved bilingual lexicon induction in general data conditions, under domain mismatch, and with training algorithm dissimilarities. We release IsoVec at https://github.com/kellymarchisio/isovec.
△ Less
Submitted 4 July, 2023; v1 submitted 10 October, 2022;
originally announced October 2022.
-
Gamma-hadron Separation in Imaging Atmospheric Cherenkov Telescopes using Quantum Classifiers
Authors:
Jashwanth S,
Sudeep Ghosh,
Neha Shah,
Kavitha Yogaraj,
Ankhi Roy
Abstract:
In this paper we have introduced a novel method for gamma hadron separation in Imaging Atmospheric Cherenkov Telescopes (IACT) using Quantum Machine Learning. IACTs captures images of Extensive Air Showers (EAS) produced from very high energy gamma rays. We have used the QML Algorithms, Quantum Support Vector Classifier (QSVC) and Variational Quantum Classifier (VQC) for binary classification of t…
▽ More
In this paper we have introduced a novel method for gamma hadron separation in Imaging Atmospheric Cherenkov Telescopes (IACT) using Quantum Machine Learning. IACTs captures images of Extensive Air Showers (EAS) produced from very high energy gamma rays. We have used the QML Algorithms, Quantum Support Vector Classifier (QSVC) and Variational Quantum Classifier (VQC) for binary classification of the events into signals (Gamma) and background(hadron) using the image parameters. MAGIC Gamma Telescope dataset is used for this study which was generated from Monte Carlo Software Coriska. These quantum algorithms achieve performance comparable to standard multivariate classification techniques and can be used to solve variety of real-world problems. The classification accuracy is improved by hyper parameter tuning. We propose a new architecture for using QSVC efficiently on large datasets and found that clustering enhance the overall performance.
△ Less
Submitted 29 October, 2022; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Certain Coefficient Problems of $\mathcal{S}_{e}^{*}$ and $\mathcal{C}_{e}$
Authors:
S. Sivaprasad Kumar,
Neha Verma
Abstract:
In this current study, we consider the classes $\mathcal{S}^{*}_{e}$ and $\mathcal{C}_e$ to obtain sharp bounds for the third Hankel determinant for functions within these classes. Additionally, we provide estimates for the sixth and seventh coefficients while establishing the fourth-order Hankel determinant as well.
In this current study, we consider the classes $\mathcal{S}^{*}_{e}$ and $\mathcal{C}_e$ to obtain sharp bounds for the third Hankel determinant for functions within these classes. Additionally, we provide estimates for the sixth and seventh coefficients while establishing the fourth-order Hankel determinant as well.
△ Less
Submitted 6 September, 2023; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Bilinear matrix inequalities and polynomials in several freely noncommuting variables
Authors:
Sriram Balasubramanian,
Neha Hotwani,
Scott McCullough
Abstract:
Matrix-valued polynomials in any finite number of freely noncommuting variables that enjoy certain canonical partial convexity properties are characterized, via an algebraic certificate, in terms of Linear Matrix Inequalities and Bilinear Matrix Inequalities.
Matrix-valued polynomials in any finite number of freely noncommuting variables that enjoy certain canonical partial convexity properties are characterized, via an algebraic certificate, in terms of Linear Matrix Inequalities and Bilinear Matrix Inequalities.
△ Less
Submitted 28 February, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Collective Variables for Crystallization Simulations -- from Early Developments to Recent Advances
Authors:
Neha,
Vikas Tiwari,
Soumya Mondal,
Nisha Kumari,
Tarak Karmakar
Abstract:
Crystallization is one of the most important physicochemical processes which has relevance in material science, biology, and the environment. Decades of experimental and theoretical efforts have been made to understand this fundamental symmetry-breaking transition. While experiments provide equilibrium structures and shapes of crystals, they are limited to unraveling how molecules aggregate to for…
▽ More
Crystallization is one of the most important physicochemical processes which has relevance in material science, biology, and the environment. Decades of experimental and theoretical efforts have been made to understand this fundamental symmetry-breaking transition. While experiments provide equilibrium structures and shapes of crystals, they are limited to unraveling how molecules aggregate to form crystal nuclei that subsequently transform into bulk crystals. Computer simulations, mainly molecular dynamics (MD), can provide such microscopic details during the early stage of a crystallization event. Crystallization is a rare event that takes place in timescales much longer than a typical equilibrium MD simulation can sample. This inadequate sampling of the MD method can be easily circumvented by the use of enhanced sampling (ES) simulations. An ES method enhances the fluctuations of a system's slow degrees of freedom, called collective variables (CVs), by applying a bias potential, and thereby transforms the system from one state to the other within a short timescale. The most crucial part of an ES method is to find suitable CVs which often needs intuition and several trial-and-error optimization steps. Over the years, a plethora of CVs has been developed and applied in the study of crystallization. In this review, we provide a brief overview of CVs that have been developed and used in ES simulations to study crystallization from melt or solution. These CVs can be categorized mainly into four types: (i) spherical particle-based, (ii) molecular template-based, (iii) physical property-based, and (iv) CVs obtained from dimensionality reduction techniques. We present the context-based evolution of CVs, discuss the current challenges, and propose future directions to further develop effective CVs for the study of crystallization of complex systems.
△ Less
Submitted 28 September, 2022;
originally announced September 2022.
-
Automated detection of Alzheimer disease using MRI images and deep neural networks- A review
Authors:
Narotam Singh,
Patteshwari. D,
Neha Soni,
Amita Kapoor
Abstract:
Early detection of Alzheimer disease is crucial for deploying interventions and slowing the disease progression. A lot of machine learning and deep learning algorithms have been explored in the past decade with the aim of building an automated detection for Alzheimer. Advancements in data augmentation techniques and advanced deep learning architectures have opened up new frontiers in this field, a…
▽ More
Early detection of Alzheimer disease is crucial for deploying interventions and slowing the disease progression. A lot of machine learning and deep learning algorithms have been explored in the past decade with the aim of building an automated detection for Alzheimer. Advancements in data augmentation techniques and advanced deep learning architectures have opened up new frontiers in this field, and research is moving at a rapid speed. Hence, the purpose of this survey is to provide an overview of recent research on deep learning models for Alzheimer disease diagnosis. In addition to categorizing the numerous data sources, neural network architectures, and commonly used assessment measures, we also classify implementation and reproducibility. Our objective is to assist interested researchers in keeping up with the newest developments and in reproducing earlier investigations as benchmarks. In addition, we also indicate future research directions for this topic.
△ Less
Submitted 22 September, 2022;
originally announced September 2022.
-
Comparative Study of Q-Learning and NeuroEvolution of Augmenting Topologies for Self Driving Agents
Authors:
Arhum Ishtiaq,
Maheen Anees,
Sara Mahmood,
Neha Jafry
Abstract:
Autonomous driving vehicles have been of keen interest ever since automation of various tasks started. Humans are prone to exhaustion and have a slow response time on the road, and on top of that driving is already quite a dangerous task with around 1.35 million road traffic incident deaths each year. It is expected that autonomous driving can reduce the number of driving accidents around the worl…
▽ More
Autonomous driving vehicles have been of keen interest ever since automation of various tasks started. Humans are prone to exhaustion and have a slow response time on the road, and on top of that driving is already quite a dangerous task with around 1.35 million road traffic incident deaths each year. It is expected that autonomous driving can reduce the number of driving accidents around the world which is why this problem has been of keen interest for researchers. Currently, self-driving vehicles use different algorithms for various sub-problems in making the vehicle autonomous. We will focus reinforcement learning algorithms, more specifically Q-learning algorithms and NeuroEvolution of Augment Topologies (NEAT), a combination of evolutionary algorithms and artificial neural networks, to train a model agent to learn how to drive on a given path. This paper will focus on drawing a comparison between the two aforementioned algorithms.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Interface-Assisted Room-Temperature Magnetoresistance in Cu-Phenalenyl-based Magnetic Tunnel Junctions
Authors:
Neha Jha,
Anand Paryar,
Tahereh Sadat Parvini,
Christian Denker,
Pavan K. Vardhanapu,
Gonela Vijaykumar,
Arne Ahrens,
Michael Seibt,
Jagadeesh S. Moodera,
Swadhin K. Mandal,
Markus Münzenberg
Abstract:
Delocalized carbon-based radical species with unpaired spin, such as phenalenyl (PLY) radical, opened avenues for developing multifunctional organic spintronic devices. Here we develop a novel technique based on a three-dimensional shadow mask and the in-situ deposition to fabricate PLY-, Cu-PLY-, and Zn-PLY-based organic magnetic tunnel junctions (OMTJs) with area 3x8 μm2 and improved morphology.…
▽ More
Delocalized carbon-based radical species with unpaired spin, such as phenalenyl (PLY) radical, opened avenues for developing multifunctional organic spintronic devices. Here we develop a novel technique based on a three-dimensional shadow mask and the in-situ deposition to fabricate PLY-, Cu-PLY-, and Zn-PLY-based organic magnetic tunnel junctions (OMTJs) with area 3x8 μm2 and improved morphology. The nonlinear and weakly temperature-dependent current-voltage (I-V) characteristics in combination with the low organic barrier height suggest tunneling as the dominant transport mechanism in the structurally and dimensionally optimized OMTJs. Cu-PLY-based OMTJs, show a significant magnetoresistance up to 14 percent at room temperature due to the formation of hybrid states at the metal-molecule interfaces called spinterface, which reveals the importance of spin-dependent interfacial modification in OMTJs design. In particular, Cu-PLY OMTJs shows a stable voltage-driven resistive switching response that suggests their use as a new viable and scalable platform for building molecular scale quantum memristors and processors.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Multi-signer Strong Designated Multi-verifier Signature Schemes based on Multiple Cryptographic Algorithms
Authors:
Neha Arora,
R. K. Sharma
Abstract:
A designated verifier signature scheme allows a signer to generate a signature that only the designated verifier can verify. This paper proposes multi-signer strong designated multi-verifier signature schemes based on multiple cryptographic algorithms and has proven their security in the random oracle model.
A designated verifier signature scheme allows a signer to generate a signature that only the designated verifier can verify. This paper proposes multi-signer strong designated multi-verifier signature schemes based on multiple cryptographic algorithms and has proven their security in the random oracle model.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Multisecret-sharing scheme with two-level security and its applications in Blockchain
Authors:
R. K. Sharma,
Ritumoni Sarma,
Neha Arora,
Vidya Sagar
Abstract:
A $(t,m)$-threshold secret sharing and multisecret-sharing scheme based on Shamir's SSS are introduced with two-level security using a one-way function. Besides we give its application in smart contract-enabled consortium blockchain network. The proposed scheme is thoroughly examined in terms of security and efficiency. Privacy, security, integrity, and scalability are also analyzed while applying…
▽ More
A $(t,m)$-threshold secret sharing and multisecret-sharing scheme based on Shamir's SSS are introduced with two-level security using a one-way function. Besides we give its application in smart contract-enabled consortium blockchain network. The proposed scheme is thoroughly examined in terms of security and efficiency. Privacy, security, integrity, and scalability are also analyzed while applying it to the blockchain network.
△ Less
Submitted 8 September, 2022;
originally announced September 2022.
-
Coefficient problems for starlike functions associated with a petal shaped domain
Authors:
S. Sivaprasad Kumar,
Neha Verma
Abstract:
In the present investigation, we consider a subclass of starlike functions associated with a petal shaped domain, recently introduced and defined by $$\mathcal{S}^{*}_ρ:=\{f\in \mathcal{A}:zf'(z)/f(z) \prec 1+\sinh^{-1} z\}.$$ We establish certain coefficient related problems such as sharp first five coefficient bounds along with sharp second and third order Hankel determinants for…
▽ More
In the present investigation, we consider a subclass of starlike functions associated with a petal shaped domain, recently introduced and defined by $$\mathcal{S}^{*}_ρ:=\{f\in \mathcal{A}:zf'(z)/f(z) \prec 1+\sinh^{-1} z\}.$$ We establish certain coefficient related problems such as sharp first five coefficient bounds along with sharp second and third order Hankel determinants for $\mathcal{S}^{*}_ρ$. Also, sixth and seventh coefficient bounds are estimated to obtain the fourth Hankel determinant bound for the same class.
△ Less
Submitted 31 August, 2022;
originally announced August 2022.
-
Stiefel-Whitney Classes of Representations of $\text{SL}(2,q)$
Authors:
Neha Malik,
Steven Spallone
Abstract:
We describe the Stiefel-Whitney classes (SWCs) of orthogonal representations $π$ of the finite special linear groups $G=\text{SL}(2,\mathbb F_q)$, in terms of character values of $π$. From this calculation, we can answer interesting questions about SWCs of $π$. For instance, we determine the subalgebra of $H^*(G,\mathbb Z/2\mathbb Z)$ generated by the SWCs of orthogonal $π$, and we also determine…
▽ More
We describe the Stiefel-Whitney classes (SWCs) of orthogonal representations $π$ of the finite special linear groups $G=\text{SL}(2,\mathbb F_q)$, in terms of character values of $π$. From this calculation, we can answer interesting questions about SWCs of $π$. For instance, we determine the subalgebra of $H^*(G,\mathbb Z/2\mathbb Z)$ generated by the SWCs of orthogonal $π$, and we also determine which $π$ have nontrivial mod $2$ Euler class.
△ Less
Submitted 17 January, 2023; v1 submitted 11 August, 2022;
originally announced August 2022.
-
A Conjecture on $H_3(1)$ For Certain Starlike Functions
Authors:
Neha Verma,
S. Sivaprasad Kumar
Abstract:
We prove a conjecture concerning the third Hankel determinant, proposed in ``Anal. Math. Phys., https://doi.org/10.1007/s13324-021-00483-7", which states that $|H_3(1)|\leq 1/9$ is sharp for the class $\mathcal{S}_{\wp}^{*}=\{zf'(z)/f(z) \prec \varphi(z):=1+ze^z\}$. In addition, we also establish bounds for sixth and seventh coefficient, and $|H_4(1)|$ for functions in $\mathcal{S}_{\wp}^{*}$. The…
▽ More
We prove a conjecture concerning the third Hankel determinant, proposed in ``Anal. Math. Phys., https://doi.org/10.1007/s13324-021-00483-7", which states that $|H_3(1)|\leq 1/9$ is sharp for the class $\mathcal{S}_{\wp}^{*}=\{zf'(z)/f(z) \prec \varphi(z):=1+ze^z\}$. In addition, we also establish bounds for sixth and seventh coefficient, and $|H_4(1)|$ for functions in $\mathcal{S}_{\wp}^{*}$. The general bounds for two and three-fold symmetric functions related to the Ma-Minda classes $\mathcal{S}^*(\varphi)$ of starlike functions are also obtained.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Interpretable Distribution Shift Detection using Optimal Transport
Authors:
Neha Hulkund,
Nicolo Fusi,
Jennifer Wortman Vaughan,
David Alvarez-Melis
Abstract:
We propose a method to identify and characterize distribution shifts in classification datasets based on optimal transport. It allows the user to identify the extent to which each class is affected by the shift, and retrieves corresponding pairs of samples to provide insights on its nature. We illustrate its use on synthetic and natural shift examples. While the results we present are preliminary,…
▽ More
We propose a method to identify and characterize distribution shifts in classification datasets based on optimal transport. It allows the user to identify the extent to which each class is affected by the shift, and retrieves corresponding pairs of samples to provide insights on its nature. We illustrate its use on synthetic and natural shift examples. While the results we present are preliminary, we hope that this inspires future work on interpretable methods for analyzing distribution shifts.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Uncorrelated Compensated Isocurvature Perturbations from kSZ Tomography
Authors:
Neha Anil Kumar,
Selim C. Hotinli,
Marc Kamionkowski
Abstract:
Compensated isocurvature perturbations (CIPs) are relative density perturbations in which a baryon-density fluctuation is accompanied by a dark matter density fluctuation such that the total-matter density is unperturbed. These fluctuations can be produced primordially if multiple fields are present during inflation, and therefore they can be used to differentiate between different models for the…
▽ More
Compensated isocurvature perturbations (CIPs) are relative density perturbations in which a baryon-density fluctuation is accompanied by a dark matter density fluctuation such that the total-matter density is unperturbed. These fluctuations can be produced primordially if multiple fields are present during inflation, and therefore they can be used to differentiate between different models for the early Universe. Kinetic Sunyaev-Zeldovich (kSZ) tomography allows for the reconstruction of the radial-velocity field of matter as a function of redshift. This technique can be used to reconstruct the total-matter-overdensity field, independent of the galaxy-density field obtained from large-scale galaxy surveys. We leverage the ability to measure the galaxy- and matter-overdensity fields independently to construct a minimum-variance estimator for the primordial CIP amplitude, based on a mode-by-mode comparison of the two measurements. We forecast that a configuration corresponding to CMB-S4 and VRO will be able to detect (at $2σ$) a CIP amplitude $A$ (for a scale-invariant power spectrum) as small as $A\simeq 5\times 10^{-9}$. Similarly, a configuration corresponding to SO and DESI will be sensitive to a CIP amplitude $A\simeq 1\times 10^{-7}$. These values are to be compared to current constraints $A \leq {\cal O}(0.01)$.
△ Less
Submitted 29 January, 2023; v1 submitted 4 August, 2022;
originally announced August 2022.
-
A Python-based Mixed Discrete-Continuous Simulation Framework for Digital Twins
Authors:
Neha Karanjkar,
Subodh M. Joshi
Abstract:
The use of Digital Twins is set to transform the manufacturing sector by aiding monitoring and real-time decision making. For several applications in this sector, the system to be modeled consists of a mix of discrete-event and continuous processes interacting with each other. Building simulation-based Digital Twins of such systems necessitates an open, flexible simulation framework which can supp…
▽ More
The use of Digital Twins is set to transform the manufacturing sector by aiding monitoring and real-time decision making. For several applications in this sector, the system to be modeled consists of a mix of discrete-event and continuous processes interacting with each other. Building simulation-based Digital Twins of such systems necessitates an open, flexible simulation framework which can support easy modeling and fast simulation of both continuous and discrete-event components, and their interactions. In this paper, we present an outline and key design aspects of a Python-based framework for performing mixed discrete-continuous simulations. The continuous processes in the system are assumed to be loosely coupled to other components via pre-defined events. For example, a continuous state variable crossing a threshold may trigger an external event. Similarly, external events may lead to a sudden change in the trajectory, state value or boundary conditions in a continuous process. We first present a systematic events-based interface using which such interactions can be modeled and simulated. We then discuss implementation details of the framework along with a detailed example. In our implementation, the advancement of time is controlled and performed using the event-stepped engine of SimPy (a popular discrete-event simulation library in Python). The continuous processes are modelled using existing frameworks with a Python wrapper providing the events interface. We discuss possible improvements to the time advancement scheme, a roadmap and use cases for the framework.
△ Less
Submitted 31 July, 2022;
originally announced August 2022.
-
Sequential Models in the Synthetic Data Vault
Authors:
Kevin Zhang,
Neha Patki,
Kalyan Veeramachaneni
Abstract:
The goal of this paper is to describe a system for generating synthetic sequential data within the Synthetic data vault. To achieve this, we present the Sequential model currently in SDV, an end-to-end framework that builds a generative model for multi-sequence, real-world data. This includes a novel neural network-based machine learning model, conditional probabilistic auto-regressive (CPAR) mode…
▽ More
The goal of this paper is to describe a system for generating synthetic sequential data within the Synthetic data vault. To achieve this, we present the Sequential model currently in SDV, an end-to-end framework that builds a generative model for multi-sequence, real-world data. This includes a novel neural network-based machine learning model, conditional probabilistic auto-regressive (CPAR) model. The overall system and the model is available in the open source Synthetic Data Vault (SDV) library {https://github.com/sdv-dev/SDV}, along with a variety of other models for different synthetic data needs.
After building the Sequential SDV, we used it to generate synthetic data and compared its quality against an existing, non-sequential generative adversarial network based model called CTGAN. To compare the sequential synthetic data against its real counterpart, we invented a new metric called Multi-Sequence Aggregate Similarity (MSAS). We used it to conclude that our Sequential SDV model learns higher level patterns than non-sequential models without any trade-offs in synthetic data quality.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Visualization Design Practices in a Crisis: Behind the Scenes with COVID-19 Dashboard Creators
Authors:
Yixuan Zhang,
Yifan Sun,
Joseph D. Gaggiano,
Neha Kumar,
Clio Andris,
Andrea G. Parker
Abstract:
During the COVID-19 pandemic, a number of data visualizations were created to inform the public about the rapidly evolving crisis. Data dashboards, a form of information dissemination used during the pandemic, have facilitated this process by visualizing statistics regarding the number of COVID-19 cases over time. In this research, we conducted a qualitative interview study among dashboard creator…
▽ More
During the COVID-19 pandemic, a number of data visualizations were created to inform the public about the rapidly evolving crisis. Data dashboards, a form of information dissemination used during the pandemic, have facilitated this process by visualizing statistics regarding the number of COVID-19 cases over time. In this research, we conducted a qualitative interview study among dashboard creators from federal agencies, state health departments, mainstream news media outlets, and other organizations that created (often widely-used) COVID-19 dashboards to answer the following questions: how did visualization creators engage in COVID-19 dashboard design, and what tensions, conflicts, and challenges arose during this process? Our findings detail the trajectory of design practices -- from creation to expansion, maintenance, and termination -- that are shaped by the complex interplay between design goals, tools and technologies, labor, emerging crisis contexts, and public engagement. We particularly examined the tensions between designers and the general public involved in these processes. These conflicts, which often materialized due to a divergence between public demands and standing policies, centered around the type and amount of information to be visualized, how public perceptions shape and are shaped by visualization design, and the strategies utilized to deal with (potential) misinterpretations and misuse of visualizations. Our findings and lessons learned shed light on new ways of thinking in visualization design, focusing on the bundled activities that are invariably involved in human and nonhuman participation throughout the entire trajectory of design practice.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices
Authors:
Mingbin Xu,
Congzheng Song,
Ye Tian,
Neha Agrawal,
Filip Granqvist,
Rogier van Dalen,
Xiao Zhang,
Arturo Argueta,
Shiyi Han,
Yaqiao Deng,
Leo Liu,
Anmol Walia,
Alex Jin
Abstract:
Federated Learning (FL) is a technique to train models using data distributed across devices. Differential Privacy (DP) provides a formal privacy guarantee for sensitive data. Our goal is to train a large neural network language model (NNLM) on compute-constrained devices while preserving privacy using FL and DP. However, the DP-noise introduced to the model increases as the model size grows, whic…
▽ More
Federated Learning (FL) is a technique to train models using data distributed across devices. Differential Privacy (DP) provides a formal privacy guarantee for sensitive data. Our goal is to train a large neural network language model (NNLM) on compute-constrained devices while preserving privacy using FL and DP. However, the DP-noise introduced to the model increases as the model size grows, which often prevents convergence. We propose Partial Embedding Updates (PEU), a novel technique to decrease noise by decreasing payload size. Furthermore, we adopt Low Rank Adaptation (LoRA) and Noise Contrastive Estimation (NCE) to reduce the memory demands of large models on compute-constrained devices. This combination of techniques makes it possible to train large-vocabulary language models while preserving accuracy and privacy.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
Discriminative Kernel Convolution Network for Multi-Label Ophthalmic Disease Detection on Imbalanced Fundus Image Dataset
Authors:
Amit Bhati,
Neha Gour,
Pritee Khanna,
Aparajita Ojha
Abstract:
It is feasible to recognize the presence and seriousness of eye disease by investigating the progressions in retinal biological structure. Fundus examination is a diagnostic procedure to examine the biological structure and anomaly of the eye. Ophthalmic diseases like glaucoma, diabetic retinopathy, and cataract are the main reason for visual impairment around the world. Ocular Disease Intelligent…
▽ More
It is feasible to recognize the presence and seriousness of eye disease by investigating the progressions in retinal biological structure. Fundus examination is a diagnostic procedure to examine the biological structure and anomaly of the eye. Ophthalmic diseases like glaucoma, diabetic retinopathy, and cataract are the main reason for visual impairment around the world. Ocular Disease Intelligent Recognition (ODIR-5K) is a benchmark structured fundus image dataset utilized by researchers for multi-label multi-disease classification of fundus images. This work presents a discriminative kernel convolution network (DKCNet), which explores discriminative region-wise features without adding extra computational cost. DKCNet is composed of an attention block followed by a squeeze and excitation (SE) block. The attention block takes features from the backbone network and generates discriminative feature attention maps. The SE block takes the discriminative feature maps and improves channel interdependencies. Better performance of DKCNet is observed with InceptionResnet backbone network for multi-label classification of ODIR-5K fundus images with 96.08 AUC, 94.28 F1-score and 0.81 kappa score. The proposed method splits the common target label for an eye pair based on the diagnostic keyword. Based on these labels oversampling and undersampling is done to resolve class imbalance. To check the biasness of proposed model towards training data, the model trained on ODIR dataset is tested on three publicly available benchmark datasets. It is found to give good performance on completely unseen fundus images also.
△ Less
Submitted 16 July, 2022;
originally announced July 2022.
-
Investigation of Rocket Effect in Bright-Rimmed Clouds using Gaia EDR3
Authors:
Piyali Saha,
Maheswar G.,
D. K. Ojha,
Tapas Baug,
Sharma Neha
Abstract:
Bright-rimmed clouds (BRCs) are excellent laboratories to explore the radiation-driven implosion mode of star formation because they show evidence of triggered star formation. In our previous study, BRC 18 has been found to accelerate away from the direction of the ionizing Hii region because of the well known "Rocket Effect". Based on the assumption that both BRC 18 and the candidate young stella…
▽ More
Bright-rimmed clouds (BRCs) are excellent laboratories to explore the radiation-driven implosion mode of star formation because they show evidence of triggered star formation. In our previous study, BRC 18 has been found to accelerate away from the direction of the ionizing Hii region because of the well known "Rocket Effect". Based on the assumption that both BRC 18 and the candidate young stellar objects (YSOs) are kinematically coupled and using the latest Gaia EDR3 measurements, we found that the relative proper motions of the candidate YSOs exhibit a tendency of moving away from the ionizing source. Using BRC 18 as a prototype, we made our further analysis for 21 more BRCs, a majority of which showed a similar trend. For most of the BRCs, the median angle of the relative proper motion of the candidate YSOs is similar to the angle of on-sky direction from the ionizing source to the central IRAS source of the BRC. Based on Pearson's and Spearman's correlation coefficients, we found a strong correlation between these two angles, which is further supported by the Kolmogorov-Smirnov (K-S) test on them. The strong correlation between these two angles supports the "Rocket Effect" in the BRCs on the plane-of-sky.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Predicting Out-of-Domain Generalization with Neighborhood Invariance
Authors:
Nathan Ng,
Neha Hulkund,
Kyunghyun Cho,
Marzyeh Ghassemi
Abstract:
Developing and deploying machine learning models safely depends on the ability to characterize and compare their abilities to generalize to new environments. Although recent work has proposed a variety of methods that can directly predict or theoretically bound the generalization capacity of a model, they rely on strong assumptions such as matching train/test distributions and access to model grad…
▽ More
Developing and deploying machine learning models safely depends on the ability to characterize and compare their abilities to generalize to new environments. Although recent work has proposed a variety of methods that can directly predict or theoretically bound the generalization capacity of a model, they rely on strong assumptions such as matching train/test distributions and access to model gradients. In order to characterize generalization when these assumptions are not satisfied, we propose neighborhood invariance, a measure of a classifier's output invariance in a local transformation neighborhood. Specifically, we sample a set of transformations and given an input test point, calculate the invariance as the largest fraction of transformed points classified into the same class. Crucially, our measure is simple to calculate, does not depend on the test point's true label, makes no assumptions about the data distribution or model, and can be applied even in out-of-domain (OOD) settings where existing methods cannot, requiring only selecting a set of appropriate data transformations. In experiments on robustness benchmarks in image classification, sentiment analysis, and natural language inference, we demonstrate a strong and robust correlation between our neighborhood invariance measure and actual OOD generalization on over 4,600 models evaluated on over 100 unique train/test domain pairs.
△ Less
Submitted 17 July, 2023; v1 submitted 5 July, 2022;
originally announced July 2022.
-
Timed Alignments
Authors:
Thomas Chatain,
Neha Rino
Abstract:
The subject of this paper is to study conformance checking for timed models, that is, process models that consider both the sequence of events in a process as well as the timestamps at which each event is recorded. Time-aware process mining is a growing subfield of research, and as tools that seek to discover timing related properties in processes develop, so does the need for conformance checking…
▽ More
The subject of this paper is to study conformance checking for timed models, that is, process models that consider both the sequence of events in a process as well as the timestamps at which each event is recorded. Time-aware process mining is a growing subfield of research, and as tools that seek to discover timing related properties in processes develop, so does the need for conformance checking techniques that can tackle time constraints and provide insightful quality measures for time-aware process models. In particular, one of the most useful conformance artefacts is the alignment, that is, finding the minimal changes necessary to correct a new observation to conform to a process model. In this paper, we set our problem of timed alignment and solve two cases each corresponding to a different metric over time processes. For the first, we have an algorithm whose time complexity is linear both in the size of the observed trace and the process model, while for the second we have a quadratic time algorithm for linear process models.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Literature on Hand GESTURE Recognition using Graph based methods
Authors:
Neha Baranwal,
Varun Sharma
Abstract:
Skeleton based recognition systems are gaining popularity and machine learning models focusing on points or joints in a skeleton have proved to be computationally effective and application in many areas like Robotics. It is easy to track points and thereby preserving spatial and temporal information, which plays an important role in abstracting the required information, classification becomes an e…
▽ More
Skeleton based recognition systems are gaining popularity and machine learning models focusing on points or joints in a skeleton have proved to be computationally effective and application in many areas like Robotics. It is easy to track points and thereby preserving spatial and temporal information, which plays an important role in abstracting the required information, classification becomes an easy task. In this paper, we aim to study these points but using a cloud mechanism, where we define a cloud as collection of points. However, when we add temporal information, it may not be possible to retrieve the coordinates of a point in each frame and hence instead of focusing on a single point, we can use k-neighbors to retrieve the state of the point under discussion. Our focus is to gather such information using weight sharing but making sure that when we try to retrieve the information from neighbors, we do not carry noise with it. LSTM which has capability of long-term modelling and can carry both temporal and spatial information. In this article we tried to summarise graph based gesture recognition method.
△ Less
Submitted 1 July, 2022;
originally announced July 2022.
-
Ensembling over Classifiers: a Bias-Variance Perspective
Authors:
Neha Gupta,
Jamie Smith,
Ben Adlam,
Zelda Mariet
Abstract:
Ensembles are a straightforward, remarkably effective method for improving the accuracy,calibration, and robustness of models on classification tasks; yet, the reasons that underlie their success remain an active area of research. We build upon the extension to the bias-variance decomposition by Pfau (2013) in order to gain crucial insights into the behavior of ensembles of classifiers. Introducin…
▽ More
Ensembles are a straightforward, remarkably effective method for improving the accuracy,calibration, and robustness of models on classification tasks; yet, the reasons that underlie their success remain an active area of research. We build upon the extension to the bias-variance decomposition by Pfau (2013) in order to gain crucial insights into the behavior of ensembles of classifiers. Introducing a dual reparameterization of the bias-variance tradeoff, we first derive generalized laws of total expectation and variance for nonsymmetric losses typical of classification tasks. Comparing conditional and bootstrap bias/variance estimates, we then show that conditional estimates necessarily incur an irreducible error. Next, we show that ensembling in dual space reduces the variance and leaves the bias unchanged, whereas standard ensembling can arbitrarily affect the bias. Empirically, standard ensembling reducesthe bias, leading us to hypothesize that ensembles of classifiers may perform well in part because of this unexpected reduction.We conclude by an empirical analysis of recent deep learning methods that ensemble over hyperparameters, revealing that these techniques indeed favor bias reduction. This suggests that, contrary to classical wisdom, targeting bias reduction may be a promising direction for classifier ensembles.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Haptic Shared Control Improves Neural Efficiency During Myoelectric Prosthesis Use
Authors:
Neha Thomas,
Alexandra J. Miller,
Hasan Ayaz,
Jeremy D. Brown
Abstract:
Clinical myoelectric prostheses lack the sensory feedback and sufficient dexterity required to complete activities of daily living efficiently and accurately. Providing haptic feedback of relevant environmental cues to the user or imbuing the prosthesis with autonomous control authority have been separately shown to improve prosthesis utility. Few studies, however, have investigated the effect of…
▽ More
Clinical myoelectric prostheses lack the sensory feedback and sufficient dexterity required to complete activities of daily living efficiently and accurately. Providing haptic feedback of relevant environmental cues to the user or imbuing the prosthesis with autonomous control authority have been separately shown to improve prosthesis utility. Few studies, however, have investigated the effect of combining these two approaches in a shared control paradigm, and none have evaluated such an approach from the perspective of neural efficiency (the relationship between task performance and mental effort measured directly from the brain). In this work, we analyzed the neural efficiency of 30 non-amputee participants in a grasp-and-lift task of a brittle object. Here, a myoelectric prosthesis featuring vibrotactile feedback of grip force and autonomous control of grasping was compared with a standard myoelectric prosthesis with and without vibrotactile feedback. As a measure of mental effort, we captured the prefrontal cortex activity changes using functional near infrared spectroscopy during the experiment. Results showed that only the haptic shared control system enabled users to achieve high neural efficiency, and that vibrotactile feedback was important for grasping with the appropriate grip force. These results indicate that the haptic shared control system synergistically combines the benefits of haptic feedback and autonomous controllers, and is well-poised to inform such hybrid advancements in myoelectric prosthesis technology.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
The Utility of Synthetic Reflexes and Haptic Feedback for Upper-Limb Prostheses in a Dexterous Task Without Direct Vision
Authors:
Neha Thomas,
Farimah Fazlollahi,
Katherine J. Kuchenbecker,
Jeremy D. Brown
Abstract:
Individuals who use myoelectric upper-limb prostheses often rely heavily on vision to complete their daily activities. They thus struggle in situations where vision is overloaded, such as multitasking, or unavailable, such as poor lighting conditions. Non-amputees can easily accomplish such tasks due to tactile reflexes and haptic sensation guiding their upper-limb motor coordination. Based on the…
▽ More
Individuals who use myoelectric upper-limb prostheses often rely heavily on vision to complete their daily activities. They thus struggle in situations where vision is overloaded, such as multitasking, or unavailable, such as poor lighting conditions. Non-amputees can easily accomplish such tasks due to tactile reflexes and haptic sensation guiding their upper-limb motor coordination. Based on these principles, we developed and tested two novel prosthesis systems that incorporate autonomous controllers and provide the user with touch-location feedback through either vibration or distributed pressure. These capabilities were made possible by installing a custom contact-location sensor on thefingers of a commercial prosthetic hand, along with a custom pressure sensor on the thumb. We compared the performance of the two systems against a standard myoelectric prosthesis and a myoelectric prosthesis with only autonomous controllers in a difficult reach-to-pick-and-place task conducted without direct vision. Results from 40 non-amputee participants in this between-subjects study indicated that vibrotactile feedback combined with synthetic reflexes proved significantly more advantageous than the standard prosthesis in several of the task milestones. In addition, vibrotactile feedback and synthetic reflexes improved grasp placement compared to only synthetic reflexes or pressure feedback combined with synthetic reflexes. These results indicate that both autonomous controllers and haptic feedback facilitate success in dexterous tasks without vision, and that the type of haptic display matters.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Partial-input baselines show that NLI models can ignore context, but they don't
Authors:
Neha Srikanth,
Rachel Rudinger
Abstract:
When strong partial-input baselines reveal artifacts in crowdsourced NLI datasets, the performance of full-input models trained on such datasets is often dismissed as reliance on spurious correlations. We investigate whether state-of-the-art NLI models are capable of overriding default inferences made by a partial-input baseline. We introduce an evaluation set of 600 examples consisting of perturb…
▽ More
When strong partial-input baselines reveal artifacts in crowdsourced NLI datasets, the performance of full-input models trained on such datasets is often dismissed as reliance on spurious correlations. We investigate whether state-of-the-art NLI models are capable of overriding default inferences made by a partial-input baseline. We introduce an evaluation set of 600 examples consisting of perturbed premises to examine a RoBERTa model's sensitivity to edited contexts. Our results indicate that NLI models are still capable of learning to condition on context--a necessary component of inferential reasoning--despite being trained on artifact-ridden datasets.
△ Less
Submitted 24 May, 2022;
originally announced May 2022.
-
Consensus Capacity of Noisy Broadcast Channels
Authors:
Neha Sangwan,
Varun Narayanan,
Vinod M. Prabhakaran
Abstract:
We study communication with consensus over a broadcast channel - the receivers reliably decode the sender's message when the sender is honest, and their decoder outputs agree even if the sender acts maliciously. We characterize the broadcast channels which permit this byzantine consensus and determine their capacity. We show that communication with consensus is possible only when the broadcast cha…
▽ More
We study communication with consensus over a broadcast channel - the receivers reliably decode the sender's message when the sender is honest, and their decoder outputs agree even if the sender acts maliciously. We characterize the broadcast channels which permit this byzantine consensus and determine their capacity. We show that communication with consensus is possible only when the broadcast channel has embedded in it a natural ''common channel'' whose output both receivers can unambiguously determine from their own channel outputs. Interestingly, in general, the consensus capacity may be larger than the point-to-point capacity of the common channel, i.e., while decoding, the receivers may make use of parts of their output signals on which they may not have consensus provided there are some parts (namely, the common channel output) on which they can agree.
△ Less
Submitted 26 March, 2025; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Berkovich-Uncu type Partition Inequalities Concerning Impermissible Sets and Perfect Power Frequencies
Authors:
Damanvir Singh Binner,
Neha Gupta,
Manoj Upreti
Abstract:
Recently, Rattan and the first author (Ann. Comb. 25 (2021) 697-728) proved a conjectured inequality of Berkovich and Uncu (Ann. Comb. 23 (2019) 263-284) concerning partitions with an impermissible part. In this article, we generalize this inequality upon considering t impermissible parts. We compare these with partitions whose certain parts appear with a frequency which is a perfect t^{th} power.…
▽ More
Recently, Rattan and the first author (Ann. Comb. 25 (2021) 697-728) proved a conjectured inequality of Berkovich and Uncu (Ann. Comb. 23 (2019) 263-284) concerning partitions with an impermissible part. In this article, we generalize this inequality upon considering t impermissible parts. We compare these with partitions whose certain parts appear with a frequency which is a perfect t^{th} power. Our inequalities hold after a certain bound, which for given t is a polynomial in s, a major improvement over the previously known bound in the case t=1. To prove these inequalities, our methods involve constructing injective maps between the relevant sets of partitions. The construction of these maps crucially involves concepts from analysis and calculus, such as explicit maps used to prove countability of N^t, and Jensen's inequality for convex functions, and then merge them with techniques from number theory such as Frobenius numbers, congruence classes, binary numbers and quadratic residues. We also show a connection of our results to colored partitions. Finally, we pose an open problem which seems to be related to power residues and the almost universality of diagonal ternary quadratic forms.
△ Less
Submitted 30 November, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Primordial trispectrum from kSZ tomography
Authors:
Neha Anil Kumar,
Gabriela Sato-Polito,
Marc Kamionkowski,
Selim C. Hotinli
Abstract:
The kinetic Sunyaev Zel'dovich effect is a secondary CMB temperature anisotropy that provides a powerful probe of the radial-velocity field of matter distributed across the Universe. This velocity field is reconstructed by combining high-resolution CMB measurements with galaxy survey data, and it provides an unbiased tracer of matter perturbations in the linear regime. In this paper, we show how t…
▽ More
The kinetic Sunyaev Zel'dovich effect is a secondary CMB temperature anisotropy that provides a powerful probe of the radial-velocity field of matter distributed across the Universe. This velocity field is reconstructed by combining high-resolution CMB measurements with galaxy survey data, and it provides an unbiased tracer of matter perturbations in the linear regime. In this paper, we show how this measurement can be used to probe primordial non-Gaussianity of the local type, particularly focusing on the trispectrum amplitude $τ_{\rm NL}$, as may arise in a simple two-field inflation model that we provide by way of illustration. Cross-correlating the velocity-field-derived matter distribution with the biased large-scale galaxy density field allows one to measure the scale-dependent bias factor with sample variance cancellation. We forecast that a configuration corresponding to CMB-S4 and VRO results in a sensitivity of $σ_{f_{\rm NL}} \approx 0.59$ and $σ_{τ_{\rm NL}} \approx 1.5$. These forecasts predict improvement factors of 10 and 195 for $σ_{f_{\rm NL}}$ and $σ_{τ_{\rm NL}}$, respectively, over the sensitivity using VRO data alone, without internal sample variance cancellation. Similarly, for a configuration corresponding to DESI and SO, we forecast a sensitivity of $σ_{f_{\rm NL}} \approx 3.1$ and $σ_{τ_{\rm NL}} \approx 69$, with improvement factors of 2 and 5, respectively, over the use of the DESI data-set in isolation. We find that a high galaxy number density and large survey volume considerably improve our ability to probe the amplitude of the primordial trispectrum for the multi-field model considered.
△ Less
Submitted 30 August, 2022; v1 submitted 6 May, 2022;
originally announced May 2022.
-
A Novel Scalable Apache Spark Based Feature Extraction Approaches for Huge Protein Sequence and their Clustering Performance Analysis
Authors:
Preeti Jha,
Aruna Tiwari,
Neha Bharill,
Milind Ratnaparkhe,
Om Prakash Patel,
Nilagiri Harshith,
Mukkamalla Mounika,
Neha Nagendra
Abstract:
Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they ar…
▽ More
Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they are not scalable with current algorithms. Therefore, there is a need for an efficient feature extraction approach that extracts significant features. We have proposed two scalable feature extraction approaches for extracting features from huge protein sequences using Apache Spark, which are termed 60d-SPF (60-dimensional Scalable Protein Feature) and 6d-SCPSF (6-dimensional Scalable Co-occurrence-based Probability-Specific Feature). The proposed 60d-SPF and 6d-SCPSF approaches capture the statistical properties of amino acids to create a fixed-length numeric feature vector that represents each protein sequence in terms of 60-dimensional and 6-dimensional features, respectively. The preprocessed huge protein sequences are used as an input in two clustering algorithms, i.e., Scalable Random Sampling with Iterative Optimization Fuzzy c-Means (SRSIO-FCM) and Scalable Literal Fuzzy C-Means (SLFCM) for clustering. We have conducted extensive experiments on various soybean protein datasets to demonstrate the effectiveness of the proposed feature extraction methods, 60d-SPF, 6d-SCPSF, and existing feature extraction methods on SRSIO-FCM and SLFCM clustering algorithms. The reported results in terms of the Silhouette index and the Davies-Bouldin index show that the proposed 60d-SPF extraction method on SRSIO-FCM and SLFCM clustering algorithms achieves significantly better results than the proposed 6d-SCPSF and existing feature extraction approaches.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Spatially-Preserving Flattening for Location-Aware Classification of Findings in Chest X-Rays
Authors:
Neha Srivathsa,
Razi Mahmood,
Tanveer Syeda-Mahmood
Abstract:
Chest X-rays have become the focus of vigorous deep learning research in recent years due to the availability of large labeled datasets. While classification of anomalous findings is now possible, ensuring that they are correctly localized still remains challenging, as this requires recognition of anomalies within anatomical regions. Existing deep learning networks for fine-grained anomaly classif…
▽ More
Chest X-rays have become the focus of vigorous deep learning research in recent years due to the availability of large labeled datasets. While classification of anomalous findings is now possible, ensuring that they are correctly localized still remains challenging, as this requires recognition of anomalies within anatomical regions. Existing deep learning networks for fine-grained anomaly classification learn location-specific findings using architectures where the location and spatial contiguity information is lost during the flattening step before classification. In this paper, we present a new spatially preserving deep learning network that preserves location and shape information through auto-encoding of feature maps during flattening. The feature maps, auto-encoder and classifier are then trained in an end-to-end fashion to enable location aware classification of findings in chest X-rays. Results are shown on a large multi-hospital chest X-ray dataset indicating a significant improvement in the quality of finding classification over state-of-the-art methods.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Zero-shot Entity and Tweet Characterization with Designed Conditional Prompts and Contexts
Authors:
Sharath Srivatsa,
Tushar Mohan,
Kumari Neha,
Nishchay Malakar,
Ponnurangam Kumaraguru,
Srinath Srinivasa
Abstract:
Online news and social media have been the de facto mediums to disseminate information globally from the beginning of the last decade. However, bias in content and purpose of intentions are not regulated, and managing bias is the responsibility of content consumers. In this regard, understanding the stances and biases of news sources towards specific entities becomes important. To address this pro…
▽ More
Online news and social media have been the de facto mediums to disseminate information globally from the beginning of the last decade. However, bias in content and purpose of intentions are not regulated, and managing bias is the responsibility of content consumers. In this regard, understanding the stances and biases of news sources towards specific entities becomes important. To address this problem, we use pretrained language models, which have been shown to bring about good results with no task-specific training or few-shot training. In this work, we approach the problem of characterizing Named Entities and Tweets as an open-ended text classification and open-ended fact probing problem.We evaluate the zero-shot language model capabilities of Generative Pretrained Transformer 2 (GPT-2) to characterize Entities and Tweets subjectively with human psychology-inspired and logical conditional prefixes and contexts. First, we fine-tune the GPT-2 model on a sufficiently large news corpus and evaluate subjective characterization of popular entities in the corpus by priming with prefixes. Second, we fine-tune GPT-2 with a Tweets corpus from a few popular hashtags and evaluate characterizing tweets by priming the language model with prefixes, questions, and contextual synopsis prompts. Entity characterization results were positive across measures and human evaluation.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Maximal density and the kappa values for the families $\{a,a+1,2a+1,n\}$ and $\{a,a+1,2a+1,3a+1,n\}$
Authors:
Ram Krishna Pandey,
Neha Rai
Abstract:
Let $M$ be a set of positive integers. We study the maximal density $μ(M)$ of the sets of nonnegative integers $S$ whose elements do not differ by an element in $M$. In 1973, Cantor and Gordon established a formula for $μ(M)$ for $|M|\leq 2$. Since then, many researchers have worked upon the problem and found several partial results in the case $|M|\geq 3$, including some results in the case, $M$…
▽ More
Let $M$ be a set of positive integers. We study the maximal density $μ(M)$ of the sets of nonnegative integers $S$ whose elements do not differ by an element in $M$. In 1973, Cantor and Gordon established a formula for $μ(M)$ for $|M|\leq 2$. Since then, many researchers have worked upon the problem and found several partial results in the case $|M|\geq 3$, including some results in the case, $M$ is an infinite set. In this paper, we study the maximal density problem for the families $M=\{a,a+1,2a+1,n\}$ and $M=\{a,a+1,2a+1,3a+1,n\}$, where $a$ and $n$ are positive integers. In most of the cases, we find bounds for the parameter \textit{kappa}, denoted by $κ(M)$, which actually serves as a lower bound for $μ(M)$. The parameter $κ(M)$ has already got its importance due to its rich connection with the problems such as the "lonely runner conjecture" in Diophantine approximations and coloring parameters such as "circular coloring" and "fractional coloring" in graph theory.
△ Less
Submitted 10 April, 2022;
originally announced April 2022.
-
Packaging, containerization, and virtualization of computational omics methods: Advances, challenges, and opportunities
Authors:
Mohammed Alser,
Sharon Waymost,
Ram Ayyala,
Brendan Lawlor,
Richard J. Abdill,
Neha Rajkumar,
Nathan LaPierre,
Jaqueline Brito,
Andre M. Ribeiro-dos-Santos,
Can Firtina,
Nour Almadhoun,
Varuni Sarwal,
Eleazar Eskin,
Qiyang Hu,
Derek Strong,
Byoung-Do,
Kim,
Malak S. Abedalthagafi,
Onur Mutlu,
Serghei Mangul
Abstract:
Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrapping omics tools in additional softwa…
▽ More
Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrapping omics tools in additional software that makes the omics tools easier to install and use. Here, we systematically review practices across prominent packaging, virtualization, and containerization platforms. We outline the challenges, advantages, and limitations of each approach and some of the most widely used platforms from the perspectives of users, software developers, and system administrators. We also propose principles to make packaging, virtualization, and containerization of omics software more sustainable and robust to increase the reproducibility of biomedical and life science research.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Two Approaches to Survival Analysis of Open Source Python Projects
Authors:
Derek Robinson,
Keanelek Enns,
Neha Koulecar,
Manish Sihag
Abstract:
A recent study applied frequentist survival analysis methods to a subset of the Software Heritage Graph and determined which attributes of an OSS project contribute to its health. This paper serves as an exact replication of that study. In addition, Bayesian survival analysis methods were applied to the same dataset, and an additional project attribute was studied to serve as a conceptual replicat…
▽ More
A recent study applied frequentist survival analysis methods to a subset of the Software Heritage Graph and determined which attributes of an OSS project contribute to its health. This paper serves as an exact replication of that study. In addition, Bayesian survival analysis methods were applied to the same dataset, and an additional project attribute was studied to serve as a conceptual replication. Both analyses focus on the effects of certain attributes on the survival of open-source software projects as measured by their revision activity. Methods such as the Kaplan-Meier estimator, Cox Proportional-Hazards model, and the visualization of posterior survival functions were used for each of the project attributes. The results show that projects which publish major releases, have repositories on multiple hosting services, possess a large team of developers, and make frequent revisions have a higher likelihood of survival in the long run. The findings were similar to the original study; however, a deeper look revealed quantitative inconsistencies.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Measuring Self-Supervised Representation Quality for Downstream Classification using Discriminative Features
Authors:
Neha Kalibhat,
Kanika Narang,
Hamed Firooz,
Maziar Sanjabi,
Soheil Feizi
Abstract:
Self-supervised learning (SSL) has shown impressive results in downstream classification tasks. However, there is limited work in understanding their failure modes and interpreting their learned representations. In this paper, we study the representation space of state-of-the-art self-supervised models including SimCLR, SwaV, MoCo, BYOL, DINO, SimSiam, VICReg and Barlow Twins. Without the use of c…
▽ More
Self-supervised learning (SSL) has shown impressive results in downstream classification tasks. However, there is limited work in understanding their failure modes and interpreting their learned representations. In this paper, we study the representation space of state-of-the-art self-supervised models including SimCLR, SwaV, MoCo, BYOL, DINO, SimSiam, VICReg and Barlow Twins. Without the use of class label information, we discover discriminative features that correspond to unique physical attributes in images, present mostly in correctly-classified representations. Using these features, we can compress the representation space by up to 40% without significantly affecting linear classification performance. We then propose Self-Supervised Representation Quality Score (or Q-Score), an unsupervised score that can reliably predict if a given sample is likely to be mis-classified during linear evaluation, achieving AUPRC of 91.45 on ImageNet-100 and 78.78 on ImageNet-1K. Q-Score can also be used as a regularization term on pre-trained encoders to remedy low-quality representations. Fine-tuning with Q-Score regularization can boost the linear probing accuracy of SSL models by up to 5.8% on ImageNet-100 and 3.7% on ImageNet-1K compared to their baselines. Finally, using gradient heatmaps and Salient ImageNet masks, we define a metric to quantify the interpretability of each representation. We show that discriminative features are strongly correlated to core attributes and, enhancing these features through Q-score regularization makes SSL representations more interpretable.
△ Less
Submitted 12 December, 2023; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Stiefel-Whitney Classes of Representations of Some Finite Groups of Lie Type
Authors:
Neha Malik,
Steven Spallone
Abstract:
In this note we present the Stiefel-Whitney classes (SWCs) for orthogonal representations of several finite groups of Lie type, namely for $G=\text{SL}(2,q),$ $\text{SL}(3,q),$ $\text{Sp}(4,q)$, and $\text{Sp}(6,q)$, with $q$ odd. We also describe the SWCs for $G=\text{SL}(2,q)$ when $q$ is even.
In this note we present the Stiefel-Whitney classes (SWCs) for orthogonal representations of several finite groups of Lie type, namely for $G=\text{SL}(2,q),$ $\text{SL}(3,q),$ $\text{Sp}(4,q)$, and $\text{Sp}(6,q)$, with $q$ odd. We also describe the SWCs for $G=\text{SL}(2,q)$ when $q$ is even.
△ Less
Submitted 15 February, 2022;
originally announced February 2022.
-
On Real-time Image Reconstruction with Neural Networks for MRI-guided Radiotherapy
Authors:
David E. J. Waddington,
Nicholas Hindley,
Neha Koonjoo,
Christopher Chiu,
Tess Reynolds,
Paul Z. Y. Liu,
Bo Zhu,
Danyal Bhutto,
Chiara Paganelli,
Paul J. Keall,
Matthew S. Rosen
Abstract:
MRI-guidance techniques that dynamically adapt radiation beams to follow tumor motion in real-time will lead to more accurate cancer treatments and reduced collateral healthy tissue damage. The gold-standard for reconstruction of undersampled MR data is compressed sensing (CS) which is computationally slow and limits the rate that images can be available for real-time adaptation. Here, we demonstr…
▽ More
MRI-guidance techniques that dynamically adapt radiation beams to follow tumor motion in real-time will lead to more accurate cancer treatments and reduced collateral healthy tissue damage. The gold-standard for reconstruction of undersampled MR data is compressed sensing (CS) which is computationally slow and limits the rate that images can be available for real-time adaptation. Here, we demonstrate the use of automated transform by manifold approximation (AUTOMAP), a generalized framework that maps raw MR signal to the target image domain, to rapidly reconstruct images from undersampled radial k-space data. The AUTOMAP neural network was trained to reconstruct images from a golden-angle radial acquisition, a benchmark for motion-sensitive imaging, on lung cancer patient data and generic images from ImageNet. Model training was subsequently augmented with motion-encoded k-space data derived from videos in the YouTube-8M dataset to encourage motion robust reconstruction. We find that AUTOMAP-reconstructed radial k-space has equivalent accuracy to CS but with much shorter processing times after initial fine-tuning on retrospectively acquired lung cancer patient data. Validation of motion-trained models with a virtual dynamic lung tumor phantom showed that the generalized motion properties learned from YouTube lead to improved target tracking accuracy. Our work shows that AUTOMAP can achieve real-time, accurate reconstruction of radial data. These findings imply that neural-network-based reconstruction is potentially superior to existing approaches for real-time image guidance applications.
△ Less
Submitted 18 May, 2022; v1 submitted 9 February, 2022;
originally announced February 2022.
-
The crowding effect on the melting of short DNA: Comparison with experiments
Authors:
Neha Mathur,
Amar Singh,
Navin Singh
Abstract:
We study the effect of crowders on the melting profile of homogeneous and heterogeneous DNA molecules. We find out the melting profile of short DNA molecules and compare our findings with the experiments. We consider some random distribution of crowders along the chain, and by finding out the best match with the experiments, we attempt to identify the location of crowders in the experimental findi…
▽ More
We study the effect of crowders on the melting profile of homogeneous and heterogeneous DNA molecules. We find out the melting profile of short DNA molecules and compare our findings with the experiments. We consider some random distribution of crowders along the chain, and by finding out the best match with the experiments, we attempt to identify the location of crowders in the experimental findings of Ghosh \cite{Ghosh_PNAS_2020}. We also study the melting of homogeneous DNA molecules of different lengths (25, 50, 75) in the presence of only one crowder in the chain. By varying the location of the crowder from one end to the other, we find that the melting temperature is susceptible to the location of the crowder at the ends. At the same time, there is minimal effect on the melting temperature due to the location of the crowder. {\it In vivo}, the strength of a crowders may vary along the chain. We study the melting of long heterogeneous chain in presence of five crowders of different strength. We find that there is a significant variation in the melting process of DNA in presence of crowders of variable strength.
△ Less
Submitted 9 February, 2022;
originally announced February 2022.
-
Understanding the bias-variance tradeoff of Bregman divergences
Authors:
Ben Adlam,
Neha Gupta,
Zelda Mariet,
Jamie Smith
Abstract:
This paper builds upon the work of Pfau (2013), which generalized the bias variance tradeoff to any Bregman divergence loss function. Pfau (2013) showed that for Bregman divergences, the bias and variances are defined with respect to a central label, defined as the mean of the label variable, and a central prediction, of a more complex form. We show that, similarly to the label, the central predic…
▽ More
This paper builds upon the work of Pfau (2013), which generalized the bias variance tradeoff to any Bregman divergence loss function. Pfau (2013) showed that for Bregman divergences, the bias and variances are defined with respect to a central label, defined as the mean of the label variable, and a central prediction, of a more complex form. We show that, similarly to the label, the central prediction can be interpreted as the mean of a random variable, where the mean operates in a dual space defined by the loss function itself. Viewing the bias-variance tradeoff through operations taken in dual space, we subsequently derive several results of interest. In particular, (a) the variance terms satisfy a generalized law of total variance; (b) if a source of randomness cannot be controlled, its contribution to the bias and variance has a closed form; (c) there exist natural ensembling operations in the label and prediction spaces which reduce the variance and do not affect the bias.
△ Less
Submitted 9 February, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Structural Analysis of DNA molecule in a confined shell
Authors:
Arghya Maity,
Neha Mathur,
Petra Imhof,
Navin Singh
Abstract:
Recent advances in operating and manipulating DNA have provided unique experimental possibilities in many fields of DNA research, especially in gene therapy. Researchers have deployed many techniques, experimental and theoretical, to study the DNA structure changes due to external perturbation. It is crucial to understand the structural and dynamical changes in the DNA molecules in a confined stat…
▽ More
Recent advances in operating and manipulating DNA have provided unique experimental possibilities in many fields of DNA research, especially in gene therapy. Researchers have deployed many techniques, experimental and theoretical, to study the DNA structure changes due to external perturbation. It is crucial to understand the structural and dynamical changes in the DNA molecules in a confined state to understand and control the self-assembly of DNA confined in a chamber or nano-channel for various applications. In the current manuscript, we extend the work study the effect of confinement on the thermal stability and the structural properties of duplex DNA. The present work is an extension of our previous research works. For our study we have considered a 1 BNA chain that is confined in a cylindrical geometry. How the geometry of the confinement affects the opening and other structural parameters of DNA molecule is the objective of this manuscript. We have used a statistical model(PBD model) and Molecular dynamics simulations for our purpose.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Fusions of the generalized Hamming scheme on a strongly-regular graph
Authors:
Allen Herman,
Neha Joshi,
Karen Meagher
Abstract:
In this paper we show that for any fusion $\mathcal{B}$ of an association scheme $\mathcal{A}$, the generalized Hamming scheme $H(n,\mathcal{B})$ is a nontrivial fusion of $H(n,\mathcal{A})$. We analyze the case where $\mathcal{A}$ is the association scheme on a strongly-regular graph, and determine the parameters of all strongly-regular graphs for which the generalized Hamming scheme,…
▽ More
In this paper we show that for any fusion $\mathcal{B}$ of an association scheme $\mathcal{A}$, the generalized Hamming scheme $H(n,\mathcal{B})$ is a nontrivial fusion of $H(n,\mathcal{A})$. We analyze the case where $\mathcal{A}$ is the association scheme on a strongly-regular graph, and determine the parameters of all strongly-regular graphs for which the generalized Hamming scheme, $H(2,\mathcal{A})$, has extra fusions, in addition to the one arising from the trivial fusion of $\mathcal{A}$.
△ Less
Submitted 4 August, 2022; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health
Authors:
Kai Wang,
Shresth Verma,
Aditya Mate,
Sanket Shah,
Aparna Taneja,
Neha Madhiwalla,
Aparna Hegde,
Milind Tambe
Abstract:
This paper studies restless multi-armed bandit (RMAB) problems with unknown arm transition dynamics but with known correlated arm features. The goal is to learn a model to predict transition dynamics given features, where the Whittle index policy solves the RMAB problems using predicted transitions. However, prior works often learn the model by maximizing the predictive accuracy instead of final R…
▽ More
This paper studies restless multi-armed bandit (RMAB) problems with unknown arm transition dynamics but with known correlated arm features. The goal is to learn a model to predict transition dynamics given features, where the Whittle index policy solves the RMAB problems using predicted transitions. However, prior works often learn the model by maximizing the predictive accuracy instead of final RMAB solution quality, causing a mismatch between training and evaluation objectives. To address this shortcoming, we propose a novel approach for decision-focused learning in RMAB that directly trains the predictive model to maximize the Whittle index solution quality. We present three key contributions: (i) we establish differentiability of the Whittle index policy to support decision-focused learning; (ii) we significantly improve the scalability of decision-focused learning approaches in sequential problems, specifically RMAB problems; (iii) we apply our algorithm to a previously collected dataset of maternal and child health to demonstrate its performance. Indeed, our algorithm is the first for decision-focused learning in RMAB that scales to real-world problem sizes.
△ Less
Submitted 13 August, 2023; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Research on Wearable Technologies for Learning: A Systematic Review
Authors:
Sharon Lynn Chu,
Brittany M. Garcia,
Neha Rani
Abstract:
A good amount of research has explored the use of wearables for educational or learning purposes. We have now reached a point when much literature can be found on that topic, but few attempts have been made to make sense of that literature from a holistic perspective. This paper presents a systematic review of the literature on wearables for learning. Literature was sourced from conferences and jo…
▽ More
A good amount of research has explored the use of wearables for educational or learning purposes. We have now reached a point when much literature can be found on that topic, but few attempts have been made to make sense of that literature from a holistic perspective. This paper presents a systematic review of the literature on wearables for learning. Literature was sourced from conferences and journals pertaining to technology and education, and through an ad hoc search. Our review focuses on identifying the ways that wearables have been used to support learning and provides perspectives on that issue from a historical dimension, and with regards to the types of wearables used, the populations targeted, and the settings addressed. Seven different ways of how wearables have been used to support learning were identified. We propose a framework identifying five main components that have been addressed in existing research on how wearables can support learning and present our interpretations of unaddressed research directions based on our review results.
△ Less
Submitted 27 January, 2022;
originally announced January 2022.