-
EchoSim4D: A Proof-of-Concept Gamified XR Echocardiography Training Simulator for Neonates using 4D Ultrasound Volume
Authors:
Deepthy Rose Jose,
Venkataseshan Sundaram,
M Manivannan
Abstract:
Neonatal echocardiography is vital for early detection of heart anomalies in newborns, enabling timely, non-invasive interventions where 4D ultrasound, adds the dimension of time to 3D imaging, enhances diagnostic capabilities by visualizing real-time heart dynamics. However, training for 4D neonatal echocardiography is limited by the lack of simulators that support 4D Ultrasound volume visualizat…
▽ More
Neonatal echocardiography is vital for early detection of heart anomalies in newborns, enabling timely, non-invasive interventions where 4D ultrasound, adds the dimension of time to 3D imaging, enhances diagnostic capabilities by visualizing real-time heart dynamics. However, training for 4D neonatal echocardiography is limited by the lack of simulators that support 4D Ultrasound volume visualization within gamified environments. This paper introduces EchoSim4D, an XR-based simulator leveraging novel pipeline for visualizing 4D volume data in Unity, incorporating real-time volume reconstruction, and a preloaded version optimized for low-end systems. EchoSim4D integrates a sensor-equipped manikin and a custom 3D-printed transducer with a 6-DOF sensor, replicating the precise probe maneuvers necessary for neonatal echocardiography. In a validation study with postgraduate medical students (0-5 years of experience), supervised by a domain expert, EchoSim4D demonstrated high visual fidelity and training efficacy. Findings suggest that 4D visualization techniques hold significant potential for advancing medical training in neonatal echocardiography.
△ Less
Submitted 9 December, 2024;
originally announced December 2024.
-
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Authors:
Woojeong Jin,
Subhabrata Mukherjee,
Yu Cheng,
Yelong Shen,
Weizhu Chen,
Ahmed Hassan Awadallah,
Damien Jose,
Xiang Ren
Abstract:
Generalization to unseen tasks is an important ability for few-shot learners to achieve better zero-/few-shot performance on diverse tasks. However, such generalization to vision-language tasks including grounding and generation tasks has been under-explored; existing few-shot VL models struggle to handle tasks that involve object grounding and multiple images such as visual commonsense reasoning…
▽ More
Generalization to unseen tasks is an important ability for few-shot learners to achieve better zero-/few-shot performance on diverse tasks. However, such generalization to vision-language tasks including grounding and generation tasks has been under-explored; existing few-shot VL models struggle to handle tasks that involve object grounding and multiple images such as visual commonsense reasoning or NLVR2. In this paper, we introduce GRILL, GRounded vIsion Language aLigning, a novel VL model that can be generalized to diverse tasks including visual question answering, captioning, and grounding tasks with no or very few training instances. Specifically, GRILL learns object grounding and localization by exploiting object-text alignments, which enables it to transfer to grounding tasks in a zero-/few-shot fashion. We evaluate our model on various zero-/few-shot VL tasks and show that it consistently surpasses the state-of-the-art few-shot methods.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners
Authors:
Shashank Gupta,
Subhabrata Mukherjee,
Krishan Subudhi,
Eduardo Gonzalez,
Damien Jose,
Ahmed H. Awadallah,
Jianfeng Gao
Abstract:
Traditional multi-task learning (MTL) methods use dense networks that use the same set of shared weights across several different tasks. This often creates interference where two or more tasks compete to pull model parameters in different directions. In this work, we study whether sparsely activated Mixture-of-Experts (MoE) improve multi-task learning by specializing some weights for learning shar…
▽ More
Traditional multi-task learning (MTL) methods use dense networks that use the same set of shared weights across several different tasks. This often creates interference where two or more tasks compete to pull model parameters in different directions. In this work, we study whether sparsely activated Mixture-of-Experts (MoE) improve multi-task learning by specializing some weights for learning shared representations and using the others for learning task-specific information. To this end, we devise task-aware gating functions to route examples from different tasks to specialized experts which share subsets of network weights conditioned on the task. This results in a sparsely activated multi-task model with a large number of parameters, but with the same computational cost as that of a dense model. We demonstrate such sparse networks to improve multi-task learning along three key dimensions: (i) transfer to low-resource tasks from related tasks in the training mixture; (ii) sample-efficient generalization to tasks not seen during training by making use of task-aware routing from seen related tasks; (iii) robustness to the addition of unrelated tasks by avoiding catastrophic forgetting of existing tasks.
△ Less
Submitted 15 April, 2022;
originally announced April 2022.
-
Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations
Authors:
Ji Xin,
Chenyan Xiong,
Ashwin Srinivasan,
Ankita Sharma,
Damien Jose,
Paul N. Bennett
Abstract:
Dense retrieval (DR) methods conduct text retrieval by first encoding texts in the embedding space and then matching them by nearest neighbor search. This requires strong locality properties from the representation space, i.e, the close allocations of each small group of relevant texts, which are hard to generalize to domains without sufficient training data. In this paper, we aim to improve the g…
▽ More
Dense retrieval (DR) methods conduct text retrieval by first encoding texts in the embedding space and then matching them by nearest neighbor search. This requires strong locality properties from the representation space, i.e, the close allocations of each small group of relevant texts, which are hard to generalize to domains without sufficient training data. In this paper, we aim to improve the generalization ability of DR models from source training domains with rich supervision signals to target domains without any relevant labels, in the zero-shot setting. To achieve that, we propose Momentum adversarial Domain Invariant Representation learning (MoDIR), which introduces a momentum method in the DR training process to train a domain classifier distinguishing source versus target, and then adversarially updates the DR encoder to learn domain invariant representations. Our experiments show that MoDIR robustly outperforms its baselines on 10+ ranking datasets from the BEIR benchmark in the zero-shot setup, with more than 10% relative gains on datasets with enough sensitivity for DR models' evaluation. Source code of this paper will be released.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Differential Transmission Schemes for Generalized Spatial Modulation
Authors:
Deepak Jose,
Sameer S. M
Abstract:
Differential modulation schemes are very relevant in receivers having power and processing limitations, as these schemes dispense with the need for knowledge of channel coefficients for symbol detection. Spatial modulation (SM) is a scheme used in multi-antenna transmission scenarios where the data is transmitted in the amplitude, phase and spatial domains through selected antennas. In the coheren…
▽ More
Differential modulation schemes are very relevant in receivers having power and processing limitations, as these schemes dispense with the need for knowledge of channel coefficients for symbol detection. Spatial modulation (SM) is a scheme used in multi-antenna transmission scenarios where the data is transmitted in the amplitude, phase and spatial domains through selected antennas. In the coherent domain, generalized SM (GSM) employs multiple antennas in combination during every time slot to enhance the spectral efficiency (SE). In this paper, we propose two differential schemes which activate two or more antennas at a time to transmit the modulated symbol. These schemes achieve higher SE using a lesser number of antennas and lower order modulation schemes instead of the higher number of antennas required for conventional SM schemes based on differential modulation. Simulation studies reveal that the proposed schemes have better bit error rate performance than traditional differential SM schemes. We also derive the analytical union bound for the proposed schemes and is satisfied from medium to high signal-to-noise ratio (SNR) ranges.
△ Less
Submitted 7 October, 2021; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Few-Shot Named Entity Recognition: A Comprehensive Study
Authors:
Jiaxin Huang,
Chunyuan Li,
Krishan Subudhi,
Damien Jose,
Shobana Balakrishnan,
Weizhu Chen,
Baolin Peng,
Jianfeng Gao,
Jiawei Han
Abstract:
This paper presents a comprehensive study to efficiently build named entity recognition (NER) systems when a small number of in-domain labeled data is available. Based upon recent Transformer-based self-supervised pre-trained language models (PLMs), we investigate three orthogonal schemes to improve the model generalization ability for few-shot settings: (1) meta-learning to construct prototypes f…
▽ More
This paper presents a comprehensive study to efficiently build named entity recognition (NER) systems when a small number of in-domain labeled data is available. Based upon recent Transformer-based self-supervised pre-trained language models (PLMs), we investigate three orthogonal schemes to improve the model generalization ability for few-shot settings: (1) meta-learning to construct prototypes for different entity types, (2) supervised pre-training on noisy web data to extract entity-related generic representations and (3) self-training to leverage unlabeled in-domain data. Different combinations of these schemes are also considered. We perform extensive empirical comparisons on 10 public NER datasets with various proportions of labeled data, suggesting useful insights for future research. Our experiments show that (i) in the few-shot learning setting, the proposed NER schemes significantly improve or outperform the commonly used baseline, a PLM-based linear classifier fine-tuned on domain labels; (ii) We create new state-of-the-art results on both few-shot and training-free settings compared with existing methods. We will release our code and pre-trained models for reproducible research.
△ Less
Submitted 29 December, 2020;
originally announced December 2020.
-
Optimizing Query Evaluations using Reinforcement Learning for Web Search
Authors:
Corby Rosset,
Damien Jose,
Gargi Ghosh,
Bhaskar Mitra,
Saurabh Tiwary
Abstract:
In web search, typically a candidate generation step selects a small set of documents---from collections containing as many as billions of web pages---that are subsequently ranked and pruned before being presented to the user. In Bing, the candidate generation involves scanning the index using statically designed match plans that prescribe sequences of different match criteria and stopping conditi…
▽ More
In web search, typically a candidate generation step selects a small set of documents---from collections containing as many as billions of web pages---that are subsequently ranked and pruned before being presented to the user. In Bing, the candidate generation involves scanning the index using statically designed match plans that prescribe sequences of different match criteria and stopping conditions. In this work, we pose match planning as a reinforcement learning task and observe up to 20% reduction in index blocks accessed, with small or no degradation in the quality of the candidate sets.
△ Less
Submitted 18 August, 2018; v1 submitted 12 April, 2018;
originally announced April 2018.
-
Off-policy evaluation for slate recommendation
Authors:
Adith Swaminathan,
Akshay Krishnamurthy,
Alekh Agarwal,
Miroslav Dudík,
John Langford,
Damien Jose,
Imed Zitouni
Abstract:
This paper studies the evaluation of policies that recommend an ordered set of items (e.g., a ranking) based on some context---a common scenario in web search, ads, and recommendation. We build on techniques from combinatorial bandits to introduce a new practical estimator that uses logged data to estimate a policy's performance. A thorough empirical evaluation on real-world data reveals that our…
▽ More
This paper studies the evaluation of policies that recommend an ordered set of items (e.g., a ranking) based on some context---a common scenario in web search, ads, and recommendation. We build on techniques from combinatorial bandits to introduce a new practical estimator that uses logged data to estimate a policy's performance. A thorough empirical evaluation on real-world data reveals that our estimator is accurate in a variety of settings, including as a subroutine in a learning-to-rank task, where it achieves competitive performance. We derive conditions under which our estimator is unbiased---these conditions are weaker than prior heuristics for slate evaluation---and experimentally demonstrate a smaller bias than parametric approaches, even when these conditions are violated. Finally, our theory and experiments also show exponential savings in the amount of required data compared with general unbiased estimators.
△ Less
Submitted 6 November, 2017; v1 submitted 16 May, 2016;
originally announced May 2016.
-
Toward irreversibility with a finite bath of oscillators
Authors:
Artur Nogueira de São José,
Patrícia Mascarenhas Dias,
Arthur Rodrigo Bosco de Magalhães,
José Geraldo Peixoto de Faria
Abstract:
We investigate the routes by which a bath composed of a finite number of oscillators at zero temperature approaches the induction of dissipation when it nears the usual limit of dense spectrum spread in an infinite interval. It is shown that, when this limit is taken, different distributions of environment frequencies can lead to the same irreversible evolution. However, when we move away from it,…
▽ More
We investigate the routes by which a bath composed of a finite number of oscillators at zero temperature approaches the induction of dissipation when it nears the usual limit of dense spectrum spread in an infinite interval. It is shown that, when this limit is taken, different distributions of environment frequencies can lead to the same irreversible evolution. However, when we move away from it, the dynamics departs from irreversibility in qualitatively different manners.
△ Less
Submitted 9 August, 2012;
originally announced August 2012.
-
Engineering superpositions of displaced number states of a trapped ion
Authors:
Marcelo A. Marchiolli,
Wagner D. Jose
Abstract:
We present a protocol that permits the generation of a subtle with superposition with 2^(l+1) displaced number states on a circle in phase space as target state for the center-of-mass motion of a trapped ion. Through a sequence of 'l' cycles involving the application of laser pulses and no-fluorescence measurements, explicit expressions for the total duration of laser pulses employed in the sequ…
▽ More
We present a protocol that permits the generation of a subtle with superposition with 2^(l+1) displaced number states on a circle in phase space as target state for the center-of-mass motion of a trapped ion. Through a sequence of 'l' cycles involving the application of laser pulses and no-fluorescence measurements, explicit expressions for the total duration of laser pulses employed in the sequence and probability of getting the ion in the upper electronic state during the 'l' cycles are obtained and analyzed in detail. Furthermore, assuming that the effective relaxation process of a trapped ion can be described in the framework of the standard master equation for the damped harmonic oscillator, we investigate the degradation of the quantum interference effects inherent to superpositions via Wigner function.
△ Less
Submitted 5 April, 2004;
originally announced April 2004.
-
Generating Fock states and two-Fock states superposition from circular states, in a trapped ion
Authors:
Salomon S. Mizrahi,
Wagner D. Jose
Abstract:
We propose three schemes to engineer 2^M and M+1 circular states for the motion of the center of mass of a trapped ion, $M$ being the number of laser pulses. Since the ion is subjected to several laser pulses, we analyze the necessary duration of each one for generating the circular states, and from these, the Fock states and superposition of two-Fock states. We also calculate the probability fo…
▽ More
We propose three schemes to engineer 2^M and M+1 circular states for the motion of the center of mass of a trapped ion, $M$ being the number of laser pulses. Since the ion is subjected to several laser pulses, we analyze the necessary duration of each one for generating the circular states, and from these, the Fock states and superposition of two-Fock states. We also calculate the probability for obtaining the required states.
△ Less
Submitted 25 October, 2002;
originally announced October 2002.
-
Engineering arbitrary motional ionic state through realistic intensity-fluctuating laser pulses
Authors:
R. M. Serra,
P. B. Ramos,
N. G. de Almeida,
W. D. Jose,
M. H. Y. Moussa
Abstract:
We present a reliable scheme for engineering arbitrary motional ionic states through an adaptation of the projection synthesis technique for trapped-ion phenomena. Starting from a prepared coherent motional state, the Wigner function of the desired state is thus sculpted from a Gaussian distribution. The engineering process has also been developed to take into account the errors arising from int…
▽ More
We present a reliable scheme for engineering arbitrary motional ionic states through an adaptation of the projection synthesis technique for trapped-ion phenomena. Starting from a prepared coherent motional state, the Wigner function of the desired state is thus sculpted from a Gaussian distribution. The engineering process has also been developed to take into account the errors arising from intensity fluctuations in the exciting-laser pulses required for manipulating the electronic and vibrational states of the trapped ion. To this end, a recently developed phenomenological-operator approach that allows for the influence of noise will be applied. This approach furnishes a straightforward technique to estimate the fidelity of the prepared state in the presence of errors, precluding the usual extensive ab initio calculations. The results obtained here by the phenomenological approach, to account for the effects of noise in our engineering scheme, can be directly applied to any other process involving trapped-ion phenomena.
△ Less
Submitted 19 April, 2001;
originally announced April 2001.