-
HEAR: Holistic Evaluation of Audio Representations
Authors:
Joseph Turian,
Jordie Shier,
Humair Raj Khan,
Bhiksha Raj,
Björn W. Schuller,
Christian J. Steinmetz,
Colin Malloy,
George Tzanetakis,
Gissel Velarde,
Kirk McNally,
Max Henry,
Nicolas Pinto,
Camille Noufi,
Christian Clough,
Dorien Herremans,
Eduardo Fonseca,
Jesse Engel,
Justin Salamon,
Philippe Esling,
Pranay Manocha,
Shinji Watanabe,
Zeyu Jin,
Yonatan Bisk
Abstract:
What audio embedding approach generalizes best to a wide range of downstream tasks across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. HEAR evaluates audio representations using a benchmark suite across a variety of domains, in…
▽ More
What audio embedding approach generalizes best to a wide range of downstream tasks across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. HEAR evaluates audio representations using a benchmark suite across a variety of domains, including speech, environmental sound, and music. HEAR was launched as a NeurIPS 2021 shared challenge. In the spirit of shared exchange, each participant submitted an audio embedding model following a common API that is general-purpose, open-source, and freely available to use. Twenty-nine models by thirteen external teams were evaluated on nineteen diverse downstream tasks derived from sixteen datasets. Open evaluation code, submitted models and datasets are key contributions, enabling comprehensive and reproducible evaluation, as well as previously impossible longitudinal studies. It still remains an open question whether one single general-purpose audio representation can perform as holistically as the human ear.
△ Less
Submitted 29 May, 2022; v1 submitted 6 March, 2022;
originally announced March 2022.
-
Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation
Authors:
Humair Raj Khan,
Deepak Gupta,
Asif Ekbal
Abstract:
Pre-trained language-vision models have shown remarkable performance on the visual question answering (VQA) task. However, most pre-trained models are trained by only considering monolingual learning, especially the resource-rich language like English. Training such models for multilingual setups demand high computing resources and multilingual language-vision dataset which hinders their applicati…
▽ More
Pre-trained language-vision models have shown remarkable performance on the visual question answering (VQA) task. However, most pre-trained models are trained by only considering monolingual learning, especially the resource-rich language like English. Training such models for multilingual setups demand high computing resources and multilingual language-vision dataset which hinders their application in practice. To alleviate these challenges, we propose a knowledge distillation approach to extend an English language-vision model (teacher) into an equally effective multilingual and code-mixed model (student). Unlike the existing knowledge distillation methods, which only use the output from the last layer of the teacher network for distillation, our student model learns and imitates the teacher from multiple intermediate layers (language and vision encoders) with appropriately designed distillation objectives for incremental knowledge extraction. We also create the large-scale multilingual and code-mixed VQA dataset in eleven different language setups considering the multiple Indian and European languages. Experimental results and in-depth analysis show the effectiveness of the proposed VQA model over the pre-trained language-vision models on eleven diverse language setups.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Analysis of Unobserved Heterogeneity via Accelerated Failure Time Models Under Bayesian and Classical Approaches
Authors:
Shaila Sharmin,
Md Hasinur Rahaman Khan
Abstract:
This paper deals with unobserved heterogeneity in the survival dataset through Accelerated Failure Time (AFT) models under both frameworks--Bayesian and classical. The Bayesian approach of dealing with unobserved heterogeneity has recently been discussed in Vallejos and Steel (2017), where mixture models are used to diminish the effect that anomalous observations or some kinds of covariates which…
▽ More
This paper deals with unobserved heterogeneity in the survival dataset through Accelerated Failure Time (AFT) models under both frameworks--Bayesian and classical. The Bayesian approach of dealing with unobserved heterogeneity has recently been discussed in Vallejos and Steel (2017), where mixture models are used to diminish the effect that anomalous observations or some kinds of covariates which are not included in the survival models. The frailty models also deal with this kind of unobserved variability under classical framework and have been used by practitioners as alternative to Bayesian. We discussed both approaches of dealing with unobserved heterogeneity with their pros and cons when a family of rate mixtures of Weibul distributions and a set of random effect distributions were used under Bayesian and classical approaches respectively. We investigated how much the classical estimates differ with the Bayesian estimates, although the paradigm of estimation methods are different. Two real data examples--a bone marrow transplants data and a kidney infection data have been used to illustrate the performances of the methods. In both situations, it is observed that the use of an Inverse-Gaussian mixture distribution outperforms the other possibilities. It is also noticed that the estimates of the frailty models are generally somewhat underestimated by comparing with the estimates of their counterpart.
△ Less
Submitted 8 September, 2017;
originally announced September 2017.
-
Stability Selection for Lasso, Ridge and Elastic Net Implemented with AFT Models
Authors:
Md Hasinur Rahaman Khan,
Anamika Bhadra,
Tamanna Howlader
Abstract:
The instability in the selection of models is a major concern with data sets containing a large number of covariates. We focus on stability selection which is used as a technique to improve variable selection performance for a range of selection methods, based on aggregating the results of applying a selection procedure to sub-samples of the data where the observations are subject to right censori…
▽ More
The instability in the selection of models is a major concern with data sets containing a large number of covariates. We focus on stability selection which is used as a technique to improve variable selection performance for a range of selection methods, based on aggregating the results of applying a selection procedure to sub-samples of the data where the observations are subject to right censoring. The accelerated failure time (AFT) models have proved useful in many contexts including the heavy censoring (as for example in cancer survival) and the high dimensionality (as for example in micro-array data). We implement the stability selection approach using three variable selection techniques--Lasso, ridge regression, and elastic net applied to censored data using AFT models. We compare the performances of these regularized techniques with and without stability selection approaches with simulation studies and a breast cancer data analysis. The results suggest that stability selection gives always stable scenario about the selection of variables and that as the dimension of data increases the performance of methods with stability selection also improves compared to methods without stability selection irrespective of the collinearity between the covariates.
△ Less
Submitted 25 April, 2016;
originally announced April 2016.
-
Improved Likelihood Estimation for the Generalized Extreme Value and the Inverse Gaussian Lifetime Distributions
Authors:
Md. Mazharul Islam,
Md Hasinur Rahaman Khan
Abstract:
In presence of nuisance parameters, profile likelihood inference is often unreliable and biased, particularly in small sample scenario. Over past decades several adjustments have been proposed to modify profile likelihood function in literature including a modified profile likelihood estimation technique introduced in Barndorff--Nielsen. In this study, adjustment of profile likelihood function of…
▽ More
In presence of nuisance parameters, profile likelihood inference is often unreliable and biased, particularly in small sample scenario. Over past decades several adjustments have been proposed to modify profile likelihood function in literature including a modified profile likelihood estimation technique introduced in Barndorff--Nielsen. In this study, adjustment of profile likelihood function of parameter of interest in presence of nuisance parameter is investigated. We particularly focuss to extend the Barndorff--Nielsen's technique on Inverse Gaussian distribution for estimating its dispersion parameter and on generalized extreme value (GEV) distribution for estimating its shape parameter. The accelerated failure time models are used for lifetimes having GEV distribution and the Inverse Gaussian distribution is used for lifetime distribution. Monte-Carlo simulation studies are conducted to demonstrate the performances of both approaches. Simulation results suggest the superiority of the modified profile likelihood estimates over the profile likelihood estimates for the parameters of interest. Particularly, it is found that the modifications can improve the overall performance of the estimators through reducing their biases and standard errors.
△ Less
Submitted 28 March, 2016;
originally announced March 2016.
-
The Physics of the B Factories
Authors:
A. J. Bevan,
B. Golob,
Th. Mannel,
S. Prell,
B. D. Yabsley,
K. Abe,
H. Aihara,
F. Anulli,
N. Arnaud,
T. Aushev,
M. Beneke,
J. Beringer,
F. Bianchi,
I. I. Bigi,
M. Bona,
N. Brambilla,
J. B rodzicka,
P. Chang,
M. J. Charles,
C. H. Cheng,
H. -Y. Cheng,
R. Chistov,
P. Colangelo,
J. P. Coleman,
A. Drutskoy
, et al. (2009 additional authors not shown)
Abstract:
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary…
▽ More
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary version of the Physics of the B Factories book. This uses the notation alpha, beta, gamma for the angles of the Unitarity Triangle. The nominal version uses the notation phi_1, phi_2 and phi_3. Please cite this work as Eur. Phys. J. C74 (2014) 3026.
△ Less
Submitted 31 October, 2015; v1 submitted 24 June, 2014;
originally announced June 2014.
-
Robust Bias Estimation for Kaplan-Meier Survival Estimator with Jackknifing
Authors:
Md Hasinur Rahaman Khan,
J. Ewart H. Shaw
Abstract:
For studying or reducing the bias of functionals of the Kaplan-Meier survival estimator, the jackknifing approach of Stute and Wang (1994) is natural. We have studied the behavior of the jackknife estimate of bias under different configurations of the censoring level, sample size, and the censoring and survival time distributions. The empirical research reveals some new findings about robust calcu…
▽ More
For studying or reducing the bias of functionals of the Kaplan-Meier survival estimator, the jackknifing approach of Stute and Wang (1994) is natural. We have studied the behavior of the jackknife estimate of bias under different configurations of the censoring level, sample size, and the censoring and survival time distributions. The empirical research reveals some new findings about robust calculation of the bias, particularly for higher censoring levels. We have extended their jackknifing approach to cover the case where the largest observation is censored, using the imputation methods for the largest observations proposed in Khan and Shaw (2013b). This modification to the existing formula reduces the number of conditions for creating jackknife bias estimates to one from the original two, and also avoids the problem that the Kaplan--Meier estimator can be badly underestimated by the existing jackknife formula.
△ Less
Submitted 14 December, 2013;
originally announced December 2013.
-
On Dealing with Censored Largest Observations under Weighted Least Squares
Authors:
Md Hasinur Rahaman Khan,
J. Ewart H. Shaw
Abstract:
When observations are subject to right censoring, weighted least squares with appropriate weights (to adjust for censoring) is sometimes used for parameter estimation. With Stute's weighted least squares method, when the largest observation is censored ($Y_{(n)}^+$), it is natural to apply the redistribution to the right algorithm of Efron (1967). However, Efron's redistribution algorithm can lead…
▽ More
When observations are subject to right censoring, weighted least squares with appropriate weights (to adjust for censoring) is sometimes used for parameter estimation. With Stute's weighted least squares method, when the largest observation is censored ($Y_{(n)}^+$), it is natural to apply the redistribution to the right algorithm of Efron (1967). However, Efron's redistribution algorithm can lead to bias and inefficiency in estimation. This study explains the issues clearly and proposes some alternative ways of treating $Y_{(n)}^+$. The first four proposed approaches are based on the well known Buckley--James (1979) method of imputation with the Efron's tail correction and the last approach is indirectly based on a general mean imputation technique in literature. All the new schemes use penalized weighted least squares optimized by quadratic programming implemented with the accelerated failure time models. Furthermore, two novel additional imputation approaches are proposed to impute the tail tied censored observations that are often found in survival analysis with heavy censoring. Several simulation studies and real data analysis demonstrated that the proposed approaches generally outperform Efron's redistribution approach and lead to considerably smaller mean squared error and bias estimates.
△ Less
Submitted 7 November, 2014; v1 submitted 9 December, 2013;
originally announced December 2013.
-
Variable Selection for Survival Data with A Class of Adaptive Elastic Net Techniques
Authors:
Md Hasinur Rahaman Khan,
J. Ewart H. Shaw
Abstract:
The accelerated failure time (AFT) models have proved useful in many contexts, though heavy censoring (as for example in cancer survival) and high dimensionality (as for example in microarray data) cause difficulties for model fitting and model selection. We propose new approaches to variable selection for censored data, based on AFT models optimized using regularized weighted least squares. The r…
▽ More
The accelerated failure time (AFT) models have proved useful in many contexts, though heavy censoring (as for example in cancer survival) and high dimensionality (as for example in microarray data) cause difficulties for model fitting and model selection. We propose new approaches to variable selection for censored data, based on AFT models optimized using regularized weighted least squares. The regularized technique uses a mixture of L1 and L2 norm penalties under two proposed elastic net type approaches. One is the the adaptive elastic net and the other is weighted elastic net. The approaches extend the original approaches proposed by Ghosh (2007), and Hong and Zhang (2010) respectively. We also extend the two proposed approaches by adding censoring observations as constraints into their model optimization frameworks. The approaches are evaluated on microarray and by simulation. We compare the performance of these approaches with six other variable selection techniques--three are generally used for censored data and the other three are correlation-based greedy methods used for high-dimensional data.
△ Less
Submitted 7 December, 2013;
originally announced December 2013.
-
The monitoring system for the aerogel Cherenkov counter of the BELLE detector
Authors:
M. H. R. Khan,
A. Murakami,
T. Sumiyoshi,
T. Kuniya,
I. Adachi,
R. Enomoto,
H. Hattori,
T. Iijima,
K. Kaneda,
R. Kawabata,
T. Ooba,
R. Suda,
K. Suzuki,
M. Watanabe
Abstract:
We report on a design and performances of a monitoring system developed for the aerogel Cherenkov counters (ACC) of the BELLE detector. The system consists of blue LEDs, a diffuser box, and optical distributors which distribute the LED light to the ACC modules. The employed LED (NSPB series) has been observed to have high reliability on the long term stability and the temprature dependence. The…
▽ More
We report on a design and performances of a monitoring system developed for the aerogel Cherenkov counters (ACC) of the BELLE detector. The system consists of blue LEDs, a diffuser box, and optical distributors which distribute the LED light to the ACC modules. The employed LED (NSPB series) has been observed to have high reliability on the long term stability and the temprature dependence. The diffuser box is employed to reduce the intrinsic non-uniformity of the LED light intensity. The overall performances of the present monitoring system on uniformity and intensity of the light output have been found to satisfy all the requirements for the monitoring.
△ Less
Submitted 27 March, 1998;
originally announced March 1998.