Skip to main content

Showing 1–6 of 6 results for author: Claesen, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:1504.07389  [pdf, other

    stat.ML cs.IR

    Building Classifiers to Predict the Start of Glucose-Lowering Pharmacotherapy Using Belgian Health Expenditure Data

    Authors: Marc Claesen, Frank De Smet, Pieter Gillard, Chantal Mathieu, Bart De Moor

    Abstract: Early diagnosis is important for type 2 diabetes (T2D) to improve patient prognosis, prevent complications and reduce long-term treatment costs. We present a novel risk profiling approach based exclusively on health expenditure data that is available to Belgian mutual health insurers. We used expenditure data related to drug purchases and medical provisions to construct models that predict whether… ▽ More

    Submitted 28 April, 2015; originally announced April 2015.

    Comments: 23 pages, 5 figures, submitted to JMLR special issue on Learning from Electronic Health Data

    ACM Class: I.5.4; J.3

  2. arXiv:1504.06837  [pdf, other

    stat.ML cs.IR cs.LG

    Assessing binary classifiers using only positive and unlabeled data

    Authors: Marc Claesen, Jesse Davis, Frank De Smet, Bart De Moor

    Abstract: Assessing the performance of a learned model is a crucial part of machine learning. However, in some domains only positive and unlabeled examples are available, which prohibits the use of most standard evaluation metrics. We propose an approach to estimate any metric based on contingency tables, including ROC and PR curves, using only positive and unlabeled data. Estimating these performance metri… ▽ More

    Submitted 30 December, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: 14 pages, 8 figures

    ACM Class: I.5.2

  3. arXiv:1502.02127  [pdf, ps, other

    cs.LG stat.ML

    Hyperparameter Search in Machine Learning

    Authors: Marc Claesen, Bart De Moor

    Abstract: We introduce the hyperparameter search problem in the field of machine learning and discuss its main challenges from an optimization perspective. Machine learning methods attempt to build models that capture some element of interest based on given data. Most common learning algorithms feature a set of hyperparameters that must be determined before training commences. The choice of hyperparameters… ▽ More

    Submitted 6 April, 2015; v1 submitted 7 February, 2015; originally announced February 2015.

    Comments: 5 pages, accepted for MIC 2015: The XI Metaheuristics International Conference in Agadir, Morocco

    ACM Class: G.1.6; I.2.6; I.2.8; I.5

  4. arXiv:1403.0745  [pdf, ps, other

    stat.ML cs.LG

    EnsembleSVM: A Library for Ensemble Learning Using Support Vector Machines

    Authors: Marc Claesen, Frank De Smet, Johan Suykens, Bart De Moor

    Abstract: EnsembleSVM is a free software package containing efficient routines to perform ensemble learning with support vector machine (SVM) base models. It currently offers ensemble methods based on binary SVM models. Our implementation avoids duplicate storage and evaluation of support vectors which are shared between constituent models. Experimental results show that using ensemble approaches can drasti… ▽ More

    Submitted 4 March, 2014; originally announced March 2014.

    Comments: 5 pages, 1 table

    ACM Class: G.3; I.2.6; I.5.1

    Journal ref: Journal of Machine Learning Research. 15 (2014) 141-145

  5. arXiv:1403.0736  [pdf, ps, other

    stat.ML cs.LG

    Fast Prediction with SVM Models Containing RBF Kernels

    Authors: Marc Claesen, Frank De Smet, Johan A. K. Suykens, Bart De Moor

    Abstract: We present an approximation scheme for support vector machine models that use an RBF kernel. A second-order Maclaurin series approximation is used for exponentials of inner products between support vectors and test instances. The approximation is applicable to all kernel methods featuring sums of kernel evaluations and makes no assumptions regarding data normalization. The prediction speed of appr… ▽ More

    Submitted 3 October, 2014; v1 submitted 4 March, 2014; originally announced March 2014.

    Comments: 9 pages, 1 figure, 3 tables

    ACM Class: G.3; I.2.6; I.5.1

  6. A Robust Ensemble Approach to Learn From Positive and Unlabeled Data Using SVM Base Models

    Authors: Marc Claesen, Frank De Smet, Johan A. K. Suykens, Bart De Moor

    Abstract: We present a novel approach to learn binary classifiers when only positive and unlabeled instances are available (PU learning). This problem is routinely cast as a supervised task with label noise in the negative set. We use an ensemble of SVM models trained on bootstrap resamples of the training data for increased robustness against label noise. The approach can be considered in a bagging framewo… ▽ More

    Submitted 21 October, 2014; v1 submitted 13 February, 2014; originally announced February 2014.

    Comments: 34 pages, 6 figures, 6 tables. Accepted for publication in Neurocomputing: Special Issue on Advances in Learning with Label Noise

    ACM Class: G.3; I.2.6; I.5.1