Skip to main content

Showing 1–6 of 6 results for author: Villa-Vialaneix, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:1606.00614  [pdf, other

    math.ST stat.ME

    Interpretable sparse SIR for functional data

    Authors: Victor Picheny, Rémi Servien, Nathalie Villa-Vialaneix

    Abstract: This work focuses on the issue of variable selection in functional regression. Unlike most work in this framework, our approach does not select isolated points in the definition domain of the predictors, nor does it rely on the expansion of the predictors in a given functional basis. It provides an approach to select full intervals made of consecutive points. This feature improves the interpretabi… ▽ More

    Submitted 2 March, 2018; v1 submitted 2 June, 2016; originally announced June 2016.

  2. arXiv:1511.08327  [pdf, other

    stat.ML cs.LG math.ST

    Random Forests for Big Data

    Authors: Robin Genuer, Jean-Michel Poggi, Christine Tuleau-Malot, Nathalie Villa-Vialaneix

    Abstract: Big Data is one of the major challenges of statistical science and has numerous consequences from algorithmic and theoretical viewpoints. Big Data always involve massive data but they also often include online data and data heterogeneity. Recently some statistical methods have been adapted to process Big Data, like linear regression models, clustering methods and bootstrapping schemes. Based on d… ▽ More

    Submitted 22 March, 2017; v1 submitted 26 November, 2015; originally announced November 2015.

  3. arXiv:1405.6676  [pdf, other

    stat.OT cs.LG math.ST

    Statistique et Big Data Analytics; Volumétrie, L'Attaque des Clones

    Authors: Philippe Besse, Nathalie Villa-Vialaneix

    Abstract: This article assumes acquired the skills and expertise of a statistician in unsupervised (NMF, k-means, SVD) and supervised learning (regression, CART, random forest). What skills and knowledge do a statistician must acquire to reach the "Volume" scale of big data? After a quick overview of the different strategies available and especially of those imposed by Hadoop, the algorithms of some availab… ▽ More

    Submitted 5 October, 2014; v1 submitted 26 May, 2014; originally announced May 2014.

    Comments: in French

  4. arXiv:1212.6316  [pdf, other

    stat.ML cs.LG

    On-line relational SOM for dissimilarity data

    Authors: Madalina Olteanu, Nathalie Villa-Vialaneix, Marie Cottrell

    Abstract: In some applications and in order to address real world situations better, data may be more complex than simple vectors. In some examples, they can be known through their pairwise dissimilarities only. Several variants of the Self Organizing Map algorithm were introduced to generalize the original algorithm to this framework. Whereas median SOM is based on a rough representation of the prototypes,… ▽ More

    Submitted 27 December, 2012; originally announced December 2012.

    Comments: WSOM 2012, Santiago : Chile (2012)

  5. arXiv:1210.6511  [pdf, other

    cs.NE cs.LG stat.ML

    Neural Networks for Complex Data

    Authors: Marie Cottrell, Madalina Olteanu, Fabrice Rossi, Joseph Rynkiewicz, Nathalie Villa-Vialaneix

    Abstract: Artificial neural networks are simple and efficient machine learning tools. Defined originally in the traditional setting of simple vector data, neural network models have evolved to address more and more difficulties of complex real world problems, ranging from time evolving data to sophisticated data structures such as graphs and functions. This paper summarizes advances on those themes from the… ▽ More

    Submitted 24 October, 2012; originally announced October 2012.

    Journal ref: Künstliche Intelligenz 26, 4 (2012) 373-380

  6. Optimizing an Organized Modularity Measure for Topographic Graph Clustering: a Deterministic Annealing Approach

    Authors: Fabrice Rossi, Nathalie Villa-Vialaneix

    Abstract: This paper proposes an organized generalization of Newman and Girvan's modularity measure for graph clustering. Optimized via a deterministic annealing scheme, this measure produces topologically ordered graph clusterings that lead to faithful and readable graph representations based on clustering induced graphs. Topographic graph clustering provides an alternative to more classical solutions in w… ▽ More

    Submitted 7 September, 2010; originally announced September 2010.

    Journal ref: Neurocomputing, 73(7--9):1142--1163, March 2010