Skip to main content

Showing 1–10 of 10 results for author: Dam, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.10291  [pdf, other

    cs.LG stat.ML

    An Evaluation of Real-time Adaptive Sampling Change Point Detection Algorithm using KCUSUM

    Authors: Vijayalakshmi Saravanan, Perry Siehien, Shinjae Yoo, Hubertus Van Dam, Thomas Flynn, Christopher Kelly, Khaled Z Ibrahim

    Abstract: Detecting abrupt changes in real-time data streams from scientific simulations presents a challenging task, demanding the deployment of accurate and efficient algorithms. Identifying change points in live data stream involves continuous scrutiny of incoming observations for deviations in their statistical characteristics, particularly in high-volume data scenarios. Maintaining a balance between su… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 16 pages. arXiv admin note: text overlap with arXiv:1903.01661

    MSC Class: CCS

  2. Indoor environment data time-series reconstruction using autoencoder neural networks

    Authors: Antonio Liguori, Romana Markovic, Thi Thu Ha Dam, Jérôme Frisch, Christoph van Treeck, Francesco Causone

    Abstract: As the number of installed meters in buildings increases, there is a growing number of data time-series that could be used to develop data-driven models to support and optimize building operation. However, building data sets are often characterized by errors and missing values, which are considered, by the recent research, among the main limiting factors on the performance of the proposed models.… ▽ More

    Submitted 21 January, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted in Building and Environment

    Journal ref: Building and Environment 191 (2021) 107623

  3. arXiv:2008.08818  [pdf, other

    stat.ML cond-mat.mtrl-sci cs.LG

    Ensemble learning reveals dissimilarity between rare-earth transition metal binary alloys with respect to the Curie temperature

    Authors: Duong-Nguyen Nguyen, Tien-Lam Pham, Viet-Cuong Nguyen, Hiori Kino, Takashi Miyake, Hieu-Chi Dam

    Abstract: We propose a data-driven method to extract dissimilarity between materials, with respect to a given target physical property. The technique is based on an ensemble method with Kernel ridge regression as the predicting model; multiple random subset sampling of the materials is done to generate prediction models and the corresponding contributions of the reference training materials in detail. The d… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: 10 pages, 3 figures

  4. arXiv:2006.02431  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Targeting SARS-CoV-2 with AI- and HPC-enabled Lead Generation: A First Data Release

    Authors: Yadu Babuji, Ben Blaiszik, Tom Brettin, Kyle Chard, Ryan Chard, Austin Clyde, Ian Foster, Zhi Hong, Shantenu Jha, Zhuozhao Li, Xuefeng Liu, Arvind Ramanathan, Yi Ren, Nicholaus Saint, Marcus Schwarting, Rick Stevens, Hubertus van Dam, Rick Wagner

    Abstract: Researchers across the globe are seeking to rapidly repurpose existing drugs or discover new drugs to counter the the novel coronavirus disease (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). One promising approach is to train machine learning (ML) and artificial intelligence (AI) tools to screen large numbers of small molecules. As a contribution to that effort,… ▽ More

    Submitted 27 May, 2020; originally announced June 2020.

    Comments: 11 pages, 5 figures

  5. arXiv:2005.08482  [pdf, other

    stat.ML cs.LG

    Variational Hyper-Encoding Networks

    Authors: Phuoc Nguyen, Truyen Tran, Sunil Gupta, Santu Rana, Hieu-Chi Dam, Svetha Venkatesh

    Abstract: We propose a framework called HyperVAE for encoding distributions of distributions. When a target distribution is modeled by a VAE, its neural network parameters θis drawn from a distribution p(θ) which is modeled by a hyper-level VAE. We propose a variational inference using Gaussian mixture models to implicitly encode the parameters θinto a low dimensional Gaussian distribution. Given a target d… ▽ More

    Submitted 12 May, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Accepted ECML-2021

  6. arXiv:1903.10867  [pdf, other

    cs.LG stat.ML

    Measuring the Similarity between Materials with an Emphasis on the Materials Distinctiveness

    Authors: Tran-Thai Dang, Tien-Lam Pham, Hiori Kino, Takashi Miyake, Hieu-Chi Dam

    Abstract: In this study, we establish a basis for selecting similarity measures when applying machine learning techniques to solve materials science problems. This selection is considered with an emphasis on the distinctiveness between materials that reflect their nature well. We perform a case study with a dataset of rare-earth transition metal crystalline compounds represented using the Orbital Field Matr… ▽ More

    Submitted 23 March, 2019; originally announced March 2019.

  7. arXiv:1708.04357  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Classification via Deep Learning with Virtual Nodes

    Authors: Trang Pham, Truyen Tran, Hoa Dam, Svetha Venkatesh

    Abstract: Learning representation for graph classification turns a variable-size graph into a fixed-size vector (or matrix). Such a representation works nicely with algebraic manipulations. Here we introduce a simple method to augment an attributed graph with a virtual node that is bidirectionally connected to all existing nodes. The virtual node represents the latent aspects of the graph, which are not imm… ▽ More

    Submitted 14 August, 2017; originally announced August 2017.

  8. arXiv:1609.00489  [pdf, other

    cs.SE cs.LG stat.ML

    A deep learning model for estimating story points

    Authors: Morakot Choetkiertikul, Hoa Khanh Dam, Truyen Tran, Trang Pham, Aditya Ghose, Tim Menzies

    Abstract: Although there has been substantial research in software analytics for effort estimation in traditional software projects, little work has been done for estimation in agile projects, especially estimating user stories or issues. Story points are the most common unit of measure used for estimating the effort involved in implementing a user story or resolving an issue. In this paper, we offer for th… ▽ More

    Submitted 6 September, 2016; v1 submitted 2 September, 2016; originally announced September 2016.

    Comments: Submitted to ICSE'17

  9. arXiv:1608.02715  [pdf, other

    cs.SE stat.ML

    A deep language model for software code

    Authors: Hoa Khanh Dam, Truyen Tran, Trang Pham

    Abstract: Existing language models such as n-grams for software code often fail to capture a long context where dependent code elements scatter far apart. In this paper, we propose a novel approach to build a language model for software code to address this particular issue. Our language model, partly inspired by human memory, is built upon the powerful deep learning-based Long Short Term Memory architectur… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

  10. arXiv:1608.00092  [pdf, other

    cs.SE stat.ML

    DeepSoft: A vision for a deep model of software

    Authors: Hoa Khanh Dam, Truyen Tran, John Grundy, Aditya Ghose

    Abstract: Although software analytics has experienced rapid growth as a research area, it has not yet reached its full potential for wide industrial adoption. Most of the existing work in software analytics still relies heavily on costly manual feature engineering processes, and they mainly address the traditional classification problems, as opposed to predicting future events. We present a vision for \emph… ▽ More

    Submitted 30 July, 2016; originally announced August 2016.

    Comments: FSE 2016