Skip to main content

Showing 1–14 of 14 results for author: Shan, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.16691  [pdf, other

    cs.LG cs.CY stat.ME stat.ML

    From Correlation to Causation: Understanding Climate Change through Causal Analysis and LLM Interpretations

    Authors: Shan Shan

    Abstract: This research presents a three-step causal inference framework that integrates correlation analysis, machine learning-based causality discovery, and LLM-driven interpretations to identify socioeconomic factors influencing carbon emissions and contributing to climate change. The approach begins with identifying correlations, progresses to causal analysis, and enhances decision making through LLM-ge… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  2. arXiv:2408.12888  [pdf, other

    cs.LG stat.ML

    Accelerated Markov Chain Monte Carlo Using Adaptive Weighting Scheme

    Authors: Yanbo Wang, Wenyu Chen, Shimin Shan

    Abstract: Gibbs sampling is one of the most commonly used Markov Chain Monte Carlo (MCMC) algorithms due to its simplicity and efficiency. It cycles through the latent variables, sampling each one from its distribution conditional on the current values of all the other variables. Conventional Gibbs sampling is based on the systematic scan (with a deterministic order of variables). In contrast, in recent yea… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  3. Identification of socioeconomic factors influencing global food price security using machine learning

    Authors: Shan Shan

    Abstract: Global concern over food prices and security has been exacerbated by the impacts of armed conflicts such as the Russia Ukraine War, pandemic diseases, and climate change. Traditionally, analyzing global food prices and their associations with socioeconomic factors has relied on static linear regression models. However, the complexity of socioeconomic factors and their implications extend beyond si… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2306.11157  [pdf, other

    stat.ML cs.LG stat.AP

    Human Limits in Machine Learning: Prediction of Plant Phenotypes Using Soil Microbiome Data

    Authors: Rosa Aghdam, Xudong Tang, Shan Shan, Richard Lankau, Claudia SolĂ­s-Lemus

    Abstract: The preservation of soil health is a critical challenge in the 21st century due to its significant impact on agriculture, human health, and biodiversity. We provide the first deep investigation of the predictive potential of machine learning models to understand the connections between soil and biological phenotypes. We investigate an integrative framework performing accurate machine learning-base… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  5. arXiv:2203.02867  [pdf, other

    stat.ML cs.LG math.NA

    Diffusion Maps : Using the Semigroup Property for Parameter Tuning

    Authors: Shan Shan, Ingrid Daubechies

    Abstract: Diffusion maps (DM) constitute a classic dimension reduction technique, for data lying on or close to a (relatively) low-dimensional manifold embedded in a much larger dimensional space. The DM procedure consists in constructing a spectral parametrization for the manifold from simulated random walks or diffusion paths on the data set. However, DM is hard to tune in practice. In particular, the tas… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: 14 pages, 12 figures

  6. arXiv:2107.04855  [pdf, ps, other

    cs.LG stat.ML

    Kernel Mean Estimation by Marginalized Corrupted Distributions

    Authors: Xiaobo Xia, Shuo Shan, Mingming Gong, Nannan Wang, Fei Gao, Haikun Wei, Tongliang Liu

    Abstract: Estimating the kernel mean in a reproducing kernel Hilbert space is a critical component in many kernel learning algorithms. Given a finite sample, the standard estimate of the target kernel mean is the empirical average. Previous works have shown that better estimators can be constructed by shrinkage methods. In this work, we propose to corrupt data examples with noise from known distributions an… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

  7. arXiv:2104.12476  [pdf, other

    cs.CV stat.ML

    EigenGAN: Layer-Wise Eigen-Learning for GANs

    Authors: Zhenliang He, Meina Kan, Shiguang Shan

    Abstract: Recent studies on Generative Adversarial Network (GAN) reveal that different layers of a generative CNN hold different semantics of the synthesized images. However, few GAN models have explicit dimensions to control the semantic attributes represented in a specific layer. This paper proposes EigenGAN which is able to unsupervisedly mine interpretable and controllable dimensions from different gene… ▽ More

    Submitted 9 August, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: ICCV 2021. Code: https://github.com/LynnHo/EigenGAN-Tensorflow

  8. arXiv:2008.02676  [pdf, other

    cs.LG stat.ML

    Exchangeable Neural ODE for Set Modeling

    Authors: Yang Li, Haidong Yi, Christopher M. Bender, Siyuan Shan, Junier B. Oliva

    Abstract: Reasoning over an instance composed of a set of vectors, like a point cloud, requires that one accounts for intra-set dependent features among elements. However, since such instances are unordered, the elements' features should remain unchanged when the input's order is permuted. This property, permutation equivariance, is a challenging constraint for most neural architectures. While recent work h… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

  9. arXiv:2002.08327  [pdf, ps, other

    cs.CR cs.CV cs.LG stat.ML

    Fawkes: Protecting Privacy against Unauthorized Deep Learning Models

    Authors: Shawn Shan, Emily Wenger, Jiayun Zhang, Huiying Li, Haitao Zheng, Ben Y. Zhao

    Abstract: Today's proliferation of powerful facial recognition systems poses a real threat to personal privacy. As Clearview.ai demonstrated, anyone can canvas the Internet for data and train highly accurate facial recognition models of individuals without their knowledge. We need tools to protect ourselves from potential misuses of unauthorized facial recognition systems. Unfortunately, no practical or eff… ▽ More

    Submitted 22 June, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Journal ref: USENIX Security Symposium 2020

  10. arXiv:1910.01226  [pdf, ps, other

    cs.CR cs.LG stat.ML

    Piracy Resistant Watermarks for Deep Neural Networks

    Authors: Huiying Li, Emily Wenger, Shawn Shan, Ben Y. Zhao, Haitao Zheng

    Abstract: As companies continue to invest heavily in larger, more accurate and more robust deep learning models, they are exploring approaches to monetize their models while protecting their intellectual property. Model licensing is promising, but requires a robust tool for owners to claim ownership of models, i.e. a watermark. Unfortunately, current designs have not been able to address piracy attacks, whe… ▽ More

    Submitted 2 December, 2020; v1 submitted 2 October, 2019; originally announced October 2019.

    Comments: 18 pages

  11. arXiv:1909.09140  [pdf, other

    cs.LG stat.ML

    Meta-Neighborhoods

    Authors: Siyuan Shan, Yang Li, Junier Oliva

    Abstract: Making an adaptive prediction based on one's input is an important ability for general artificial intelligence. In this work, we step forward in this direction and propose a semi-parametric method, Meta-Neighborhoods, where predictions are made adaptively to the neighborhood of the input. We show that Meta-Neighborhoods is a generalization of $k$-nearest-neighbors. Due to the simpler manifold stru… ▽ More

    Submitted 13 October, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2020

  12. arXiv:1904.08554  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Gotta Catch 'Em All: Using Honeypots to Catch Adversarial Attacks on Neural Networks

    Authors: Shawn Shan, Emily Wenger, Bolun Wang, Bo Li, Haitao Zheng, Ben Y. Zhao

    Abstract: Deep neural networks (DNN) are known to be vulnerable to adversarial attacks. Numerous efforts either try to patch weaknesses in trained models, or try to make it difficult or costly to compute adversarial examples that exploit them. In our work, we explore a new "honeypot" approach to protect DNN models. We intentionally inject trapdoors, honeypot weaknesses in the classification manifold that at… ▽ More

    Submitted 28 September, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

    Journal ref: Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security

  13. arXiv:1711.10678  [pdf, other

    cs.CV stat.ML

    AttGAN: Facial Attribute Editing by Only Changing What You Want

    Authors: Zhenliang He, Wangmeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen

    Abstract: Facial attribute editing aims to manipulate single or multiple attributes of a face image, i.e., to generate a new face with desired attributes while preserving other details. Recently, generative adversarial net (GAN) and encoder-decoder architecture are usually incorporated to handle this task with promising results. Based on the encoder-decoder architecture, facial attribute editing is achieved… ▽ More

    Submitted 25 July, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: Submitted to IEEE Transactions on Image Processing, Code: https://github.com/LynnHo/AttGAN-Tensorflow

  14. arXiv:1705.08516  [pdf, other

    stat.AP

    An Open-Data Analysis of Heterogeneities in Lung Cancer Premature Mortality Rate and Associated Factors among Toronto Neighborhoods

    Authors: Zhanwei Du, Jiming Liu, Songwei Shan

    Abstract: In public health, various data are rigorously collected and published with open access. These data reflect the environmental and non-environmental characteristics of heterogeneous neighborhoods in cities. In the present study, we aimed to study the relations between these data and disease risks in heterogeneous neighborhoods. A flexible framework was developed to determine the key factors correlat… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.