Skip to main content

Showing 1–13 of 13 results for author: Miao, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.10838  [pdf, other

    cond-mat.mtrl-sci cs.AI physics.app-ph

    Deep Learning Models for Colloidal Nanocrystal Synthesis

    Authors: Kai Gu, Yingping Liang, Jiaming Su, Peihan Sun, Jia Peng, Naihua Miao, Zhimei Sun, Ying Fu, Haizheng Zhong, Jun Zhang

    Abstract: Colloidal synthesis of nanocrystals usually includes complex chemical reactions and multi-step crystallization processes. Despite the great success in the past 30 years, it remains challenging to clarify the correlations between synthetic parameters of chemical reaction and physical properties of nanocrystals. Here, we developed a deep learning-based nanocrystal synthesis model that correlates syn… ▽ More

    Submitted 14 December, 2024; originally announced December 2024.

  2. arXiv:2310.12357  [pdf, other

    cs.SE cs.CR

    Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

    Authors: Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun

    Abstract: Large language models (LLMs) have demonstrated significant potential in the realm of natural language understanding and programming code processing tasks. Their capacity to comprehend and generate human-like code has spurred research into harnessing LLMs for code analysis purposes. However, the existing body of literature falls short in delivering a systematic evaluation and assessment of LLMs' ef… ▽ More

    Submitted 5 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted by Usenix Security 2024

  3. arXiv:2308.00436  [pdf, other

    cs.AI cs.CL cs.LG

    SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

    Authors: Ning Miao, Yee Whye Teh, Tom Rainforth

    Abstract: The recent progress in large language models (LLMs), especially the invention of chain-of-thought prompting, has made it possible to automatically answer questions by stepwise reasoning. However, when faced with more complicated problems that require non-linear thinking, even the strongest LLMs make mistakes. To address this, we explore whether LLMs are able to recognize errors in their own step-b… ▽ More

    Submitted 5 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  4. Gotcha! I Know What You are Doing on the FPGA Cloud: Fingerprinting Co-Located Cloud FPGA Accelerators via Measuring Communication Links

    Authors: Chongzhou Fang, Ning Miao, Han Wang, Jiacheng Zhou, Tyler Sheaves, John M. Emmert, Avesta Sasan, Houman Homayoun

    Abstract: In recent decades, due to the emerging requirements of computation acceleration, cloud FPGAs have become popular in public clouds. Major cloud service providers, e.g. AWS and Microsoft Azure have provided FPGA computing resources in their infrastructure and have enabled users to design and deploy their own accelerators on these FPGAs. Multi-tenancy FPGAs, where multiple users can share the same FP… ▽ More

    Submitted 7 July, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: To be published in ACM CCS 2023

  5. arXiv:2304.01990  [pdf, other

    cs.CR cs.LG eess.SP

    Side Channel-Assisted Inference Leakage from Machine Learning-based ECG Classification

    Authors: Jialin Liu, Ning Miao, Chongzhou Fang, Houman Homayoun, Han Wang

    Abstract: The Electrocardiogram (ECG) measures the electrical cardiac activity generated by the heart to detect abnormal heartbeat and heart attack. However, the irregular occurrence of the abnormalities demands continuous monitoring of heartbeats. Machine learning techniques are leveraged to automate the task to reduce labor work needed during monitoring. In recent years, many companies have launched produ… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  6. arXiv:2206.00051  [pdf, other

    cs.LG

    Learning Instance-Specific Augmentations by Capturing Local Invariances

    Authors: Ning Miao, Tom Rainforth, Emile Mathieu, Yann Dubois, Yee Whye Teh, Adam Foster, Hyunjik Kim

    Abstract: We introduce InstaAug, a method for automatically learning input-specific augmentations from data. Previous methods for learning augmentations have typically assumed independence between the original input and the transformation applied to that input. This can be highly restrictive, as the invariances we hope our augmentation will capture are themselves often highly input dependent. InstaAug inste… ▽ More

    Submitted 30 May, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

  7. arXiv:2106.13746  [pdf, other

    stat.ML cs.LG

    On Incorporating Inductive Biases into VAEs

    Authors: Ning Miao, Emile Mathieu, N. Siddharth, Yee Whye Teh, Tom Rainforth

    Abstract: We explain why directly changing the prior can be a surprisingly ineffective mechanism for incorporating inductive biases into VAEs, and introduce a simple and effective alternative approach: Intermediary Latent Space VAEs(InteL-VAEs). InteL-VAEs use an intermediary set of latent variables to control the stochasticity of the encoding process, before mapping these in turn to the latent representati… ▽ More

    Submitted 14 February, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

  8. arXiv:2007.06174  [pdf, other

    cs.CL

    Generating Fluent Adversarial Examples for Natural Languages

    Authors: Huangzhao Zhang, Hao Zhou, Ning Miao, Lei Li

    Abstract: Efficiently building an adversarial attacker for natural language processing (NLP) tasks is a real challenge. Firstly, as the sentence space is discrete, it is difficult to make small perturbations along the direction of gradients. Secondly, the fluency of the generated examples cannot be guaranteed. In this paper, we propose MHA, which addresses both problems by performing Metropolis-Hastings sam… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted by ACL 2019

  9. arXiv:2007.06162  [pdf, other

    cs.CL

    Do You Have the Right Scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods

    Authors: Ning Miao, Yuxuan Song, Hao Zhou, Lei Li

    Abstract: It has been a common approach to pre-train a language model on a large corpus and fine-tune it on task-specific data. In practice, we observe that fine-tuning a pre-trained model on a small dataset may lead to over- and/or under-estimation problem. In this paper, we propose MC-Tailor, a novel method to alleviate the above issue in text generation tasks by truncating and transferring the probabilit… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted by ACL 2020

  10. arXiv:2007.06018  [pdf, other

    stat.ML cs.CL cs.LG

    Improving Maximum Likelihood Training for Text Generation with Density Ratio Estimation

    Authors: Yuxuan Song, Ning Miao, Hao Zhou, Lantao Yu, Mingxuan Wang, Lei Li

    Abstract: Auto-regressive sequence generative models trained by Maximum Likelihood Estimation suffer the exposure bias problem in practical finite sample scenarios. The crux is that the number of training samples for Maximum Likelihood Estimation is usually limited and the input data distributions are different at training and inference stages. Many method shave been proposed to solve the above problem (Yu… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics 2020

  11. arXiv:1911.00274  [pdf, other

    cs.CL cs.LG

    Kernelized Bayesian Softmax for Text Generation

    Authors: Ning Miao, Hao Zhou, Chengqi Zhao, Wenxian Shi, Lei Li

    Abstract: Neural models for text generation require a softmax layer with proper token embeddings during the decoding phase. Most existing approaches adopt single point embedding for each token. However, a word may have multiple senses according to different context, some of which might be distinct. In this paper, we propose KerBS, a novel approach for learning better embeddings for text generation. KerBS em… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  12. arXiv:1906.06719  [pdf, other

    cs.LG cs.CL stat.ML

    Dispersed Exponential Family Mixture VAEs for Interpretable Text Generation

    Authors: Wenxian Shi, Hao Zhou, Ning Miao, Lei Li

    Abstract: Deep generative models are commonly used for generating images and text. Interpretability of these models is one important pursuit, other than the generation quality. Variational auto-encoder (VAE) with Gaussian distribution as prior has been successfully applied in text generation, but it is hard to interpret the meaning of the latent variable. To enhance the controllability and interpretability,… ▽ More

    Submitted 21 August, 2020; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: Camera ready version for ICML 2020

  13. arXiv:1811.10996  [pdf, other

    cs.CL cs.AI cs.LG math.ST stat.ML

    CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling

    Authors: Ning Miao, Hao Zhou, Lili Mou, Rui Yan, Lei Li

    Abstract: In real-world applications of natural language generation, there are often constraints on the target sentences in addition to fluency and naturalness requirements. Existing language generation techniques are usually based on recurrent neural networks (RNNs). However, it is non-trivial to impose constraints on RNNs while maintaining generation quality, since RNNs generate sentences sequentially (or… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: AAAI19