Skip to main content

Showing 1–6 of 6 results for author: Ouyang, A

.
  1. arXiv:2505.15329  [pdf, ps, other

    cs.LG

    Fourier-Invertible Neural Encoder (FINE) for Homogeneous Flows

    Authors: Anqiao Ouyang, Hongyi Ke, Qi Wang

    Abstract: Invertible neural architectures have recently attracted attention for their compactness, interpretability, and information-preserving properties. In this work, we propose the Fourier-Invertible Neural Encoder (FINE), which combines invertible monotonic activation functions with reversible filter structures, and could be extended using Invertible ResNets. This architecture is examined in learning l… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2502.10517  [pdf, other

    cs.LG cs.AI cs.PF cs.SE

    KernelBench: Can LLMs Write Efficient GPU Kernels?

    Authors: Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher RĂ©, Azalia Mirhoseini

    Abstract: Efficient GPU kernels are crucial for building performant machine learning architectures, but writing them is a time-consuming challenge that requires significant expertise; therefore, we explore using language models (LMs) to automate kernel generation. We introduce KernelBench, an open-source framework for evaluating LMs' ability to write fast and correct kernels on a suite of 250 carefully sele… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

  3. arXiv:2305.13903  [pdf, other

    cs.CL cs.CV

    Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought

    Authors: Vaishnavi Himakunthala, Andy Ouyang, Daniel Rose, Ryan He, Alex Mei, Yujie Lu, Chinmay Sonar, Michael Saxon, William Yang Wang

    Abstract: Despite exciting recent results showing vision-language systems' capacity to reason about images using natural language, their capacity for video reasoning remains under-explored. We motivate framing video reasoning as the sequential understanding of a small number of keyframes, thereby leveraging the power and robustness of vision-language while alleviating the computational complexities of proce… ▽ More

    Submitted 9 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  4. arXiv:2305.02317  [pdf, other

    cs.CL cs.CV

    Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings

    Authors: Daniel Rose, Vaishnavi Himakunthala, Andy Ouyang, Ryan He, Alex Mei, Yujie Lu, Michael Saxon, Chinmay Sonar, Diba Mirza, William Yang Wang

    Abstract: Recent advances in large language models elicit reasoning in a chain-of-thought that allows models to decompose problems in a human-like fashion. Though this paradigm improves multi-step reasoning ability in language models, it is limited by being unimodal and applied mainly to question-answering tasks. We claim that incorporating visual augmentation into reasoning is essential, especially for com… ▽ More

    Submitted 22 January, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

  5. arXiv:2110.08450  [pdf, other

    cs.LG cs.AI cs.PF

    Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining

    Authors: Tim Kaler, Nickolas Stathas, Anne Ouyang, Alexandros-Stavros Iliopoulos, Tao B. Schardl, Charles E. Leiserson, Jie Chen

    Abstract: Improving the training and inference performance of graph neural networks (GNNs) is faced with a challenge uncommon in general neural networks: creating mini-batches requires a lot of computation and data movement due to the exponential growth of multi-hop graph neighborhoods along network layers. Such a unique challenge gives rise to a diverse set of system design choices. We argue in favor of pe… ▽ More

    Submitted 16 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: MLSys 2022. Code is available at https://github.com/MITIBMxGraph/SALIENT

  6. arXiv:2008.02413  [pdf, other

    physics.soc-ph cs.CY

    Impact of COVID-19 on Public Transit Accessibility and Ridership

    Authors: Michael Wilbur, Afiya Ayman, Anna Ouyang, Vincent Poon, Riyan Kabir, Abhiram Vadali, Philip Pugliese, Daniel Freudberg, Aron Laszka, Abhishek Dubey

    Abstract: Public transit is central to cultivating equitable communities. Meanwhile, the novel coronavirus disease COVID-19 and associated social restrictions has radically transformed ridership behavior in urban areas. Perhaps the most concerning aspect of the COVID-19 pandemic is that low-income and historically marginalized groups are not only the most susceptible to economic shifts but are also most rel… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.