Skip to main content

Showing 1–5 of 5 results for author: Canny, J F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.06253  [pdf, other

    cs.CV cs.CL

    What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description Models and Metrics

    Authors: David M. Chan, Austin Myers, Sudheendra Vijayanarasimhan, David A. Ross, Bryan Seybold, John F. Canny

    Abstract: While there have been significant gains in the field of automated video description, the generalization performance of automated description models to novel domains remains a major barrier to using these systems in the real world. Most visual description methods are known to capture and exploit patterns in the training data leading to evaluation metric increases, but what are those patterns? In th… ▽ More

    Submitted 12 January, 2023; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: The 1st Workshop on Vision Datasets Understanding, IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

  2. arXiv:2110.07775  [pdf, other

    cs.HC cs.AI cs.LG

    Creating User Interface Mock-ups from High-Level Text Descriptions with Deep-Learning Models

    Authors: Forrest Huang, Gang Li, Xin Zhou, John F. Canny, Yang Li

    Abstract: The design process of user interfaces (UIs) often begins with articulating high-level design goals. Translating these high-level design goals into concrete design mock-ups, however, requires extensive effort and UI design expertise. To facilitate this process for app designers and developers, we introduce three deep-learning techniques to create low-fidelity UI mock-ups from a natural language phr… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  3. arXiv:1904.04399  [pdf, other

    cs.HC cs.LG

    Sketchforme: Composing Sketched Scenes from Text Descriptions for Interactive Applications

    Authors: Forrest Huang, John F. Canny

    Abstract: Sketching and natural languages are effective communication media for interactive applications. We introduce Sketchforme, the first neural-network-based system that can generate sketches based on text descriptions specified by users. Sketchforme is capable of gaining high-level and low-level understanding of multi-object sketched scenes without being trained on sketched scene datasets annotated wi… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 10 pages

    ACM Class: I.4.9

  4. arXiv:1812.04604  [pdf, other

    cs.CV cs.AI

    Diagnostic Visualization for Deep Neural Networks Using Stochastic Gradient Langevin Dynamics

    Authors: Biye Jiang, David M. Chan, Tianhao Zhang, John F. Canny

    Abstract: The internal states of most deep neural networks are difficult to interpret, which makes diagnosis and debugging during training challenging. Activation maximization methods are widely used, but lead to multiple optima and are hard to interpret (appear noise-like) for complex neurons. Image-based methods use maximally-activating image regions which are easier to interpret, but do not provide pixel… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

  5. arXiv:1807.11824  [pdf, other

    cs.LG cs.PF stat.ML

    t-SNE-CUDA: GPU-Accelerated t-SNE and its Applications to Modern Data

    Authors: David M. Chan, Roshan Rao, Forrest Huang, John F. Canny

    Abstract: Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples. Existing visualization methods which employ dimensionality reduction to two or three dimensions are often inefficient and/or ineffective for these datasets. This paper introduces t-SNE-CUDA, a GPU-accelerated implementation of t-distributed Symmetric… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

    Comments: To appear in HPML 2018 High Performance Machine Learning Workshop (Accepted, 2018)