Skip to main content

Showing 1–50 of 130 results for author: Owens, A

.
  1. arXiv:2506.03148  [pdf, ps, other

    cs.CV

    Self-Supervised Spatial Correspondence Across Modalities

    Authors: Ayush Shrivastava, Andrew Owens

    Abstract: We present a method for finding cross-modal space-time correspondences. Given two images from different visual modalities, such as an RGB image and a depth map, our model identifies which pairs of pixels correspond to the same physical points in the scene. To solve this problem, we extend the contrastive random walk framework to simultaneously learn cycle-consistent feature representations for bot… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: CVPR 2025. Project link: https://www.ayshrv.com/cmrw . Code: https://github.com/ayshrv/cmrw

  2. arXiv:2503.11769  [pdf, other

    astro-ph.GA

    The Chicago Carnegie Hubble Program: Improving the Calibration of SNe Ia with JWST Measurements of the Tip of the Red Giant Branch

    Authors: Taylor J. Hoyt, In Sung Jang, Wendy L. Freedman, Barry F. Madore, Kayla A. Owens, Abigail J. Lee

    Abstract: We present distances to ten supernova (SN) host galaxies determined via the red giant branch tip (TRGB) using JWST/NIRCAM and the F115W, F356W, and F444W bandpasses. Our analysis, including photometric catalog cleaning, adoption of disk light profiles, TRGB color slope estimation, and a novel technique for identifying the infrared TRGB, was conducted blinded. The new F115W TRGB distances agree wel… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 56 pages; 25 figures; 8 tables

  3. arXiv:2503.10329  [pdf, other

    physics.atom-ph

    A Comparison of Calcium Sources for Ion-Trap Loading via Laser Ablation

    Authors: Daisy R H Smith, Silpa Muralidharan, Roland Hablutzel, Georgina Croft, Klara Theophilo, Alexander Owens, Yashna N D Lekhai, Scott J Thomas, Cameron Deans

    Abstract: Trapped-ion technology is a leading approach for scalable quantum computing. A key element of ion trapping is reliable loading of atomic sources into the trap. While thermal atomic ovens have traditionally been used for this purpose, laser ablation has emerged as a viable alternative in recent years, offering the advantages of faster and more localized loading with lower heat dissipation. Calcium… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 10 pages, 7 figures

  4. arXiv:2502.18705  [pdf, other

    cs.HC

    Understanding Children's Avatar Making in Social Online Games

    Authors: Yue Fu, Samuel Schwamm, Amanda Baughan, Nicole M Powell, Zoe Kronberg, Alicia Owens, Emily Renee Izenman, Dania Alsabeh, Elizabeth Hunt, Michael Rich, David Bickham, Jenny Radesky, Alexis Hiniker

    Abstract: Social online games like Minecraft and Roblox have become increasingly integral to children's daily lives. Our study explores how children aged 8 to 13 create and customize avatars in these virtual environments. Through semi-structured interviews and gameplay observations with 48 participants, we investigate the motivations behind children's avatar-making. Our findings show that children's avatar… ▽ More

    Submitted 11 March, 2025; v1 submitted 25 February, 2025; originally announced February 2025.

  5. arXiv:2501.12390  [pdf, other

    cs.CV

    GPS as a Control Signal for Image Generation

    Authors: Chao Feng, Ziyang Chen, Aleksander Holynski, Alexei A. Efros, Andrew Owens

    Abstract: We show that the GPS tags contained in photo metadata provide a useful control signal for image generation. We train GPS-to-image models and use them for tasks that require a fine-grained understanding of how images vary within a city. In particular, we train a diffusion model to generate images conditioned on both GPS and text. The learned model generates images that capture the distinctive appea… ▽ More

    Submitted 22 January, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: Project page: https://cfeng16.github.io/gps-gen/

  6. arXiv:2412.02700  [pdf, other

    cs.CV

    Motion Prompting: Controlling Video Generation with Motion Trajectories

    Authors: Daniel Geng, Charles Herrmann, Junhwa Hur, Forrester Cole, Serena Zhang, Tobias Pfaff, Tatiana Lopez-Guevara, Carl Doersch, Yusuf Aytar, Michael Rubinstein, Chen Sun, Oliver Wang, Andrew Owens, Deqing Sun

    Abstract: Motion control is crucial for generating expressive and compelling video content; however, most existing video generation models rely mainly on text prompts for control, which struggle to capture the nuances of dynamic actions and temporal compositions. To this end, we train a video generation model conditioned on spatio-temporally sparse or dense motion trajectories. In contrast to prior motion c… ▽ More

    Submitted 27 March, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: CVPR 2025 camera ready. Project page: https://motion-prompting.github.io/

  7. arXiv:2411.17698  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Video-Guided Foley Sound Generation with Multimodal Controls

    Authors: Ziyang Chen, Prem Seetharaman, Bryan Russell, Oriol Nieto, David Bourgin, Andrew Owens, Justin Salamon

    Abstract: Generating sound effects for videos often requires creating artistic sound effects that diverge significantly from real-life sources and flexible control in the sound design. To address this problem, we introduce MultiFoley, a model designed for video-guided sound generation that supports multimodal conditioning through text, audio, and video. Given a silent video and a text prompt, MultiFoley all… ▽ More

    Submitted 17 March, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: Accepted at CVPR 2025. Project site: https://ificl.github.io/MultiFoley/

  8. arXiv:2411.04125  [pdf, other

    cs.CV

    Community Forensics: Using Thousands of Generators to Train Fake Image Detectors

    Authors: Jeongsoo Park, Andrew Owens

    Abstract: One of the key challenges of detecting AI-generated images is spotting images that have been created by previously unseen generative models. We argue that the limited diversity of the training data is a major obstacle to addressing this problem, and we propose a new dataset that is significantly larger and more diverse than prior work. As part of creating this dataset, we systematically download t… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: 15 pages

  9. arXiv:2410.16575  [pdf, other

    astro-ph.SR physics.atom-ph physics.chem-ph

    ExoMol line lists -- LXV. Mid-Infrared rovibronic spectroscopy of isotopologues of NiH

    Authors: Kirill Batrakov, Sergei N. Yurchenko, Alec Owens, Jonathan Tennyson, Alexander Mitrushchenkov, Amanda J. Ross, Patrick Crozet, Asen Pashov

    Abstract: New line lists for four isotopologues of nickel monohydride, $^{58}$NiH, $^{60}$NiH, $^{62}$NiH, and $^{58}$NiD are presented covering the wavenumber range $<10000$ cm$^{-1}$ ($λ> 1$ $μ$m), $J$ up to 37.5 for transitions within and between the three lowest-lying electronic states, ${X}\,^{2}Δ$, ${W}\,^{2}Π$, and ${V}\,^{2}Σ^{+}$. The line lists are applicable for temperatures up to 5000 K. The lin… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  10. arXiv:2410.11834  [pdf, other

    cs.RO

    Contrastive Touch-to-Touch Pretraining

    Authors: Samanta Rodriguez, Yiming Dou, William van den Bogert, Miquel Oller, Kevin So, Andrew Owens, Nima Fazeli

    Abstract: Today's tactile sensors have a variety of different designs, making it challenging to develop general-purpose methods for processing touch signals. In this paper, we learn a unified representation that captures the shared information between different tactile sensors. Unlike current approaches that focus on reconstruction or task-specific supervision, we leverage contrastive learning to integrate… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  11. arXiv:2410.04295  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.IM

    ExoMol line lists -- LX. Molecular line list for the ammonia isotopologue $^{15}$NH$_3$

    Authors: Sergei N. Yurchenko, Charles A. Bowesman, Ryan P. Brady, Elizabeth R. Guest, Kyriaki Kefala, Georgi B. Mitev, Alec Owens, Armando N. Perri, Marco Pezzella, Oleksiy Smola, Andrei Sokolov, Jingxin Zhang, Jonathan Tennyson

    Abstract: A theoretical line list for $^{15}$NH$_3$ CoYuTe-15 is presented based on the empirical potential energy and ab initio dipole moments surfaces developed and used for the production of the ExoMol line list CoYuTe for $^{14}$NH$_3$. The ro-vibrational energy levels and wavefunctions are computed using the variational program TROVE. The line list ranges up to 10000 cm$^{-1}$ ($λ\geq 1$ $μ$m) and cont… ▽ More

    Submitted 9 October, 2024; v1 submitted 5 October, 2024; originally announced October 2024.

    Journal ref: MNRAS, 533, 3442-3456 (2020)

  12. arXiv:2409.16288  [pdf, other

    cs.CV

    Self-Supervised Any-Point Tracking by Contrastive Random Walks

    Authors: Ayush Shrivastava, Andrew Owens

    Abstract: We present a simple, self-supervised approach to the Tracking Any Point (TAP) problem. We train a global matching transformer to find cycle consistent tracks through video via contrastive random walks, using the transformer's attention-based global matching to define the transition matrices for a random walk on a space-time graph. The ability to perform "all pairs" comparisons between points allow… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: ECCV 2024. Project link: https://ayshrv.com/gmrw . Code: https://github.com/ayshrv/gmrw/

  13. arXiv:2409.14592  [pdf, other

    cs.RO

    Tactile Functasets: Neural Implicit Representations of Tactile Datasets

    Authors: Sikai Li, Samanta Rodriguez, Yiming Dou, Andrew Owens, Nima Fazeli

    Abstract: Modern incarnations of tactile sensors produce high-dimensional raw sensory feedback such as images, making it challenging to efficiently store, process, and generalize across sensors. To address these concerns, we introduce a novel implicit function representation for tactile sensor feedback. Rather than directly using raw tactile images, we propose neural implicit functions trained to reconstruc… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  14. arXiv:2409.14340  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Self-Supervised Audio-Visual Soundscape Stylization

    Authors: Tingle Li, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Anumanchipalli

    Abstract: Speech sounds convey a great deal of information about the scenes, resulting in a variety of effects ranging from reverberation to additional ambient sounds. In this paper, we manipulate input speech to sound as though it was recorded within a different scene, given an audio-visual conditional example recorded from that scene. Our model learns through self-supervision, taking advantage of the fact… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: ECCV 2024

  15. arXiv:2409.08269  [pdf, other

    cs.RO

    Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation

    Authors: Samanta Rodriguez, Yiming Dou, Miquel Oller, Andrew Owens, Nima Fazeli

    Abstract: Today's touch sensors come in many shapes and sizes. This has made it challenging to develop general-purpose touch processing methods since models are generally tied to one specific sensor design. We address this problem by performing cross-modal prediction between touch sensors: given the tactile signal from one sensor, we use a generative model to estimate how the same physical contact would be… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  16. arXiv:2408.06153  [pdf, other

    astro-ph.CO

    Status Report on the Chicago-Carnegie Hubble Program (CCHP): Measurement of the Hubble Constant Using the Hubble and James Webb Space Telescopes

    Authors: Wendy L. Freedman, Barry F. Madore, In Sung Jang, Taylor J. Hoyt, Abigail J. Lee, Kayla A. Owens

    Abstract: We present the latest results from the Chicago-Carnegie Hubble Program (\cchp) to measure the Hubble constant, using data from the James Webb Space Telescope (JWST). The overall program aims to calibrate three independent methods: (1) Tip of the Red Giant Branch (TRGB) stars, (2) JAGB (J-Region Asymptotic Giant Branch) stars, and (3) Cepheids. To date, our program includes 10 nearby galaxies, host… ▽ More

    Submitted 17 March, 2025; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 70 pages, 21 figures. Major updates from V1 include HST plus JWST calibration of the TRGB increasing the number of calibrators from 10 to 24, and improving the statistical precision in H0. Minor change in fonts from V2

  17. arXiv:2408.03474  [pdf, other

    astro-ph.GA astro-ph.CO

    The Chicago-Carnegie Hubble Program: The JWST J-region Asymptotic Giant Branch (JAGB) Extragalactic Distance Scale

    Authors: Abigail J. Lee, Wendy L. Freedman, Barry F. Madore, In Sung Jang, Kayla A. Owens, Taylor J. Hoyt

    Abstract: The J-region asymptotic giant branch (JAGB) method is a new standard candle based on the constant luminosities of carbon-rich asymptotic giant branch stars in the J band. The JAGB method is independent of the Cepheid and TRGB distance indicators. Therefore, we can leverage it to both cross-check Cepheid and TRGB distances for systematic errors and use it to measure an independent local Hubble cons… ▽ More

    Submitted 26 March, 2025; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 25 pages, 10 figures, 5 tables, accepted to ApJ

  18. arXiv:2407.07309  [pdf, other

    astro-ph.GA astro-ph.CO

    Coordinated JWST Imaging of Three Distance Indicators in a SN Host Galaxy and an Estimate of the TRGB Color Dependence

    Authors: Taylor J. Hoyt, In Sung Jang, Wendy L. Freedman, Barry F. Madore, Abigail J. Lee, Kayla A. Owens

    Abstract: Boasting a 6.5m mirror in space, JWST can increase by several times the number of supernovae (SNe) to which a redshift-independent distance has been measured with a precision distance indicator (e.g., TRGB or Cepheids); the limited number of such SN calibrators currently dominates the uncertainty budget in distance ladder Hubble constant (H0) experiments. JWST/NIRCAM imaging of the Virgo Cluster g… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Revised version after submission to AAS journals; 20 pages, 12 figures; Fig. 1 compressed to reduce file size

  19. arXiv:2406.06347  [pdf, other

    astro-ph.GA

    The 2024 release of the ExoMol database: molecular line lists for exoplanet and other hot atmospheres

    Authors: Jonathan Tennyson, Sergei N. Yurchenko, Jingxin Zhang, Charles A. Bowesman, Ryan P. Brady, Jeanna Buldyreva, Katy L. Chubb, Robert R. Gamache, Maire N. Gorman, Elizabeth R. Guest, Christian Hill, Kyriaki Kefala, A. E. Lynas-Gray, Thomas M. Mellor, Laura K. McKemmish, Georgi B. Mitev, Irina I. Mizus, Alec Owens, Zhijian Peng, Armando N. Perri, Marco Pezzella, Oleg L. Polyansky, Qianwei Qu, Mikhail Semenov, Oleksiy Smola , et al. (5 additional authors not shown)

    Abstract: The ExoMol database (www.exomol.com) provides molecular data for spectroscopic studies of hot atmospheres. These data are widely used to model atmospheres of exoplanets, cool stars and other astronomical objects, as well as a variety of terrestrial applications. The 2024 data release reports the current status of the database which contains recommended line lists for 91 molecules and 224 isotopolo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Report number: JQSRT in press 2024

  20. arXiv:2405.12221  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Images that Sound: Composing Images and Sounds on a Single Canvas

    Authors: Ziyang Chen, Daniel Geng, Andrew Owens

    Abstract: Spectrograms are 2D representations of sound that look very different from the images found in our visual world. And natural images, when played as spectrograms, make unnatural sounds. In this paper, we show that it is possible to synthesize spectrograms that simultaneously look like natural images and sound like natural audio. We call these visual spectrograms images that sound. Our approach is s… ▽ More

    Submitted 4 February, 2025; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted to NeurIPS 2024. Project site: https://ificl.github.io/images-that-sound/

  21. arXiv:2405.08815  [pdf, other

    cs.CV

    Efficient Vision-Language Pre-training by Cluster Masking

    Authors: Zihao Wei, Zixuan Pan, Andrew Owens

    Abstract: We propose a simple strategy for masking image patches during visual-language contrastive learning that improves the quality of the learned representations and the training speed. During each iteration of training, we randomly mask clusters of visually similar image patches, as measured by their raw pixel intensities. This provides an extra learning signal, beyond the contrastive training itself,… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, Project page: https://zxp46.github.io/cluster-masking/ , Code: https://github.com/Zi-hao-Wei/Efficient-Vision-Language-Pre-training-by-Cluster-Masking

  22. arXiv:2405.04534  [pdf, other

    cs.CV

    Tactile-Augmented Radiance Fields

    Authors: Yiming Dou, Fengyu Yang, Yi Liu, Antonio Loquercio, Andrew Owens

    Abstract: We present a scene representation, which we call a tactile-augmented radiance field (TaRF), that brings vision and touch into a shared 3D space. This representation can be used to estimate the visual and tactile signals for a given 3D position within a scene. We capture a scene's TaRF from a collection of photos and sparsely sampled touch probes. Our approach makes use of two insights: (i) common… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, Project page: https://dou-yiming.github.io/TaRF, Code: https://github.com/Dou-Yiming/TaRF/

  23. arXiv:2404.11615  [pdf, other

    cs.CV

    Factorized Diffusion: Perceptual Illusions by Noise Decomposition

    Authors: Daniel Geng, Inbum Park, Andrew Owens

    Abstract: Given a factorization of an image into a sum of linear components, we present a zero-shot method to control each individual component through diffusion model sampling. For example, we can decompose an image into low and high spatial frequencies and condition these components on different text prompts. This produces hybrid images, which change appearance depending on viewing distance. By decomposin… ▽ More

    Submitted 10 January, 2025; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: ECCV 2024 camera ready version + more readable size

  24. arXiv:2403.18821  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark

    Authors: Ziyang Chen, Israel D. Gebru, Christian Richardt, Anurag Kumar, William Laney, Andrew Owens, Alexander Richard

    Abstract: We present a new dataset called Real Acoustic Fields (RAF) that captures real acoustic room data from multiple modalities. The dataset includes high-quality and densely captured room impulse response data paired with multi-view images, and precise 6DoF pose tracking data for sound emitters and listeners in the rooms. We used this dataset to evaluate existing methods for novel-view acoustic synthes… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024. Project site: https://facebookresearch.github.io/real-acoustic-fields/

  25. arXiv:2402.18794  [pdf, other

    astro-ph.GA

    Resolved Near-infrared Stellar Photometry from the Magellan Telescope for 13 Nearby Galaxies: JAGB Method Distances

    Authors: Abigail J. Lee, Andrew J. Monson, Wendy L. Freedman, Barry F. Madore, Kayla A. Owens, Rachael L. Beaton, Coral Espinoza, Tongtian Ren, Yi Ren

    Abstract: We present near-infrared JHK photometry for the resolved stellar populations in 13 nearby galaxies: NGC 6822, IC 1613, NGC 3109, Sextans B, Sextans A, NGC 300, NGC 55, NGC 7793, NGC 247, NGC 5253, Cen A, NGC 1313, and M83, acquired from the 6.5m Baade-Magellan telescope. We measure distances to each galaxy using the J-region asymptotic giant branch (JAGB) method, a new standard candle that leverag… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 31 pages, 11 figures, 6 tables, accepted to ApJ. Photometry catalogs for 13 galaxies available at https://zenodo.org/records/10606945

  26. arXiv:2401.18085  [pdf, other

    cs.CV

    Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators

    Authors: Daniel Geng, Andrew Owens

    Abstract: Diffusion models are capable of generating impressive images conditioned on text descriptions, and extensions of these models allow users to edit images at a relatively coarse scale. However, the ability to precisely edit the layout, position, pose, and shape of objects in images with diffusion models is still difficult. To this end, we propose motion guidance, a zero-shot technique that allows a… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  27. arXiv:2401.18084  [pdf, other

    cs.CV cs.RO

    Binding Touch to Everything: Learning Unified Multimodal Tactile Representations

    Authors: Fengyu Yang, Chao Feng, Ziyang Chen, Hyoungseob Park, Daniel Wang, Yiming Dou, Ziyao Zeng, Xien Chen, Rit Gangopadhyay, Andrew Owens, Alex Wong

    Abstract: The ability to associate touch with other modalities has huge implications for humans and computational systems. However, multimodal learning with touch remains challenging due to the expensive data collection process and non-standardized sensor outputs. We introduce UniTouch, a unified tactile model for vision-based touch sensors connected to multiple modalities, including vision, language, and s… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  28. arXiv:2312.02282  [pdf, other

    astro-ph.GA astro-ph.CO

    First JWST Observations of JAGB Stars in the SN Ia Host Galaxies: NGC 7250, NGC 4536, NGC 3972

    Authors: Abigail J. Lee, Wendy L. Freedman, In Sung Jang, Barry F. Madore, Kayla A. Owens

    Abstract: The J-region Asymptotic Giant Branch (JAGB) method is a standard candle that leverages the constant luminosities of color-selected, carbon-rich AGB stars, measured in the near infrared at 1.2 microns. The Chicago-Carnegie Hubble Program (CCHP) has obtained JWST imaging of the SN Ia host galaxies NGC 7250, NGC 4536, and NGC 3972. With these observations, the JAGB method can be studied for the first… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 13 pages, 7 figures, accepted to ApJ

  29. arXiv:2311.17919  [pdf, other

    cs.CV

    Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

    Authors: Daniel Geng, Inbum Park, Andrew Owens

    Abstract: We address the problem of synthesizing multi-view optical illusions: images that change appearance upon a transformation, such as a flip or rotation. We propose a simple, zero-shot method for obtaining these illusions from off-the-shelf text-to-image diffusion models. During the reverse diffusion process, we estimate the noise from different views of a noisy image, and then combine these noise est… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 camera ready

  30. arXiv:2311.17056  [pdf, other

    cs.CV

    Self-Supervised Motion Magnification by Backpropagating Through Optical Flow

    Authors: Zhaoying Pan, Daniel Geng, Andrew Owens

    Abstract: This paper presents a simple, self-supervised method for magnifying subtle motions in video: given an input video and a magnification factor, we manipulate the video such that its new optical flow is scaled by the desired amount. To train our model, we propose a loss function that estimates the optical flow of the generated video and penalizes how far if deviates from the given magnification facto… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

  31. arXiv:2310.15238  [pdf, other

    astro-ph.IM physics.comp-ph

    Hyperbolic Conduction: A Fast, Physical Conduction Model Implemented in Smoothed Particle Hydrodynamics

    Authors: N. A. Owens, J. Wadsley

    Abstract: We present the first implementation of hyperbolic thermal conduction in smoothed particle hydrodynamics (SPH). Hyperbolic conduction is a physically-motivated alternative to traditional, parabolic conduction. It incorporates a relaxation time, which ensures that heat propagates no faster than a physical signal speed. This allows for larger, Courant like, time steps for explicit schemes. Numerical… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  32. arXiv:2309.15117  [pdf, other

    cs.CV

    Generating Visual Scenes from Touch

    Authors: Fengyu Yang, Jiacheng Zhang, Andrew Owens

    Abstract: An emerging line of work has sought to generate plausible imagery from touch. Existing approaches, however, tackle only narrow aspects of the visuo-tactile synthesis problem, and lag significantly behind the quality of cross-modal synthesis methods in other domains. We draw on recent advances in latent diffusion to create a model for synthesizing images from tactile signals (and vice versa) and ap… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: ICCV 2023; Project site: https://fredfyyang.github.io/vision-from-touch/

  33. arXiv:2308.03941  [pdf, other

    astro-ph.SR astro-ph.EP astro-ph.GA astro-ph.IM

    ExoMol line lists -- LI. Molecular line list for lithium hydroxide (LiOH)

    Authors: Alec Owens, Sam O. M. Wright, Yakiv Pavlenko, Alexander Mitrushchenkov, Jacek Koput, Sergei N. Yurchenko, Jonathan Tennyson

    Abstract: A new molecular line list for lithium hydroxide ($^{7}$Li$^{16}$O$^{1}$H) covering wavelengths $λ> 1 μ$m (the 0-10000 cm$^{-1}$ range) is presented. The OYT7 line list contains over 331 million transitions between rotation-vibration energy levels with total angular momentum up to $J=95$ and is applicable for temperatures up to $T\approx 3500$ K. Line list calculations are based on a previously pub… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  34. arXiv:2305.06195  [pdf, other

    astro-ph.SR astro-ph.IM

    Quantifying Uncertainties on the Tip of the Red Giant Branch Method

    Authors: Barry F. Madore, Wendy L. Freedman Kayla A. Owens, In Sung Jang

    Abstract: We present an extensive grid of numerical simulations quantifying the uncertainties in measurements of the Tip of the Red Giant Branch (TRGB). These simulations incorporate a luminosity function composed of 2 magnitudes of red giant branch (RGB) stars leading up to the tip, with asymptotic giant branch (AGB) stars contributing exclusively to the luminosity function for at least a magnitude above t… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Accepte to the Astronomical Journal

  35. arXiv:2304.08490  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Conditional Generation of Audio from Video via Foley Analogies

    Authors: Yuexi Du, Ziyang Chen, Justin Salamon, Bryan Russell, Andrew Owens

    Abstract: The sound effects that designers add to videos are designed to convey a particular artistic effect and, thus, may be quite different from a scene's true sound. Inspired by the challenges of creating a soundtrack for a video that differs from its true sound, but that nonetheless matches the actions occurring on screen, we propose the problem of conditional Foley. We present the following contributi… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  36. The James Webb Space Telescope Mission

    Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

    Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures

  37. arXiv:2303.17490  [pdf, other

    cs.CV cs.MM cs.SD eess.AS eess.IV

    Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment

    Authors: Kim Sung-Bin, Arda Senocak, Hyunwoo Ha, Andrew Owens, Tae-Hyun Oh

    Abstract: How does audio describe the world around us? In this paper, we propose a method for generating an image of a scene from sound. Our method addresses the challenges of dealing with the large gaps that often exist between sight and sound. We design a model that works by scheduling the learning procedure of each model component to associate audio-visual modalities despite their information gaps. The k… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  38. arXiv:2303.11989  [pdf, other

    cs.CV

    Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models

    Authors: Lukas Höllein, Ang Cao, Andrew Owens, Justin Johnson, Matthias Nießner

    Abstract: We present Text2Room, a method for generating room-scale textured 3D meshes from a given text prompt as input. To this end, we leverage pre-trained 2D text-to-image models to synthesize a sequence of images from different poses. In order to lift these outputs into a consistent 3D scene representation, we combine monocular depth estimation with a text-conditioned inpainting model. The core idea of… ▽ More

    Submitted 10 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV 2023 (Oral) video: https://youtu.be/fjRnFL91EZc project page: https://lukashoel.github.io/text-to-room/ code: https://github.com/lukasHoel/text2room

  39. arXiv:2303.11329  [pdf, other

    cs.CV cs.SD eess.AS

    Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation

    Authors: Ziyang Chen, Shengyi Qian, Andrew Owens

    Abstract: The images and sounds that we perceive undergo subtle but geometrically consistent changes as we rotate our heads. In this paper, we use these cues to solve a problem we call Sound Localization from Motion (SLfM): jointly estimating camera rotation and localizing sound sources. We learn to solve these tasks solely through self-supervision. A visual model predicts camera rotation from a pair of ima… ▽ More

    Submitted 21 August, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: ICCV 2023. Project site: https://ificl.github.io/SLfM/

  40. arXiv:2301.04647  [pdf, other

    cs.CV cs.CL

    EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata

    Authors: Chenhao Zheng, Ayush Shrivastava, Andrew Owens

    Abstract: We learn a visual representation that captures information about the camera that recorded a given photo. To do this, we train a multimodal embedding between image patches and the EXIF metadata that cameras automatically insert into image files. Our model represents this metadata by simply converting it to text and then processing it with a transformer. The features that we learn significantly outp… ▽ More

    Submitted 17 June, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: CVPR 2023 (Highlight). Project link: http://hellomuffin.github.io/exif-as-language

  41. arXiv:2301.01767  [pdf, other

    cs.CV

    Self-Supervised Video Forensics by Audio-Visual Anomaly Detection

    Authors: Chao Feng, Ziyang Chen, Andrew Owens

    Abstract: Manipulated videos often contain subtle inconsistencies between their visual and audio signals. We propose a video forensics method, based on anomaly detection, that can identify these inconsistencies, and that can be trained solely using real, unlabeled data. We train an autoregressive model to generate sequences of audio-visual features, using feature sets that capture the temporal synchronizati… ▽ More

    Submitted 27 March, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: CVPR 2023

  42. arXiv:2211.15058  [pdf, other

    cs.CV

    Mix and Localize: Localizing Sound Sources in Mixtures

    Authors: Xixi Hu, Ziyang Chen, Andrew Owens

    Abstract: We present a method for simultaneously localizing multiple sound sources within a visual scene. This task requires a model to both group a sound mixture into individual sources, and to associate them with a visual signal. Our method jointly solves both tasks at once, using a formulation inspired by the contrastive random walk of Jabri et al. We create a graph in which images and separated sounds c… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: CVPR 2022

  43. arXiv:2211.12498  [pdf, other

    cs.CV

    Touch and Go: Learning from Human-Collected Vision and Touch

    Authors: Fengyu Yang, Chenyang Ma, Jiacheng Zhang, Jing Zhu, Wenzhen Yuan, Andrew Owens

    Abstract: The ability to associate touch with sight is essential for tasks that require physically interacting with objects in the world. We propose a dataset with paired visual and tactile data called Touch and Go, in which human data collectors probe objects in natural environments using tactile sensors, while simultaneously recording egocentric video. In contrast to previous efforts, which have largely b… ▽ More

    Submitted 29 November, 2022; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted by NeurIPS 2022 Track of Datasets and Benchmarks

  44. arXiv:2211.07728  [pdf, other

    astro-ph.EP astro-ph.SR

    Hazy with a chance of star spots: constraining the atmosphere of the young planet, K2-33b

    Authors: Pa Chia Thao, Andrew W. Mann, Peter Gao, Dylan A. Owens, Andrew Vanderburg, Elisabeth R. Newton, Yao Tang, Matthew J. Fields, Trevor J. David, Jonathan M. Irwin, Tim-Oliver Husser, David Charbonneau, Sarah Ballard

    Abstract: Although all-sky surveys have led to the discovery of dozens of young planets, little is known about their atmospheres. Here, we present multi-wavelength transit data for the super Neptune-sized exoplanet, K2-33b -- the youngest (~10 Myr) transiting exoplanet to-date. We combined photometric observations of K2-33 covering a total of 33 transits spanning >2 years, taken from K2, MEarth, Hubble, and… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted to AJ. 26 pages, 14 figures, 6 tables

  45. arXiv:2210.12480  [pdf, other

    astro-ph.EP astro-ph.SR physics.atom-ph

    ExoMol line lists -- XLVII. Rovibronic molecular line list of the calcium monohydroxide radical (CaOH)

    Authors: Alec Owens, Alexander Mitrushchenkov, Sergei N. Yurchenko, Jonathan Tennyson

    Abstract: Any future detection of the calcium monohydroxide radical (CaOH) in stellar and exoplanetary atmospheres will rely on accurate molecular opacity data. Here, we present the first comprehensive molecular line list of CaOH covering the \A--\X\ rotation-vibration-electronic and \X--\X\ rotation-vibration bands. The newly computed OYT6 line list contains over 24.2 billion transitions between 3.2 millio… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Journal ref: Mon. Not. R. astr. Soc., 516, 3995-4002 (2022)

  46. arXiv:2210.12474  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR physics.atom-ph

    ExoMol line lists -- XLV. Rovibronic molecular line lists of calcium monohydride (CaH) and magnesium monohydride (MgH)

    Authors: Alec Owens, Sophie Dooley, Luke McLaughlin, Brandon Tan, Guanming Zhang, Sergei N. Yurchenko, Jonathan Tennyson

    Abstract: New molecular line lists for calcium monohydride ($^{40}$Ca$^{1}$H) and magnesium monohydride ($^{24}$Mg$^{1}$H) and its minor isotopologues ($^{25}$Mg$^{1}$H and $^{26}$Mg$^{1}$H) are presented. The rotation-vibration-electronic (rovibronic) line lists, named \texttt{XAB}, consider transitions involving the \X, \A, and \BBp\ electronic states in the 0--30\,000~cm$^{-1}$ region (wavelengths… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Journal ref: Mon. Not. R. astr. Soc., 511, 5448-5461 (2022)

  47. arXiv:2206.08383  [pdf, other

    astro-ph.EP astro-ph.SR

    Transit Hunt for Young and Maturing Exoplanets (THYME) VIII: a Pleiades-age association harboring two transiting planetary systems from Kepler

    Authors: Madyson G. Barber, Andrew W. Mann, Jonathan L. Bush, Benjamin M. Tofflemire, Adam L. Kraus, Daniel M. Krolikowski, Andrew Vanderburg, Matthew J. Fields, Elisabeth R. Newton, Dylan A. Owens, Pa Chia Thao

    Abstract: Young planets provide a window into the early stages and evolution of planetary systems. Ideal planets for such research are in coeval associations, where the parent population can precisely determine their ages. We describe a young association (MELANGE-3) in the Kepler field, which harbors two transiting planetary systems (Kepler-1928 and Kepler-970). We identify MELANGE-3 by searching for kinema… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: accepted for publication in AJ

  48. arXiv:2205.11323  [pdf, other

    astro-ph.GA astro-ph.CO

    The Astrophysical Distance Scale: V. A 2% Distance to the Local Group Spiral M33 via the JAGB Method, Tip of the Red Giant Branch, and Leavitt Law

    Authors: Abigail J. Lee, Laurie Rousseau-Nepton, Wendy L. Freedman, Barry F. Madore, Maria-Rosa L. Cioni, Taylor J. Hoyt, In Sung Jang, Atefeh Javadi, Kayla A. Owens

    Abstract: The J-region asymptotic giant branch (JAGB) method is a new standard candle that is based on the stable intrinsic J-band magnitude of color-selected carbon stars, and has a precision comparable to other primary distance indicators such as Cepheids and the TRGB. We further test the accuracy of the JAGB method in the Local Group Galaxy M33. M33's moderate inclination, low metallicity, and nearby pro… ▽ More

    Submitted 23 June, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: 23 pages, 14 figures, accepted to the ApJ. v2 is exactly the same as v1 except for a fixed minor typo found while looking at the proofs

  49. arXiv:2205.05072  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Learning Visual Styles from Audio-Visual Associations

    Authors: Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao

    Abstract: From the patter of rain to the crunch of snow, the sounds we hear often convey the visual textures that appear within a scene. In this paper, we present a method for learning visual styles from unlabeled audio-visual data. Our model learns to manipulate the texture of a scene to match a sound, a problem we term audio-driven image stylization. Given a dataset of paired audio-visual data, we learn t… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  50. arXiv:2204.12489  [pdf, other

    cs.CV cs.SD eess.AS

    Sound Localization by Self-Supervised Time Delay Estimation

    Authors: Ziyang Chen, David F. Fouhey, Andrew Owens

    Abstract: Sounds reach one microphone in a stereo pair sooner than the other, resulting in an interaural time delay that conveys their directions. Estimating a sound's time delay requires finding correspondences between the signals recorded by each microphone. We propose to learn these correspondences through self-supervision, drawing on recent techniques from visual tracking. We adapt the contrastive rando… ▽ More

    Submitted 28 January, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: ECCV 2022