Skip to main content

Showing 1–20 of 20 results for author: Kalantari, N K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05367  [pdf, other

    cs.CV

    Text2Stereo: Repurposing Stable Diffusion for Stereo Generation with Consistency Rewards

    Authors: Aakash Garg, Libing Zeng, Andrii Tsarov, Nima Khademi Kalantari

    Abstract: In this paper, we propose a novel diffusion-based approach to generate stereo images given a text prompt. Since stereo image datasets with large baselines are scarce, training a diffusion model from scratch is not feasible. Therefore, we propose leveraging the strong priors learned by Stable Diffusion and fine-tuning it on stereo image datasets to adapt it to the task of stereo generation. To impr… ▽ More

    Submitted 27 May, 2025; originally announced June 2025.

  2. arXiv:2506.00839  [pdf, ps, other

    cs.GR cs.CV cs.LG

    Neural Path Guiding with Distribution Factorization

    Authors: Pedro Figueiredo, Qihao He, Nima Khademi Kalantari

    Abstract: In this paper, we present a neural path guiding method to aid with Monte Carlo (MC) integration in rendering. Existing neural methods utilize distribution representations that are either fast or expressive, but not both. We propose a simple, but effective, representation that is sufficiently expressive and reasonably fast. Specifically, we break down the 2D distribution over the directional domain… ▽ More

    Submitted 4 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

    Comments: 11 pages, 11 figures. Accepted to EGSR 2025

  3. Neural Importance Sampling of Many Lights

    Authors: Pedro Figueiredo, Qihao He, Steve Bako, Nima Khademi Kalantari

    Abstract: We propose a neural approach for estimating spatially varying light selection distributions to improve importance sampling in Monte Carlo rendering, particularly for complex scenes with many light sources. Our method uses a neural network to predict the light selection distribution at each shading point based on local information, trained by minimizing the KL-divergence between the learned and tar… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 11 pages, 11 figures. Accepted for publication in SIGGRAPH Conference Papers '25; to be presented at SIGGRAPH 2025

  4. arXiv:2503.10860  [pdf, other

    cs.CV cs.GR

    RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors

    Authors: Avinash Paliwal, Xilong Zhou, Wei Ye, Jinhui Xiong, Rakesh Ranjan, Nima Khademi Kalantari

    Abstract: In this paper, we propose RI3D, a novel 3DGS-based approach that harnesses the power of diffusion models to reconstruct high-quality novel views given a sparse set of input images. Our key contribution is separating the view synthesis process into two tasks of reconstructing visible regions and hallucinating missing regions, and introducing two personalized diffusion models, each tailored to one o… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project page: https://people.engr.tamu.edu/nimak/Papers/RI3D, Code: https://github.com/avinashpaliwal/RI3D

  5. arXiv:2412.04827  [pdf, other

    cs.CV cs.GR

    PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion

    Authors: Avinash Paliwal, Xilong Zhou, Andrii Tsarov, Nima Khademi Kalantari

    Abstract: In this paper, we present PanoDreamer, a novel method for producing a coherent 360° 3D scene from a single input image. Unlike existing methods that generate the scene sequentially, we frame the problem as single-image panorama and depth estimation. Once the coherent panoramic image and its corresponding depth are obtained, the scene can be reconstructed by inpainting the small occluded regions an… ▽ More

    Submitted 11 March, 2025; v1 submitted 6 December, 2024; originally announced December 2024.

    Comments: Project page: https://people.engr.tamu.edu/nimak/Papers/PanoDreamer, Code: https://github.com/avinashpaliwal/PanoDreamer

  6. arXiv:2411.12002  [pdf, other

    cs.CV

    Analyzing and Improving the Skin Tone Consistency and Bias in Implicit 3D Relightable Face Generators

    Authors: Libing Zeng, Nima Khademi Kalantari

    Abstract: With the advances in generative adversarial networks (GANs) and neural rendering, 3D relightable face generation has received significant attention. Among the existing methods, a particularly successful technique uses an implicit lighting representation and generates relit images through the product of synthesized albedo and light-dependent shading images. While this approach produces high-quality… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 10 pages, 10 figures, 5 tables, WACV 2025

  7. CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians

    Authors: Avinash Paliwal, Wei Ye, Jinhui Xiong, Dmytro Kotovenko, Rakesh Ranjan, Vikas Chandra, Nima Khademi Kalantari

    Abstract: The field of 3D reconstruction from images has rapidly evolved in the past few years, first with the introduction of Neural Radiance Field (NeRF) and more recently with 3D Gaussian Splatting (3DGS). The latter provides a significant edge over NeRF in terms of the training and inference speed, as well as the reconstruction quality. Although 3DGS works well for dense input images, the unstructured p… ▽ More

    Submitted 7 December, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: ECCV2024, Project page: https://people.engr.tamu.edu/nimak/Papers/CoherentGS, Code: https://github.com/avinashpaliwal/CoherentGS

  8. arXiv:2309.10689  [pdf, other

    cs.CV cs.GR

    ReShader: View-Dependent Highlights for Single Image View-Synthesis

    Authors: Avinash Paliwal, Brandon Nguyen, Andrii Tsarov, Nima Khademi Kalantari

    Abstract: In recent years, novel view synthesis from a single image has seen significant progress thanks to the rapid advancements in 3D scene representation and image inpainting techniques. While the current approaches are able to synthesize geometrically consistent novel views, they often do not handle the view-dependent effects properly. Specifically, the highlights in their synthesized images usually ap… ▽ More

    Submitted 3 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH Asia 2023. Project page at https://people.engr.tamu.edu/nimak/Papers/SIGAsia2023_Reshader/index.html and video at https://www.youtube.com/watch?v=XW-tl48D3Ok

    Journal ref: ACM Transactions on Graphics (ToG) 42,6 (2023) 1-9

  9. arXiv:2305.12296  [pdf, other

    cs.CV cs.AI cs.GR

    PhotoMat: A Material Generator Learned from Single Flash Photos

    Authors: Xilong Zhou, Miloš Hašan, Valentin Deschaintre, Paul Guerrero, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Nima Khademi Kalantari

    Abstract: Authoring high-quality digital materials is key to realism in 3D rendering. Previous generative models for materials have been trained exclusively on synthetic data; such data is limited in availability and has a visual gap to real materials. We circumvent this limitation by proposing PhotoMat: the first material generator trained exclusively on real photos of material samples captured using a cel… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Journal ref: Siggraph 2023

  10. arXiv:2303.17181  [pdf, other

    cs.CV cs.GR

    Implicit View-Time Interpolation of Stereo Videos using Multi-Plane Disparities and Non-Uniform Coordinates

    Authors: Avinash Paliwal, Andrii Tsarov, Nima Khademi Kalantari

    Abstract: In this paper, we propose an approach for view-time interpolation of stereo videos. Specifically, we build upon X-Fields that approximates an interpolatable mapping between the input coordinates and 2D RGB images using a convolutional decoder. Our main contribution is to analyze and identify the sources of the problems with using X-Fields in our application and propose novel techniques to overcome… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023. Project page at https://people.engr.tamu.edu/nimak/Papers/CVPR23StereoVideo/index.html and video at https://www.youtube.com/watch?v=XJa_bf8OCrc

  11. arXiv:2211.07377  [pdf, other

    cs.LG

    Physics-Guided, Physics-Informed, and Physics-Encoded Neural Networks in Scientific Computing

    Authors: Salah A Faroughi, Nikhil Pawar, Celio Fernandes, Maziar Raissi, Subasish Das, Nima K. Kalantari, Seyed Kourosh Mahjour

    Abstract: Recent breakthroughs in computing power have made it feasible to use machine learning and deep learning to advance scientific computing in many fields, including fluid mechanics, solid mechanics, materials science, etc. Neural networks, in particular, play a central role in this hybridization. Due to their intrinsic architecture, conventional neural networks cannot be successfully trained and scop… ▽ More

    Submitted 4 February, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  12. arXiv:2209.13284  [pdf, other

    cs.CV cs.LG

    Frame Interpolation for Dynamic Scenes with Implicit Flow Encoding

    Authors: Pedro Figueirêdo, Avinash Paliwal, Nima Khademi Kalantari

    Abstract: In this paper, we propose an algorithm to interpolate between a pair of images of a dynamic scene. While in the past years significant progress in frame interpolation has been made, current approaches are not able to handle images with brightness and illumination changes, which are common even when the images are captured shortly apart. We propose to address this problem by taking advantage of the… ▽ More

    Submitted 16 November, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted to WACV 2023. Project website: https://people.engr.tamu.edu/nimak/Papers/WACV2023_Interp . Code: https://github.com/pedrovfigueiredo/frameintIFE . YouTube: https://youtu.be/Re_c-CBlSfI

  13. Differentiable Simulation of Inertial Musculotendons

    Authors: Ying Wang, Jasper Verheul, Sang-Hoon Yeo, Nima Khademi Kalantari, Shinjiro Sueda

    Abstract: We propose a simple and practical approach for incorporating the effects of muscle inertia, which has been ignored by previous musculoskeletal simulators in both graphics and biomechanics. We approximate the inertia of the muscle by assuming that muscle mass is distributed along the centerline of the muscle. We express the motion of the musculotendons in terms of the motion of the skeletal joints… ▽ More

    Submitted 22 September, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Journal ref: ACM Transactions on Graphics (SIGGRAPH Asia), 41 (6) 272:1-272:11, 2022

  14. arXiv:2103.02861  [pdf, other

    cs.CV eess.IV

    Multi-Stage Raw Video Denoising with Adversarial Loss and Gradient Mask

    Authors: Avinash Paliwal, Libing Zeng, Nima Khademi Kalantari

    Abstract: In this paper, we propose a learning-based approach for denoising raw videos captured under low lighting conditions. We propose to do this by first explicitly aligning the neighboring frames to the current frame using a convolutional neural network (CNN). We then fuse the registered frames using another CNN to obtain the final denoised frame. To avoid directly aligning the temporally distant frame… ▽ More

    Submitted 16 April, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted to ICCP 2021. Project page containing code and video at https://people.engr.tamu.edu/nimak/Papers/ICCP2021_denoising

  15. Single Image HDR Reconstruction Using a CNN with Masked Features and Perceptual Loss

    Authors: Marcel Santana Santos, Tsang Ing Ren, Nima Khademi Kalantari

    Abstract: Digital cameras can only capture a limited range of real-world scenes' luminance, producing images with saturated pixels. Existing single image high dynamic range (HDR) reconstruction methods attempt to expand the range of luminance, but are not able to hallucinate plausible textures, producing results with artifacts in the saturated areas. In this paper, we present a novel learning-based approach… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: 10 pages, 13 figures, to be published in ACM SIGGRAPH 2020. For project page see http://faculty.cs.tamu.edu/nimak/Papers/SIGGRAPH2020_HDR/

  16. Deep Slow Motion Video Reconstruction with Hybrid Imaging System

    Authors: Avinash Paliwal, Nima Khademi Kalantari

    Abstract: Slow motion videos are becoming increasingly popular, but capturing high-resolution videos at extremely high frame rates requires professional high-speed cameras. To mitigate this problem, current techniques increase the frame rate of standard videos through frame interpolation by assuming linear object motion which is not valid in challenging cases. In this paper, we address this problem using tw… ▽ More

    Submitted 21 April, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: IEEE TPAMI and ICCP 2020. Project page containing code and video at http://faculty.cs.tamu.edu/nimak/Papers/ICCP2020_Slomo

  17. arXiv:1905.00889  [pdf, other

    cs.CV cs.GR

    Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines

    Authors: Ben Mildenhall, Pratul P. Srinivasan, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari, Ravi Ramamoorthi, Ren Ng, Abhishek Kar

    Abstract: We present a practical and robust deep learning solution for capturing and rendering novel views of complex real world scenes for virtual exploration. Previous approaches either require intractably dense view sampling or provide little to no guidance for how users should sample views of a scene to reliably render high-quality novel views. Instead, we propose an algorithm for view synthesis from an… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: SIGGRAPH 2019. Project page with video and code: http://people.eecs.berkeley.edu/~bmild/llff/

  18. arXiv:1807.11226  [pdf, other

    cs.CV

    Deep Hybrid Real and Synthetic Training for Intrinsic Decomposition

    Authors: Sai Bi, Nima Khademi Kalantari, Ravi Ramamoorthi

    Abstract: Intrinsic image decomposition is the process of separating the reflectance and shading layers of an image, which is a challenging and underdetermined problem. In this paper, we propose to systematically address this problem using a deep convolutional neural network (CNN). Although deep learning (DL) has been recently used to handle this application, the current DL methods train the network only on… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

    Comments: Accepted to EGSR 2018

  19. Light Field Video Capture Using a Learning-Based Hybrid Imaging System

    Authors: Ting-Chun Wang, Jun-Yan Zhu, Nima Khademi Kalantari, Alexei A. Efros, Ravi Ramamoorthi

    Abstract: Light field cameras have many advantages over traditional cameras, as they allow the user to change various camera settings after capture. However, capturing light fields requires a huge bandwidth to record the data: a modern light field camera can only take three images per second. This prevents current consumer light field cameras from capturing light field videos. Temporal interpolation at such… ▽ More

    Submitted 8 May, 2017; originally announced May 2017.

    Comments: ACM Transactions on Graphics (Proceedings of SIGGRAPH 2017)

  20. Learning-Based View Synthesis for Light Field Cameras

    Authors: Nima Khademi Kalantari, Ting-Chun Wang, Ravi Ramamoorthi

    Abstract: With the introduction of consumer light field cameras, light field imaging has recently become widespread. However, there is an inherent trade-off between the angular and spatial resolution, and thus, these cameras often sparsely sample in either spatial or angular domain. In this paper, we use machine learning to mitigate this trade-off. Specifically, we propose a novel learning-based approach to… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

    Comments: in ACM Transactions on Graphics 2016

    ACM Class: I.4.1