Skip to main content

Showing 1–15 of 15 results for author: Tomasi, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18342  [pdf, ps, other

    cs.CV cs.LG

    Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance

    Authors: Jack Goffinet, Youngjo Min, Carlo Tomasi, David E. Carlson

    Abstract: Accurate and scalable quantification of animal pose and appearance is crucial for studying behavior. Current 3D pose estimation techniques, such as keypoint- and mesh-based techniques, often face challenges including limited representational detail, labor-intensive annotation requirements, and expensive per-frame optimization. These limitations hinder the study of subtle movements and can make lar… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 19 pages, 13 figures

  2. arXiv:2410.03588  [pdf, other

    cs.LG

    Training Over a Distribution of Hyperparameters for Enhanced Performance and Adaptability on Imbalanced Classification

    Authors: Kelsey Lieberman, Swarna Kamlam Ravindran, Shuai Yuan, Carlo Tomasi

    Abstract: Although binary classification is a well-studied problem, training reliable classifiers under severe class imbalance remains a challenge. Recent techniques mitigate the ill effects of imbalance on training by modifying the loss functions or optimization methods. We observe that different hyperparameter values on these loss functions perform better at different recall values. We propose to exploit… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  3. arXiv:2402.05400  [pdf, other

    cs.LG cs.CV

    Optimizing for ROC Curves on Class-Imbalanced Data by Training over a Family of Loss Functions

    Authors: Kelsey Lieberman, Shuai Yuan, Swarna Kamlam Ravindran, Carlo Tomasi

    Abstract: Although binary classification is a well-studied problem in computer vision, training reliable classifiers under severe class imbalance remains a challenging problem. Recent work has proposed techniques that mitigate the effects of training under imbalance by modifying the loss functions or optimization methods. While this work has led to significant improvements in the overall accuracy in the mul… ▽ More

    Submitted 4 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2311.16508  [pdf, other

    cs.CV

    RandMSAugment: A Mixed-Sample Augmentation for Limited-Data Scenarios

    Authors: Swarna Kamlam Ravindran, Carlo Tomasi

    Abstract: The high costs of annotating large datasets suggests a need for effectively training CNNs with limited data, and data augmentation is a promising direction. We study foundational augmentation techniques, including Mixed Sample Data Augmentations (MSDAs) and a no-parameter variant of RandAugment termed Preset-RandAugment, in the fully supervised scenario. We observe that Preset-RandAugment excels i… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  5. arXiv:2310.04712  [pdf, other

    cs.CV cs.RO

    UFD-PRiME: Unsupervised Joint Learning of Optical Flow and Stereo Depth through Pixel-Level Rigid Motion Estimation

    Authors: Shuai Yuan, Carlo Tomasi

    Abstract: Both optical flow and stereo disparities are image matches and can therefore benefit from joint training. Depth and 3D motion provide geometric rather than photometric information and can further improve optical flow. Accordingly, we design a first network that estimates flow and disparity jointly and is trained without supervision. A second network, trained with optical flow from the first as pse… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  6. arXiv:2303.06209  [pdf, other

    cs.CV cs.RO

    SemARFlow: Injecting Semantics into Unsupervised Optical Flow Estimation for Autonomous Driving

    Authors: Shuai Yuan, Shuzhi Yu, Hannah Kim, Carlo Tomasi

    Abstract: Unsupervised optical flow estimation is especially hard near occlusions and motion boundaries and in low-texture regions. We show that additional information such as semantics and domain knowledge can help better constrain this problem. We introduce SemARFlow, an unsupervised optical flow network designed for autonomous driving data that takes estimated semantic segmentation masks as additional in… ▽ More

    Submitted 8 August, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by ICCV-2023; Code is available at https://github.com/duke-vision/semantic-unsup-flow-release

  7. arXiv:2208.02305  [pdf, other

    cs.CV

    Unsupervised Flow Refinement near Motion Boundaries

    Authors: Shuzhi Yu, Hannah Halin Kim, Shuai Yuan, Carlo Tomasi

    Abstract: Unsupervised optical flow estimators based on deep learning have attracted increasing attention due to the cost and difficulty of annotating for ground truth. Although performance measured by average End-Point Error (EPE) has improved over the years, flow estimates are still poorer along motion boundaries (MBs), where the flow is not smooth, as is typically assumed, and where features computed by… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  8. arXiv:2207.04132  [pdf, other

    cs.CV

    Cross-Attention Transformer for Video Interpolation

    Authors: Hannah Halin Kim, Shuzhi Yu, Shuai Yuan, Carlo Tomasi

    Abstract: We propose TAIN (Transformers and Attention for video INterpolation), a residual neural network for video interpolation, which aims to interpolate an intermediate frame given two consecutive image frames around it. We first present a novel vision transformer module, named Cross Similarity (CS), to globally aggregate input image features with similar appearance as those of the predicted interpolate… ▽ More

    Submitted 1 December, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

  9. arXiv:2203.05053  [pdf, other

    cs.CV

    Optical Flow Training under Limited Label Budget via Active Learning

    Authors: Shuai Yuan, Xian Sun, Hannah Kim, Shuzhi Yu, Carlo Tomasi

    Abstract: Supervised training of optical flow predictors generally yields better accuracy than unsupervised training. However, the improved performance comes at an often high annotation cost. Semi-supervised training trades off accuracy against annotation cost. We use a simple yet effective semi-supervised training method to show that even a small fraction of labels can improve flow accuracy by a significan… ▽ More

    Submitted 28 September, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022

  10. arXiv:2111.01261  [pdf, other

    cs.CV

    Joint Detection of Motion Boundaries and Occlusions

    Authors: Hannah Halin Kim, Shuzhi Yu, Carlo Tomasi

    Abstract: We propose MONet, a convolutional neural network that jointly detects motion boundaries (MBs) and occlusion regions (Occs) in video both forward and backward in time. Detection is difficult because optical flow is discontinuous along MBs and undefined in Occs, while many flow estimators assume smoothness and a flow defined everywhere. To reason in the two time directions simultaneously, we direct-… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Journal ref: The British Machine Vision Conference (BMVC), 2021

  11. arXiv:2109.05873  [pdf, other

    math.NA cs.LG

    Construction of Grid Operators for Multilevel Solvers: a Neural Network Approach

    Authors: Claudio Tomasi, Rolf Krause

    Abstract: In this paper, we investigate the combination of multigrid methods and neural networks, starting from a Finite Element discretization of an elliptic PDE. Multigrid methods use interpolation operators to transfer information between different levels of approximation. These operators are crucial for fast convergence of multigrid, but they are generally unknown. We propose Deep Neural Network models… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: To appear in Springer Journal: "The 26th International Domain Decomposition Conference (DD26)"

  12. arXiv:1905.10944  [pdf, other

    cs.LG cs.CV stat.ML

    Identity Connections in Residual Nets Improve Noise Stability

    Authors: Shuzhi Yu, Carlo Tomasi

    Abstract: Residual Neural Networks (ResNets) achieve state-of-the-art performance in many computer vision problems. Compared to plain networks without residual connections (PlnNets), ResNets train faster, generalize better, and suffer less from the so-called degradation problem. We introduce simplified (but still nonlinear) versions of ResNets and PlnNets for which these discrepancies still hold, although t… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: ICML 2019 Workshop on Understanding and Improving Generalization in Deep Learning, additional analysis on a property called Dominant Gradient Flow of Residual Nets in Appendix D

  13. arXiv:1803.10859  [pdf, other

    cs.CV

    Features for Multi-Target Multi-Camera Tracking and Re-Identification

    Authors: Ergys Ristani, Carlo Tomasi

    Abstract: Multi-Target Multi-Camera Tracking (MTMCT) tracks many people through video taken from several cameras. Person Re-Identification (Re-ID) retrieves from a gallery images of people similar to a person query image. We learn good features for both MTMCT and Re-ID with a convolutional neural network. Our contributions include an adaptive weighted triplet loss for training and a new technique for hard-i… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Comments: Accepted as spotlight at CVPR 2018

  14. arXiv:1609.01775  [pdf, other

    cs.CV

    Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking

    Authors: Ergys Ristani, Francesco Solera, Roger S. Zou, Rita Cucchiara, Carlo Tomasi

    Abstract: To help accelerate progress in multi-target, multi-camera tracking systems, we present (i) a new pair of precision-recall measures of performance that treats errors of all types uniformly and emphasizes correct identification over sources of error; (ii) the largest fully-annotated and calibrated data set to date with more than 2 million frames of 1080p, 60fps video taken by 8 cameras observing mor… ▽ More

    Submitted 19 September, 2016; v1 submitted 6 September, 2016; originally announced September 2016.

    Comments: ECCV 2016 Workshop on Benchmarking Multi-Target Tracking

  15. arXiv:cs/0606098  [pdf, ps, other

    cs.GR cs.CG

    Outlier Robust ICP for Minimizing Fractional RMSD

    Authors: Jeff M. Phillips, Ran Liu, Carlo Tomasi

    Abstract: We describe a variation of the iterative closest point (ICP) algorithm for aligning two point sets under a set of transformations. Our algorithm is superior to previous algorithms because (1) in determining the optimal alignment, it identifies and discards likely outliers in a statistically robust manner, and (2) it is guaranteed to converge to a locally optimal solution. To this end, we formali… ▽ More

    Submitted 22 June, 2006; originally announced June 2006.

    Comments: 22 pages, 7 Figures, 9 Tables

    Report number: Duke University Technical Report: CS-2006-05