Skip to main content

Showing 1–31 of 31 results for author: Baltrusaitis, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20582  [pdf, ps, other

    cs.CV

    Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting

    Authors: Yizhou Zhao, Chunjiang Liu, Haoyu Chen, Bhiksha Raj, Min Xu, Tadas Baltrusaitis, Mitch Rundle, HsiangTao Wu, Kamran Ghasedi

    Abstract: Face reenactment and portrait relighting are essential tasks in portrait editing, yet they are typically addressed independently, without much synergy. Most face reenactment methods prioritize motion control and multiview consistency, while portrait relighting focuses on adjusting shading effects. To take advantage of both geometric consistency and illumination awareness, we introduce Total-Editin… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  2. arXiv:2502.05505  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model

    Authors: Zinan Lin, Tadas Baltrusaitis, Wenyu Wang, Sergey Yekhanin

    Abstract: Differentially private (DP) synthetic data, which closely resembles the original private data while maintaining strong privacy guarantees, has become a key tool for unlocking the value of private data without compromising privacy. Recently, Private Evolution (PE) has emerged as a promising method for generating DP synthetic data. Unlike other training-based approaches, PE only requires access to i… ▽ More

    Submitted 20 May, 2025; v1 submitted 8 February, 2025; originally announced February 2025.

    Comments: Published in: (1) ICLR 2025 Workshop on Data Problems, (2) ICLR 2025 Workshop on Synthetic Data

  3. arXiv:2412.07739  [pdf, other

    cs.CV cs.AI cs.GR

    GASP: Gaussian Avatars with Synthetic Priors

    Authors: Jack Saunders, Charlie Hewitt, Yanan Jian, Marek Kowalski, Tadas Baltrusaitis, Yiye Chen, Darren Cosker, Virginia Estellers, Nicholas Gyde, Vinay P. Namboodiri, Benjamin E Lundell

    Abstract: Gaussian Splatting has changed the game for real-time photo-realistic rendering. One of the most popular applications of Gaussian Splatting is to create animatable avatars, known as Gaussian Avatars. Recent works have pushed the boundaries of quality and rendering efficiency but suffer from two main limitations. Either they require expensive multi-camera rigs to produce avatars with free-view rend… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: Project page: https://microsoft.github.io/GASP/

  4. arXiv:2410.20552  [pdf, other

    cs.CV cs.AI

    SympCam: Remote Optical Measurement of Sympathetic Arousal

    Authors: Björn Braun, Daniel McDuff, Tadas Baltrusaitis, Paul Streli, Max Moebus, Christian Holz

    Abstract: Recent work has shown that a person's sympathetic arousal can be estimated from facial videos alone using basic signal processing. This opens up new possibilities in the field of telehealth and stress management, providing a non-invasive method to measure stress only using a regular RGB camera. In this paper, we present SympCam, a new 3D convolutional architecture tailored to the task of remote sy… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

    Comments: Accepted for publication at the IEEE-EMBS International Conference on Biomedical and Health Informatics

  5. Eyelid Fold Consistency in Facial Modeling

    Authors: Lohit Petikam, Charlie Hewitt, Fatemeh Saleh, Tadas Baltrušaitis

    Abstract: Eyelid shape is integral to identity and likeness in human facial modeling. Human eyelids are diverse in appearance with varied skin fold and epicanthal fold morphology between individuals. Existing parametric face models express eyelid shape variation to an extent, but do not preserve sufficient likeness across a diverse range of individuals. We propose a new definition of eyelid fold consistency… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  6. Hairmony: Fairness-aware hairstyle classification

    Authors: Givi Meishvili, James Clemoes, Charlie Hewitt, Zafiirah Hosenie, Xian Xiao, Martin de La Gorce, Tibor Takacs, Tadas Baltrusaitis, Antonio Criminisi, Chyna McRae, Nina Jablonski, Marta Wilczkowiak

    Abstract: We present a method for prediction of a person's hairstyle from a single image. Despite growing use cases in user digitization and enrollment for virtual experiences, available methods are limited, particularly in the range of hairstyles they can capture. Human hair is extremely diverse and lacks any universally accepted description or categorization, making this a challenging task. Most current m… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  7. arXiv:2410.11520  [pdf, other

    cs.CV cs.GR

    Look Ma, no markers: holistic performance capture without the hassle

    Authors: Charlie Hewitt, Fatemeh Saleh, Sadegh Aliakbarian, Lohit Petikam, Shideh Rezaeifar, Louis Florentin, Zafiirah Hosenie, Thomas J Cashman, Julien Valentin, Darren Cosker, Tadas Baltrusaitis

    Abstract: We tackle the problem of highly-accurate, holistic performance capture for the face, body and hands simultaneously. Motion-capture technologies used in film and game production typically focus only on face, body or hand capture independently, involve complex and expensive hardware and a high degree of manual intervention from skilled operators. While machine-learning-based approaches exist to over… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

  8. arXiv:2401.14785  [pdf, other

    cs.CV

    SimpleEgo: Predicting Probabilistic Body Pose from Egocentric Cameras

    Authors: Hanz Cuevas-Velasquez, Charlie Hewitt, Sadegh Aliakbarian, Tadas Baltrušaitis

    Abstract: Our work addresses the problem of egocentric human pose estimation from downwards-facing cameras on head-mounted devices (HMD). This presents a challenging scenario, as parts of the body often fall outside of the image or are occluded. Previous solutions minimize this problem by using fish-eye camera lenses to capture a wider view, but these can present hardware design issues. They also predict 2D… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted in 3DV 2024

  9. arXiv:2311.06930  [pdf, other

    cs.CV

    Video-based sympathetic arousal assessment via peripheral blood flow estimation

    Authors: Bjoern Braun, Daniel McDuff, Tadas Baltrusaitis, Christian Holz

    Abstract: Electrodermal activity (EDA) is considered a standard marker of sympathetic activity. However, traditional EDA measurement requires electrodes in steady contact with the skin. Can sympathetic arousal be measured using only an optical sensor, such as an RGB camera? This paper presents a novel approach to infer sympathetic arousal by measuring the peripheral blood flow on the face or hand optically.… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted and to be published at Biomedical Optics Express

  10. arXiv:2303.11225  [pdf, other

    cs.CV cs.GR

    HiFace: High-Fidelity 3D Face Reconstruction by Learning Static and Dynamic Details

    Authors: Zenghao Chai, Tianke Zhang, Tianyu He, Xu Tan, Tadas Baltrušaitis, HsiangTao Wu, Runnan Li, Sheng Zhao, Chun Yuan, Jiang Bian

    Abstract: 3D Morphable Models (3DMMs) demonstrate great potential for reconstructing faithful and animatable 3D facial surfaces from a single image. The facial surface is influenced by the coarse shape, as well as the static detail (e,g., person-specific appearance) and dynamic detail (e.g., expression-driven wrinkles). Previous work struggles to decouple the static and dynamic details through image-level s… ▽ More

    Submitted 23 August, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV 2023, camera-ready version; Project page: https://project-hiface.github.io/

  11. arXiv:2301.01161  [pdf, other

    cs.CV cs.GR

    Procedural Humans for Computer Vision

    Authors: Charlie Hewitt, Tadas Baltrušaitis, Erroll Wood, Lohit Petikam, Louis Florentin, Hanz Cuevas Velasquez

    Abstract: Recent work has shown the benefits of synthetic data for use in computer vision, with applications ranging from autonomous driving to face landmark detection and reconstruction. There are a number of benefits of using synthetic data from privacy preservation and bias elimination to quality and feasibility of annotation. Generating human-centered synthetic data is a particular challenge in terms of… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  12. arXiv:2212.06135  [pdf, other

    cs.CV

    Rodin: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion

    Authors: Tengfei Wang, Bo Zhang, Ting Zhang, Shuyang Gu, Jianmin Bao, Tadas Baltrusaitis, Jingjing Shen, Dong Chen, Fang Wen, Qifeng Chen, Baining Guo

    Abstract: This paper presents a 3D generative model that uses diffusion models to automatically generate 3D digital avatars represented as neural radiance fields. A significant challenge in generating such avatars is that the memory and processing costs in 3D are prohibitive for producing the rich details required for high-quality avatars. To tackle this problem we propose the roll-out diffusion network (Ro… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Project Webpage: https://3d-avatar-diffusion.microsoft.com/

  13. arXiv:2210.11594  [pdf, other

    cs.CV

    Photo-realistic 360 Head Avatars in the Wild

    Authors: Stanislaw Szymanowicz, Virginia Estellers, Tadas Baltrusaitis, Matthew Johnson

    Abstract: Delivering immersive, 3D experiences for human communication requires a method to obtain 360 degree photo-realistic avatars of humans. To make these experiences accessible to all, only commodity hardware, like mobile phone cameras, should be necessary to capture the data needed for avatar creation. For avatars to be rendered realistically from any viewpoint, we require training images and camera p… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Workshop on Computer Vision for Metaverse

  14. arXiv:2210.03529  [pdf, other

    cs.CV cs.GR

    Mesh-Tension Driven Expression-Based Wrinkles for Synthetic Faces

    Authors: Chirag Raman, Charlie Hewitt, Erroll Wood, Tadas Baltrusaitis

    Abstract: Recent advances in synthesizing realistic faces have shown that synthetic training data can replace real data for various face-related computer vision tasks. A question arises: how important is realism? Is the pursuit of photorealism excessive? In this work, we show otherwise. We boost the realism of our synthetic faces by introducing dynamic skin wrinkles in response to facial expressions and obs… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: In Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  15. arXiv:2210.02579  [pdf, other

    cs.CV

    DigiFace-1M: 1 Million Digital Face Images for Face Recognition

    Authors: Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien Valentin, Roberto Cipolla, Jingjing Shen

    Abstract: State-of-the-art face recognition models show impressive accuracy, achieving over 99.8% on Labeled Faces in the Wild (LFW) dataset. Such models are trained on large-scale datasets that contain millions of real human face images collected from the internet. Web-crawled face images are severely biased (in terms of race, lighting, make-up, etc) and often contain label noise. More importantly, the fac… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: WACV 2023

  16. arXiv:2206.04197  [pdf, other

    cs.CV cs.AI

    SCAMPS: Synthetics for Camera Measurement of Physiological Signals

    Authors: Daniel McDuff, Miah Wander, Xin Liu, Brian L. Hill, Javier Hernandez, Jonathan Lester, Tadas Baltrusaitis

    Abstract: The use of cameras and computational algorithms for noninvasive, low-cost and scalable measurement of physiological (e.g., cardiac and pulmonary) vital signs is very attractive. However, diverse data representing a range of environments, body motions, illumination conditions and physiological states is laborious, time consuming and expensive to obtain. Synthetic data have proven a valuable tool in… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  17. arXiv:2204.02776  [pdf, other

    cs.CV

    3D face reconstruction with dense landmarks

    Authors: Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljevic, Daniel Wilde, Stephan Garbin, Chirag Raman, Jamie Shotton, Toby Sharp, Ivan Stojiljkovic, Tom Cashman, Julien Valentin

    Abstract: Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering. Can we keep things simple by just using more landmarks? In answer, we present the first m… ▽ More

    Submitted 20 July, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: ECCV 2022

  18. arXiv:2110.04902  [pdf, other

    cs.CV

    Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

    Authors: Daniel McDuff, Xin Liu, Javier Hernandez, Erroll Wood, Tadas Baltrusaitis

    Abstract: Synthetic data is a powerful tool in training data hungry deep learning algorithms. However, to date, camera-based physiological sensing has not taken full advantage of these techniques. In this work, we leverage a high-fidelity synthetics pipeline for generating videos of faces with faithful blood flow and breathing patterns. We present systematic experiments showing how physiologically-grounded… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  19. arXiv:2109.15102  [pdf, other

    cs.CV

    Fake It Till You Make It: Face analysis in the wild using synthetic data alone

    Authors: Erroll Wood, Tadas Baltrušaitis, Charlie Hewitt, Sebastian Dziadzio, Matthew Johnson, Virginia Estellers, Thomas J. Cashman, Jamie Shotton

    Abstract: We demonstrate that it is possible to perform face-related computer vision in the wild using synthetic data alone. The community has long enjoyed the benefits of synthesizing training data with graphics, but the domain gap between real and synthetic data has remained a problem, especially for human faces. Researchers have tried to bridge this gap with data mixing, domain adaptation, and domain-adv… ▽ More

    Submitted 5 October, 2021; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: ICCV 2021. Amended acknowledgements

  20. arXiv:2010.12949  [pdf, other

    cs.CV cs.AI cs.LG

    Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

    Authors: Daniel McDuff, Javier Hernandez, Erroll Wood, Xin Liu, Tadas Baltrusaitis

    Abstract: Non-contact physiological measurement has the potential to provide low-cost, non-invasive health monitoring. However, machine vision approaches are often limited by the availability and diversity of annotated video datasets resulting in poor generalization to complex real-life conditions. To address these challenges, this work proposes the use of synthetic avatars that display facial blood flow ch… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  21. arXiv:2007.08364  [pdf, other

    cs.CV cs.LG

    A high fidelity synthetic face framework for computer vision

    Authors: Tadas Baltrusaitis, Erroll Wood, Virginia Estellers, Charlie Hewitt, Sebastian Dziadzio, Marek Kowalski, Matthew Johnson, Thomas J. Cashman, Jamie Shotton

    Abstract: Analysis of faces is one of the core applications of computer vision, with tasks ranging from landmark alignment, head pose estimation, expression recognition, and face recognition among others. However, building reliable methods requires time-consuming data collection and often even more time-consuming manual annotation, which can be unreliable. In our work we propose synthesizing such facial dat… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  22. arXiv:2005.02671  [pdf, other

    cs.CV cs.LG

    CONFIG: Controllable Neural Face Image Generation

    Authors: Marek Kowalski, Stephan J. Garbin, Virginia Estellers, Tadas Baltrušaitis, Matthew Johnson, Jamie Shotton

    Abstract: Our ability to sample realistic natural images, particularly faces, has advanced by leaps and bounds in recent years, yet our ability to exert fine-tuned control over the generative process has lagged behind. If this new technology is to find practical uses, we need to achieve a level of control over generative networks which, without sacrificing realism, is on par with that seen in computer graph… ▽ More

    Submitted 19 October, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: includes supplementary materials

  23. arXiv:1802.00924  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement Learning

    Authors: Minghai Chen, Sen Wang, Paul Pu Liang, Tadas Baltrušaitis, Amir Zadeh, Louis-Philippe Morency

    Abstract: With the increasing popularity of video sharing websites such as YouTube and Facebook, multimodal sentiment analysis has received increasing attention from the scientific community. Contrary to previous works in multimodal sentiment analysis which focus on holistic information in speech segments such as bag of words representations and average facial expression intensity, we develop a novel deep a… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

    Comments: ICMI 2017 Oral Presentation, Honorable Mention Award

  24. arXiv:1711.08690  [pdf, other

    cs.CV cs.MM

    Attended End-to-end Architecture for Age Estimation from Facial Expression Videos

    Authors: Wenjie Pei, Hamdi Dibeklioğlu, Tadas Baltrušaitis, David M. J. Tax

    Abstract: The main challenges of age estimation from facial expression videos lie not only in the modeling of the static facial appearance, but also in the capturing of the temporal facial dynamics. Traditional techniques to this problem focus on constructing handcrafted features to explore the discriminative information contained in facial appearance and dynamics separately. This relies on sophisticated fe… ▽ More

    Submitted 30 November, 2019; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Accepted by Transactions on Image Processing (TIP)

  25. arXiv:1708.00370  [pdf, other

    cs.CV

    Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

    Authors: Behnaz Nojavanasghari, Charles. E. Hughes, Tadas Baltrusaitis, Louis-philippe Morency

    Abstract: A person's face discloses important information about their affective state. Although there has been extensive research on recognition of facial expressions, the performance of existing approaches is challenged by facial occlusions. Facial occlusions are often treated as noise and discarded in recognition of affective states. However, hand over face occlusions can provide additional information fo… ▽ More

    Submitted 16 August, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: Accepted to International Conference on Affective Computing and Intelligent Interaction (ACII), 2017

  26. arXiv:1706.07867  [pdf, other

    cs.LG

    Preserving Intermediate Objectives: One Simple Trick to Improve Learning for Hierarchical Models

    Authors: Abhilasha Ravichander, Shruti Rijhwani, Rajat Kulshreshtha, Chirag Nagpal, Tadas Baltrušaitis, Louis-Philippe Morency

    Abstract: Hierarchical models are utilized in a wide variety of problems which are characterized by task hierarchies, where predictions on smaller subtasks are useful for trying to predict a final task. Typically, neural networks are first trained for the subtasks, and the predictions of these networks are subsequently used as additional features when training a model and doing inference for a final task. I… ▽ More

    Submitted 23 June, 2017; originally announced June 2017.

  27. arXiv:1705.09406  [pdf, other

    cs.LG

    Multimodal Machine Learning: A Survey and Taxonomy

    Authors: Tadas Baltrušaitis, Chaitanya Ahuja, Louis-Philippe Morency

    Abstract: Our experience of the world is multimodal - we see objects, hear sounds, feel texture, smell odors, and taste flavors. Modality refers to the way in which something happens or is experienced and a research problem is characterized as multimodal when it includes multiple such modalities. In order for Artificial Intelligence to make progress in understanding the world around us, it needs to be able… ▽ More

    Submitted 1 August, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  28. arXiv:1704.08763  [pdf, other

    cs.CV

    GazeDirector: Fully Articulated Eye Gaze Redirection in Video

    Authors: Erroll Wood, Tadas Baltrusaitis, Louis-Philippe Morency, Peter Robinson, Andreas Bulling

    Abstract: We present GazeDirector, a new approach for eye gaze redirection that uses model-fitting. Our method first tracks the eyes by fitting a multi-part eye region model to video frames using analysis-by-synthesis, thereby recovering eye region shape, texture, pose, and gaze simultaneously. It then redirects gaze by 1) warping the eyelids from the original image using a model-derived flow field, and 2)… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

  29. arXiv:1612.00385  [pdf, other

    cs.CV cs.CL

    Temporal Attention-Gated Model for Robust Sequence Classification

    Authors: Wenjie Pei, Tadas Baltrušaitis, David M. J. Tax, Louis-Philippe Morency

    Abstract: Typical techniques for sequence classification are designed for well-segmented sequences which have been edited to remove noisy or irrelevant parts. Therefore, such methods cannot be easily applied on noisy sequences expected in real-world applications. In this paper, we present the Temporal Attention-Gated Model (TAGM) which integrates ideas from attention models and gated recurrent networks to b… ▽ More

    Submitted 15 April, 2017; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: Accepted by CVPR 2017

  30. arXiv:1611.08657  [pdf, other

    cs.CV cs.AI

    Convolutional Experts Constrained Local Model for Facial Landmark Detection

    Authors: Amir Zadeh, Tadas Baltrušaitis, Louis-Philippe Morency

    Abstract: Constrained Local Models (CLMs) are a well-established family of methods for facial landmark detection. However, they have recently fallen out of favor to cascaded regression-based approaches. This is in part due to the inability of existing CLM local detectors to model the very complex individual landmark appearance that is affected by expression, illumination, facial hair, makeup, and accessorie… ▽ More

    Submitted 26 July, 2017; v1 submitted 25 November, 2016; originally announced November 2016.

    Comments: Accepted at CVPR-W 2017

  31. arXiv:1505.05916  [pdf, other

    cs.CV

    Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

    Authors: Erroll Wood, Tadas Baltrusaitis, Xucong Zhang, Yusuke Sugano, Peter Robinson, Andreas Bulling

    Abstract: Images of the eye are key in several computer vision problems, such as shape registration and gaze estimation. Recent large-scale supervised methods for these problems require time-consuming data collection and manual annotation, which can be unreliable. We propose synthesizing perfectly labelled photo-realistic training data in a fraction of the time. We used computer graphics techniques to build… ▽ More

    Submitted 21 May, 2015; originally announced May 2015.