Skip to main content

Showing 1–45 of 45 results for author: Huber, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.21910  [pdf, ps, other

    cs.CL

    AutoMixer: Checkpoint Artifacts as Automatic Data Mixers

    Authors: Ernie Chang, Yang Li, Patrick Huber, David Kant, Yangyang Shi, Vikas Chandra

    Abstract: In language model training, it is desirable to equip models with capabilities from various tasks. However, it is not clear how to directly obtain the right data mixtures for these capabilities as the relationship between data and tasks is difficult to be modeled. In this work, we observe that checkpoint models exhibit emerging capabilities at different points in the training trajectory. Often, the… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Accepted at ACL 2025

  2. arXiv:2506.02472  [pdf, ps, other

    cs.CV

    HRTR: A Single-stage Transformer for Fine-grained Sub-second Action Segmentation in Stroke Rehabilitation

    Authors: Halil Ismail Helvaci, Justin Philip Huber, Jihye Bae, Sen-ching Samson Cheung

    Abstract: Stroke rehabilitation often demands precise tracking of patient movements to monitor progress, with complexities of rehabilitation exercises presenting two critical challenges: fine-grained and sub-second (under one-second) action detection. In this work, we propose the High Resolution Temporal Transformer (HRTR), to time-localize and classify high-resolution (fine-grained), sub-second actions in… ▽ More

    Submitted 11 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  3. arXiv:2503.00245  [pdf, other

    cs.LG cs.CL

    CoSMoEs: Compact Sparse Mixture of Experts

    Authors: Patrick Huber, Akshat Shrivastava, Ernie Chang, Chinnadhurai Sankar, Ahmed Aly, Adithya Sagar

    Abstract: Sparse Mixture of Expert (MoE) models are popular foundational architectures at large scale, however, under-explored at smaller sizes. Here, we show how to enable Compact Sparse Mixture of Experts (CoSMoEs) for on-device inference. Specifically, we tackle the three main on-device dimensions: Quality, Memory and Latency. Along the quality axis, we show that in a fair evaluation (removing confoundin… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: 11 pages, 8 figures

  4. arXiv:2410.03083  [pdf, other

    cs.CL cs.AI

    Scaling Parameter-Constrained Language Models with Quality Data

    Authors: Ernie Chang, Matteo Paltenghi, Yang Li, Pin-Jie Lin, Changsheng Zhao, Patrick Huber, Zechun Liu, Rastislav Rabatin, Yangyang Shi, Vikas Chandra

    Abstract: Scaling laws in language modeling traditionally quantify training loss as a function of dataset size and model parameters, providing compute-optimal estimates but often neglecting the impact of data quality on model generalization. In this paper, we extend the conventional understanding of scaling law by offering a microscopic view of data quality within the original formulation -- effective train… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024 Industry Track, 18 pages, 9 figures, 4 tables

  5. arXiv:2408.11219  [pdf, other

    cs.CL cs.AI

    CoDi: Conversational Distillation for Grounded Question Answering

    Authors: Patrick Huber, Arash Einolghozati, Rylan Conway, Kanika Narang, Matt Smith, Waqar Nayyar, Adithya Sagar, Ahmed Aly, Akshat Shrivastava

    Abstract: Distilling conversational skills into Small Language Models (SLMs) with approximately 1 billion parameters presents significant challenges. Firstly, SLMs have limited capacity in their model parameters to learn extensive knowledge compared to larger models. Secondly, high-quality conversational datasets are often scarce, small, and domain-specific. Addressing these challenges, we introduce a novel… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 13 pages

  6. arXiv:2402.18113  [pdf, other

    cs.CL cs.AI

    Small But Funny: A Feedback-Driven Approach to Humor Distillation

    Authors: Sahithya Ravi, Patrick Huber, Akshat Shrivastava, Aditya Sagar, Ahmed Aly, Vered Shwartz, Arash Einolghozati

    Abstract: The emergence of Large Language Models (LLMs) has brought to light promising language generation capabilities, particularly in performing tasks like complex reasoning and creative writing. Consequently, distillation through imitation of teacher responses has emerged as a popular technique to transfer knowledge from LLMs to more accessible, Small Language Models (SLMs). While this works well for si… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2402.10466  [pdf, other

    cs.CL cs.AI

    Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

    Authors: Zekun Li, Zhiyu Zoey Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook

    Abstract: Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness in task-oriented dialogues (TOD), which requires not only response generation but also effective dialogue state tracking (DST) within specific tasks and domains, remains less satisfying. In this work, we… ▽ More

    Submitted 30 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 Main. Code available at: https://github.com/facebookresearch/FnCTOD

  8. arXiv:2310.13248  [pdf, other

    cs.LG cs.AI cs.CY cs.SI

    FLEE-GNN: A Federated Learning System for Edge-Enhanced Graph Neural Network in Analyzing Geospatial Resilience of Multicommodity Food Flows

    Authors: Yuxiao Qu, Jinmeng Rao, Song Gao, Qianheng Zhang, Wei-Lun Chao, Yu Su, Michelle Miller, Alfonso Morales, Patrick Huber

    Abstract: Understanding and measuring the resilience of food supply networks is a global imperative to tackle increasing food insecurity. However, the complexity of these networks, with their multidimensional interactions and decisions, presents significant challenges. This paper proposes FLEE-GNN, a novel Federated Learning System for Edge-Enhanced Graph Neural Network, designed to overcome these challenge… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 10 pages, 5 figures

    ACM Class: I.2

    Journal ref: ACM SIGSPATIAL GeoAI 2023

  9. arXiv:2309.10880  [pdf

    cs.CL cs.AI cs.CY cs.IR

    Classifying Organizations for Food System Ontologies using Natural Language Processing

    Authors: Tianyu Jiang, Sonia Vinogradova, Nathan Stringham, E. Louise Earl, Allan D. Hollander, Patrick R. Huber, Ellen Riloff, R. Sandra Schillo, Giorgio A. Ubbiali, Matthew Lange

    Abstract: Our research explores the use of natural language processing (NLP) methods to automatically classify entities for the purpose of knowledge graph population and integration with food system ontologies. We have created NLP models that can automatically classify organizations with respect to categories associated with environmental issues as well as Standard Industrial Classification (SIC) codes, whi… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Presented at IFOW 2023 Integrated Food Ontology Workshop at the Formal Ontology in Information Systems Conference (FOIS) 2023 in Sherbrooke, Quebec, Canada July 17-20th, 2023

    ACM Class: H.3.1; I.2.7; J.3; J.4; K.4.3

  10. arXiv:2307.13639  [pdf, other

    cs.CV

    Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Reconstruction

    Authors: Will Rowan, Patrik Huber, Nick Pears, Andrew Keeling

    Abstract: Accurate 3D face reconstruction from 2D images is an enabling technology with applications in healthcare, security, and creative industries. However, current state-of-the-art methods either rely on supervised training with very limited 3D data or self-supervised training with 2D image data. To bridge this gap, we present a method to generate a large-scale synthesised dataset of 250K photorealistic… ▽ More

    Submitted 8 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  11. arXiv:2304.07522  [pdf, other

    cs.CV

    ID2image: Leakage of non-ID information into face descriptors and inversion from descriptors to images

    Authors: Mingrui Li, William A. P. Smith, Patrik Huber

    Abstract: Embedding a face image to a descriptor vector using a deep CNN is a widely used technique in face recognition. Via several possible training strategies, such embeddings are supposed to capture only identity information. Information about the environment (such as background and lighting) or changeable aspects of the face (such as pose, expression, presence of glasses, hat etc.) should be discarded… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: SCIA 2023

  12. arXiv:2304.04640  [pdf, other

    cs.AI

    NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems

    Authors: Jason Yik, Korneel Van den Berghe, Douwe den Blanken, Younes Bouhadjar, Maxime Fabre, Paul Hueber, Weijie Ke, Mina A Khoei, Denis Kleyko, Noah Pacik-Nelson, Alessandro Pierro, Philipp Stratmann, Pao-Sheng Vincent Sun, Guangzhi Tang, Shenqi Wang, Biyan Zhou, Soikat Hasan Ahmed, George Vathakkattil Joseph, Benedetto Leto, Aurora Micheli, Anurag Kumar Mishra, Gregor Lenz, Tao Sun, Zergham Ahmed, Mahmoud Akl , et al. (75 additional authors not shown)

    Abstract: Neuromorphic computing shows promise for advancing computing efficiency and capabilities of AI applications using brain-inspired principles. However, the neuromorphic research field currently lacks standardized benchmarks, making it difficult to accurately measure technological advancements, compare performance with conventional methods, and identify promising future research directions. Prior neu… ▽ More

    Submitted 14 January, 2025; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: To appear in Nature Neuromorphic Hardware and Computing collection

  13. arXiv:2303.02688  [pdf, other

    cs.CV

    Text2Face: A Multi-Modal 3D Face Model

    Authors: Will Rowan, Patrik Huber, Nick Pears, Andrew Keeling

    Abstract: We present the first 3D morphable modelling approach, whereby 3D face shape can be directly and completely defined using a textual prompt. Building on work in multi-modal learning, we extend the FLAME head model to a common image-and-text latent space. This allows for direct 3D Morphable Model (3DMM) parameter generation and therefore shape manipulation from textual descriptions. Our method, Text2… ▽ More

    Submitted 8 March, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: Fixed formatting and a typo

  14. arXiv:2302.05895  [pdf, other

    cs.CL

    Discourse Structure Extraction from Pre-Trained and Fine-Tuned Language Models in Dialogues

    Authors: Chuyuan Li, Patrick Huber, Wen Xiao, Maxime Amblard, Chloé Braud, Giuseppe Carenini

    Abstract: Discourse processing suffers from data sparsity, especially for dialogues. As a result, we explore approaches to build discourse structures for dialogues, based on attention matrices from Pre-trained Language Models (PLMs). We investigate multiple tasks for fine-tuning and show that the dialogue-tailored Sentence Ordering task performs best. To locate and exploit discourse information in PLMs, we… ▽ More

    Submitted 25 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Journal ref: Findings of the Association for Computational Linguistics: EACL 2023 (2023) 2562--2579

  15. arXiv:2212.06038  [pdf, other

    cs.CL

    Large Discourse Treebanks from Scalable Distant Supervision

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: Discourse parsing is an essential upstream task in Natural Language Processing with strong implications for many real-world applications. Despite its widely recognized role, most recent discourse parsers (and consequently downstream tasks) still rely on small-scale human-annotated discourse treebanks, trying to infer general-purpose discourse structures from very limited data in a few narrow domai… ▽ More

    Submitted 17 October, 2022; originally announced December 2022.

    Comments: Extended Abstract. Non Archival. 2 pages

    Journal ref: CODI 2020

  16. arXiv:2210.09565  [pdf, other

    cs.CL

    Towards Domain-Independent Supervised Discourse Parsing Through Gradient Boosting

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: Discourse analysis and discourse parsing have shown great impact on many important problems in the field of Natural Language Processing (NLP). Given the direct impact of discourse annotations on model performance and interpretability, robustly extracting discourse structures from arbitrary documents is a key task to further improve computational models in NLP. To this end, we present a new, superv… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Extended Abstract. Non Archival. 3 pages

    Journal ref: CODI 2022

  17. arXiv:2210.09559  [pdf, other

    cs.CL

    Unsupervised Inference of Data-Driven Discourse Structures using a Tree Auto-Encoder

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: With a growing need for robust and general discourse structures in many downstream tasks and real-world applications, the current lack of high-quality, high-quantity discourse trees poses a severe shortcoming. In order the alleviate this limitation, we propose a new strategy to generate tree structures in a task-agnostic, unsupervised fashion by extending a latent tree induction framework with an… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Extended Abstract. Non-Archival. 2 pages

    Journal ref: CODI 2020

  18. arXiv:2210.01548  [pdf, other

    cs.CV

    Neural Implicit Surface Reconstruction from Noisy Camera Observations

    Authors: Sarthak Gupta, Patrik Huber

    Abstract: Representing 3D objects and scenes with neural radiance fields has become very popular over the last years. Recently, surface-based representations have been proposed, that allow to reconstruct 3D objects from simple photographs. However, most current techniques require an accurate camera calibration, i.e. camera parameters corresponding to each image, which is often a difficult task to do in real… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: 4 pages - 2 for paper, 2 for supplementary

  19. arXiv:2209.08626  [pdf, other

    cs.CL

    Improving Topic Segmentation by Injecting Discourse Dependencies

    Authors: Linzi Xing, Patrick Huber, Giuseppe Carenini

    Abstract: Recent neural supervised topic segmentation models achieve distinguished superior effectiveness over unsupervised methods, with the availability of large-scale training corpora sampled from Wikipedia. These models may, however, suffer from limited robustness and transferability caused by exploiting simple linguistic cues for prediction, but overlooking more important inter-sentential topical consi… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: Accepted to the 3rd Workshop on Computational Approaches to Discourse (CODI-2022) at COLING 2022

  20. arXiv:2207.06793  [pdf, other

    cs.CV cs.GR

    Neural apparent BRDF fields for multiview photometric stereo

    Authors: Meghna Asthana, William A. P. Smith, Patrik Huber

    Abstract: We propose to tackle the multiview photometric stereo problem using an extension of Neural Radiance Fields (NeRFs), conditioned on light source direction. The geometric part of our neural representation predicts surface normal direction, allowing us to reason about local surface reflectance. The appearance part of our neural representation is decomposed into a neural bidirectional reflectance func… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: 9 pages, 6 figures, 1 table

  21. arXiv:2204.04289  [pdf, other

    cs.CL cs.AI

    Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: With a growing number of BERTology work analyzing different components of pre-trained language models, we extend this line of research through an in-depth analysis of discourse information in pre-trained and fine-tuned language models. We move beyond prior work along three dimensions: First, we describe a novel approach to infer discourse structures from arbitrarily long documents. Second, we prop… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 9 pages

    Journal ref: In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)

  22. arXiv:2112.06196  [pdf, other

    cs.CL cs.AI

    Predicting Above-Sentence Discourse Structure using Distant Supervision from Topic Segmentation

    Authors: Patrick Huber, Linzi Xing, Giuseppe Carenini

    Abstract: RST-style discourse parsing plays a vital role in many NLP tasks, revealing the underlying semantic/pragmatic structure of potentially complex and diverse documents. Despite its importance, one of the most prevailing limitations in modern day discourse parsing is the lack of large-scale datasets. To overcome the data sparsity issue, distantly supervised approaches from tasks like sentiment analysi… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  23. arXiv:2110.07731  [pdf, other

    cs.CL cs.LG

    CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training

    Authors: Patrick Huber, Armen Aghajanyan, Barlas Oğuz, Dmytro Okhonko, Wen-tau Yih, Sonal Gupta, Xilun Chen

    Abstract: With the rise of large-scale pre-trained language models, open-domain question-answering (ODQA) has become an important research topic in NLP. Based on the popular pre-training fine-tuning approach, we posit that an additional in-domain pre-training stage using a large-scale, natural, and diverse question-answering (QA) dataset can be beneficial for ODQA. Consequently, we propose a novel QA datase… ▽ More

    Submitted 2 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 9 pages, Findings of NAACL 2022

  24. arXiv:2106.02658  [pdf, other

    cs.CL cs.AI cs.LG

    W-RST: Towards a Weighted RST-style Discourse Framework

    Authors: Patrick Huber, Wen Xiao, Giuseppe Carenini

    Abstract: Aiming for a better integration of data-driven and linguistically-inspired approaches, we explore whether RST Nuclearity, assigning a binary assessment of importance between text segments, can be replaced by automatically generated, real-valued scores, in what we call a Weighted-RST framework. In particular, we find that weighted discourse trees from auxiliary tasks can benefit key NLP downstream… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 9 pages, Accepted at ACL 2021

  25. arXiv:2104.07058  [pdf, other

    cs.CL

    Predicting Discourse Trees from Transformer-based Neural Summarizers

    Authors: Wen Xiao, Patrick Huber, Giuseppe Carenini

    Abstract: Previous work indicates that discourse information benefits summarization. In this paper, we explore whether this synergy between discourse and summarization is bidirectional, by inferring document-level discourse trees from pre-trained neural summarizers. In particular, we generate unlabeled RST-style discourse trees from the self-attention matrices of the transformer model. Experiments across mo… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 14 pages, accepted by NAACL 2021

  26. arXiv:2012.09446  [pdf, other

    cs.CL cs.AI

    Unsupervised Learning of Discourse Structures using a Tree Autoencoder

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: Discourse information, as postulated by popular discourse theories, such as RST and PDTB, has been shown to improve an increasing number of downstream NLP tasks, showing positive effects and synergies of discourse with important real-world applications. While methods for incorporating discourse become more and more sophisticated, the growing need for robust and general discourse structures has not… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: Accepted to AAAI 2021, 7 pages

  27. arXiv:2012.02144  [pdf, other

    cs.CL

    Do We Really Need That Many Parameters In Transformer For Extractive Summarization? Discourse Can Help !

    Authors: Wen Xiao, Patrick Huber, Giuseppe Carenini

    Abstract: The multi-head self-attention of popular transformer models is widely used within Natural Language Processing (NLP), including for the task of extractive summarization. With the goal of analyzing and pruning the parameter-heavy self-attention mechanism, there are multiple approaches proposing more parameter-light self-attention alternatives. In this paper, we present a novel parameter-lean self-at… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: In the Proceeding of 1st Workshop on Computational Approaches to Discourse (CODI) at EMNLP 2020. 11 pages

  28. arXiv:2011.03203  [pdf, other

    cs.CL

    Unleashing the Power of Neural Discourse Parsers -- A Context and Structure Aware Approach Using Large Scale Pretraining

    Authors: Grigorii Guz, Patrick Huber, Giuseppe Carenini

    Abstract: RST-based discourse parsing is an important NLP task with numerous downstream applications, such as summarization, machine translation and opinion mining. In this paper, we demonstrate a simple, yet highly accurate discourse parser, incorporating recent contextual language models. Our parser establishes the new state-of-the-art (SOTA) performance for predicting structure and nuclearity on two key… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: 10 pages, 1 figure, COLING 2020

  29. arXiv:2011.03021  [pdf, other

    cs.CL

    From Sentiment Annotations to Sentiment Prediction through Discourse Augmentation

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: Sentiment analysis, especially for long documents, plausibly requires methods capturing complex linguistics structures. To accommodate this, we propose a novel framework to exploit task-related discourse for the task of sentiment analysis. More specifically, we are combining the large-scale, sentiment-dependent MEGA-DT treebank with a novel neural architecture for sentiment prediction, based on a… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: In Proceedings of the 28 International Conference on Computational Linguistics (COLING). 10 pages

  30. arXiv:2011.03017  [pdf, other

    cs.CL

    MEGA RST Discourse Treebanks with Structure and Nuclearity from Scalable Distant Sentiment Supervision

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: The lack of large and diverse discourse treebanks hinders the application of data-driven approaches, such as deep-learning, to RST-style discourse parsing. In this work, we present a novel scalable methodology to automatically generate discourse treebanks using distant supervision from sentiment-annotated datasets, creating and publishing MEGA-DT, a new large-scale discourse-annotated corpus. Our… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 9 pages

  31. arXiv:2007.10312  [pdf, other

    cs.DC cond-mat.mtrl-sci

    Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computational workflows

    Authors: Martin Uhrin, Sebastiaan P. Huber, Jusong Yu, Nicola Marzari, Giovanni Pizzi

    Abstract: Over the last two decades, the field of computational science has seen a dramatic shift towards incorporating high-throughput computation and big-data analysis as fundamental pillars of the scientific discovery process. This has necessitated the development of tools and techniques to deal with the generation, storage and processing of large amounts of data. In this work we present an in-depth look… ▽ More

    Submitted 21 July, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Journal ref: Computational Materials Science 187, 110086 (2021)

  32. kiwiPy: Robust, high-volume, messaging for big-data and computational science workflows

    Authors: Martin Uhrin, Sebastiaan P. Huber

    Abstract: In this work we present kiwiPy, a Python library designed to support robust message based communication for high-throughput, big-data, applications while being general enough to be useful wherever high-volumes of messages need to be communicated in a predictable manner. KiwiPy relies on the RabbitMQ protocol, an industry standard message broker, while providing a simple and intuitive interface tha… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Journal ref: Journal of Open Source Software 5 2351 (2020)

  33. arXiv:2003.12476  [pdf, other

    cs.DC cond-mat.mtrl-sci

    AiiDA 1.0, a scalable computational infrastructure for automated reproducible workflows and data provenance

    Authors: Sebastiaan. P. Huber, Spyros Zoupanos, Martin Uhrin, Leopold Talirz, Leonid Kahle, Rico Häuselmann, Dominik Gresch, Tiziano Müller, Aliaksandr V. Yakutovich, Casper W. Andersen, Francisco F. Ramirez, Carl S. Adorf, Fernando Gargiulo, Snehal Kumbhar, Elsa Passaro, Conrad Johnston, Andrius Merkys, Andrea Cepellotti, Nicolas Mounet, Nicola Marzari, Boris Kozinsky, Giovanni Pizzi

    Abstract: The ever-growing availability of computing power and the sustained development of advanced computational methods have contributed much to recent scientific progress. These developments present new challenges driven by the sheer amount of calculations and data to manage. Next-generation exascale supercomputers will harden these challenges, such that automated and scalable solutions become crucial.… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Journal ref: Scientific Data 7, 300 (2020)

  34. arXiv:1912.04333  [pdf, other

    cs.CV physics.plasm-ph

    3D Particle Positions from Computer Stereo Vision in PK-4

    Authors: Daniel P. Mohr, Peter Huber, Mierk Schwabe, Christina A. Knapek

    Abstract: Complex plasmas consist of microparticles embedded in a low-temperature plasma containing ions, electrons and neutral particles. The microparticles form a dynamical system that can be used to study a multitude of effects on the level of the constituent particles. The microparticles are usually illuminated with a sheet of laser light, and the scattered light can be observed with digital cameras. So… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  35. arXiv:1910.14176  [pdf, other

    cs.CL

    Predicting Discourse Structure using Distant Supervision from Sentiment

    Authors: Patrick Huber, Giuseppe Carenini

    Abstract: Discourse parsing could not yet take full advantage of the neural NLP revolution, mostly due to the lack of annotated datasets. We propose a novel approach that uses distant supervision on an auxiliary task (sentiment classification), to generate abundant data for RST-style discourse structure prediction. Our approach combines a neural variant of multiple-instance learning, using document-level su… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted to EMNLP 2019, 9 pages

  36. arXiv:1807.11582  [pdf, other

    cs.CL cs.LG stat.ML

    A Hierarchical Approach to Neural Context-Aware Modeling

    Authors: Patrick Huber, Jan Niehues, Alex Waibel

    Abstract: We present a new recurrent neural network topology to enhance state-of-the-art machine learning systems by incorporating a broader context. Our approach overcomes recent limitations with extended narratives through a multi-layered computational approach to generate an abstract context representation. Therefore, the developed system captures the narrative on word-level, sentence-level, and context-… ▽ More

    Submitted 6 August, 2018; v1 submitted 27 July, 2018; originally announced July 2018.

    Comments: 8 pages, 2 figures, 1 table

  37. arXiv:1803.08983  [pdf, ps, other

    cs.CL cs.AI

    Automated Evaluation of Out-of-Context Errors

    Authors: Patrick Huber, Jan Niehues, Alex Waibel

    Abstract: We present a new approach to evaluate computational models for the task of text understanding by the means of out-of-context error detection. Through the novel design of our automated modification process, existing large-scale data sources can be adopted for a vast number of text understanding tasks. The data is thereby altered on a semantic level, allowing models to be tested against a challengin… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

    Comments: LREC 2018, 5 pages, Out-of-Context Error Recognition, Automatic Evaluation Dataset, Text Understanding, TEDTalk

  38. arXiv:1803.05536  [pdf, other

    cs.CV

    Evaluation of Dense 3D Reconstruction from 2D Face Images in the Wild

    Authors: Zhen-Hua Feng, Patrik Huber, Josef Kittler, Peter JB Hancock, Xiao-Jun Wu, Qijun Zhao, Paul Koppen, Matthias Rätsch

    Abstract: This paper investigates the evaluation of dense 3D face reconstruction from a single 2D image in the wild. To this end, we organise a competition that provides a new benchmark dataset that contains 2000 2D facial images of 135 subjects as well as their 3D ground truth face scans. In contrast to previous competitions or challenges, the aim of this new benchmark dataset is to evaluate the accuracy o… ▽ More

    Submitted 20 April, 2018; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 8 pages

  39. arXiv:1711.06753  [pdf, other

    cs.CV

    Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

    Authors: Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

    Abstract: We present a new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs). We first compare and analyse different loss functions including L2, L1 and smooth L1. The analysis of these loss functions suggests that, for the training of a CNN-based localisation model, more attention should be paid to small and medium range errors. To this end,… ▽ More

    Submitted 23 October, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: 11 pages, 6 figures, 6 tables

  40. 3D Morphable Models as Spatial Transformer Networks

    Authors: Anil Bas, Patrik Huber, William A. P. Smith, Muhammad Awais, Josef Kittler

    Abstract: In this paper, we show how a 3D Morphable Model (i.e. a statistical model of the 3D shape of a class of objects such as faces) can be used to spatially transform input data as a module (a 3DMM-STN) within a convolutional neural network. This is an extension of the original spatial transformer network in that we are able to interpret and normalise 3D pose changes and self-occlusions. The trained lo… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: Accepted to ICCV 2017 2nd Workshop on Geometry Meets Deep Learning

    MSC Class: 68T45 ACM Class: I.4.8; I.2.10

  41. arXiv:1705.02402  [pdf, other

    cs.CV

    Face Detection, Bounding Box Aggregation and Pose Estimation for Robust Facial Landmark Localisation in the Wild

    Authors: Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

    Abstract: We present a framework for robust face detection and landmark localisation of faces in the wild, which has been evaluated as part of `the 2nd Facial Landmark Localisation Competition'. The framework has four stages: face detection, bounding box aggregation, pose estimation and landmark localisation. To achieve a high detection rate, we use two publicly available CNN-based face detectors and two pr… ▽ More

    Submitted 1 June, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

  42. arXiv:1611.05396  [pdf, other

    cs.CV

    Dynamic Attention-controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-set Sample Weighting

    Authors: Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu

    Abstract: We present a new Cascaded Shape Regression (CSR) architecture, namely Dynamic Attention-Controlled CSR (DAC-CSR), for robust facial landmark detection on unconstrained faces. Our DAC-CSR divides facial landmark detection into three cascaded sub-tasks: face bounding box refinement, general CSR and attention-controlled CSR. The first two stages refine initial face bounding boxes and output intermedi… ▽ More

    Submitted 4 April, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  43. arXiv:1606.00474  [pdf, other

    cs.CV cs.HC cs.RO

    A 3D Face Modelling Approach for Pose-Invariant Face Recognition in a Human-Robot Environment

    Authors: Michael Grupp, Philipp Kopp, Patrik Huber, Matthias Rätsch

    Abstract: Face analysis techniques have become a crucial component of human-machine interaction in the fields of assistive and humanoid robotics. However, the variations in head-pose that arise naturally in these environments are still a great challenge. In this paper, we present a real-time capable 3D face modelling framework for 2D in-the-wild images that is applicable for robotics. The fitting of the 3D… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    MSC Class: 68T45; 68T40; 68T10 ACM Class: I.2.9; I.2.10; I.5.4

  44. 3D Face Tracking and Texture Fusion in the Wild

    Authors: Patrik Huber, Philipp Kopp, Matthias Rätsch, William Christmas, Josef Kittler

    Abstract: We present a fully automatic approach to real-time 3D face reconstruction from monocular in-the-wild videos. With the use of a cascaded-regressor based face tracking and a 3D Morphable Face Model shape fitting, we obtain a semi-dense 3D face shape. We further use the texture information from multiple frames to build a holistic 3D face representation from the video frames. Our system is able to cap… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    MSC Class: 68T45 ACM Class: I.4.8; I.4.9; I.2.10

    Journal ref: IEEE Signal Processing Letters (Volume: 24, Issue: 4, April 2017)

  45. Fitting 3D Morphable Models using Local Features

    Authors: Patrik Huber, Zhen-Hua Feng, William Christmas, Josef Kittler, Matthias Rätsch

    Abstract: In this paper, we propose a novel fitting method that uses local image features to fit a 3D Morphable Model to 2D images. To overcome the obstacle of optimising a cost function that contains a non-differentiable feature extraction operator, we use a learning-based cascaded regression method that learns the gradient direction from data. The method allows to simultaneously solve for shape and pose p… ▽ More

    Submitted 8 March, 2015; originally announced March 2015.

    Comments: Submitted to ICIP 2015; 4 pages, 4 figures

    MSC Class: 68T45 ACM Class: I.4.8; I.2.10

    Journal ref: Proceedings of the IEEE International Conference on Image Processing (ICIP) 2015, pages 1195-1199