-
Trend Filtered Mixture of Experts for Automated Gating of High-Frequency Flow Cytometry Data
Authors:
Sangwon Hyun,
Tim Coleman,
Francois Ribalet,
Jacob Bien
Abstract:
Ocean microbes are critical to both ocean ecosystems and the global climate. Flow cytometry, which measures cell optical properties in fluid samples, is routinely used in oceanographic research. Despite decades of accumulated data, identifying key microbial populations (a process known as ``gating'') remains a significant analytical challenge. To address this, we focus on gating multidimensional,…
▽ More
Ocean microbes are critical to both ocean ecosystems and the global climate. Flow cytometry, which measures cell optical properties in fluid samples, is routinely used in oceanographic research. Despite decades of accumulated data, identifying key microbial populations (a process known as ``gating'') remains a significant analytical challenge. To address this, we focus on gating multidimensional, high-frequency flow cytometry data collected {\it continuously} on board oceanographic research vessels, capturing time- and space-wise variations in the dynamic ocean. Our paper proposes a novel mixture-of-experts model in which both the gating function and the experts are given by trend filtering. The model leverages two key assumptions: (1) Each snapshot of flow cytometry data is a mixture of multivariate Gaussians and (2) the parameters of these Gaussians vary smoothly over time. Our method uses regularization and a constraint to ensure smoothness and that cluster means match biologically distinct microbe types. We demonstrate, using flow cytometry data from the North Pacific Ocean, that our proposed model accurately matches human-annotated gating and corrects significant errors.
△ Less
Submitted 16 April, 2025;
originally announced April 2025.
-
Analyzing the Training Dynamics of Image Restoration Transformers: A Revisit to Layer Normalization
Authors:
MinKyu Lee,
Sangeek Hyun,
Woojin Jun,
Hyunjun Kim,
Jiwoo Chung,
Jae-Pil Heo
Abstract:
This work investigates the internal training dynamics of image restoration~(IR) Transformers and uncovers a critical yet overlooked issue: conventional LayerNorm leads feature magnitude divergence, up to a million scale, and collapses channel-wise entropy. We analyze this phenomenon from the perspective of networks attempting to bypass constraints imposed by conventional LayerNorm due to conflicts…
▽ More
This work investigates the internal training dynamics of image restoration~(IR) Transformers and uncovers a critical yet overlooked issue: conventional LayerNorm leads feature magnitude divergence, up to a million scale, and collapses channel-wise entropy. We analyze this phenomenon from the perspective of networks attempting to bypass constraints imposed by conventional LayerNorm due to conflicts against requirements in IR tasks. Accordingly, we address two misalignments between LayerNorm and IR tasks, and later show that addressing these mismatches leads to both stabilized training dynamics and improved IR performance. Specifically, conventional LayerNorm works in a per-token manner, disrupting spatial correlations between tokens, essential in IR tasks. Also, it employs an input-independent normalization that restricts the flexibility of feature scales, required to preserve input-specific statistics. Together, these mismatches significantly hinder IR Transformer's ability to accurately preserve low-level features throughout the network. To this end, we introduce Image Restoration Transformer Tailored Layer Normalization~(i-LN), a surprisingly simple drop-in replacement for conventional LayerNorm. We propose to normalize features in a holistic manner across the entire spatio-channel dimension, preserving spatial relationships among individual tokens. Additionally, we introduce an input-adaptive rescaling strategy that maintains the feature range flexibility required by individual inputs. Together, these modifications effectively contribute to preserving low-level feature statistics of inputs throughout IR Transformers. Experimental results verify that this combined strategy enhances both the stability and performance of IR Transformers across various IR tasks.
△ Less
Submitted 25 June, 2025; v1 submitted 9 April, 2025;
originally announced April 2025.
-
DDPT: Diffusion-Driven Prompt Tuning for Large Language Model Code Generation
Authors:
Jinyang Li,
Sangwon Hyun,
M. Ali Babar
Abstract:
Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation. However, the quality of the generated code is heavily dependent on the structure and composition of the prompts used. Crafting high-quality prompts is a challenging task that requires significant knowledge and skills of prompt engineering. To advance the automation support for the prompt engineering for LLM-…
▽ More
Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation. However, the quality of the generated code is heavily dependent on the structure and composition of the prompts used. Crafting high-quality prompts is a challenging task that requires significant knowledge and skills of prompt engineering. To advance the automation support for the prompt engineering for LLM-based code generation, we propose a novel solution Diffusion-Driven Prompt Tuning (DDPT) that learns how to generate optimal prompt embedding from Gaussian Noise to automate the prompt engineering for code generation. We evaluate the feasibility of diffusion-based optimization and abstract the optimal prompt embedding as a directional vector toward the optimal embedding. We use the code generation loss given by the LLMs to help the diffusion model capture the distribution of optimal prompt embedding during training. The trained diffusion model can build a path from the noise distribution to the optimal distribution at the sampling phrase, the evaluation result demonstrates that DDPT helps improve the prompt optimization for code generation.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation
Authors:
Jiwoo Chung,
Sangeek Hyun,
Hyunjun Kim,
Eunseo Koh,
MinKyu Lee,
Jae-Pil Heo
Abstract:
Recent advances in text-to-image generative models have enabled numerous practical applications, including subject-driven generation, which fine-tunes pretrained models to capture subject semantics from only a few examples. While diffusion-based models produce high-quality images, their extensive denoising steps result in significant computational overhead, limiting real-world applicability. Visua…
▽ More
Recent advances in text-to-image generative models have enabled numerous practical applications, including subject-driven generation, which fine-tunes pretrained models to capture subject semantics from only a few examples. While diffusion-based models produce high-quality images, their extensive denoising steps result in significant computational overhead, limiting real-world applicability. Visual autoregressive~(VAR) models, which predict next-scale tokens rather than spatially adjacent ones, offer significantly faster inference suitable for practical deployment. In this paper, we propose the first VAR-based approach for subject-driven generation. However, naïve fine-tuning VAR leads to computational overhead, language drift, and reduced diversity. To address these challenges, we introduce selective layer tuning to reduce complexity and prior distillation to mitigate language drift. Additionally, we found that the early stages have a greater influence on the generation of subject than the latter stages, which merely synthesize local details. Based on this finding, we propose scale-wise weighted tuning, which prioritizes coarser resolutions for promoting the model to focus on the subject-relevant information instead of local details. Extensive experiments validate that our method significantly outperforms diffusion-based baselines across various metrics and demonstrates its practical usage.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Black Hole Chemistry Knows Extra Dimensions
Authors:
Kyung Kiu Kim,
Jeongwon Ho,
Seungjoon Hyun,
Taehyeon Song
Abstract:
: In this note, we study an extra dimension effect on the black hole chemistry in the 8-dimensional Einstein-Yang-Mills-Maxwell theory. The base spacetime contains 4- dimensional compact manifolds and an instanton on top of those. We demonstrate how the extra dimensions affect the phase transition and viable black hole sizes in the 4-dimensional Einstein frame through the black hole chemistry. We…
▽ More
: In this note, we study an extra dimension effect on the black hole chemistry in the 8-dimensional Einstein-Yang-Mills-Maxwell theory. The base spacetime contains 4- dimensional compact manifolds and an instanton on top of those. We demonstrate how the extra dimensions affect the phase transition and viable black hole sizes in the 4-dimensional Einstein frame through the black hole chemistry. We focus on asymptotically anti-de Sitter spacetimes for the effective 4-dimensional model obtained by a dimensional reduction. The extra-dimension size determines thermodynamic pressure, and the thermodynamic volume is roughly the horizon size of black holes. Thus, the extra dimension and black hole sizes are related as a conjugate pair of thermodynamic variables.
△ Less
Submitted 5 June, 2025; v1 submitted 18 February, 2025;
originally announced February 2025.
-
Diverse Inference and Verification for Advanced Reasoning
Authors:
Iddo Drori,
Gaston Longhitano,
Mao Mao,
Seunghwan Hyun,
Yuke Zhang,
Sungjun Park,
Zachary Meeks,
Xin-Yu Zhang,
Ben Segev,
Howard Yong,
Nakul Verma,
Avi Shporer,
Alon Amit,
Madeleine Udell
Abstract:
Reasoning LLMs such as OpenAI o1, o3 and DeepSeek R1 have made significant progress in mathematics and coding, yet find challenging advanced tasks such as International Mathematical Olympiad (IMO) combinatorics problems, Abstraction and Reasoning Corpus (ARC) puzzles, and Humanity's Last Exam (HLE) questions. We use a diverse inference approach that combines multiple models and methods at test tim…
▽ More
Reasoning LLMs such as OpenAI o1, o3 and DeepSeek R1 have made significant progress in mathematics and coding, yet find challenging advanced tasks such as International Mathematical Olympiad (IMO) combinatorics problems, Abstraction and Reasoning Corpus (ARC) puzzles, and Humanity's Last Exam (HLE) questions. We use a diverse inference approach that combines multiple models and methods at test time. We find that verifying mathematics and code problems, and rejection sampling on other problems is simple and effective. We automatically verify correctness of solutions to IMO problems by Lean, and ARC puzzles by code, and find that best-of-N effectively answers HLE questions. Our approach increases answer accuracy on IMO combinatorics problems from 33.3% to 77.8%, accuracy on HLE questions from 8% to 37%, and solves 80% of ARC puzzles that 948 humans could not and 26.5% of ARC puzzles that o3 high compute does not. Test-time simulations, reinforcement learning, and meta-learning with inference feedback improve generalization by adapting agent graph representations and varying prompts, code, and datasets. Our approach is reliable, robust, and scalable, and in the spirit of reproducible research, we will make it publicly available upon publication.
△ Less
Submitted 14 February, 2025;
originally announced February 2025.
-
Sunrise III: Overview of Observatory and Instruments
Authors:
Andreas Korpi-Lagg,
Achim Gandorfer,
Sami K. Solanki,
Jose Carlos del Toro Iniesta,
Yukio Katsukawa,
Pietro Bernasconi,
Thomas Berkefeld,
Alex Feller,
Tino L. Riethmüller,
Alberto Álvarez-Herrero,
Masahito Kubo,
Valentín Martínez Pillet,
H. N. Smitha,
David Orozco Suárez,
Bianca Grauf,
Michael Carpenter,
Alexander Bell,
María-Teresa Álvarez-Alonso,
Daniel Álvarez García,
Beatriz Aparicio del Moral,
Daniel Ayoub,
Francisco Javier Bailén,
Eduardo Bailón Martínez,
Maria Balaguer Jiménez,
Peter Barthol
, et al. (95 additional authors not shown)
Abstract:
In July 2024, Sunrise completed its third successful science flight. The Sunrise III observatory had been upgraded significantly after the two previous successful flights in 2009 and 2013. Three completely new instruments focus on the small-scale physical processes and their complex interaction from the deepest observable layers in the photosphere up to chromospheric heights. Previously poorly exp…
▽ More
In July 2024, Sunrise completed its third successful science flight. The Sunrise III observatory had been upgraded significantly after the two previous successful flights in 2009 and 2013. Three completely new instruments focus on the small-scale physical processes and their complex interaction from the deepest observable layers in the photosphere up to chromospheric heights. Previously poorly explored spectral regions and lines are exploited to paint a three-dimensional picture of the solar atmosphere with unprecedented completeness and level of detail. The full polarimetric information is captured by all three instruments to reveal the interaction between the magnetic fields and the hydrodynamic processes. Two slit-based spectropolarimeters, the Sunrise UV Spectropolarimeter and Imager (SUSI) and the Sunrise Chromospheric Infrared spectro-Polarimeter (SCIP), focus on the near-ultraviolet and the near-infrared regions respectively, and the imaging spectropolarimeter Tunable Magnetograph (TuMag) simultaneously obtains maps of the full field-of-view of $46 \times 46$ Mm$^2$ in the photosphere and the chromosphere in the visible. The instruments are operated in an orchestrated mode, benefiting from a new Image Stabilization and Light Distribution unit (ISLiD), with the Correlating Wavefront Sensor (CWS) providing the autofocus control and an image stability with a root-mean-square value smaller than 0.005''. A new gondola was constructed to significantly improve the telescope pointing stability, required to achieve uninterrupted observations over many hours. Sunrise III was launched successfully on July 10, 2024, from the Esrange Space Center near Kiruna (Sweden). It reached the landing site between the Mackenzie River and the Great Bear Lake in Canada after a flight duration of 6.5 days. In this paper, we give an overview of the Sunrise III observatory and its instruments.
△ Less
Submitted 30 May, 2025; v1 submitted 10 February, 2025;
originally announced February 2025.
-
Auto-Encoded Supervision for Perceptual Image Super-Resolution
Authors:
MinKyu Lee,
Sangeek Hyun,
Woojin Jun,
Jae-Pil Heo
Abstract:
This work tackles the fidelity objective in the perceptual super-resolution~(SR). Specifically, we address the shortcomings of pixel-level $L_\text{p}$ loss ($\mathcal{L}_\text{pix}$) in the GAN-based SR framework. Since $L_\text{pix}$ is known to have a trade-off relationship against perceptual quality, prior methods often multiply a small scale factor or utilize low-pass filters. However, this w…
▽ More
This work tackles the fidelity objective in the perceptual super-resolution~(SR). Specifically, we address the shortcomings of pixel-level $L_\text{p}$ loss ($\mathcal{L}_\text{pix}$) in the GAN-based SR framework. Since $L_\text{pix}$ is known to have a trade-off relationship against perceptual quality, prior methods often multiply a small scale factor or utilize low-pass filters. However, this work shows that these circumventions fail to address the fundamental factor that induces blurring. Accordingly, we focus on two points: 1) precisely discriminating the subcomponent of $L_\text{pix}$ that contributes to blurring, and 2) only guiding based on the factor that is free from this trade-off relationship. We show that they can be achieved in a surprisingly simple manner, with an Auto-Encoder (AE) pretrained with $L_\text{pix}$. Accordingly, we propose the Auto-Encoded Supervision for Optimal Penalization loss ($L_\text{AESOP}$), a novel loss function that measures distance in the AE space, instead of the raw pixel space. Note that the AE space indicates the space after the decoder, not the bottleneck. By simply substituting $L_\text{pix}$ with $L_\text{AESOP}$, we can provide effective reconstruction guidance without compromising perceptual quality. Designed for simplicity, our method enables easy integration into existing SR frameworks. Experimental results verify that AESOP can lead to favorable results in the perceptual SR task.
△ Less
Submitted 11 April, 2025; v1 submitted 28 November, 2024;
originally announced December 2024.
-
First-principles calculation of the entropy of liquids with a case study on sodium
Authors:
Koun Shirai,
Hiroyoshi Momida,
Kazunori Sato,
Sangil Hyun
Abstract:
Despite increasing demands for the thermodynamic data of liquids in a wide range of science and engineering fields, there is a still a considerable lack of reliable data over a wide range of temperature ($T$) and pressure conditions. The most significant obstacle is that there is no practical method to calculate the entropy ($S$) of liquids. This problem can be solved using the thermodynamic defin…
▽ More
Despite increasing demands for the thermodynamic data of liquids in a wide range of science and engineering fields, there is a still a considerable lack of reliable data over a wide range of temperature ($T$) and pressure conditions. The most significant obstacle is that there is no practical method to calculate the entropy ($S$) of liquids. This problem can be solved using the thermodynamic definition of entropy, i.e., $S = \int C d\ln T$, where $C$ is specific heat. The specific heat is calculated by the derivative of the internal energy $U$ with respect to $T$. Both quantities, i.e., $U$ and $T$, are well defined in the molecular dynamics (MD) simulations based on density functional theory. The reliability of the present method is entirely dependent on the accuracy of the specific heat of liquid, for which there is no standard model. The problem with liquids is that there are no eigenstates, based on which the standard procedures are constructed. The relationship between $U$ and $T$ is affected by the energy relaxation processes, the effect of which appears in the $T$ dependence on the specific heat of liquids. This motivates us to conduct MD simulations by isolating the system from an external heat bath. In this paper, by applying this method to the liquid sodium, it is demonstrated that the experimental $T$ dependence of the isochoric specific heat is reproduced well without any empirical parameter. On this basis, the entropy of the liquid Na is obtained with a good agreement with experimental values.
△ Less
Submitted 16 November, 2024;
originally announced November 2024.
-
A Generalized LLM-Augmented BIM Framework: Application to a Speech-to-BIM system
Authors:
Ghang Lee,
Suhyung Jang,
Seokho Hyun
Abstract:
Performing building information modeling (BIM) tasks is a complex process that imposes a steep learning curve and a heavy cognitive load due to the necessity of remembering sequences of numerous commands. With the rapid advancement of large language models (LLMs), it is foreseeable that BIM tasks, including querying and managing BIM data, 4D and 5D BIM, design compliance checking, or authoring a d…
▽ More
Performing building information modeling (BIM) tasks is a complex process that imposes a steep learning curve and a heavy cognitive load due to the necessity of remembering sequences of numerous commands. With the rapid advancement of large language models (LLMs), it is foreseeable that BIM tasks, including querying and managing BIM data, 4D and 5D BIM, design compliance checking, or authoring a design, using written or spoken natural language (i.e., text-to-BIM or speech-to-BIM), will soon supplant traditional graphical user interfaces. This paper proposes a generalized LLM-augmented BIM framework to expedite the development of LLM-enhanced BIM applications by providing a step-by-step development process. The proposed framework consists of six steps: interpret-fill-match-structure-execute-check. The paper demonstrates the applicability of the proposed framework through implementing a speech-to-BIM application, NADIA-S (Natural-language-based Architectural Detailing through Interaction with Artificial Intelligence via Speech), using exterior wall detailing as an example.
△ Less
Submitted 26 September, 2024;
originally announced September 2024.
-
Overestimation of melting temperatures calculated by first-principles molecular dynamics simulations
Authors:
Koun Shirai,
Hiroyoshi Momida,
Kazunori Sato,
Sangil Hyun
Abstract:
Although the melting temperature, $T_{m}$, of a solid can be calculated based on first-principles molecular dynamics (FP-MD) simulations, systematic assessments of the accuracy of the resulting values have not yet been reported. FP-MD simulations require significant computational resources and hence an examination of the effect of cell size on convergence is difficult. In addition, calculation of…
▽ More
Although the melting temperature, $T_{m}$, of a solid can be calculated based on first-principles molecular dynamics (FP-MD) simulations, systematic assessments of the accuracy of the resulting values have not yet been reported. FP-MD simulations require significant computational resources and hence an examination of the effect of cell size on convergence is difficult. In addition, calculation of the energy of a liquid is not a trivial problem because of energy dissipation effects. The present work attempts to resolve these problems, and thus allow the accuracy of $T_{m}$ values obtained from FP-MD simulations to be assessed for typical semiconductors, metals, and oxides. With the exception of Si, the $T_{m}$ value was overestimated in all cases. This overestimation can be reduced by increasing the cell size, although the convergence is slow unless the potential is very shallow. For oxides, this overestimation may not be removed by increasing the cell size. The LDA/GGA error of overbinding affects the melting enthalpy and thereby $T_{m}$. In order to fully capture the energy dissipation nature of liquids, adiabatic MD simulations are required, and such simulations have been performed in the present study.
△ Less
Submitted 8 December, 2024; v1 submitted 2 September, 2024;
originally announced September 2024.
-
GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats
Authors:
Sangeek Hyun,
Jae-Pil Heo
Abstract:
Most advances in 3D Generative Adversarial Networks (3D GANs) largely depend on ray casting-based volume rendering, which incurs demanding rendering costs. One promising alternative is rasterization-based 3D Gaussian Splatting (3D-GS), providing a much faster rendering speed and explicit 3D representation. In this paper, we exploit Gaussian as a 3D representation for 3D GANs by leveraging its effi…
▽ More
Most advances in 3D Generative Adversarial Networks (3D GANs) largely depend on ray casting-based volume rendering, which incurs demanding rendering costs. One promising alternative is rasterization-based 3D Gaussian Splatting (3D-GS), providing a much faster rendering speed and explicit 3D representation. In this paper, we exploit Gaussian as a 3D representation for 3D GANs by leveraging its efficient and explicit characteristics. However, in an adversarial framework, we observe that a naïve generator architecture suffers from training instability and lacks the capability to adjust the scale of Gaussians. This leads to model divergence and visual artifacts due to the absence of proper guidance for initialized positions of Gaussians and densification to manage their scales adaptively. To address these issues, we introduce a generator architecture with a hierarchical multi-scale Gaussian representation that effectively regularizes the position and scale of generated Gaussians. Specifically, we design a hierarchy of Gaussians where finer-level Gaussians are parameterized by their coarser-level counterparts; the position of finer-level Gaussians would be located near their coarser-level counterparts, and the scale would monotonically decrease as the level becomes finer, modeling both coarse and fine details of the 3D scene. Experimental results demonstrate that ours achieves a significantly faster rendering speed (x100) compared to state-of-the-art 3D consistent GANs with comparable 3D generation capability. Project page: https://hse1032.github.io/gsgan.
△ Less
Submitted 14 November, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
Diversity-aware Channel Pruning for StyleGAN Compression
Authors:
Jiwoo Chung,
Sangeek Hyun,
Sang-Heon Shim,
Jae-Pil Heo
Abstract:
StyleGAN has shown remarkable performance in unconditional image generation. However, its high computational cost poses a significant challenge for practical applications. Although recent efforts have been made to compress StyleGAN while preserving its performance, existing compressed models still lag behind the original model, particularly in terms of sample diversity. To overcome this, we propos…
▽ More
StyleGAN has shown remarkable performance in unconditional image generation. However, its high computational cost poses a significant challenge for practical applications. Although recent efforts have been made to compress StyleGAN while preserving its performance, existing compressed models still lag behind the original model, particularly in terms of sample diversity. To overcome this, we propose a novel channel pruning method that leverages varying sensitivities of channels to latent vectors, which is a key factor in sample diversity. Specifically, by assessing channel importance based on their sensitivities to latent vector perturbations, our method enhances the diversity of samples in the compressed model. Since our method solely focuses on the channel pruning stage, it has complementary benefits with prior training schemes without additional training cost. Extensive experiments demonstrate that our method significantly enhances sample diversity across various datasets. Moreover, in terms of FID scores, our method not only surpasses state-of-the-art by a large margin but also achieves comparable scores with only half training iterations.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Task-Disruptive Background Suppression for Few-Shot Segmentation
Authors:
Suho Park,
SuBeen Lee,
Sangeek Hyun,
Hyun Seok Seong,
Jae-Pil Heo
Abstract:
Few-shot segmentation aims to accurately segment novel target objects within query images using only a limited number of annotated support images. The recent works exploit support background as well as its foreground to precisely compute the dense correlations between query and support. However, they overlook the characteristics of the background that generally contains various types of objects. I…
▽ More
Few-shot segmentation aims to accurately segment novel target objects within query images using only a limited number of annotated support images. The recent works exploit support background as well as its foreground to precisely compute the dense correlations between query and support. However, they overlook the characteristics of the background that generally contains various types of objects. In this paper, we highlight this characteristic of background which can bring problematic cases as follows: (1) when the query and support backgrounds are dissimilar and (2) when objects in the support background are similar to the target object in the query. Without any consideration of the above cases, adopting the entire support background leads to a misprediction of the query foreground as background. To address this issue, we propose Task-disruptive Background Suppression (TBS), a module to suppress those disruptive support background features based on two spatial-wise scores: query-relevant and target-relevant scores. The former aims to mitigate the impact of unshared features solely existing in the support background, while the latter aims to reduce the influence of target-similar support background features. Based on these two scores, we define a query background relevant score that captures the similarity between the backgrounds of the query and the support, and utilize it to scale support background features to adaptively restrict the impact of disruptive support backgrounds. Our proposed method achieves state-of-the-art performance on PASCAL-5 and COCO-20 datasets on 1-shot segmentation. Our official code is available at github.com/SuhoPark0706/TBSNet.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer
Authors:
Jiwoo Chung,
Sangeek Hyun,
Jae-Pil Heo
Abstract:
Despite the impressive generative capabilities of diffusion models, existing diffusion model-based style transfer methods require inference-stage optimization (e.g. fine-tuning or textual inversion of style) which is time-consuming, or fails to leverage the generative ability of large-scale diffusion models. To address these issues, we introduce a novel artistic style transfer method based on a pr…
▽ More
Despite the impressive generative capabilities of diffusion models, existing diffusion model-based style transfer methods require inference-stage optimization (e.g. fine-tuning or textual inversion of style) which is time-consuming, or fails to leverage the generative ability of large-scale diffusion models. To address these issues, we introduce a novel artistic style transfer method based on a pre-trained large-scale diffusion model without any optimization. Specifically, we manipulate the features of self-attention layers as the way the cross-attention mechanism works; in the generation process, substituting the key and value of content with those of style image. This approach provides several desirable characteristics for style transfer including 1) preservation of content by transferring similar styles into similar image patches and 2) transfer of style based on similarity of local texture (e.g. edge) between content and style images. Furthermore, we introduce query preservation and attention temperature scaling to mitigate the issue of disruption of original content, and initial latent Adaptive Instance Normalization (AdaIN) to deal with the disharmonious color (failure to transfer the colors of style). Our experimental results demonstrate that our proposed method surpasses state-of-the-art methods in both conventional and diffusion-based style transfer baselines.
△ Less
Submitted 20 March, 2024; v1 submitted 11 December, 2023;
originally announced December 2023.
-
METAL: Metamorphic Testing Framework for Analyzing Large-Language Model Qualities
Authors:
Sangwon Hyun,
Mingyu Guo,
M. Ali Babar
Abstract:
Large-Language Models (LLMs) have shifted the paradigm of natural language data processing. However, their black-boxed and probabilistic characteristics can lead to potential risks in the quality of outputs in diverse LLM applications. Recent studies have tested Quality Attributes (QAs), such as robustness or fairness, of LLMs by generating adversarial input texts. However, existing studies have l…
▽ More
Large-Language Models (LLMs) have shifted the paradigm of natural language data processing. However, their black-boxed and probabilistic characteristics can lead to potential risks in the quality of outputs in diverse LLM applications. Recent studies have tested Quality Attributes (QAs), such as robustness or fairness, of LLMs by generating adversarial input texts. However, existing studies have limited their coverage of QAs and tasks in LLMs and are difficult to extend. Additionally, these studies have only used one evaluation metric, Attack Success Rate (ASR), to assess the effectiveness of their approaches. We propose a MEtamorphic Testing for Analyzing LLMs (METAL) framework to address these issues by applying Metamorphic Testing (MT) techniques. This approach facilitates the systematic testing of LLM qualities by defining Metamorphic Relations (MRs), which serve as modularized evaluation metrics. The METAL framework can automatically generate hundreds of MRs from templates that cover various QAs and tasks. In addition, we introduced novel metrics that integrate the ASR method into the semantic qualities of text to assess the effectiveness of MRs accurately. Through the experiments conducted with three prominent LLMs, we have confirmed that the METAL framework effectively evaluates essential QAs on primary LLM tasks and reveals the quality risks in LLMs. Moreover, the newly proposed metrics can guide the optimal MRs for testing each task and suggest the most effective method for generating MRs.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding
Authors:
WonJun Moon,
Sangeek Hyun,
SuBeen Lee,
Jae-Pil Heo
Abstract:
Temporal Grounding is to identify specific moments or highlights from a video corresponding to textual descriptions. Typical approaches in temporal grounding treat all video clips equally during the encoding process regardless of their semantic relevance with the text query. Therefore, we propose Correlation-Guided DEtection TRansformer (CG-DETR), exploring to provide clues for query-associated vi…
▽ More
Temporal Grounding is to identify specific moments or highlights from a video corresponding to textual descriptions. Typical approaches in temporal grounding treat all video clips equally during the encoding process regardless of their semantic relevance with the text query. Therefore, we propose Correlation-Guided DEtection TRansformer (CG-DETR), exploring to provide clues for query-associated video clips within the cross-modal attention. First, we design an adaptive cross-attention with dummy tokens. Dummy tokens conditioned by text query take portions of the attention weights, preventing irrelevant video clips from being represented by the text query. Yet, not all words equally inherit the text query's correlation to video clips. Thus, we further guide the cross-attention map by inferring the fine-grained correlation between video clips and words. We enable this by learning a joint embedding space for high-level concepts, i.e., moment and sentence level, and inferring the clip-word correlation. Lastly, we exploit the moment-specific characteristics and combine them with the context of each video to form a moment-adaptive saliency detector. By exploiting the degrees of text engagement in each video clip, it precisely measures the highlightness of each clip. CG-DETR achieves state-of-the-art results on various benchmarks for temporal grounding. Codes are available at https://github.com/wjun0830/CGDETR.
△ Less
Submitted 3 July, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Contact holes in vertical electrode structures analyzed by voltage contrast-SEM and conducting AFM
Authors:
Minsun Gu,
Moon Seop Hyun,
Moonsup Han,
Gyungtae Kim,
Young Jun Chang
Abstract:
Soaring demands of multi-stacked memory devices request urgent development of backside contact electrode technologies, such as high aspect ratio etching, metallization, and inspection methods. Especially the complex metal contact process should be monitored for each manufacturing step to filter the defective samples and to maintain the high yield of production. Among the inspection methods for det…
▽ More
Soaring demands of multi-stacked memory devices request urgent development of backside contact electrode technologies, such as high aspect ratio etching, metallization, and inspection methods. Especially the complex metal contact process should be monitored for each manufacturing step to filter the defective samples and to maintain the high yield of production. Among the inspection methods for detecting the electrical connections, there is voltage contrast (VC)-SEM and conducting AFM (C-AFM). In this report, we investigated the two inspection methods for testing designed samples with different contact hole states. The VC-SEM data shows the contrast variation at the contact holes, from which one may discern the contact status with an optimum voltage. The C-AFM results clearly demonstrate a finite electrical current in the connected contact, while a negligible current in the disconnected one. Finally, we discuss insights of using the two methods for analyzing the contact hole technologies with high aspect ratios.
△ Less
Submitted 22 October, 2023;
originally announced October 2023.
-
Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
Authors:
WonJun Moon,
Sangeek Hyun,
SangUk Park,
Dongchan Park,
Jae-Pil Heo
Abstract:
Recently, video moment retrieval and highlight detection (MR/HD) are being spotlighted as the demand for video understanding is drastically increased. The key objective of MR/HD is to localize the moment and estimate clip-wise accordance level, i.e., saliency score, to the given text query. Although the recent transformer-based models brought some advances, we found that these methods do not fully…
▽ More
Recently, video moment retrieval and highlight detection (MR/HD) are being spotlighted as the demand for video understanding is drastically increased. The key objective of MR/HD is to localize the moment and estimate clip-wise accordance level, i.e., saliency score, to the given text query. Although the recent transformer-based models brought some advances, we found that these methods do not fully exploit the information of a given query. For example, the relevance between text query and video contents is sometimes neglected when predicting the moment and its saliency. To tackle this issue, we introduce Query-Dependent DETR (QD-DETR), a detection transformer tailored for MR/HD. As we observe the insignificant role of a given query in transformer architectures, our encoding module starts with cross-attention layers to explicitly inject the context of text query into video representation. Then, to enhance the model's capability of exploiting the query information, we manipulate the video-query pairs to produce irrelevant pairs. Such negative (irrelevant) video-query pairs are trained to yield low saliency scores, which in turn, encourages the model to estimate precise accordance between query-video pairs. Lastly, we present an input-adaptive saliency predictor which adaptively defines the criterion of saliency scores for the given video-query pairs. Our extensive studies verify the importance of building the query-dependent representation for MR/HD. Specifically, QD-DETR outperforms state-of-the-art methods on QVHighlights, TVSum, and Charades-STA datasets. Codes are available at github.com/wjun0830/QD-DETR.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Nonminimally Assisted Inflation: A General Analysis
Authors:
Sang Chul Hyun,
Jinsu Kim,
Tatsuki Kodama,
Seong Chan Park,
Tomo Takahashi
Abstract:
The effects of a scalar field, known as the "assistant field," which nonminimally couples to gravity, on single-field inflationary models are studied. The analysis provides analytical expressions for inflationary observables such as the spectral index ($n_s$), the tensor-to-scalar ratio ($r$), and the local-type nonlinearity parameter ($f_{\rm NL}^{(\rm local)}$). The presence of the assistant fie…
▽ More
The effects of a scalar field, known as the "assistant field," which nonminimally couples to gravity, on single-field inflationary models are studied. The analysis provides analytical expressions for inflationary observables such as the spectral index ($n_s$), the tensor-to-scalar ratio ($r$), and the local-type nonlinearity parameter ($f_{\rm NL}^{(\rm local)}$). The presence of the assistant field leads to a lowering of $n_s$ and $r$ in most of the parameter space, compared to the original predictions. In some cases, $n_s$ may increase due to the assistant field. This revives compatibility between ruled-out single-field models and the latest observations by Planck-BICEP/Keck. The results are demonstrated using three example models: loop inflation, power-law inflation, and hybrid inflation.
△ Less
Submitted 5 May, 2023; v1 submitted 12 February, 2023;
originally announced February 2023.
-
Non-minimally assisted chaotic inflation
Authors:
Sang Chul Hyun,
Jinsu Kim,
Seong Chan Park,
Tomo Takahashi
Abstract:
Conventional wisdom says that a chaotic inflation model with a power-law potential is ruled out by the recent Planck-BICEP/Keck results. We find, however, that the model can be assisted by a non-minimally coupled scalar field and still provides a successful inflation. Considering a power-law chaotic inflation model of the type $V\sim \varphi^n$ with $n=\{2, 4/3, 1, 2/3, 1/3\}$, we show that…
▽ More
Conventional wisdom says that a chaotic inflation model with a power-law potential is ruled out by the recent Planck-BICEP/Keck results. We find, however, that the model can be assisted by a non-minimally coupled scalar field and still provides a successful inflation. Considering a power-law chaotic inflation model of the type $V\sim \varphi^n$ with $n=\{2, 4/3, 1, 2/3, 1/3\}$, we show that $n=1/3$ ($n=\{2/3, 1/3\}$) may be revived with the help of the quadratic (quartic) non-minimal coupling of the assistant field to gravity.
△ Less
Submitted 2 June, 2022; v1 submitted 17 March, 2022;
originally announced March 2022.
-
Joint Vehicle Tracking and RSU Selection for V2I Communications with Extended Kalman Filter
Authors:
Jiho Song,
Seong-Hwan Hyun,
Jong-Ho Lee,
Jeongsik Choi,
Seong-Cheol Kim
Abstract:
We develop joint vehicle tracking and road side unit (RSU) selection algorithms suitable for vehicle-to-infrastructure (V2I) communications. We first design an analytical framework for evaluating vehicle tracking systems based on the extended Kalman filter. A simple, yet effective, metric that quantifies the vehicle tracking performance is derived in terms of the angular derivative of a dominant s…
▽ More
We develop joint vehicle tracking and road side unit (RSU) selection algorithms suitable for vehicle-to-infrastructure (V2I) communications. We first design an analytical framework for evaluating vehicle tracking systems based on the extended Kalman filter. A simple, yet effective, metric that quantifies the vehicle tracking performance is derived in terms of the angular derivative of a dominant spatial frequency. Second, an RSU selection algorithm is proposed to select a proper RSU that enhances the vehicle tracking performance. A joint vehicle tracking algorithm is also developed to maximize the tracking performance by considering sounding samples at multiple RSUs while minimizing the amount of sample exchange. The numerical results verify that the proposed vehicle tracking algorithms give better performance than conventional signal-to-noise ratio-based tracking systems.
△ Less
Submitted 10 February, 2022; v1 submitted 1 January, 2022;
originally announced January 2022.
-
Ocean Mover's Distance: Using Optimal Transport for Analyzing Oceanographic Data
Authors:
Sangwon Hyun,
Aditya Mishra,
Christopher L. Follett,
Bror Jonsson,
Gemma Kulk,
Gael Forget,
Marie-Fanny Racault,
Thomas Jackson,
Stephanie Dutkiewicz,
Christian L. Müller,
Jacob Bien
Abstract:
Remote sensing observations from satellites and global biogeochemical models have combined to revolutionize the study of ocean biogeochemical cycling, but comparing the two data streams to each other and across time remains challenging due to the strong spatial-temporal structuring of the ocean. Here, we show that the Wasserstein distance provides a powerful metric for harnessing these structured…
▽ More
Remote sensing observations from satellites and global biogeochemical models have combined to revolutionize the study of ocean biogeochemical cycling, but comparing the two data streams to each other and across time remains challenging due to the strong spatial-temporal structuring of the ocean. Here, we show that the Wasserstein distance provides a powerful metric for harnessing these structured datasets for better marine ecosystem and climate predictions. Wasserstein distance complements commonly used point-wise difference methods such as the root mean squared error, by quantifying differences in terms of spatial displacement in addition to magnitude. As a test case we consider Chlorophyll (a key indicator of phytoplankton biomass) in the North-East Pacific Ocean, obtained from model simulations, in situ measurements, and satellite observations. We focus on two main applications: 1) Comparing model predictions with satellite observations, and 2) temporal evolution of Chlorophyll both seasonally and over longer time frames. Wasserstein distance successfully isolates temporal and depth variability and quantifies shifts in biogeochemical province boundaries. It also exposes relevant temporal trends in satellite Chlorophyll consistent with climate change predictions. Our study shows that optimal transport vectors underlying Wasserstein distance provide a novel visualization tool for testing models and better understanding temporal dynamics in the ocean.
△ Less
Submitted 4 November, 2022; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Regular Path Query Evaluation Sharing a Reduced Transitive Closure Based on Graph Reduction
Authors:
Inju Na,
Ilyeop Yi,
Kyu-Young Whang,
Yang-Sae Moon,
Soon J. Hyun
Abstract:
Regular path queries (RPQs) find pairs of vertices of paths satisfying given regular expressions on an edge-labeled, directed multigraph. When evaluating an RPQ, the evaluation of a Kleene closure (i.e., Kleene plus or Kleene star) is very expensive. Furthermore, when multiple RPQs include a Kleene closure as a common sub-query, repeated evaluations of the common sub-query cause serious performanc…
▽ More
Regular path queries (RPQs) find pairs of vertices of paths satisfying given regular expressions on an edge-labeled, directed multigraph. When evaluating an RPQ, the evaluation of a Kleene closure (i.e., Kleene plus or Kleene star) is very expensive. Furthermore, when multiple RPQs include a Kleene closure as a common sub-query, repeated evaluations of the common sub-query cause serious performance degradation. In this paper, we present a novel concept of RPQ-based graph reduction, which significantly simplifies the original graph through edge-level and vertex-level reductions. Interestingly, RPQ-based graph reduction can replace the evaluation of the Kleene closure on the large original graph to that of the transitive closure to the small reduced graph. We then propose a reduced transitive closure (RTC) as a lightweight structure for efficiently sharing the result of a Kleene closure. We also present an RPQ evaluation algorithm, RTCSharing, which treats each clause in the disjunctive normal form of the given RPQ as a batch unit. If the batch units include a Kleene closure as a common sub-query, we share the lightweight RTC instead of the heavyweight result of the Kleene closure. RPQ-based graph reduction further enables us to formally represent the result of an RPQ including a Kleene closure as a relational algebra expression including the RTC. Through the formal expression, we optimize the evaluation of the batch unit by eliminating useless and redundant operations of the previous method. Experiments show that RTCSharing improves the performance significantly by up to 73.86 times compared with existing methods in terms of query response time.
△ Less
Submitted 25 March, 2022; v1 submitted 12 November, 2021;
originally announced November 2021.
-
Festina-Lente Bound on Higgs Vacuum Structure and Inflation
Authors:
Sung Mook Lee,
Dhong Yeon Cheong,
Sang Chul Hyun,
Seong Chan Park,
Min-Seok Seo
Abstract:
The recently suggested Festina-Lente (FL) bound provides a lower bound on the masses of ${\rm U(1)}$ charged particles in terms of the positive vacuum energy. Since the charged particle masses in the Standard Model (SM) are generated by the Higgs mechanism, the FL bound provides a testbed of consistent Higgs potentials in the current dark energy-dominated universe as well as during inflation. We s…
▽ More
The recently suggested Festina-Lente (FL) bound provides a lower bound on the masses of ${\rm U(1)}$ charged particles in terms of the positive vacuum energy. Since the charged particle masses in the Standard Model (SM) are generated by the Higgs mechanism, the FL bound provides a testbed of consistent Higgs potentials in the current dark energy-dominated universe as well as during inflation. We study the implications of the FL bound on the UV behavior of the Higgs potential for a miniscule vacuum energy, as in the current universe. We also present values of the Hubble parameter and the Higgs vacuum expectation value allowed by the FL bound during inflation, which implies that the Higgs cannot stay at the electroweak scale during this epoch.
△ Less
Submitted 16 February, 2022; v1 submitted 7 November, 2021;
originally announced November 2021.
-
Adaptive Beam Design for V2I Communications using Vehicle Tracking with Extended Kalman Filter
Authors:
Seong-Hwan Hyun,
Jiho Song,
Keunwoo Kim,
Jong-Ho Lee,
Seong-Cheol Kim
Abstract:
Vehicle-to-everything communication system is a strong candidate for improving the driving experience and automotive safety by linking vehicles to wireless networks. To take advantage of the full benefits of vehicle connectivity, it is essential to ensure a stable network connection between roadside unit (RSU) and fast-moving vehicles. Based on the extended Kalman filter (EKF), we develop a vehicl…
▽ More
Vehicle-to-everything communication system is a strong candidate for improving the driving experience and automotive safety by linking vehicles to wireless networks. To take advantage of the full benefits of vehicle connectivity, it is essential to ensure a stable network connection between roadside unit (RSU) and fast-moving vehicles. Based on the extended Kalman filter (EKF), we develop a vehicle tracking algorithm to enable reliable radio connections. For the vehicle tracking algorithm, we focus on estimating the rapid changes in the beam direction of a high-mobility vehicle while reducing the feedback overhead. Furthermore, we design a beamforming codebook that considers the road layout and RSU. By leveraging the proposed beamforming codebook, vehicles on the road can expect a service quality similar to that of conventional cellular services. Finally, a beamformer selection algorithm is developed to secure sufficient gain for the system's link budget. Numerical results verify that the EKF-based vehicle tracking algorithm and the proposed beamforming structure are more suitable for vehicle-to-infrastructure networks compared to existing schemes.
△ Less
Submitted 10 November, 2021; v1 submitted 5 August, 2021;
originally announced August 2021.
-
Efficient Exact k-Flexible Aggregate Nearest Neighbor Search in Road Networks Using the M-tree
Authors:
Moonyoung Chung,
Soon J. Hyun,
Woong-Kee Loh
Abstract:
This study proposes an efficient exact k-flexible aggregate nearest neighbor (k-FANN) search algorithm in road networks using the M-tree. The state-of-the-art IER-kNN algorithm used the R-tree and pruned off unnecessary nodes based on the Euclidean coordinates of objects in road networks. However, IER-kNN made many unnecessary accesses to index nodes since the Euclidean distances between objects a…
▽ More
This study proposes an efficient exact k-flexible aggregate nearest neighbor (k-FANN) search algorithm in road networks using the M-tree. The state-of-the-art IER-kNN algorithm used the R-tree and pruned off unnecessary nodes based on the Euclidean coordinates of objects in road networks. However, IER-kNN made many unnecessary accesses to index nodes since the Euclidean distances between objects are significantly different from the actual shortest-path distances between them. In contrast, our algorithm proposed in this study can greatly reduce unnecessary accesses to index nodes compared with IER-kNN since the M-tree is constructed based on the actual shortest-path distances between objects. To the best of our knowledge, our algorithm is the first exact FANN algorithm that uses the M-tree. We prove that our algorithm does not cause any false drop. In conducting a series of experiments using various real road network datasets, our algorithm consistently outperformed IER-kNN by up to 6.92 times.
△ Less
Submitted 13 September, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Algorithms for Linearly Recurrent Sequences of Truncated Polynomials
Authors:
Seung Gyu Hyun,
Vincent Neiger,
Éric Schost
Abstract:
Linear recurrent sequences are those whose elements are defined as linear combinations of preceding elements, and finding recurrence relations is a fundamental problem in computer algebra. In this paper, we focus on sequences whose elements are vectors over the ring $\mathbb{A} = \mathbb{K}[x]/(x^d)$ of truncated polynomials. Finding the ideal of their recurrence relations has applications such as…
▽ More
Linear recurrent sequences are those whose elements are defined as linear combinations of preceding elements, and finding recurrence relations is a fundamental problem in computer algebra. In this paper, we focus on sequences whose elements are vectors over the ring $\mathbb{A} = \mathbb{K}[x]/(x^d)$ of truncated polynomials. Finding the ideal of their recurrence relations has applications such as the computation of minimal polynomials and determinants of sparse matrices over $\mathbb{A}$. We present three methods for finding this ideal: a Berlekamp-Massey-like approach due to Kurakin, one which computes the kernel of some block-Hankel matrix over $\mathbb{A}$ via a minimal approximant basis, and one based on bivariate Padé approximation. We propose complexity improvements for the first two methods, respectively by avoiding the computation of redundant relations and by exploiting the Hankel structure to compress the approximation problem. Then we confirm these improvements empirically through a C++ implementation, and we discuss the above-mentioned applications.
△ Less
Submitted 8 June, 2021; v1 submitted 6 February, 2021;
originally announced February 2021.
-
Participation in TREC 2020 COVID Track Using Continuous Active Learning
Authors:
Xue Jun Wang,
Maura R. Grossman,
Seung Gyu Hyun
Abstract:
We describe our participation in all five rounds of the TREC 2020 COVID Track (TREC-COVID). The goal of TREC-COVID is to contribute to the response to the COVID-19 pandemic by identifying answers to many pressing questions and building infrastructure to improve search systems [8]. All five rounds of this Track challenged participants to perform a classic ad-hoc search task on the new data collecti…
▽ More
We describe our participation in all five rounds of the TREC 2020 COVID Track (TREC-COVID). The goal of TREC-COVID is to contribute to the response to the COVID-19 pandemic by identifying answers to many pressing questions and building infrastructure to improve search systems [8]. All five rounds of this Track challenged participants to perform a classic ad-hoc search task on the new data collection CORD-19. Our solution addressed this challenge by applying the Continuous Active Learning model (CAL) and its variations. Our results showed us to be amongst the top scoring manual runs and we remained competitive within all categories of submissions.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
Test-Cost Sensitive Methods for Identifying Nearby Points
Authors:
Seung Gyu Hyun,
Christopher Leung
Abstract:
Real-world applications that involve missing values are often constrained by the cost to obtain data. Test-cost sensitive, or costly feature, methods additionally consider the cost of acquiring features. Such methods have been extensively studied in the problem of classification. In this paper, we study a related problem of test-cost sensitive methods to identify nearby points from a large set, gi…
▽ More
Real-world applications that involve missing values are often constrained by the cost to obtain data. Test-cost sensitive, or costly feature, methods additionally consider the cost of acquiring features. Such methods have been extensively studied in the problem of classification. In this paper, we study a related problem of test-cost sensitive methods to identify nearby points from a large set, given a new point with some unknown feature values. We present two models, one based on a tree and another based on Deep Reinforcement Learning. In our simulations, we show that the models outperform random agents on a set of five real-world data sets.
△ Less
Submitted 4 October, 2020;
originally announced October 2020.
-
Modeling Cell Populations Measured By Flow Cytometry With Covariates Using Sparse Mixture of Regressions
Authors:
Sangwon Hyun,
Mattias Rolf Cape,
Francois Ribalet,
Jacob Bien
Abstract:
The ocean is filled with microscopic microalgae called phytoplankton, which together are responsible for as much photosynthesis as all plants on land combined. Our ability to predict their response to the warming ocean relies on understanding how the dynamics of phytoplankton populations is influenced by changes in environmental conditions. One powerful technique to study the dynamics of phytoplan…
▽ More
The ocean is filled with microscopic microalgae called phytoplankton, which together are responsible for as much photosynthesis as all plants on land combined. Our ability to predict their response to the warming ocean relies on understanding how the dynamics of phytoplankton populations is influenced by changes in environmental conditions. One powerful technique to study the dynamics of phytoplankton is flow cytometry, which measures the optical properties of thousands of individual cells per second. Today, oceanographers are able to collect flow cytometry data in real-time onboard a moving ship, providing them with fine-scale resolution of the distribution of phytoplankton across thousands of kilometers. One of the current challenges is to understand how these small and large scale variations relate to environmental conditions, such as nutrient availability, temperature, light and ocean currents. In this paper, we propose a novel sparse mixture of multivariate regressions model to estimate the time-varying phytoplankton subpopulations while simultaneously identifying the specific environmental covariates that are predictive of the observed changes to these subpopulations. We demonstrate the usefulness and interpretability of the approach using both synthetic data and real observations collected on an oceanographic cruise conducted in the north-east Pacific in the spring of 2017.
△ Less
Submitted 3 August, 2022; v1 submitted 25 August, 2020;
originally announced August 2020.
-
Thermodynamics of Inhomogeneously Mass-deformed ABJM Model and Pressure Anisotropy
Authors:
Seungjoon Hyun,
Byoungjoon Ahn,
Kyung Kiu Kim,
O-Kab Kwon,
Sang-A Park
Abstract:
In this paper we study the thermodynamics of black branes with a modulated complex scalar in the context of bulk and boundary theories. The modulation induces inhomogeneity to the dual field theory, anisotropic pressure, and brane charge to the bulk geometry. The first law of thermodynamics and the Smarr relation are obtained using the off-shell ADT and the reduced action formalisms. We discuss th…
▽ More
In this paper we study the thermodynamics of black branes with a modulated complex scalar in the context of bulk and boundary theories. The modulation induces inhomogeneity to the dual field theory, anisotropic pressure, and brane charge to the bulk geometry. The first law of thermodynamics and the Smarr relation are obtained using the off-shell ADT and the reduced action formalisms. We discuss the prescription for the mass of black branes, which relies on relevant and marginal deformations in the dual field theory. One of the cases is the gravity dual to a ABJM model with a sinusoidal mass function depending on a spatial coordinate. This is the first study of the deformed ABJM model at finite temperature including bulk thermodynamics.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
AdS Q-Soliton and Inhomogeneously mass-deformed ABJM Model
Authors:
Byoungjoon Ahn,
Seungjoon Hyun,
Kyung Kiu Kim,
O-Kab Kwon,
Sang-A Park
Abstract:
We study dual geometries to a deformed ABJM model with spatially dependent source functions at finite temperature. These source functions are proportional to the mass function $m(x)= m_0 \sin k x$ and its derivative $m'(x)$. As dual geometries, we find hairy black branes and AdS solitons corresponding to deconfinement phase and confining phase of the dual field theory, respectively. It turns out t…
▽ More
We study dual geometries to a deformed ABJM model with spatially dependent source functions at finite temperature. These source functions are proportional to the mass function $m(x)= m_0 \sin k x$ and its derivative $m'(x)$. As dual geometries, we find hairy black branes and AdS solitons corresponding to deconfinement phase and confining phase of the dual field theory, respectively. It turns out that the hairy AdS solitons have lower free energy than the black branes when the Hawking temperature is smaller than the confining scale. Therefore the dual system undergoes the first order phase transition. Even though our study is limited to the so-called Q-lattice ansatz, the solution space contains a set of solutions dual to a supersymmetric mass deformation. As a physical quantity to probe the confining phase, we investigate the holographic entanglement entropy and discuss its behavior in terms of modulation effect.
△ Less
Submitted 4 March, 2020; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Charged AdS black holes in Gauss-Bonnet gravity and nonlinear electrodynamics
Authors:
Seungjoon Hyun,
Cao H. Nam
Abstract:
New five-dimensional charged AdS black hole solutions are found in Einstein-Gauss-Bonnet gravity and the nonlinear electrodynamics. These solutions include regular black holes as well as extremal black holes. The first law of the black hole thermodynamics is confirmed in the extended phase space where the cosmological constant is treated as the pressure. The first and second order phase transition…
▽ More
New five-dimensional charged AdS black hole solutions are found in Einstein-Gauss-Bonnet gravity and the nonlinear electrodynamics. These solutions include regular black holes as well as extremal black holes. The first law of the black hole thermodynamics is confirmed in the extended phase space where the cosmological constant is treated as the pressure. The first and second order phase transitions are investigated by observing the behavior of the heat capacity at constant pressure and the Gibbs free energy. In addition, the equation of state for the black holes and their $P-V$ criticality are studied. Finally, the critical exponents are found to be the same as those of the Van der Waals fluid.
△ Less
Submitted 25 August, 2019;
originally announced August 2019.
-
Change of basis for m-primary ideals in one and two variables
Authors:
Seung Gyu Hyun,
Stephen Melczer,
Éric Schost,
Catherine St-Pierre
Abstract:
Following recent work by van der Hoeven and Lecerf (ISSAC 2017), we discuss the complexity of linear mappings, called untangling and tangling by those authors, that arise in the context of computations with univariate polynomials. We give a slightly faster tangling algorithm and discuss new applications of these techniques. We show how to extend these ideas to bivariate settings, and use them to g…
▽ More
Following recent work by van der Hoeven and Lecerf (ISSAC 2017), we discuss the complexity of linear mappings, called untangling and tangling by those authors, that arise in the context of computations with univariate polynomials. We give a slightly faster tangling algorithm and discuss new applications of these techniques. We show how to extend these ideas to bivariate settings, and use them to give bounds on the arithmetic complexity of certain algebras.
△ Less
Submitted 11 May, 2019;
originally announced May 2019.
-
Implementations of efficient univariate polynomial matrix algorithms and application to bivariate resultants
Authors:
Seung Gyu Hyun,
Vincent Neiger,
Éric Schost
Abstract:
Complexity bounds for many problems on matrices with univariate polynomial entries have been improved in the last few years. Still, for most related algorithms, efficient implementations are not available, which leaves open the question of the practical impact of these algorithms, e.g. on applications such as decoding some error-correcting codes and solving polynomial systems or structured linear…
▽ More
Complexity bounds for many problems on matrices with univariate polynomial entries have been improved in the last few years. Still, for most related algorithms, efficient implementations are not available, which leaves open the question of the practical impact of these algorithms, e.g. on applications such as decoding some error-correcting codes and solving polynomial systems or structured linear systems. In this paper, we discuss implementation aspects for most fundamental operations: multiplication, truncated inversion, approximants, interpolants, kernels, linear system solving, determinant, and basis reduction. We focus on prime fields with a word-size modulus, relying on Shoup's C++ library NTL. Combining these new tools to implement variants of Villard's algorithm for the resultant of generic bivariate polynomials (ISSAC 2018), we get better performance than the state of the art for large parameters.
△ Less
Submitted 10 May, 2019;
originally announced May 2019.
-
Post-Selection Inference for Changepoint Detection Algorithms with Application to Copy Number Variation Data
Authors:
Sangwon Hyun,
Kevin Lin,
Max G'Sell,
Ryan J. Tibshirani
Abstract:
Changepoint detection methods are used in many areas of science and engineering, e.g., in the analysis of copy number variation data, to detect abnormalities in copy numbers along the genome. Despite the broad array of available tools, methodology for quantifying our uncertainty in the strength (or presence) of given changepoints, post-detection, are lacking. Post-selection inference offers a fram…
▽ More
Changepoint detection methods are used in many areas of science and engineering, e.g., in the analysis of copy number variation data, to detect abnormalities in copy numbers along the genome. Despite the broad array of available tools, methodology for quantifying our uncertainty in the strength (or presence) of given changepoints, post-detection, are lacking. Post-selection inference offers a framework to fill this gap, but the most straightforward application of these methods results in low-powered tests and leaves open several important questions about practical usability. In this work, we carefully tailor post-selection inference methods towards changepoint detection, focusing as our main scientific application on copy number variation data. As for changepoint algorithms, we study binary segmentation, and two of its most popular variants, wild and circular, and the fused lasso. We implement some of the latest developments in post-selection inference theory: we use auxiliary randomization to improve power, which requires implementations of MCMC algorithms (importance sampling and hit-and-run sampling) to carry out our tests. We also provide recommendations for improving practical useability, detailed simulations, and an example analysis on array comparative genomic hybridization (CGH) data.
△ Less
Submitted 10 December, 2018;
originally announced December 2018.
-
Unified low-energy effective Hamiltonian and the band topology of $p$-block square-net layer derivatives
Authors:
S. I. Hyun,
Inho Lee,
Geunsik Lee,
J. H. Shim
Abstract:
In recent years, low-dimensional materials with tetragonal $P4/nmm$ (orthorhombic $Pnma$) space group having square-net (chain-like) substructure of $p$-block elements have been studied extensively. By using a first-principles calculation and a two-sites $\otimes$ two-orbitals tight-binding model, we construct the unified low-energy effective Hamiltonian and the $\mathbb{Z}_{2}$ topological phase…
▽ More
In recent years, low-dimensional materials with tetragonal $P4/nmm$ (orthorhombic $Pnma$) space group having square-net (chain-like) substructure of $p$-block elements have been studied extensively. By using a first-principles calculation and a two-sites $\otimes$ two-orbitals tight-binding model, we construct the unified low-energy effective Hamiltonian and the $\mathbb{Z}_{2}$ topological phase diagram for such materials with different filling factors. Near the chemical potential, we show that the staggered arrangement of ions at 2c (4c) site yields the virtual hopping that have the same form with the second nearest-neighbor hopping between the square-net (chain-like) ions. We show that this hybridization and low-symmetry of the chain-like structure protects the quantum spin Hall insulator phase. Finally, the second order spin-orbit coupling on top of the atomic spin-orbit coupling is considered to clarify the origin of the non-zero Berry phase signals reported in recent quantum oscillation experiments.
△ Less
Submitted 9 September, 2018;
originally announced September 2018.
-
A fast algorithm for solving linearly recurrent sequences
Authors:
Seung Gyu Hyun,
Stephen Melczer,
Catherine St-Pierre
Abstract:
We present an algorithm which computes the $D^{th}$ term of a sequence satisfying a linear recurrence relation of order $d$ over a field $K$ in $O( \mathsf{M}(\bar d)\log(D) + \mathsf{M}(d)\log(d))$ operations in $K$, where $\bar d \leq d$ is the degree of the squarefree part of the annihilating polynomial of the recurrence and $\mathsf{M}$ is the cost of polynomial multiplication in $K$. This is…
▽ More
We present an algorithm which computes the $D^{th}$ term of a sequence satisfying a linear recurrence relation of order $d$ over a field $K$ in $O( \mathsf{M}(\bar d)\log(D) + \mathsf{M}(d)\log(d))$ operations in $K$, where $\bar d \leq d$ is the degree of the squarefree part of the annihilating polynomial of the recurrence and $\mathsf{M}$ is the cost of polynomial multiplication in $K$. This is a refinement of the previously optimal result of $O( \mathsf{M}(d)\log(D) )$ operations, due to Fiduccia.
△ Less
Submitted 9 June, 2018;
originally announced June 2018.
-
Block-Krylov techniques in the context of sparse-FGLM algorithms
Authors:
Seung Gyu Hyun,
Vincent Neiger,
Hamid Rahkooy,
Eric Schost
Abstract:
Consider a zero-dimensional ideal $I$ in $\mathbb{K}[X_1,\dots,X_n]$. Inspired by Faugère and Mou's Sparse FGLM algorithm, we use Krylov sequences based on multiplication matrices of $I$ in order to compute a description of its zero set by means of univariate polynomials.
Steel recently showed how to use Coppersmith's block-Wiedemann algorithm in this context; he describes an algorithm that can…
▽ More
Consider a zero-dimensional ideal $I$ in $\mathbb{K}[X_1,\dots,X_n]$. Inspired by Faugère and Mou's Sparse FGLM algorithm, we use Krylov sequences based on multiplication matrices of $I$ in order to compute a description of its zero set by means of univariate polynomials.
Steel recently showed how to use Coppersmith's block-Wiedemann algorithm in this context; he describes an algorithm that can be easily parallelized, but only computes parts of the output in this manner. Using generating series expressions going back to work of Bostan, Salvy, and Schost, we show how to compute the entire output for a small overhead, without making any assumption on the ideal $I$ other than it having dimension zero. We then propose a refinement of this idea that partially avoids the introduction of a generic linear form. We comment on experimental results obtained by an implementation based on the C++ libraries Eigen, LinBox and NTL.
△ Less
Submitted 15 January, 2019; v1 submitted 12 December, 2017;
originally announced December 2017.
-
AdS from Entanglement Entropy
Authors:
Seungjoon Hyun,
Sang-A Park
Abstract:
We show that the anti-de Sitter(AdS) space naturally emerges from the conformal field theory(CFT). The behavior of the leading divergent term in the entanglement entropy implies the underlying AdS geometry. The coefficient of the leading divergent term is related to the radius of the AdS space. All these are confirmed fully for the two dimensional CFTs. We also give comments for the higher dimensi…
▽ More
We show that the anti-de Sitter(AdS) space naturally emerges from the conformal field theory(CFT). The behavior of the leading divergent term in the entanglement entropy implies the underlying AdS geometry. The coefficient of the leading divergent term is related to the radius of the AdS space. All these are confirmed fully for the two dimensional CFTs. We also give comments for the higher dimensional CFTs.
△ Less
Submitted 21 September, 2017;
originally announced September 2017.
-
Frustration-driven C4 symmetric orders in a hetero-structured iron-based superconductor
Authors:
Jong Mok Ok,
S. -H. Baek,
C. Hoch,
R. K. Kremer,
S. Y. Park,
Sungdae Ji,
B. Buechner,
J. -H. Park,
S. I. Hyun,
J. H. Shim,
Yunkyu Bang,
E. G. Moon,
I. I. Mazin,
Jun Sung Kim
Abstract:
A subtle balance between competing interactions in strongly correlated systems can be easily tipped by additional interfacial interactions in a heterostructure. This often induces exotic phases with unprecedented properties, as recently exemplified by high-Tc superconductivity in FeSe monolayer on the nonmagnetic SrTiO3. When the proximity-coupled layer is magnetically active, even richer phase di…
▽ More
A subtle balance between competing interactions in strongly correlated systems can be easily tipped by additional interfacial interactions in a heterostructure. This often induces exotic phases with unprecedented properties, as recently exemplified by high-Tc superconductivity in FeSe monolayer on the nonmagnetic SrTiO3. When the proximity-coupled layer is magnetically active, even richer phase diagrams are expected in iron-based superconductors (FeSCs), which however has not been explored due to the lack of a proper material system. One promising candidate is Sr2VO3FeAs, a naturally-assembled heterostructure of a FeSC and a Mott-insulating vanadium oxide. Here, using high-quality single crystals and high-accuracy 75As and 51V nuclear magnetic resonance (NMR) measurements, we show that a novel electronic phase is emerging in the FeAs layer below T0 ~ 155 K without either static magnetism or a crystal symmetry change, which has never been observed in other FeSCs. We find that frustration of the otherwise dominant Fe stripe and V Neel fluctuations via interfacial coupling induces a charge/orbital order with C4-symmetry in the FeAs layers, while suppressing the Neel antiferromagnetism in the SrVO3 layers. These findings demonstrate that the magnetic proximity coupling is effective to stabilize a hidden order in FeSCs and, more generally, in strongly correlated heterostructures.
△ Less
Submitted 25 June, 2017;
originally announced June 2017.
-
Thermodynamic Volume and the Extended Smarr Relation
Authors:
Seungjoon Hyun,
Jaehoon Jeong,
Sang-A Park,
Sang-Heon Yi
Abstract:
We continue to explore the scaling transformation in the reduced action formalism of gravity models. As an extension of our construction, we consider the extended forms of the Smarr relation for various black holes, adopting the cosmological constant as the bulk pressure as in some literatures on black holes. Firstly, by using the quasi-local formalism for charges, we show that, in a general theor…
▽ More
We continue to explore the scaling transformation in the reduced action formalism of gravity models. As an extension of our construction, we consider the extended forms of the Smarr relation for various black holes, adopting the cosmological constant as the bulk pressure as in some literatures on black holes. Firstly, by using the quasi-local formalism for charges, we show that, in a general theory of gravity, the volume in the black hole thermodynamics could be defined as the thermodynamic conjugate variable to the bulk pressure in such a way that the first law can be extended consistently. This, so called, thermodynamic volume can be expressed explicitly in terms of the metric and field variables. Then, by using the scaling transformation allowed in the reduced action formulation, we obtain the extended Smarr relation involving the bulk pressure and the thermodynamic volume. In our approach, we do not resort to Euler's homogeneous scaling of charges while incorporating the would-be hairy contribution without any difficulty.
△ Less
Submitted 2 March, 2017; v1 submitted 21 February, 2017;
originally announced February 2017.
-
Revisit to Thermodynamic Relations in the AdS/CMT Models
Authors:
Seungjoon Hyun,
Sang-A Park,
Sang-Heon Yi
Abstract:
Motivated by the recent unified approach to the Smarr-like relation of AdS planar black holes in conjunction with the quasi-local formalism on conserved charges, we revisit the quantum statistical and thermodynamic relations of hairy AdS planar black holes. By extending the previous results, we identify the hairy contribution in the bulk and show that the holographic computation can be improved so…
▽ More
Motivated by the recent unified approach to the Smarr-like relation of AdS planar black holes in conjunction with the quasi-local formalism on conserved charges, we revisit the quantum statistical and thermodynamic relations of hairy AdS planar black holes. By extending the previous results, we identify the hairy contribution in the bulk and show that the holographic computation can be improved so that it is consistent with the bulk computation. We argue that the first law can be retained in its universal form while the relation between the on-shell renormalized Euclidean action and its free energy interpretation in gravity may be deformed to contain the hairy contribution in hairy AdS black holes.
△ Less
Submitted 3 March, 2017; v1 submitted 14 September, 2016;
originally announced September 2016.
-
Exact Post-Selection Inference for Changepoint Detection and Other Generalized Lasso Problems
Authors:
Sangwon Hyun,
Max G'Sell,
Ryan J. Tibshirani
Abstract:
We study tools for inference conditioned on model selection events that are defined by the generalized lasso regularization path. The generalized lasso estimate is given by the solution of a penalized least squares regression problem, where the penalty is the l1 norm of a matrix D times the coefficient vector. The generalized lasso path collects these estimates for a range of penalty parameter (λ)…
▽ More
We study tools for inference conditioned on model selection events that are defined by the generalized lasso regularization path. The generalized lasso estimate is given by the solution of a penalized least squares regression problem, where the penalty is the l1 norm of a matrix D times the coefficient vector. The generalized lasso path collects these estimates for a range of penalty parameter (λ) values. Leveraging a sequential characterization of this path from Tibshirani & Taylor (2011), and recent advances in post-selection inference from Lee et al. (2016), Tibshirani et al. (2016), we develop exact hypothesis tests and confidence intervals for linear contrasts of the underlying mean vector, conditioned on any model selection event along the generalized lasso path (assuming Gaussian errors in the observations). By inspecting specific choices of D, we obtain post-selection tests and confidence intervals for specific cases of generalized lasso estimates, such as the fused lasso, trend filtering, and the graph fused lasso. In the fused lasso case, the underlying coordinates of the mean are assigned a linear ordering, and our framework allows us to test selectively chosen breakpoints or changepoints in these mean coordinates. This is an interesting and well-studied problem with broad applications, our framework applied to the trend filtering and graph fused lasso serves several applications as well. Aside from the development of selective inference tools, we describe several practical aspects of our methods such as valid post-processing of generalized estimates before performing inference in order to improve power, and problem-specific visualization aids that may be given to the data analyst for he/she to choose linear contrasts to be tested. Many examples, both from simulated and real data sources, are presented to examine the empirical properties of our inference methods.
△ Less
Submitted 11 June, 2016;
originally announced June 2016.
-
Canonical energy and hairy AdS black holes
Authors:
Seungjoon Hyun,
Sang-A Park,
Sang-Heon Yi
Abstract:
We propose the modified version of the canonical energy which was introduced originally by Hollands and Wald. Our construction depends only on the Euler-Lagrange expression of the system and thus is independent of the ambiguity in the Lagrangian. After some comments on our construction, we briefly mention on the relevance of our construction to the boundary information metric in the context of the…
▽ More
We propose the modified version of the canonical energy which was introduced originally by Hollands and Wald. Our construction depends only on the Euler-Lagrange expression of the system and thus is independent of the ambiguity in the Lagrangian. After some comments on our construction, we briefly mention on the relevance of our construction to the boundary information metric in the context of the AdS/CFT correspondence. We also study the stability of three-dimensional hairy extremal black holes by using our construction.
△ Less
Submitted 17 August, 2016; v1 submitted 8 March, 2016;
originally announced March 2016.
-
Holography without counter terms
Authors:
Byoungjoon Ahn,
Seungjoon Hyun,
Kyung Kiu Kim,
Sang-A Park,
Sang-Heon Yi
Abstract:
By considering the behavior of the reduced action under the scaling transformation, we present a unified derivation of the Smarr-like relation for asymptotically anti-de-Sitter planar black holes. This novel Smarr-like relation leads to useful information in the condensed matter systems through the AdS/CMT correspondence. By using our results, we provide an efficient way to obtain the holographica…
▽ More
By considering the behavior of the reduced action under the scaling transformation, we present a unified derivation of the Smarr-like relation for asymptotically anti-de-Sitter planar black holes. This novel Smarr-like relation leads to useful information in the condensed matter systems through the AdS/CMT correspondence. By using our results, we provide an efficient way to obtain the holographically renormalized on-shell action without the information on the explicit forms of counter terms. We find the complete consistency of our results with those in various models discussed in the recent literatures and obtain new implications.
△ Less
Submitted 20 July, 2016; v1 submitted 31 December, 2015;
originally announced December 2015.
-
Scaling symmetry and scalar hairy rotating AdS_3 black holes
Authors:
Byoungjoon Ahn,
Seungjoon Hyun,
Sang-A Park,
Sang-Heon Yi
Abstract:
By using the scaling symmetry in the reduced action formalism, we derive the novel Smarr relation which holds even for the hairy rotating AdS_3 black holes. And then, by using the Smarr relation we argue that the hairy rotating AdS_3 black holes are stable thermodynamically, compared to the non-hairy ones.
By using the scaling symmetry in the reduced action formalism, we derive the novel Smarr relation which holds even for the hairy rotating AdS_3 black holes. And then, by using the Smarr relation we argue that the hairy rotating AdS_3 black holes are stable thermodynamically, compared to the non-hairy ones.
△ Less
Submitted 31 December, 2015; v1 submitted 26 August, 2015;
originally announced August 2015.
-
Scaling symmetry and scalar hairy Lifshitz black holes
Authors:
Seungjoon Hyun,
Jaehoon Jeong,
Sang-A Park,
Sang-Heon Yi
Abstract:
By utilizing the scaling symmetry of the reduced action for planar black holes, we obtain the corresponding conserved charge. We use the conserved charge to find the generalized Smarr relation of static hairy planar black holes in various dimensions. Our results not only reproduce the relation in the various known cases but also give the new relation in the Lifshitz planar black holes with the sca…
▽ More
By utilizing the scaling symmetry of the reduced action for planar black holes, we obtain the corresponding conserved charge. We use the conserved charge to find the generalized Smarr relation of static hairy planar black holes in various dimensions. Our results not only reproduce the relation in the various known cases but also give the new relation in the Lifshitz planar black holes with the scalar hair.
△ Less
Submitted 14 October, 2015; v1 submitted 13 July, 2015;
originally announced July 2015.
-
Flexible Modeling of Epidemics with an Empirical Bayes Framework
Authors:
Logan C. Brooks,
David C. Farrow,
Sangwon Hyun,
Ryan J. Tibshirani,
Roni Rosenfeld
Abstract:
Seasonal influenza epidemics cause consistent, considerable, widespread loss annually in terms of economic burden, morbidity, and mortality. With access to accurate and reliable forecasts of a current or upcoming influenza epidemic's behavior, policy makers can design and implement more effective countermeasures. We developed a framework for in-season forecasts of epidemics using a semiparametric…
▽ More
Seasonal influenza epidemics cause consistent, considerable, widespread loss annually in terms of economic burden, morbidity, and mortality. With access to accurate and reliable forecasts of a current or upcoming influenza epidemic's behavior, policy makers can design and implement more effective countermeasures. We developed a framework for in-season forecasts of epidemics using a semiparametric Empirical Bayes framework, and applied it to predict the weekly percentage of outpatient doctors visits for influenza-like illness, as well as the season onset, duration, peak time, and peak height, with and without additional data from Google Flu Trends, as part of the CDC's 2013--2014 "Predict the Influenza Season Challenge". Previous work on epidemic modeling has focused on developing mechanistic models of disease behavior and applying time series tools to explain historical data. However, these models may not accurately capture the range of possible behaviors that we may see in the future. Our approach instead produces possibilities for the epidemic curve of the season of interest using modified versions of data from previous seasons, allowing for reasonable variations in the timing, pace, and intensity of the seasonal epidemics, as well as noise in observations. Since the framework does not make strict domain-specific assumptions, it can easily be applied to other diseases as well. Another important advantage of this method is that it produces a complete posterior distribution for any desired forecasting target, rather than mere point predictions. We report prospective influenza-like-illness forecasts that were made for the 2013--2014 U.S. influenza season, and compare the framework's cross-validated prediction error on historical data to that of a variety of simpler baseline predictors.
△ Less
Submitted 27 October, 2014;
originally announced October 2014.