-
A tutorial on simulating nonlinear behaviors of flexible structures with the discrete differential geometry (DDG) method
Authors:
Weicheng Huang,
Zhuonan Hao,
Jiahao Li,
Dezhong Tong,
Kexin Guo,
Yingchao Zhang,
Huajian Gao,
K. Jimmy Hsia,
Mingchao Liu
Abstract:
Flexible elastic structures, such as beams, rods, ribbons, plates, and shells, exhibit complex nonlinear dynamical behaviors that are central to a wide range of engineering and scientific applications, including soft robotics, deployable structures, and biomedical devices. While various numerical methods have been developed to simulate these behaviors, many conventional approaches struggle to simu…
▽ More
Flexible elastic structures, such as beams, rods, ribbons, plates, and shells, exhibit complex nonlinear dynamical behaviors that are central to a wide range of engineering and scientific applications, including soft robotics, deployable structures, and biomedical devices. While various numerical methods have been developed to simulate these behaviors, many conventional approaches struggle to simultaneously capture geometric and material nonlinearities, as well as nonlinear external interactions, particularly in highly deformable and dynamically evolving systems. The Discrete Differential Geometry (DDG) method has emerged as a robust and efficient numerical framework that intrinsically preserves geometric properties, accommodates material nonlinearity, and accurately models interactions with external environments and fields. By directly discretizing geometric and mechanical quantities, DDG provides an accurate, stable, and efficient approach to modeling flexible structures, addressing key limitations of traditional numerical methods. This tutorial provides a systematic introduction to the DDG method for simulating nonlinear behaviors in flexible structures. It covers DDG theory, simulation frameworks, and MATLAB implementation, with examples spanning dynamic systems, geometric and material nonlinearities, and external interactions like magnetics and fluids, culminating in practical insights and future directions. By offering a comprehensive and practical guide, together with open-source MATLAB code, this tutorial aims to facilitate the broader adoption of DDG-based numerical tools among researchers and engineers in computational mechanics, applied mathematics, and structural design.
△ Less
Submitted 15 April, 2025;
originally announced April 2025.
-
TCFG: Tangential Damping Classifier-free Guidance
Authors:
Mingi Kwon,
Shin seong Kim,
Jaeseok Jeong. Yi Ting Hsiao,
Youngjung Uh
Abstract:
Diffusion models have achieved remarkable success in text-to-image synthesis, largely attributed to the use of classifier-free guidance (CFG), which enables high-quality, condition-aligned image generation. CFG combines the conditional score (e.g., text-conditioned) with the unconditional score to control the output. However, the unconditional score is in charge of estimating the transition betwee…
▽ More
Diffusion models have achieved remarkable success in text-to-image synthesis, largely attributed to the use of classifier-free guidance (CFG), which enables high-quality, condition-aligned image generation. CFG combines the conditional score (e.g., text-conditioned) with the unconditional score to control the output. However, the unconditional score is in charge of estimating the transition between manifolds of adjacent timesteps from $x_t$ to $x_{t-1}$, which may inadvertently interfere with the trajectory toward the specific condition. In this work, we introduce a novel approach that leverages a geometric perspective on the unconditional score to enhance CFG performance when conditional scores are available. Specifically, we propose a method that filters the singular vectors of both conditional and unconditional scores using singular value decomposition. This filtering process aligns the unconditional score with the conditional score, thereby refining the sampling trajectory to stay closer to the manifold. Our approach improves image quality with negligible additional computation. We provide deeper insights into the score function behavior in diffusion models and present a practical technique for achieving more accurate and contextually coherent image synthesis.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
Authors:
Chenyang Zhao,
Kun Wang,
Janet H. Hsiao,
Antoni B. Chan
Abstract:
Significant progress has been achieved on the improvement and downstream usages of the Contrastive Language-Image Pre-training (CLIP) vision-language model, while less attention is paid to the interpretation of CLIP. We propose a Gradient-based visual and textual Explanation method for CLIP (Grad-ECLIP), which interprets the matching result of CLIP for specific input image-text pair. By decomposin…
▽ More
Significant progress has been achieved on the improvement and downstream usages of the Contrastive Language-Image Pre-training (CLIP) vision-language model, while less attention is paid to the interpretation of CLIP. We propose a Gradient-based visual and textual Explanation method for CLIP (Grad-ECLIP), which interprets the matching result of CLIP for specific input image-text pair. By decomposing the architecture of the encoder and discovering the relationship between the matching similarity and intermediate spatial features, Grad-ECLIP produces effective heat maps that show the influence of image regions or words on the CLIP results. Different from the previous Transformer interpretation methods that focus on the utilization of self-attention maps, which are typically extremely sparse in CLIP, we produce high-quality visual explanations by applying channel and spatial weights on token features. Qualitative and quantitative evaluations verify the effectiveness and superiority of Grad-ECLIP compared with the state-of-the-art methods. Furthermore, a series of analysis are conducted based on our visual and textual explanation results, from which we explore the working mechanism of image-text matching, the strengths and limitations in attribution identification of CLIP, and the relationship between the concreteness/abstractness of a word and its usage in CLIP. Finally, based on the ability of explanation map that indicates text-specific saliency region of input image, we also propose an application with Grad-ECLIP, which is adopted to boost the fine-grained alignment in the CLIP fine-tuning. The code of Grad-ECLIP is available here: https://github.com/Cyang-Zhao/Grad-Eclip.
△ Less
Submitted 25 February, 2025;
originally announced February 2025.
-
Rubber-to-glass adhesion between a rigid sphere and a shape memory polymer substrate of finite thickness
Authors:
Changhong Linghu,
Wentao Mao,
Haoyu Jiang,
Huajian Gao,
K. Jimmy Hsia
Abstract:
Shape memory polymers (SMPs) are emerging as innovative smart adhesive materials with broad application potential. Compared to conventional elastomeric adhesives, SMP adhesives are distinguished by the so-called rubber-to-glass (R2G) adhesion, which involves contact in the rubbery state followed by detachment in the glassy state. This process, through a shape-locking effect, enhances adhesion stre…
▽ More
Shape memory polymers (SMPs) are emerging as innovative smart adhesive materials with broad application potential. Compared to conventional elastomeric adhesives, SMP adhesives are distinguished by the so-called rubber-to-glass (R2G) adhesion, which involves contact in the rubbery state followed by detachment in the glassy state. This process, through a shape-locking effect, enhances adhesion strength by more than an order of magnitude compared to conventional adhesive contact. Here, we investigate the fundamental problem of a rigid sphere undergoing R2G adhesion with an SMP substrate of finite thickness through experiments, finite element (FE) simulations, and theoretical modeling. It is demonstrated that during press-in, the contact problem can be modeled as a rigid oblate spheroid contacting an infinite substrate, while the pull-off process can be described by a modified ball-and-socket model. These equivalent models yield practically useful analytical solutions for the contact radius during press-in and the R2G adhesion force during pull-off. A critical thickness-to-contact-radius ratio of around 5 is identified, below which the thickness effect becomes significant. These insights provide valuable guidance for the design and application of SMP-based smart adhesives.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
Localized tension-induced giant folding in unstructured elastic sheets
Authors:
Kexin Guo,
Marc Suñé,
Kwok Ming Li,
K. Jimmy Hsia,
Mingchao Liu,
Dominic Vella
Abstract:
Buckling in compression is the archetype of elastic instability: when compressed along its longest dimension, a thin structure such as a playing card will buckle out-of-plane accommodating the imposed compression without a significant change of length. However, recent studies have demonstrated that tension applied to sheets with microscopic structure leads to out-of-plane deformation in applicatio…
▽ More
Buckling in compression is the archetype of elastic instability: when compressed along its longest dimension, a thin structure such as a playing card will buckle out-of-plane accommodating the imposed compression without a significant change of length. However, recent studies have demonstrated that tension applied to sheets with microscopic structure leads to out-of-plane deformation in applications from `groovy metasheets' for multi-stable morphing to kirigami grippers. Here, we demonstrate that this counter-intuitive behavior -- a large transverse folding induced by a relatively small imposed longitudinal tension -- occurs also in unstructured sheets of isotropic material. The key to this behavior is that a localized uniaxial tension induces giant folding; we refer to this as `localized TUG folding' to reflect the importance of localized tension and its mode of actuation. We show that localized TUG folding occurs because of an efficient transfer of applied tensile load into compression -- a geometric consequence of a localized applied tension. We determine scaling results for the folding angle as a function of applied strain in agreement with both experiments and simulations. The generic nature of localized TUG folding suggests that it might be utilized in a broader range of materials and structures than previously realized.
△ Less
Submitted 3 April, 2025; v1 submitted 26 August, 2024;
originally announced August 2024.
-
Weakly-supervised Medical Image Segmentation with Gaze Annotations
Authors:
Yuan Zhong,
Chenhui Tang,
Yumeng Yang,
Ruoxi Qi,
Kang Zhou,
Yuqi Gong,
Pheng Ann Heng,
Janet H. Hsiao,
Qi Dou
Abstract:
Eye gaze that reveals human observational patterns has increasingly been incorporated into solutions for vision tasks. Despite recent explorations on leveraging gaze to aid deep networks, few studies exploit gaze as an efficient annotation approach for medical image segmentation which typically entails heavy annotating costs. In this paper, we propose to collect dense weak supervision for medical…
▽ More
Eye gaze that reveals human observational patterns has increasingly been incorporated into solutions for vision tasks. Despite recent explorations on leveraging gaze to aid deep networks, few studies exploit gaze as an efficient annotation approach for medical image segmentation which typically entails heavy annotating costs. In this paper, we propose to collect dense weak supervision for medical image segmentation with a gaze annotation scheme. To train with gaze, we propose a multi-level framework that trains multiple networks from discriminative human attention, simulated with a set of pseudo-masks derived by applying hierarchical thresholds on gaze heatmaps. Furthermore, to mitigate gaze noise, a cross-level consistency is exploited to regularize overfitting noisy labels, steering models toward clean patterns learned by peer networks. The proposed method is validated on two public medical datasets of polyp and prostate segmentation tasks. We contribute a high-quality gaze dataset entitled GazeMedSeg as an extension to the popular medical segmentation datasets. To the best of our knowledge, this is the first gaze dataset for medical image segmentation. Our experiments demonstrate that gaze annotation outperforms previous label-efficient annotation schemes in terms of both performance and annotation time. Our collected gaze data and code are available at: https://github.com/med-air/GazeMedSeg.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Simplified discrete model for axisymmetric dielectric elastomer membranes with robotic applications
Authors:
Zhaowei Liu,
Mingchao Liu,
K. Jimmy Hsia,
Xiaonan Huang,
Weicheng Huang
Abstract:
Soft robots utilizing inflatable dielectric membranes can realize intricate functionalities through the application of non-mechanical fields. However, given the current limitations in simulations, including low computational efficiency and difficulty in dealing with complex external interactions, the design and control of such soft robots often require trial and error. Thus, a novel one-dimensiona…
▽ More
Soft robots utilizing inflatable dielectric membranes can realize intricate functionalities through the application of non-mechanical fields. However, given the current limitations in simulations, including low computational efficiency and difficulty in dealing with complex external interactions, the design and control of such soft robots often require trial and error. Thus, a novel one-dimensional (1D) discrete differential geometry (DDG)-based numerical model is developed for analyzing the highly nonlinear mechanics in axisymmetric inflatable dielectric membranes. The model captures the intricate dynamics of these membranes under both inflationary pressure and electrical stimulation. Comprehensive validations using hyperelastic benchmarks demonstrate the model's accuracy and reliability. Additionally, the focus on the electro-mechanical coupling elucidates critical insights into the membrane's behavior under varying internal pressures and electrical loads. The research further translates these findings into innovative soft robotic applications, including a spherical soft actuator, a soft circular fluid pump, and a soft toroidal gripper, where the snap-through of electroelastic membrane plays a crucial role. Our analyses reveal that the functional ranges of soft robots are amplified by the snap-through of an electroelastic membrane upon electrical stimuli. This study underscores the potential of DDG-based simulations to advance the understanding of the nonlinear mechanics of electroelastic membranes and guide the design of electroelastic actuators in soft robotics applications.
△ Less
Submitted 23 April, 2024;
originally announced May 2024.
-
Observation of strain-rate softening behavior in jammed granular media
Authors:
Mingchao Liu,
Weining Mao,
Yiqiu Zhao,
Qin Xu,
Yixiang Gan,
Yifan Wang,
K Jimmy Hsia
Abstract:
The strain-rate sensitivity of confined granular materials has been widely explored, with most findings exhibiting rate-strengthening behaviors. This study, however, reveals a distinct rate-softening behavior across a certain strain rate range based on triaxial tests on particle clusters of various materials with different surface properties, particle sizes, shapes, and stiffness. This softening e…
▽ More
The strain-rate sensitivity of confined granular materials has been widely explored, with most findings exhibiting rate-strengthening behaviors. This study, however, reveals a distinct rate-softening behavior across a certain strain rate range based on triaxial tests on particle clusters of various materials with different surface properties, particle sizes, shapes, and stiffness. This softening effect is especially pronounced in the case of common rice particles. By examining the behavior of rice particles under different confining pressure and surface conditions, and directly measuring the frictional coefficient across various loading rates, we find that the reduction in surface frictional coefficient with the increasing strain rate predominantly contributes to this rate-softening behavior. This conclusion is validated by results from Finite Element Method (FEM) simulations. Additionally, we identify confining pressure as a critical factor regulating the normal stress between particles, and thereby enhancing frictional behavior. Rheometer tests reveal that the shear modulus exhibits a similar rate-softening trend. This study of rate-softening behavior in granular materials enhances our understanding of the mechanisms during their deformation under confining pressure. It also suggests that local inter-particle tribology significantly impacts overall granular behavior.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems
Authors:
Jennifer Hsia,
Afreen Shaikh,
Zhiruo Wang,
Graham Neubig
Abstract:
Retrieval-augmented generation (RAG) can significantly improve the performance of language models (LMs) by providing additional context for tasks such as document-based question answering (DBQA). However, the effectiveness of RAG is highly dependent on its configuration. To systematically find the optimal configuration, we introduce RAGGED, a framework for analyzing RAG configurations across vario…
▽ More
Retrieval-augmented generation (RAG) can significantly improve the performance of language models (LMs) by providing additional context for tasks such as document-based question answering (DBQA). However, the effectiveness of RAG is highly dependent on its configuration. To systematically find the optimal configuration, we introduce RAGGED, a framework for analyzing RAG configurations across various DBQA tasks. Using the framework, we discover distinct LM behaviors in response to varying context quantities, context qualities, and retrievers. For instance, while some models are robust to noisy contexts, monotonically performing better with more contexts, others are more noise-sensitive and can effectively use only a few contexts before declining in performance. This framework also provides a deeper analysis of these differences by evaluating the LMs' sensitivity to signal and noise under specific context quality conditions. Using RAGGED, researchers and practitioners can derive actionable insights about how to optimally configure their RAG systems for their specific question-answering tasks.
△ Less
Submitted 12 August, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Discrete differential geometry-based model for nonlinear analysis of axisymmetric shells
Authors:
Weicheng Huang,
Tianzhen Liu,
Zhaowei Liu,
Peifei Xu,
Mingchao Liu,
Yuzhen Chen,
K. Jimmy Hsia
Abstract:
In this paper, we propose a novel one-dimensional (1D) discrete differential geometry (DDG)-based numerical method for geometrically nonlinear mechanics analysis (e.g., buckling and snapping) of axisymmetric shell structures. Our numerical model leverages differential geometry principles to accurately capture the complex nonlinear deformation patterns exhibited by axisymmetric shells. By discretiz…
▽ More
In this paper, we propose a novel one-dimensional (1D) discrete differential geometry (DDG)-based numerical method for geometrically nonlinear mechanics analysis (e.g., buckling and snapping) of axisymmetric shell structures. Our numerical model leverages differential geometry principles to accurately capture the complex nonlinear deformation patterns exhibited by axisymmetric shells. By discretizing the axisymmetric shell into interconnected 1D elements along the meridional direction, the in-plane stretching and out-of-bending potentials are formulated based on the geometric principles of 1D nodes and edges under the Kirchhoff-Love hypothesis, and elastic force vector and associated Hession matrix required by equations of motion are later derived based on symbolic calculation. Through extensive validation with available theoretical solutions and finite element method (FEM) simulations in literature, our model demonstrates high accuracy in predicting the nonlinear behavior of axisymmetric shells. Importantly, compared to the classical theoretical model and three-dimensional (3D) FEM simulation, our model is highly computationally efficient, making it suitable for large-scale real-time simulations of nonlinear problems of shell structures such as instability and snap-through phenomena. Moreover, our framework can easily incorporate complex loading conditions, e.g., boundary nonlinear contact and multi-physics actuation, which play an essential role in the use of engineering applications, such as soft robots and flexible devices. This study demonstrates that the simplicity and effectiveness of the 1D discrete differential geometry-based approach render it a powerful tool for engineers and researchers interested in nonlinear mechanics analysis of axisymmetric shells, with potential applications in various engineering fields.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Exploiting dynamic bifurcation in elastic ribbons for mode skipping and selection
Authors:
Weicheng Huang,
Tian Yu,
Dominic Vella,
K. Jimmy Hsia,
Mingchao Liu
Abstract:
In this paper, we systematically study the dynamic snap-through behavior of a pre-deformed elastic ribbon by combining theoretical analysis, discrete numerical simulations, and experiments. By rotating one of its clamped ends with controlled angular speed, we observe two snap-through transition paths among the multiple stable configurations of a ribbon in three-dimensional (3D) space, which is dif…
▽ More
In this paper, we systematically study the dynamic snap-through behavior of a pre-deformed elastic ribbon by combining theoretical analysis, discrete numerical simulations, and experiments. By rotating one of its clamped ends with controlled angular speed, we observe two snap-through transition paths among the multiple stable configurations of a ribbon in three-dimensional (3D) space, which is different from the classical snap-through of a two-dimensional (2D) bistable beam. Our theoretical model for the static bifurcation analysis is derived based on the Kirchhoff equations, and dynamical numerical simulations are conducted using the Discrete Elastic Rods (DER) algorithm. The planar beam model is also employed for the asymptotic analysis of dynamic snap-through behaviors. The results show that, since the snap-through processes of both planar beams and 3D ribbons are governed by the saddle-node bifurcation, the same scaling law for the delay applies. We further demonstrate that, in elastic ribbons, by controlling the rotating velocity at the end, distinct snap-through pathways can be realized by selectively skipping specific modes, moreover, particular final modes can be strategically achieved. Through a parametric study using numerical simulations, we construct general phase diagrams for both mode skipping and selection of snapping ribbons. The work serves as a benchmark for future investigations on dynamic snap-through of thin elastic structures and provides guidelines for the novel design of intelligent mechanical systems.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
On Cohen--Macaulay modules over the Weyl algebra
Authors:
Kuei-Nuan Lin,
Jen-Chieh Hsiao
Abstract:
We propose a definition of Cohen--Macaulay modules over the Weyl algebra $D$ and give a sufficient condition for a GKZ $A$-hypergeometric $D$-module to be Cohen--Macaulay.
We propose a definition of Cohen--Macaulay modules over the Weyl algebra $D$ and give a sufficient condition for a GKZ $A$-hypergeometric $D$-module to be Cohen--Macaulay.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Goodhart's Law Applies to NLP's Explanation Benchmarks
Authors:
Jennifer Hsia,
Danish Pruthi,
Aarti Singh,
Zachary C. Lipton
Abstract:
Despite the rising popularity of saliency-based explanations, the research community remains at an impasse, facing doubts concerning their purpose, efficacy, and tendency to contradict each other. Seeking to unite the community's efforts around common goals, several recent works have proposed evaluation metrics. In this paper, we critically examine two sets of metrics: the ERASER metrics (comprehe…
▽ More
Despite the rising popularity of saliency-based explanations, the research community remains at an impasse, facing doubts concerning their purpose, efficacy, and tendency to contradict each other. Seeking to unite the community's efforts around common goals, several recent works have proposed evaluation metrics. In this paper, we critically examine two sets of metrics: the ERASER metrics (comprehensiveness and sufficiency) and the EVAL-X metrics, focusing our inquiry on natural language processing. First, we show that we can inflate a model's comprehensiveness and sufficiency scores dramatically without altering its predictions or explanations on in-distribution test inputs. Our strategy exploits the tendency for extracted explanations and their complements to be "out-of-support" relative to each other and in-distribution inputs. Next, we demonstrate that the EVAL-X metrics can be inflated arbitrarily by a simple method that encodes the label, even though EVAL-X is precisely motivated to address such exploits. Our results raise doubts about the ability of current metrics to guide explainability research, underscoring the need for a broader reassessment of what precisely these metrics are intended to capture.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models
Authors:
Guoyang Liu,
Jindi Zhang,
Antoni B. Chan,
Janet H. Hsiao
Abstract:
We examined whether embedding human attention knowledge into saliency-based explainable AI (XAI) methods for computer vision models could enhance their plausibility and faithfulness. We first developed new gradient-based XAI methods for object detection models to generate object-specific explanations by extending the current methods for image classification models. Interestingly, while these gradi…
▽ More
We examined whether embedding human attention knowledge into saliency-based explainable AI (XAI) methods for computer vision models could enhance their plausibility and faithfulness. We first developed new gradient-based XAI methods for object detection models to generate object-specific explanations by extending the current methods for image classification models. Interestingly, while these gradient-based methods worked well for explaining image classification models, when being used for explaining object detection models, the resulting saliency maps generally had lower faithfulness than human attention maps when performing the same task. We then developed Human Attention-Guided XAI (HAG-XAI) to learn from human attention how to best combine explanatory information from the models to enhance explanation plausibility by using trainable activation functions and smoothing kernels to maximize XAI saliency map's similarity to human attention maps. While for image classification models, HAG-XAI enhanced explanation plausibility at the expense of faithfulness, for object detection models it enhanced plausibility and faithfulness simultaneously and outperformed existing methods. The learned functions were model-specific, well generalizable to other databases.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder
Authors:
Xinmiao Lin,
Yikang Li,
Jenhao Hsiao,
Chiuman Ho,
Yu Kong
Abstract:
The popular VQ-VAE models reconstruct images through learning a discrete codebook but suffer from a significant issue in the rapid quality degradation of image reconstruction as the compression rate rises. One major reason is that a higher compression rate induces more loss of visual signals on the higher frequency spectrum which reflect the details on pixel space. In this paper, a Frequency Compl…
▽ More
The popular VQ-VAE models reconstruct images through learning a discrete codebook but suffer from a significant issue in the rapid quality degradation of image reconstruction as the compression rate rises. One major reason is that a higher compression rate induces more loss of visual signals on the higher frequency spectrum which reflect the details on pixel space. In this paper, a Frequency Complement Module (FCM) architecture is proposed to capture the missing frequency information for enhancing reconstruction quality. The FCM can be easily incorporated into the VQ-VAE structure, and we refer to the new model as Frequency Augmented VAE (FA-VAE). In addition, a Dynamic Spectrum Loss (DSL) is introduced to guide the FCMs to balance between various frequencies dynamically for optimal reconstruction. FA-VAE is further extended to the text-to-image synthesis task, and a Cross-attention Autoregressive Transformer (CAT) is proposed to obtain more precise semantic attributes in texts. Extensive reconstruction experiments with different compression rates are conducted on several benchmark datasets, and the results demonstrate that the proposed FA-VAE is able to restore more faithfully the details compared to SOTA methods. CAT also shows improved generation quality with better image-text semantic alignment.
△ Less
Submitted 3 November, 2023; v1 submitted 4 May, 2023;
originally announced May 2023.
-
Explanation Strategies for Image Classification in Humans vs. Current Explainable AI
Authors:
Ruoxi Qi,
Yueyuan Zheng,
Yi Yang,
Caleb Chen Cao,
Janet H. Hsiao
Abstract:
Explainable AI (XAI) methods provide explanations of AI models, but our understanding of how they compare with human explanations remains limited. In image classification, we found that humans adopted more explorative attention strategies for explanation than the classification task itself. Two representative explanation strategies were identified through clustering: One involved focused visual sc…
▽ More
Explainable AI (XAI) methods provide explanations of AI models, but our understanding of how they compare with human explanations remains limited. In image classification, we found that humans adopted more explorative attention strategies for explanation than the classification task itself. Two representative explanation strategies were identified through clustering: One involved focused visual scanning on foreground objects with more conceptual explanations diagnostic for inferring class labels, whereas the other involved explorative scanning with more visual explanations rated higher for effectiveness. Interestingly, XAI saliency-map explanations had the highest similarity to the explorative attention strategy in humans, and explanations highlighting discriminative features from invoking observable causality through perturbation had higher similarity to human strategies than those highlighting internal features associated with higher class score. Thus, humans differ in information and strategy use for explanations, and XAI methods that highlight features informing observable causality match better with human explanations, potentially more accessible to users.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
VideoXum: Cross-modal Visual and Textural Summarization of Videos
Authors:
Jingyang Lin,
Hang Hua,
Ming Chen,
Yikang Li,
Jenhao Hsiao,
Chiuman Ho,
Jiebo Luo
Abstract:
Video summarization aims to distill the most important information from a source video to produce either an abridged clip or a textual narrative. Traditionally, different methods have been proposed depending on whether the output is a video or text, thus ignoring the correlation between the two semantically related tasks of visual summarization and textual summarization. We propose a new joint vid…
▽ More
Video summarization aims to distill the most important information from a source video to produce either an abridged clip or a textual narrative. Traditionally, different methods have been proposed depending on whether the output is a video or text, thus ignoring the correlation between the two semantically related tasks of visual summarization and textual summarization. We propose a new joint video and text summarization task. The goal is to generate both a shortened video clip along with the corresponding textual summary from a long video, collectively referred to as a cross-modal summary. The generated shortened video clip and text narratives should be semantically well aligned. To this end, we first build a large-scale human-annotated dataset -- VideoXum (X refers to different modalities). The dataset is reannotated based on ActivityNet. After we filter out the videos that do not meet the length requirements, 14,001 long videos remain in our new dataset. Each video in our reannotated dataset has human-annotated video summaries and the corresponding narrative summaries. We then design a novel end-to-end model -- VTSUM-BILP to address the challenges of our proposed task. Moreover, we propose a new metric called VT-CLIPScore to help evaluate the semantic consistency of cross-modality summary. The proposed model achieves promising performance on this new task and establishes a benchmark for future research.
△ Less
Submitted 23 April, 2024; v1 submitted 21 March, 2023;
originally announced March 2023.
-
Exploring the Advantages of Quantum Generative Adversarial Networks in Generative Chemistry
Authors:
Po-Yu Kao,
Ya-Chu Yang,
Wei-Yin Chiang,
Jen-Yueh Hsiao,
Yudong Cao,
Alex Aliper,
Feng Ren,
Alan Aspuru-Guzik,
Alex Zhavoronkov,
Min-Hsiu Hsieh,
Yen-Chu Lin
Abstract:
De novo drug design with desired biological activities is crucial for developing novel therapeutics for patients. The drug development process is time and resource-consuming, and it has a low probability of success. Recent advances in machine learning and deep learning technology have reduced the time and cost of the discovery process and therefore, improved pharmaceutical research and development…
▽ More
De novo drug design with desired biological activities is crucial for developing novel therapeutics for patients. The drug development process is time and resource-consuming, and it has a low probability of success. Recent advances in machine learning and deep learning technology have reduced the time and cost of the discovery process and therefore, improved pharmaceutical research and development. In this paper, we explore the combination of two rapidly-developing fields with lead candidate discovery in the drug development process. First, Artificial intelligence has already been demonstrated to successfully accelerate conventional drug design approaches. Second, quantum computing has demonstrated promising potential in different applications, such as quantum chemistry, combinatorial optimizations, and machine learning. This manuscript explores hybrid quantum-classical generative adversarial networks (GAN) for small molecule discovery. We substituted each element of GAN with a variational quantum circuit (VQC) and demonstrated the quantum advantages in the small drug discovery. Utilizing a VQC in the noise generator of a GAN to generate small molecules achieves better physicochemical properties and performance in the goal-directed benchmark than the classical counterpart. Moreover, we demonstrate the potential of a VQC with only tens of learnable parameters in the generator of GAN to generate small molecules. We also demonstrate the quantum advantage of a VQC in the discriminator of GAN. In this hybrid model, the number of learnable parameters is significantly less than the classical ones, and it can still generate valid molecules. The hybrid model with only tens of training parameters in the quantum discriminator outperforms the MLP-based one in terms of both generated molecule properties and the achieved KL divergence.
△ Less
Submitted 6 February, 2023; v1 submitted 30 October, 2022;
originally announced October 2022.
-
Quantum Simulation of Preferred Tautomeric State Prediction
Authors:
Yu Shee,
Tzu-Lan Yeh,
Jen-Yueh Hsiao,
Ann Yang,
Yen-Chu Lin,
Min-Hsiu Hsieh
Abstract:
Prediction of tautomers plays an essential role in computer-aided drug discovery. However, it remains a challenging task nowadays to accurately predict the canonical tautomeric form of a given drug-like molecule. Lack of extensive tautomer databases, most likely due to the difficulty in experimental studies, hampers the development of effective empirical methods for tautomer predictions. A more ac…
▽ More
Prediction of tautomers plays an essential role in computer-aided drug discovery. However, it remains a challenging task nowadays to accurately predict the canonical tautomeric form of a given drug-like molecule. Lack of extensive tautomer databases, most likely due to the difficulty in experimental studies, hampers the development of effective empirical methods for tautomer predictions. A more accurate estimation of the stable tautomeric form can be achieved by quantum chemistry calculations. Yet, the computational cost required prevents quantum chemistry calculation as a standard tool for tautomer prediction in computer-aided drug discovery. In this paper we propose a hybrid quantum chemistry-quantum computation workflow to efficiently predict the dominant tautomeric form. Specifically, we select active-space molecular orbitals based on quantum chemistry methods. Then we utilize efficient encoding methods to map the Hamiltonian onto quantum devices to reduce the qubit resources and circuit depth. Finally, variational quantum eigensolver (VQE) algorithms are employed for ground state estimation where hardware-efficient ansatz circuits are used. To demonstrate the applicability of our methodology, we perform experiments on two tautomeric systems: acetone and Edaravone, each having 52 and 150 spin-orbitals in the STO-3G basis set, respectively. Our numerical results show that their tautomeric state prediction agrees with the CCSD benchmarks. Moreover, the required quantum resources are efficient: in the example of Edaravone, we could achieve chemical accuracy with only eight qubits and 80 two-qubit gates.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
Authors:
Shichao Xu,
Yikang Li,
Jenhao Hsiao,
Chiuman Ho,
Zhu Qi
Abstract:
In computer vision, multi-label recognition are important tasks with many real-world applications, but classifying previously unseen labels remains a significant challenge. In this paper, we propose a novel algorithm, Aligned Dual moDality ClaSsifier (ADDS), which includes a Dual-Modal decoder (DM-decoder) with alignment between visual and textual features, for open-vocabulary multi-label classifi…
▽ More
In computer vision, multi-label recognition are important tasks with many real-world applications, but classifying previously unseen labels remains a significant challenge. In this paper, we propose a novel algorithm, Aligned Dual moDality ClaSsifier (ADDS), which includes a Dual-Modal decoder (DM-decoder) with alignment between visual and textual features, for open-vocabulary multi-label classification tasks. Then we design a simple and yet effective method called Pyramid-Forwarding to enhance the performance for inputs with high resolutions. Moreover, the Selective Language Supervision is applied to further enhance the model performance. Extensive experiments conducted on several standard benchmarks, NUS-WIDE, ImageNet-1k, ImageNet-21k, and MS-COCO, demonstrate that our approach significantly outperforms previous methods and provides state-of-the-art performance for open-vocabulary multi-label classification, conventional multi-label classification and an extreme case called single-to-multi label classification where models trained on single-label datasets (ImageNet-1k, ImageNet-21k) are tested on multi-label ones (MS-COCO and NUS-WIDE).
△ Less
Submitted 7 October, 2023; v1 submitted 19 August, 2022;
originally announced August 2022.
-
A framework for model-assisted T x E x M exploration in maize
Authors:
Jennifer Hsiao,
Soo-Hyung Kim,
Dennis J. Timlin,
Nathaniel D. Mueller,
Abigail L. S. Swann
Abstract:
Breeding for new crop characteristics and adjusting management practices are critical avenues to mitigate yield loss and maintain yield stability under a changing climate. However, identifying high-performing plant traits and management options for different growing regions through traditional breeding practices and agronomic field trials is often time and resource-intensive. Mechanistic crop simu…
▽ More
Breeding for new crop characteristics and adjusting management practices are critical avenues to mitigate yield loss and maintain yield stability under a changing climate. However, identifying high-performing plant traits and management options for different growing regions through traditional breeding practices and agronomic field trials is often time and resource-intensive. Mechanistic crop simulation models can serve as powerful tools to help synthesize cropping information, set breeding targets, and develop adaptation strategies to sustain food production. In this study, we develop a modeling framework for a mechanistic crop model (MAIZSIM) to run many simulations within a trait x environment x management landscape and demonstrate how such a modeling framework could be used to identify ideal trait-management combinations that maximize yield and yield stability for different agro-climate regions in the US.
△ Less
Submitted 8 June, 2022; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Unentangled quantum reinforcement learning agents in the OpenAI Gym
Authors:
Jen-Yueh Hsiao,
Yuxuan Du,
Wei-Yin Chiang,
Min-Hsiu Hsieh,
Hsi-Sheng Goan
Abstract:
Classical reinforcement learning (RL) has generated excellent results in different regions; however, its sample inefficiency remains a critical issue. In this paper, we provide concrete numerical evidence that the sample efficiency (the speed of convergence) of quantum RL could be better than that of classical RL, and for achieving comparable learning performance, quantum RL could use much (at lea…
▽ More
Classical reinforcement learning (RL) has generated excellent results in different regions; however, its sample inefficiency remains a critical issue. In this paper, we provide concrete numerical evidence that the sample efficiency (the speed of convergence) of quantum RL could be better than that of classical RL, and for achieving comparable learning performance, quantum RL could use much (at least one order of magnitude) fewer trainable parameters than classical RL. Specifically, we employ the popular benchmarking environments of RL in the OpenAI Gym, and show that our quantum RL agent converges faster than classical fully-connected neural networks (FCNs) in the tasks of CartPole and Acrobot under the same optimization process. We also successfully train the first quantum RL agent that can complete the task of LunarLander in the OpenAI Gym. Our quantum RL agent only requires a single-qubit-based variational quantum circuit without entangling gates, followed by a classical neural network (NN) to post-process the measurement output. Finally, we could accomplish the aforementioned tasks on the real IBM quantum machines. To the best of our knowledge, none of the earlier quantum RL agents could do that.
△ Less
Submitted 27 March, 2022;
originally announced March 2022.
-
Roadmap of Designing Cognitive Metrics for Explainable Artificial Intelligence (XAI)
Authors:
Janet Hui-wen Hsiao,
Hilary Hei Ting Ngai,
Luyu Qiu,
Yi Yang,
Caleb Chen Cao
Abstract:
More recently, Explainable Artificial Intelligence (XAI) research has shifted to focus on a more pragmatic or naturalistic account of understanding, that is, whether the stakeholders understand the explanation. This point is especially important for research on evaluation methods for XAI systems. Thus, another direction where XAI research can benefit significantly from cognitive science and psycho…
▽ More
More recently, Explainable Artificial Intelligence (XAI) research has shifted to focus on a more pragmatic or naturalistic account of understanding, that is, whether the stakeholders understand the explanation. This point is especially important for research on evaluation methods for XAI systems. Thus, another direction where XAI research can benefit significantly from cognitive science and psychology research is ways to measure understanding of users, responses and attitudes. These measures can be used to quantify explanation quality and as feedback to the XAI system to improve the explanations. The current report aims to propose suitable metrics for evaluating XAI systems from the perspective of the cognitive states and processes of stakeholders. We elaborate on 7 dimensions, i.e., goodness, satisfaction, user understanding, curiosity & engagement, trust & reliance, controllability & interactivity, and learning curve & productivity, together with the recommended subjective and objective psychological measures. We then provide more details about how we can use the recommended measures to evaluate a visual classification XAI system according to the recommended cognitive metrics.
△ Less
Submitted 20 July, 2021;
originally announced August 2021.
-
Resisting Out-of-Distribution Data Problem in Perturbation of XAI
Authors:
Luyu Qiu,
Yi Yang,
Caleb Chen Cao,
Jing Liu,
Yueyuan Zheng,
Hilary Hei Ting Ngai,
Janet Hsiao,
Lei Chen
Abstract:
With the rapid development of eXplainable Artificial Intelligence (XAI), perturbation-based XAI algorithms have become quite popular due to their effectiveness and ease of implementation. The vast majority of perturbation-based XAI techniques face the challenge of Out-of-Distribution (OoD) data -- an artifact of randomly perturbed data becoming inconsistent with the original dataset. OoD data lead…
▽ More
With the rapid development of eXplainable Artificial Intelligence (XAI), perturbation-based XAI algorithms have become quite popular due to their effectiveness and ease of implementation. The vast majority of perturbation-based XAI techniques face the challenge of Out-of-Distribution (OoD) data -- an artifact of randomly perturbed data becoming inconsistent with the original dataset. OoD data leads to the over-confidence problem in model predictions, making the existing XAI approaches unreliable. To our best knowledge, the OoD data problem in perturbation-based XAI algorithms has not been adequately addressed in the literature. In this work, we address this OoD data problem by designing an additional module quantifying the affinity between the perturbed data and the original dataset distribution, which is integrated into the process of aggregation. Our solution is shown to be compatible with the most popular perturbation-based XAI algorithms, such as RISE, OCCLUSION, and LIME. Experiments have confirmed that our methods demonstrate a significant improvement in general cases using both computational and cognitive metrics. Especially in the case of degradation, our proposed approach demonstrates outstanding performance comparing to baselines. Besides, our solution also resolves a fundamental problem with the faithfulness indicator, a commonly used evaluation metric of XAI algorithms that appears to be sensitive to the OoD issue.
△ Less
Submitted 27 July, 2021;
originally announced July 2021.
-
An Update on a Progressively Expanded Database for Automated Lung Sound Analysis
Authors:
Fu-Shun Hsu,
Shang-Ran Huang,
Chien-Wen Huang,
Yuan-Ren Cheng,
Chun-Chieh Chen,
Jack Hsiao,
Chung-Wei Chen,
Feipei Lai
Abstract:
Purpose: We previously established an open-access lung sound database, HF_Lung_V1, and developed deep learning models for inhalation, exhalation, continuous adventitious sound (CAS), and discontinuous adventitious sound (DAS) detection. The amount of data used for training contributes to model accuracy. Herein, we collected larger quantities of data to further improve model performance. Moreover,…
▽ More
Purpose: We previously established an open-access lung sound database, HF_Lung_V1, and developed deep learning models for inhalation, exhalation, continuous adventitious sound (CAS), and discontinuous adventitious sound (DAS) detection. The amount of data used for training contributes to model accuracy. Herein, we collected larger quantities of data to further improve model performance. Moreover, the issues of noisy labels and sound overlapping were explored. Methods: HF_Lung_V1 was expanded to HF_Lung_V2 with a 1.45x increase in the number of audio files. Convolutional neural network-bidirectional gated recurrent unit network models were trained separately using the HF_Lung_V1 (V1_Train) and HF_Lung_V2 (V2_Train) training sets and then tested using the HF_Lung_V1 (V1_Test) and HF_Lung_V2 (V2_Test) test sets, respectively. Segment and event detection performance was evaluated using the F1 scores. Label quality was assessed. Moreover, the overlap ratios between inhalation, exhalation, CAS, and DAS labels were computed. Results: The model trained using V2_Train exhibited improved F1 scores in inhalation, exhalation, and CAS detection on both V1_Test and V2_Test but not in DAS detection. Poor CAS detection was attributed to the quality of CAS labels. DAS detection was strongly influenced by the overlapping of DAS labels with inhalation and exhalation labels. Conclusion: Collecting greater quantities of lung sound data is vital for developing more accurate lung sound analysis models. To build real ground-truth labels, the labels must be reworked; this process is ongoing. Furthermore, a method for addressing the sound overlapping problem in DAS detection must be formulated.
△ Less
Submitted 29 September, 2021; v1 submitted 8 February, 2021;
originally announced February 2021.
-
Benchmarking of eight recurrent neural network variants for breath phase and adventitious sound detection on a self-developed open-access lung sound database-HF_Lung_V1
Authors:
Fu-Shun Hsu,
Shang-Ran Huang,
Chien-Wen Huang,
Chao-Jung Huang,
Yuan-Ren Cheng,
Chun-Chieh Chen,
Jack Hsiao,
Chung-Wei Chen,
Li-Chin Chen,
Yen-Chun Lai,
Bi-Fang Hsu,
Nian-Jhen Lin,
Wan-Lin Tsai,
Yi-Lin Wu,
Tzu-Ling Tseng,
Ching-Ting Tseng,
Yi-Tsun Chen,
Feipei Lai
Abstract:
A reliable, remote, and continuous real-time respiratory sound monitor with automated respiratory sound analysis ability is urgently required in many clinical scenarios-such as in monitoring disease progression of coronavirus disease 2019-to replace conventional auscultation with a handheld stethoscope. However, a robust computerized respiratory sound analysis algorithm has not yet been validated…
▽ More
A reliable, remote, and continuous real-time respiratory sound monitor with automated respiratory sound analysis ability is urgently required in many clinical scenarios-such as in monitoring disease progression of coronavirus disease 2019-to replace conventional auscultation with a handheld stethoscope. However, a robust computerized respiratory sound analysis algorithm has not yet been validated in practical applications. In this study, we developed a lung sound database (HF_Lung_V1) comprising 9,765 audio files of lung sounds (duration of 15 s each), 34,095 inhalation labels, 18,349 exhalation labels, 13,883 continuous adventitious sound (CAS) labels (comprising 8,457 wheeze labels, 686 stridor labels, and 4,740 rhonchi labels), and 15,606 discontinuous adventitious sound labels (all crackles). We conducted benchmark tests for long short-term memory (LSTM), gated recurrent unit (GRU), bidirectional LSTM (BiLSTM), bidirectional GRU (BiGRU), convolutional neural network (CNN)-LSTM, CNN-GRU, CNN-BiLSTM, and CNN-BiGRU models for breath phase detection and adventitious sound detection. We also conducted a performance comparison between the LSTM-based and GRU-based models, between unidirectional and bidirectional models, and between models with and without a CNN. The results revealed that these models exhibited adequate performance in lung sound analysis. The GRU-based models outperformed, in terms of F1 scores and areas under the receiver operating characteristic curves, the LSTM-based models in most of the defined tasks. Furthermore, all bidirectional models outperformed their unidirectional counterparts. Finally, the addition of a CNN improved the accuracy of lung sound analysis, especially in the CAS detection tasks.
△ Less
Submitted 12 July, 2022; v1 submitted 5 February, 2021;
originally announced February 2021.
-
GCF-Net: Gated Clip Fusion Network for Video Action Recognition
Authors:
Jenhao Hsiao,
Jiawei Chen,
Chiuman Ho
Abstract:
In recent years, most of the accuracy gains for video action recognition have come from the newly designed CNN architectures (e.g., 3D-CNNs). These models are trained by applying a deep CNN on single clip of fixed temporal length. Since each video segment are processed by the 3D-CNN module separately, the corresponding clip descriptor is local and the inter-clip relationships are inherently implic…
▽ More
In recent years, most of the accuracy gains for video action recognition have come from the newly designed CNN architectures (e.g., 3D-CNNs). These models are trained by applying a deep CNN on single clip of fixed temporal length. Since each video segment are processed by the 3D-CNN module separately, the corresponding clip descriptor is local and the inter-clip relationships are inherently implicit. Common method that directly averages the clip-level outputs as a video-level prediction is prone to fail due to the lack of mechanism that can extract and integrate relevant information to represent the video.
In this paper, we introduce the Gated Clip Fusion Network (GCF-Net) that can greatly boost the existing video action classifiers with the cost of a tiny computation overhead. The GCF-Net explicitly models the inter-dependencies between video clips to strengthen the receptive field of local clip descriptors. Furthermore, the importance of each clip to an action event is calculated and a relevant subset of clips is selected accordingly for a video-level analysis. On a large benchmark dataset (Kinetics-600), the proposed GCF-Net elevates the accuracy of existing action classifiers by 11.49% (based on central clip) and 3.67% (based on densely sampled clips) respectively.
△ Less
Submitted 1 February, 2021;
originally announced February 2021.
-
Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition
Authors:
Jiawei Chen,
Jenson Hsiao,
Chiu Man Ho
Abstract:
Human action recognition is regarded as a key cornerstone in domains such as surveillance or video understanding. Despite recent progress in the development of end-to-end solutions for video-based action recognition, achieving state-of-the-art performance still requires using auxiliary hand-crafted motion representations, e.g., optical flow, which are usually computationally demanding. In this wor…
▽ More
Human action recognition is regarded as a key cornerstone in domains such as surveillance or video understanding. Despite recent progress in the development of end-to-end solutions for video-based action recognition, achieving state-of-the-art performance still requires using auxiliary hand-crafted motion representations, e.g., optical flow, which are usually computationally demanding. In this work, we propose to use residual frames (i.e., differences between adjacent RGB frames) as an alternative "lightweight" motion representation, which carries salient motion information and is computationally efficient. In addition, we develop a new pseudo-3D convolution module which decouples 3D convolution into 2D and 1D convolution. The proposed module exploits residual information in the feature space to better structure motions, and is equipped with a self-attention mechanism that assists to recalibrate the appearance and motion features. Empirical results confirm the efficiency and effectiveness of residual frames as well as the proposed pseudo-3D convolution module.
△ Less
Submitted 3 August, 2020;
originally announced August 2020.
-
Miniaturized optical frequency standard for next-generation portable optical clocks
Authors:
Vincent Maurice,
Zachary L. Newman,
Susannah Dickerson,
Morgan Rivers,
James Hsiao,
Phillip Greene,
Mark Mescher,
John Kitching,
Matthew T. Hummon,
Cort Johnson
Abstract:
Optical frequency standards, lasers stabilized to atomic or molecular transitions, are widely used in length metrology and laser ranging, provide a backbone for optical communications and lie at the heart of next-generation optical atomic clocks. Here we demonstrate a compact, low-power optical frequency standard based on the Doppler-free, two-photon transition in rubidium-87 at 778 nm implemented…
▽ More
Optical frequency standards, lasers stabilized to atomic or molecular transitions, are widely used in length metrology and laser ranging, provide a backbone for optical communications and lie at the heart of next-generation optical atomic clocks. Here we demonstrate a compact, low-power optical frequency standard based on the Doppler-free, two-photon transition in rubidium-87 at 778 nm implemented on a micro-optics breadboard. The optical standard achieves a fractional frequency stability of 2.9x10$^{-12}$/$\sqrtτ$ for averaging times $τ$ less than 10$^{3}$ s, has a volume of $\approx$35 cm$^3$ and operates on $\approx$450 mW of electrical power. These results demonstrate a key step towards the development of compact optical clocks and the broad dissemination of SI-traceable wavelength references.
△ Less
Submitted 29 March, 2020;
originally announced March 2020.
-
Maize yield under a changing climate: The hidden role of vapor pressure deficit
Authors:
Jennifer Hsiao,
Abigail L. S. Swann,
Soo-Hyung Kim
Abstract:
Temperatures over the next century are expected to rise to levels detrimental to crop growth and yield. As the atmosphere warms without additional water vapor input, vapor pressure deficit (VPD) increases as well. Increased temperatures and accompanied elevated VPD levels can both lead to negative impacts on crop yield. The independent importance of VPD, however, is often neglected or conflated wi…
▽ More
Temperatures over the next century are expected to rise to levels detrimental to crop growth and yield. As the atmosphere warms without additional water vapor input, vapor pressure deficit (VPD) increases as well. Increased temperatures and accompanied elevated VPD levels can both lead to negative impacts on crop yield. The independent importance of VPD, however, is often neglected or conflated with that from temperature due to a tight correlation between the two climate factors. We used a coupled process-based crop (MAIZSIM) and soil (2DSOIL) model to gain a mechanistic understanding of the independent roles temperature and VPD play in crop yield projections, as well as their interactions with rising CO2 levels and changing precipitation patterns. We found that by separating out the VPD effect from rising temperatures, VPD increases had a greater negative impact on yield compared to that from warming. The negative impact of these two factors varied with precipitation levels and influenced yield through separate mechanisms. Warmer temperatures caused yield loss mainly through shortening the growing season, while elevated VPD increased water loss and triggered several water stress responses such as reduced photosynthetic rates, lowered leaf area development, and shortened growing season length. Elevated CO2 concentrations partially alleviated yield loss under warming or increased VPD conditions through water savings, but the impact level varied with precipitation levels and was most pronounced under drier conditions. These results demonstrate the key role VPD plays in crop growth and yield, displaying a magnitude of impact comparative to temperature and CO2. A mechanistic understanding of the function of VPD and its relation with other climate factors and management practices is critical to improving crop yield projections under a changing climate.
△ Less
Submitted 7 October, 2019;
originally announced October 2019.
-
Surface-emitting electroholographic SAW modulator
Authors:
Joy C. Perkinson,
Michael G. Moebius,
Elizabeth J. Brundage,
William A. Teynor,
Steven J. Byrnes,
James C. Hsiao,
William D. Sawyer,
Dennis M. Callahan,
Ian W. Frank,
John J. LeBlanc,
Gregg E. Favalora
Abstract:
We report the design and operation of a surface-emitting surface acoustic wave (SAW) acousto-optical modulator which behaves as a cm-scale linear hologram in response to an applied electronic waveform. The modulator is formed by an optical waveguide, transducer, and out-coupling surface grating on a 1 mm-thick lithium niobate substrate. We demonstrate the ability to load and illuminate a 9-region…
▽ More
We report the design and operation of a surface-emitting surface acoustic wave (SAW) acousto-optical modulator which behaves as a cm-scale linear hologram in response to an applied electronic waveform. The modulator is formed by an optical waveguide, transducer, and out-coupling surface grating on a 1 mm-thick lithium niobate substrate. We demonstrate the ability to load and illuminate a 9-region linear hologram into the modulator's 8 mm-long interaction region using applied waveforms of 280-320 MHz. To the best of the authors' knowledge, this is the first demonstration of a monolithically-integrated, surface-emitting SAW modulator fabricated using lithographic techniques. Applications include practical implementations of a holographic display.
△ Less
Submitted 17 June, 2019;
originally announced June 2019.
-
EMHMM Simulation Study
Authors:
Antoni B. Chan,
Janet H. Hsiao
Abstract:
Eye Movement analysis with Hidden Markov Models (EMHMM) is a method for modeling eye fixation sequences using hidden Markov models (HMMs). In this report, we run a simulation study to investigate the estimation error for learning HMMs with variational Bayesian inference, with respect to the number of sequences and the sequence lengths. We also relate the estimation error measured by KL divergence…
▽ More
Eye Movement analysis with Hidden Markov Models (EMHMM) is a method for modeling eye fixation sequences using hidden Markov models (HMMs). In this report, we run a simulation study to investigate the estimation error for learning HMMs with variational Bayesian inference, with respect to the number of sequences and the sequence lengths. We also relate the estimation error measured by KL divergence and L1-norm to a corresponding distortion in the ground-truth HMM parameters.
△ Less
Submitted 24 June, 2019; v1 submitted 17 October, 2018;
originally announced October 2018.
-
Bernstein-Sato Polynomials on Normal Toric Varieties
Authors:
Jen-Chieh Hsiao,
Laura Felicia Matusevich
Abstract:
We generalize the Bernstein-Sato polynomials of Budur, Mustata and Saito to ideals in normal semigroup rings. In the case of monomial ideals, we also relate the roots of the Bernstein-Sato polynomial to the jumping coefficients of the corresponding multiplier ideals. In order to prove the latter result, we obtain a new combinatorial description for the multiplier ideals of a monomial ideal in a no…
▽ More
We generalize the Bernstein-Sato polynomials of Budur, Mustata and Saito to ideals in normal semigroup rings. In the case of monomial ideals, we also relate the roots of the Bernstein-Sato polynomial to the jumping coefficients of the corresponding multiplier ideals. In order to prove the latter result, we obtain a new combinatorial description for the multiplier ideals of a monomial ideal in a normal semigroup ring.
△ Less
Submitted 11 August, 2016;
originally announced August 2016.
-
Pattern Formation with a Compartmental Lateral Inhibition System
Authors:
Ana Sofia Rufino Ferreira,
Justin Hsia,
Murat Arcak,
Michel Maharbiz,
Adam Arkin
Abstract:
We propose a compartmental lateral inhibition system that generates contrasting patterns of gene expression between neighboring compartments. The system consists of a set of compartments interconnected by channels. Each compartment contains a colony of cells that produce diffusible molecules to be detected by the neighboring colony, and each cell is equipped with an inhibitory circuit that reduces…
▽ More
We propose a compartmental lateral inhibition system that generates contrasting patterns of gene expression between neighboring compartments. The system consists of a set of compartments interconnected by channels. Each compartment contains a colony of cells that produce diffusible molecules to be detected by the neighboring colony, and each cell is equipped with an inhibitory circuit that reduces its production when the detected signal is stronger. We develop a technique to analyze the steady-state patterns emerging from this lateral inhibition system and apply it to a specific implementation. The analysis shows that the proposed system indeed exhibits contrasting patterns within realistic parameter ranges.
△ Less
Submitted 23 July, 2014;
originally announced July 2014.
-
On the F-rationality and cohomological properties of matrix Schubert varieties
Authors:
Jen-Chieh Hsiao
Abstract:
We characterize complete intersection matrix Schubert varieties, generalizing the classical result on one-sided ladder determinantal varieties. We also give a new proof of the F-rationality of matrix Schubert varieties. Although it is known that such varieties are F-regular (hence F-rational) by the global F-regularity of Schubert varieties, our proof is of independent interest since it does not r…
▽ More
We characterize complete intersection matrix Schubert varieties, generalizing the classical result on one-sided ladder determinantal varieties. We also give a new proof of the F-rationality of matrix Schubert varieties. Although it is known that such varieties are F-regular (hence F-rational) by the global F-regularity of Schubert varieties, our proof is of independent interest since it does not require the Bott-Samelson resolution. As a consequence, this provides an alternative proof of the classical fact that Schubert varieties in flag varieties are normal and have rational singularities.
△ Less
Submitted 23 October, 2013;
originally announced October 2013.
-
Effects of Tip-Nanotube Interactions on Atomic Force Microscopy Imaging of Carbon Nanotubes
Authors:
Rouholla Alizadegan,
Albert D. Liao,
Feng Xiong,
Eric Pop,
K. Jimmy Hsia
Abstract:
We examine the effect of van der Waals (vdW) interactions between atomic force microscope (AFM) tips and individual carbon nanotubes (CNTs) supported on SiO2. Molecular dynamics (MD) simulations reveal how CNTs deform during AFM measurement, irrespective of the AFM tip material. The apparent height of a single- (double-) walled CNT can be used to estimate its diameter up to ~2 nm (~3 nm), but for…
▽ More
We examine the effect of van der Waals (vdW) interactions between atomic force microscope (AFM) tips and individual carbon nanotubes (CNTs) supported on SiO2. Molecular dynamics (MD) simulations reveal how CNTs deform during AFM measurement, irrespective of the AFM tip material. The apparent height of a single- (double-) walled CNT can be used to estimate its diameter up to ~2 nm (~3 nm), but for larger diameters the CNT cross-section is no longer circular. Our simulations were compared against CNT dimensions obtained from AFM measurements and resonant Raman spectroscopy, with good agreement for the smaller CNT di-ameters. In general, AFM measurements of large-diameter CNTs must be interpreted with care, but the reliability of the approach is improved if knowledge of the number of CNT walls is avail-able, or if additional verification (e.g. by optical techniques) can be obtained.
△ Less
Submitted 6 July, 2012;
originally announced July 2012.
-
A Counterexample for Subadditivity of Multiplier Ideals on Toric Varieties
Authors:
Jen-Chieh Hsiao
Abstract:
We construct a 3-dimensional complete intersection toric variety on which the subadditivity formula doesn't hold, answering negatively a question by Takagi and Watanabe. A combinatorial proof of the subadditivity formula on 2-dimensional normal toric varieties is also provided.
We construct a 3-dimensional complete intersection toric variety on which the subadditivity formula doesn't hold, answering negatively a question by Takagi and Watanabe. A combinatorial proof of the subadditivity formula on 2-dimensional normal toric varieties is also provided.
△ Less
Submitted 29 April, 2011;
originally announced April 2011.
-
Cartier modules on toric varieties
Authors:
Jen-Chieh Hsiao,
Karl Schwede,
Wenliang Zhang
Abstract:
Assume that $X$ is an affine toric variety of characteristic $p > 0$. Let $Δ$ be an effective toric $Q$-divisor such that $K_X+Δ$ is $Q$-Cartier with index not divisible by $p$ and let $φ_Δ:F^e_* O_X \to O_X$ be the toric map corresponding to $Δ$. We identify all ideals $I$ of $O_X$ with $φ_Δ(F^e_* I)=I$ combinatorially and also in terms of a log resolution (giving us a version of these ideals whi…
▽ More
Assume that $X$ is an affine toric variety of characteristic $p > 0$. Let $Δ$ be an effective toric $Q$-divisor such that $K_X+Δ$ is $Q$-Cartier with index not divisible by $p$ and let $φ_Δ:F^e_* O_X \to O_X$ be the toric map corresponding to $Δ$. We identify all ideals $I$ of $O_X$ with $φ_Δ(F^e_* I)=I$ combinatorially and also in terms of a log resolution (giving us a version of these ideals which can be defined in characteristic zero). Moreover, given a toric ideal $\ba$, we identify all ideals $I$ fixed by the Cartier algebra generated by $φ_Δ$ and $\ba$; this answers a question by Manuel Blickle in the toric setting.
△ Less
Submitted 13 April, 2012; v1 submitted 3 November, 2010;
originally announced November 2010.
-
Thermal Dissipation and Variability in Electrical Breakdown of Carbon Nanotube Devices
Authors:
Albert Liao,
Rouholla Alizadegan,
Zhun-Yong Ong,
Sumit Dutta,
K. Jimmy Hsia,
Eric Pop
Abstract:
We study high-field electrical breakdown and heat dissipation from carbon nanotube (CNT) devices on SiO2 substrates. The thermal "footprint" of a CNT caused by van der Waals interactions with the substrate is revealed through molecular dynamics (MD) simulations. Experiments and modeling find the CNT-substrate thermal coupling scales proportionally to CNT diameter and inversely with SiO2 surface…
▽ More
We study high-field electrical breakdown and heat dissipation from carbon nanotube (CNT) devices on SiO2 substrates. The thermal "footprint" of a CNT caused by van der Waals interactions with the substrate is revealed through molecular dynamics (MD) simulations. Experiments and modeling find the CNT-substrate thermal coupling scales proportionally to CNT diameter and inversely with SiO2 surface roughness (~d/Δ). Comparison of diffuse mismatch modeling (DMM) and data reveals the upper limit of thermal coupling ~0.4 W/K/m per unit length at room temperature, and ~0.7 W/K/m at 600 C for the largest diameter (3-4 nm) CNTs. We also find semiconducting CNTs can break down prematurely, and display more breakdown variability due to dynamic shifts in threshold voltage, which metallic CNTs are immune to; this poses a fundamental challenge for selective electrical breakdowns in CNT electronics.
△ Less
Submitted 5 November, 2010; v1 submitted 24 May, 2010;
originally announced May 2010.
-
D-module structure of local cohomology modules of toric algebras
Authors:
Jen-Chieh Hsiao
Abstract:
Let S be a toric algebra over a field K of characteristic 0 and let I be a monomial ideal of S. We show that the local cohomology modules H^i_I(S) are of finite length over the ring of differential operators D(S;K), generalizing the classical case of a polynomial algebra S. As an application, we compute the characteristic cycles of some local cohomology modules.
Let S be a toric algebra over a field K of characteristic 0 and let I be a monomial ideal of S. We show that the local cohomology modules H^i_I(S) are of finite length over the ring of differential operators D(S;K), generalizing the classical case of a polynomial algebra S. As an application, we compute the characteristic cycles of some local cohomology modules.
△ Less
Submitted 11 May, 2010;
originally announced May 2010.
-
The spontaneous emergence of ordered phases in crumpled sheets
Authors:
Yen-Chih Lin,
Ji-Ming Sun,
Jen-Hao Hsiao,
Yeukuang Hwu,
C. L. Wang,
Tzay-Ming Hong
Abstract:
X-ray tomography is performed to acquire 3D images of crumpled aluminum foils. We develop an algorithm to trace out the labyrinthian paths in the three perpendicular cross sections of the data matrices. The tangent-tangent correlation function along each path is found to decay exponentially with an effective persistence length that shortens as the crumpled ball becomes more compact. In the mean…
▽ More
X-ray tomography is performed to acquire 3D images of crumpled aluminum foils. We develop an algorithm to trace out the labyrinthian paths in the three perpendicular cross sections of the data matrices. The tangent-tangent correlation function along each path is found to decay exponentially with an effective persistence length that shortens as the crumpled ball becomes more compact. In the mean time, we observed ordered domains near the crust, similar to the lamellae phase mixed by the amorphous portion in lyotropic liquid crystals. The size and density of these domains grow with further compaction, and their orientation favors either perpendicular or parallel to the radial direction. Ordering is also identified near the core with an arbitrary orientation, exemplary of the spontaneous symmetry breaking.
△ Less
Submitted 21 December, 2009;
originally announced December 2009.
-
A solenoidal electron spectrometer for a precision measurement of the neutron $β$-asymmetry with ultracold neutrons
Authors:
B. Plaster,
R. Carr,
B. W. Filippone,
D. Harrison,
J. Hsiao,
T. M. Ito,
J. Liu,
J. W. Martin,
B. Tipton,
J. Yuan
Abstract:
We describe an electron spectrometer designed for a precision measurement of the neutron $β$-asymmetry with spin-polarized ultracold neutrons. The spectrometer consists of a 1.0-Tesla solenoidal field with two identical multiwire proportional chamber and plastic scintillator electron detector packages situated within 0.6-Tesla field-expansion regions. Select results from performance studies of t…
▽ More
We describe an electron spectrometer designed for a precision measurement of the neutron $β$-asymmetry with spin-polarized ultracold neutrons. The spectrometer consists of a 1.0-Tesla solenoidal field with two identical multiwire proportional chamber and plastic scintillator electron detector packages situated within 0.6-Tesla field-expansion regions. Select results from performance studies of the spectrometer with calibration sources are reported.
△ Less
Submitted 12 June, 2008;
originally announced June 2008.
-
Experimental Studies of Low-field Landau Quantization in Two-dimensional Electron Systems in GaAs/AlGaAs Heterostructures
Authors:
Jing-Han Chen,
D. R. Hang,
C. F. Huang,
Tsai-Yu Huang,
Jyun-Ying Lin,
S. H. Lo,
J. C. Hsiao,
Ming-Gu Lin,
M. Y. Simmons,
D. A. Ritchie,
C. -T. Liang
Abstract:
By applying a magnetic field perpendicular to GaAs/AlGaAs two-dimensional electron systems, we study the low-field Landau quantization when the thermal damping is reduced with decreasing the temperature. Magneto-oscillations following Shubnikov-de Haas (SdH) formula are observed even when their amplitudes are so large that the deviation to such a formula is expected. Our experimental results sho…
▽ More
By applying a magnetic field perpendicular to GaAs/AlGaAs two-dimensional electron systems, we study the low-field Landau quantization when the thermal damping is reduced with decreasing the temperature. Magneto-oscillations following Shubnikov-de Haas (SdH) formula are observed even when their amplitudes are so large that the deviation to such a formula is expected. Our experimental results show the importance of the positive magneto-resistance to the extension of SdH formula under the damping induced by the disorder.
△ Less
Submitted 29 October, 2006;
originally announced October 2006.
-
From semiclassical transport to quantum Hall effect under low-field Landau quantization
Authors:
D. R. Hang,
C. F. Huang,
Y. W. Zhang,
H. D. Yeh,
J. C. Hsiao,
H. L. Pang
Abstract:
The crossover from the semiclassical transport to quantum Hall effect is studied by examining a two-dimensional electron system in an AlGaAs/GaAs heterostructure. By probing the magneto-oscillations, it is shown that the semiclassical Shubnikov-de Haas (SdH) formulation can be valid even when the minima of the longitudinal resistivity approach zero. The extension of the applicable range of the S…
▽ More
The crossover from the semiclassical transport to quantum Hall effect is studied by examining a two-dimensional electron system in an AlGaAs/GaAs heterostructure. By probing the magneto-oscillations, it is shown that the semiclassical Shubnikov-de Haas (SdH) formulation can be valid even when the minima of the longitudinal resistivity approach zero. The extension of the applicable range of the SdH theory could be due to the damping effects resulting from disorder and temperature. Moreover, we observed plateau-plateau transition like behavior with such an extension. From our study, it is important to include the positive magnetoresistance to refine the SdH theory.
△ Less
Submitted 17 August, 2006;
originally announced August 2006.