-
Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models
Authors:
Cao Yuxuan,
Wu Jiayang,
Alistair Cheong Liang Chuen,
Bryan Shan Guanrong,
Theodore Lee Chong Jen,
Sherman Chann Zhi Shen
Abstract:
Traditional online content moderation systems struggle to classify modern multimodal means of communication, such as memes, a highly nuanced and information-dense medium. This task is especially hard in a culturally diverse society like Singapore, where low-resource languages are used and extensive knowledge on local context is needed to interpret online content. We curate a large collection of 11…
▽ More
Traditional online content moderation systems struggle to classify modern multimodal means of communication, such as memes, a highly nuanced and information-dense medium. This task is especially hard in a culturally diverse society like Singapore, where low-resource languages are used and extensive knowledge on local context is needed to interpret online content. We curate a large collection of 112K memes labeled by GPT-4V for fine-tuning a VLM to classify offensive memes in Singapore context. We show the effectiveness of fine-tuned VLMs on our dataset, and propose a pipeline containing OCR, translation and a 7-billion parameter-class VLM. Our solutions reach 80.62% accuracy and 0.8192 AUROC on a held-out test set, and can greatly aid human in moderating online contents. The dataset, code, and model weights have been open-sourced at https://github.com/aliencaocao/vlm-for-memes-aisg.
△ Less
Submitted 8 March, 2025; v1 submitted 25 February, 2025;
originally announced February 2025.
-
A Decoding Algorithm for Length-Control Summarization Based on Directed Acyclic Transformers
Authors:
Chenyang Huang,
Hao Zhou,
Cameron Jen,
Kangjie Zheng,
Osmar R. Zaïane,
Lili Mou
Abstract:
Length-control summarization aims to condense long texts into a short one within a certain length limit. Previous approaches often use autoregressive (AR) models and treat the length requirement as a soft constraint, which may not always be satisfied. In this study, we propose a novel length-control decoding algorithm based on the Directed Acyclic Transformer (DAT). Our approach allows for multipl…
▽ More
Length-control summarization aims to condense long texts into a short one within a certain length limit. Previous approaches often use autoregressive (AR) models and treat the length requirement as a soft constraint, which may not always be satisfied. In this study, we propose a novel length-control decoding algorithm based on the Directed Acyclic Transformer (DAT). Our approach allows for multiple plausible sequence fragments and predicts a \emph{path} to connect them. In addition, we propose a Sequence Maximum a Posteriori (SeqMAP) decoding algorithm that marginalizes different possible paths and finds the most probable summary satisfying the length budget. Our algorithm is based on beam search, which further facilitates a reranker for performance improvement. Experimental results on the Gigaword and DUC2004 datasets demonstrate our state-of-the-art performance for length-control summarization.
△ Less
Submitted 6 February, 2025;
originally announced February 2025.
-
A Diffusion Process on Riemannian Manifold for Visual Tracking
Authors:
Marcus Chen,
Cham Tat Jen,
Pang Sze Kim,
Alvina Goh
Abstract:
Robust visual tracking for long video sequences is a research area that has many important applications. The main challenges include how the target image can be modeled and how this model can be updated. In this paper, we model the target using a covariance descriptor, as this descriptor is robust to problems such as pixel-pixel misalignment, pose and illumination changes, that commonly occur in v…
▽ More
Robust visual tracking for long video sequences is a research area that has many important applications. The main challenges include how the target image can be modeled and how this model can be updated. In this paper, we model the target using a covariance descriptor, as this descriptor is robust to problems such as pixel-pixel misalignment, pose and illumination changes, that commonly occur in visual tracking. We model the changes in the template using a generative process. We introduce a new dynamical model for the template update using a random walk on the Riemannian manifold where the covariance descriptors lie in. This is done using log-transformed space of the manifold to free the constraints imposed inherently by positive semidefinite matrices. Modeling template variations and poses kinetics together in the state space enables us to jointly quantify the uncertainties relating to the kinematic states and the template in a principled way. Finally, the sequential inference of the posterior distribution of the kinematic states and the template is done using a particle filter. Our results shows that this principled approach can be robust to changes in illumination, poses and spatial affine transformation. In the experiments, our method outperformed the current state-of-the-art algorithm - the incremental Principal Component Analysis method, particularly when a target underwent fast poses changes and also maintained a comparable performance in stable target tracking cases.
△ Less
Submitted 24 March, 2013;
originally announced March 2013.
-
Numerical Investigation of Laser-Assisted Nanoimprinting on a Copper Substrate from a Perspective of Heat Transfer Analysis
Authors:
Chun-Ping Jen
Abstract:
The technique of laser-assisted nanoimprinting lithography (LAN) has been proposed to utilize an excimer laser to irradiate through a quartz mold and melts a thin polymer film on the substrate for micro- to nano-scaled fabrications. In the present study, the novel concept of that copper was adopted as the substrate instead of silicon, which is conventionally used, was proposed. The micro/nano st…
▽ More
The technique of laser-assisted nanoimprinting lithography (LAN) has been proposed to utilize an excimer laser to irradiate through a quartz mold and melts a thin polymer film on the substrate for micro- to nano-scaled fabrications. In the present study, the novel concept of that copper was adopted as the substrate instead of silicon, which is conventionally used, was proposed. The micro/nano structures on the copper substrate could be fabricated by chemical/electrochemical etching or electroforming ; following by the patterns have been transferred onto the substrate using LAN process. Alternatives of the substrate materials could lead versatile applications in micro/nano-fabrication. To demonstrate the feasibility of this concept numerically, this study introduced optical multiple reflection theory to perform both analytical and numerical modeling during the process and to predict the thermal response theoretically.
△ Less
Submitted 7 May, 2008;
originally announced May 2008.
-
Cell Trapping Utilizing Insulator-based Dielectrophoresis in The Open-Top Microchannels
Authors:
Chun-Ping Jen,
Yao-Hung Huang,
Teng-Wen Chen
Abstract:
The ability to manipulate or separate a biological small particle, such as a living cell and embryo, is fundamental needed to many biological and medical applications. The insulator-based dielectrophoresis (iDEP) trapping is composed of conductless tetragon structures in micro-chip. In this study, a lower conductive material of photoresist was adopted as a structure in open-top microchannel inst…
▽ More
The ability to manipulate or separate a biological small particle, such as a living cell and embryo, is fundamental needed to many biological and medical applications. The insulator-based dielectrophoresis (iDEP) trapping is composed of conductless tetragon structures in micro-chip. In this study, a lower conductive material of photoresist was adopted as a structure in open-top microchannel instead of a metallic wire to squeeze the electric field in a conducting solution, therefore, creating a high field gradient with a local maximum. The microchip with the open-top microchannels was designed and fabricated herein. The insulator-based DEP trapping microchip with the open-top microchannels was designed and fabricated in this work. The cells trapped by DEP force could be further treated or cultured in the open-top microchannel ; however, those trapped in the microchip with enclosed microchannels could not be proceeded easily.
△ Less
Submitted 7 May, 2008;
originally announced May 2008.