-
A Persuasion-Based Prompt Learning Approach to Improve Smishing Detection through Data Augmentation
Authors:
Ho Sung Shim,
Hyoungjun Park,
Kyuhan Lee,
Jang-Sun Park,
Seonhye Kang
Abstract:
Smishing, which aims to illicitly obtain personal information from unsuspecting victims, holds significance due to its negative impacts on our society. In prior studies, as a tool to counteract smishing, machine learning (ML) has been widely adopted, which filters and blocks smishing messages before they reach potential victims. However, a number of challenges remain in ML-based smishing detection…
▽ More
Smishing, which aims to illicitly obtain personal information from unsuspecting victims, holds significance due to its negative impacts on our society. In prior studies, as a tool to counteract smishing, machine learning (ML) has been widely adopted, which filters and blocks smishing messages before they reach potential victims. However, a number of challenges remain in ML-based smishing detection, with the scarcity of annotated datasets being one major hurdle. Specifically, given the sensitive nature of smishing-related data, there is a lack of publicly accessible data that can be used for training and evaluating ML models. Additionally, the nuanced similarities between smishing messages and other types of social engineering attacks such as spam messages exacerbate the challenge of smishing classification with limited resources. To tackle this challenge, we introduce a novel data augmentation method utilizing a few-shot prompt learning approach. What sets our approach apart from extant methods is the use of the principles of persuasion, a psychology theory which explains the underlying mechanisms of smishing. By designing prompts grounded in the persuasion principles, our augmented dataset could effectively capture various, important aspects of smishing messages, enabling ML models to be effectively trained. Our evaluation within a real-world context demonstrates that our augmentation approach produces more diverse and higher-quality smishing data instances compared to other cutting-edging approaches, leading to substantial improvements in the ability of ML models to detect the subtle characteristics of smishing messages. Moreover, our additional analyses reveal that the performance improvement provided by our approach is more pronounced when used with ML models that have a larger number of parameters, demonstrating its effectiveness in training large-scale ML models.
△ Less
Submitted 5 November, 2024; v1 submitted 18 October, 2024;
originally announced November 2024.
-
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Authors:
Jinmiao Huang,
Waseem Gharbieh,
Qianhui Wan,
Han Suk Shim,
Chul Lee
Abstract:
Current keyword spotting systems are typically trained with a large amount of pre-defined keywords. Recognizing keywords in an open-vocabulary setting is essential for personalizing smart device interaction. Towards this goal, we propose a pure MLP-based neural network that is based on MLPMixer - an MLP model architecture that effectively replaces the attention mechanism in Vision Transformers. We…
▽ More
Current keyword spotting systems are typically trained with a large amount of pre-defined keywords. Recognizing keywords in an open-vocabulary setting is essential for personalizing smart device interaction. Towards this goal, we propose a pure MLP-based neural network that is based on MLPMixer - an MLP model architecture that effectively replaces the attention mechanism in Vision Transformers. We investigate different ways of adapting the MLPMixer architecture to the QbyE open-vocabulary keyword spotting task. Comparisons with the state-of-the-art RNN and CNN models show that our method achieves better performance in challenging situations (10dB and 6dB environments) on both the publicly available Hey-Snips dataset and a larger scale internal dataset with 400 speakers. Our proposed model also has a smaller number of parameters and MACs compared to the baseline models.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Query-by-Example Keyword Spotting system using Multi-head Attention and Softtriple Loss
Authors:
Jinmiao Huang,
Waseem Gharbieh,
Han Suk Shim,
Eugene Kim
Abstract:
This paper proposes a neural network architecture for tackling the query-by-example user-defined keyword spotting task. A multi-head attention module is added on top of a multi-layered GRU for effective feature extraction, and a normalized multi-head attention module is proposed for feature aggregation. We also adopt the softtriple loss - a combination of triplet loss and softmax loss - and showca…
▽ More
This paper proposes a neural network architecture for tackling the query-by-example user-defined keyword spotting task. A multi-head attention module is added on top of a multi-layered GRU for effective feature extraction, and a normalized multi-head attention module is proposed for feature aggregation. We also adopt the softtriple loss - a combination of triplet loss and softmax loss - and showcase its effectiveness. We demonstrate the performance of our model on internal datasets with different languages and the public Hey-Snips dataset. We compare the performance of our model to a baseline system and conduct an ablation study to show the benefit of each component in our architecture. The proposed work shows solid performance while preserving simplicity.
△ Less
Submitted 7 May, 2021; v1 submitted 13 February, 2021;
originally announced February 2021.
-
Robust Uncalibrated Stereo Rectification with Constrained Geometric Distortions (USR-CGD)
Authors:
Hyunsuk Ko,
Han Suk Shim,
Ouk Choi,
C. -C. Jay Kuo
Abstract:
A novel algorithm for uncalibrated stereo image-pair rectification under the constraint of geometric distortion, called USR-CGD, is presented in this work. Although it is straightforward to define a rectifying transformation (or homography) given the epipolar geometry, many existing algorithms have unwanted geometric distortions as a side effect. To obtain rectified images with reduced geometric d…
▽ More
A novel algorithm for uncalibrated stereo image-pair rectification under the constraint of geometric distortion, called USR-CGD, is presented in this work. Although it is straightforward to define a rectifying transformation (or homography) given the epipolar geometry, many existing algorithms have unwanted geometric distortions as a side effect. To obtain rectified images with reduced geometric distortions while maintaining a small rectification error, we parameterize the homography by considering the influence of various kinds of geometric distortions. Next, we define several geometric measures and incorporate them into a new cost function for parameter optimization. Finally, we propose a constrained adaptive optimization scheme to allow a balanced performance between the rectification error and the geometric error. Extensive experimental results are provided to demonstrate the superb performance of the proposed USR-CGD method, which outperforms existing algorithms by a significant margin.
△ Less
Submitted 31 March, 2016;
originally announced March 2016.
-
Optical - Near-Infrared catalogue for the AKARI North Ecliptic Pole Deep Field
Authors:
Nagisa Oi,
Hideo Matsuhara,
Kazumi Murata,
Tomotsugu Goto,
Takehiko Wada,
Toshinobu Takagi,
Youichi Ohyama,
Matthew Malkan,
Myungshin Im,
Hyunjin Shim Shim,
Stephen Serjeant,
Chris Pearson
Abstract:
Aims. We present an 8-band (u*, g', r', i', z', Y, J, Ks) optical to NIR deep photometric catalog based on the observations made with MegaCam and WIRCam at CFHT, and compute photometric redshifts, zp in the North Ecliptic Pole (NEP) region. Our catalog provides us to identify the counterparts, and zp for AKARI NIR/MIR sources.
Results. The estimated 4sigma detection limits within an 1" aperture…
▽ More
Aims. We present an 8-band (u*, g', r', i', z', Y, J, Ks) optical to NIR deep photometric catalog based on the observations made with MegaCam and WIRCam at CFHT, and compute photometric redshifts, zp in the North Ecliptic Pole (NEP) region. Our catalog provides us to identify the counterparts, and zp for AKARI NIR/MIR sources.
Results. The estimated 4sigma detection limits within an 1" aperture radius are 26.7, 25.9, 25.1, and 24.1 mag [AB] for g', r', i', and z'-bands and 23.4, 23.0, and 22.7 mag for Y, J, and Ks-bands, respectively. There are a total of 85797 sources in the band-merged catalog. An astrometric accuracy of this catalog determined by examining coordinate offsets with regard to 2MASS is 0.013" with a root mean square offset of 0.32". We distinguish 5441 secure stars from extended sources using u*-J vs. g'-Ks colors, combined with the SExtractor stellarity index of the images. Comparing with galaxy spectroscopic redshifts, we find a photometric redshift dispersion, sigma_(dz/(1+z)), of 0.032 and catastrophic failure rate, dz/(1+z)>0.15, of 5.8% at z<1, while a dispersion of 0.117 and a catastrophic failures rate of 16.6% at z>1. We extend estimate of the zp uncertainty over the full magnitude/redshift space with a redshift probability distribution function and find that our redshift are highly accurate with z'<22 at zp<2.5 and for fainter sources with z'<24 at z<1. From the investigation of photometric properties of AKARI infrared sources (23354 sources) using the g'z'Ks diagram, <5% of AKARI sources with optical counterparts are classified as high-z (1.4<z<2.5) star-forming galaxies. Among the high-z star-forming galaxies, AKARI MIR detected sources seem to be affected by stronger dust extinction compared with sources with non-detections in the AKARI MIR bands. The full, electronic version of our catalog with zp will be available at the CDS.
△ Less
Submitted 31 March, 2014;
originally announced March 2014.