-
Learning to Plan via Supervised Contrastive Learning and Strategic Interpolation: A Chess Case Study
Authors:
Andrew Hamara,
Greg Hamerly,
Pablo Rivas,
Andrew C. Freeman
Abstract:
Modern chess engines achieve superhuman performance through deep tree search and regressive evaluation, while human players rely on intuition to select candidate moves followed by a shallow search to validate them. To model this intuition-driven planning process, we train a transformer encoder using supervised contrastive learning to embed board states into a latent space structured by positional…
▽ More
Modern chess engines achieve superhuman performance through deep tree search and regressive evaluation, while human players rely on intuition to select candidate moves followed by a shallow search to validate them. To model this intuition-driven planning process, we train a transformer encoder using supervised contrastive learning to embed board states into a latent space structured by positional evaluation. In this space, distance reflects evaluative similarity, and visualized trajectories display interpretable transitions between game states. We demonstrate that move selection can occur entirely within this embedding space by advancing toward favorable regions, without relying on deep search. Despite using only a 6-ply beam search, our model achieves an estimated Elo rating of 2593. Performance improves with both model size and embedding dimensionality, suggesting that latent planning may offer a viable alternative to traditional search. Although we focus on chess, the proposed embedding-based planning method can be generalized to other perfect-information games where state evaluations are learnable. All source code is available at https://github.com/andrewhamara/SOLIS.
△ Less
Submitted 5 June, 2025;
originally announced June 2025.
-
Neural Pruning for 3D Scene Reconstruction: Efficient NeRF Acceleration
Authors:
Tianqi Ding,
Dawei Xiang,
Pablo Rivas,
Liang Dong
Abstract:
Neural Radiance Fields (NeRF) have become a popular 3D reconstruction approach in recent years. While they produce high-quality results, they also demand lengthy training times, often spanning days. This paper studies neural pruning as a strategy to address these concerns. We compare pruning approaches, including uniform sampling, importance-based methods, and coreset-based techniques, to reduce t…
▽ More
Neural Radiance Fields (NeRF) have become a popular 3D reconstruction approach in recent years. While they produce high-quality results, they also demand lengthy training times, often spanning days. This paper studies neural pruning as a strategy to address these concerns. We compare pruning approaches, including uniform sampling, importance-based methods, and coreset-based techniques, to reduce the model size and speed up training. Our findings show that coreset-driven pruning can achieve a 50% reduction in model size and a 35% speedup in training, with only a slight decrease in accuracy. These results suggest that pruning can be an effective method for improving the efficiency of NeRF models in resource-limited settings.
△ Less
Submitted 7 April, 2025; v1 submitted 1 April, 2025;
originally announced April 2025.
-
Harvesting Information Across the Horizon
Authors:
S. Wang,
M. R. Preciado Rivas,
R. B. Mann
Abstract:
The effect of black holes on entanglement harvesting has been of considerable interest over the past decade. Research involving stationary Unruh-DeWitt (UDW) detectors near a (2+1)-dimensional Bañados-Teitelboim-Zanelli (BTZ) black hole has uncovered phenomena such as entanglement shadows, entanglement amplification through black hole rotation, and differences between bipartite and tripartite enta…
▽ More
The effect of black holes on entanglement harvesting has been of considerable interest over the past decade. Research involving stationary Unruh-DeWitt (UDW) detectors near a (2+1)-dimensional Bañados-Teitelboim-Zanelli (BTZ) black hole has uncovered phenomena such as entanglement shadows, entanglement amplification through black hole rotation, and differences between bipartite and tripartite entanglement. For a (1+1)-dimensional Schwarzschild black hole, it has been shown that two infalling UDW detectors can harvest entanglement from the scalar quantum vacuum even when separated by an event horizon. In this paper, we calculate the mutual information between two UDW detectors coupled to a massless quantum scalar field, with the detectors starting at rest and falling radially into a non-rotating (2+1)-dimensional BTZ black hole. The trajectory of the detectors includes regions where both detectors are switched on outside of the horizon; where one detector is switched on inside of the horizon while the other switches on outside; and where both detectors switch on inside of the horizon. We investigate different black hole masses, detector energy gaps, widths and temporal separations of the detector switching functions, and field boundary conditions. We find that black holes--even the simplest kind having constant curvature--significantly affect the correlation properties of quantum fields in the vacuum state. These correlations, both outside and inside the horizon, can be mapped out by infalling detectors.
△ Less
Submitted 7 April, 2025; v1 submitted 31 March, 2025;
originally announced April 2025.
-
Leveraging OpenFlamingo for Multimodal Embedding Analysis of C2C Car Parts Data
Authors:
Maisha Binte Rashid,
Pablo Rivas
Abstract:
In this paper, we aim to investigate the capabilities of multimodal machine learning models, particularly the OpenFlamingo model, in processing a large-scale dataset of consumer-to-consumer (C2C) online posts related to car parts. We have collected data from two platforms, OfferUp and Craigslist, resulting in a dataset of over 1.2 million posts with their corresponding images. The OpenFlamingo mod…
▽ More
In this paper, we aim to investigate the capabilities of multimodal machine learning models, particularly the OpenFlamingo model, in processing a large-scale dataset of consumer-to-consumer (C2C) online posts related to car parts. We have collected data from two platforms, OfferUp and Craigslist, resulting in a dataset of over 1.2 million posts with their corresponding images. The OpenFlamingo model was used to extract embeddings for the text and image of each post. We used $k$-means clustering on the joint embeddings to identify underlying patterns and commonalities among the posts. We have found that most clusters contain a pattern, but some clusters showed no internal patterns. The results provide insight into the fact that OpenFlamingo can be used for finding patterns in large datasets but needs some modification in the architecture according to the dataset.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
A Framework for Evaluating Vision-Language Model Safety: Building Trust in AI for Public Sector Applications
Authors:
Maisha Binte Rashid,
Pablo Rivas
Abstract:
Vision-Language Models (VLMs) are increasingly deployed in public sector missions, necessitating robust evaluation of their safety and vulnerability to adversarial attacks. This paper introduces a novel framework to quantify adversarial risks in VLMs. We analyze model performance under Gaussian, salt-and-pepper, and uniform noise, identifying misclassification thresholds and deriving composite noi…
▽ More
Vision-Language Models (VLMs) are increasingly deployed in public sector missions, necessitating robust evaluation of their safety and vulnerability to adversarial attacks. This paper introduces a novel framework to quantify adversarial risks in VLMs. We analyze model performance under Gaussian, salt-and-pepper, and uniform noise, identifying misclassification thresholds and deriving composite noise patches and saliency patterns that highlight vulnerable regions. These patterns are compared against the Fast Gradient Sign Method (FGSM) to assess their adversarial effectiveness. We propose a new Vulnerability Score that combines the impact of random noise and adversarial attacks, providing a comprehensive metric for evaluating model robustness.
△ Less
Submitted 22 February, 2025;
originally announced February 2025.
-
Exploring Visual Embedding Spaces Induced by Vision Transformers for Online Auto Parts Marketplaces
Authors:
Cameron Armijo,
Pablo Rivas
Abstract:
This study examines the capabilities of the Vision Transformer (ViT) model in generating visual embeddings for images of auto parts sourced from online marketplaces, such as Craigslist and OfferUp. By focusing exclusively on single-modality data, the analysis evaluates ViT's potential for detecting patterns indicative of illicit activities. The workflow involves extracting high-dimensional embeddi…
▽ More
This study examines the capabilities of the Vision Transformer (ViT) model in generating visual embeddings for images of auto parts sourced from online marketplaces, such as Craigslist and OfferUp. By focusing exclusively on single-modality data, the analysis evaluates ViT's potential for detecting patterns indicative of illicit activities. The workflow involves extracting high-dimensional embeddings from images, applying dimensionality reduction techniques like Uniform Manifold Approximation and Projection (UMAP) to visualize the embedding space, and using K-Means clustering to categorize similar items. Representative posts nearest to each cluster centroid provide insights into the composition and characteristics of the clusters. While the results highlight the strengths of ViT in isolating visual patterns, challenges such as overlapping clusters and outliers underscore the limitations of single-modal approaches in this domain. This work contributes to understanding the role of Vision Transformers in analyzing online marketplaces and offers a foundation for future advancements in detecting fraudulent or illegal activities.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Data-Dependent Generalization Bounds for Parameterized Quantum Models Under Noise
Authors:
Bikram Khanal,
Pablo Rivas
Abstract:
Quantum machine learning offers a transformative approach to solving complex problems, but the inherent noise hinders its practical implementation in near-term quantum devices. This obstacle makes it difficult to understand the generalizability of quantum circuit models. Designing robust quantum machine learning models under noise requires a principled understanding of complexity and generalizatio…
▽ More
Quantum machine learning offers a transformative approach to solving complex problems, but the inherent noise hinders its practical implementation in near-term quantum devices. This obstacle makes it difficult to understand the generalizability of quantum circuit models. Designing robust quantum machine learning models under noise requires a principled understanding of complexity and generalization, extending beyond classical capacity measures. This study investigates the generalization properties of parameterized quantum machine learning models under the influence of noise. We present a data-dependent generalization bound grounded in the quantum Fisher information matrix. We leverage statistical learning theory to relate the parameter space volumes and training sizes to estimate the generalization capability of the trained model. We provide a structured characterization of complexity in quantum models by integrating local parameter neighborhoods and effective dimensions defined through quantum Fisher information matrix eigenvalues. We also analyze the tightness of the bound and discuss the tradeoff between model expressiveness and generalization performance.
△ Less
Submitted 3 February, 2025; v1 submitted 16 December, 2024;
originally announced December 2024.
-
NLP-Guided Synthesis: Transitioning from Sequential Programs to Distributed Programs
Authors:
Arun Sanjel,
Bikram Khanal,
Greg Speegle,
Pablo Rivas
Abstract:
As the need for large-scale data processing grows, distributed programming frameworks like PySpark have become increasingly popular. However, the task of converting traditional, sequential code to distributed code remains a significant hurdle, often requiring specialized knowledge and substantial time investment. While existing tools have made strides in automating this conversion, they often fall…
▽ More
As the need for large-scale data processing grows, distributed programming frameworks like PySpark have become increasingly popular. However, the task of converting traditional, sequential code to distributed code remains a significant hurdle, often requiring specialized knowledge and substantial time investment. While existing tools have made strides in automating this conversion, they often fall short in terms of speed, flexibility, and overall applicability. In this paper, we introduce ROOP, a groundbreaking tool designed to address these challenges. Utilizing a BERT-based Natural Language Processing (NLP) model, ROOP automates the translation of Python code to its PySpark equivalent, offering a streamlined solution for leveraging distributed computing resources. We evaluated ROOP using a diverse set of 14 Python programs comprising 26 loop fragments. Our results are promising: ROOP achieved a near-perfect translation accuracy rate, successfully converting 25 out of the 26 loop fragments. Notably, for simpler operations, ROOP demonstrated remarkable efficiency, completing translations in as little as 44 seconds. Moreover, ROOP incorporates a built-in testing mechanism to ensure the functional equivalence of the original and translated code, adding an extra layer of reliability. This research opens up new avenues for automating the transition from sequential to distributed programming, making the process more accessible and efficient for developers.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
Authors:
Andrew Hamara,
Pablo Rivas
Abstract:
This study investigates ImageBind's ability to generate meaningful fused multimodal embeddings for online auto parts listings. We propose a simplistic embedding fusion workflow that aims to capture the overlapping information of image/text pairs, ultimately combining the semantics of a post into a joint embedding. After storing such fused embeddings in a vector database, we experiment with dimensi…
▽ More
This study investigates ImageBind's ability to generate meaningful fused multimodal embeddings for online auto parts listings. We propose a simplistic embedding fusion workflow that aims to capture the overlapping information of image/text pairs, ultimately combining the semantics of a post into a joint embedding. After storing such fused embeddings in a vector database, we experiment with dimensionality reduction and provide empirical evidence to convey the semantic quality of the joint embeddings by clustering and examining the posts nearest to each cluster centroid. Additionally, our initial findings with ImageBind's emergent zero-shot cross-modal retrieval suggest that pure audio embeddings can correlate with semantically similar marketplace listings, indicating potential avenues for future research.
△ Less
Submitted 30 August, 2024;
originally announced September 2024.
-
Learning Robust Observable to Address Noise in Quantum Machine Learning
Authors:
Bikram Khanal,
Pablo Rivas
Abstract:
Quantum Machine Learning (QML) has emerged as a promising field that combines the power of quantum computing with the principles of machine learning. One of the significant challenges in QML is dealing with noise in quantum systems, especially in the Noisy Intermediate-Scale Quantum (NISQ) era. Noise in quantum systems can introduce errors in quantum computations and degrade the performance of qua…
▽ More
Quantum Machine Learning (QML) has emerged as a promising field that combines the power of quantum computing with the principles of machine learning. One of the significant challenges in QML is dealing with noise in quantum systems, especially in the Noisy Intermediate-Scale Quantum (NISQ) era. Noise in quantum systems can introduce errors in quantum computations and degrade the performance of quantum algorithms. In this paper, we propose a framework for learning observables that are robust against noisy channels in quantum systems. We demonstrate that it is possible to learn observables that remain invariant under the effects of noise and show that this can be achieved through a machine-learning approach. We present a toy example using a Bell state under a depolarization channel to illustrate the concept of robust observables. We then describe a machine-learning framework for learning such observables across six two-qubit quantum circuits and five noisy channels. Our results show that it is possible to learn observables that are more robust to noise than conventional observables. We discuss the implications of this finding for quantum machine learning, including potential applications in enhancing the stability of QML models in noisy environments. By developing techniques for learning robust observables, we can improve the performance and reliability of quantum machine learning models in the presence of noise, contributing to the advancement of practical QML applications in the NISQ era.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
Generalization Error Bound for Quantum Machine Learning in NISQ Era -- A Survey
Authors:
Bikram Khanal,
Pablo Rivas,
Arun Sanjel,
Korn Sooksatra,
Ernesto Quevedo,
Alejandro Rodriguez
Abstract:
Despite the mounting anticipation for the quantum revolution, the success of Quantum Machine Learning (QML) in the Noisy Intermediate-Scale Quantum (NISQ) era hinges on a largely unexplored factor: the generalization error bound, a cornerstone of robust and reliable machine learning models. Current QML research, while exploring novel algorithms and applications extensively, is predominantly situat…
▽ More
Despite the mounting anticipation for the quantum revolution, the success of Quantum Machine Learning (QML) in the Noisy Intermediate-Scale Quantum (NISQ) era hinges on a largely unexplored factor: the generalization error bound, a cornerstone of robust and reliable machine learning models. Current QML research, while exploring novel algorithms and applications extensively, is predominantly situated in the context of noise-free, ideal quantum computers. However, Quantum Circuit (QC) operations in NISQ-era devices are susceptible to various noise sources and errors. In this article, we conduct a Systematic Mapping Study (SMS) to explore the state-of-the-art generalization bound for supervised QML in NISQ-era and analyze the latest practices in the field. Our study systematically summarizes the existing computational platforms with quantum hardware, datasets, optimization techniques, and the common properties of the bounds found in the literature. We further present the performance accuracy of various approaches in classical benchmark datasets like the MNIST and IRIS datasets. The SMS also highlights the limitations and challenges in QML in the NISQ era and discusses future research directions to advance the field. Using a detailed Boolean operators query in five reliable indexers, we collected 544 papers and filtered them to a small set of 37 relevant articles. This filtration was done following the best practice of SMS with well-defined research questions and inclusion and exclusion criteria.
△ Less
Submitted 3 February, 2025; v1 submitted 11 September, 2024;
originally announced September 2024.
-
Bottleneck-based Encoder-decoder ARchitecture (BEAR) for Learning Unbiased Consumer-to-Consumer Image Representations
Authors:
Pablo Rivas,
Gisela Bichler,
Tomas Cerny,
Laurie Giddens,
Stacie Petter
Abstract:
Unbiased representation learning is still an object of study under specific applications and contexts. Novel architectures are usually crafted to resolve particular problems using mixtures of fundamental pieces. This paper presents different image feature extraction mechanisms that work together with residual connections to encode perceptual image information in an autoencoder configuration. We us…
▽ More
Unbiased representation learning is still an object of study under specific applications and contexts. Novel architectures are usually crafted to resolve particular problems using mixtures of fundamental pieces. This paper presents different image feature extraction mechanisms that work together with residual connections to encode perceptual image information in an autoencoder configuration. We use image data that aims to support a larger research agenda dealing with issues regarding criminal activity in consumer-to-consumer online platforms. Preliminary results suggest that the proposed architecture can learn rich spaces using ours and other image datasets resolving important challenges that are identified.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Authors:
Jorge Yero Salazar,
Pablo Rivas,
Renato Borras-Chavez,
Sarah Kienle
Abstract:
This paper examines the challenges and advancements in recognizing seals within their natural habitats using conventional photography, underscored by the emergence of machine learning technologies. We used the leopard seal, \emph{Hydrurga leptonyx}, a key species within Antarctic ecosystems, to review the different available methods found. As apex predators, Leopard seals are characterized by thei…
▽ More
This paper examines the challenges and advancements in recognizing seals within their natural habitats using conventional photography, underscored by the emergence of machine learning technologies. We used the leopard seal, \emph{Hydrurga leptonyx}, a key species within Antarctic ecosystems, to review the different available methods found. As apex predators, Leopard seals are characterized by their significant ecological role and elusive nature so studying them is crucial to understand the health of their ecosystem. Traditional methods of monitoring seal species are often constrained by the labor-intensive and time-consuming processes required for collecting data, compounded by the limited insights these methods provide. The advent of machine learning, particularly through the application of vision transformers, heralds a new era of efficiency and precision in species monitoring. By leveraging state-of-the-art approaches in detection, segmentation, and recognition within digital imaging, this paper presents a synthesis of the current landscape, highlighting both the cutting-edge methodologies and the predominant challenges faced in accurately identifying seals through photographic data.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
AI Safety in Practice: Enhancing Adversarial Robustness in Multimodal Image Captioning
Authors:
Maisha Binte Rashid,
Pablo Rivas
Abstract:
Multimodal machine learning models that combine visual and textual data are increasingly being deployed in critical applications, raising significant safety and security concerns due to their vulnerability to adversarial attacks. This paper presents an effective strategy to enhance the robustness of multimodal image captioning models against such attacks. By leveraging the Fast Gradient Sign Metho…
▽ More
Multimodal machine learning models that combine visual and textual data are increasingly being deployed in critical applications, raising significant safety and security concerns due to their vulnerability to adversarial attacks. This paper presents an effective strategy to enhance the robustness of multimodal image captioning models against such attacks. By leveraging the Fast Gradient Sign Method (FGSM) to generate adversarial examples and incorporating adversarial training techniques, we demonstrate improved model robustness on two benchmark datasets: Flickr8k and COCO. Our findings indicate that selectively training only the text decoder of the multimodal architecture shows performance comparable to full adversarial training while offering increased computational efficiency. This targeted approach suggests a balance between robustness and training costs, facilitating the ethical deployment of multimodal AI systems across various domains.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
An Automated SQL Query Grading System Using An Attention-Based Convolutional Neural Network
Authors:
Donald R. Schwartz,
Pablo Rivas
Abstract:
Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique co…
▽ More
Grading SQL queries can be a time-consuming, tedious and challenging task, especially as the number of student submissions increases. Several systems have been introduced in an attempt to mitigate these challenges, but those systems have their own limitations. This paper describes our novel approach to automating the process of grading SQL queries. Unlike previous approaches, we employ a unique convolutional neural network architecture that employs a parameter-sharing approach for different machine learning tasks that enables the architecture to induce different knowledge representations of the data to increase its potential for understanding SQL statements.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing
Authors:
Nurul Rafi,
Pablo Rivas
Abstract:
Research in neural models inspired by mammal's visual cortex has led to many spiking neural networks such as pulse-coupled neural networks (PCNNs). These models are oscillating, spatio-temporal models stimulated with images to produce several time-based responses. This paper reviews PCNN's state of the art, covering its mathematical formulation, variants, and other simplifications found in the lit…
▽ More
Research in neural models inspired by mammal's visual cortex has led to many spiking neural networks such as pulse-coupled neural networks (PCNNs). These models are oscillating, spatio-temporal models stimulated with images to produce several time-based responses. This paper reviews PCNN's state of the art, covering its mathematical formulation, variants, and other simplifications found in the literature. We present several applications in which PCNN architectures have successfully addressed some fundamental image processing and computer vision challenges, including image segmentation, edge detection, medical imaging, image fusion, image compression, object recognition, and remote sensing. Results achieved in these applications suggest that the PCNN architecture generates useful perceptual information relevant to a wide variety of computer vision tasks.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Authors:
Ernesto Quevedo,
Jorge Yero,
Rachel Koerner,
Pablo Rivas,
Tomas Cerny
Abstract:
Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated content. Current methods often demand substantial resources and rely on extensive LLMs or employ supervised learning with multidimensional features or intricate li…
▽ More
Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated content. Current methods often demand substantial resources and rely on extensive LLMs or employ supervised learning with multidimensional features or intricate linguistic and semantic analyses difficult to reproduce and largely depend on using the same LLM that hallucinated. This paper introduces a supervised learning approach employing two simple classifiers utilizing only four numerical features derived from tokens and vocabulary probabilities obtained from other LLM evaluators, which are not necessarily the same. The method yields promising results, surpassing state-of-the-art outcomes in multiple tasks across three different benchmarks. Additionally, we provide a comprehensive examination of the strengths and weaknesses of our approach, highlighting the significance of the features utilized and the LLM employed as an evaluator. We have released our code publicly at https://github.com/Baylor-AI/HalluDetect.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Authors:
Corinne Aars,
Lauren Adams,
Xiaokan Tian,
Zhaoyu Wang,
Colton Wismer,
Jason Wu,
Pablo Rivas,
Korn Sooksatra,
Matthew Fendt
Abstract:
This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented w…
▽ More
This study presents the development and evaluation of a ByT5-based multilingual translation model tailored for translating the Bible into underrepresented languages. Utilizing the comprehensive Johns Hopkins University Bible Corpus, we trained the model to capture the intricate nuances of character-based and morphologically rich languages. Our results, measured by the BLEU score and supplemented with sample translations, suggest the model can improve accessibility to sacred texts. It effectively handles the distinctive biblical lexicon and structure, thus bridging the linguistic divide. The study also discusses the model's limitations and suggests pathways for future enhancements, focusing on expanding access to sacred literature across linguistic boundaries.
△ Less
Submitted 30 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
On the Challenges of Creating Datasets for Analyzing Commercial Sex Advertisements to Assess Human Trafficking Risk and Organized Activity
Authors:
Pablo Rivas,
Tomas Cerny,
Alejandro Rodriguez Perez,
Javier Turek,
Laurie Giddens,
Gisela Bichler,
Stacie Petter
Abstract:
Our study addresses the challenges of building datasets to understand the risks associated with organized activities and human trafficking through commercial sex advertisements. These challenges include data scarcity, rapid obsolescence, and privacy concerns. Traditional approaches, which are not automated and are difficult to reproduce, fall short in addressing these issues. We have developed a r…
▽ More
Our study addresses the challenges of building datasets to understand the risks associated with organized activities and human trafficking through commercial sex advertisements. These challenges include data scarcity, rapid obsolescence, and privacy concerns. Traditional approaches, which are not automated and are difficult to reproduce, fall short in addressing these issues. We have developed a reproducible and automated methodology to analyze five million advertisements. In the process, we identified further challenges in dataset creation within this sensitive domain. This paper presents a streamlined methodology to assist researchers in constructing effective datasets for combating organized crime, allowing them to focus on advancing detection technologies.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
On Adversarial Examples for Text Classification by Perturbing Latent Representations
Authors:
Korn Sooksatra,
Bikram Khanal,
Pablo Rivas
Abstract:
Recently, with the advancement of deep learning, several applications in text classification have advanced significantly. However, this improvement comes with a cost because deep learning is vulnerable to adversarial examples. This weakness indicates that deep learning is not very robust. Fortunately, the input of a text classifier is discrete. Hence, it can prevent the classifier from state-of-th…
▽ More
Recently, with the advancement of deep learning, several applications in text classification have advanced significantly. However, this improvement comes with a cost because deep learning is vulnerable to adversarial examples. This weakness indicates that deep learning is not very robust. Fortunately, the input of a text classifier is discrete. Hence, it can prevent the classifier from state-of-the-art attacks. Nonetheless, previous works have generated black-box attacks that successfully manipulate the discrete values of the input to find adversarial examples. Therefore, instead of changing the discrete values, we transform the input into its embedding vector containing real values to perform the state-of-the-art white-box attacks. Then, we convert the perturbed embedding vector back into a text and name it an adversarial example. In summary, we create a framework that measures the robustness of a text classifier by using the gradients of the classifier.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Is ReLU Adversarially Robust?
Authors:
Korn Sooksatra,
Greg Hamerly,
Pablo Rivas
Abstract:
The efficacy of deep learning models has been called into question by the presence of adversarial examples. Addressing the vulnerability of deep learning models to adversarial examples is crucial for ensuring their continued development and deployment. In this work, we focus on the role of rectified linear unit (ReLU) activation functions in the generation of adversarial examples. ReLU functions a…
▽ More
The efficacy of deep learning models has been called into question by the presence of adversarial examples. Addressing the vulnerability of deep learning models to adversarial examples is crucial for ensuring their continued development and deployment. In this work, we focus on the role of rectified linear unit (ReLU) activation functions in the generation of adversarial examples. ReLU functions are commonly used in deep learning models because they facilitate the training process. However, our empirical analysis demonstrates that ReLU functions are not robust against adversarial examples. We propose a modified version of the ReLU function, which improves robustness against adversarial examples. Our results are supported by an experiment, which confirms the effectiveness of our proposed modification. Additionally, we demonstrate that applying adversarial training to our customized model further enhances its robustness compared to a general model.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
A Review on Machine Learning Algorithms for Dust Aerosol Detection using Satellite Data
Authors:
Nurul Rafi,
Pablo Rivas
Abstract:
Dust storms are associated with certain respiratory illnesses across different areas in the world. Researchers have devoted time and resources to study the elements surrounding dust storm phenomena. This paper reviews the efforts of those who have investigated dust aerosols using sensors onboard of satellites using machine learning-based approaches. We have reviewed the most common issues revolvin…
▽ More
Dust storms are associated with certain respiratory illnesses across different areas in the world. Researchers have devoted time and resources to study the elements surrounding dust storm phenomena. This paper reviews the efforts of those who have investigated dust aerosols using sensors onboard of satellites using machine learning-based approaches. We have reviewed the most common issues revolving dust aerosol modeling using different datasets and different sensors from a historical perspective. Our findings suggest that multi-spectral approaches based on linear and non-linear combinations of spectral bands are some of the most successful for visualization and quantitative analysis; however, when researchers have leveraged machine learning, performance has been improved and new opportunities to solve unique problems arise.
△ Less
Submitted 14 April, 2024;
originally announced April 2024.
-
A Modified Depolarization Approach for Efficient Quantum Machine Learning
Authors:
Bikram Khanal,
Pablo Rivas
Abstract:
Quantum Computing in the Noisy Intermediate-Scale Quantum (NISQ) era has shown promising applications in machine learning, optimization, and cryptography. Despite the progress, challenges persist due to system noise, errors, and decoherence that complicate the simulation of quantum systems. The depolarization channel is a standard tool for simulating a quantum system's noise. However, modeling suc…
▽ More
Quantum Computing in the Noisy Intermediate-Scale Quantum (NISQ) era has shown promising applications in machine learning, optimization, and cryptography. Despite the progress, challenges persist due to system noise, errors, and decoherence that complicate the simulation of quantum systems. The depolarization channel is a standard tool for simulating a quantum system's noise. However, modeling such noise for practical applications is computationally expensive when we have limited hardware resources, as is the case in the NISQ era. We propose a modified representation for a single-qubit depolarization channel with two Kraus operators based only on X and Z Pauli matrices. Our approach reduces the computational complexity from six to four matrix multiplications per execution of a channel. Experiments on a Quantum Machine Learning (QML) model on the Iris dataset across various circuit depths and depolarization rates validate that our approach maintains the model's accuracy while improving efficiency. This simplified noise model enables more scalable simulations of quantum circuits under depolarization, advancing capabilities in the NISQ era.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements
Authors:
Alejandro Rodriguez Perez,
Pablo Rivas
Abstract:
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques. We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models. Focusing on tasks like Human Trafficking Risk Prediction (HTRP) and Organized Acti…
▽ More
This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques. We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models. Focusing on tasks like Human Trafficking Risk Prediction (HTRP) and Organized Activity Detection (OAD), we employ cutting-edge Transformer models for analysis. A key contribution is the implementation of an interpretability framework using Integrated Gradients, providing explainable insights crucial for law enforcement. This work not only fills a critical gap in the literature but also offers a scalable, machine learning-driven approach to combat human exploitation online. It serves as a foundation for future research and practical applications, emphasizing the role of machine learning in addressing complex social issues.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Dynamics and Control of Bubble-Propelled Microrobots
Authors:
David P. Rivas,
Max Sokolich,
Harrison Muller,
Sambeeta Das
Abstract:
Having the advantage of being relatively fast and powerful, as well as readily fabricated, spherical bubble-propelled microrobots are particularly well suited for applications such as cargo delivery, micromanipulation, and biological or environmental remediation. However, there have been limited examples of control and manipulation with these microrobots and few studies on their dynamics. Here we…
▽ More
Having the advantage of being relatively fast and powerful, as well as readily fabricated, spherical bubble-propelled microrobots are particularly well suited for applications such as cargo delivery, micromanipulation, and biological or environmental remediation. However, there have been limited examples of control and manipulation with these microrobots and few studies on their dynamics. Here we investigate the bubble formation and dynamics of both hemispherically coated Janus microrobots as well as GLAD "patchy" microrobots which not only provide for an interesting comparison, but also exhibit useful properties in their own right. Specifically, we find that the patchy microrobots have a tendency to produce smaller bubbles and undergo smoother motion, properties that are beneficial for applications such as precise micro-manipulation, for example. We demonstrate manipulation and assemble of passive spheres on a substrate as well as at an air-liquid interface. We also characterize the propulsion and bubble formation of both types of microrobots and find that previously proposed theories insufficiently describe their motion and bubble bursting mechanism. Additionally, we observe that the microrobots, which reside at the air-liquid interface, demonstrate positive gravitaxis towards the droplet edges which we attribute to a torque resulting from opposing downward and buoyant forces on the microrobot.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Driven Topological Transitions in Active Nematic Films
Authors:
David P. Rivas,
Tyler N. Shendruk,
Robert R. Henry,
Daniel H. Reich,
Robert L. Leheny
Abstract:
The topological properties of many materials are central to their behavior, with the dynamics of topological defects being particularly important to intrinsically out-of-equilibrium, active materials. In this paper, local manipulation of the ordering, dynamics, and topological properties of microtubule-based extensile active nematic films is demonstrated in a joint experimental and simulation stud…
▽ More
The topological properties of many materials are central to their behavior, with the dynamics of topological defects being particularly important to intrinsically out-of-equilibrium, active materials. In this paper, local manipulation of the ordering, dynamics, and topological properties of microtubule-based extensile active nematic films is demonstrated in a joint experimental and simulation study. Hydrodynamic stresses created by magnetically actuated rotation of disk-shaped colloids in proximity to the films compete with internal stresses in the active nematic, enabling local control of the motion of the +1/2 charge topological defects that are intrinsic to spontaneously turbulent active films. Sufficiently large applied stresses drive the formation of +1 charge topological vortices in the director field through the merger of two +1/2 defects. The directed motion of the defects is accompanied by ordering of the vorticity and velocity of the active flows within the film that is qualitatively unlike the response of passive viscous films. Many features of the film's response to the disk are captured by Lattice Boltzmann simulations, leading to insight into the anomalous viscoelastic nature of the active nematic. The topological vortex formation is accompanied by a rheological instability in the film that leads to significant increase in the flow velocities. Comparison of the velocity profile in vicinity of the vortex with fluid-dynamics calculations provides an estimate of film viscosity.
△ Less
Submitted 31 October, 2019;
originally announced October 2019.