-
Exposing Reliability Degradation and Mitigation in Approximate DNNs under Permanent Faults
Authors:
Ayesha Siddique,
Khaza Anuarul Hoque
Abstract:
Approximate computing is known for enhancing deep neural network accelerators' energy efficiency by introducing inexactness with a tolerable accuracy loss. However, small accuracy variations may increase the sensitivity of these accelerators towards undesired subtle disturbances, such as permanent faults. The impact of permanent faults in accurate deep neural network (AccDNN) accelerators has been…
▽ More
Approximate computing is known for enhancing deep neural network accelerators' energy efficiency by introducing inexactness with a tolerable accuracy loss. However, small accuracy variations may increase the sensitivity of these accelerators towards undesired subtle disturbances, such as permanent faults. The impact of permanent faults in accurate deep neural network (AccDNN) accelerators has been thoroughly investigated in the literature. Conversely, the impact of permanent faults and their mitigation in approximate DNN (AxDNN) accelerators is vastly under-explored. Towards this, we first present an extensive fault resilience analysis of approximate multi-layer perceptrons (MLPs) and convolutional neural networks (CNNs) using the state-of-the-art Evoapprox8b multipliers in GPU and TPU accelerators. Then, we propose a novel fault mitigation method, i.e., fault-aware retuning of weights (Fal-reTune). Fal-reTune retunes the weights using a weight mapping function in the presence of faults for improved classification accuracy. To evaluate the fault resilience and the effectiveness of our proposed mitigation method, we used the most widely used MNIST, Fashion-MNIST, and CIFAR10 datasets. Our results demonstrate that the permanent faults exacerbate the accuracy loss in AxDNNs compared to the AccDNN accelerators. For instance, a permanent fault in AxDNNs can lead to 56\% accuracy loss, whereas the same faulty bit can lead to only 4\% accuracy loss in AccDNN accelerators. We empirically show that our proposed Fal-reTune mitigation method improves the performance of AxDNNs up to 98%, even with fault rates of up to 50%. Furthermore, we observe that the fault resilience in AxDNNs is orthogonal to their energy efficiency.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Improving Reliability of Spiking Neural Networks through Fault Aware Threshold Voltage Optimization
Authors:
Ayesha Siddique,
Khaza Anuarul Hoque
Abstract:
Spiking neural networks have made breakthroughs in computer vision by lending themselves to neuromorphic hardware. However, the neuromorphic hardware lacks parallelism and hence, limits the throughput and hardware acceleration of SNNs on edge devices. To address this problem, many systolic-array SNN accelerators (systolicSNNs) have been proposed recently, but their reliability is still a major con…
▽ More
Spiking neural networks have made breakthroughs in computer vision by lending themselves to neuromorphic hardware. However, the neuromorphic hardware lacks parallelism and hence, limits the throughput and hardware acceleration of SNNs on edge devices. To address this problem, many systolic-array SNN accelerators (systolicSNNs) have been proposed recently, but their reliability is still a major concern. In this paper, we first extensively analyze the impact of permanent faults on the SystolicSNNs. Then, we present a novel fault mitigation method, i.e., fault-aware threshold voltage optimization in retraining (FalVolt). FalVolt optimizes the threshold voltage for each layer in retraining to achieve the classification accuracy close to the baseline in the presence of faults. To demonstrate the effectiveness of our proposed mitigation, we classify both static (i.e., MNIST) and neuromorphic datasets (i.e., N-MNIST and DVS Gesture) on a 256x256 systolicSNN with stuck-at faults. We empirically show that the classification accuracy of a systolicSNN drops significantly even at extremely low fault rates (as low as 0.012\%). Our proposed FalVolt mitigation method improves the performance of systolicSNNs by enabling them to operate at fault rates of up to 60\%, with a negligible drop in classification accuracy (as low as 0.1\%). Our results show that FalVolt is 2x faster compared to other state-of-the-art techniques common in artificial neural networks (ANNs), such as fault-aware pruning and retraining without threshold voltage optimization.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Security-Aware Approximate Spiking Neural Networks
Authors:
Syed Tihaam Ahmad,
Ayesha Siddique,
Khaza Anuarul Hoque
Abstract:
Deep Neural Networks (DNNs) and Spiking Neural Networks (SNNs) are both known for their susceptibility to adversarial attacks. Therefore, researchers in the recent past have extensively studied the robustness and defense of DNNs and SNNs under adversarial attacks. Compared to accurate SNNs (AccSNN), approximate SNNs (AxSNNs) are known to be up to 4X more energy-efficient for ultra-low power applic…
▽ More
Deep Neural Networks (DNNs) and Spiking Neural Networks (SNNs) are both known for their susceptibility to adversarial attacks. Therefore, researchers in the recent past have extensively studied the robustness and defense of DNNs and SNNs under adversarial attacks. Compared to accurate SNNs (AccSNN), approximate SNNs (AxSNNs) are known to be up to 4X more energy-efficient for ultra-low power applications. Unfortunately, the robustness of AxSNNs under adversarial attacks is yet unexplored. In this paper, we first extensively analyze the robustness of AxSNNs with different structural parameters and approximation levels under two gradient-based and two neuromorphic attacks. Then, we propose two novel defense methods, i.e., precision scaling and approximate quantization-aware filtering (AQF), for securing AxSNNs. We evaluated the effectiveness of these two defense methods using both static and neuromorphic datasets. Our results demonstrate that AxSNNs are more prone to adversarial attacks than AccSNNs, but precision scaling and AQF significantly improve the robustness of AxSNNs. For instance, a PGD attack on AxSNN results in a 72\% accuracy loss compared to AccSNN without any attack, whereas the same attack on the precision-scaled AxSNN leads to only a 17\% accuracy loss in the static MNIST dataset (4X robustness improvement). Similarly, a Sparse Attack on AxSNN leads to a 77\% accuracy loss when compared to AccSNN without any attack, whereas the same attack on an AxSNN with AQF leads to only a 2\% accuracy loss in the neuromorphic DVS128 Gesture dataset (38X robustness improvement).
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Don't follow the leader: Independent thinkers create scientific innovation
Authors:
Sean Kelty,
Raiyan Abdul Baten,
Adiba Mahbub Proma,
Ehsan Hoque,
Johan Bollen,
Gourab Ghoshal
Abstract:
Academic success is distributed unequally; a few top scientists receive the bulk of attention, citations, and resources. However, do these ``superstars" foster leadership in scientific innovation? We introduce three information-theoretic measures that quantify novelty, innovation, and impact from scholarly citation networks, and compare the scholarly output of scientists who are either not connect…
▽ More
Academic success is distributed unequally; a few top scientists receive the bulk of attention, citations, and resources. However, do these ``superstars" foster leadership in scientific innovation? We introduce three information-theoretic measures that quantify novelty, innovation, and impact from scholarly citation networks, and compare the scholarly output of scientists who are either not connected or strongly connected to superstar scientists. We find that while connected scientists do indeed publish more, garner more citations, and produce more diverse content, this comes at a cost of lower innovation and higher redundancy of ideas. Further, once one removes papers co-authored with superstars, the academic output of these connected scientists diminishes. In contrast, authors that produce innovative content without the benefit of collaborations with scientific superstars produce papers that connect a greater diversity of concepts, publish more, and have comparable citation rates, once one controls for transferred prestige of superstars. On balance, our results indicate that academia pays a price by focusing attention and resources on superstars.
△ Less
Submitted 6 January, 2023;
originally announced January 2023.
-
NADBenchmarks -- a compilation of Benchmark Datasets for Machine Learning Tasks related to Natural Disasters
Authors:
Adiba Mahbub Proma,
Md Saiful Islam,
Stela Ciko,
Raiyan Abdul Baten,
Ehsan Hoque
Abstract:
Climate change has increased the intensity, frequency, and duration of extreme weather events and natural disasters across the world. While the increased data on natural disasters improves the scope of machine learning (ML) in this field, progress is relatively slow. One bottleneck is the lack of benchmark datasets that would allow ML researchers to quantify their progress against a standard metri…
▽ More
Climate change has increased the intensity, frequency, and duration of extreme weather events and natural disasters across the world. While the increased data on natural disasters improves the scope of machine learning (ML) in this field, progress is relatively slow. One bottleneck is the lack of benchmark datasets that would allow ML researchers to quantify their progress against a standard metric. The objective of this short paper is to explore the state of benchmark datasets for ML tasks related to natural disasters, categorizing them according to the disaster management cycle. We compile a list of existing benchmark datasets introduced in the past five years. We propose a web platform - NADBenchmarks - where researchers can search for benchmark datasets for natural disasters, and we develop a preliminary version of such a platform using our compiled list. This paper is intended to aid researchers in finding benchmark datasets to train their ML models on, and provide general directions for topics where they can contribute new benchmark datasets.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Family of nonstandard integrable and superintegrable classical Hamiltonian systems in non-vanishing magnetic fields
Authors:
Md Fazlul Hoque,
Libor Šnobl
Abstract:
In this paper we present the construction of all nonstandard integrable systems in magnetic fields whose integrals have leading order structure corresponding to the case (i) of Theorem 1 in [A Marchesiello and L Šnobl 2022 {\it J. Phys. A: Math. Theor.} {\bf 55} 145203]. We find that the resulting systems can be written as one family with several parameters. For certain limits of these parameters…
▽ More
In this paper we present the construction of all nonstandard integrable systems in magnetic fields whose integrals have leading order structure corresponding to the case (i) of Theorem 1 in [A Marchesiello and L Šnobl 2022 {\it J. Phys. A: Math. Theor.} {\bf 55} 145203]. We find that the resulting systems can be written as one family with several parameters. For certain limits of these parameters the system belongs to intersections with already known standard systems separating in Cartesian and / or cylindrical coordinates and the number of independent integrals of motion increases, thus the system becomes minimally superintegrable. These results generalize the particular example presented in section 3 of [A Marchesiello and L Šnobl 2022 {\it J. Phys. A: Math. Theor.} {\bf 55} 145203].
△ Less
Submitted 1 April, 2023; v1 submitted 10 December, 2022;
originally announced December 2022.
-
Thermal Dissipation Resulting from Everyday Interactions as a Sensing Modality -- The MIDAS Touch
Authors:
Farooq Dar,
Hilary Emenike,
Zhigang Yin,
Mohan Liyanage,
Rajesh Sharma,
Agustin Zuniga,
Mohammad A. Hoque,
Marko Radeta,
Petteri Nurmi,
Huber Flores
Abstract:
We contribute MIDAS as a novel sensing solution for characterizing everyday objects using thermal dissipation. MIDAS takes advantage of the fact that anytime a person touches an object it results in heat transfer. By capturing and modeling the dissipation of the transferred heat, e.g., through the decrease in the captured thermal radiation, MIDAS can characterize the object and determine its mater…
▽ More
We contribute MIDAS as a novel sensing solution for characterizing everyday objects using thermal dissipation. MIDAS takes advantage of the fact that anytime a person touches an object it results in heat transfer. By capturing and modeling the dissipation of the transferred heat, e.g., through the decrease in the captured thermal radiation, MIDAS can characterize the object and determine its material. We validate MIDAS through extensive empirical benchmarks and demonstrate that MIDAS offers an innovative sensing modality that can recognize a wide range of materials with up to 83% accuracy and generalize to variations in the people interacting with objects. We also demonstrate that MIDAS can detect thermal dissipation through objects, up to 2 mm thickness, and support analysis of multiple objects that are interacted with
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
On the thermal and mechanical properties of Mg$_{0.2}$Co$_{0.2}$Ni$_{0.2}$Cu$_{0.2}$Zn$_{0.2}$O across the high-entropy to entropy-stabilized transition
Authors:
Christina M. Rost,
Daniel L. Schmuckler,
Clifton Bumgardner,
Md Shafkat Bin Hoque,
David R. Diercks,
John T. Gaskins,
Jon-Paul Maria,
Geoffrey L. Brennecka,
Xiadong Li,
Patrick E. Hopkins
Abstract:
As various property studies continue to emerge on high entropy and entropy-stabilized ceramics, we seek further understanding of property changes across the phase boundary between \enquote{high-entropy} and \enquote{entropy-stabilized}. The thermal and mechanical properties of bulk ceramic entropy stabilized oxide composition Mg$_{0.2}$Co$_{0.2}$Ni$_{0.2}$Cu$_{0.2}$Zn$_{0.2}$O are investigated acr…
▽ More
As various property studies continue to emerge on high entropy and entropy-stabilized ceramics, we seek further understanding of property changes across the phase boundary between \enquote{high-entropy} and \enquote{entropy-stabilized}. The thermal and mechanical properties of bulk ceramic entropy stabilized oxide composition Mg$_{0.2}$Co$_{0.2}$Ni$_{0.2}$Cu$_{0.2}$Zn$_{0.2}$O are investigated across this critical transition temperature via the transient plane-source method, temperature-dependent X-ray diffraction, and nano-indentation. Thermal conductivity remains constant within uncertainty across the multi-to-single phase transition at a value of ~2.5 W/mK, while the linear coefficient of thermal expansion increases nearly 24 % from 10.8 to 14.1 x 10$^{-6}$ K$^{-1}$. Mechanical softening is also observed across the transition.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
On a conjecture of Franu\v sić and Jadrijevi\' c: Counter-examples
Authors:
Kalyan Chakraborty,
Shubham Gupta,
Azizul Hoque
Abstract:
Let $d\equiv 2\pmod 4$ be a square-free integer such that $x^2 - dy^2 =- 1$ and $x^2 - dy^2 = 6$ are solvable in integers. We prove the existence of infinitely many quadruples in $\mathbb{Z}[\sqrt{d}]$ with the property $D(n)$ when $n \in \{(4m + 1) + 4k\sqrt{d}, (4m + 1) + (4k + 2)\sqrt{d}, (4m + 3) + 4k\sqrt{d}, (4m + 3) + (4k + 2)\sqrt{d}, (4m + 2) + (4k + 2)\sqrt{d}\}$ for…
▽ More
Let $d\equiv 2\pmod 4$ be a square-free integer such that $x^2 - dy^2 =- 1$ and $x^2 - dy^2 = 6$ are solvable in integers. We prove the existence of infinitely many quadruples in $\mathbb{Z}[\sqrt{d}]$ with the property $D(n)$ when $n \in \{(4m + 1) + 4k\sqrt{d}, (4m + 1) + (4k + 2)\sqrt{d}, (4m + 3) + 4k\sqrt{d}, (4m + 3) + (4k + 2)\sqrt{d}, (4m + 2) + (4k + 2)\sqrt{d}\}$ for $m, k \in \mathbb{Z}$. As a consequence, we provide few counter examples to a conjecture of Franu\v sić and Jadrijevi\' c (see Conjecture 1.1).
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Adversarial De-confounding in Individualised Treatment Effects Estimation
Authors:
Vinod Kumar Chauhan,
Soheila Molaei,
Marzia Hoque Tania,
Anshul Thakur,
Tingting Zhu,
David A. Clifton
Abstract:
Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised tr…
▽ More
Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised treatment effects (ITE) estimation. This paper proposes disentangled representations with adversarial training to selectively balance the confounders in the binary treatment setting for the ITE estimation. The adversarial training of treatment policy selectively encourages treatment-agnostic balanced representations for the confounders and helps to estimate the ITE in the observational studies via counterfactual inference. Empirical results on synthetic and real-world datasets, with varying degrees of confounding, prove that our proposed approach improves the state-of-the-art methods in achieving lower error in the ITE estimation.
△ Less
Submitted 24 January, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
OpenCQA: Open-ended Question Answering with Charts
Authors:
Shankar Kantharaj,
Xuan Long Do,
Rixie Tiffany Ko Leong,
Jia Qing Tan,
Enamul Hoque,
Shafiq Joty
Abstract:
Charts are very popular to analyze data and convey important insights. People often analyze visualizations to answer open-ended questions that require explanatory answers. Answering such questions are often difficult and time-consuming as it requires a lot of cognitive and perceptual efforts. To address this challenge, we introduce a new task called OpenCQA, where the goal is to answer an open-end…
▽ More
Charts are very popular to analyze data and convey important insights. People often analyze visualizations to answer open-ended questions that require explanatory answers. Answering such questions are often difficult and time-consuming as it requires a lot of cognitive and perceptual efforts. To address this challenge, we introduce a new task called OpenCQA, where the goal is to answer an open-ended question about a chart with descriptive texts. We present the annotation process and an in-depth analysis of our dataset. We implement and evaluate a set of baselines under three practical settings. In the first setting, a chart and the accompanying article is provided as input to the model. The second setting provides only the relevant paragraph(s) to the chart instead of the entire article, whereas the third setting requires the model to generate an answer solely based on the chart. Our analysis of the results show that the top performing models generally produce fluent and coherent text while they struggle to perform complex logical and arithmetic reasoning.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.
-
Energy and Time Based Topology Control Approach to Enhance the Lifetime of WSN in an economic zone
Authors:
Tanvir Hossain,
Md. Ershadul Haque,
Abdullah Al Mamun,
Samiul Ul Hoque,
Al Amin Fahim
Abstract:
An economic zone requires continuous monitoring and controlling by an autonomous surveillance system for heightening its production competency and security. Wireless sensor network (WSN) has swiftly grown popularity over the world for uninterruptedly monitoring and controlling a system. Sensor devices, the main elements of WSN, are given limited amount of energy, which leads the network to limited…
▽ More
An economic zone requires continuous monitoring and controlling by an autonomous surveillance system for heightening its production competency and security. Wireless sensor network (WSN) has swiftly grown popularity over the world for uninterruptedly monitoring and controlling a system. Sensor devices, the main elements of WSN, are given limited amount of energy, which leads the network to limited lifespan. Therefore, the most significant challenge is to increase the lifespan of a WSN system. Topology control mechanism (TCM) is a renowned method to enhance the lifespan of WSN. This paper proposes an approach to extend the lifetime of WSN for an economic area, targeting an economic zone in Bangladesh. Observations are made on the performance of the network lifetime considering the individual combinations of the TCM protocols and comparative investigation between the time and energy triggering strategy of TCM protocols. Results reveal the network makes a better performance in the case of A3 protocol while using the topology maintenance protocols with both time and energy triggering methods. Moreover, the performance of the A3 and DGETRec is superior to the other combinations of TCM protocols. Hence, the WSN system can be able to serve better connectivity coverage in the target economic zone.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Lehmer sequence approach to the divisibility of class numbers of imaginary quadratic fields
Authors:
Kalyan Chakraborty,
Azizul Hoque
Abstract:
Let $k\geq 3$ and $n\geq 3$ be odd integers, and let $m\geq 0$ be any integer. For a prime number $\ell$, we prove that the class number of the imaginary quadratic field $\mathbb{Q}(\sqrt{\ell^{2m}-2k^n})$ is either divisible by $n$ or by a specific divisor of $n$. Applying this result, we construct an infinite family of certain tuples of imaginary quadratic fields of the form…
▽ More
Let $k\geq 3$ and $n\geq 3$ be odd integers, and let $m\geq 0$ be any integer. For a prime number $\ell$, we prove that the class number of the imaginary quadratic field $\mathbb{Q}(\sqrt{\ell^{2m}-2k^n})$ is either divisible by $n$ or by a specific divisor of $n$. Applying this result, we construct an infinite family of certain tuples of imaginary quadratic fields of the form $$\left(\mathbb{Q}(\sqrt{d}), \mathbb{Q}(\sqrt{d+1}), \mathbb{Q}(\sqrt{4d+1}), \mathbb{Q}(\sqrt{2d+4}), \mathbb{Q}(\sqrt{2d+16}), \cdots, \mathbb{Q}(\sqrt{2d+4^t}) \right)$$ with $d\in \mathbb{Z}$ and $1\leq 4^t\leq 2|d|$ whose class numbers are all divisible by $n$. Our proofs use some deep results about primitive divisors of Lehmer sequences.
△ Less
Submitted 2 October, 2022;
originally announced October 2022.
-
Self-Supervised Visuo-Tactile Pretraining to Locate and Follow Garment Features
Authors:
Justin Kerr,
Huang Huang,
Albert Wilcox,
Ryan Hoque,
Jeffrey Ichnowski,
Roberto Calandra,
Ken Goldberg
Abstract:
Humans make extensive use of vision and touch as complementary senses, with vision providing global information about the scene and touch measuring local information during manipulation without suffering from occlusions. While prior work demonstrates the efficacy of tactile sensing for precise manipulation of deformables, they typically rely on supervised, human-labeled datasets. We propose Self-S…
▽ More
Humans make extensive use of vision and touch as complementary senses, with vision providing global information about the scene and touch measuring local information during manipulation without suffering from occlusions. While prior work demonstrates the efficacy of tactile sensing for precise manipulation of deformables, they typically rely on supervised, human-labeled datasets. We propose Self-Supervised Visuo-Tactile Pretraining (SSVTP), a framework for learning multi-task visuo-tactile representations in a self-supervised manner through cross-modal supervision. We design a mechanism that enables a robot to autonomously collect precisely spatially-aligned visual and tactile image pairs, then train visual and tactile encoders to embed these pairs into a shared latent space using cross-modal contrastive loss. We apply this latent space to downstream perception and control of deformable garments on flat surfaces, and evaluate the flexibility of the learned representations without fine-tuning on 5 tasks: feature classification, contact localization, anomaly detection, feature search from a visual query (e.g., garment feature localization under occlusion), and edge following along cloth edges. The pretrained representations achieve a 73-100% success rate on these 5 tasks.
△ Less
Submitted 31 July, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
SEER: Sustainable E-commerce with Environmental-impact Rating
Authors:
Md Saiful Islam,
Adiba Mahbub,
Caleb Wohn,
Karen Berger,
Serena Uong,
Varun Kumar,
Katrina Smith Korfmacher,
Ehsan Hoque
Abstract:
With online shopping gaining massive popularity over the past few years, e-commerce platforms can play a significant role in tackling climate change and other environmental problems. In this study, we report that the "attitude-behavior" gap identified by prior sustainable consumption literature also exists in an online setting. We propose SEER, a concept design for online shopping websites to help…
▽ More
With online shopping gaining massive popularity over the past few years, e-commerce platforms can play a significant role in tackling climate change and other environmental problems. In this study, we report that the "attitude-behavior" gap identified by prior sustainable consumption literature also exists in an online setting. We propose SEER, a concept design for online shopping websites to help consumers make more sustainable choices. We introduce explainable environmental impact ratings to increase knowledge, trust, and convenience for consumers willing to purchase eco-friendly products. In our quasi-randomized case-control experiment with 98 subjects across the United States, we found that the case group using SEER demonstrates significantly more eco-friendly consumption behavior than the control group using a traditional e-commerce setting. While there are challenges in generating reliable explanations and environmental ratings for products, if implemented, in the United States alone, SEER has the potential to reduce approximately 2.88 million tonnes of carbon emission every year.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
TruVR: Trustworthy Cybersickness Detection using Explainable Machine Learning
Authors:
Ripan Kumar Kundu,
Rifatul Islam,
Prasad Calyam,
Khaza Anuarul Hoque
Abstract:
Cybersickness can be characterized by nausea, vertigo, headache, eye strain, and other discomforts when using virtual reality (VR) systems. The previously reported machine learning (ML) and deep learning (DL) algorithms for detecting (classification) and predicting (regression) VR cybersickness use black-box models; thus, they lack explainability. Moreover, VR sensors generate a massive amount of…
▽ More
Cybersickness can be characterized by nausea, vertigo, headache, eye strain, and other discomforts when using virtual reality (VR) systems. The previously reported machine learning (ML) and deep learning (DL) algorithms for detecting (classification) and predicting (regression) VR cybersickness use black-box models; thus, they lack explainability. Moreover, VR sensors generate a massive amount of data, resulting in complex and large models. Therefore, having inherent explainability in cybersickness detection models can significantly improve the model's trustworthiness and provide insight into why and how the ML/DL model arrived at a specific decision. To address this issue, we present three explainable machine learning (xML) models to detect and predict cybersickness: 1) explainable boosting machine (EBM), 2) decision tree (DT), and 3) logistic regression (LR). We evaluate xML-based models with publicly available physiological and gameplay datasets for cybersickness. The results show that the EBM can detect cybersickness with an accuracy of 99.75% and 94.10% for the physiological and gameplay datasets, respectively. On the other hand, while predicting the cybersickness, EBM resulted in a Root Mean Square Error (RMSE) of 0.071 for the physiological dataset and 0.27 for the gameplay dataset. Furthermore, the EBM-based global explanation reveals exposure length, rotation, and acceleration as key features causing cybersickness in the gameplay dataset. In contrast, galvanic skin responses and heart rate are most significant in the physiological dataset. Our results also suggest that EBM-based local explanation can identify cybersickness-causing factors for individual samples. We believe the proposed xML-based cybersickness detection method can help future researchers understand, analyze, and design simpler cybersickness detection and reduction models.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
Impact analysis of recovery cases due to COVID19 using LSTM deep learning model
Authors:
Md Ershadul Haque,
Samiul Hoque
Abstract:
The present world is badly affected by novel coronavirus (COVID-19). Using medical kits to identify the coronavirus affected persons are very slow. What happens in the next, nobody knows. The world is facing erratic problem and do not know what will happen in near future. This paper is trying to make prognosis of the coronavirus recovery cases using LSTM (Long Short Term Memory). This work exploit…
▽ More
The present world is badly affected by novel coronavirus (COVID-19). Using medical kits to identify the coronavirus affected persons are very slow. What happens in the next, nobody knows. The world is facing erratic problem and do not know what will happen in near future. This paper is trying to make prognosis of the coronavirus recovery cases using LSTM (Long Short Term Memory). This work exploited data of 258 regions, their latitude and longitude and the number of death of 403 days ranging from 22-01-2020 to 27-02-2021. Specifically, advanced deep learning-based algorithms known as the LSTM, play a great effect on extracting highly essential features for time series data (TSD) analysis.There are lots of methods which already use to analyze propagation prediction. The main task of this paper culminates in analyzing the spreading of Coronavirus across worldwide recovery cases using LSTM deep learning-based architectures.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Rice Leaf Disease Classification and Detection Using YOLOv5
Authors:
Md Ershadul Haque,
Ashikur Rahman,
Iftekhar Junaeid,
Samiul Ul Hoque,
Manoranjan Paul
Abstract:
A staple food in more than a hundred nations worldwide is rice (Oryza sativa). The cultivation of rice is vital to global economic growth. However, the main issue facing the agricultural industry is rice leaf disease. The quality and quantity of the crops have declined, and this is the main cause. As farmers in any country do not have much knowledge about rice leaf disease, they cannot diagnose ri…
▽ More
A staple food in more than a hundred nations worldwide is rice (Oryza sativa). The cultivation of rice is vital to global economic growth. However, the main issue facing the agricultural industry is rice leaf disease. The quality and quantity of the crops have declined, and this is the main cause. As farmers in any country do not have much knowledge about rice leaf disease, they cannot diagnose rice leaf disease properly. That's why they cannot take proper care of rice leaves. As a result, the production is decreasing. From literature survey, it has seen that YOLOv5 exhibit the better result compare to others deep learning method. As a result of the continual advancement of object detection technology, YOLO family algorithms, which have extraordinarily high precision and better speed have been used in various scene recognition tasks to build rice leaf disease monitoring systems. We have annotate 1500 collected data sets and propose a rice leaf disease classification and detection method based on YOLOv5 deep learning. We then trained and evaluated the YOLOv5 model. The simulation outcomes show improved object detection result for the augmented YOLOv5 network proposed in this article. The required levels of recognition precision, recall, mAP value, and F1 score are 90\%, 67\%, 76\%, and 81\% respectively are considered as performance metrics.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
DramatVis Personae: Visual Text Analytics for Identifying Social Biases in Creative Writing
Authors:
Md Naimul Hoque,
Bhavya Ghai,
Niklas Elmqvist
Abstract:
Implicit biases and stereotypes are often pervasive in different forms of creative writing such as novels, screenplays, and children's books. To understand the kind of biases writers are concerned about and how they mitigate those in their writing, we conducted formative interviews with nine writers. The interviews suggested that despite a writer's best interest, tracking and managing implicit bia…
▽ More
Implicit biases and stereotypes are often pervasive in different forms of creative writing such as novels, screenplays, and children's books. To understand the kind of biases writers are concerned about and how they mitigate those in their writing, we conducted formative interviews with nine writers. The interviews suggested that despite a writer's best interest, tracking and managing implicit biases such as a lack of agency, supporting or submissive roles, or harmful language for characters representing marginalized groups is challenging as the story becomes longer and complicated. Based on the interviews, we developed DramatVis Personae (DVP), a visual analytics tool that allows writers to assign social identities to characters, and evaluate how characters and different intersectional social identities are represented in the story. To evaluate DVP, we first conducted think-aloud sessions with three writers and found that DVP is easy-to-use, naturally integrates into the writing process, and could potentially help writers in several critical bias identification tasks. We then conducted a follow-up user study with 11 writers and found that participants could answer questions related to bias detection more efficiently using DVP in comparison to a simple text editor.
△ Less
Submitted 1 September, 2022;
originally announced September 2022.
-
A Survey on Open Information Extraction from Rule-based Model to Large Language Model
Authors:
Pai Liu,
Wenyang Gao,
Wenjie Dong,
Lin Ai,
Ziwei Gong,
Songfang Huang,
Zongsheng Li,
Ehsan Hoque,
Julia Hirschberg,
Yue Zhang
Abstract:
Open Information Extraction (OpenIE) represents a crucial NLP task aimed at deriving structured information from unstructured text, unrestricted by relation type or domain. This survey paper provides an overview of OpenIE technologies spanning from 2007 to 2024, emphasizing a chronological perspective absent in prior surveys. It examines the evolution of task settings in OpenIE to align with the a…
▽ More
Open Information Extraction (OpenIE) represents a crucial NLP task aimed at deriving structured information from unstructured text, unrestricted by relation type or domain. This survey paper provides an overview of OpenIE technologies spanning from 2007 to 2024, emphasizing a chronological perspective absent in prior surveys. It examines the evolution of task settings in OpenIE to align with the advances in recent technologies. The paper categorizes OpenIE approaches into rule-based, neural, and pre-trained large language models, discussing each within a chronological framework. Additionally, it highlights prevalent datasets and evaluation metrics currently in use. Building on this extensive review, the paper outlines potential future directions in terms of datasets, information sources, output formats, methodologies, and evaluation metrics.
△ Less
Submitted 23 October, 2024; v1 submitted 18 August, 2022;
originally announced August 2022.
-
Data-Driven Machine Learning to Predict Mechanical Properties of Monolayer TMDs
Authors:
Prottay Malakar,
Md Shajedul Hoque Thakur,
Shahriar Muhammad Nahid,
Md Mahbubul Islam
Abstract:
The understanding of the material properties of the layered transition metal dichalcogenides (TMDs) is critical for their applications in structural composites. The data-driven machine learning (ML) based approaches are being developed in contrast to traditional experimental or computational approach to predict and understand materials properties under varied operating conditions. In this study, w…
▽ More
The understanding of the material properties of the layered transition metal dichalcogenides (TMDs) is critical for their applications in structural composites. The data-driven machine learning (ML) based approaches are being developed in contrast to traditional experimental or computational approach to predict and understand materials properties under varied operating conditions. In this study, we used two ML algorithms such as Long Short-Term Memory (LSTM) and Feed Forward Neural Network (FFNN) combined with molecular dynamics (MD) simulations to predict the mechanical properties of MX2 (M = Mo, W, and X = S, Se) TMDs. The LSTM model is found to be capable of predicting the entire stress-strain response whereas the FFNN is used to predict the material properties such as fracture stress, fracture strain, and Young's modulus. The effects of operating temperature, chiral orientation, and pre-existing crack size on the mechanical properties are thoroughly investigated. We carried out 1440 MD simulations to produce the input dataset for the neural network models. Our results indicate that both LSTM and FFNN are capable of predicting the mechanical response of monolayer TMDs under different conditions with more than 95% accuracy. The FFNN model exhibits lower computational cost than LSTM; however, the capability of LSTM model to predict the entire stress-strain curve is advantageous to assess material properties. The study paves the pathway toward extending this approach to predict other important properties, such as optical, electrical, and magnetic properties of TMDs.
△ Less
Submitted 10 August, 2022;
originally announced August 2022.
-
Weighted Scaling Approach for Metabolomics Data Analysis
Authors:
Biplab Biswas,
Nishith Kumar,
Md Aminul Hoque,
Md Ashad Alam
Abstract:
Systematic variation is a common issue in metabolomics data analysis. Therefore, different scaling and normalization techniques are used to preprocess the data for metabolomics data analysis. Although several scaling methods are available in the literature, however, choice of scaling, transformation and/or normalization technique influence the further statistical analysis. It is challenging to cho…
▽ More
Systematic variation is a common issue in metabolomics data analysis. Therefore, different scaling and normalization techniques are used to preprocess the data for metabolomics data analysis. Although several scaling methods are available in the literature, however, choice of scaling, transformation and/or normalization technique influence the further statistical analysis. It is challenging to choose the appropriate scaling technique for downstream analysis to get accurate results or to make a proper decision. Moreover, the existing scaling techniques are sensitive to outliers or extreme values. To fill the gap, our objective is to introduce a robust scaling approach that is not influenced by outliers as well as provides more accurate results for downstream analysis. Here, we introduced a new weighted scaling approach that is robust against outliers however, where no additional outlier detection/treatment step is needed in data preprocessing and also compared it with the conventional scaling and normalization techniques through artificial and real metabolomics datasets. We evaluated the performance of the proposed method in comparison to the other existing conventional scaling techniques using metabolomics data analysis in both the absence and presence of different percentages of outliers. Results show that in most cases, the proposed scaling technique performs better than the traditional scaling methods in both the absence and presence of outliers. The proposed method improves the further downstream metabolomics analysis. The R function of the proposed robust scaling method is available at https://github.com/nishithkumarpaul/robustScaling/blob/main/wscaling.R
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
Direct Visualization of Localized Vibrations at Complex Grain Boundaries
Authors:
Eric R. Hoglund,
De-Liang Bao,
Andrew O'Hara,
Thomas W. Pfeifer,
Md Shafkat Bin Hoque,
Sara Makarem,
James M. Howe,
Sokrates T. Pantelides,
Patrick E. Hopkins,
Jordan A. Hachtel
Abstract:
Grain boundaries (GBs) are a prolific microstructural feature that dominates the functionality of a wide class of materials. The change in functionality at a GB is a direct result of unique local atomic arrangements, different from those in the grain, that have driven extensive experimental and theoretical studies correlating atomic-scale GB structures to macroscopic electronic, infrared-optical,…
▽ More
Grain boundaries (GBs) are a prolific microstructural feature that dominates the functionality of a wide class of materials. The change in functionality at a GB is a direct result of unique local atomic arrangements, different from those in the grain, that have driven extensive experimental and theoretical studies correlating atomic-scale GB structures to macroscopic electronic, infrared-optical, and thermal properties. Here, we examine a SrTiO3 GB using atomic-resolution aberration-corrected scanning transmission electron microscopy (STEM) and ultra-high-energy-resolution monochromated electron energy-loss spectroscopy (EELS), in conjunction with density functional theory (DFT) calculations. This combination enables the direct correlation of the GB structure, composition, and chemical bonding with atomic vibrations within the GB dislocation-cores. We observe that nonstoichiometry and changes in coordination and bonding at the GB leads to a redistribution of vibrational states at the GB and its dislocation-cores relative to the bounding grains. The access to localized vibrations within GBs provided by ultrahigh spatial/spectral resolution EELS correlated with atomic coordination, bonding, and stoichiometry and validated by theory, provides a direct route to quantifying the impact of individual boundaries on macroscopic properties.
△ Less
Submitted 24 August, 2022; v1 submitted 30 July, 2022;
originally announced August 2022.
-
A Flexible Schema-Guided Dialogue Management Framework: From Friendly Peer to Virtual Standardized Cancer Patient
Authors:
Benjamin Kane,
Catherine Giugno,
Lenhart Schubert,
Kurtis Haut,
Caleb Wohn,
Ehsan Hoque
Abstract:
A schema-guided approach to dialogue management has been shown in recent work to be effective in creating robust customizable virtual agents capable of acting as friendly peers or task assistants. However, successful applications of these methods in open-ended, mixed-initiative domains remain elusive -- particularly within medical domains such as virtual standardized patients, where such complex i…
▽ More
A schema-guided approach to dialogue management has been shown in recent work to be effective in creating robust customizable virtual agents capable of acting as friendly peers or task assistants. However, successful applications of these methods in open-ended, mixed-initiative domains remain elusive -- particularly within medical domains such as virtual standardized patients, where such complex interactions are commonplace -- and require more extensive and flexible dialogue management capabilities than previous systems provide. In this paper, we describe a general-purpose schema-guided dialogue management framework used to develop SOPHIE, a virtual standardized cancer patient that allows a doctor to conveniently practice for interactions with patients. We conduct a crowdsourced evaluation of conversations between medical students and SOPHIE. Our agent is judged to produce responses that are natural, emotionally appropriate, and consistent with her role as a cancer patient. Furthermore, it significantly outperforms an end-to-end neural model fine-tuned on a human standardized patient corpus, attesting to the advantages of a schema-guided approach.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Learning Switching Criteria for Sim2Real Transfer of Robotic Fabric Manipulation Policies
Authors:
Satvik Sharma,
Ellen Novoseller,
Vainavi Viswanath,
Zaynah Javed,
Rishi Parikh,
Ryan Hoque,
Ashwin Balakrishna,
Daniel S. Brown,
Ken Goldberg
Abstract:
Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behav…
▽ More
Simulation-to-reality transfer has emerged as a popular and highly successful method to train robotic control policies for a wide variety of tasks. However, it is often challenging to determine when policies trained in simulation are ready to be transferred to the physical world. Deploying policies that have been trained with very little simulation data can result in unreliable and dangerous behaviors on physical hardware. On the other hand, excessive training in simulation can cause policies to overfit to the visual appearance and dynamics of the simulator. In this work, we study strategies to automatically determine when policies trained in simulation can be reliably transferred to a physical robot. We specifically study these ideas in the context of robotic fabric manipulation, in which successful sim2real transfer is especially challenging due to the difficulties of precisely modeling the dynamics and visual appearance of fabric. Results in a fabric smoothing task suggest that our switching criteria correlate well with performance in real. In particular, our confidence-based switching criteria achieve average final fabric coverage of 87.2-93.7% within 55-60% of the total training budget. See https://tinyurl.com/lsc-case for code and supplemental materials.
△ Less
Submitted 2 July, 2022;
originally announced July 2022.
-
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Authors:
Ryan Hoque,
Lawrence Yunliang Chen,
Satvik Sharma,
Karthik Dharmarajan,
Brijen Thananjeyan,
Pieter Abbeel,
Ken Goldberg
Abstract:
Commercial and industrial deployments of robot fleets at Amazon, Nimble, Plus One, Waymo, and Zoox query remote human teleoperators when robots are at risk or unable to make task progress. With continual learning, interventions from the remote pool of humans can also be used to improve the robot fleet control policy over time. A central question is how to effectively allocate limited human attenti…
▽ More
Commercial and industrial deployments of robot fleets at Amazon, Nimble, Plus One, Waymo, and Zoox query remote human teleoperators when robots are at risk or unable to make task progress. With continual learning, interventions from the remote pool of humans can also be used to improve the robot fleet control policy over time. A central question is how to effectively allocate limited human attention. Prior work addresses this in the single-robot, single-human setting; we formalize the Interactive Fleet Learning (IFL) setting, in which multiple robots interactively query and learn from multiple human supervisors. We propose Return on Human Effort (ROHE) as a new metric and Fleet-DAgger, a family of IFL algorithms. We present an open-source IFL benchmark suite of GPU-accelerated Isaac Gym environments for standardized evaluation and development of IFL algorithms. We compare a novel Fleet-DAgger algorithm to 4 baselines with 100 robots in simulation. We also perform a physical block-pushing experiment with 4 ABB YuMi robot arms and 2 remote humans. Experiments suggest that the allocation of humans to robots significantly affects the performance of the fleet, and that the novel Fleet-DAgger algorithm can achieve up to 8.8x higher ROHE than baselines. See https://tinyurl.com/fleet-dagger for supplemental material.
△ Less
Submitted 16 November, 2022; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Chirp-Based Over-the-Air Computation for Long-Range Federated Edge Learning
Authors:
Safi Shams Muhtasimul Hoque,
Mohammad Hassan Adeli,
Alphan Sahin
Abstract:
In this study, we propose circularly-shifted chirp (CSC)-based majority vote (MV) (CSC-MV), a power-efficient over-the-air computation (OAC) scheme, to achieve long-range federated edge learning (FEEL). The proposed approach maps the votes (i.e., the sign of the local gradients) from the edge devices (EDs) to the linear CSCs constructed with a discrete Fourier transform-spread orthogonal frequency…
▽ More
In this study, we propose circularly-shifted chirp (CSC)-based majority vote (MV) (CSC-MV), a power-efficient over-the-air computation (OAC) scheme, to achieve long-range federated edge learning (FEEL). The proposed approach maps the votes (i.e., the sign of the local gradients) from the edge devices (EDs) to the linear CSCs constructed with a discrete Fourier transform-spread orthogonal frequency division multiplexing (DFT-s-OFDM) transmitter. At the edge server (ES), the MV is calculated with an energy detector. We compare our proposed scheme with one-bit broadband digital aggregation (OBDA) and show that the output-power back-off (OBO) requirement of the transmitters with an adjacent-channel-leakage ratio (ACLR) constraint for CSC-MV is lower than the one with OBDA. For example, with an ACLR constraint of -22 dB, CSC-MV can have an OBO requirement of 6-7 dB less than the one with OBDA. When the power amplifier (PA) non-linearity is considered, we demonstrate that CSC-MV outperforms OBDA in terms of test accuracy for both homogeneous and heterogeneous data distributions, without using channel state information (CSI) at the ES and EDs.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Geometrical magnetoresistance effect and mobility in graphene field-effect transistors
Authors:
Isabel Harrysson Rodrigues,
Andrey Generalov,
Anamul Md Hoque,
Miika Soikkeli,
Anton Murros,
Sanna Arpiainen,
Andrei Vorobiev
Abstract:
Further development of the graphene field-effect transistors (GFETs) for high-frequency electronics requires accurate evaluation and study of the mobility of charge carriers in a specific device. Here, we demonstrate that the mobility in the GFETs can be directly characterized and studied using the geometrical magnetoresistance (gMR) effect. The method is free from the limitations of other approac…
▽ More
Further development of the graphene field-effect transistors (GFETs) for high-frequency electronics requires accurate evaluation and study of the mobility of charge carriers in a specific device. Here, we demonstrate that the mobility in the GFETs can be directly characterized and studied using the geometrical magnetoresistance (gMR) effect. The method is free from the limitations of other approaches since it does not require an assumption of the constant mobility and the knowledge of the gate capacitance. Studies of a few sets of GFETs in the wide range of transverse magnetic fields indicate that the gMR effect dominates up to approximately 0.55 T. In higher fields, the physical magnetoresistance effect starts to contribute. The advantages of the gMR approach allowed us to interpret the measured dependencies of mobility on the gate voltage, i.e., carrier concentration, and identify the corresponding scattering mechanisms. In particular, the range of the fairly constant mobility is associated with the dominating Coulomb scattering. The decrease in mobility at higher carrier concentrations is associated with the contribution of the phonon scattering. Analysis shows that the gMR mobility is typically 2-3 times higher than that found via the commonly used drain resistance model. The latter underestimates the mobility since it does not take the interfacial capacitance into account.
△ Less
Submitted 6 June, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Evaluating Performance of Machine Learning Models for Diabetic Sensorimotor Polyneuropathy Severity Classification using Biomechanical Signals during Gait
Authors:
Fahmida Haque,
Mamun Bin Ibne Reaz,
Muhammad Enamul Hoque Chowdhury,
Serkan Kiranyaz,
Mohamed Abdelmoniem,
Emadeddin Hussein,
Mohammed Shaat,
Sawal Hamid Md Ali,
Ahmad Ashrif A Bakar,
Geetika Srivastava,
Mohammad Arif Sobhan Bhuiyan,
Mohd Hadri Hafiz Mokhtar,
Edi Kurniawan
Abstract:
Diabetic sensorimotor polyneuropathy (DSPN) is one of the prevalent forms of neuropathy affected by diabetic patients that involves alterations in biomechanical changes in human gait. In literature, for the last 50 years, researchers are trying to observe the biomechanical changes due to DSPN by studying muscle electromyography (EMG), and ground reaction forces (GRF). However, the literature is co…
▽ More
Diabetic sensorimotor polyneuropathy (DSPN) is one of the prevalent forms of neuropathy affected by diabetic patients that involves alterations in biomechanical changes in human gait. In literature, for the last 50 years, researchers are trying to observe the biomechanical changes due to DSPN by studying muscle electromyography (EMG), and ground reaction forces (GRF). However, the literature is contradictory. In such a scenario, we are proposing to use Machine learning techniques to identify DSPN patients by using EMG, and GRF data. We have collected a dataset consists of three lower limb muscles EMG (tibialis anterior (TA), vastus lateralis (VL), gastrocnemius medialis (GM) and 3-dimensional GRF components (GRFx, GRFy, and GRFz). Raw EMG and GRF signals were preprocessed, and a newly proposed feature extraction technique scheme from literature was applied to extract the best features from the signals. The extracted feature list was ranked using Relief feature ranking techniques, and highly correlated features were removed. We have trained different ML models to find out the best-performing model and optimized that model. We trained the optimized ML models for different combinations of muscles and GRF components features, and the performance matrix was evaluated. This study has found ensemble classifier model was performing in identifying DSPN Severity, and we optimized it before training. For EMG analysis, we have found the best accuracy of 92.89% using the Top 14 features for features from GL, VL and TA muscles combined. In the GRF analysis, the model showed 94.78% accuracy by using the Top 15 features for the feature combinations extracted from GRFx, GRFy and GRFz signals. The performance of ML-based DSPN severity classification models, improved significantly, indicating their reliability in DSPN severity classification, for biomechanical data.
△ Less
Submitted 21 May, 2022;
originally announced May 2022.
-
Charge to Spin Conversion in van der Waals Metal NbSe2
Authors:
Anamul Md. Hoque,
Bing Zhao,
Dmitrii Khokhriakov,
Prasanta Muduli,
Saroj P. Dash1
Abstract:
Quantum materials with a large charge current-induced spin polarization are promising for next-generation all-electrical spintronic science and technology. Van der Waals metals with high spin-orbit coupling and novel spin textures have attracted significant attention for an efficient charge to spin conversion process. Here, we demonstrate the electrical generation of spin polarization in NbSe2 up…
▽ More
Quantum materials with a large charge current-induced spin polarization are promising for next-generation all-electrical spintronic science and technology. Van der Waals metals with high spin-orbit coupling and novel spin textures have attracted significant attention for an efficient charge to spin conversion process. Here, we demonstrate the electrical generation of spin polarization in NbSe2 up to room temperature. To probe the current-induced spin polarization in NbSe2, we used a graphene-based non-local spin-valve device, where the spin-polarization in NbSe2 is efficiently injected and detected using non-local spin-switch and Hanle spin precession measurements. A significantly higher charge-spin conversion in NbSe2 is observed at a lower temperature, below the superconducting transition temperature Tc ~ 7 K of NbSe2. However, the charge-spin conversion signal could only be observed with a higher bias current above the superconducting critical current, limiting the observation of the signal only to the non-superconducting state of NbSe2. Systematic measurements provide the possible origins of the spin polarization to be predominantly due to the spin Hall effect or Rashba-Edelstein effect in NbSe2, considering different symmetry allowed charge-spin conversion processes.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Generalized Mersenne Numbers of the form $cx^2$
Authors:
Azizul Hoque
Abstract:
Generalized Mersenne numbers are defined as $M_{p,n} = p^n - p + 1$, where $p$ is any prime and $n$ is any positive integer. Here, we prove that for each pair $(c, p)$ with $c\geq 1$ an integer, there is at most one $M_{p, n}$ of the form $cx^2$ with a few exceptions.
Generalized Mersenne numbers are defined as $M_{p,n} = p^n - p + 1$, where $p$ is any prime and $n$ is any positive integer. Here, we prove that for each pair $(c, p)$ with $c\geq 1$ an integer, there is at most one $M_{p, n}$ of the form $cx^2$ with a few exceptions.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
Structural, Dielectric, and Electrical Transport Properties of Al3+ Substituted Nanocrystalline Ni-Cu Spinel Ferrites Prepared Through the Sol-Gel Route
Authors:
M. M. Rahman,
N. Hasan,
M. A. Hoque,
M. B. Hossen,
M. Arifuzzaman
Abstract:
In this study, a series of nanocrystalline ferrites of Ni0.7Cu0.30AlxFe2-xO4 (x=0.00 to 0.10 with a step of 0.02) has been synthesized through the sol-gel auto combustion technique. The structural, morphological, dielectric, and electrical properties of the Ni-Cu spinel ferrite nanoparticles are analyzed due to the substitution of Al3+ content. The crystalline and structural characteristics of the…
▽ More
In this study, a series of nanocrystalline ferrites of Ni0.7Cu0.30AlxFe2-xO4 (x=0.00 to 0.10 with a step of 0.02) has been synthesized through the sol-gel auto combustion technique. The structural, morphological, dielectric, and electrical properties of the Ni-Cu spinel ferrite nanoparticles are analyzed due to the substitution of Al3+ content. The crystalline and structural characteristics of the prepared nanoparticles (NPs) have been studied employing the x-ray diffraction (XRD) spectra and FTIR analysis. The extracted XRD patterns assure the single-phase cubic spinel structure of all samples with homogeneity and no impurity, which indicates the yielding of high crystalline NPs. A slight decrease of average grain size with increment of Al3+ content is observed in the surface morphological study carried out by the field emission scanning electron microscopy (FESEM). The studied materials are found in semi-spherical shapes, showing the multi-domain grains separated by grain boundaries with some agglomerations. The chemical composition study for the synthesized NI-Cu spinel ferrites using energy dispersive x-ray (EDX) ensures the presence of each component in appropriate proportions in each sample.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
Quadratic symmetry algebras and spectrum of the 3D nondegenerate quantum superintegrable system
Authors:
Mohasena Ahamed,
Md Fazlul Hoque
Abstract:
In this paper, we present the quadratic associative symmetry algebra of the 3D nondegenerate maximally quantum superintegrable system. This is the complete symmetry algebra of the system. It is demonstrated that the symmetry algebra contains suitable quadratic subalgebras, each of which is generated by three generators with relevant structure constants, which may depend on central elements. We con…
▽ More
In this paper, we present the quadratic associative symmetry algebra of the 3D nondegenerate maximally quantum superintegrable system. This is the complete symmetry algebra of the system. It is demonstrated that the symmetry algebra contains suitable quadratic subalgebras, each of which is generated by three generators with relevant structure constants, which may depend on central elements. We construct corresponding Casimir operators and present finite-dimensional unirreps and structure functions via the realizations of these subalgebras in the context of deformed oscillators. By imposing constraints on the structure functions, we obtain the spectrum of the 3D nondegenerate superintegrable system. We also show that this model is multiseparable and admits separation of variables in cylindrical polar and paraboloidal coordinates. We derive the physical spectrum by solving the Schrödinger equation of the system and compare the result with those obtained from algebraic derivations.
△ Less
Submitted 22 July, 2022; v1 submitted 9 May, 2022;
originally announced May 2022.
-
Chart Question Answering: State of the Art and Future Directions
Authors:
Enamul Hoque,
Parsa Kavehzadeh,
Ahmed Masry
Abstract:
Information visualizations such as bar charts and line charts are very common for analyzing data and discovering critical insights. Often people analyze charts to answer questions that they have in mind. Answering such questions can be challenging as they often require a significant amount of perceptual and cognitive effort. Chart Question Answering (CQA) systems typically take a chart and a natur…
▽ More
Information visualizations such as bar charts and line charts are very common for analyzing data and discovering critical insights. Often people analyze charts to answer questions that they have in mind. Answering such questions can be challenging as they often require a significant amount of perceptual and cognitive effort. Chart Question Answering (CQA) systems typically take a chart and a natural language question as input and automatically generate the answer to facilitate visual data analysis. Over the last few years, there has been a growing body of literature on the task of CQA. In this survey, we systematically review the current state-of-the-art research focusing on the problem of chart question answering. We provide a taxonomy by identifying several important dimensions of the problem domain including possible inputs and outputs of the task and discuss the advantages and limitations of proposed solutions. We then summarize various evaluation techniques used in the surveyed papers. Finally, we outline the open challenges and future research opportunities related to chart question answering.
△ Less
Submitted 21 May, 2022; v1 submitted 8 May, 2022;
originally announced May 2022.
-
SciEv: Finding Scientific Evidence Papers for Scientific News
Authors:
Md Reshad Ul Hoque,
Jiang Li,
Jian Wu
Abstract:
In the past decade, many scientific news media that report scientific breakthroughs and discoveries emerged, bringing science and technology closer to the general public. However, not all scientific news article cites proper sources, such as original scientific papers. A portion of scientific news articles contain misinterpreted, exaggerated, or distorted information that deviates from facts asser…
▽ More
In the past decade, many scientific news media that report scientific breakthroughs and discoveries emerged, bringing science and technology closer to the general public. However, not all scientific news article cites proper sources, such as original scientific papers. A portion of scientific news articles contain misinterpreted, exaggerated, or distorted information that deviates from facts asserted in the original papers. Manually identifying proper citations is laborious and costly. Therefore, it is necessary to automatically search for pertinent scientific papers that could be used as evidence for a given piece of scientific news. We propose a system called SciEv that searches for scientific evidence papers given a scientific news article. The system employs a 2-stage query paradigm with the first stage retrieving candidate papers and the second stage reranking them. The key feature of SciEv is it uses domain knowledge entities (DKEs) to find candidates in the first stage, which proved to be more effective than regular keyphrases. In the reranking stage, we explore different document representations for news articles and candidate papers. To evaluate our system, we compiled a pilot dataset consisting of 100 manually curated (news,paper) pairs from ScienceAlert and similar websites. To our best knowledge, this is the first dataset of this kind. Our experiments indicate that the transformer model performs the best for DKE extraction. The system achieves a P@1=50%, P@5=71%, and P@10=74% when it uses a TFIDF-based text representation. The transformer-based re-ranker achieves a comparable performance but costs twice as much time. We will collect more data and test the system for user experience.
△ Less
Submitted 29 April, 2022;
originally announced May 2022.
-
Diophantine triples with the property $D(n)$ for distinct $n$
Authors:
Kalyan Chakraborty,
Shubham Gupta,
Azizul Hoque
Abstract:
We prove that for every integer $n$, there exist infinitely many $D(n)$-triples which are also $D(t)$-triples for $t\in\mathbb{Z}$ with $n\ne t$. We also prove that there are infinitely many triples with the property $D(-1)$ in $\mathbb{Z}[i]$ which are also $D(n)$-triple in $\mathbb{Z}[i]$ for two distinct $n$'s other than $n = -1$ and these triples are not equivalent to any triple with the prope…
▽ More
We prove that for every integer $n$, there exist infinitely many $D(n)$-triples which are also $D(t)$-triples for $t\in\mathbb{Z}$ with $n\ne t$. We also prove that there are infinitely many triples with the property $D(-1)$ in $\mathbb{Z}[i]$ which are also $D(n)$-triple in $\mathbb{Z}[i]$ for two distinct $n$'s other than $n = -1$ and these triples are not equivalent to any triple with the property $D(1)$.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
Learning to Fold Real Garments with One Arm: A Case Study in Cloud-Based Robotics Research
Authors:
Ryan Hoque,
Kaushik Shivakumar,
Shrey Aeron,
Gabriel Deza,
Aditya Ganapathi,
Adrian Wong,
Johnny Lee,
Andy Zeng,
Vincent Vanhoucke,
Ken Goldberg
Abstract:
Autonomous fabric manipulation is a longstanding challenge in robotics, but evaluating progress is difficult due to the cost and diversity of robot hardware. Using Reach, a cloud robotics platform that enables low-latency remote execution of control policies on physical robots, we present the first systematic benchmarking of fabric manipulation algorithms on physical hardware. We develop 4 novel l…
▽ More
Autonomous fabric manipulation is a longstanding challenge in robotics, but evaluating progress is difficult due to the cost and diversity of robot hardware. Using Reach, a cloud robotics platform that enables low-latency remote execution of control policies on physical robots, we present the first systematic benchmarking of fabric manipulation algorithms on physical hardware. We develop 4 novel learning-based algorithms that model expert actions, keypoints, reward functions, and dynamic motions, and we compare these against 4 learning-free and inverse dynamics algorithms on the task of folding a crumpled T-shirt with a single robot arm. The entire lifecycle of data collection, model training, and policy evaluation is performed remotely without physical access to the robot workcell. Results suggest a new algorithm combining imitation learning with analytic methods achieves 84% of human-level performance on the folding task. See https://sites.google.com/berkeley.edu/cloudfolding for all data, code, models, and supplemental material.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Orientation Controlled Anisotropy in Single Crystals of Quasi-1D BaTiS3
Authors:
Boyang Zhao,
Md Shafkat Bin Hoque,
Gwan Yeong Jung,
Hongyan Mei,
Shantanu Singh,
Guodong Ren,
Milena Milich,
Qinai Zhao,
Nan Wang,
Huandong Chen,
Shanyuan Niu,
Sang-Jun Lee,
Cheng-Tai Kuo,
Jun-Sik Lee,
John A. Tomko,
Han Wang,
Mikhail Kats,
Rohan Mishra,
Patrick E Hopkins,
J. Ravichandran
Abstract:
Low-dimensional materials with chain-like (one-dimensional) or layered (twodimensional) structures are of significant interest due to their anisotropic electrical, optical, thermal properties. One material with chain-like structure, BaTiS3 (BTS), was recently shown to possess giant in-plane optical anisotropy and glass-like thermal conductivity. To understand the origin of these effects, it is nec…
▽ More
Low-dimensional materials with chain-like (one-dimensional) or layered (twodimensional) structures are of significant interest due to their anisotropic electrical, optical, thermal properties. One material with chain-like structure, BaTiS3 (BTS), was recently shown to possess giant in-plane optical anisotropy and glass-like thermal conductivity. To understand the origin of these effects, it is necessary to fully characterize the optical, thermal, and electronic anisotropy of BTS. To this end, BTS crystals with different orientations (aand c-axis orientations) were grown by chemical vapor transport. X-ray absorption spectroscopy (XAS) was used to characterize the local structure and electronic anisotropy of BTS. Fourier transform infrared (FTIR) reflection/transmission spectra show a large inplane optical anisotropy in the a-oriented crystals, while the c-axis oriented crystals were nearly isotropic in-plane. BTS platelet crystals are promising uniaxial materials for IR optics with their optic axis parallel to the c-axis. The thermal conductivity measurements revealed a thermal anisotropy of ~4.5 between the c- and a-axis. Time-domain Brillouin scattering showed that the longitudinal sound speed along the two axes is nearly the same suggesting that the thermal anisotropy is a result of different phonon scattering rates.
△ Less
Submitted 7 April, 2022;
originally announced April 2022.
-
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Authors:
Ahmed Masry,
Do Xuan Long,
Jia Qing Tan,
Shafiq Joty,
Enamul Hoque
Abstract:
Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in their questions. However, most existing datasets do not focus on such complex reasoning questions as their questions are template-based and answers come from a f…
▽ More
Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in their questions. However, most existing datasets do not focus on such complex reasoning questions as their questions are template-based and answers come from a fixed-vocabulary. In this work, we present a large-scale benchmark covering 9.6K human-written questions as well as 23.1K questions generated from human-written chart summaries. To address the unique challenges in our benchmark involving visual and logical reasoning over charts, we present two transformer-based models that combine visual features and the data table of the chart in a unified way to answer questions. While our models achieve the state-of-the-art results on the previous datasets as well as on our benchmark, the evaluation also reveals several challenges in answering complex reasoning questions.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
Auto-Gait: Automatic Ataxia Risk Assessment with Computer Vision on Gait Task Videos
Authors:
Wasifur Rahman,
Masum Hasan,
Md Saiful Islam,
Titilayo Olubajo,
Jeet Thaker,
Abdelrahman Abdelkader,
Phillip Yang,
Tetsuo Ashizawa,
Ehsan Hoque
Abstract:
In this paper, we investigated whether we can 1) detect participants with ataxia-specific gait characteristics (risk-prediction), and 2) assess severity of ataxia from gait (severity-assessment) using computer vision. We created a dataset of 155 videos from 89 participants, 24 controls and 65 diagnosed with (or are pre-manifest) spinocerebellar ataxias (SCAs), performing the gait task of the Scale…
▽ More
In this paper, we investigated whether we can 1) detect participants with ataxia-specific gait characteristics (risk-prediction), and 2) assess severity of ataxia from gait (severity-assessment) using computer vision. We created a dataset of 155 videos from 89 participants, 24 controls and 65 diagnosed with (or are pre-manifest) spinocerebellar ataxias (SCAs), performing the gait task of the Scale for the Assessment and Rating of Ataxia (SARA) from 11 medical sites located in 8 different states across the United States. We develop a computer vision pipeline to detect, track, and separate out the participants from their surroundings and construct several features from their body pose coordinates to capture gait characteristics like step width, step length, swing, stability, speed, etc. Our risk-prediction model achieves 83.06% accuracy and an 80.23% F1 score. Similarly, our severity-assessment model achieves a mean absolute error (MAE) score of 0.6225 and a Pearson's correlation coefficient score of 0.7268. Our models still performed competitively when evaluated on data from sites not used during training. Furthermore, through feature importance analysis, we found that our models associate wider steps, decreased walking speed, and increased instability with greater ataxia severity, which is consistent with previously established clinical knowledge. Our models create possibilities for remote ataxia assessment in non-clinical settings in the future, which could significantly improve accessibility of ataxia care. Furthermore, our underlying dataset was assembled from a geographically diverse cohort, highlighting its potential to further increase equity. The code used in this study is open to the public, and the anonymized body pose landmark dataset is also available upon request.
△ Less
Submitted 15 April, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Chart-to-Text: A Large-Scale Benchmark for Chart Summarization
Authors:
Shankar Kantharaj,
Rixie Tiffany Ko Leong,
Xiang Lin,
Ahmed Masry,
Megh Thakkar,
Enamul Hoque,
Shafiq Joty
Abstract:
Charts are commonly used for exploring data and communicating insights. Generating natural language summaries from charts can be very helpful for people in inferring key insights that would otherwise require a lot of cognitive and perceptual efforts. We present Chart-to-text, a large-scale benchmark with two datasets and a total of 44,096 charts covering a wide range of topics and chart types. We…
▽ More
Charts are commonly used for exploring data and communicating insights. Generating natural language summaries from charts can be very helpful for people in inferring key insights that would otherwise require a lot of cognitive and perceptual efforts. We present Chart-to-text, a large-scale benchmark with two datasets and a total of 44,096 charts covering a wide range of topics and chart types. We explain the dataset construction process and analyze the datasets. We also introduce a number of state-of-the-art neural models as baselines that utilize image captioning and data-to-text generation techniques to tackle two problem variations: one assumes the underlying data table of the chart is available while the other needs to extract data from chart images. Our analysis with automatic and human evaluation shows that while our best models usually generate fluent summaries and yield reasonable BLEU scores, they also suffer from hallucinations and factual errors as well as difficulties in correctly explaining complex patterns and trends in charts.
△ Less
Submitted 14 April, 2022; v1 submitted 12 March, 2022;
originally announced March 2022.
-
BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition
Authors:
Md. Saif Hassan Onim,
Hussain Nyeem,
Koushik Roy,
Mahmudul Hasan,
Abtahi Ishmam,
Md. Akiful Hoque Akif,
Tareque Bashar Ovi
Abstract:
The development of the Automatic License Plate Recognition (ALPR) system has received much attention for the English license plate. However, despite being the sixth largest population around the world, no significant progress can be tracked in the Bengali language countries or states for the ALPR system addressing their more alarming traffic management with inadequate road-safety measures. This pa…
▽ More
The development of the Automatic License Plate Recognition (ALPR) system has received much attention for the English license plate. However, despite being the sixth largest population around the world, no significant progress can be tracked in the Bengali language countries or states for the ALPR system addressing their more alarming traffic management with inadequate road-safety measures. This paper reports a computationally efficient and reasonably accurate Automatic License Plate Recognition (ALPR) system for Bengali characters with a new end-to-end DNN model that we call Bengali License Plate Network(BLPnet). The cascaded architecture for detecting vehicle regions prior to vehicle license plate (VLP) in the model is proposed to eliminate false positives resulting in higher detection accuracy of VLP. Besides, a lower set of trainable parameters is considered for reducing the computational cost making the system faster and more compatible for a real-time application. With a Computational Neural Network (CNN)based new Bengali OCR engine and word-mapping process, the model is characters rotation invariant, and can readily extract, detect and output the complete license plate number of a vehicle. The model feeding with17 frames per second (fps) on real-time video footage can detect a vehicle with the Mean Squared Error (MSE) of 0.0152, and the mean license plate character recognition accuracy of 95%. While compared to the other models, an improvement of 5% and 20% were recorded for the BLPnetover the prominent YOLO-based ALPR model and the Tesseract model for the number-plate detection accuracy and time requirement, respectively.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Effect of powder bed fusion process parameters on microstructural and mechanical properties of FeCrNi MEA: An atomistic study
Authors:
Ishat Raihan Jamil,
Ali Muhit Mustaquim,
Mahmudul Islam,
Md Shajedul Hoque Thakur,
Mohammad Nasim Hasan
Abstract:
In our study, molecular dynamics (MD) simulations of laser powder bed fusion (LPBF) have been conducted on equimolar FeNiCr medium entropy alloy (MEA) powders. With the development of newer LPBF technologies capable of printing at the microscale, an even deeper understanding of the underlying atomistic effects of the process parameters on the microstructural and mechanical properties of the manufa…
▽ More
In our study, molecular dynamics (MD) simulations of laser powder bed fusion (LPBF) have been conducted on equimolar FeNiCr medium entropy alloy (MEA) powders. With the development of newer LPBF technologies capable of printing at the microscale, an even deeper understanding of the underlying atomistic effects of the process parameters on the microstructural and mechanical properties of the manufactured FeNiCr MEA products is required. In accordance with previous literature, the parameters of the LPBF process have been systematically varied, including layer resolution from 1 to 6, laser power from 100 μW to 220 μW, bed temperature from 300 K to 1200 K, and laser scan speed from 0.5 Å/ps to 0.0625 Å/ps. Consistent with prior macroscopic experimental findings, the atomistic results suggest that additive manufacturing using thinner layers imparts higher ultimate tensile strength (UTS) than fabricating with thicker layers. The latter, however, requires a shorter process time but induces keyhole defect formation if the laser-induced temperature is not sufficiently high enough. Increasing the temperature proves useful in mitigating this problem. Enhancement of UTS for the multi-rowed powders has been observed by raising the substrate temperature to 600 K or laser power to 160 μW during production. Beyond these critical limits, however, the UTS of the product diminishes due to the emergence of multiple vacancies. The results of our present study will help researchers to find a good balance between the production speed and strength of additive manufactured products at the nanoscale.
△ Less
Submitted 17 February, 2022;
originally announced February 2022.
-
Adaptive Template Enhancement for Improved Person Recognition using Small Datasets
Authors:
Su Yang,
Sanaul Hoque,
Farzin Deravi
Abstract:
A novel instance-based method for the classification of electroencephalography (EEG) signals is presented and evaluated in this paper. The non-stationary nature of the EEG signals, coupled with the demanding task of pattern recognition with limited training data as well as the potentially noisy signal acquisition conditions, have motivated the work reported in this study. The proposed adaptive tem…
▽ More
A novel instance-based method for the classification of electroencephalography (EEG) signals is presented and evaluated in this paper. The non-stationary nature of the EEG signals, coupled with the demanding task of pattern recognition with limited training data as well as the potentially noisy signal acquisition conditions, have motivated the work reported in this study. The proposed adaptive template enhancement mechanism transforms the feature-level instances by treating each feature dimension separately, hence resulting in improved class separation and better query-class matching. The proposed new instance-based learning algorithm is compared with a few related algorithms in a number of scenarios. A clinical grade 64-electrode EEG database, as well as a low-quality (high-noise level) EEG database obtained with a low-cost system using a single dry sensor have been used for evaluations in biometric person recognition. The proposed approach demonstrates significantly improved classification accuracy in both identification and verification scenarios. In particular, this new method is seen to provide a good classification performance for noisy EEG data, indicating its potential suitability for a wide range of applications.
△ Less
Submitted 3 January, 2022;
originally announced January 2022.
-
Over-the-Air Computation with DFT-spread OFDM for Federated Edge Learning
Authors:
Alphan Sahin,
Bryson Everette,
Safi Shams Muhtasimul Hoque
Abstract:
In this study, we propose an over-the-air computation (AirComp) scheme for federated edge learning (FEEL) without channel state information (CSI) at the edge devices (EDs) or the edge server (ES). The proposed scheme relies on non-coherent communication techniques for achieving distributed training by majority vote (MV). In this work, the votes, i.e., the signs of the local gradients, from the EDs…
▽ More
In this study, we propose an over-the-air computation (AirComp) scheme for federated edge learning (FEEL) without channel state information (CSI) at the edge devices (EDs) or the edge server (ES). The proposed scheme relies on non-coherent communication techniques for achieving distributed training by majority vote (MV). In this work, the votes, i.e., the signs of the local gradients, from the EDs are represented with the pulse-position modulation (PPM) symbols constructed with discrete Fourier transform (DFT)-spread orthogonal frequency division multiplexing (OFDM) (DFT-s-OFDM). By taking the delay spread and time-synchronization errors into account, the MV at the ES is obtained with an energy detector. Hence, the proposed scheme does not require CSI at the EDs and ES. We also prove the convergence of the distributed training when the MV is obtained with the proposed scheme under fading channel. Through simulations, we show that the proposed scheme provides a high test accuracy in fading channels while resulting in lower peak-to-mean envelope power ratio (PMEPR) symbols.
△ Less
Submitted 26 December, 2021;
originally announced December 2021.
-
XDWM: A 2D Domain Wall Memory
Authors:
Arifa Hoque,
Alex K. Jones,
Sanjukta Bhanja
Abstract:
Domain-Wall Memory (DWM) structures typically bundle nanowires shifted together for parallel access. Ironically, this organization does not allow the natural shifting of DWM to realize \textit{logical shifting} within data elements. We describe a novel 2-D DWM cross-point (X-Cell) that allows two individual nanowires placed orthogonally to share the X-Cell. Each nanowire can operate independently…
▽ More
Domain-Wall Memory (DWM) structures typically bundle nanowires shifted together for parallel access. Ironically, this organization does not allow the natural shifting of DWM to realize \textit{logical shifting} within data elements. We describe a novel 2-D DWM cross-point (X-Cell) that allows two individual nanowires placed orthogonally to share the X-Cell. Each nanowire can operate independently while sharing the value at the X-Cell. Using X-Cells, we propose an orthogonal nanowire in the Y dimension overlaid on a bundle of X dimension nanowires for a cross-DWM or XDWM. We demonstrate that the bundle shifts correctly in the X-Direction, and that data can be logically shifted in the Y-direction providing novel data movement and supporting processing-in-memory. We conducted studies on the requirements for physical cell dimensions and shift currents for XDWM. Due to the non-standard domain, our micro-magnetic studies demonstrate that XDWM introduces a shift current penalty of 6.25% while shifting happens in one nanowire compared to a standard nanowire. We also demonstrate correct shifting using nanowire bundles in both the X- and Y- dimensions. Using magnetic simulation to derive the values for SPICE simulation we show the maximum leakage current between nanowires when shifting the bundle together is $\le3$% indicating that sneak paths are not problematic for XDWM.
△ Less
Submitted 23 December, 2021;
originally announced December 2021.
-
Domain Adaptation with Pre-trained Transformers for Query Focused Abstractive Text Summarization
Authors:
Md Tahmid Rahman Laskar,
Enamul Hoque,
Jimmy Xiangji Huang
Abstract:
The Query Focused Text Summarization (QFTS) task aims at building systems that generate the summary of the text document(s) based on the given query. A key challenge in addressing this task is the lack of large labeled data for training the summarization model. In this paper, we address this challenge by exploring a series of domain adaptation techniques. Given the recent success of pre-trained tr…
▽ More
The Query Focused Text Summarization (QFTS) task aims at building systems that generate the summary of the text document(s) based on the given query. A key challenge in addressing this task is the lack of large labeled data for training the summarization model. In this paper, we address this challenge by exploring a series of domain adaptation techniques. Given the recent success of pre-trained transformer models in a wide range of natural language processing tasks, we utilize such models to generate abstractive summaries for the QFTS task for both single-document and multi-document scenarios. For domain adaptation, we apply a variety of techniques using pre-trained transformer-based summarization models including transfer learning, weakly supervised learning, and distant supervision. Extensive experiments on six datasets show that our proposed approach is very effective in generating abstractive summaries for the QFTS task while setting a new state-of-the-art result in several datasets across a set of automatic and human evaluation metrics.
△ Less
Submitted 22 December, 2021;
originally announced December 2021.
-
It's Time to Do Something: Mitigating the Negative Impacts of Computing Through a Change to the Peer Review Process
Authors:
Brent Hecht,
Lauren Wilcox,
Jeffrey P. Bigham,
Johannes Schöning,
Ehsan Hoque,
Jason Ernst,
Yonatan Bisk,
Luigi De Russis,
Lana Yarosh,
Bushra Anjum,
Danish Contractor,
Cathy Wu
Abstract:
The computing research community needs to work much harder to address the downsides of our innovations. Between the erosion of privacy, threats to democracy, and automation's effect on employment (among many other issues), we can no longer simply assume that our research will have a net positive impact on the world. While bending the arc of computing innovation towards societal benefit may at firs…
▽ More
The computing research community needs to work much harder to address the downsides of our innovations. Between the erosion of privacy, threats to democracy, and automation's effect on employment (among many other issues), we can no longer simply assume that our research will have a net positive impact on the world. While bending the arc of computing innovation towards societal benefit may at first seem intractable, we believe we can achieve substantial progress with a straightforward step: making a small change to the peer review process. As we explain below, we hypothesize that our recommended change will force computing researchers to more deeply consider the negative impacts of their work. We also expect that this change will incentivize research and policy that alleviates computing's negative impacts.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Creativity in social networks is enhanced by 'Goldilocks' dispersal of ideators' visibility
Authors:
Raiyan Abdul Baten,
Richard N. Aslin,
Gourab Ghoshal,
Ehsan Hoque
Abstract:
Recent works suggest that striking a balance between maximizing idea stimulation and minimizing idea redundancy can elevate creativity in self-organizing social networks. We explore whether dispersing the visibility of idea generators can help achieve such a trade-off. We employ popularity signals (follower counts) of participants as an external source of variation in network structures, which we…
▽ More
Recent works suggest that striking a balance between maximizing idea stimulation and minimizing idea redundancy can elevate creativity in self-organizing social networks. We explore whether dispersing the visibility of idea generators can help achieve such a trade-off. We employ popularity signals (follower counts) of participants as an external source of variation in network structures, which we control across four randomized study conditions. We observe that popularity signals influence inspiration-seeking ties, partly by biasing people's perception of their peers' creativity. Networks that partially disperse the ideators' visibility using this external signal show reduced idea-redundancy and elevated creativity. However, extreme dispersal leads to inferior creativity by narrowing the range of idea stimulation. Our work holds future-of-work implications for elevating creativity.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
Is Approximation Universally Defensive Against Adversarial Attacks in Deep Neural Networks?
Authors:
Ayesha Siddique,
Khaza Anuarul Hoque
Abstract:
Approximate computing is known for its effectiveness in improvising the energy efficiency of deep neural network (DNN) accelerators at the cost of slight accuracy loss. Very recently, the inexact nature of approximate components, such as approximate multipliers have also been reported successful in defending adversarial attacks on DNNs models. Since the approximation errors traverse through the DN…
▽ More
Approximate computing is known for its effectiveness in improvising the energy efficiency of deep neural network (DNN) accelerators at the cost of slight accuracy loss. Very recently, the inexact nature of approximate components, such as approximate multipliers have also been reported successful in defending adversarial attacks on DNNs models. Since the approximation errors traverse through the DNN layers as masked or unmasked, this raises a key research question-can approximate computing always offer a defense against adversarial attacks in DNNs, i.e., are they universally defensive? Towards this, we present an extensive adversarial robustness analysis of different approximate DNN accelerators (AxDNNs) using the state-of-the-art approximate multipliers. In particular, we evaluate the impact of ten adversarial attacks on different AxDNNs using the MNIST and CIFAR-10 datasets. Our results demonstrate that adversarial attacks on AxDNNs can cause 53% accuracy loss whereas the same attack may lead to almost no accuracy loss (as low as 0.06%) in the accurate DNN. Thus, approximate computing cannot be referred to as a universal defense strategy against adversarial attacks.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.