Skip to main content

Showing 1–15 of 15 results for author: Shakya, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00467  [pdf, ps, other

    cs.LG cs.AI

    Diversity Conscious Refined Random Forest

    Authors: Sijan Bhattarai, Saurav Bhandari, Girija Bhusal, Saroj Shakya, Tapendra Pandey

    Abstract: Random Forest (RF) is a widely used ensemble learning technique known for its robust classification performance across diverse domains. However, it often relies on hundreds of trees and all input features, leading to high inference cost and model redundancy. In this work, our goal is to grow trees dynamically only on informative features and then enforce maximal diversity by clustering and retaini… ▽ More

    Submitted 5 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2502.03614  [pdf, other

    cs.LG cs.AI cs.CR

    A Novel Zero-Touch, Zero-Trust, AI/ML Enablement Framework for IoT Network Security

    Authors: Sushil Shakya, Robert Abbas, Sasa Maric

    Abstract: The IoT facilitates a connected, intelligent, and sustainable society; therefore, it is imperative to protect the IoT ecosystem. The IoT-based 5G and 6G will leverage the use of machine learning and artificial intelligence (ML/AI) more to pave the way for autonomous and collaborative secure IoT networks. Zero-touch, zero-trust IoT security with AI and machine learning (ML) enablement frameworks of… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  3. arXiv:2411.05890  [pdf, other

    cs.CR cs.LG

    A Comparative Analysis of Machine Learning Models for DDoS Detection in IoT Networks

    Authors: Sushil Shakya, Robert Abbas

    Abstract: This paper presents the detection of DDoS attacks in IoT networks using machine learning models. Their rapid growth has made them highly susceptible to various forms of cyberattacks, many of whose security procedures are implemented in an irregular manner. It evaluates the efficacy of different machine learning models, such as XGBoost, K-Nearest Neighbours, Stochastic Gradient Descent, and Naïve B… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: 6 pages, 6 figures

  4. arXiv:2410.22566  [pdf, other

    eess.IV cs.CV

    Deep Priors for Video Quality Prediction

    Authors: Siddharath Narayan Shakya, Parimala Kancharla

    Abstract: In this work, we designed a completely blind video quality assessment algorithm using the deep video prior. This work mainly explores the utility of deep video prior in estimating the visual quality of the video. In our work, we have used a single distorted video and a reference video pair to learn the deep video prior. At inference time, the learned deep prior is used to restore the original vide… ▽ More

    Submitted 5 November, 2024; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2024 conference tinny paper

  5. arXiv:2410.17421  [pdf

    cs.CY

    From an attention economy to an ecology of attending. A manifesto

    Authors: Gunter Bombaerts, Tom Hannes, Martin Adam, Alessandra Aloisi, Joel Anderson, Lawrence Berger, Stefano Davide Bettera, Enrico Campo, Laura Candiotto, Silvia Caprioglio Panizza, Yves Citton, Diego D’Angelo, Matthew Dennis, Nathalie Depraz, Peter Doran, Wolfgang Drechsler, Bill Duane, William Edelglass, Iris Eisenberger, Beverley Foulks McGuire, Antony Fredriksson, Karamjit S. Gill, Peter D. Hershock, Soraj Hongladarom, Beth Jacobs , et al. (30 additional authors not shown)

    Abstract: As the signatories of this manifesto, we denounce the attention economy as inhumane and a threat to our sociopolitical and ecological well-being. We endorse policymakers' efforts to address the negative consequences of the attention economy's technology, but add that these approaches are often limited in their criticism of the systemic context of human attention. Starting from Buddhist philosophy,… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

    Comments: 21 pages, 1 figure

  6. Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet

    Authors: Manish Dhakal, Arman Chhetri, Aman Kumar Gupta, Prabin Lamichhane, Suraj Pandey, Subarna Shakya

    Abstract: This paper presents an end-to-end deep learning model for Automatic Speech Recognition (ASR) that transcribes Nepali speech to text. The model was trained and tested on the OpenSLR (audio, text) dataset. The majority of the audio dataset have silent gaps at both ends which are clipped during dataset preprocessing for a more uniform mapping of audio frames and their corresponding texts. Mel Frequen… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted at 2022 International Conference on Inventive Computation Technologies (ICICT), IEEE

    Journal ref: 2022 International Conference on Inventive Computation Technologies (ICICT), pp. 515-521

  7. Contextual Spelling Correction with Language Model for Low-resource Setting

    Authors: Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

    Abstract: The task of Spell Correction(SC) in low-resource languages presents a significant challenge due to the availability of only a limited corpus of data and no annotated spelling correction datasets. To tackle these challenges a small-scale word-based transformer LM is trained to provide the SC model with contextual understanding. Further, the probabilistic error rules are extracted from the corpus in… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 8 pages

  8. arXiv:2404.18071  [pdf

    cs.CL cs.LG

    Can Perplexity Predict Fine-tuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali

    Authors: Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah, Subarna Shakya

    Abstract: The impact of subword tokenization on language model performance is well-documented for perplexity, with finer granularity consistently reducing this intrinsic metric. However, research on how different tokenization schemes affect a model's understanding capabilities remains limited, particularly for non-Latin script languages. Addressing this gap, we conducted a comprehensive evaluation of six di… ▽ More

    Submitted 9 June, 2025; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: 11 pages

  9. arXiv:2310.13290  [pdf, other

    cs.CL

    Interpreting Indirect Answers to Yes-No Questions in Multiple Languages

    Authors: Zijie Wang, Md Mosharaf Hossain, Shivam Mathur, Terry Cruz Melo, Kadir Bulut Ozler, Keun Hee Park, Jacob Quintero, MohammadHossein Rezaei, Shreya Nupur Shakya, Md Nayem Uddin, Eduardo Blanco

    Abstract: Yes-no questions expect a yes or no for an answer, but people often skip polar keywords. Instead, they answer with long explanations that must be interpreted. In this paper, we focus on this challenging problem and release new benchmarks in eight languages. We present a distant supervision approach to collect training data. We also demonstrate that direct answers (i.e., with polar keywords) are us… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  10. arXiv:2208.06130  [pdf

    cs.CR

    Analysis, Detection, and Classification of Android Malware using System Calls

    Authors: Shubham Shakya, Mayank Dave

    Abstract: With the increasing popularity of Android in the last decade, Android is popular among users as well as attackers. The vast number of android users grabs the attention of attackers on android. Due to the continuous evolution of the variety and attacking techniques of android malware, our detection methods should need an update too. Most of the researcher's works are based on static features, and v… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: 18 pages, 7 table , 3 figures

  11. Age Range Estimation using MTCNN and VGG-Face Model

    Authors: Dipesh Gyawali, Prashanga Pokharel, Ashutosh Chauhan, Subodh Chandra Shakya

    Abstract: The Convolutional Neural Network has amazed us with its usage on several applications. Age range estimation using CNN is emerging due to its application in myriad of areas which makes it a state-of-the-art area for research and improve the estimation accuracy. A deep CNN model is used for identification of people's age range in our proposed work. At first, we extracted only face images from image… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 6 pages, 10 figures

    Journal ref: 11th IEEE International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2020

  12. arXiv:1910.09129  [pdf, other

    cs.IR cs.CL cs.LG

    A Comparison of Semantic Similarity Methods for Maximum Human Interpretability

    Authors: Pinky Sitikhu, Kritish Pahi, Pujan Thapa, Subarna Shakya

    Abstract: The inclusion of semantic information in any similarity measures improves the efficiency of the similarity measure and provides human interpretable results for further analysis. The similarity calculation method that focuses on features related to the text's words only, will give less accurate results. This paper presents three different methods that not only focus on the text's words but also inc… ▽ More

    Submitted 30 October, 2019; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: Accepted in IEEE International Conference on Artificial Intelligence for Transforming Business and Society

  13. arXiv:1910.03474  [pdf, other

    cs.CL cs.LG stat.ML

    Fine-grained Sentiment Classification using BERT

    Authors: Manish Munikar, Sushil Shakya, Aakash Shrestha

    Abstract: Sentiment classification is an important process in understanding people's perception towards a product, service, or topic. Many natural language processing models have been proposed to solve the sentiment classification problem. However, most of them have focused on binary sentiment classification. In this paper, we use a promising deep learning model called BERT to solve the fine-grained sentime… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Submitted to IEEE International Conference on Artificial Intelligence for Transforming Business and Society

  14. arXiv:1904.04767  [pdf, other

    quant-ph cs.ET

    Quanvolutional Neural Networks: Powering Image Recognition with Quantum Circuits

    Authors: Maxwell Henderson, Samriddhi Shakya, Shashindra Pradhan, Tristan Cook

    Abstract: Convolutional neural networks (CNNs) have rapidly risen in popularity for many machine learning applications, particularly in the field of image recognition. Much of the benefit generated from these networks comes from their ability to extract features from the data in a hierarchical manner. These features are extracted using various transformational layers, notably the convolutional layer which g… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 7 pages, 3 figures

  15. arXiv:1409.3512  [pdf

    cs.CL

    Word Sense Disambiguation using WSD specific Wordnet of Polysemy Words

    Authors: Udaya Raj Dhungana, Subarna Shakya, Kabita Baral, Bharat Sharma

    Abstract: This paper presents a new model of WordNet that is used to disambiguate the correct sense of polysemy word based on the clue words. The related words for each sense of a polysemy word as well as single sense word are referred to as the clue words. The conventional WordNet organizes nouns, verbs, adjectives and adverbs together into sets of synonyms called synsets each expressing a different concep… ▽ More

    Submitted 10 September, 2014; originally announced September 2014.