-
Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics
Authors:
Asifullah khan,
Muhammad Zaeem Khan,
Saleha Jamshed,
Sadia Ahmad,
Aleesha Zainab,
Kaynat Khatib,
Faria Bibi,
Abdul Rehman
Abstract:
This survey paper outlines the key developments in the field of Large Language Models (LLMs), such as enhancing their reasoning skills, adaptability to various tasks, increased computational efficiency, and ability to make ethical decisions. The techniques that have been most effective in bridging the gap between human and machine communications include the Chain-of-Thought prompting, Instruction…
▽ More
This survey paper outlines the key developments in the field of Large Language Models (LLMs), such as enhancing their reasoning skills, adaptability to various tasks, increased computational efficiency, and ability to make ethical decisions. The techniques that have been most effective in bridging the gap between human and machine communications include the Chain-of-Thought prompting, Instruction Tuning, and Reinforcement Learning from Human Feedback. The improvements in multimodal learning and few-shot or zero-shot techniques have further empowered LLMs to handle complex jobs with minor input. They also manage to do more with less by applying scaling and optimization tricks for computing power conservation. This survey also offers a broader perspective on recent advancements in LLMs going beyond isolated aspects such as model architecture or ethical concerns. It categorizes emerging methods that enhance LLM reasoning, efficiency, and ethical alignment. It also identifies underexplored areas such as interpretability, cross-modal integration and sustainability. With recent progress, challenges like huge computational costs, biases, and ethical risks remain constant. Addressing these requires bias mitigation, transparent decision-making, and clear ethical guidelines. Future research will focus on enhancing models ability to handle multiple input, thereby making them more intelligent, safe, and reliable.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Dynamic Logistic Ensembles with Recursive Probability and Automatic Subset Splitting for Enhanced Binary Classification
Authors:
Mohammad Zubair Khan,
David Li
Abstract:
This paper presents a novel approach to binary classification using dynamic logistic ensemble models. The proposed method addresses the challenges posed by datasets containing inherent internal clusters that lack explicit feature-based separations. By extending traditional logistic regression, we develop an algorithm that automatically partitions the dataset into multiple subsets, constructing an…
▽ More
This paper presents a novel approach to binary classification using dynamic logistic ensemble models. The proposed method addresses the challenges posed by datasets containing inherent internal clusters that lack explicit feature-based separations. By extending traditional logistic regression, we develop an algorithm that automatically partitions the dataset into multiple subsets, constructing an ensemble of logistic models to enhance classification accuracy. A key innovation in this work is the recursive probability calculation, derived through algebraic manipulation and mathematical induction, which enables scalable and efficient model construction. Compared to traditional ensemble methods such as Bagging and Boosting, our approach maintains interpretability while offering competitive performance. Furthermore, we systematically employ maximum likelihood and cost functions to facilitate the analytical derivation of recursive gradients as functions of ensemble depth. The effectiveness of the proposed approach is validated on a custom dataset created by introducing noise and shifting data to simulate group structures, resulting in significant performance improvements with layers. Implemented in Python, this work balances computational efficiency with theoretical rigor, providing a robust and interpretable solution for complex classification tasks with broad implications for machine learning applications. Code at https://github.com/ensemble-art/Dynamic-Logistic-Ensembles
△ Less
Submitted 26 November, 2024;
originally announced November 2024.
-
Contactless Human Activity Recognition using Deep Learning with Flexible and Scalable Software Define Radio
Authors:
Muhammad Zakir Khan,
Jawad Ahmad,
Wadii Boulila,
Matthew Broadbent,
Syed Aziz Shah,
Anis Koubaa,
Qammer H. Abbasi
Abstract:
Ambient computing is gaining popularity as a major technological advancement for the future. The modern era has witnessed a surge in the advancement in healthcare systems, with viable radio frequency solutions proposed for remote and unobtrusive human activity recognition (HAR). Specifically, this study investigates the use of Wi-Fi channel state information (CSI) as a novel method of ambient sens…
▽ More
Ambient computing is gaining popularity as a major technological advancement for the future. The modern era has witnessed a surge in the advancement in healthcare systems, with viable radio frequency solutions proposed for remote and unobtrusive human activity recognition (HAR). Specifically, this study investigates the use of Wi-Fi channel state information (CSI) as a novel method of ambient sensing that can be employed as a contactless means of recognizing human activity in indoor environments. These methods avoid additional costly hardware required for vision-based systems, which are privacy-intrusive, by (re)using Wi-Fi CSI for various safety and security applications. During an experiment utilizing universal software-defined radio (USRP) to collect CSI samples, it was observed that a subject engaged in six distinct activities, which included no activity, standing, sitting, and leaning forward, across different areas of the room. Additionally, more CSI samples were collected when the subject walked in two different directions. This study presents a Wi-Fi CSI-based HAR system that assesses and contrasts deep learning approaches, namely convolutional neural network (CNN), long short-term memory (LSTM), and hybrid (LSTM+CNN), employed for accurate activity recognition. The experimental results indicate that LSTM surpasses current models and achieves an average accuracy of 95.3% in multi-activity classification when compared to CNN and hybrid techniques. In the future, research needs to study the significance of resilience in diverse and dynamic environments to identify the activity of multiple users.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
A Low-Complexity Diversity-Preserving Universal Bit-Flipping Enhanced Hard Decision Decoder for Arbitrary Linear Codes
Authors:
Praveen Sai Bere,
Mohammed Zafar Ali Khan,
Lajos Hanzo
Abstract:
V2X (Vehicle-to-everything) communication relies on short messages for short-range transmissions over a fading wireless channel, yet requires high reliability and low latency.
Hard-decision decoding sacrifices the preservation of diversity order, leading to pronounced performance degradation in fading channels. By contrast, soft-decision decoding retains diversity order, albeit at the cost of in…
▽ More
V2X (Vehicle-to-everything) communication relies on short messages for short-range transmissions over a fading wireless channel, yet requires high reliability and low latency.
Hard-decision decoding sacrifices the preservation of diversity order, leading to pronounced performance degradation in fading channels. By contrast, soft-decision decoding retains diversity order, albeit at the cost of increased computational complexity.
We introduce a novel enhanced hard-decision decoder termed as the Diversity Flip decoder (DFD) designed for preserving the diversity order. Moreover, it exhibits 'universal' applicability to all linear block codes. For a $\mathscr{C}(n,k)$ code having a minimum distance ${d_{\min}}$, the proposed decoder incurs a worst-case complexity order of $2^{({d_{\min}}-1)}-1$. Notably, for codes having low ${d_{\min}}$, this complexity represents a significant reduction compared to the popular soft and hard decision decoding algorithms. Due to its capability of maintaining diversity at a low complexity, it is eminently suitable for applications such as V2X (Vehicle-to-everything), IoT (Internet of Things), mMTC (Massive Machine type Communications), URLLC (Ultra-Reliable Low Latency Communications) and WBAN (Wireless Body Area Networks) for efficient decoding with favorable performance characteristics. The simulation results provided for various known codes and decoding algorithms validate the performance versus complexity benefits of the proposed decoder.
△ Less
Submitted 7 August, 2024; v1 submitted 7 April, 2023;
originally announced April 2023.
-
Comparing the Performance of NLP Toolkits and Evaluation measures in Legal Tech
Authors:
Muhammad Zohaib Khan
Abstract:
Recent developments in Natural Language Processing have led to the introduction of state-of-the-art Neural Language Models, enabled with unsupervised transferable learning, using different pretraining objectives. While these models achieve excellent results on the downstream NLP tasks, various domain adaptation techniques can improve their performance on domain-specific tasks. We compare and analy…
▽ More
Recent developments in Natural Language Processing have led to the introduction of state-of-the-art Neural Language Models, enabled with unsupervised transferable learning, using different pretraining objectives. While these models achieve excellent results on the downstream NLP tasks, various domain adaptation techniques can improve their performance on domain-specific tasks. We compare and analyze the pretrained Neural Language Models, XLNet (autoregressive), and BERT (autoencoder) on the Legal Tasks. Results show that XLNet Model performs better on our Sequence Classification task of Legal Opinions Classification, whereas BERT produces better results on the NER task. We use domain-specific pretraining and additional legal vocabulary to adapt BERT Model further to the Legal Domain. We prepared multiple variants of the BERT Model, using both methods and their combination. Comparing our variants of the BERT Model, specializing in the Legal Domain, we conclude that both additional pretraining and vocabulary techniques enhance the BERT model's performance on the Legal Opinions Classification task. Additional legal vocabulary improves BERT's performance on the NER task. Combining the pretraining and vocabulary techniques further improves the final results. Our Legal-Vocab-BERT Model gives the best results on the Legal Opinions Task, outperforming the larger pretrained general Language Models, i.e., BERT-Base and XLNet-Base.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Parity Check Codes for Second Order Diversity
Authors:
Aaqib A. Patel,
Abdul Mateen Ahmed,
Mohammed Zafar Ali Khan
Abstract:
Block codes are typically not used for fading channels as soft decision decoding is computationally intensive and hard decision decoding results in performance loss. In this paper we propose a diversity preserving hard decision decoding scheme for parity check codes (PCC) over Rayleigh fading channels. The proposed flip decoding scheme has linear complexity in the block length. Theoretical analysi…
▽ More
Block codes are typically not used for fading channels as soft decision decoding is computationally intensive and hard decision decoding results in performance loss. In this paper we propose a diversity preserving hard decision decoding scheme for parity check codes (PCC) over Rayleigh fading channels. The proposed flip decoding scheme has linear complexity in the block length. Theoretical analysis and simulation results verify the correctness of the proposed detection scheme.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Crowd Size using CommSense Instrument for COVID-19 Echo Period
Authors:
Santu Sardar,
Amit K. Mishra,
Mohammed Z. A. Khan
Abstract:
The period after the COVID-19 wave is called the Echo-period. Estimation of crowd size in an outdoor environment is essential in the Echo-period. Making a simple and flexible working system for the same is the need of the hour. This article proposes and evaluates a non-intrusive, passive, and costeffective solution for crowd size estimation in an outdoor environment. We call the proposed system as…
▽ More
The period after the COVID-19 wave is called the Echo-period. Estimation of crowd size in an outdoor environment is essential in the Echo-period. Making a simple and flexible working system for the same is the need of the hour. This article proposes and evaluates a non-intrusive, passive, and costeffective solution for crowd size estimation in an outdoor environment. We call the proposed system as LTE communication infrastructure based environment sensing or LTE-CommSense. This system does not need any active signal transmission as it uses LTE transmitted signal. So, this is a power-efficient, simple low footprint device. Importantly, the personal identity of the people in the crowd can not be obtained using this method. First, the system uses practical data to determine whether the outdoor environment is empty or not. If not, it tries to estimate the number of people occupying the near range locality. Performance evaluation with practical data confirms the feasibility of this proposed approach.
△ Less
Submitted 20 October, 2020;
originally announced November 2020.
-
Multiclass Model for Agriculture development using Multivariate Statistical method
Authors:
N Deepa,
Mohammad Zubair Khan,
Prabadevi B,
Durai Raj Vincent P M,
Praveen Kumar Reddy Maddikunta,
Thippa Reddy Gadekallu
Abstract:
Mahalanobis taguchi system (MTS) is a multi-variate statistical method extensively used for feature selection and binary classification problems. The calculation of orthogonal array and signal-to-noise ratio in MTS makes the algorithm complicated when more number of factors are involved in the classification problem. Also the decision is based on the accuracy of normal and abnormal observations of…
▽ More
Mahalanobis taguchi system (MTS) is a multi-variate statistical method extensively used for feature selection and binary classification problems. The calculation of orthogonal array and signal-to-noise ratio in MTS makes the algorithm complicated when more number of factors are involved in the classification problem. Also the decision is based on the accuracy of normal and abnormal observations of the dataset. In this paper, a multiclass model using Improved Mahalanobis Taguchi System (IMTS) is proposed based on normal observations and Mahalanobis distance for agriculture development. Twenty-six input factors relevant to crop cultivation have been identified and clustered into six main factors for the development of the model. The multiclass model is developed with the consideration of the relative importance of the factors. An objective function is defined for the classification of three crops, namely paddy, sugarcane and groundnut. The classification results are verified against the results obtained from the agriculture experts working in the field. The proposed classifier provides 100% accuracy, recall, precision and 0% error rate when compared with other traditional classifier models.
△ Less
Submitted 7 October, 2020; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Performance Evaluation of LTE-CommSense System for Discrimination of Presence of Multiple Objects in Outdoor Environment
Authors:
Santu Sardar,
Amit K. Mishra,
Mohammed Zafar Ali Khan
Abstract:
LTE-CommSense is a novel instrumentation scheme which analyzes channel affected reference signals of LTE downlink signal to obtain knowledge about the environmental change. This work presents the characterization of LTE-CommSense instrument to detect presence or absence of objects in outdoor environment. Additionally, we analyze its capability of detecting and distinguishing when multiple objects…
▽ More
LTE-CommSense is a novel instrumentation scheme which analyzes channel affected reference signals of LTE downlink signal to obtain knowledge about the environmental change. This work presents the characterization of LTE-CommSense instrument to detect presence or absence of objects in outdoor environment. Additionally, we analyze its capability of detecting and distinguishing when multiple objects are present. For performance evaluation and characterization of this instrument, we derive object detection accuracy, FAR, FRR and resolution which we believe are the most important figures of merit in this case. As the operation of LTE-CommSense is to detect events instead of objects, we redefine the concept of resolution for LTE-CommSense. Two different proposals to represent the redefined resolution viz. Neyman Pearson principle based and Cramer Rao principle based resolution are presented here. All the performance metrics are derived using practical data captured using an SDR platform modeled as a LTE-CommSense receiver. We observe that, LTE-CommSense provides better performance in detecting presence or absence of objects at near range.
△ Less
Submitted 15 April, 2019;
originally announced May 2019.
-
Vehicle Detection and Classification using LTE-CommSense
Authors:
Santu Sardar,
Amit K. Mishra,
Mohammed Zafar Ali Khan
Abstract:
We demonstrated a vehicle detection and classification method based on Long Term Evolution (LTE) communication infrastructure based environment sensing instrument, termed as LTE-CommSense by the authors. This technology is a novel passive sensing system which focuses on the reference signals embedded in the sub-frames of LTE resource grid. It compares the received signal with the expected referenc…
▽ More
We demonstrated a vehicle detection and classification method based on Long Term Evolution (LTE) communication infrastructure based environment sensing instrument, termed as LTE-CommSense by the authors. This technology is a novel passive sensing system which focuses on the reference signals embedded in the sub-frames of LTE resource grid. It compares the received signal with the expected reference signal, extracts the evaluated channel state information (CSI) and analyzes it to estimate the change in the environment. For vehicle detection and subsequent classification, our setup is similar to a passive radar in forward scattering radar (FSR) mode. Instead of performing the radio frequency (RF) signals directly, we take advantage of the processing that happens in a LTE receiver user equipment (UE). We tap into the channel estimation and equalization block and extract the CSI value. CSI value reflects the property of the communication channel between communication base station (eNodeB) and UE. We use CSI values for with and vehicle and without vehicle case in outdoor open road environment. Being a receiver only system, there is no need for any transmission and related regulations. Therefore, this system is low cost, power efficient and difficult to detect. Also, most of its processing will be done by the existing LTE communication receiver (UE). In this paper, we establish our claim by analyzing field-collected data. Live LTE downlink (DL) signal is captured using modeled LTE UE using software defined radio (SDR). The detection analysis and classification performance shows promising results and ascertains that, LTE-CommSense is capable of detection and classification of different types of vehicles in outdoor road environment.
△ Less
Submitted 15 April, 2019;
originally announced April 2019.
-
Development and Evaluation of Recurrent Neural Network based Models for Hourly Traffic Volume and AADT Prediction
Authors:
MD Zadid Khan,
Sakib Mahmud Khan,
Mashrur Chowdhury,
Kakan Dey
Abstract:
The prediction of high-resolution hourly traffic volumes of a given roadway is essential for transportation planning. Traditionally, Automatic Traffic Recorders (ATR) are used to collect this hourly volume data. These large datasets are time series data characterized by long-term temporal dependencies and missing values. Regarding the temporal dependencies, all roadways are characterized by season…
▽ More
The prediction of high-resolution hourly traffic volumes of a given roadway is essential for transportation planning. Traditionally, Automatic Traffic Recorders (ATR) are used to collect this hourly volume data. These large datasets are time series data characterized by long-term temporal dependencies and missing values. Regarding the temporal dependencies, all roadways are characterized by seasonal variations that can be weekly, monthly or yearly, depending on the cause of the variation. Regarding the missing data in a time-series sequence, traditional time series forecasting models perform poorly under the influence of seasonal variations. To address this limitation, robust, Recurrent Neural Network (RNN) based, multi-step ahead forecasting models are developed for time-series in this study. The simple RNN, the Gated Recurrent Unit (GRU) and the Long Short-Term Memory (LSTM) units are used to develop the model and evaluate its performance. Two approaches are used to address the missing value issue: masking and imputation, in conjunction with the RNN models. Six different imputation algorithms are then used to identify the best model. The analysis indicates that the LSTM model performs better than simple RNN and GRU models, and imputation performs better than masking to predict future traffic volume. Based on analysis using 92 ATRs, the LSTM-Median model is deemed the best model in all scenarios for hourly traffic volume and AADT prediction, with an average RMSE of 274 and MAPE of 18.91% for hourly traffic volume prediction and average RMSE of 824 and MAPE of 2.10% for AADT prediction.
△ Less
Submitted 25 November, 2018; v1 submitted 15 August, 2018;
originally announced August 2018.
-
Development of Statewide AADT Estimation Model from Short-Term Counts: A Comparative Study for South Carolina
Authors:
Sakib Mahmud Khan,
Sababa Islam,
MD Zadid Khan,
Kakan Dey,
Mashrur Chowdhury,
Nathan Huynh
Abstract:
Annual Average Daily Traffic (AADT) is an important parameter used in traffic engineering analysis. Departments of Transportation (DOTs) continually collect traffic count using both permanent count stations (i.e., Automatic Traffic Recorders or ATRs) and temporary short-term count stations. In South Carolina, 87% of the ATRs are located on interstates and arterial highways. For most secondary high…
▽ More
Annual Average Daily Traffic (AADT) is an important parameter used in traffic engineering analysis. Departments of Transportation (DOTs) continually collect traffic count using both permanent count stations (i.e., Automatic Traffic Recorders or ATRs) and temporary short-term count stations. In South Carolina, 87% of the ATRs are located on interstates and arterial highways. For most secondary highways (i.e., collectors and local roads), AADT is estimated based on short-term counts. This paper develops AADT estimation models for different roadway functional classes with two machine learning techniques: Artificial Neural Network (ANN) and Support Vector Regression (SVR). The models aim to predict AADT from short-term counts. The results are first compared against each other to identify the best model. Then, the results of the best model are compared against a regression method and factor-based method. The comparison reveals the superiority of SVR for AADT estimation for different roadway functional classes over all other methods. Among all developed models for different functional roadway classes, the SVR-based model shows a minimum root mean square error (RMSE) of 0.22 and a mean absolute percentage error (MAPE) of 11.3% for the interstate/expressway functional class. This model also shows a higher R-squared value compared to the traditional factor-based model and regression model. SVR models are validated for each roadway functional class using the 2016 ATR data and selected short-term count data collected by the South Carolina Department of Transportation (SCDOT). The validation results show that the SVR-based AADT estimation models can be used by the SCDOT as a reliable option to predict AADT from the short-term counts.
△ Less
Submitted 29 November, 2017;
originally announced December 2017.
-
Utility Maximization Framework for Opportunistic Wireless Electric Vehicle Charging
Authors:
MD Zadid Khan,
Mashrur Chowdhury,
Sakib Mahmud Khan,
Ilya Safro,
Hayato Ushijima-Mwesigwa
Abstract:
This is an extended abstract, it has no separate abstract section
This is an extended abstract, it has no separate abstract section
△ Less
Submitted 6 December, 2017; v1 submitted 22 August, 2017;
originally announced August 2017.
-
Cloud Cognitive Radio HetNets with Limited Feedback
Authors:
Sandeep Babasaheb Dhavane,
Mohammed Zafar Ali Khan
Abstract:
In this paper we propose a cloud based interweave cognitive radio HetNets which combines gain of cloud based radio that is increased rate for cell edge users and better spectral efficiency of cognitive radio. Simulation results for limited feedback shows approximately 100 % increase in rate for primary while 300 % for secondary cell edge users with same outage in cloud over conventional cognitive…
▽ More
In this paper we propose a cloud based interweave cognitive radio HetNets which combines gain of cloud based radio that is increased rate for cell edge users and better spectral efficiency of cognitive radio. Simulation results for limited feedback shows approximately 100 % increase in rate for primary while 300 % for secondary cell edge users with same outage in cloud over conventional cognitive radio network.
△ Less
Submitted 19 May, 2016; v1 submitted 20 April, 2016;
originally announced April 2016.
-
Capacity and Spectral Efficiency of Interference Avoiding Cognitive Radio with Imperfect Detection
Authors:
Aaqib Patel,
Md. Zafar Ali Khan,
S. N. Merchant,
U. B. Desai
Abstract:
In this paper, we consider a model in which the unlicensed or the Secondary User (SU) equipped with a Cognitive Radio (CR) (together referred to as CR) interweaves its transmission with that of the licensed or the Primary User (PU). In this model, when the CR detects the PU to be (i) busy it does not transmit and; (ii) PU to be idle it transmits. Two situations based on CR's detection of PU are co…
▽ More
In this paper, we consider a model in which the unlicensed or the Secondary User (SU) equipped with a Cognitive Radio (CR) (together referred to as CR) interweaves its transmission with that of the licensed or the Primary User (PU). In this model, when the CR detects the PU to be (i) busy it does not transmit and; (ii) PU to be idle it transmits. Two situations based on CR's detection of PU are considered, where the CR detects PU (i) perfectly - referred to as the "ideal case" and; (ii) imperfectly - referred to as "non ideal case". For both the cases we bring out the rate region, sum capacity of PU and CR and spectral efficiency factor - the ratio of sum capacity of PU and CR to the capacity of PU without CR. We consider the Rayleigh fading channel to provide insight to our results. For the ideal case we study the effect of PU occupancy on spectral efficiency factor. For the non ideal case, in addition to the effect of occupancy, we study the effect of false alarm and missed detection on the rate region and spectral efficiency factor. We characterize the set of values of false alarm and missed detection probabilities for which the system benefits, in the form of admissible regions. We show that false alarm has a more profound effect on the spectral efficiency factor than missed detection. We also show that when PU occupancy is small, the effects of both false alarm and missed detection decrease. Finally, for the standard detection techniques viz. energy detection, matched filter and magnitude squared coherence, we show that that the matched filter performs best followed by magnitude squared coherence followed by energy detection with respect to spectral efficiency factor.
△ Less
Submitted 15 May, 2012;
originally announced May 2012.
-
Single-Symbol Maximum Likelihood Decodable Linear STBCs
Authors:
Md. Zafar ALi Khan,
B. Sundar Rajan
Abstract:
Space-Time block codes (STBC) from Orthogonal Designs (OD) and Co-ordinate Interleaved Orthogonal Designs (CIOD) have been attracting wider attention due to their amenability for fast (single-symbol) ML decoding, and full-rate with full-rank over quasi-static fading channels. However, these codes are instances of single-symbol decodable codes and it is natural to ask, if there exist codes other…
▽ More
Space-Time block codes (STBC) from Orthogonal Designs (OD) and Co-ordinate Interleaved Orthogonal Designs (CIOD) have been attracting wider attention due to their amenability for fast (single-symbol) ML decoding, and full-rate with full-rank over quasi-static fading channels. However, these codes are instances of single-symbol decodable codes and it is natural to ask, if there exist codes other than STBCs form ODs and CIODs that allow single-symbol coding?
In this paper, the above question is answered in the affirmative by characterizing all linear STBCs, that allow single-symbol ML decoding (not necessarily full-diversity) over quasi-static fading channels-calling them single-symbol decodable designs (SDD). The class SDD includes ODs and CIODs as proper subclasses. Further, among the SDD, a class of those that offer full-diversity, called Full-rank SDD (FSDD) are characterized and classified.
△ Less
Submitted 8 February, 2006;
originally announced February 2006.