-
Predictive Analysis of Tuberculosis Treatment Outcomes Using Machine Learning: A Karnataka TB Data Study at a Scale
Authors:
SeshaSai Nath Chinagudaba,
Darshan Gera,
Krishna Kiran Vamsi Dasu,
Uma Shankar S,
Kiran K,
Anil Singarajpure,
Shivayogappa. U,
Somashekar N,
Vineet Kumar Chadda,
Sharath B N
Abstract:
Tuberculosis (TB) remains a global health threat, ranking among the leading causes of mortality worldwide. In this context, machine learning (ML) has emerged as a transformative force, providing innovative solutions to the complexities associated with TB treatment.This study explores how machine learning, especially with tabular data, can be used to predict Tuberculosis (TB) treatment outcomes mor…
▽ More
Tuberculosis (TB) remains a global health threat, ranking among the leading causes of mortality worldwide. In this context, machine learning (ML) has emerged as a transformative force, providing innovative solutions to the complexities associated with TB treatment.This study explores how machine learning, especially with tabular data, can be used to predict Tuberculosis (TB) treatment outcomes more accurately. It transforms this prediction task into a binary classification problem, generating risk scores from patient data sourced from NIKSHAY, India's national TB control program, which includes over 500,000 patient records.
Data preprocessing is a critical component of the study, and the model achieved an recall of 98% and an AUC-ROC score of 0.95 on the validation set, which includes 20,000 patient records.We also explore the use of Natural Language Processing (NLP) for improved model learning. Our results, corroborated by various metrics and ablation studies, validate the effectiveness of our approach. The study concludes by discussing the potential ramifications of our research on TB eradication efforts and proposing potential avenues for future work. This study marks a significant stride in the battle against TB, showcasing the potential of machine learning in healthcare.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
Authors:
Jie He,
Simon Chi Lok U,
Víctor Gutiérrez-Basulto,
Jeff Z. Pan
Abstract:
Unsupervised commonsense reasoning (UCR) is becoming increasingly popular as the construction of commonsense reasoning datasets is expensive, and they are inevitably limited in their scope. A popular approach to UCR is to fine-tune language models with external knowledge (e.g., knowledge graphs), but this usually requires a large number of training examples. In this paper, we propose to transform…
▽ More
Unsupervised commonsense reasoning (UCR) is becoming increasingly popular as the construction of commonsense reasoning datasets is expensive, and they are inevitably limited in their scope. A popular approach to UCR is to fine-tune language models with external knowledge (e.g., knowledge graphs), but this usually requires a large number of training examples. In this paper, we propose to transform the downstream multiple choice question answering task into a simpler binary classification task by ranking all candidate answers according to their reasonableness. To this end, for training the model, we convert the knowledge graph triples into reasonable and unreasonable texts. Extensive experimental results show the effectiveness of our approach on various multiple choice question answering benchmarks. Furthermore, compared with existing UCR approaches using KGs, ours is less data hungry. Our code is available at https://github.com/probe2/BUCA.
△ Less
Submitted 11 April, 2025; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Securing the data in cloud using Algebra Homomorphic Encryption scheme based on updated Elgamal(AHEE)
Authors:
Fahina,
Shwetha U,
Poorna,
Supriya,
Rama Moorthy H,
Dr. Vasudeva
Abstract:
Cloud computing is the broad and diverse phenomenon. Users are allowed to store huge amount of data on cloud storage for future use. Most of the cloud service providers store data in plain text format or in secured manner but client will not be known the method in which it is stored. Homomorphic encryption is the encryption which allows the operation on cipher text thus generating an encrypted res…
▽ More
Cloud computing is the broad and diverse phenomenon. Users are allowed to store huge amount of data on cloud storage for future use. Most of the cloud service providers store data in plain text format or in secured manner but client will not be known the method in which it is stored. Homomorphic encryption is the encryption which allows the operation on cipher text thus generating an encrypted result which when decrypted, matches the result of operations performed on the plaintext. This is sometimes a desirable feature in modern communication system architectures. There are several homomorphic algorithms, one of them is AHEE. User need to use their own encryption algorithm to secure their data if required. The data needs to be decrypted whenever it is to be processed. In this paper, we have focused on providing security to the client using AHEE algorithm at the client side, for any data to be stored in the cloud.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
On the Secrecy Capacity of 2-user Gaussian Z-Interference Channel with Shared Key
Authors:
Somalatha U,
Parthajit Mohapatra
Abstract:
In this paper, the role of secret key with finite rate is studied to enhance the secrecy performance of the system when users are operating in interference limited scenarios. To address this problem, a 2-user Gaussian Z-IC with secrecy constraint at the receiver is considered. One of the fundamental problems here is how to use the secret key as a part of the encoding process. The paper proposes no…
▽ More
In this paper, the role of secret key with finite rate is studied to enhance the secrecy performance of the system when users are operating in interference limited scenarios. To address this problem, a 2-user Gaussian Z-IC with secrecy constraint at the receiver is considered. One of the fundamental problems here is how to use the secret key as a part of the encoding process. The paper proposes novel achievable schemes, where the schemes differ from each other based on how the key has been used in the encoding process. The first achievable scheme uses one part of the key for one-time pad and remaining part of the key for wiretap coding. The encoding is performed such that the receiver experiencing interference can decode some part of the interference without violating the secrecy constraint. As a special case of the derived result, one can obtain the secrecy rate region when the key is completely used for one-time pad or part of the wiretap coding. The second scheme uses the shared key to encrypt the message using one-time pad and in contrast to the previous case no interference is decoded at the receiver. The paper also derives an outer bound on the sum rate and secrecy rate of the transmitter which causes interference. The main novelty of deriving outer bound lies in the selection of side information provided to the receiver and using the secrecy constraint at the receiver. The derived outer bounds are found to be tight depending on the channel conditions and rate of the key. The scaling behaviour of key rate is also explored for different schemes using the notion of secure GDOF. The optimality of different schemes is characterized for some specific cases. The developed results show the importance of key rate splitting in enhancing the secrecy performance of the system when users are operating under interference limited environment.
△ Less
Submitted 19 May, 2021;
originally announced May 2021.