-
Towards Leveraging Large Language Models for Automated Medical Q&A Evaluation
Authors:
Jack Krolik,
Herprit Mahal,
Feroz Ahmad,
Gaurav Trivedi,
Bahador Saket
Abstract:
This paper explores the potential of using Large Language Models (LLMs) to automate the evaluation of responses in medical Question and Answer (Q\&A) systems, a crucial form of Natural Language Processing. Traditionally, human evaluation has been indispensable for assessing the quality of these responses. However, manual evaluation by medical professionals is time-consuming and costly. Our study e…
▽ More
This paper explores the potential of using Large Language Models (LLMs) to automate the evaluation of responses in medical Question and Answer (Q\&A) systems, a crucial form of Natural Language Processing. Traditionally, human evaluation has been indispensable for assessing the quality of these responses. However, manual evaluation by medical professionals is time-consuming and costly. Our study examines whether LLMs can reliably replicate human evaluations by using questions derived from patient data, thereby saving valuable time for medical experts. While the findings suggest promising results, further research is needed to address more specific or complex questions that were beyond the scope of this initial investigation.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Predicting suicidal behavior among Indian adults using childhood trauma, mental health questionnaires and machine learning cascade ensembles
Authors:
Akash K Rao,
Gunjan Y Trivedi,
Riri G Trivedi,
Anshika Bajpai,
Gajraj Singh Chauhan,
Vishnu K Menon,
Kathirvel Soundappan,
Hemalatha Ramani,
Neha Pandya,
Varun Dutt
Abstract:
Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In th…
▽ More
Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In this study, different machine learning algorithms and ensembles were developed to predict suicide behavior based on childhood trauma, different mental health parameters, and other behavioral factors. The dataset was acquired from 391 individuals from a wellness center in India. Information regarding their childhood trauma, psychological wellness, and other mental health issues was acquired through standardized questionnaires. Results revealed that cascade ensemble learning methods using a support vector machine, decision trees, and random forest were able to classify suicidal behavior with an accuracy of 95.04% using data from childhood trauma and mental health questionnaires. The study highlights the potential of using these machine learning ensembles to identify individuals with suicidal tendencies so that targeted interinterventions could be provided efficiently.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
FPGA Implementation of Simplified Spiking Neural Network
Authors:
Shikhar Gupta,
Arpan Vyas,
Gaurav Trivedi
Abstract:
Spiking Neural Networks (SNN) are third-generation Artificial Neural Networks (ANN) which are close to the biological neural system. In recent years SNN has become popular in the area of robotics and embedded applications, therefore, it has become imperative to explore its real-time and energy-efficient implementations. SNNs are more powerful than their predecessors because they encode temporal in…
▽ More
Spiking Neural Networks (SNN) are third-generation Artificial Neural Networks (ANN) which are close to the biological neural system. In recent years SNN has become popular in the area of robotics and embedded applications, therefore, it has become imperative to explore its real-time and energy-efficient implementations. SNNs are more powerful than their predecessors because they encode temporal information and use biologically plausible plasticity rules. In this paper, a simpler and computationally efficient SNN model using FPGA architecture is described. The proposed model is validated on a Xilinx Virtex 6 FPGA and analyzes a fully connected network which consists of 800 neurons and 12,544 synapses in real-time.
△ Less
Submitted 2 October, 2020;
originally announced October 2020.
-
PowerPlanningDL: Reliability-Aware Framework for On-Chip Power Grid Design using Deep Learning
Authors:
Sukanta Dey,
Sukumar Nandi,
Gaurav Trivedi
Abstract:
With the increase in the complexity of chip designs, VLSI physical design has become a time-consuming task, which is an iterative design process. Power planning is that part of the floorplanning in VLSI physical design where power grid networks are designed in order to provide adequate power to all the underlying functional blocks. Power planning also requires multiple iterative steps to create th…
▽ More
With the increase in the complexity of chip designs, VLSI physical design has become a time-consuming task, which is an iterative design process. Power planning is that part of the floorplanning in VLSI physical design where power grid networks are designed in order to provide adequate power to all the underlying functional blocks. Power planning also requires multiple iterative steps to create the power grid network while satisfying the allowed worst-case IR drop and Electromigration (EM) margin. For the first time, this paper introduces Deep learning (DL)-based framework to approximately predict the initial design of the power grid network, considering different reliability constraints. The proposed framework reduces many iterative design steps and speeds up the total design cycle. Neural Network-based multi-target regression technique is used to create the DL model. Feature extraction is done, and the training dataset is generated from the floorplans of some of the power grid designs extracted from the IBM processor. The DL model is trained using the generated dataset. The proposed DL-based framework is validated using a new set of power grid specifications (obtained by perturbing the designs used in the training phase). The results show that the predicted power grid design is closer to the original design with minimal prediction error (~2%). The proposed DL-based approach also improves the design cycle time with a speedup of ~6X for standard power grid benchmarks.
△ Less
Submitted 24 July, 2020; v1 submitted 4 May, 2020;
originally announced May 2020.
-
An Interactive Tool for Natural Language Processing on Clinical Text
Authors:
Gaurav Trivedi,
Phuong Pham,
Wendy Chapman,
Rebecca Hwa,
Janyce Wiebe,
Harry Hochheiser
Abstract:
Natural Language Processing (NLP) systems often make use of machine learning techniques that are unfamiliar to end-users who are interested in analyzing clinical records. Although NLP has been widely used in extracting information from clinical text, current systems generally do not support model revision based on feedback from domain experts.
We present a prototype tool that allows end users to…
▽ More
Natural Language Processing (NLP) systems often make use of machine learning techniques that are unfamiliar to end-users who are interested in analyzing clinical records. Although NLP has been widely used in extracting information from clinical text, current systems generally do not support model revision based on feedback from domain experts.
We present a prototype tool that allows end users to visualize and review the outputs of an NLP system that extracts binary variables from clinical text. Our tool combines multiple visualizations to help the users understand these results and make any necessary corrections, thus forming a feedback loop and helping improve the accuracy of the NLP models. We have tested our prototype in a formative think-aloud user study with clinicians and researchers involved in colonoscopy research. Results from semi-structured interviews and a System Usability Scale (SUS) analysis show that the users are able to quickly start refining NLP models, despite having very little or no experience with machine learning. Observations from these sessions suggest revisions to the interface to better support review workflow and interpretation of results.
△ Less
Submitted 7 July, 2017; v1 submitted 6 July, 2017;
originally announced July 2017.
-
A Representation of Symmetry Generators for the Type IIB Superstring on a Plane Wave in the U(4) Formalism
Authors:
Gautam Trivedi
Abstract:
We calculate the symmetry currents for the type IIB superstring on a maximally supersymmetric plane wave background using the N=(2,2) superconformally covariant U(4) formulation developed by Berkovits, Maldacena and Maoz. An explicit realization of the U(4) generators together with 16 fermionic generators is obtained in terms of the N=(2,2) worldsheet fields. Because the action is no longer quad…
▽ More
We calculate the symmetry currents for the type IIB superstring on a maximally supersymmetric plane wave background using the N=(2,2) superconformally covariant U(4) formulation developed by Berkovits, Maldacena and Maoz. An explicit realization of the U(4) generators together with 16 fermionic generators is obtained in terms of the N=(2,2) worldsheet fields. Because the action is no longer quadratic, we use a light-cone version to display the currents in terms of the covariant worldsheet variables.
△ Less
Submitted 7 April, 2003; v1 submitted 3 March, 2003;
originally announced March 2003.
-
Correlation Functions in Berkovits' Pure Spinor Formulation
Authors:
Gautam Trivedi
Abstract:
We use Berkovits' pure spinor quantization to compute various three-point tree correlation functions in position-space for the Type IIB superstring. We solve the constraint equations for the vertex operators and obtain explicit expressions for the graviton and axion components of the vertex operators. Using these operators we compute tree level correlation functions in flat space and discuss the…
▽ More
We use Berkovits' pure spinor quantization to compute various three-point tree correlation functions in position-space for the Type IIB superstring. We solve the constraint equations for the vertex operators and obtain explicit expressions for the graviton and axion components of the vertex operators. Using these operators we compute tree level correlation functions in flat space and discuss their extension to the AdS5 X S5 background.
△ Less
Submitted 12 November, 2002; v1 submitted 21 May, 2002;
originally announced May 2002.