-
NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Authors:
Antonia Karamolegkou,
Angana Borah,
Eunjung Cho,
Sagnik Ray Choudhury,
Martina Galletti,
Rajarshi Ghosh,
Pranav Gupta,
Oana Ignat,
Priyanka Kargupta,
Neema Kotonya,
Hemank Lamba,
Sun-Joo Lee,
Arushi Mangla,
Ishani Mondal,
Deniz Nazarova,
Poli Nemkova,
Dina Pisarevskaya,
Naquee Rizwan,
Nazanin Sabri,
Dominik Stammbach,
Anna Steinberg,
David Tomás,
Steven R Wilson,
Bowen Yi,
Jessica H Zhu
, et al. (7 additional authors not shown)
Abstract:
Recent advancements in large language models (LLMs) have unlocked unprecedented possibilities across a range of applications. However, as a community, we believe that the field of Natural Language Processing (NLP) has a growing need to approach deployment with greater intentionality and responsibility. In alignment with the broader vision of AI for Social Good (Tomašev et al., 2020), this paper ex…
▽ More
Recent advancements in large language models (LLMs) have unlocked unprecedented possibilities across a range of applications. However, as a community, we believe that the field of Natural Language Processing (NLP) has a growing need to approach deployment with greater intentionality and responsibility. In alignment with the broader vision of AI for Social Good (Tomašev et al., 2020), this paper examines the role of NLP in addressing pressing societal challenges. Through a cross-disciplinary analysis of social goals and emerging risks, we highlight promising research directions and outline challenges that must be addressed to ensure responsible and equitable progress in NLP4SG research.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)
Authors:
Subba Reddy Oota,
Akshett Jindal,
Ishani Mondal,
Khushbu Pahwa,
Satya Sai Srinath Namburi,
Manish Shrivastava,
Maneesh Singh,
Bapi S. Raju,
Manish Gupta
Abstract:
Transformer-based language models, though not explicitly trained to mimic brain recordings, have demonstrated surprising alignment with brain activity. Progress in these models-through increased size, instruction-tuning, and multimodality-has led to better representational alignment with neural data. Recently, a new class of instruction-tuned multimodal LLMs (MLLMs) have emerged, showing remarkabl…
▽ More
Transformer-based language models, though not explicitly trained to mimic brain recordings, have demonstrated surprising alignment with brain activity. Progress in these models-through increased size, instruction-tuning, and multimodality-has led to better representational alignment with neural data. Recently, a new class of instruction-tuned multimodal LLMs (MLLMs) have emerged, showing remarkable zero-shot capabilities in open-ended multimodal vision tasks. However, it is unknown whether MLLMs, when prompted with natural instructions, lead to better brain alignment and effectively capture instruction-specific representations. To address this, we first investigate brain alignment, i.e., measuring the degree of predictivity of neural visual activity using text output response embeddings from MLLMs as participants engage in watching natural scenes. Experiments with 10 different instructions show that MLLMs exhibit significantly better brain alignment than vision-only models and perform comparably to non-instruction-tuned multimodal models like CLIP. We also find that while these MLLMs are effective at generating high-quality responses suitable to the task-specific instructions, not all instructions are relevant for brain alignment. Further, by varying instructions, we make the MLLMs encode instruction-specific visual concepts related to the input image. This analysis shows that MLLMs effectively capture count-related and recognition-related concepts, demonstrating strong alignment with brain activity. Notably, the majority of the explained variance of the brain encoding models is shared between MLLM embeddings of image captioning and other instructions. These results suggest that enhancing MLLMs' ability to capture task-specific information could lead to better differentiation between various types of instructions, and thereby improving their precision in predicting brain responses.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
Analysis of non-linear fractal functions on PCF self-similar sets
Authors:
Aaryan Dharmesh Shah,
Sangita Jha,
Anarul Islam Mondal
Abstract:
This article deals with (1) the construction of a general non-linear fractal interpolation function on PCF self-similar sets, (2) the energy and normal derivatives of uniform non-linear fractal functions, (3) estimation of the bound of box dimension of the proposed fractal functions on the Sierpinski gasket and the von-Koch curve. Here, we present a more general framework to construct the attracto…
▽ More
This article deals with (1) the construction of a general non-linear fractal interpolation function on PCF self-similar sets, (2) the energy and normal derivatives of uniform non-linear fractal functions, (3) estimation of the bound of box dimension of the proposed fractal functions on the Sierpinski gasket and the von-Koch curve. Here, we present a more general framework to construct the attractor and the functions on the PCF self-similar sets using the Edelstein contraction, which broadens the class of functions. En route, we calculate the upper and lower box dimensions of the graph of non-linear interpolant. Finally, we provide several graphical and numerical examples for illustration of the construction and estimate the dimensions for different data sets.
△ Less
Submitted 15 May, 2025;
originally announced May 2025.
-
Complex electronic topography and magnetotransport in an in-plane ferromagnetic kagome metal
Authors:
Anup Pradhan Sakhya,
Richa Pokharel Madhogaria,
Barun Ghosh,
Nabil Atlam,
Milo Sprague,
Mazharul Islam Mondal,
Himanshu Sheokand,
Arun K. Kumay,
Shirin Mozaffari,
Rui Xue,
Yong P. Chen,
David G. Mandrus,
Arun Bansil,
Madhab Neupane
Abstract:
The intricate interplay between flat bands, Dirac cones, and magnetism in kagome materials has recently attracted significant attention from materials scientists, particularly in compounds belonging to the RMn6Sn6 family (R = Sc, Y, rare earths), due to their inherent magnetic frustration. Here, we present a detailed investigation of the ferromagnetic (FM) kagome magnet ScMn6(Sn0.78Ga0.22)6 using…
▽ More
The intricate interplay between flat bands, Dirac cones, and magnetism in kagome materials has recently attracted significant attention from materials scientists, particularly in compounds belonging to the RMn6Sn6 family (R = Sc, Y, rare earths), due to their inherent magnetic frustration. Here, we present a detailed investigation of the ferromagnetic (FM) kagome magnet ScMn6(Sn0.78Ga0.22)6 using angle-resolved photoemission spectroscopy (ARPES), magnetotransport measurements, and density functional theory (DFT) calculations. Our findings reveal a paramagnetic-to-FM transition at 375 K, with the in-plane direction serving as the easy magnetization axis. Notably, ARPES measurements reveal a Dirac cone near the Fermi energy, while the Hall resistivity exhibits a substantial contribution from the anomalous Hall effect. Additionally, we observe a flat band spanning a substantial portion of the Brillouin zone, arising from the destructive interference of wave functions in the Mn kagome lattice. Theoretical calculations reveal that the gap in the Dirac cone can be modulated by altering the orientation of the magnetic moment. An out-of-plane orientation produces a gap of approximately 15 meV, while an in-plane alignment leads to a gapless state, as corroborated by ARPES measurements. This comprehensive analysis provides valuable insights into the electronic structure of magnetic kagome materials and paves the way for exploring novel topological phases in this material class.
△ Less
Submitted 14 May, 2025;
originally announced May 2025.
-
Electronic structure of a layered altermagnetic compound CoNb4Se8
Authors:
Anup Pradhan Sakhya,
Mazharul Islam Mondal,
Milo Sprague,
Resham Babu Regmi,
Arun K Kumay,
Himanshu Sheokand,
Igor. I. Mazin,
Nirmal J. Ghimire,
Madhab Neupane
Abstract:
Recently, there has been a growing interest in altermagnetism, a novel form of magnetism, characterized by unique spin-splitting even in the absence of both net magnetic moments and spin-orbit coupling. Despite numerous theoretical predictions, experimental evidence of such spin-splitting in real materials remains limited. In this study, we use angle-resolved photoemission spectroscopy (ARPES) com…
▽ More
Recently, there has been a growing interest in altermagnetism, a novel form of magnetism, characterized by unique spin-splitting even in the absence of both net magnetic moments and spin-orbit coupling. Despite numerous theoretical predictions, experimental evidence of such spin-splitting in real materials remains limited. In this study, we use angle-resolved photoemission spectroscopy (ARPES) combined with density functional theory (DFT) calculations to investigate the electronic band structure of the altermagnet candidate CoNb4Se8. This material features an ordered sublattice of intercalated Co atoms within NbSe2 layers. Magnetization and electrical resistivity measurements reveal the onset of antiferromagnetism below 168 K. Temperature dependent ARPES data, supported by DFT calculations, uncover spin split bands along the MGM high-symmetry direction. The observation of spin splitting in this high temperature altermagnet opens new avenues for exploring its electronic properties and potential applications in spintronic technologies.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Diverse electronic topography in a distorted kagome metal LaTi3Bi4
Authors:
Anup Pradhan Sakhya,
Brenden R. Ortiz,
Barun Ghosh,
Milo Sprague,
Mazharul Islam Mondal,
Matthew Matzelle,
Nabil Atlam,
Arun K Kumay,
David G. Mandrus,
Jonathan D. Denlinger,
Arun Bansil,
Madhab Neupane
Abstract:
Recent reports on a family of kagome metals of the form LnTi3Bi4 (Ln = Lanthanide) has stoked interest due to the combination of highly anisotropic magnetism and a rich electronic structure. The electronic structure near the Fermi level is proposed to exhibit Dirac points and van Hove singularities. In this manuscript, we use angle resolved photoemission spectroscopy measurements in combination wi…
▽ More
Recent reports on a family of kagome metals of the form LnTi3Bi4 (Ln = Lanthanide) has stoked interest due to the combination of highly anisotropic magnetism and a rich electronic structure. The electronic structure near the Fermi level is proposed to exhibit Dirac points and van Hove singularities. In this manuscript, we use angle resolved photoemission spectroscopy measurements in combination with density functional theory calculations to investigate the electronic structure of a newly discovered kagome metal LaTi3Bi4. Our results reveal multiple van Hove singularities (VHSs) with one VHS located in the vicinity of the Fermi level. We clearly observe two flat bands, which originate from the destructive interference of wave functions within the Ti kagome motif. These flat bands and VHSs originate from Ti d orbitals and are very responsive to the polarization of the incident beam. We notice a significant anisotropy in the electronic structure, resulting from the breaking of six fold rotational symmetry in this material. Our findings demonstrate this new family of Ti based kagome material as a promising platform to explore novel emerging phenomena in the wider LnTi3Bi4 (Ln= lanthanide) family of materials.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations
Authors:
Ishani Mondal,
Jack W. Stokes,
Sujay Kumar Jauhar,
Longqi Yang,
Mengting Wan,
Xiaofeng Xu,
Xia Song,
Jennifer Neville
Abstract:
LLMs often fail to meet the specialized needs of distinct user groups due to their one-size-fits-all training paradigm \cite{lucy-etal-2024-one} and there is limited research on what personalization aspects each group expect. To address these limitations, we propose a group-aware personalization framework, Group Preference Alignment (GPA), that identifies context-specific variations in conversatio…
▽ More
LLMs often fail to meet the specialized needs of distinct user groups due to their one-size-fits-all training paradigm \cite{lucy-etal-2024-one} and there is limited research on what personalization aspects each group expect. To address these limitations, we propose a group-aware personalization framework, Group Preference Alignment (GPA), that identifies context-specific variations in conversational preferences across user groups and then steers LLMs to address those preferences. Our approach consists of two steps: (1) Group-Aware Preference Extraction, where maximally divergent user-group preferences are extracted from real-world conversation logs and distilled into interpretable rubrics, and (2) Tailored Response Generation, which leverages these rubrics through two methods: a) Context-Tuned Inference (GAP-CT), that dynamically adjusts responses via context-dependent prompt instructions, and b) Rubric-Finetuning Inference (GPA-FT), which uses the rubrics to generate contrastive synthetic data for personalization of group-specific models via alignment. Experiments demonstrate that our framework significantly improves alignment of the output with respect to user preferences and outperforms baseline methods, while maintaining robust performance on standard benchmarks.
△ Less
Submitted 11 March, 2025;
originally announced March 2025.
-
Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators
Authors:
Feng Gu,
Zongxia Li,
Carlos Rafael Colon,
Benjamin Evans,
Ishani Mondal,
Jordan Lee Boyd-Graber
Abstract:
Event annotation is important for identifying market changes, monitoring breaking news, and understanding sociological trends. Although expert annotators set the gold standards, human coding is expensive and inefficient. Unlike information extraction experiments that focus on single contexts, we evaluate a holistic workflow that removes irrelevant documents, merges documents about the same event,…
▽ More
Event annotation is important for identifying market changes, monitoring breaking news, and understanding sociological trends. Although expert annotators set the gold standards, human coding is expensive and inefficient. Unlike information extraction experiments that focus on single contexts, we evaluate a holistic workflow that removes irrelevant documents, merges documents about the same event, and annotates the events. Although LLM-based automated annotations are better than traditional TF-IDF-based methods or Event Set Curation, they are still not reliable annotators compared to human experts. However, adding LLMs to assist experts for Event Set Curation can reduce the time and mental effort required for Variable Annotation. When using LLMs to extract event variables to assist expert annotators, they agree more with the extracted variables than fully automated LLMs for annotation.
△ Less
Submitted 5 April, 2025; v1 submitted 9 March, 2025;
originally announced March 2025.
-
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement
Authors:
Ishani Mondal,
Zongxia Li,
Yufang Hou,
Anandhavelu Natarajan,
Aparna Garimella,
Jordan Boyd-Graber
Abstract:
Automating the creation of scientific diagrams from academic papers can significantly streamline the development of tutorials, presentations, and posters, thereby saving time and accelerating the process. Current text-to-image models struggle with generating accurate and visually appealing diagrams from long-context inputs. We propose SciDoc2Diagram, a task that extracts relevant information from…
▽ More
Automating the creation of scientific diagrams from academic papers can significantly streamline the development of tutorials, presentations, and posters, thereby saving time and accelerating the process. Current text-to-image models struggle with generating accurate and visually appealing diagrams from long-context inputs. We propose SciDoc2Diagram, a task that extracts relevant information from scientific papers and generates diagrams, along with a benchmarking dataset, SciDoc2DiagramBench. We develop a multi-step pipeline SciDoc2Diagrammer that generates diagrams based on user intentions using intermediate code generation. We observed that initial diagram drafts were often incomplete or unfaithful to the source, leading us to develop SciDoc2Diagrammer-Multi-Aspect-Feedback (MAF), a refinement strategy that significantly enhances factual correctness and visual appeal and outperforms existing models on both automatic and human judgement.
△ Less
Submitted 15 October, 2024; v1 submitted 28 September, 2024;
originally announced September 2024.
-
Observation of paramagnetic spin-degeneracy lifting in EuZn2Sb2
Authors:
Milo X. Sprague,
Sabin Regmi,
Barun Ghosh,
Anup Pradhan Sakhya,
Mazharul Islam Mondal,
Iftakhar Bin Elius,
Nathan Valadez,
Bahadur Singh,
Tetiana Romanova,
Dariusz Kaczorowski,
Arun Bansil,
Madhab Neupane
Abstract:
Taken together, time-reversal and spatial inversion symmetries impose a two-fold spin degeneracy of the electronic states in crystals. In centrosymmetric materials, this degeneracy can be lifted by introducing magnetism, either via an externally applied field or through internal magnetization. However, a correlated alignment of spins, even in the paramagnetic phase, can lift the spin degeneracy of…
▽ More
Taken together, time-reversal and spatial inversion symmetries impose a two-fold spin degeneracy of the electronic states in crystals. In centrosymmetric materials, this degeneracy can be lifted by introducing magnetism, either via an externally applied field or through internal magnetization. However, a correlated alignment of spins, even in the paramagnetic phase, can lift the spin degeneracy of electronic states. Here, we report an in-depth study of the electronic band structure of the Eu-ternary pnictide EuZn2Sb2 through a combination of high-resolution angle-resolved photoemission spectroscopy measurements and first principles calculations. An analysis of the photoemission lineshapes over a range of incident photon energies and sample temperatures is shown to reveal the presence of band spin degeneracy-lifting in the paramagnetic phase. Our ARPES results are in good agreement with theoretical ferromagnetic-phase calculations, which indicates the importance of ferromagnetic fluctuations in the system. Through our calculations, we predict that spin-polarized bands in EuZn2Sb2 generate a single pair of Weyl nodes. Our observation of band-splitting in EuZn2Sb2 provides a key step toward realizing time-reversal symmetry breaking physics in the absence of long-range magnetic order.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Is your benchmark truly adversarial? AdvScore: Evaluating Human-Grounded Adversarialness
Authors:
Yoo Yeon Sung,
Maharshi Gor,
Eve Fleisig,
Ishani Mondal,
Jordan Lee Boyd-Graber
Abstract:
Adversarial datasets should validate AI robustness by providing samples on which humans perform well, but models do not. However, as models evolve, datasets can become obsolete. Measuring whether a dataset remains adversarial is hindered by the lack of a standardized metric for measuring adversarialness. We propose AdvScore, a human-grounded evaluation metric that assesses a dataset's adversarialn…
▽ More
Adversarial datasets should validate AI robustness by providing samples on which humans perform well, but models do not. However, as models evolve, datasets can become obsolete. Measuring whether a dataset remains adversarial is hindered by the lack of a standardized metric for measuring adversarialness. We propose AdvScore, a human-grounded evaluation metric that assesses a dataset's adversarialness by capturing models' and humans' varying abilities while also identifying poor examples. We then use AdvScore to motivate a new dataset creation pipeline for realistic and high-quality adversarial samples, enabling us to collect an adversarial question answering (QA) dataset, AdvQA. We apply AdvScore using 9,347 human responses and ten language models' predictions to track model improvement over five years, from 2020 to 2024. AdvScore thus provides guidance for achieving robustness comparable with human capabilities. Furthermore, it helps determine to what extent adversarial datasets continue to pose challenges, ensuring that, rather than reflecting outdated or overly artificial difficulties, they effectively test model capabilities.
△ Less
Submitted 18 February, 2025; v1 submitted 24 June, 2024;
originally announced June 2024.
-
Electronic structure of a nodal line semimetal candidate TbSbTe
Authors:
Iftakhar Bin Elius,
Jacob F Casey,
Sabin Regmi,
Volodymyr Buturlim,
Anup Pradhan Sakhya,
Milo Sprague,
Mazharul Islam Mondal,
Nathan Valadez,
Arun K Kumay,
Justin Scrivens,
Yenugonda Venkateswara,
Shovan Dan,
Tetiana Romanova,
Arjun K Pathak,
Krzysztof Gofryk,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
The LnSbTe (Ln = Lanthanides) family, like isostructural ZrSiS type compounds, has emerged as a fertile playground for exploring the interaction of electronic correlations and magnetic ordering with the nodal line band topology. Here, we report a detailed electronic band structure investigation of TbSbTe, corroborated by electrical transport, thermodynamic, and magnetic studies. Temperature-depend…
▽ More
The LnSbTe (Ln = Lanthanides) family, like isostructural ZrSiS type compounds, has emerged as a fertile playground for exploring the interaction of electronic correlations and magnetic ordering with the nodal line band topology. Here, we report a detailed electronic band structure investigation of TbSbTe, corroborated by electrical transport, thermodynamic, and magnetic studies. Temperature-dependent magnetic susceptibility and thermodynamic transport studies indicate the onset of antiferromagnetic ordering below TN = 5.1 K. The electronic band structure study, carried out with high-resolution angle-resolved photoemission spectroscopy (ARPES) measurements aided with density functional theory based first-principles calculations reveals presence of nodal lines in the GammaX high symmetry direction, forming a diamond-shaped nodal plane around Gamma high symmetry point. A strongly photon energy dependent nodal feature located at the X point of the surface Brillouin zone, indicating an extended nodal line along X R direction, is also observed. This study elucidates the intricate interplay among symmetry-protected band characteristics, the influence of spin orbit coupling, magnetism, and topological properties.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
How much reliable is ChatGPT's prediction on Information Extraction under Input Perturbations?
Authors:
Ishani Mondal,
Abhilasha Sancheti
Abstract:
In this paper, we assess the robustness (reliability) of ChatGPT under input perturbations for one of the most fundamental tasks of Information Extraction (IE) i.e. Named Entity Recognition (NER). Despite the hype, the majority of the researchers have vouched for its language understanding and generation capabilities; a little attention has been paid to understand its robustness: How the input-per…
▽ More
In this paper, we assess the robustness (reliability) of ChatGPT under input perturbations for one of the most fundamental tasks of Information Extraction (IE) i.e. Named Entity Recognition (NER). Despite the hype, the majority of the researchers have vouched for its language understanding and generation capabilities; a little attention has been paid to understand its robustness: How the input-perturbations affect 1) the predictions, 2) the confidence of predictions and 3) the quality of rationale behind its prediction. We perform a systematic analysis of ChatGPT's robustness (under both zero-shot and few-shot setup) on two NER datasets using both automatic and human evaluation. Based on automatic evaluation metrics, we find that 1) ChatGPT is more brittle on Drug or Disease replacements (rare entities) compared to the perturbations on widely known Person or Location entities, 2) the quality of explanations for the same entity considerably differ under different types of "Entity-Specific" and "Context-Specific" perturbations and the quality can be significantly improved using in-context learning, and 3) it is overconfident for majority of the incorrect predictions, and hence it could lead to misguidance of the end-users.
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
PEDANTS: Cheap but Effective and Interpretable Answer Equivalence
Authors:
Zongxia Li,
Ishani Mondal,
Yijun Liang,
Huy Nghiem,
Jordan Lee Boyd-Graber
Abstract:
Question answering (QA) can only make progress if we know if an answer is correct, but current answer correctness (AC) metrics struggle with verbose, free-form answers from large language models (LLMs). There are two challenges with current short-form QA evaluations: a lack of diverse styles of evaluation data and an over-reliance on expensive and slow LLMs. LLM-based scorers correlate better with…
▽ More
Question answering (QA) can only make progress if we know if an answer is correct, but current answer correctness (AC) metrics struggle with verbose, free-form answers from large language models (LLMs). There are two challenges with current short-form QA evaluations: a lack of diverse styles of evaluation data and an over-reliance on expensive and slow LLMs. LLM-based scorers correlate better with humans, but this expensive task has only been tested on limited QA datasets. We rectify these issues by providing rubrics and datasets for evaluating machine QA adopted from the Trivia community. We also propose an efficient, and interpretable QA evaluation that is more stable than an exact match and neural methods(BERTScore).
△ Less
Submitted 11 October, 2024; v1 submitted 16 February, 2024;
originally announced February 2024.
-
MunTTS: A Text-to-Speech System for Mundari
Authors:
Varun Gumma,
Rishav Hada,
Aditya Yadavalli,
Pamir Gogoi,
Ishani Mondal,
Vivek Seshadri,
Kalika Bali
Abstract:
We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-en…
▽ More
We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-end speech models. We also delve into the methods used for training our models, ensuring they are efficient and effective despite the data constraints. We evaluate our system with native speakers and objective metrics, demonstrating its potential as a tool for preserving and promoting the Mundari language in the digital age.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
CFMatch: Aligning Automated Answer Equivalence Evaluation with Expert Judgments For Open-Domain Question Answering
Authors:
Zongxia Li,
Ishani Mondal,
Yijun Liang,
Huy Nghiem,
Jordan Boyd-Graber
Abstract:
Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current evaluation metrics to determine answer equivalence (AE) often do not align with human judgments, particularly more verbose, free-form answers from large language models (LLM). There are two challenges: a lack of data and that models are too bi…
▽ More
Question answering (QA) can only make progress if we know if an answer is correct, but for many of the most challenging and interesting QA examples, current evaluation metrics to determine answer equivalence (AE) often do not align with human judgments, particularly more verbose, free-form answers from large language models (LLM). There are two challenges: a lack of data and that models are too big: LLM-based scorers can correlate better with human judges, but this task has only been tested on limited QA datasets, and even when available, update of the model is limited because LLMs are large and often expensive. We rectify both of these issues by providing clear and consistent guidelines for evaluating AE in machine QA adopted from professional human QA contests. We also introduce a combination of standard evaluation and a more efficient, robust, and lightweight discriminate AE classifier-based matching method (CFMatch, smaller than 1 MB), trained and validated to more accurately evaluate answer correctness in accordance with adopted expert AE rules that are more aligned with human judgments.
△ Less
Submitted 29 June, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Authors:
Yoo Yeon Sung,
Ishani Mondal,
Jordan Boyd-Graber
Abstract:
Dynamic adversarial question generation, where humans write examples to stump a model, aims to create examples that are realistic and informative. However, the advent of large language models (LLMs) has been a double-edged sword for human authors: more people are interested in seeing and pushing the limits of these models, but because the models are so much stronger an opponent, they are harder to…
▽ More
Dynamic adversarial question generation, where humans write examples to stump a model, aims to create examples that are realistic and informative. However, the advent of large language models (LLMs) has been a double-edged sword for human authors: more people are interested in seeing and pushing the limits of these models, but because the models are so much stronger an opponent, they are harder to defeat. To understand how these models impact adversarial question writing process, we enrich the writing guidance with LLMs and retrieval models for the authors to reason why their questions are not adversarial. While authors could create interesting, challenging adversarial questions, they sometimes resort to tricks that result in poor questions that are ambiguous, subjective, or confusing not just to a computer but also to humans. To address these issues, we propose new metrics and incentives for eliciting good, challenging questions and present a new dataset of adversarially authored questions.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Observation of multiple van Hove singularities and correlated electronic states in a new topological ferromagnetic kagome metal NdTi3Bi4
Authors:
Mazharul Islam Mondal,
Anup Pradhan Sakhya,
Milo Sprague,
Brenden R. Ortiz,
Matthew Matzelle,
Barun Ghosh,
Nathan Valadez,
Iftakhar Bin Elius,
Arun Bansil,
Madhab Neupane
Abstract:
Kagome materials have attracted enormous research interest recently owing to its diverse topological phases and manifestation of electronic correlation due to its inherent geometric frustration. Here, we report the electronic structure of a new distorted kagome metal NdTi3Bi4 using a combination of angle resolved photoemission spectroscopy (ARPES) measurements and density functional theory (DFT) c…
▽ More
Kagome materials have attracted enormous research interest recently owing to its diverse topological phases and manifestation of electronic correlation due to its inherent geometric frustration. Here, we report the electronic structure of a new distorted kagome metal NdTi3Bi4 using a combination of angle resolved photoemission spectroscopy (ARPES) measurements and density functional theory (DFT) calculations. We discover the presence of two at bands which are found to originate from the kagome structure formed by Ti atoms with major contribution from Ti dxy and Ti dx2-y2 orbitals. We also observed multiple van Hove singularities (VHSs) in its electronic structure, with one VHS lying near the Fermi level EF. In addition, the presence of a surface Dirac cone at the G point and a linear Dirac-like state at the K point with its Dirac node lying very close to the EF indicates its topological nature. Our findings reveal NdTi3Bi4 as a potential material to understand the interplay of topology, magnetism, and electron correlation.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
Complex Fermiology and Electronic Structure of Antiferromagnet EuSnP
Authors:
Milo Sprague,
Anup Pradhan Sakhya,
Sabin Regmi,
Mazharul Islam Mondal,
Iftakhar Bin Elius,
Nathan Valadez,
Kali Booth,
Tetiana Romanova,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
We studied the electronic structure of a layered antiferromagnetic metal, EuSnP, in the paramagnetic and in the antiferromagnetic phase using angle resolved photoemission spectroscopy (ARPES) alongside density functional theory (DFT) based first principles calculations. The temperature dependence of the magnetic susceptibility measurements exhibits an antiferromagnetic transition at a Neel tempera…
▽ More
We studied the electronic structure of a layered antiferromagnetic metal, EuSnP, in the paramagnetic and in the antiferromagnetic phase using angle resolved photoemission spectroscopy (ARPES) alongside density functional theory (DFT) based first principles calculations. The temperature dependence of the magnetic susceptibility measurements exhibits an antiferromagnetic transition at a Neel temperature of 21 K. Employing high resolution ARPES, the valence band structure was measured at several temperatures above and below the Neel temperature, which produced identical spectra independent of temperature. Through analysis of the ARPES results presented here, we attribute the temperature independent spectra to the weak coupling of the Sn, and P conduction electrons with Eu 4f states.
△ Less
Submitted 5 November, 2023;
originally announced November 2023.
-
Electronic structure in a rare-earth based nodal-line semimetal candidate PrSbTe
Authors:
Sabin Regmi,
Iftakhar Bin Elius,
Anup Pradhan Sakhya,
Milo Sprague,
Mazharul Islam Mondal,
Nathan Valadez,
Volodymyr Buturlim,
Kali Booth,
Tetiana Romanova,
Krzysztof Gofryk,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
Nodal line semimetals feature topologically protected band crossings between the bulk valence and conduction bands that extend along a finite dimension in the form of a line or a loop. While ZrSiS and similar materials have attracted extensive research as hosts for the nodal line semimetallic phase, an alternative avenue has emerged in the form of isostructural rare-earth (RE) based RESbTe materia…
▽ More
Nodal line semimetals feature topologically protected band crossings between the bulk valence and conduction bands that extend along a finite dimension in the form of a line or a loop. While ZrSiS and similar materials have attracted extensive research as hosts for the nodal line semimetallic phase, an alternative avenue has emerged in the form of isostructural rare-earth (RE) based RESbTe materials. Such systems possess intriguing potentialities for harboring elements of magnetic ordering and electronic correlations owing to the presence of 4f electrons intrinsic to the RE elements. In this study, we have carried out angle resolved photoemission spectroscopy (ARPES) and thermodynamic measurements in conjunction with first principles computations on PrSbTe to elucidate its electronic structure and topological characteristics. Magnetic and thermal characterizations indicate the presence of well-localized 4f states with the absence of any discernible phase transition down to 2 K. The ARPES results reveal the presence of gapless Dirac crossings that correspond to a nodal-line along the XR direction in the three-dimensional Brillouin zone. Furthermore, Dirac crossing that makes up nodal line, which forms a diamond-shaped nodal plane centered at the center of the Brillouin zone is also identified within the experimental resolution. This study on the electronic structure of PrSbTe contributes to the understanding of the pivotal role played by spin-orbit coupling in the context of the RESbTe family of materials
△ Less
Submitted 1 May, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Observation of flat and weakly dispersing bands in a van der Waals semiconductor Nb3Br8 with breathing kagome lattice
Authors:
Sabin Regmi,
Anup Pradhan Sakhya,
Tharindu Fernando,
Yuzhou Zhao,
Dylan Jeff,
Milo Sprague,
Favian Gonzalez,
Iftakhar Bin Elius,
Mazharul Islam Mondal,
Nathan Valadez,
Damani Jarrett,
Alexis Agosto,
Jihui Yang,
Jiun-Haw Chu,
Saiful I. Khondaker,
Xiaodong Xu,
Ting Cao,
Madhab Neupane
Abstract:
Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are…
▽ More
Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are well explained by the theoretical calculations, which show they have Nb d character indicating their origination from the Nb atoms forming the breathing kagome plane. This van der Waals material can be easily thinned down via mechanical exfoliation to the ultrathin limit and such ultrathin samples are stable as depicted from the time-dependent Raman spectroscopy measurements at room temperature. These results demonstrate that Nb3Br8 is an excellent material not only for studying breathing kagome induced flat band physics and its connection with magnetism, but also for heterostructure fabrication for application purposes.
△ Less
Submitted 9 September, 2023;
originally announced September 2023.
-
Observation of multiple flat bands and topological Dirac states in a new titanium based slightly distorted kagome metal YbTi3Bi4
Authors:
Anup Pradhan Sakhya,
Brenden R. Ortiz,
Barun Ghosh,
Milo Sprague,
Mazharul Islam Mondal,
Matthew Matzelle,
Iftakhar Bin Elius,
Nathan Valadez,
David G. Mandrus,
Arun Bansil,
Madhab Neupane
Abstract:
Kagome lattices have emerged as an ideal platform for exploring various exotic quantum phenomena such as correlated topological phases, frustrated lattice geometry, unconventional charge density wave orders, Chern quantum phases, superconductivity, etc. In particular, the vanadium based nonmagnetic kagome metals AV3Sb5 (A= K, Rb, and Cs) have seen a flurry of research interest due to the discovery…
▽ More
Kagome lattices have emerged as an ideal platform for exploring various exotic quantum phenomena such as correlated topological phases, frustrated lattice geometry, unconventional charge density wave orders, Chern quantum phases, superconductivity, etc. In particular, the vanadium based nonmagnetic kagome metals AV3Sb5 (A= K, Rb, and Cs) have seen a flurry of research interest due to the discovery of multiple competing orders. Here, we report the discovery of a new Ti based kagome metal YbTi3Bi4 and employ angle-resolved photoemission spectroscopy (ARPES), magnetotransport in combination with density functional theory calculations to investigate its electronic structure. We reveal spectroscopic evidence of multiple flat bands arising from the kagome lattice of Ti with predominant Ti 3d character. Through our calculations of the Z2 indices, we have identified that the system exhibits topological nontriviality with surface Dirac cones at the Gamma point and a quasi two-dimensional Dirac state at the K point which is further confirmed by our ARPES measured band dispersion. These results establish YbTi3Bi4 as a novel platform for exploring the intersection of nontrivial topology, and electron correlation effects in this newly discovered Ti based kagome lattice.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
Observation of momentum-dependent charge density wave gap in a layered antiferromagnet GdTe3
Authors:
Sabin Regmi,
Iftakhar Bin Elius,
Anup Pradhan Sakhya,
Dylan Jeff,
Milo Sprague,
Mazharul Islam Mondal,
Damani Jarrett,
Nathan Valadez,
Alexis Agosto,
Tetiana Romanova,
Jiun-Haw Chu,
Saiful I. Khondaker,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
Charge density wave (CDW) ordering has been an important topic of study for a long time owing to its connection with other exotic phases such as superconductivity and magnetism. The RTe3 (R = rare-earth elements) family of materials provides a fertile ground to study the dynamics of CDW in van der Waals layered materials, and the presence of magnetism in these materials allows to explore the inter…
▽ More
Charge density wave (CDW) ordering has been an important topic of study for a long time owing to its connection with other exotic phases such as superconductivity and magnetism. The RTe3 (R = rare-earth elements) family of materials provides a fertile ground to study the dynamics of CDW in van der Waals layered materials, and the presence of magnetism in these materials allows to explore the interplay among CDW and long range magnetic ordering. Here, we have carried out a high-resolution angle-resolved photoemission spectroscopy (ARPES) study of a CDW material GdTe3, which is antiferromagnetic below 12 K, along with thermodynamic, electrical transport, magnetic, and Raman measurements. Our Raman spectroscopy measurements show the presence of CDW amplitude mode at room temperature, which remains prominent when the sample is thinned down to 4-layers by exfoliation. Our ARPES data show a two-fold symmetric Fermi surface with both gapped and ungapped regions indicative of the partial nesting. The gap is momentum dependent, maximum along G-Z and gradually decreases going towards G - M. Our study provides a platform to study the dynamics of CDW and its interaction with other physical orders in two- and three-dimensions.
△ Less
Submitted 1 November, 2023; v1 submitted 7 June, 2023;
originally announced June 2023.
-
InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction
Authors:
Ishani Mondal,
Michelle Yuan,
Anandhavelu N,
Aparna Garimella,
Francis Ferraro,
Andrew Blair-Stanek,
Benjamin Van Durme,
Jordan Boyd-Graber
Abstract:
Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap templates in a real-world setting, we need to induce template slots from documents with zero or minimal…
▽ More
Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap templates in a real-world setting, we need to induce template slots from documents with zero or minimal supervision. Since the purpose of question answering intersect with the goal of information extraction, we use automatic question generation to induce template slots from the documents and investigate how a tiny amount of a proxy human-supervision on-the-fly (termed as InteractiveIE) can further boost the performance. Extensive experiments on biomedical and legal documents, where obtaining training data is expensive, reveal encouraging trends of performance improvement using InteractiveIE over AI-only baseline.
△ Less
Submitted 17 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Observation of flat bands and Dirac-like bands in a weakly correlated semimetal YRu2Si2
Authors:
Anup Pradhan Sakhya,
Sabin Regmi,
Milo Sprague,
Mazharul Islam Mondal,
Iftakhar Bin Elius,
Nathan Valadez,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
Condensed matter systems with flat bands have been the center of research interest in recent years as they provide a platform for the emergence of exotic many-body states, such as superconductivity, ferromagnetism, and the fractional quantum Hall effect. However, realization of materials possessing at bands near the Fermi level experimentally is very rare. Here, we report the experimental observat…
▽ More
Condensed matter systems with flat bands have been the center of research interest in recent years as they provide a platform for the emergence of exotic many-body states, such as superconductivity, ferromagnetism, and the fractional quantum Hall effect. However, realization of materials possessing at bands near the Fermi level experimentally is very rare. Here, we report the experimental observation of flat bands in a weakly-correlated system YRu2Si2 employing angle-resolved photoemission spectroscopy (ARPES) which is supported by first-principles calculations. These flat bands originate from Ru d orbitals and are found to be sensitive to the polarization of light. In addition, ARPES data revealed surface and bulk Dirac-like bands. The observed ARPES data is in excellent agreement with the density functional theory results. The presence of both flat bands and Dirac-like bands in YRu2Si2 suggest a unique synergy of correlation and topology in this material belonging to the centrosymmetric tetragonal ThCr2Si2-type structure thus establishing a new platform to investigate flat band physics in combination with non-trivial topological states in a weakly correlated system.
△ Less
Submitted 16 April, 2023;
originally announced April 2023.
-
A study on sensitivity and stability analysis of non-stationary $α$-fractal functions
Authors:
Anarul Islam Mondal,
Sangita Jha
Abstract:
This article aims to study fractal interpolation functions corresponding to a sequence of iterated function systems (IFSs). For a suitable choice of a sequence of IFS parameters, the corresponding non-stationary fractal function is a better approximant for the non-smooth approximant. In this regard, we first construct the non-stationary interpolant in the Lipschitz space and study some topological…
▽ More
This article aims to study fractal interpolation functions corresponding to a sequence of iterated function systems (IFSs). For a suitable choice of a sequence of IFS parameters, the corresponding non-stationary fractal function is a better approximant for the non-smooth approximant. In this regard, we first construct the non-stationary interpolant in the Lipschitz space and study some topological properties of the associated non-linear fractal operator. Next, we discuss the stability of the interpolant having small perturbations. Also, we investigate the sensitivity with respect to the perturbations of the IFS parameters by providing an upper bound of errors acquired in the approximation process. In the end, we study the continuous dependence of the proposed interpolant on different IFS parameters.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
Non-stationary $α$-fractal functions and their dimensions in various function spaces
Authors:
Anarul Islam Mondal,
Sangita Jha
Abstract:
In this article, we study the novel concept of non-stationary iterated function systems (IFSs) introduced by Massopust in 2019. At first, using a sequence of different contractive operators, we construct non-stationary $α$-fractal functions on the space of all continuous functions. Next, we provide some elementary properties of the fractal operator associated with the nonstationary $α$-fractal fun…
▽ More
In this article, we study the novel concept of non-stationary iterated function systems (IFSs) introduced by Massopust in 2019. At first, using a sequence of different contractive operators, we construct non-stationary $α$-fractal functions on the space of all continuous functions. Next, we provide some elementary properties of the fractal operator associated with the nonstationary $α$-fractal functions. Further, we show that the proposed interpolant generalizes the existing stationary interpolant in the sense of IFS. For a class of functions defined on an interval, we derive conditions on the IFS parameters so that the corresponding non-stationary $α$-fractal functions are elements of some standard spaces like bounded variation space, convex Lipschitz space, and other function spaces. Finally, we discuss the dimensional analysis of the corresponding non-stationary $α$-fractal functions on these spaces.
△ Less
Submitted 17 March, 2023;
originally announced March 2023.
-
Intent Identification and Entity Extraction for Healthcare Queries in Indic Languages
Authors:
Ankan Mullick,
Ishani Mondal,
Sourjyadip Ray,
R Raghav,
G Sai Chaitanya,
Pawan Goyal
Abstract:
Scarcity of data and technological limitations for resource-poor languages in developing countries like India poses a threat to the development of sophisticated NLU systems for healthcare. To assess the current status of various state-of-the-art language models in healthcare, this paper studies the problem by initially proposing two different Healthcare datasets, Indian Healthcare Query Intent-Web…
▽ More
Scarcity of data and technological limitations for resource-poor languages in developing countries like India poses a threat to the development of sophisticated NLU systems for healthcare. To assess the current status of various state-of-the-art language models in healthcare, this paper studies the problem by initially proposing two different Healthcare datasets, Indian Healthcare Query Intent-WebMD and 1mg (IHQID-WebMD and IHQID-1mg) and one real world Indian hospital query data in English and multiple Indic languages (Hindi, Bengali, Tamil, Telugu, Marathi and Gujarati) which are annotated with the query intents as well as entities. Our aim is to detect query intents and extract corresponding entities. We perform extensive experiments on a set of models in various realistic settings and explore two scenarios based on the access to English data only (less costly) and access to target language data (more expensive). We analyze context specific practical relevancy through empirical analysis. The results, expressed in terms of overall F1 score show that our approach is practically useful to identify intents and entities.
△ Less
Submitted 19 February, 2023;
originally announced February 2023.
-
Explaining (Sarcastic) Utterances to Enhance Affect Understanding in Multimodal Dialogues
Authors:
Shivani Kumar,
Ishani Mondal,
Md Shad Akhtar,
Tanmoy Chakraborty
Abstract:
Conversations emerge as the primary media for exchanging ideas and conceptions. From the listener's perspective, identifying various affective qualities, such as sarcasm, humour, and emotions, is paramount for comprehending the true connotation of the emitted utterance. However, one of the major hurdles faced in learning these affect dimensions is the presence of figurative language, viz. irony, m…
▽ More
Conversations emerge as the primary media for exchanging ideas and conceptions. From the listener's perspective, identifying various affective qualities, such as sarcasm, humour, and emotions, is paramount for comprehending the true connotation of the emitted utterance. However, one of the major hurdles faced in learning these affect dimensions is the presence of figurative language, viz. irony, metaphor, or sarcasm. We hypothesize that any detection system constituting the exhaustive and explicit presentation of the emitted utterance would improve the overall comprehension of the dialogue. To this end, we explore the task of Sarcasm Explanation in Dialogues, which aims to unfold the hidden irony behind sarcastic utterances. We propose MOSES, a deep neural network, which takes a multimodal (sarcastic) dialogue instance as an input and generates a natural language sentence as its explanation. Subsequently, we leverage the generated explanation for various natural language understanding tasks in a conversational dialogue setup, such as sarcasm detection, humour identification, and emotion recognition. Our evaluation shows that MOSES outperforms the state-of-the-art system for SED by an average of ~2% on different evaluation metrics, such as ROUGE, BLEU, and METEOR. Further, we observe that leveraging the generated explanation advances three downstream tasks for affect classification - an average improvement of ~14% F1-score in the sarcasm detection task and ~2% in the humour identification and emotion recognition task. We also perform extensive analyses to assess the quality of the results.
△ Less
Submitted 22 November, 2022; v1 submitted 20 November, 2022;
originally announced November 2022.
-
Observation of gapless nodal-line states in NdSbTe
Authors:
Sabin Regmi,
Robert Smith,
Anup Pradhan Sakhya,
Milo Sprague,
Mazharul Islam Mondal,
Iftakhar Bin Elius,
Nathan Valadez,
Andrzej Ptok,
Dariusz Kaczorowski,
Madhab Neupane
Abstract:
Lanthanide (Ln) based systems in the ZrSiS-type nodal-line semimetals have been subjects of research investigations as grounds for studying the interplay of topology with possible magnetic ordering and electronic correlations that may originate from the presence of Ln 4f electrons. In this study, we carried out a thorough study of a LnSbTe system - NdSbTe - by using angle-resolved photoemission sp…
▽ More
Lanthanide (Ln) based systems in the ZrSiS-type nodal-line semimetals have been subjects of research investigations as grounds for studying the interplay of topology with possible magnetic ordering and electronic correlations that may originate from the presence of Ln 4f electrons. In this study, we carried out a thorough study of a LnSbTe system - NdSbTe - by using angle-resolved photoemission spectroscopy along with first-principles calculations and thermodynamic measurements. We experimentally detect the presence of multiple gapless nodal-line states, which is well supported by first-principles calculations. A dispersive and an almost non-dispersive nodal-line exist along the bulk X-R direction. Another nodal-line is present well below the Fermi level across the G- M direction, which is formed by bands with high Fermi velocity that seem to be sensitive to light polarization. Our study provides an insight into the electronic structure of a new LnSbTe material system that will aid towards understanding the connection of Ln elements with topological electronic structure in these systems.
△ Less
Submitted 27 April, 2023; v1 submitted 30 September, 2022;
originally announced October 2022.
-
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Authors:
Yizhong Wang,
Swaroop Mishra,
Pegah Alipoormolabashi,
Yeganeh Kordi,
Amirreza Mirzaei,
Anjana Arunkumar,
Arjun Ashok,
Arut Selvan Dhanasekaran,
Atharva Naik,
David Stap,
Eshaan Pathak,
Giannis Karamanolakis,
Haizhi Gary Lai,
Ishan Purohit,
Ishani Mondal,
Jacob Anderson,
Kirby Kuznia,
Krima Doshi,
Maitreya Patel,
Kuntal Kumar Pal,
Mehrad Moradshahi,
Mihir Parmar,
Mirali Purohit,
Neeraj Varshney,
Phani Rohitha Kaza
, et al. (15 additional authors not shown)
Abstract:
How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,…
▽ More
How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting, and text composition. This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions -- training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones. Furthermore, we build Tk-Instruct, a transformer model trained to follow a variety of in-context instructions (plain language task definitions or k-shot examples). Our experiments show that Tk-Instruct outperforms existing instruction-following models such as InstructGPT by over 9% on our benchmark despite being an order of magnitude smaller. We further analyze generalization as a function of various scaling parameters, such as the number of observed tasks, the number of instances per task, and model sizes. We hope our dataset and model facilitate future progress towards more general-purpose NLP models.
△ Less
Submitted 24 October, 2022; v1 submitted 15 April, 2022;
originally announced April 2022.
-
Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?
Authors:
Ishani Mondal,
Kabir Ahuja,
Mohit Jain,
Jacki O Neil,
Kalika Bali,
Monojit Choudhury
Abstract:
The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast major…
▽ More
The COVID-19 pandemic has brought out both the best and worst of language technology (LT). On one hand, conversational agents for information dissemination and basic diagnosis have seen widespread use, and arguably, had an important role in combating the pandemic. On the other hand, it has also become clear that such technologies are readily available for a handful of languages, and the vast majority of the global south is completely bereft of these benefits. What is the state of LT, especially conversational agents, for healthcare across the world's languages? And, what would it take to ensure global readiness of LT before the next pandemic? In this paper, we try to answer these questions through survey of existing literature and resources, as well as through a rapid chatbot building exercise for 15 Asian and African languages with varying amount of resource-availability. The study confirms the pitiful state of LT even for languages with large speaker bases, such as Sinhala and Hausa, and identifies the gaps that could help us prioritize research and investment strategies in LT for healthcare.
△ Less
Submitted 6 April, 2022;
originally announced April 2022.
-
Multi-Objective Few-shot Learning for Fair Classification
Authors:
Ishani Mondal,
Procheta Sen,
Debasis Ganguly
Abstract:
In this paper, we propose a general framework for mitigating the disparities of the predicted classes with respect to secondary attributes within the data (e.g., race, gender etc.). Our proposed method involves learning a multi-objective function that in addition to learning the primary objective of predicting the primary class labels from the data, also employs a clustering-based heuristic to min…
▽ More
In this paper, we propose a general framework for mitigating the disparities of the predicted classes with respect to secondary attributes within the data (e.g., race, gender etc.). Our proposed method involves learning a multi-objective function that in addition to learning the primary objective of predicting the primary class labels from the data, also employs a clustering-based heuristic to minimize the disparities of the class label distribution with respect to the cluster memberships, with the assumption that each cluster should ideally map to a distinct combination of attribute values. Experiments demonstrate effective mitigation of cognitive biases on a benchmark dataset without the use of annotations of secondary attribute values (the zero-shot case) or with the use of a small number of attribute value annotations (the few-shot case).
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Modeling Pedagogical Learning Environment with Hybrid Model based on ICT
Authors:
Al Maruf Hassan,
Istiak Ahmed Mondal
Abstract:
Pedagogy is a method that handles the ethos and culture of instruction from educators and the learning of learners. Pedagogy of Information and Communications Technology (ICT) refers to the interactions among the teacher, children, and learning environment based on ICT. It is a discipline that deals with the theory and practice of teaching strategies, teaching actions, teaching judgments, and deci…
▽ More
Pedagogy is a method that handles the ethos and culture of instruction from educators and the learning of learners. Pedagogy of Information and Communications Technology (ICT) refers to the interactions among the teacher, children, and learning environment based on ICT. It is a discipline that deals with the theory and practice of teaching strategies, teaching actions, teaching judgments, and decisions. It is also the understanding and needs of students as well as the background and interests of an individual one. In this paper, we have designed the pedagogical learning environment from the perspective of ICT education. In our methodology of the pedagogy for ICT, education includes the interaction among different elements. The methodology improves to propagate convenience differently into the educational environment. We are also building a hybrid model for the ICT development program. The hybrid model represents the combination of standards, stages, year level, and class level as well as brings it into one umbrella. We have constructed the pedagogical learning environment theoretically from the perspective of ICT education to the consideration of outcome-based ICT learning. Outcome-based education is a fundamental element for building any nation completely around the globe.
△ Less
Submitted 27 August, 2021; v1 submitted 9 August, 2021;
originally announced August 2021.
-
End-to-End NLP Knowledge Graph Construction
Authors:
Ishani Mondal,
Yufang Hou,
Charles Jochim
Abstract:
This paper studies the end-to-end construction of an NLP Knowledge Graph (KG) from scientific papers. We focus on extracting four types of relations: evaluatedOn between tasks and datasets, evaluatedBy between tasks and evaluation metrics, as well as coreferent and related relations between the same type of entities. For instance, F1-score is coreferent with F-measure. We introduce novel methods f…
▽ More
This paper studies the end-to-end construction of an NLP Knowledge Graph (KG) from scientific papers. We focus on extracting four types of relations: evaluatedOn between tasks and datasets, evaluatedBy between tasks and evaluation metrics, as well as coreferent and related relations between the same type of entities. For instance, F1-score is coreferent with F-measure. We introduce novel methods for each of these relation types and apply our final framework (SciNLP-KG) to 30,000 NLP papers from ACL Anthology to build a large-scale KG, which can facilitate automatically constructing scientific leaderboards for the NLP community. The results of our experiments indicate that the resulting KG contains high-quality information.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification
Authors:
Ishani Mondal
Abstract:
Healthcare predictive analytics aids medical decision-making, diagnosis prediction and drug review analysis. Therefore, prediction accuracy is an important criteria which also necessitates robust predictive language models. However, the models using deep learning have been proven vulnerable towards insignificantly perturbed input instances which are less likely to be misclassified by humans. Recen…
▽ More
Healthcare predictive analytics aids medical decision-making, diagnosis prediction and drug review analysis. Therefore, prediction accuracy is an important criteria which also necessitates robust predictive language models. However, the models using deep learning have been proven vulnerable towards insignificantly perturbed input instances which are less likely to be misclassified by humans. Recent efforts of generating adversaries using rule-based synonyms and BERT-MLMs have been witnessed in general domain, but the ever increasing biomedical literature poses unique challenges. We propose BBAEG (Biomedical BERT-based Adversarial Example Generation), a black-box attack algorithm for biomedical text classification, leveraging the strengths of both domain-specific synonym replacement for biomedical named entities and BERTMLM predictions, spelling variation and number replacement. Through automatic and human evaluation on two datasets, we demonstrate that BBAEG performs stronger attack with better language fluency, semantic coherence as compared to prior work.
△ Less
Submitted 5 April, 2021;
originally announced April 2021.
-
BERTChem-DDI : Improved Drug-Drug Interaction Prediction from text using Chemical Structure Information
Authors:
Ishani Mondal
Abstract:
Traditional biomedical version of embeddings obtained from pre-trained language models have recently shown state-of-the-art results for relation extraction (RE) tasks in the medical domain. In this paper, we explore how to incorporate domain knowledge, available in the form of molecular structure of drugs, for predicting Drug-Drug Interaction from textual corpus. We propose a method, BERTChem-DDI,…
▽ More
Traditional biomedical version of embeddings obtained from pre-trained language models have recently shown state-of-the-art results for relation extraction (RE) tasks in the medical domain. In this paper, we explore how to incorporate domain knowledge, available in the form of molecular structure of drugs, for predicting Drug-Drug Interaction from textual corpus. We propose a method, BERTChem-DDI, to efficiently combine drug embeddings obtained from the rich chemical structure of drugs along with off-the-shelf domain-specific BioBERT embedding-based RE architecture. Experiments conducted on the DDIExtraction 2013 corpus clearly indicate that this strategy improves other strong baselines architectures by 3.4\% macro F1-score.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Medical Entity Linking using Triplet Network
Authors:
Ishani Mondal,
Sukannya Purkayastha,
Sudeshna Sarkar,
Pawan Goyal,
Jitesh Pillai,
Amitava Bhattacharyya,
Mahanandeeshwar Gattu
Abstract:
Entity linking (or Normalization) is an essential task in text mining that maps the entity mentions in the medical text to standard entities in a given Knowledge Base (KB). This task is of great importance in the medical domain. It can also be used for merging different medical and clinical ontologies. In this paper, we center around the problem of disease linking or normalization. This task is ex…
▽ More
Entity linking (or Normalization) is an essential task in text mining that maps the entity mentions in the medical text to standard entities in a given Knowledge Base (KB). This task is of great importance in the medical domain. It can also be used for merging different medical and clinical ontologies. In this paper, we center around the problem of disease linking or normalization. This task is executed in two phases: candidate generation and candidate scoring. In this paper, we present an approach to rank the candidate Knowledge Base entries based on their similarity with disease mention. We make use of the Triplet Network for candidate ranking. While the existing methods have used carefully generated sieves and external resources for candidate generation, we introduce a robust and portable candidate generation scheme that does not make use of the hand-crafted rules. Experimental results on the standard benchmark NCBI disease dataset demonstrate that our system outperforms the prior methods by a significant margin.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Towards Incorporating Entity-specific Knowledge Graph Information in Predicting Drug-Drug Interactions
Authors:
Ishani Mondal
Abstract:
Off-the-shelf biomedical embeddings obtained from the recently released various pre-trained language models (such as BERT, XLNET) have demonstrated state-of-the-art results (in terms of accuracy) for the various natural language understanding tasks (NLU) in the biomedical domain. Relation Classification (RC) falls into one of the most critical tasks. In this paper, we explore how to incorporate do…
▽ More
Off-the-shelf biomedical embeddings obtained from the recently released various pre-trained language models (such as BERT, XLNET) have demonstrated state-of-the-art results (in terms of accuracy) for the various natural language understanding tasks (NLU) in the biomedical domain. Relation Classification (RC) falls into one of the most critical tasks. In this paper, we explore how to incorporate domain knowledge of the biomedical entities (such as drug, disease, genes), obtained from Knowledge Graph (KG) Embeddings, for predicting Drug-Drug Interaction from textual corpus. We propose a new method, BERTKG-DDI, to combine drug embeddings obtained from its interaction with other biomedical entities along with domain-specific BioBERT embedding-based RC architecture. Experiments conducted on the DDIExtraction 2013 corpus clearly indicate that this strategy improves other baselines architectures by 4.1% macro F1-score.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
ALEX: Active Learning based Enhancement of a Model's Explainability
Authors:
Ishani Mondal,
Debasis Ganguly
Abstract:
An active learning (AL) algorithm seeks to construct an effective classifier with a minimal number of labeled examples in a bootstrapping manner. While standard AL heuristics, such as selecting those points for annotation for which a classification model yields least confident predictions, there has been no empirical investigation to see if these heuristics lead to models that are more interpretab…
▽ More
An active learning (AL) algorithm seeks to construct an effective classifier with a minimal number of labeled examples in a bootstrapping manner. While standard AL heuristics, such as selecting those points for annotation for which a classification model yields least confident predictions, there has been no empirical investigation to see if these heuristics lead to models that are more interpretable to humans. In the era of data-driven learning, this is an important research direction to pursue. This paper describes our work-in-progress towards developing an AL selection function that in addition to model effectiveness also seeks to improve on the interpretability of a model during the bootstrapping steps. Concretely speaking, our proposed selection function trains an `explainer' model in addition to the classifier model, and favours those instances where a different part of the data is used, on an average, to explain the predicted class. Initial experiments exhibited encouraging trends in showing that such a heuristic can lead to developing more effective and more explainable end-to-end data-driven classifiers.
△ Less
Submitted 2 September, 2020;
originally announced September 2020.