-
Evaluating the Explainability of Neural Rankers
Authors:
Saran Pandian,
Debasis Ganguly,
Sean MacAvaney
Abstract:
Information retrieval models have witnessed a paradigm shift from unsupervised statistical approaches to feature-based supervised approaches to completely data-driven ones that make use of the pre-training of large language models. While the increasing complexity of the search models have been able to demonstrate improvements in effectiveness (measured in terms of relevance of top-retrieved result…
▽ More
Information retrieval models have witnessed a paradigm shift from unsupervised statistical approaches to feature-based supervised approaches to completely data-driven ones that make use of the pre-training of large language models. While the increasing complexity of the search models have been able to demonstrate improvements in effectiveness (measured in terms of relevance of top-retrieved results), a question worthy of a thorough inspection is - "how explainable are these models?", which is what this paper aims to evaluate. In particular, we propose a common evaluation platform to systematically evaluate the explainability of any ranking model (the explanation algorithm being identical for all the models that are to be evaluated). In our proposed framework, each model, in addition to returning a ranked list of documents, also requires to return a list of explanation units or rationales for each document. This meta-information from each document is then used to measure how locally consistent these rationales are as an intrinsic measure of interpretability - one that does not require manual relevance assessments. Additionally, as an extrinsic measure, we compute how relevant these rationales are by leveraging sub-document level relevance assessments. Our findings show a number of interesting observations, such as sentence-level rationales are more consistent, an increase in complexity mostly leads to less consistent explanations, and that interpretability measures offer a complementary dimension of evaluation of IR systems because consistency is not well-correlated with nDCG at top ranks.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Stacking of Hyperparameter Tuned Models for Tagging Coding Problems
Authors:
Sathya Krishnan TS,
S. Lakshmana Pandian,
P. Shunmugapriya
Abstract:
Coding problems are problems that require a solution in the form of a computer program. Coding problems are popular among students and professionals as it enhances their skills and career opportunities. An AI system that would help those who practice coding problems would be highly useful and there is a huge potential for such a system. In this work, we propose a model which uses stacking of hyper…
▽ More
Coding problems are problems that require a solution in the form of a computer program. Coding problems are popular among students and professionals as it enhances their skills and career opportunities. An AI system that would help those who practice coding problems would be highly useful and there is a huge potential for such a system. In this work, we propose a model which uses stacking of hyperparameter tuned boosting models to achieve impressive metric scores of 77.8% accuracy and 0.815 PR-AUC on the dataset that was scraped from Codeforces and Leetcode. We open source the dataset and the models developed for this work.
△ Less
Submitted 6 July, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Grounding Explainability Within the Context of Global South in XAI
Authors:
Deepa Singh,
Michal Slupczynski,
Ajit G. Pillai,
Vinoth Pandian Sermuga Pandian
Abstract:
In this position paper, we propose building a broader and deeper understanding around Explainability in AI by 'grounding' it in social contexts, the socio-technical systems operate in. We situate our understanding of grounded explainability in the 'Global South' in general and India in particular and express the need for more research within the global south context when it comes to explainability…
▽ More
In this position paper, we propose building a broader and deeper understanding around Explainability in AI by 'grounding' it in social contexts, the socio-technical systems operate in. We situate our understanding of grounded explainability in the 'Global South' in general and India in particular and express the need for more research within the global south context when it comes to explainability and AI.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
Intra-night optical variability monitoring of $γ$-ray emitting blazars
Authors:
K. Subbu Ulaganatha Pandian,
A. Natarajan,
C. S. Stalin,
Ashwani Pandey,
S. Muneer,
B. Natarajan
Abstract:
We present the results obtained from our campaign to characterize the intra-night-optical variability properties of blazars detected by the {\it Fermi} Large Area Telescope. This involves R-band monitoring observations of a sample of 18 blazars, that includes five flat-spectrum radio quasars (FSRQs) and thirteen BL Lac objects (BL Lacs) covering the redshift range z = 0.085$-$1.184. Our observatio…
▽ More
We present the results obtained from our campaign to characterize the intra-night-optical variability properties of blazars detected by the {\it Fermi} Large Area Telescope. This involves R-band monitoring observations of a sample of 18 blazars, that includes five flat-spectrum radio quasars (FSRQs) and thirteen BL Lac objects (BL Lacs) covering the redshift range z = 0.085$-$1.184. Our observations, carried out using the 1.3 m J.C. Bhattacharya Telescope cover a total of 40 nights ($\sim$200 hrs) between the period 2016 December and 2020 March. We characterized variability using the power enhanced $F-$test. We found a duty cycle (DC) of the variability of about 11\% for FSRQs and 12\% for BL Lacs. Dividing the sample into different sub-classes based on the position of the synchrotron peak in their broadband spectral energy distribution (SED), we found DC of $\sim$16\%, $\sim$10\% and $\sim$7\% for low-synchrotron peaked (LSP), intermediate synchrotron peaked (ISP) and high synchrotron peaked (HSP) blazars. Such high DC of variability in LSP blazars could be understood in the context of the R-band tracing the falling part (contributed by high energy electrons) of the synchrotron component of the broadband SED. Also, the R-band tracing the rising synchrotron part (produced by low energy electrons) in the case of ISP and HSP blazars, could cause lesser variability in them. Thus, the observed high DC of variability in LSP blazars relative to ISP and HSP blazars is in accordance with the leptonic model of emission from blazar jets.
△ Less
Submitted 15 February, 2022;
originally announced February 2022.
-
Efficient Scaling of Dynamic Graph Neural Networks
Authors:
Venkatesan T. Chakaravarthy,
Shivmaran S. Pandian,
Saurabh Raje,
Yogish Sabharwal,
Toyotaro Suzumura,
Shashanka Ubaru
Abstract:
We present distributed algorithms for training dynamic Graph Neural Networks (GNN) on large scale graphs spanning multi-node, multi-GPU systems. To the best of our knowledge, this is the first scaling study on dynamic GNN. We devise mechanisms for reducing the GPU memory usage and identify two execution time bottlenecks: CPU-GPU data transfer; and communication volume. Exploiting properties of dyn…
▽ More
We present distributed algorithms for training dynamic Graph Neural Networks (GNN) on large scale graphs spanning multi-node, multi-GPU systems. To the best of our knowledge, this is the first scaling study on dynamic GNN. We devise mechanisms for reducing the GPU memory usage and identify two execution time bottlenecks: CPU-GPU data transfer; and communication volume. Exploiting properties of dynamic graphs, we design a graph difference-based strategy to significantly reduce the transfer time. We develop a simple, but effective data distribution technique under which the communication volume remains fixed and linear in the input size, for any number of GPUs. Our experiments using billion-size graphs on a system of 128 GPUs shows that: (i) the distribution scheme achieves up to 30x speedup on 128 GPUs; (ii) the graph-difference technique reduces the transfer time by a factor of up to 4.1x and the overall execution time by up to 40%
△ Less
Submitted 16 September, 2021;
originally announced September 2021.
-
Unleashing the power of disruptive and emerging technologies amid COVID-19: A detailed review
Authors:
Sonali Agarwal,
Narinder Singh Punn,
Sanjay Kumar Sonbhadra,
M. Tanveer,
P. Nagabhushan,
K K Soundra Pandian,
Praveer Saxena
Abstract:
The unprecedented outbreak of the novel coronavirus (COVID-19), during early December 2019 in Wuhan, China, has quickly evolved into a global pandemic, became a matter of grave concern, and placed government agencies worldwide in a precarious position. The scarcity of resources and lack of experiences to endure the COVID-19 pandemic, combined with the fear of future consequences has established th…
▽ More
The unprecedented outbreak of the novel coronavirus (COVID-19), during early December 2019 in Wuhan, China, has quickly evolved into a global pandemic, became a matter of grave concern, and placed government agencies worldwide in a precarious position. The scarcity of resources and lack of experiences to endure the COVID-19 pandemic, combined with the fear of future consequences has established the need for adoption of emerging and future technologies to address the upcoming challenges. Since the last five months, the amount of pandemic impact has reached its pinnacle that is altering everyone's life; and humans are now bound to adopt safe ways to survive under the risk of being affected. Technological advances are now accelerating faster than ever before to stay ahead of the consequences and acquire new capabilities to build a safer world. Thus, there is a rising need to unfold the power of emerging, future and disruptive technologies to explore all possible ways to fight against COVID-19. In this review article, we attempt to study all emerging, future, and disruptive technologies that can be utilized to mitigate the impact of COVID-19. Building on background insights, detailed technological specific use cases to fight against COVID-19 have been discussed in terms of their strengths, weaknesses, opportunities, and threats (SWOT). As concluding remarks, we highlight prioritized research areas and upcoming opportunities to blur the lines between the physical, digital, and biological domain-specific challenges and also illuminate collaborative research directions for moving towards a post-COVID-19 world.
△ Less
Submitted 19 April, 2021; v1 submitted 23 May, 2020;
originally announced May 2020.
-
BlackBox Toolkit: Intelligent Assistance to UI Design
Authors:
Vinoth Pandian Sermuga Pandian,
Sarah Suleri
Abstract:
User Interface (UI) design is an creative process that involves considerable reiteration and rework. Designers go through multiple iterations of different prototyping fidelities to create a UI design. In this research, we propose to modify the UI design process by assisting it with artificial intelligence (AI). We propose to enable AI to perform repetitive tasks for the designer while allowing the…
▽ More
User Interface (UI) design is an creative process that involves considerable reiteration and rework. Designers go through multiple iterations of different prototyping fidelities to create a UI design. In this research, we propose to modify the UI design process by assisting it with artificial intelligence (AI). We propose to enable AI to perform repetitive tasks for the designer while allowing the designer to take command of the creative process. This approach makes the machine act as a black box that intelligently assists the designers in creating UI design. We believe this approach would greatly benefit designers in co-creating design solutions with AI.
△ Less
Submitted 7 April, 2020; v1 submitted 4 April, 2020;
originally announced April 2020.
-
NASA-TLX Web App: An Online Tool to Analyse Subjective Workload
Authors:
Vinoth Pandian Sermuga Pandian,
Sarah Suleri
Abstract:
NASA Task Load Index (NASA-TLX) is a widely used assessment technique to compute subjective workload experienced during a task. It evaluates workload using six dimensions: mental demand, physical demand, temporal demand, frustration, effort, and performance. This paper presents a web app to assist experimenters in using NASA-TLX to commute subjective workload. The web app enables the experimenter…
▽ More
NASA Task Load Index (NASA-TLX) is a widely used assessment technique to compute subjective workload experienced during a task. It evaluates workload using six dimensions: mental demand, physical demand, temporal demand, frustration, effort, and performance. This paper presents a web app to assist experimenters in using NASA-TLX to commute subjective workload. The web app enables the experimenter to conduct various experiments simultaneously and offers the participants a concise interface to provide their subjective evaluation. It performs the calculations at the backend and provides the computed results comprehensively. The web app provides a dashboard for the experimenter to visualize and export the summary of results. Qualitative feedback from 12 experimenters indicated that the NASA-TLX web app is relevant, helpful, and easy to use.
△ Less
Submitted 27 January, 2020;
originally announced January 2020.
-
On Optimizing Distributed Tucker Decomposition for Sparse Tensors
Authors:
Venkatesan T. Chakaravarthy,
Jee W. Choi,
Douglas J. Joseph,
Prakash Murali,
Shivmaran S. Pandian,
Yogish Sabharwal,
Dheeraj Sreedhar
Abstract:
The Tucker decomposition generalizes the notion of Singular Value Decomposition (SVD) to tensors, the higher dimensional analogues of matrices. We study the problem of constructing the Tucker decomposition of sparse tensors on distributed memory systems via the HOOI procedure, a popular iterative method. The scheme used for distributing the input tensor among the processors (MPI ranks) critically…
▽ More
The Tucker decomposition generalizes the notion of Singular Value Decomposition (SVD) to tensors, the higher dimensional analogues of matrices. We study the problem of constructing the Tucker decomposition of sparse tensors on distributed memory systems via the HOOI procedure, a popular iterative method. The scheme used for distributing the input tensor among the processors (MPI ranks) critically influences the HOOI execution time. Prior work has proposed different distribution schemes: an offline scheme based on sophisticated hypergraph partitioning method and simple, lightweight alternatives that can be used real-time. While the hypergraph based scheme typically results in faster HOOI execution time, being complex, the time taken for determining the distribution is an order of magnitude higher than the execution time of a single HOOI iteration. Our main contribution is a lightweight distribution scheme, which achieves the best of both worlds. We show that the scheme is near-optimal on certain fundamental metrics associated with the HOOI procedure and as a result, near-optimal on the computational load (FLOPs). Though the scheme may incur higher communication volume, the computation time is the dominant factor and as the result, the scheme achieves better performance on the overall HOOI execution time. Our experimental evaluation on large real-life tensors (having up to 4 billion elements) shows that the scheme outperforms the prior schemes on the HOOI execution time by a factor of up to 3x. On the other hand, its distribution time is comparable to the prior lightweight schemes and is typically lesser than the execution time of a single HOOI iteration.
△ Less
Submitted 18 January, 2020; v1 submitted 25 April, 2018;
originally announced April 2018.
-
Graphene based Supercapacitors with Improved Specific Capacitance and Fast Charging Time at High Current Density
Authors:
Santhakumar Kannappan,
Karthikeyan Kaliyappan,
Rajesh Kumar Manian,
Amaresh Samuthira Pandian,
Hao Yang,
Yun Sung Lee,
Jae-Hyung Jang,
Wu Lu
Abstract:
Graphene is a promising material for energy storage, especially for high performance supercapacitors. For real time high power applications, it is critical to have high specific capacitance with fast charging time at high current density. Using a modified Hummer's method and tip sonication for graphene synthesis, here we show graphene-based supercapacitors with high stability and significantly-imp…
▽ More
Graphene is a promising material for energy storage, especially for high performance supercapacitors. For real time high power applications, it is critical to have high specific capacitance with fast charging time at high current density. Using a modified Hummer's method and tip sonication for graphene synthesis, here we show graphene-based supercapacitors with high stability and significantly-improved electrical double layer capacitance and energy density with fast charging and discharging time at a high current density, due to enhanced ionic electrolyte accessibility in deeper regions. The discharge capacitance and energy density values, 195 Fg-1 and 83.4 Whkg-1, are achieved at a current density of 2.5 Ag-1. The time required to discharge 64.18 Whkg-1 at 5 A/g is around 25 sec. At 7.5 Ag-1 current density, the cell can deliver a specific capacitance of about 137 Fg-1 and maintain 98 % of its initial value after 10,000 cycles, suggesting that the stable performance of supercapacitors at high current rates is suitable for fast charging-discharging applications. We attribute this superior performance to the highly porous nature of graphene prepared with minimum restacking due to crimple nature wrinkles and the improved current collecting method.
△ Less
Submitted 6 November, 2013;
originally announced November 2013.
-
Achieving Both High Power and Energy Density in Electrochemical Supercapacitors with Nanoporous Graphene Materials
Authors:
Hao Yang,
Santhakumar Kannappan,
Amaresh S. Pandian,
Jae-Hyung Jang,
Yun Sung Lee,
Wu Lu
Abstract:
Supercapacitors, based on the fast ion transportation, are specialized to provide high power, long stability, and efficient energy storage with highly porous electrode materials. However, their low energy density and specific capacitance prevent them from many applications that require long duration. Using a scalable nanoporous graphene synthesis method involving a simple annealing process in hydr…
▽ More
Supercapacitors, based on the fast ion transportation, are specialized to provide high power, long stability, and efficient energy storage with highly porous electrode materials. However, their low energy density and specific capacitance prevent them from many applications that require long duration. Using a scalable nanoporous graphene synthesis method involving a simple annealing process in hydrogen, here we show graphene supercapacitors capable of achieving a high energy density comparable to what Li-ion batteries can offer, but a much higher power density. Ultra-high specific gravimetric and volumetric capacitances are achieved with highly porous graphene electrodes. Moreover, the supercapacitors assembled with graphene electrodes show excellent stability. Our results demonstrate that by synthesizing graphene materials with an ideal pore size, uniformity, and good ion accessibility, the performance of supercapacitors can be revolutionized.
△ Less
Submitted 6 November, 2013;
originally announced November 2013.
-
A Color Image Digital Watermarking Scheme Based on SOFM
Authors:
J. Anitha,
S. Immanuel Alex Pandian
Abstract:
Digital watermarking technique has been presented and widely researched to solve some important issues in the digital world, such as copyright protection, copy protection and content authentication. Several robust watermarking schemes based on vector quantization (VQ) have been presented. In this paper, we present a new digital image watermarking method based on SOFM vector quantizer for color ima…
▽ More
Digital watermarking technique has been presented and widely researched to solve some important issues in the digital world, such as copyright protection, copy protection and content authentication. Several robust watermarking schemes based on vector quantization (VQ) have been presented. In this paper, we present a new digital image watermarking method based on SOFM vector quantizer for color images. This method utilizes the codebook partition technique in which the watermark bit is embedded into the selected VQ encoded block. The main feature of this scheme is that the watermark exists both in VQ compressed image and in the reconstructed image. The watermark extraction can be performed without the original image. The watermark is hidden inside the compressed image, so much transmission time and storage space can be saved when the compressed data are transmitted over the Internet. Simulation results demonstrate that the proposed method has robustness against various image processing operations without sacrificing compression performance and the computational speed.
△ Less
Submitted 13 January, 2011;
originally announced January 2011.
-
Enhanced Authentication and Locality Aided - Destination Mobility in Dynamic Routing Protocol for MANET
Authors:
Sudhakar Sengan,
S. Chenthur Pandian
Abstract:
In our proposed model, the route selection is a function of following parameters: hop count, trust level of node and security level of application. In this paper, to focus on secure neighbor detection, trust factor evaluation, operational mode, route discovery and route selection. The paper mainly address the security of geographic routing. The watchdog identifies misbehaving nodes, while the Path…
▽ More
In our proposed model, the route selection is a function of following parameters: hop count, trust level of node and security level of application. In this paper, to focus on secure neighbor detection, trust factor evaluation, operational mode, route discovery and route selection. The paper mainly address the security of geographic routing. The watchdog identifies misbehaving nodes, while the Pathselector avoids routing packets through these nodes. The watchdog, the pathselector is run by each server. In order to keep the source informed about the destination's mobility, the destination keeps sending the alert message to its previous hop telling that it has changed its position and any reference to it for data packet forwarding be informed to the VHR server.
△ Less
Submitted 10 April, 2010;
originally announced April 2010.