Skip to main content

Showing 1–26 of 26 results for author: Kane, A

.
  1. arXiv:2505.14588  [pdf, ps, other

    econ.GN

    Generative AI at the Crossroads: Light Bulb, Dynamo, or Microscope?

    Authors: Martin Baily, David Byrne, Aidan Kane, Paul Soto

    Abstract: With the advent of generative AI (genAI), the potential scope of artificial intelligence has increased dramatically, but the future effect of genAI on productivity remains uncertain with the effect of the technology on the innovation process a crucial open question. Some inventions, such as the light bulb, temporarily raise productivity growth as adoption spreads, but the effect fades when the mar… ▽ More

    Submitted 16 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2505.04616  [pdf, other

    cs.CV

    Person Recognition at Altitude and Range: Fusion of Face, Body Shape and Gait

    Authors: Feng Liu, Nicholas Chimitt, Lanqing Guo, Jitesh Jain, Aditya Kane, Minchul Kim, Wes Robbins, Yiyang Su, Dingqiang Ye, Xingguang Zhang, Jie Zhu, Siddharth Satyakam, Christopher Perry, Stanley H. Chan, Arun Ross, Humphrey Shi, Zhangyang Wang, Anil Jain, Xiaoming Liu

    Abstract: We address the problem of whole-body person recognition in unconstrained environments. This problem arises in surveillance scenarios such as those in the IARPA Biometric Recognition and Identification at Altitude and Range (BRIAR) program, where biometric data is captured at long standoff distances, elevated viewing angles, and under adverse atmospheric conditions (e.g., turbulence and high wind v… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 18 pages, 12 figures

  3. arXiv:2504.16922  [pdf, other

    cs.CV cs.AI cs.LG

    Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light

    Authors: Ali Hassani, Fengzhe Zhou, Aditya Kane, Jiannan Huang, Chieh-Yun Chen, Min Shi, Steven Walton, Markus Hoehnerbach, Vijay Thakkar, Michael Isaev, Qinsheng Zhang, Bing Xu, Haicheng Wu, Wen-mei Hwu, Ming-Yu Liu, Humphrey Shi

    Abstract: Many sparse attention mechanisms such as Neighborhood Attention have typically failed to consistently deliver speedup over the self attention baseline. This is largely due to the level of complexity in attention infrastructure, and the rapid evolution of AI hardware architecture. At the same time, many state-of-the-art foundational models, particularly in computer vision, are heavily bound by atte… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: https://github.com/SHI-Labs/NATTEN/

  4. arXiv:2311.17722  [pdf, other

    cs.CL cs.LG

    SenTest: Evaluating Robustness of Sentence Encoders

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Geetanjali Kale, Raviraj Joshi

    Abstract: Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due to their rich and effective sentence representations. Having effective sentence representations is paramount in multiple tasks, such as information retrieval, re… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  5. arXiv:2311.02428  [pdf, other

    cs.CV cs.LG

    Task Arithmetic with LoRA for Continual Learning

    Authors: Rajas Chitale, Ankit Vaidya, Aditya Kane, Archana Ghotkar

    Abstract: Continual learning refers to the problem where the training data is available in sequential chunks, termed "tasks". The majority of progress in continual learning has been stunted by the problem of catastrophic forgetting, which is caused by sequential training of the model on streams of data. Moreover, it becomes computationally expensive to sequentially train large models multiple times. To miti… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  6. arXiv:2306.14030  [pdf, other

    cs.CL cs.LG

    My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks

    Authors: Tanmay Chavan, Omkar Gokhale, Aditya Kane, Shantanu Patankar, Raviraj Joshi

    Abstract: The research on code-mixed data is limited due to the unavailability of dedicated code-mixed datasets and pre-trained language models. In this work, we focus on the low-resource Indian language Marathi which lacks any prior work in code-mixing. We present L3Cube-MeCorpus, a large code-mixed Marathi-English (Mr-En) corpus with 10 million social media sentences for pretraining. We also release L3Cub… ▽ More

    Submitted 20 July, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

  7. arXiv:2303.03487  [pdf, other

    cs.CL cs.AI

    Two-stage Pipeline for Multilingual Dialect Detection

    Authors: Ankit Vaidya, Aditya Kane

    Abstract: Dialect Identification is a crucial task for localizing various Large Language Models. This paper outlines our approach to the VarDial 2023 shared task. Here we have to identify three or two dialects from three languages each which results in a 9-way classification for Track-1 and 6-way classification for Track-2 respectively. Our proposed approach consists of a two-stage system and outperforms ot… ▽ More

    Submitted 28 March, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  8. arXiv:2212.10039  [pdf, other

    cs.CL

    A Twitter BERT Approach for Offensive Language Detection in Marathi

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Raviraj Joshi

    Abstract: Automated offensive language detection is essential in combating the spread of hate speech, particularly in social media. This paper describes our work on Offensive Language Identification in low resource Indic language Marathi. The problem is formulated as a text classification task to identify a tweet as offensive or non-offensive. We evaluate different mono-lingual and multi-lingual BERT models… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  9. arXiv:2210.08209  [pdf, other

    cs.CL

    Large Language Models for Multi-label Propaganda Detection

    Authors: Tanmay Chavan, Aditya Kane

    Abstract: The spread of propaganda through the internet has increased drastically over the past years. Lately, propaganda detection has started gaining importance because of the negative impact it has on society. In this work, we describe our approach for the WANLP 2022 shared task which handles the task of propaganda detection in a multi-label setting. The task demands the model to label the given text as… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

  10. arXiv:2210.08207  [pdf, other

    cs.CL

    Temporal Word Meaning Disambiguation using TimeLMs

    Authors: Mihir Godbole, Parth Dandavate, Aditya Kane

    Abstract: Meaning of words constantly changes given the events in modern civilization. Large Language Models use word embeddings, which are often static and thus cannot cope with this semantic change. Thus,it is important to resolve ambiguity in word meanings. This paper is an effort in this direction, where we explore methods for word sense disambiguation for the EvoNLP shared task. We conduct rigorous abl… ▽ More

    Submitted 17 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

  11. arXiv:2210.04267  [pdf, other

    cs.CL cs.AI

    Spread Love Not Hate: Undermining the Importance of Hateful Pre-training for Hate Speech Detection

    Authors: Omkar Gokhale, Aditya Kane, Shantanu Patankar, Tanmay Chavan, Raviraj Joshi

    Abstract: Pre-training large neural language models, such as BERT, has led to impressive gains on many natural language processing (NLP) tasks. Although this method has proven to be effective for many domains, it might not always provide desirable benefits. In this paper, we study the effects of hateful pre-training on low-resource hate speech classification tasks. While previous studies on the English lang… ▽ More

    Submitted 11 December, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

  12. arXiv:2209.10320  [pdf, other

    cs.CV

    Continual VQA for Disaster Response Systems

    Authors: Aditya Kane, V Manushree, Sahil Khose

    Abstract: Visual Question Answering (VQA) is a multi-modal task that involves answering questions from an input image, semantically understanding the contents of the image and answering it in natural language. Using VQA for disaster management is an important line of research due to the scope of problems that are answered by the VQA system. However, the main challenge is the delay caused by the generation o… ▽ More

    Submitted 10 November, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted at Tackling Climate Change with Machine Learning workshop at NeurIPS 2022

  13. arXiv:2209.03661  [pdf, other

    cs.CL

    Efficient Gender Debiasing of Pre-trained Indic Language Models

    Authors: Neeraja Kirtane, V Manushree, Aditya Kane

    Abstract: The gender bias present in the data on which language models are pre-trained gets reflected in the systems that use these models. The model's intrinsic gender bias shows an outdated and unequal view of women in our culture and encourages discrimination. Therefore, in order to establish more equitable systems and increase fairness, it is crucial to identify and mitigate the bias existing in these m… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  14. arXiv:2205.15025  [pdf, other

    cs.CV cs.AI cs.CL

    An Efficient Modern Baseline for FloodNet VQA

    Authors: Aditya Kane, Sahil Khose

    Abstract: Designing efficient and reliable VQA systems remains a challenging problem, more so in the case of disaster management and response systems. In this work, we revisit fundamental combination methods like concatenation, addition and element-wise multiplication with modern image and text feature abstraction models. We design a simple and efficient system which outperforms pre-existing methods on the… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Under review, 4 pages, 2 figures, 1 table

  15. arXiv:2205.09402  [pdf

    cs.LG

    Predictive Maintenance using Machine Learning

    Authors: Archit P. Kane, Ashutosh S. Kore, Advait N. Khandale, Sarish S. Nigade, Pranjali P. Joshi

    Abstract: Predictive maintenance (PdM) is a concept, which is implemented to effectively manage maintenance plans of the assets by predicting their failures with data driven techniques. In these scenarios, data is collected over a certain period of time to monitor the state of equipment. The objective is to find some correlations and patterns that can help predict and ultimately prevent failures. Equipment… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  16. arXiv:2203.11899  [pdf, other

    cs.CL

    Transformer based ensemble for emotion detection

    Authors: Aditya Kane, Shantanu Patankar, Sahil Khose, Neeraja Kirtane

    Abstract: Detecting emotions in languages is important to accomplish a complete interaction between humans and machines. This paper describes our contribution to the WASSA 2022 shared task which handles this crucial task of emotion detection. We have to identify the following emotions: sadness, surprise, neutral, anger, fear, disgust, joy based on a given essay text. We are using an ensemble of ELECTRA and… ▽ More

    Submitted 10 April, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted at WASSA, ACL 2022

  17. arXiv:2004.00108  [pdf, ps, other

    cs.CR

    How to transform the Apple's application 'Find My' into a toolbox for whistleblowers

    Authors: Amadou Moctar Kane

    Abstract: The recent introduction of Find My app by Apple will open a large window of opportunities for whistleblowers. Based on a short range Bluetooth signals, an EC P-224 encryption, and an end-to-end encrypted manner using iCloud Keychain, Find My app is probably the first application broadcasting a large number of anonymous public key on this scale. Hence, this new Apple's application may introduce a r… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: 18 pages

  18. Ionic Tuning of Cobaltites at the Nanoscale

    Authors: Dustin A. Gilbert, Alexander J. Grutter, Peyton D. Murray, Rajesh V. Chopdekar, Alexander M. Kane, Aleksey L. Ionin, Michael S. Lee, Steven R. Spurgeon, Brian J. Kirby, Brian B. Maranville, Alpha T. N'Diaye, Apurva Mehta, Elke Arenholz, Kai Liu, Yayoi Takamura, Julie A. Borchers

    Abstract: Control of materials through custom design of ionic distributions represents a powerful new approach to develop future technologies ranging from spintronic logic and memory devices to energy storage. Perovskites have shown particular promise for ionic devices due to their high ion mobility and sensitivity to chemical stoichiometry. In this work, we demonstrate a solid-state approach to control of… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Journal ref: Phys. Rev. Materials 2, 104402 (2018)

  19. arXiv:1806.05711  [pdf, ps, other

    cs.CR

    An eco-friendly Ecash with recycled banknotes

    Authors: Amadou Moctar Kane

    Abstract: By comparing cryptocurrencies with other existing payment methods, including banknotes and bank cards, it is clear that the use of Bitcoin and its competitors (Ethereum, \dots) is almost insignificant in world trade. We may also note that these cryptocurrencies have become tools of speculation, which is the antithesis of their primary purpose. Based essentially on the security of electronic sign… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: 10 pages

  20. arXiv:1710.08977  [pdf

    physics.ed-ph

    Making Physics Courses Accessible for Blind Students: strategies for course administration, class meetings and course materials

    Authors: Megan Holt, Daniel Gillen, Chelsea Cook, Christa Hixson Miller, Sacha D. Nandlall, Kevin Setter, Cary Supalo, Paul Thorman, Suzanne Amador Kane

    Abstract: The Americans with Disabilities Act (ADA) mandates that U.S. institutions of higher education provide "reasonable accommodations" to students with disabilities to ensure equal educational opportunities. However, despite the key role of physics as a gateway to Science, Technology, Engineering and Mathematics (STEM) studies, only limited resources exist for teaching physics to students who are blind… ▽ More

    Submitted 17 July, 2018; v1 submitted 24 October, 2017; originally announced October 2017.

  21. arXiv:1612.08894  [pdf, other

    cs.CV

    Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

    Authors: Konstantinos Kamnitsas, Christian Baumgartner, Christian Ledig, Virginia F. J. Newcombe, Joanna P. Simpson, Andrew D. Kane, David K. Menon, Aditya Nori, Antonio Criminisi, Daniel Rueckert, Ben Glocker

    Abstract: Significant advances have been made towards building accurate automatic segmentation systems for a variety of biomedical applications using machine learning. However, the performance of these systems often degrades when they are applied on new data that differ from the training data, for example, due to variations in imaging protocols. Manually annotating new data for each test domain is not a fea… ▽ More

    Submitted 28 December, 2016; originally announced December 2016.

  22. arXiv:1606.06644  [pdf, ps, other

    cs.CR cs.CY

    How DNA Cryptography can help whistleblowers and refugees

    Authors: Amadou Moctar Kane

    Abstract: The recent progress in DNA sequencing will probably revolutionize the world of electronic. Hence, we went from DNA sequencing that only research centers could realize, to portable, tiny and inexpensive tools. So, it is likely that in a few years these DNA sequencers will be included in our smartphones. The purpose of this paper is to support this revolution, by using the DNA cryptography, hash f… ▽ More

    Submitted 11 April, 2016; originally announced June 2016.

    Comments: 22 pages

  23. Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain Lesion Segmentation

    Authors: Konstantinos Kamnitsas, Christian Ledig, Virginia F. J. Newcombe, Joanna P. Simpson, Andrew D. Kane, David K. Menon, Daniel Rueckert, Ben Glocker

    Abstract: We propose a dual pathway, 11-layers deep, three-dimensional Convolutional Neural Network for the challenging task of brain lesion segmentation. The devised architecture is the result of an in-depth analysis of the limitations of current networks proposed for similar applications. To overcome the computational burden of processing 3D medical scans, we have devised an efficient and effective dense… ▽ More

    Submitted 8 January, 2017; v1 submitted 18 March, 2016; originally announced March 2016.

    Comments: This version was accepted in the journal Medical Image Analysis (MedIA)

  24. arXiv:1507.06235  [pdf, other

    cs.IR

    The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search

    Authors: Richard Zanibbi, Kenny Davila, Andrew Kane, Frank Tompa

    Abstract: With the ever-increasing quantity and variety of data worldwide, the Web has become a rich repository of mathematical formulae. This necessitates the creation of robust and scalable systems for Mathematical Information Retrieval, where users search for mathematical information using individual formulae (query-by-expression) or a combination of keywords and formulae. Often, the pages that best sati… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

    Comments: 10 pages

    ACM Class: H.2.4; H.3.3; H.3.4

  25. Physical Removal of Metallic Carbon Nanotubes from Nanotube Network Devices Using a Thermal and Fluidic Process

    Authors: Alexandra C. Ford, Michael Shaughnessy, Bryan M. Wong, Alexander A. Kane, Oleksandr V. Kuznetsov, Karen L. Krafcik, W. E. Billups, Robert H. Hauge, François Léonard

    Abstract: Electronic and optoelectronic devices based on thin films of carbon nanotubes are currently limited by the presence of metallic nanotubes. Here we present a novel approach based on nanotube alkyl functionalization to physically remove the metallic nanotubes from such network devices. The process relies on preferential thermal desorption of the alkyls from the semiconducting nanotubes and the subse… ▽ More

    Submitted 1 February, 2013; originally announced February 2013.

  26. arXiv:0808.1134  [pdf

    physics.bio-ph q-bio.PE

    A biophysical model of prokaryotic diversity in geothermal hot springs

    Authors: Anna Klales, James Duncan, Elizabeth Janus Nett, Suzanne Amador Kane

    Abstract: Recent field investigations of photosynthetic bacteria living in geothermal hot spring environments have revealed surprisingly complex ecosystems, with an unexpected level of genetic diversity. One case of particular interest involves the distribution along hot spring thermal gradients of genetically distinct bacterial strains that differ in their preferred temperatures for reproduction and phot… ▽ More

    Submitted 7 August, 2008; originally announced August 2008.