Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for January 2022

Total of 453 entries
Showing up to 2000 entries per page: fewer | more | all
[1] arXiv:2201.00075 [pdf, other]
Title: How do lexical semantics affect translation? An empirical study
Vivek Subramanian, Dhanasekar Sundararaman
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[2] arXiv:2201.00083 [pdf, other]
Title: Automated Fake News Detection using cross-checking with reliable sources
Zahra Ghadiri, Milad Ranjbar, Fakhteh Ghanbarnejad, Sadegh Raeisi
Comments: 12 Pages, 5 Figures
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
[3] arXiv:2201.00118 [pdf, other]
Title: Semantic Search for Large Scale Clinical Ontologies
Duy-Hoa Ngo, Madonna Kemp, Donna Truran, Bevan Koopman, Alejandro Metke-Jimenez
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[4] arXiv:2201.00136 [pdf, other]
Title: Zero-shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
Zi-Yi Dou, Nanyun Peng
Comments: AAAI 2022
Subjects: Computation and Language (cs.CL)
[5] arXiv:2201.00318 [pdf, other]
Title: On Sensitivity of Deep Learning Based Text Classification Algorithms to Practical Input Perturbations
Aamir Miyajiwala, Arnav Ladkat, Samiksha Jagadale, Raviraj Joshi
Comments: Accepted at Computing Conference 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[6] arXiv:2201.00374 [pdf, other]
Title: Topical Classification of Food Safety Publications with a Knowledge Base
Piotr Sowinski, Katarzyna Wasielewska-Michniewska, Maria Ganzha, Marcin Paprzycki
Subjects: Computation and Language (cs.CL)
[7] arXiv:2201.00455 [pdf, other]
Title: Actor-Critic Network for Q&A in an Adversarial Environment
Bejan Sadeghian
Comments: 6 pages, 3 figures, 3 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2201.00490 [pdf, other]
Title: Learning with Latent Structures in Natural Language Processing: A Survey
Zhaofeng Wu
Subjects: Computation and Language (cs.CL)
[9] arXiv:2201.00558 [pdf, other]
Title: Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models
Made Nindyatama Nityasya, Haryo Akbarianto Wibowo, Rendi Chevi, Radityo Eko Prasojo, Alham Fikri Aji
Comments: 14 pages, 3 figures, submitted to Elsevier
Subjects: Computation and Language (cs.CL)
[10] arXiv:2201.00598 [pdf, other]
Title: Toxicity Detection for Indic Multilingual Social Media Content
Manan Jhaveri, Devanshu Ramaiya, Harveen Singh Chadha
Comments: It was meant for IEEE BigM conference
Subjects: Computation and Language (cs.CL)
[11] arXiv:2201.00768 [pdf, other]
Title: Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Marwan Omar, Soohyeon Choi, DaeHun Nyang, David Mohaisen
Comments: Survey; 2 figures, 4 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[12] arXiv:2201.00912 [pdf, other]
Title: An Adversarial Benchmark for Fake News Detection Models
Lorenzo Jaime Yu Flores, Yiding Hao
Comments: 6 pages, 2 figures, Presented at AAAI 2022, Workshop on Adversarial Machine Learning and Beyond
Subjects: Computation and Language (cs.CL)
[13] arXiv:2201.00965 [pdf, html, other]
Title: Semantics-Preserved Distortion for Personal Privacy Protection in Information Management
Jiajia Li, Lu Yang, Letian Peng, Shitou Zhang, Ping Wang, Zuchao Li, Hai Zhao
Subjects: Computation and Language (cs.CL)
[14] arXiv:2201.00987 [pdf, other]
Title: MDFEND: Multi-domain Fake News Detection
Qiong Nan, Juan Cao, Yongchun Zhu, Yanyan Wang, Jintao Li
Comments: CIKM 2021 short paper. 5 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[15] arXiv:2201.00989 [pdf, other]
Title: DigNet: Digging Clues from Local-Global Interactive Graph for Aspect-level Sentiment Classification
Bowen Xing, Ivor Tsang
Comments: submitted to Journal of Artificial Intelligence Research (JAIR)
Subjects: Computation and Language (cs.CL)
[16] arXiv:2201.01140 [pdf, other]
Title: Predicting Influenza A Viral Host Using PSSM and Word Embeddings
Yanhua Xu, Dominik Wojtczak
Comments: Accepted for publication at CIBCB 2021. V1: accepted version + minor correction to table 1; V2: corrected a minor typo; V3: update the formula of error rate; V4: replacing 'nested cv' with 'nested k-fold cv' for better clarity
Journal-ref: 2021 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[17] arXiv:2201.01251 [pdf, other]
Title: Multi-Stage Episodic Control for Strategic Exploration in Text Games
Jens Tuyls, Shunyu Yao, Sham Kakade, Karthik Narasimhan
Comments: ICLR 2022 (Spotlight) - this https URL
Subjects: Computation and Language (cs.CL)
[18] arXiv:2201.01337 [pdf, other]
Title: ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling
Alexandre Alcoforado, Thomas Palmeira Ferraz, Rodrigo Gerber, Enzo Bustos, André Seidel Oliveira, Bruno Miguel Veloso, Fabio Levy Siqueira, Anna Helena Reali Costa
Comments: Accepted at PROPOR 2022: 15th International Conference on Computational Processing of Portuguese
Journal-ref: In: Pinheiro V. et al. (eds) Computational Processing of the Portuguese Language. PROPOR 2022. Lecture Notes in Computer Science, vol 13208. Springer, Cham
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[19] arXiv:2201.01364 [pdf, other]
Title: A Discriminative Hierarchical PLDA-based Model for Spoken Language Recognition
Luciana Ferrer, Diego Castan, Mitchell McLaren, Aaron Lawson
Journal-ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 2396-2410, 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[20] arXiv:2201.01405 [pdf, other]
Title: Mining Adverse Drug Reactions from Unstructured Mediums at Scale
Hasham Ul Haq, Veysel Kocaman, David Talby
Comments: Accepted to W3PHIAI workshop at AAAI-22
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[21] arXiv:2201.01420 [pdf, other]
Title: Hyperparameter-free Continuous Learning for Domain Classification in Natural Language Understanding
Ting Hua, Yilin Shen, Changsheng Zhao, Yen-Chang Hsu, Hongxia Jin
Journal-ref: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies,pages 2669--2678
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[22] arXiv:2201.01559 [pdf, other]
Title: Monitoring Energy Trends through Automatic Information Extraction
Dilek Küçük
Comments: 5 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[23] arXiv:2201.01631 [pdf, other]
Title: SMDT: Selective Memory-Augmented Neural Document Translation
Xu Zhang, Jian Yang, Haoyang Huang, Shuming Ma, Dongdong Zhang, Jinlong Li, Furu Wei
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[24] arXiv:2201.01693 [pdf, other]
Title: Strategies of Effective Digitization of Commentaries and Sub-commentaries: Towards the Construction of Textual History
Diptesh Kanojia, Malhar Kulkarni, Sayali Ghodekar, Eivind Kahrs, Pushpak Bhattacharyya
Comments: Accepted at TCDK @ SSSU 2020; ISBN: 978-93-83097-43-2; Pages 477--489
Subjects: Computation and Language (cs.CL)
[25] arXiv:2201.01700 [pdf, other]
Title: Some Strategies to Capture Karaka-Yogyata with Special Reference to apadana
Swaraja Salaskar, Diptesh Kanojia, Malhar Kulkarni
Comments: Published at SOIL-Tech 2019
Subjects: Computation and Language (cs.CL)
[26] arXiv:2201.01706 [pdf, other]
Title: Multi Document Reading Comprehension
Avi Chawla
Comments: 10 pages, 6 Figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[27] arXiv:2201.01747 [pdf, other]
Title: Semi-automatic WordNet Linking using Word Embeddings
Kevin Patel, Diptesh Kanojia, Pushpak Bhattacharyya
Comments: Published at GWC 2018
Subjects: Computation and Language (cs.CL)
[28] arXiv:2201.01787 [pdf, other]
Title: Does Entity Abstraction Help Generative Transformers Reason?
Nicolas Gontier, Siva Reddy, Christopher Pal
Comments: TMLR 2022; 28 pages; 9 tables; 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[29] arXiv:2201.01800 [pdf, other]
Title: KUDO Interpreter Assist: Automated Real-time Support for Remote Interpretation
Claudio Fantinuoli, Giulia Marchesini, David Landan, Lukas Horak
Comments: Accepted at TC43
Subjects: Computation and Language (cs.CL)
[30] arXiv:2201.01837 [pdf, other]
Title: Frame Shift Prediction
Zheng-Xin Yong, Patrick D. Watson, Tiago Timponi Torrent, Oliver Czulo, Collin F. Baker
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[31] arXiv:2201.01845 [pdf, other]
Title: Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation
Zoey Liu, Emily Prud'hommeaux
Comments: Published in TACL (this https URL)
Subjects: Computation and Language (cs.CL)
[32] arXiv:2201.01880 [pdf, other]
Title: Automatic Related Work Generation: A Meta Study
Xiangci Li, Jessica Ouyang
Subjects: Computation and Language (cs.CL)
[33] arXiv:2201.01956 [pdf, other]
Title: HuSpaCy: an industrial-strength Hungarian natural language processing toolkit
György Orosz, Zsolt Szántó, Péter Berkecz, Gergő Szabó, Richárd Farkas
Comments: Camera-ready manuscript: - Fixed various grammatical error. - Restructured the evaluation section. - Updated scores in accordance with the v0.4.2 release
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
[34] arXiv:2201.01995 [pdf, other]
Title: Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu
Comments: 5pages, 1 figure
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[35] arXiv:2201.01997 [pdf, other]
Title: An exploratory experiment on Hindi, Bengali hate-speech detection and transfer learning using neural networks
Tung Minh Phung, Jan Cloos
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[36] arXiv:2201.02009 [pdf, other]
Title: PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation
Juncheng Wan, Jian Yang, Shuming Ma, Dongdong Zhang, Weinan Zhang, Yong Yu, Zhoujun Li
Comments: 13 pages
Subjects: Computation and Language (cs.CL)
[37] arXiv:2201.02026 [pdf, other]
Title: Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis
Liat Ein-Dor, Ilya Shnayderman, Artem Spector, Lena Dankin, Ranit Aharonov, Noam Slonim
Comments: Published in AAAI 2022
Subjects: Computation and Language (cs.CL)
[38] arXiv:2201.02049 [pdf, other]
Title: Forming Predictive Features of Tweets for Decision-Making Support
Bohdan M. Pavlyshenko
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[39] arXiv:2201.02080 [pdf, other]
Title: BERN2: an advanced neural biomedical named entity recognition and normalization tool
Mujeen Sung, Minbyul Jeong, Yonghwa Choi, Donghyeon Kim, Jinhyuk Lee, Jaewoo Kang
Comments: Published in Bioinformatics 2022. Web service available at this http URL. Code available at this https URL
Subjects: Computation and Language (cs.CL)
[40] arXiv:2201.02113 [pdf, other]
Title: ConTrip: Consensus Sentiment review Analysis and Platform ratings in a single score
José Bonet, José Bonet
Comments: 4 pagines, 1 figure
Subjects: Computation and Language (cs.CL)
[41] arXiv:2201.02257 [pdf, other]
Title: Applying Word Embeddings to Measure Valence in Information Operations Targeting Journalists in Brazil
David A. Broniatowski
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[42] arXiv:2201.02312 [pdf, other]
Title: A Transfer Learning Pipeline for Educational Resource Discovery with Application in Leading Paragraph Generation
Irene Li, Thomas George, Alexander Fabbri, Tammy Liao, Benjamin Chen, Rina Kawamura, Richard Zhou, Vanessa Yan, Swapnil Hingmire, Dragomir Radev
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[43] arXiv:2201.02321 [pdf, other]
Title: An Unsupervised Masking Objective for Abstractive Multi-Document News Summarization
Nikolai Vogler, Songlin Li, Yujie Xu, Yujian Mi, Taylor Berg-Kirkpatrick
Subjects: Computation and Language (cs.CL)
[44] arXiv:2201.02387 [pdf, other]
Title: The Defeat of the Winograd Schema Challenge
Vid Kocijan, Ernest Davis, Thomas Lukasiewicz, Gary Marcus, Leora Morgenstern
Subjects: Computation and Language (cs.CL)
[45] arXiv:2201.02419 [pdf, other]
Title: Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[46] arXiv:2201.02489 [pdf, other]
Title: Semantic-based Data Augmentation for Math Word Problems
Ailisi Li, Jiaqing Liang, Yanghua Xiao
Subjects: Computation and Language (cs.CL)
[47] arXiv:2201.02504 [pdf, other]
Title: Repairing Adversarial Texts through Perturbation
Guoliang Dong, Jingyi Wang, Jun Sun, Sudipta Chattopadhyay, Xinyu Wang, Ting Dai, Jie Shi, Jin Song Dong
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
[48] arXiv:2201.02510 [pdf, other]
Title: Predicting Patient Readmission Risk from Medical Text via Knowledge Graph Enhanced Multiview Graph Convolution
Qiuhao Lu, Thien Huu Nguyen, Dejing Dou
Comments: SIGIR 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[49] arXiv:2201.02517 [pdf, other]
Title: Development of an Extractive Clinical Question Answering Dataset with Multi-Answer and Multi-Focus Questions
Sungrim Moon, Huan He, Hongfang Liu, Jungwei W. Fan
Comments: 2 tables, 5 figures
Journal-ref: JMIR AI 2023;2:e41818
Subjects: Computation and Language (cs.CL)
[50] arXiv:2201.02550 [pdf, other]
Title: Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition
Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[51] arXiv:2201.02662 [pdf, other]
Title: Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow
Maarten Sap, Anna Jafarpour, Yejin Choi, Noah A. Smith, James W. Pennebaker, Eric Horvitz
Comments: Equal contribution from Sap and Jafarpour; in review; version 2
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[52] arXiv:2201.02710 [pdf, other]
Title: A New Amharic Speech Emotion Dataset and Classification Benchmark
Ephrem A. Retta, Eiad Almekhlafi, Richard Sutcliffe, Mustafa Mhamed, Haider Ali, Jun Feng
Comments: 16 pages, 12 tables, 6 figures
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[53] arXiv:2201.02715 [pdf, other]
Title: Low-Rank Constraints for Fast Inference in Structured Models
Justin T. Chiu, Yuntian Deng, Alexander M. Rush
Comments: 22 pages. Published at NeurIPS 2021
Subjects: Computation and Language (cs.CL)
[54] arXiv:2201.02732 [pdf, other]
Title: C2-CRS: Coarse-to-Fine Contrastive Learning for Conversational Recommender System
Yuanhang Zhou, Kun Zhou, Wayne Xin Zhao, Cheng Wang, Peng Jiang, He Hu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[55] arXiv:2201.02733 [pdf, other]
Title: Testing the Robustness of a BiLSTM-based Structural Story Classifier
Aftab Hussain, Sai Durga Prasad Nanduri, Sneha Seenuvasavarathan
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[56] arXiv:2201.02734 [pdf, other]
Title: Building Human-like Communicative Intelligence: A Grounded Perspective
Marina Dubova
Journal-ref: Cognitive Systems Research, 72, 63-79 (2022)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[57] arXiv:2201.02735 [pdf, other]
Title: A Deep Learning Approach to Integrate Human-Level Understanding in a Chatbot
Afia Fairoose Abedin, Amirul Islam Al Mamun, Rownak Jahan Nowrin, Amitabha Chakrabarty, Moin Mostakim, Sudip Kumar Naskar
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[58] arXiv:2201.02737 [pdf, other]
Title: Cognitive Computing to Optimize IT Services
Abbas Raza Ali
Comments: 2018 IEEE 17th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[59] arXiv:2201.02738 [pdf, other]
Title: Traffic event description based on Twitter data using Unsupervised Learning Methods for Indian road conditions
Yasaswi Sri Chandra Gandhi Kilaru, Indrajit Ghosh
Comments: 13th International (Online) Conference on Transportation Planning and Implementation Methodologies for Developing Countries
Subjects: Computation and Language (cs.CL)
[60] arXiv:2201.02739 [pdf, other]
Title: Adaptive Beam Search to Enhance On-device Abstractive Summarization
Harichandana B S S, Sumit Kumar
Comments: Accepted at IEEE INDICON 2021, 19-21 December, 2021, India
Subjects: Computation and Language (cs.CL)
[61] arXiv:2201.02740 [pdf, other]
Title: Best of Both Worlds: A Hybrid Approach for Multi-Hop Explanation with Declarative Facts
Shane Storks, Qiaozi Gao, Aishwarya Reganti, Govind Thattai
Comments: Accepted to CLeaR Workshop @ AAAI 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[62] arXiv:2201.02792 [pdf, other]
Title: Defining maximum acceptable latency of AI-enhanced CAI tools
Claudio Fantinuoli, Maddalena Montecchio
Comments: Accepted at techLing2021
Subjects: Computation and Language (cs.CL)
[63] arXiv:2201.02797 [pdf, html, other]
Title: A Unified Review of Deep Learning for Automated Medical Coding
Shaoxiong Ji, Wei Sun, Xiaobo Li, Hang Dong, Ara Taalas, Yijia Zhang, Honghan Wu, Esa Pitkänen, Pekka Marttinen
Comments: ACM Computing Surveys
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[64] arXiv:2201.02816 [pdf, other]
Title: Clustering Text Using Attention
Lovedeep Singh
Comments: 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[65] arXiv:2201.02846 [pdf, other]
Title: Coherence-Based Distributed Document Representation Learning for Scientific Documents
Shicheng Tan, Shu Zhao, Yanping Zhang
Subjects: Computation and Language (cs.CL)
[66] arXiv:2201.02977 [pdf, other]
Title: Indian Language Wordnets and their Linkages with Princeton WordNet
Diptesh Kanojia, Kevin Patel, Pushpak Bhattacharyya
Comments: Published at LREC 2018
Subjects: Computation and Language (cs.CL)
[67] arXiv:2201.02993 [pdf, other]
Title: Rethink the Evaluation for Attack Strength of Backdoor Attacks in Natural Language Processing
Lingfeng Shen, Haiyun Jiang, Lemao Liu, Shuming Shi
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[68] arXiv:2201.03017 [pdf, other]
Title: Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic
Simon Lupart, Benoit Favre, Vassilina Nikoulina, Salah Ait-Mokhtar
Comments: to be published at the AAAI-22 Workshop on Scientific Document Understanding
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[69] arXiv:2201.03026 [pdf, other]
Title: An Ensemble Approach to Acronym Extraction using Transformers
Prashant Sharma, Hadeel Saadany, Leonardo Zilio, Diptesh Kanojia, Constantin Orăsan
Comments: Published at SDU@AAAI-22
Subjects: Computation and Language (cs.CL)
[70] arXiv:2201.03035 [pdf, other]
Title: Medication Error Detection Using Contextual Language Models
Yu Jiang, Christian Poellabauer
Comments: AAAI-22 workshop: W3PHIAI-22
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[71] arXiv:2201.03107 [pdf, other]
Title: Projection: A Mixed-Initiative Research Process
Austin Silveria
Subjects: Computation and Language (cs.CL)
[72] arXiv:2201.03110 [pdf, other]
Title: Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning
Aditya Siddhant, Ankur Bapna, Orhan Firat, Yuan Cao, Mia Xu Chen, Isaac Caswell, Xavier Garcia
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[73] arXiv:2201.03115 [pdf, other]
Title: Semantic and sentiment analysis of selected Bhagavad Gita translations using BERT-based language framework
Rohitash Chandra, Venkatesh Kulkarni
Journal-ref: IEEE Access, 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[74] arXiv:2201.03173 [pdf, other]
Title: Quantifying Gender Bias in Consumer Culture
Reihane Boghrati, Jonah Berger
Subjects: Computation and Language (cs.CL)
[75] arXiv:2201.03174 [pdf, other]
Title: Style, Content, and the Success of Ideas
Reihane Boghrati, Jonah Berger, Grant Packard
Subjects: Computation and Language (cs.CL)
[76] arXiv:2201.03188 [pdf, other]
Title: Writing Style Aware Document-level Event Extraction
Zhuo Xu, Yue Wang, Lu Bai, Lixin Cui
Comments: This paper has been submitted to Pattern Recognition Letters
Subjects: Computation and Language (cs.CL)
[77] arXiv:2201.03239 [pdf, other]
Title: There is no rose without a thorn: Finding weaknesses on BlenderBot 2.0 in terms of Model, Data and User-Centric Approach
Jungseob Lee, Midan Shim, Suhyune Son, Chanjun Park, Yujin Kim, Heuiseok Lim
Comments: English and Extention Version of "Empirical study on BlenderBot 2.0 errors analysis in terms of model, data and dialogue" (Journal of the Korea Convergence Society)
Subjects: Computation and Language (cs.CL)
[78] arXiv:2201.03327 [pdf, html, other]
Title: Latency Adjustable Transformer Encoder for Language Understanding
Sajjad Kachuee, Mohammad Sharifkhani
Subjects: Computation and Language (cs.CL)
[79] arXiv:2201.03335 [pdf, other]
Title: DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population
Ningyu Zhang, Xin Xu, Liankuan Tao, Haiyang Yu, Hongbin Ye, Shuofei Qiao, Xin Xie, Xiang Chen, Zhoubo Li, Lei Li, Xiaozhuan Liang, Yunzhi Yao, Shumin Deng, Peng Wang, Wen Zhang, Zhenru Zhang, Chuanqi Tan, Qiang Chen, Feiyu Xiong, Fei Huang, Guozhou Zheng, Huajun Chen
Comments: Accepted by EMNLP 2022 System Demonstrations and the project website is this http URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[80] arXiv:2201.03366 [pdf, other]
Title: Morphological Analysis of Japanese Hiragana Sentences using the BI-LSTM CRF Model
Jun Izutsu, Kanako Komiya
Comments: 13 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[81] arXiv:2201.03382 [pdf, other]
Title: BERT for Sentiment Analysis: Pre-trained and Fine-Tuned Alternatives
Frederico Souza, João Filho
Comments: 10 pages, 1 figure, 3 tables. Accepted at International Conference on the Computational Processing of Portuguese (PROPOR 2022), but not yet published
Subjects: Computation and Language (cs.CL)
[82] arXiv:2201.03423 [pdf, other]
Title: A Survey of Plagiarism Detection Systems: Case of Use with English, French and Arabic Languages
Mehdi Abdelhamid, Faical Azouaou, Sofiane Batata
Comments: 26 pages, 2 figures, 19 tables
Subjects: Computation and Language (cs.CL)
[83] arXiv:2201.03425 [pdf, other]
Title: Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers
Johannes Schneider, Robin Richner, Micha Riser
Comments: International Journal of Artificial Intelligence in Education (Accepted)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[84] arXiv:2201.03445 [pdf, other]
Title: NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese
Sidney Evaldo Leal, Magali Sanches Duran, Carolina Evaristo Scarton, Nathan Siegle Hartmann, Sandra Maria Aluísio
Comments: 26 pages
Subjects: Computation and Language (cs.CL)
[85] arXiv:2201.03511 [pdf, other]
Title: A study on cross-corpus speech emotion recognition and data augmentation
Norbert Braunschweiler, Rama Doddipatla, Simon Keizer, Svetlana Stoyanchev
Comments: Accepted at ASRU 2021
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[86] arXiv:2201.03514 [pdf, other]
Title: Black-Box Tuning for Language-Model-as-a-Service
Tianxiang Sun, Yunfan Shao, Hong Qian, Xuanjing Huang, Xipeng Qiu
Comments: Accepted by ICML 2022. Camera-ready version
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[87] arXiv:2201.03521 [pdf, other]
Title: Polish Natural Language Inference and Factivity -- an Expert-based Dataset and Benchmarks
Daniel Ziembicki, Anna Wróblewska, Karolina Seweryn
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[88] arXiv:2201.03533 [pdf, other]
Title: SCROLLS: Standardized CompaRison Over Long Language Sequences
Uri Shaham, Elad Segal, Maor Ivgi, Avia Efrat, Ori Yoran, Adi Haviv, Ankit Gupta, Wenhan Xiong, Mor Geva, Jonathan Berant, Omer Levy
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[89] arXiv:2201.03655 [pdf, other]
Title: A Likelihood Ratio based Domain Adaptation Method for E2E Models
Chhavi Choudhury, Ankur Gandhe, Xiaohan Ding, Ivan Bulyko
Comments: Submitted to ICASSP 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[90] arXiv:2201.03677 [pdf, other]
Title: Homepage2Vec: Language-Agnostic Website Embedding and Classification
Sylvain Lugeon, Tiziano Piccardi, Robert West
Comments: Published in Proc. of ICWSM 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[91] arXiv:2201.03679 [pdf, other]
Title: Informal Persian Universal Dependency Treebank
Roya Kabiri, Simin Karimi, Mihai Surdeanu
Subjects: Computation and Language (cs.CL)
[92] arXiv:2201.03713 [pdf, other]
Title: CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen
Comments: LREC 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[93] arXiv:2201.03742 [pdf, other]
Title: Explaining Predictive Uncertainty by Looking Back at Model Explanations
Hanjie Chen, Wanyu Du, Yangfeng Ji
Subjects: Computation and Language (cs.CL)
[94] arXiv:2201.03761 [pdf, other]
Title: Prior Knowledge Enhances Radiology Report Generation
Song Wang, Liyan Tang, Mingquan Lin, George Shih, Ying Ding, Yifan Peng
Comments: 10 pages, 4 figures, accepted by AMIA 2022 Informatics Summit
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[95] arXiv:2201.03804 [pdf, other]
Title: CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J. Barezi, Peng Xu, Cheuk Tung Shadow Yiu, Rita Frieske, Holy Lovenia, Genta Indra Winata, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung
Comments: 6 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[96] arXiv:2201.03829 [pdf, other]
Title: Quantifying Robustness to Adversarial Word Substitutions
Yuting Yang, Pei Huang, FeiFei Ma, Juan Cao, Meishan Zhang, Jian Zhang, Jintao Li
Subjects: Computation and Language (cs.CL)
[97] arXiv:2201.03848 [pdf, other]
Title: Turkish Sentiment Analysis Using Machine Learning Methods: Application on Online Food Order Site Reviews
Özlem Aktaş, Berkay Coşkuner, İlker Soner
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[98] arXiv:2201.03857 [pdf, other]
Title: The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild
Taja Kuzman, Peter Rupnik, Nikola Ljubešić
Subjects: Computation and Language (cs.CL)
[99] arXiv:2201.03941 [pdf, other]
Title: Sentiment Analysis with Deep Learning Models: A Comparative Study on a Decade of Sinhala Language Facebook Data
Gihan Weeraprameshwara, Vihanga Jayawickrama, Nisansa de Silva, Yudhanjaya Wijeratne
Comments: 8 pages, LaTeX; typos corrected
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[100] arXiv:2201.04227 [pdf, other]
Title: A Feature Extraction based Model for Hate Speech Identification
Salar Mohtaj, Vera Schmitt, Sebastian Möller
Comments: Accepted at FIRE 2021 - Hate Speech and offensive content detection (HASOC) Track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[101] arXiv:2201.04275 [pdf, other]
Title: PhysNLU: A Language Resource for Evaluating Natural Language Understanding and Explanation Coherence in Physics
Jordan Meadows, Zili Zhou, Andre Freitas
Subjects: Computation and Language (cs.CL)
[102] arXiv:2201.04337 [pdf, other]
Title: PromptBERT: Improving BERT Sentence Embeddings with Prompts
Ting Jiang, Jian Jiao, Shaohan Huang, Zihan Zhang, Deqing Wang, Fuzhen Zhuang, Furu Wei, Haizhen Huang, Denvy Deng, Qi Zhang
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[103] arXiv:2201.04356 [pdf, other]
Title: Computational analyses of the topics, sentiments, literariness, creativity and beauty of texts in a large Corpus of English Literature
Arthur M. Jacobs, Annette Kinder
Comments: 37 pages, 12 figures
Subjects: Computation and Language (cs.CL)
[104] arXiv:2201.04427 [pdf, other]
Title: Differentiating Geographic Movement Described in Text Documents
Scott Pezanowski, Alan M. MacEachren, Prasenjit Mitra
Journal-ref: Transactions in GIS, 00, 1-26 (2021)
Subjects: Computation and Language (cs.CL)
[105] arXiv:2201.04450 [pdf, other]
Title: Biaffine Discourse Dependency Parsing
Yingxue Fu
Subjects: Computation and Language (cs.CL)
[106] arXiv:2201.04467 [pdf, other]
Title: How Does Data Corruption Affect Natural Language Understanding Models? A Study on GLUE datasets
Aarne Talman, Marianna Apidianaki, Stergios Chatzikyriakidis, Jörg Tiedemann
Comments: *SEM 2022 camera ready version
Subjects: Computation and Language (cs.CL)
[107] arXiv:2201.04723 [pdf, other]
Title: Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents
Eric Michael Smith, Orion Hsu, Rebecca Qian, Stephen Roller, Y-Lan Boureau, Jason Weston
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[108] arXiv:2201.04810 [pdf, other]
Title: Recognizing semantic relation in sentence pairs using Tree-RNNs and Typed dependencies
Jeena Kleenankandy, K A Abdul Nazeer
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[109] arXiv:2201.04826 [pdf, other]
Title: Document-level Relation Extraction with Context Guided Mention Integration and Inter-pair Reasoning
Chao Zhao, Daojian Zeng, Lu Xu, Jianhua Dai
Subjects: Computation and Language (cs.CL)
[110] arXiv:2201.04831 [pdf, other]
Title: Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-based Sentiment Analysis
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Hua Jin, Dacheng Tao
Comments: Accepted by IEEE TKDE 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[111] arXiv:2201.04843 [pdf, other]
Title: Multi-task Pre-training Language Model for Semantic Network Completion
Da Li, Sen Yang, Kele Xu, Ming Yi, Yukai He, Huaimin Wang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[112] arXiv:2201.04877 [pdf, other]
Title: A Quadratic 0-1 Programming Approach for Word Sense Disambiguation
Boliang Lin
Comments: 9 pages, 3 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[113] arXiv:2201.04913 [pdf, other]
Title: Compressing Word Embeddings Using Syllables
Laurent Mertens, Joost Vennekens
Comments: 19 pages 3 figures 11 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[114] arXiv:2201.05017 [pdf, other]
Title: Towards Automated Error Analysis: Learning to Characterize Errors
Tong Gao, Shivang Singh, Raymond J. Mooney
Comments: 12 pages, 11 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[115] arXiv:2201.05041 [pdf, other]
Title: LARD: Large-scale Artificial Disfluency Generation
T. Passali, T. Mavropoulos, G. Tsoumakas, G. Meditskos, S. Vrochidis
Comments: Accepted at LREC 2022
Subjects: Computation and Language (cs.CL)
[116] arXiv:2201.05051 [pdf, other]
Title: Speech Resources in the Tamasheq Language
Marcely Zanon Boito, Fethi Bougares, Florentin Barbier, Souhir Gahbiche, Loïc Barrault, Mickael Rouvier, Yannick Estève
Comments: Accepted to LREC 2022
Subjects: Computation and Language (cs.CL)
[117] arXiv:2201.05061 [pdf, other]
Title: Feature-rich multiplex lexical networks reveal mental strategies of early language learning
Salvatore Citraro, Michael S. Vitevitch, Massimo Stella, Giulio Rossetti
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[118] arXiv:2201.05088 [pdf, other]
Title: Grow-and-Clip: Informative-yet-Concise Evidence Distillation for Answer Explanation
Yuyan Chen, Yanghua Xiao, Bang Liu
Comments: Accepted to ICDE 2022 (Research Track)
Subjects: Computation and Language (cs.CL)
[119] arXiv:2201.05123 [pdf, other]
Title: NorDiaChange: Diachronic Semantic Change Dataset for Norwegian
Andrey Kutuzov, Samia Touileb, Petter Mæhlum, Tita Ranveig Enstad, Alexandra Wittemann
Comments: LREC'2022 proceedings
Subjects: Computation and Language (cs.CL)
[120] arXiv:2201.05173 [pdf, other]
Title: The Combinatorics of \textit{Salva Veritate} Principles
Norman E. Trushaev
Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL)
[121] arXiv:2201.05177 [pdf, other]
Title: Making a (Counterfactual) Difference One Rationale at a Time
Mitchell Plyler, Michael Green, Min Chi
Journal-ref: Advances in Neural Information Processing Systems 2021
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[122] arXiv:2201.05230 [pdf, other]
Title: NLP in Human Rights Research -- Extracting Knowledge Graphs About Police and Army Units and Their Commanders
Daniel Bauer (1), Tom Longley (2), Yueen Ma (1), Tony Wilson (2) ((1) Department of Computer Science, Columbia University, (2) Security Force Monitor, Human Rights Institute, Columbia Law School)
Comments: Equal contributions. for associated text corpus see this https URL
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[123] arXiv:2201.05273 [pdf, other]
Title: Pretrained Language Models for Text Generation: A Survey
Junyi Li, Tianyi Tang, Wayne Xin Zhao, Jian-Yun Nie, Ji-Rong Wen
Comments: Under review
Subjects: Computation and Language (cs.CL)
[124] arXiv:2201.05294 [pdf, other]
Title: Multi-Narrative Semantic Overlap Task: Evaluation and Benchmark
Naman Bansal, Mousumi Akter, Shubhra Kanti Karmaker Santu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[125] arXiv:2201.05302 [pdf, other]
Title: Applying a Generic Sequence-to-Sequence Model for Simple and Effective Keyphrase Generation
Md Faisal Mahbub Chowdhury, Gaetano Rossiello, Michael Glass, Nandana Mihindukulasooriya, Alfio Gliozzo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[126] arXiv:2201.05313 [pdf, other]
Title: ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization
Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki
Subjects: Computation and Language (cs.CL)
[127] arXiv:2201.05320 [pdf, other]
Title: CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor, Ori Yoran, Ronan Le Bras, Chandra Bhagavatula, Yoav Goldberg, Yejin Choi, Jonathan Berant
Comments: Presented as Oral at NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128] arXiv:2201.05337 [pdf, other]
Title: A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang, Haolin Song, Shaoyu Li, Ming Zhou, Dawei Song
Comments: Accpeted by ACM Computing Surveys Journal
Subjects: Computation and Language (cs.CL)
[129] arXiv:2201.05363 [pdf, other]
Title: Polarity and Subjectivity Detection with Multitask Learning and BERT Embedding
Ranjan Satapathy, Shweta Pardeshi, Erik Cambria
Comments: 10 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130] arXiv:2201.05382 [pdf, other]
Title: Mental Health Assessment for the Chatbots
Yong Shan, Jinchao Zhang, Zekang Li, Yang Feng, Jie Zhou
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[131] arXiv:2201.05411 [pdf, other]
Title: Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
Yinyi Wei, Tong Mo, Yongtao Jiang, Weiping Li, Wen Zhao
Subjects: Computation and Language (cs.CL)
[132] arXiv:2201.05575 [pdf, other]
Title: Reasoning Through Memorization: Nearest Neighbor Knowledge Graph Embeddings
Peng Wang, Xin Xie, Xiaohan Wang, Ningyu Zhang
Comments: NLPCC 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[133] arXiv:2201.05590 [pdf, other]
Title: Czech Grammar Error Correction with a Large and Diverse Corpus
Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen
Comments: Published in TACL, MIT Press
Subjects: Computation and Language (cs.CL)
[134] arXiv:2201.05601 [pdf, other]
Title: A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
Vésteinn Snæbjarnarson, Haukur Barri Símonarson, Pétur Orri Ragnarsson, Svanhvít Lilja Ingólfsdóttir, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson, Hafsteinn Einarsson
Subjects: Computation and Language (cs.CL)
[135] arXiv:2201.05609 [pdf, other]
Title: Multilingual Open Text Release 1: Public Domain News in 44 Languages
Chester Palen-Michel, June Kim, Constantine Lignos
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[136] arXiv:2201.05613 [pdf, other]
Title: The Dark Side of the Language: Pre-trained Transformers in the DarkNet
Leonardo Ranaldi, Aria Nourbakhsh, Arianna Patrizi, Elena Sofia Ruzzetti, Dario Onorati, Francesca Fallucchi, Fabio Massimo Zanzotto
Journal-ref: 2023.ranlp-1.102
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[137] arXiv:2201.05692 [pdf, other]
Title: Model Stability with Continuous Data Updates
Huiting Liu, Avinesh P.V.S., Siddharth Patwardhan, Peter Grasch, Sachin Agarwal
Subjects: Computation and Language (cs.CL)
[138] arXiv:2201.05700 [pdf, other]
Title: Cost-Effective Training in Low-Resource Neural Machine Translation
Sai Koneru, Danni Liu, Jan Niehues
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139] arXiv:2201.05721 [pdf, other]
Title: Extracting Space Situational Awareness Events from News Text
Zhengnan Xie, Alice Saebom Kwak, Enfa George, Laura W. Dozal, Hoang Van, Moriba Jah, Roberto Furfaro, Peter Jansen
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[140] arXiv:2201.05742 [pdf, other]
Title: Kformer: Knowledge Injection in Transformer Feed-Forward Layers
Yunzhi Yao, Shaohan Huang, Li Dong, Furu Wei, Huajun Chen, Ningyu Zhang
Comments: Accepted by NLPCC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[141] arXiv:2201.05767 [pdf, other]
Title: Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems
Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti
Comments: Accepted to EMNLP 2022 as a long paper (Findings). Model code is available at this https URL
Journal-ref: Findings of the Association for Computational Linguistics: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[142] arXiv:2201.05780 [pdf, other]
Title: A Dual Prompt Learning Framework for Few-Shot Dialogue State Tracking
Yuting Yang, Wenqiang Lei, Pei Huang, Juan Cao, Jintao Li, Tat-Seng Chua
Subjects: Computation and Language (cs.CL)
[143] arXiv:2201.05793 [pdf, other]
Title: A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases
Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam
Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[144] arXiv:2201.05878 [pdf, other]
Title: Automatic Lexical Simplification for Turkish
Ahmet Yavuz Uluslu
Subjects: Computation and Language (cs.CL)
[145] arXiv:2201.05880 [pdf, other]
Title: Reasoning over Hybrid Chain for Table-and-Text Open Domain QA
Wanjun Zhong, Junjie Huang, Qian Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
Subjects: Computation and Language (cs.CL)
[146] arXiv:2201.05891 [pdf, other]
Title: Automatic Correction of Syntactic Dependency Annotation Differences
Andrew Zupon, Andrew Carnie, Michael Hammond, Mihai Surdeanu
Subjects: Computation and Language (cs.CL)
[147] arXiv:2201.05899 [pdf, other]
Title: Unobserved Local Structures Make Compositional Generalization Hard
Ben Bogin, Shivanshu Gupta, Jonathan Berant
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[148] arXiv:2201.05922 [pdf, other]
Title: Addressing the Challenges of Cross-Lingual Hate Speech Detection
Irina Bigoulaeva, Viktor Hangya, Iryna Gurevych, Alexander Fraser
Subjects: Computation and Language (cs.CL)
[149] arXiv:2201.05955 [pdf, other]
Title: WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi
Comments: EMNLP Findings camera-ready
Subjects: Computation and Language (cs.CL)
[150] arXiv:2201.05966 [pdf, other]
Title: UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[151] arXiv:2201.05979 [pdf, other]
Title: SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples
Hao Wang, Yangguang Li, Zhen Huang, Yong Dou, Lingpeng Kong, Jing Shao
Comments: 7 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[152] arXiv:2201.05981 [pdf, other]
Title: Double Retrieval and Ranking for Accurate Question Answering
Zeyu Zhang, Thuy Vu, Alessandro Moschitti
Subjects: Computation and Language (cs.CL)
[153] arXiv:2201.05984 [pdf, other]
Title: In Situ Answer Sentence Selection at Web-scale
Zeyu Zhang, Thuy Vu, Alessandro Moschitti
Subjects: Computation and Language (cs.CL)
[154] arXiv:2201.06009 [pdf, other]
Title: Memory-assisted prompt editing to improve GPT-3 after deployment
Aman Madaan, Niket Tandon, Peter Clark, Yiming Yang
Comments: EMNLP 2022. This version updates the title to be consistent with EMNLP camera ready
Subjects: Computation and Language (cs.CL)
[155] arXiv:2201.06025 [pdf, other]
Title: COLD: A Benchmark for Chinese Offensive Language Detection
Jiawen Deng, Jingyan Zhou, Hao Sun, Chujie Zheng, Fei Mi, Helen Meng, Minlie Huang
Comments: 19 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[156] arXiv:2201.06028 [pdf, other]
Title: Natural Language Deduction through Search over Statement Compositions
Kaj Bostrom, Zayne Sprague, Swarat Chaudhuri, Greg Durrett
Comments: Findings of EMNLP 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[157] arXiv:2201.06125 [pdf, other]
Title: Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model
Bo-Ying Su, Shang-Ling Hsu, Kuan-Yin Lai, Amarnath Gupta
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[158] arXiv:2201.06134 [pdf, other]
Title: The Ninth Advances in Cognitive Systems (ACS) Conference
Mark Burstein, Mohan Sridharan, David McDonald
Subjects: Computation and Language (cs.CL)
[159] arXiv:2201.06170 [pdf, other]
Title: Evaluation of HTR models without Ground Truth Material
Phillip Benjamin Ströbel, Simon Clematide, Martin Volk, Raphael Schwitter, Tobias Hodel, David Schoch
Comments: Accepted at LREC 2022. Final version submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[160] arXiv:2201.06199 [pdf, other]
Title: Proficiency Matters Quality Estimation in Grammatical Error Correction
Yujin Takahashi, Masahiro Kaneko, Masato Mita, Mamoru Komachi
Comments: 6 pages (4 pages + references)
Subjects: Computation and Language (cs.CL)
[161] arXiv:2201.06206 [pdf, other]
Title: SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning
Yushi Bai, Xin Lv, Juanzi Li, Lei Hou, Yincen Qu, Zelin Dai, Feiyu Xiong
Comments: EMNLP 2022. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[162] arXiv:2201.06219 [pdf, other]
Title: An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets
Yuqiao Wen, Guoqing Luo, Lili Mou
Comments: Accepted by LREC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[163] arXiv:2201.06223 [pdf, other]
Title: Korean-Specific Dataset for Table Question Answering
Changwook Jun, Jooyoung Choi, Myoseop Sim, Hyun Kim, Hansol Jang, Kyungkoo Min
Comments: 7 pages including references and 4 figures
Subjects: Computation and Language (cs.CL)
[164] arXiv:2201.06225 [pdf, other]
Title: Interactive Contrastive Learning for Self-supervised Entity Alignment
Kaisheng Zeng, Zhenhao Dong, Lei Hou, Yixin Cao, Minghao Hu, Jifan Yu, Xin Lv, Juanzi Li, Ling Feng
Comments: Accepted by CIKM 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[165] arXiv:2201.06230 [pdf, other]
Title: Generalizable Neuro-symbolic Systems for Commonsense Question Answering
Alessandro Oltramari, Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee
Comments: In Pascal Hitzler, Md Kamruzzaman Sarker (eds.), Neuro-Symbolic Artificial Intelligence: The State of the Art. Frontiers in Artificial Intelligence and Applications Vol. 342, IOS Press, Amsterdam, 2022. arXiv admin note: text overlap with arXiv:2003.04707
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[166] arXiv:2201.06286 [pdf, other]
Title: MuLVE, A Multi-Language Vocabulary Evaluation Data Set
Anik Jacobsen, Salar Mohtaj, Sebastian Möller
Comments: Submitted to LREC 2022
Journal-ref: Proceedings of the Language Resources and Evaluation Conference. 2022; 673-679
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[167] arXiv:2201.06302 [pdf, other]
Title: On the Context-Free Ambiguity of Emoji
Justyna Czestochowska, Kristina Gligoric, Maxime Peyrard, Yann Mentha, Michal Bien, Andrea Grutter, Anita Auer, Aris Xanthos, Robert West
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[168] arXiv:2201.06309 [pdf, other]
Title: Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition
Pengfei Liu, Kun Li, Helen Meng
Comments: Published in INTERSPEECH-2020
Journal-ref: INTERSPEECH 2020
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[169] arXiv:2201.06313 [pdf, other]
Title: A Deep Convolutional Neural Networks Based Multi-Task Ensemble Model for Aspect and Polarity Classification in Persian Reviews
Milad Vazan, Fatemeh Sadat Masoumi, Sepideh Saeedi Majd
Subjects: Computation and Language (cs.CL)
[170] arXiv:2201.06348 [pdf, other]
Title: Chatbot System Architecture
Moataz Mohammed, Mostafa M. Aref
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[171] arXiv:2201.06384 [pdf, other]
Title: Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations
Chris Emmery, Ákos Kádár, Grzegorz Chrupała, Walter Daelemans
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[172] arXiv:2201.06469 [pdf, other]
Title: Handling Compounding in Mobile Keyboard Input
Andreas Kabel, Keith Hall, Tom Ouyang, David Rybach, Daan van Esch, Françoise Beaufays
Comments: 7 pages
Subjects: Computation and Language (cs.CL)
[173] arXiv:2201.06496 [pdf, other]
Title: ArCovidVac: Analyzing Arabic Tweets About COVID-19 Vaccination
Hamdy Mubarak, Sabit Hassan, Shammur Absar Chowdhury, Firoj Alam
Comments: 8 pages, 9 figures
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[174] arXiv:2201.06499 [pdf, other]
Title: RuMedBench: A Russian Medical Language Understanding Benchmark
Pavel Blinov, Arina Reshetnikova, Aleksandr Nesterov, Galina Zubkova, Vladimir Kokh
Comments: 11 pages, code available at this https URL; Published in the proceedings of 20th International Conference on Artificial Intelligence in Medicine, Halifax, Canada; code available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[175] arXiv:2201.06573 [pdf, other]
Title: PerPaDa: A Persian Paraphrase Dataset based on Implicit Crowdsourcing Data Collection
Salar Mohtaj, Fatemeh Tavakkoli, Habibollah Asghari
Comments: Submitted to LREC 2022
Journal-ref: Proceedings of the Language Resources and Evaluation Conference. 2022; 5090-5096
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[176] arXiv:2201.06642 [pdf, other]
Title: Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji, Pedro Ortiz Suarez, Laurent Romary, Benoît Sagot
Comments: 12 pages, 6 figures, 2 tables
Subjects: Computation and Language (cs.CL)
[177] arXiv:2201.06657 [pdf, other]
Title: A Literature Survey of Recent Advances in Chatbots
Guendalina Caldarini, Sardar Jaf, Kenneth McGarry
Journal-ref: Information 2022, 13(1), 41
Subjects: Computation and Language (cs.CL)
[178] arXiv:2201.06665 [pdf, other]
Title: Text characterization based on recurrence networks
Bárbara C. e Souza, Filipi N. Silva, Henrique F. de Arruda, Giovana D. da Silva, Luciano da F. Costa, Diego R. Amancio
Journal-ref: Information Sciences (2023)
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[179] arXiv:2201.06674 [pdf, other]
Title: TYPIC: A Corpus of Template-Based Diagnostic Comments on Argumentation
Shoichi Naito, Shintaro Sawada, Chihiro Nakagawa, Naoya Inoue, Kenshi Yamaguchi, Iori Shimizu, Farjana Sultana Mim, Keshav Singh, Kentaro Inui
Comments: LREC2022. The dataset is available at this https URL
Subjects: Computation and Language (cs.CL)
[180] arXiv:2201.06721 [pdf, other]
Title: Selecting and combining complementary feature representations and classifiers for hate speech detection
Rafael M. O. Cruz, Woshington V. de Sousa, George D. C. Cavalcanti
Comments: acceped for publication on the Online Social Networks and Media (OSNEM) journal
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[181] arXiv:2201.06723 [pdf, other]
Title: Emojis as Anchors to Detect Arabic Offensive Language and Hate Speech
Hamdy Mubarak, Sabit Hassan, Shammur Absar Chowdhury
Subjects: Computation and Language (cs.CL)
[182] arXiv:2201.06724 [pdf, other]
Title: Youling: an AI-Assisted Lyrics Creation System
Rongsheng Zhang, Xiaoxi Mao, Le Li, Lin Jiang, Lin Chen, Zhiwei Hu, Yadong Xi, Changjie Fan, Minlie Huang
Comments: accept by emnlp2020 demo track
Subjects: Computation and Language (cs.CL)
[183] arXiv:2201.06731 [pdf, other]
Title: Dialog Intent Induction via Density-based Deep Clustering Ensemble
Jiashu Pu, Guandan Chen, Yongzhu Chang, Xiaoxi Mao
Comments: accepted by AAAI-22 W16: Dialog System Technology Challenge (DSTC10)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[184] arXiv:2201.06741 [pdf, other]
Title: HashSet -- A Dataset For Hashtag Segmentation
Prashant Kodali, Akshala Bhatnagar, Naman Ahuja, Manish Shrivastava, Ponnurangam Kumaraguru
Subjects: Computation and Language (cs.CL)
[185] arXiv:2201.06757 [pdf, other]
Title: Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration
Bálint Csanády, András Lukács
Comments: 7 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186] arXiv:2201.06774 [pdf, other]
Title: Hierarchical Neural Network Approaches for Long Document Classification
Snehal Khandve, Vedangi Wagh, Apurva Wani, Isha Joshi, Raviraj Joshi
Comments: Accepted at International Conference on Machine Learning and Computing (ICMLC) 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[187] arXiv:2201.06777 [pdf, other]
Title: COPA-SSE: Semi-structured Explanations for Commonsense Reasoning
Ana Brassard, Benjamin Heinzerling, Pride Kavumba, Kentaro Inui
Comments: 6 pages, 6 figures, LREC 2022. Data available at this https URL
Subjects: Computation and Language (cs.CL)
[188] arXiv:2201.06849 [pdf, other]
Title: Toward Self-learning End-to-End Task-Oriented Dialog Systems
Xiaoying Zhang, Baolin Peng, Jianfeng Gao, Helen Meng
Subjects: Computation and Language (cs.CL)
[189] arXiv:2201.06876 [pdf, other]
Title: Syntax-based data augmentation for Hungarian-English machine translation
Attila Nagy, Patrick Nanys, Balázs Frey Konrád, Bence Bial, Judit Ács
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[190] arXiv:2201.06885 [pdf, other]
Title: Evidence-aware Fake News Detection with Graph Neural Networks
Weizhi Xu, Junfei Wu, Qiang Liu, Shu Wu, Liang Wang
Comments: Accepted by TheWebConf 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191] arXiv:2201.06907 [pdf, other]
Title: Improve Sentence Alignment by Divide-and-conquer
Wu Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[192] arXiv:2201.07099 [pdf, other]
Title: What Makes the Story Forward? Inferring Commonsense Explanations as Prompts for Future Event Generation
Li Lin, Yixin Cao, Lifu Huang, Shu'ang Li, Xuming Hu, Lijie Wen, Jianmin Wang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[193] arXiv:2201.07105 [pdf, other]
Title: Beyond modeling: NLP Pipeline for efficient environmental policy analysis
Jordi Planas, Daniel Firebanks-Quevedo, Galina Naydenova, Ramansh Sharma, Cristina Taylor, Kathleen Buckingham, Rong Fang
Comments: Accepted at Fragile Earth workshop proceedings at KDD 2021
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[194] arXiv:2201.07112 [pdf, other]
Title: Sectioning of Biomedical Abstracts: A Sequence of Sequence Classification Task
Mehmet Efruz Karabulut, K. Vijay-Shanker
Comments: 9 pages, 2 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[195] arXiv:2201.07126 [pdf, other]
Title: Instance-aware Prompt Learning for Language Understanding and Generation
Feihu Jin, Jinliang Lu, Jiajun Zhang, Chengqing Zong
Comments: 7 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[196] arXiv:2201.07198 [pdf, other]
Title: Klexikon: A German Dataset for Joint Summarization and Simplification
Dennis Aumiller, Michael Gertz
Comments: Code and data are available on Github: this https URL
Subjects: Computation and Language (cs.CL)
[197] arXiv:2201.07281 [pdf, other]
Title: Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Hang Jiang, Yining Hua, Doug Beeferman, Deb Roy
Comments: Accepted at LREC 2022 (Long Papers)
Subjects: Computation and Language (cs.CL)
[198] arXiv:2201.07288 [pdf, other]
Title: Extending the Vocabulary of Fictional Languages using Neural Networks
Thomas Zacharias, Ashutosh Taklikar, Raja Giryes
Comments: 10 pages, 1 figure, NeurIPS Workshop on Machine Learning for Creativity and Design 2021
Subjects: Computation and Language (cs.CL)
[199] arXiv:2201.07311 [pdf, other]
Title: Datasheet for the Pile
Stella Biderman, Kieran Bicheno, Leo Gao
Comments: Accompanies "The Pile: An 800GB Dataset of Diverse Text for Language Modeling" arXiv:2101.00027
Subjects: Computation and Language (cs.CL)
[200] arXiv:2201.07317 [pdf, other]
Title: A Privacy-Preserving Unsupervised Domain Adaptation Framework for Clinical Text Analysis
Qiyuan An, Ruijiang Li, Lin Gu, Hao Zhang, Qingyu Chen, Zhiyong Lu, Fei Wang, Yingying Zhu
Subjects: Computation and Language (cs.CL)
[201] arXiv:2201.07341 [pdf, other]
Title: Learning grammar with a divide-and-concur neural network
Sean Deyo, Veit Elser
Journal-ref: Phys. Rev. E 105, 064303 (2022)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Cellular Automata and Lattice Gases (nlin.CG)
[202] arXiv:2201.07365 [pdf, other]
Title: Improving Neural Machine Translation by Denoising Training
Liang Ding, Keqin Peng, Dacheng Tao
Comments: arXiv admin note: text overlap with arXiv:2109.07780
Subjects: Computation and Language (cs.CL)
[203] arXiv:2201.07406 [pdf, other]
Title: Fooling MOSS Detection with Pretrained Language Models
Stella Biderman, Edward Raff
Comments: To appear in the Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204] arXiv:2201.07423 [pdf, other]
Title: Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19
Yueyi Jiang, Yunfan Jiang, Liu Leqi, Piotr Winkielman
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[205] arXiv:2201.07434 [pdf, other]
Title: Interpreting Arabic Transformer Models
Ahmed Abdelali, Nadir Durrani, Fahim Dalvi, Hassan Sajjad
Comments: A new version of the paper was uploaded under a different reference: arXiv:2210.09990
Subjects: Computation and Language (cs.CL)
[206] arXiv:2201.07449 [pdf, other]
Title: TourBERT: A pretrained language model for the tourism industry
Veronika Arefieva, Roman Egger
Comments: Identified a mistake in our calculations. Will fix the problem within the next weeks and resubmit
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[207] arXiv:2201.07489 [pdf, other]
Title: Development of Fake News Model using Machine Learning through Natural Language Processing
Sajjad Ahmed, Knut Hinkelmann, Flavio Corradini
Journal-ref: International Journal of Computer and Information Engineering Vol:14, No:12, 2020
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[208] arXiv:2201.07520 [pdf, other]
Title: CM3: A Causal Masked Multimodal Model of the Internet
Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer
Subjects: Computation and Language (cs.CL)
[209] arXiv:2201.07614 [pdf, other]
Title: Uncovering More Shallow Heuristics: Probing the Natural Language Inference Capacities of Transformer-Based Pre-Trained Language Models Using Syllogistic Patterns
Reto Gubelmann, Siegfried Handschuh
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[210] arXiv:2201.07670 [pdf, other]
Title: Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts
Kilian Theil, Dirk Hovy, Heiner Stuckenschmidt
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[211] arXiv:2201.07725 [pdf, other]
Title: Data-to-Value: An Evaluation-First Methodology for Natural Language Projects
Jochen L. Leidner
Comments: 9 pages, 6 figures, 4 tables
Subjects: Computation and Language (cs.CL); Methodology (stat.ME)
[212] arXiv:2201.07899 [pdf, other]
Title: ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP)
Carol Neidle, Augustine Opoku, Dimitris Metaxas
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[213] arXiv:2201.07902 [pdf, other]
Title: Evaluating Machine Common Sense via Cloze Testing
Ehsan Qasemi, Lee Kezar, Jay Pujara, Pedro Szekely
Subjects: Computation and Language (cs.CL)
[214] arXiv:2201.07905 [pdf, other]
Title: CPTAM: Constituency Parse Tree Aggregation Method
Adithya Kulkarni, Nasim Sabetpour, Alexey Markin, Oliver Eulenstein, Qi Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215] arXiv:2201.08038 [pdf, other]
Title: Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Daisuke Suzuki, Yujin Takahashi, Ikumi Yamashita, Taichi Aida, Tosho Hirasawa, Michitaka Nakatsuji, Masato Mita, Mamoru Komachi
Comments: 8 pages (6pages + references)
Subjects: Computation and Language (cs.CL)
[216] arXiv:2201.08054 [pdf, other]
Title: VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Yihang Li, Shuichiro Shimizu, Weiqi Gu, Chenhui Chu, Sadao Kurohashi
Comments: Accepted by LREC2022
Subjects: Computation and Language (cs.CL)
[217] arXiv:2201.08070 [pdf, other]
Title: Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi
Comments: An extension of work arXiv:2005.03361
Journal-ref: TALLIP Volume 21, Issue 4, July 2022
Subjects: Computation and Language (cs.CL)
[218] arXiv:2201.08081 [pdf, other]
Title: LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
Qi Shi, Qian Liu, Bei Chen, Yu Zhang, Ting Liu, Jian-Guang Lou
Comments: EMNLP 2022 Findings
Subjects: Computation and Language (cs.CL)
[219] arXiv:2201.08089 [pdf, other]
Title: Why Did You Not Compare With That? Identifying Papers for Use as Baselines
Manjot Bedi, Tanisha Pandey, Sumit Bhatia, Tanmoy Chakraborty
Comments: Preprint of upcoming paper at European Conference on Information Retrieval (ECIR) 2022
Subjects: Computation and Language (cs.CL)
[220] arXiv:2201.08174 [pdf, other]
Title: Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis
Aleksandr Perevalov, Xi Yan, Liubov Kovriguina, Longquan Jiang, Andreas Both, Ricardo Usbeck
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[221] arXiv:2201.08193 [pdf, other]
Title: TextHacker: Learning based Hybrid Local Search Algorithm for Text Hard-label Adversarial Attack
Zhen Yu, Xiaosen Wang, Wanxiang Che, Kun He
Comments: Accepted by EMNLP 2022 Findings, Code is available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222] arXiv:2201.08214 [pdf, html, other]
Title: A Latent-Variable Model for Intrinsic Probing
Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein
Subjects: Computation and Language (cs.CL)
[223] arXiv:2201.08239 [pdf, other]
Title: LaMDA: Language Models for Dialog Applications
Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-Ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-Hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-John, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-Arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[224] arXiv:2201.08277 [pdf, other]
Title: NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Salahudeen Abdullahi, Anuoluwapo Aremu, Alipio Jeorge, Pavel Brazdil
Comments: Submitted to LREC 2022, 13 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[225] arXiv:2201.08318 [pdf, other]
Title: Cheating Automatic Short Answer Grading: On the Adversarial Usage of Adjectives and Adverbs
Anna Filighera, Sebastian Ochs, Tim Steuer, Thomas Tregel
Subjects: Computation and Language (cs.CL)
[226] arXiv:2201.08340 [pdf, other]
Title: Signature Entrenchment and Conceptual Changes in Automated Theory Repair
Xue Li, Alan Bundy, Eugene Philalithis
Comments: Presented at The Ninth Advances in Cognitive Systems (ACS) Conference 2021 (arXiv:2201.06134)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[227] arXiv:2201.08451 [pdf, other]
Title: Regional Negative Bias in Word Embeddings Predicts Racial Animus--but only via Name Frequency
Austin van Loon, Salvatore Giorgi, Robb Willer, Johannes Eichstaedt
Comments: 5 pages, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[228] arXiv:2201.08495 [pdf, other]
Title: SciBERTSUM: Extractive Summarization for Scientific Documents
Athar Sefid, C Lee Giles
Subjects: Computation and Language (cs.CL)
[229] arXiv:2201.08531 [pdf, other]
Title: Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao Zhou, Tong Zhang
Comments: To appear in the Transactions on Machine Learning Research (TMLR)
Subjects: Computation and Language (cs.CL)
[230] arXiv:2201.08542 [pdf, other]
Title: Can Model Compression Improve NLP Fairness
Guangxuan Xu, Qingyuan Hu
Subjects: Computation and Language (cs.CL)
[231] arXiv:2201.08555 [pdf, other]
Title: Identifying Adversarial Attacks on Text Classifiers
Zhouhang Xie, Jonathan Brophy, Adam Noack, Wencong You, Kalyani Asthana, Carter Perkins, Sabrina Reis, Sameer Singh, Daniel Lowd
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[232] arXiv:2201.08598 [pdf, other]
Title: Taxonomy Enrichment with Text and Graph Vector Representations
Irina Nikishina, Mikhail Tikhomirov, Varvara Logacheva, Yuriy Nazarov, Alexander Panchenko, Natalia Loukachevitch
Subjects: Computation and Language (cs.CL)
[233] arXiv:2201.08643 [pdf, other]
Title: Text Style Transfer for Bias Mitigation using Masked Language Modeling
Ewoenam Kwaku Tokpo, Toon Calders
Comments: 9 pages, 3 figures, 5 tables
Subjects: Computation and Language (cs.CL)
[234] arXiv:2201.08670 [pdf, other]
Title: Context-Tuning: Learning Contextualized Prompts for Natural Language Generation
Tianyi Tang, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
Comments: 15 pages, accepted by COLING 2022
Subjects: Computation and Language (cs.CL)
[235] arXiv:2201.08675 [pdf, other]
Title: Gender Bias in Text: Labeled Datasets and Lexicons
Jad Doughman, Wael Khreich
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[236] arXiv:2201.08687 [pdf, other]
Title: A Comparative Study on Language Models for Task-Oriented Dialogue Systems
Vinsen Marselino Andreas, Genta Indra Winata, Ayu Purwarianti
Comments: 5 pages, 1 figure
Journal-ref: 2021 8th International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA) (pp. 1-5). IEEE
Subjects: Computation and Language (cs.CL)
[237] arXiv:2201.08702 [pdf, other]
Title: Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation
Qianben Chen, Richong Zhang, Yaowei Zheng, Yongyi Mao
Comments: 8 pages, 4 figures, under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[238] arXiv:2201.08717 [pdf, other]
Title: Personality Type Based on Myers-Briggs Type Indicator with Text Posting Style by using Traditional and Deep Learning
Sakdipat Ontoum, Jonathan H. Chan
Comments: 10 pages, 14 figures, this work was presented at the 11th Joint Symposium on Computational Intelligence (JSCI11)
Subjects: Computation and Language (cs.CL)
[239] arXiv:2201.08860 [pdf, other]
Title: GreaseLM: Graph REASoning Enhanced Language Models for Question Answering
Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, Jure Leskovec
Comments: Published at ICLR 2022. All code, data, and pretrained models are available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[240] arXiv:2201.08904 [pdf, other]
Title: Description-Driven Task-Oriented Dialog Modeling
Jeffrey Zhao, Raghav Gupta, Yuan Cao, Dian Yu, Mingqiu Wang, Harrison Lee, Abhinav Rastogi, Izhak Shafran, Yonghui Wu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241] arXiv:2201.08919 [pdf, other]
Title: Recurrent Neural Networks with Mixed Hierarchical Structures and EM Algorithm for Natural Language Processing
Zhaoxin Luo, Michael Zhu
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
[242] arXiv:2201.08975 [pdf, other]
Title: Chinese Word Segmentation with Heterogeneous Graph Neural Network
Xuemei Tang, Jun Wang, Qi Su
Subjects: Computation and Language (cs.CL)
[243] arXiv:2201.09012 [pdf, other]
Title: Leaf: Multiple-Choice Question Generation
Kristiyan Vachev, Momchil Hardalov, Georgi Karadzhov, Georgi Georgiev, Ivan Koychev, Preslav Nakov
Comments: Accepted to ECIR 2022 (Demo)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[244] arXiv:2201.09060 [pdf, other]
Title: Solvability of orbit-finite systems of linear equations
Arka Ghosh, Piotr Hofman, Sławomir Lasota
Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Logic in Computer Science (cs.LO)
[245] arXiv:2201.09107 [pdf, other]
Title: Visual Information Guided Zero-Shot Paraphrase Generation
Zhe Lin, Xiaojun Wan
Comments: Accepted By COLING 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[246] arXiv:2201.09119 [pdf, other]
Title: A Causal Lens for Controllable Text Generation
Zhiting Hu, Li Erran Li
Comments: NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[247] arXiv:2201.09146 [pdf, other]
Title: Question rewriting? Assessing its importance for conversational question answering
Gonçalo Raposo, Rui Ribeiro, Bruno Martins, Luísa Coheur
Comments: Submitted manuscript (not anonymized) accepted to the 44th European Conference on Information Retrieval (ECIR) 2022. This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Information Retrieval, and is available online at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[248] arXiv:2201.09227 [pdf, other]
Title: A Large and Diverse Arabic Corpus for Language Modeling
Abbas Raza Ali, Muhammad Ajmal Siddiqui, Rema Algunaibet, Hasan Raza Ali
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[249] arXiv:2201.09282 [pdf, other]
Title: WIDAR -- Weighted Input Document Augmented ROUGE
Raghav Jain, Vaibhav Mavi, Anubhav Jangra, Sriparna Saha
Comments: Manuscript Accepted as full paper in ECIR 2022
Subjects: Computation and Language (cs.CL)
[250] arXiv:2201.09324 [pdf, other]
Title: Supervised Visual Attention for Simultaneous Multimodal Machine Translation
Veneta Haralampieva, Ozan Caglayan, Lucia Specia
Comments: Accepted to Journal of Artificial Intelligence Research (JAIR)
Journal-ref: Journal of Artificial Intelligence Research 74 (2022) 1059-1089
Subjects: Computation and Language (cs.CL)
[251] arXiv:2201.09377 [pdf, other]
Title: An Application of Pseudo-Log-Likelihoods to Natural Language Scoring
Darren Abramson, Ali Emami
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[252] arXiv:2201.09518 [pdf, other]
Title: Synthetic Books
Varvara Guljajeva
Comments: 7 pages, 5 figures
Journal-ref: ARTECH 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[253] arXiv:2201.09523 [pdf, other]
Title: BTPK-based interpretable method for NER tasks based on Talmudic Public Announcement Logic
Yulin Chen, Beishui Liao, Bruno Bentzen, Bo Yuan, Zelai Yao, Haixiao Chi, Dov Gabbay
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[254] arXiv:2201.09651 [pdf, other]
Title: Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar, Marius Mosbach, Debanjali Biswas, Dietrich Klakow
Comments: 11 pages of main content, 7 pages of appendix; presented at AKBC CSRR 2021
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[255] arXiv:2201.09680 [pdf, other]
Title: Relational Memory Augmented Language Models
Qi Liu, Dani Yogatama, Phil Blunsom
Comments: Accepted to TACL, pre MIT Press publication version
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[256] arXiv:2201.09696 [pdf, other]
Title: Unified Question Generation with Continual Lifelong Learning
Wei Yuan, Hongzhi Yin, Tieke He, Tong Chen, Qiufeng Wang, Lizhen Cui
Comments: Paper accepted in The Web Conference (WWW) 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[257] arXiv:2201.09745 [pdf, other]
Title: Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks
Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, Dongmei Zhang
Comments: Accepted by IJCAI'2022 survey track
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[258] arXiv:2201.09966 [pdf, other]
Title: Classification Of Fake News Headline Based On Neural Networks
Ke Yahan, Ruyi Qu, Lu Xiaoxia
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[259] arXiv:2201.09997 [pdf, other]
Title: Razmecheno: Named Entity Recognition from Digital Archive of Diaries "Prozhito"
Timofey Atnashev, Veronika Ganeeva, Roman Kazakov, Daria Matyash, Michael Sonkin, Ekaterina Voloshina, Oleg Serikov, Ekaterina Artemova
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[260] arXiv:2201.10005 [pdf, other]
Title: Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan, Tao Xu, Raul Puri, Alec Radford, Jesse Michael Han, Jerry Tworek, Qiming Yuan, Nikolas Tezak, Jong Wook Kim, Chris Hallacy, Johannes Heidecke, Pranav Shyam, Boris Power, Tyna Eloundou Nekoul, Girish Sastry, Gretchen Krueger, David Schnurr, Felipe Petroski Such, Kenny Hsu, Madeleine Thompson, Tabarak Khan, Toki Sherbakov, Joanne Jang, Peter Welinder, Lilian Weng
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[261] arXiv:2201.10066 [pdf, other]
Title: Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Angelina McMillan-Major, Zaid Alyafeai, Stella Biderman, Kimbo Chen, Francesco De Toni, Gérard Dupont, Hady Elsahar, Chris Emezue, Alham Fikri Aji, Suzana Ilić, Nurulaqilla Khamis, Colin Leong, Maraim Masoud, Aitor Soroa, Pedro Ortiz Suarez, Zeerak Talat, Daniel van Strien, Yacine Jernite
Comments: 8 pages plus appendix and references
Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[262] arXiv:2201.10113 [pdf, other]
Title: Multimodal data matters: language model pre-training over structured and unstructured electronic health records
Sicen Liu, Xiaolong Wang, Yongshuai Hou, Ge Li, Hui Wang, Hui Xu, Yang Xiang, Buzhou Tang
Comments: 12 pages, 5 figures accepted for publication in the IEEE Journal of Biomedical and Health Informatics (J-BHI)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[263] arXiv:2201.10262 [pdf, other]
Title: Do Transformers Encode a Foundational Ontology? Probing Abstract Classes in Natural Language
Mael Jullien, Marco Valentino, Andre Freitas
Subjects: Computation and Language (cs.CL)
[264] arXiv:2201.10274 [pdf, other]
Title: Multi-channel Attentive Graph Convolutional Network With Sentiment Fusion For Multimodal Sentiment Analysis
Luwei Xiao, Xingjiao Wu, Wen Wu, Jing Yang, Liang He
Subjects: Computation and Language (cs.CL)
[265] arXiv:2201.10376 [pdf, other]
Title: Modeling Multi-level Context for Informational Bias Detection by Contrastive Learning and Sentential Graph Network
Shijia Guo, Kenny Q. Zhu
Comments: 10 pages including bibliography
Subjects: Computation and Language (cs.CL)
[266] arXiv:2201.10422 [pdf, other]
Title: Language Generation for Broad-Coverage, Explainable Cognitive Systems
Marjorie McShane, Ivan Leon
Comments: Presented at The Ninth Advances in Cognitive Systems (ACS) Conference 2021 (arXiv:2201.06134)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[267] arXiv:2201.10430 [pdf, other]
Title: A Quantitative and Qualitative Analysis of Schizophrenia Language
Amal Alqahtani, Efsun Sarioglu Kay, Sardar Hamidian, Michael Compton, Mona Diab
Subjects: Computation and Language (cs.CL)
[268] arXiv:2201.10463 [pdf, other]
Title: Distantly supervised end-to-end medical entity extraction from electronic health records with human-level quality
Alexander Nesterov, Dmitry Umerenkov
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[269] arXiv:2201.10474 [pdf, other]
Title: Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, Noah A. Smith
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[270] arXiv:2201.10515 [pdf, other]
Title: Suicidal Ideation Detection on Social Media: A Review of Machine Learning Methods
Asma Abdulsalam, Areej Alhothali
Comments: 14 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[271] arXiv:2201.10588 [pdf, other]
Title: Convex Polytope Modelling for Unsupervised Derivation of Semantic Structure for Data-efficient Natural Language Understanding
Jingyan Zhou, Xiaohan Feng, King Keung Wu, Helen Meng
Subjects: Computation and Language (cs.CL)
[272] arXiv:2201.10608 [pdf, other]
Title: DOM-LM: Learning Generalizable Representations for HTML Documents
Xiang Deng, Prashant Shiralkar, Colin Lockard, Binxuan Huang, Huan Sun
Subjects: Computation and Language (cs.CL)
[273] arXiv:2201.10618 [pdf, other]
Title: The ABBE Corpus: Animate Beings Being Emotional
Samira Zad, Joshuan Jimenez, Mark A. Finlayson
Comments: 9 pages, 1 figure
Subjects: Computation and Language (cs.CL)
[274] arXiv:2201.10707 [pdf, other]
Title: A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model
Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[275] arXiv:2201.10716 [pdf, other]
Title: Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models
Lu Dong, Zhi-Qiang Guo, Chao-Hong Tan, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling
Comments: This paper is accepted by ICASSP2022
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[276] arXiv:2201.10792 [pdf, other]
Title: On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR
Zhao Yang, Dianwen Ng, Xiao Fu, Liping Han, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao
Comments: submitted to INTERSPEECH 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[277] arXiv:2201.10797 [pdf, other]
Title: An Automated Question-Answering Framework Based on Evolution Algorithm
Sinan Tan, Hui Xue, Qiyu Ren, Huaping Liu, Jing Bai
Comments: In Proceedings of the AAAI 2019 Workshop (WS13) on Reasoning and Complex Question-Answering (RCQA-19) this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[278] arXiv:2201.10866 [pdf, other]
Title: CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search
Xiaonan Li, Yeyun Gong, Yelong Shen, Xipeng Qiu, Hang Zhang, Bolun Yao, Weizhen Qi, Daxin Jiang, Weizhu Chen, Nan Duan
Comments: Accepted to EMNLP 2022 (main conference)
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[279] arXiv:2201.10881 [pdf, other]
Title: The Norwegian Parliamentary Speech Corpus
Per Erik Solberg, Pablo Ortiz
Comments: 6 pages, submitted to LREC 2022
Journal-ref: LREC 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[280] arXiv:2201.10927 [pdf, other]
Title: Pair-Level Supervised Contrastive Learning for Natural Language Inference
Shu'ang Li, Xuming Hu, Li Lin, Lijie Wen
Comments: Accepted at ICASSP 2022
Subjects: Computation and Language (cs.CL)
[281] arXiv:2201.10986 [pdf, other]
Title: Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data
Federico Bianchi, Vincenzo Cutrona, Dirk Hovy
Subjects: Computation and Language (cs.CL)
[282] arXiv:2201.11115 [pdf, other]
Title: CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Herbert Ullrich, Jan Drchal, Martin Rýpar, Hana Vincourová, Václav Moravec
Comments: submitted to LREV journal for review, resubmission, changed title according to reviewer suggestion
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[283] arXiv:2201.11153 [pdf, other]
Title: Addressing Issues of Cross-Linguality in Open-Retrieval Question Answering Systems For Emergent Domains
Alon Albalak, Sharon Levy, William Yang Wang
Comments: 6 pages, 8 figures
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[284] arXiv:2201.11155 [pdf, other]
Title: Explainable Patterns for Distinction and Prediction of Moral Judgement on Reddit
Ion Stagkos Efstathiadis, Guilherme Paulino-Passos, Francesca Toni
Comments: 1st Workshop on Human and Machine Decisions (WHMD 2021) at NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[285] arXiv:2201.11172 [pdf, other]
Title: Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Tu Anh Dinh, Danni Liu, Jan Niehues
Comments: 6 pages, 5 figures, accepted to IEEE ICASSP 2022. arXiv admin note: text overlap with arXiv:2107.06010
Journal-ref: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 6222-6226
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[286] arXiv:2201.11176 [pdf, other]
Title: DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
Wei Zhao, Michael Strube, Steffen Eger
Comments: EACL2023 Camera Ready
Subjects: Computation and Language (cs.CL)
[287] arXiv:2201.11258 [pdf, other]
Title: Learning How to Translate North Korean through South Korean
Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, Mamoru Komachi
Comments: 8 pages, 1 figures, 8 tables
Subjects: Computation and Language (cs.CL)
[288] arXiv:2201.11294 [pdf, other]
Title: Highly Generalizable Models for Multilingual Hate Speech Detection
Neha Deshpande, Nicholas Farris, Vidhur Kumar
Subjects: Computation and Language (cs.CL)
[289] arXiv:2201.11312 [pdf, other]
Title: A Higher-Order Semantic Dependency Parser
Bin Li, Yunlong Fan, Yikemaiti Sataer, Zhiqiang Gao
Subjects: Computation and Language (cs.CL)
[290] arXiv:2201.11313 [pdf, other]
Title: Learning Deep Semantic Model for Code Search using CodeSearchNet Corpus
Chen Wu, Ming Yan
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[291] arXiv:2201.11332 [pdf, other]
Title: Ontology-enhanced Prompt-tuning for Few-shot Learning
Hongbin Ye, Ningyu Zhang, Shumin Deng, Xiang Chen, Hui Chen, Feiyu Xiong, Xi Chen, Huajun Chen
Comments: Accepted by WWW2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[292] arXiv:2201.11367 [pdf, other]
Title: Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation
Yihe Wang, Yitong Li, Yasheng Wang, Fei Mi, Pingyi Zhou, Xin Wang, Jin Liu, Xin Jiang, Qun Liu
Comments: Accepted in COLING 2022
Subjects: Computation and Language (cs.CL)
[293] arXiv:2201.11374 [pdf, other]
Title: Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing
Jivnesh Sandhan, Laxmidhar Behera, Pawan Goyal
Comments: Accepted at EACL2023 to be held in Croatia Europe
Subjects: Computation and Language (cs.CL)
[294] arXiv:2201.11391 [pdf, other]
Title: Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal
Comments: The work is accepted at COLING22-SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Subjects: Computation and Language (cs.CL)
[295] arXiv:2201.11443 [pdf, other]
Title: Yes-Yes-Yes: Proactive Data Collection for ACL Rolling Review and Beyond
Nils Dycke, Ilia Kuznetsov, Iryna Gurevych
Comments: Accepted at Findings of EMNLP 2022
Subjects: Computation and Language (cs.CL)
[296] arXiv:2201.11473 [pdf, other]
Title: Reasoning Like Program Executors
Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen
Comments: To appear in EMNLP 2022 main conference. The first two authors contributed equally
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Symbolic Computation (cs.SC)
[297] arXiv:2201.11569 [pdf, other]
Title: Human Interpretation of Saliency-based Explanation Over Text
Hendrik Schuff, Alon Jacovi, Heike Adel, Yoav Goldberg, Ngoc Thang Vu
Comments: FAccT 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[298] arXiv:2201.11576 [pdf, other]
Title: Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation
Jixuan Wang, Kuan-Chieh Wang, Frank Rudzicz, Michael Brudno
Comments: NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[299] arXiv:2201.11582 [pdf, other]
Title: GUDN: A novel guide network with label reinforcement strategy for extreme multi-label text classification
Qing Wang, Jia Zhu, Hongji Shu, Kwame Omono Asamoah, Jianyang Shi, Cong Zhou
Comments: 12 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[300] arXiv:2201.11732 [pdf, other]
Title: IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan Vulić
Comments: ICML 2022
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2201.11766 [pdf, other]
Title: Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding
Matthew Setzler, Scott Howland, Lauren Phillips
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[302] arXiv:2201.11826 [pdf, other]
Title: Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition
Ayoub Ghriss, Bo Yang, Viktor Rozgic, Elizabeth Shriberg, Chao Wang
Comments: ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[303] arXiv:2201.11838 [pdf, other]
Title: Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences
Yikuan Li, Ramsey M. Wehbe, Faraz S. Ahmad, Hanyin Wang, Yuan Luo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[304] arXiv:2201.11867 [pdf, other]
Title: Neural-FST Class Language Model for End-to-End Speech Recognition
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer
Comments: Accepted for publication at ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[305] arXiv:2201.11870 [pdf, other]
Title: Multiple-Source Domain Adaptation via Coordinated Domain Encoders and Paired Classifiers
Payam Karisani
Comments: AAAI 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[306] arXiv:2201.11885 [pdf, other]
Title: Boosting Entity Mention Detection for Targetted Twitter Streams with Global Contextual Embeddings
Satadisha Saha Bhowmick, Eduard C. Dragut, Weiyi Meng
Subjects: Computation and Language (cs.CL)
[307] arXiv:2201.11903 [pdf, other]
Title: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[308] arXiv:2201.11990 [pdf, other]
Title: Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
Comments: Shaden Smith and Mostofa Patwary contributed equally
Subjects: Computation and Language (cs.CL)
[309] arXiv:2201.12093 [pdf, other]
Title: PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings
Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Daxin Jiang
Comments: To appear at EMNLP 2022
Subjects: Computation and Language (cs.CL)
[310] arXiv:2201.12105 [pdf, other]
Title: Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon
Comments: ICASSP \c{opyright}2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[311] arXiv:2201.12109 [pdf, other]
Title: Protum: A New Method For Prompt Tuning Based on "[MASK]"
Pan He, Yuxi Chen, Yan Wang, Yanru Zhang
Comments: under review in ICML
Subjects: Computation and Language (cs.CL)
[312] arXiv:2201.12155 [pdf, other]
Title: Reducing language context confusion for end-to-end code-switching automatic speech recognition
Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng
Comments: arXiv admin note: text overlap with arXiv:2010.14798,the paper has been accepted by Insterspeech 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[313] arXiv:2201.12219 [pdf, other]
Title: Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages
Silvia Severini, Ayyoob Imani, Philipp Dufter, Hinrich Schütze
Comments: LREC 2022
Subjects: Computation and Language (cs.CL)
[314] arXiv:2201.12323 [pdf, other]
Title: Describing Differences between Text Distributions with Natural Language
Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt
Comments: International Conference on Machine Learning, 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[315] arXiv:2201.12407 [pdf, other]
Title: Schema-Free Dependency Parsing via Sequence Generation
Boda Lin, Zijun Yao, Jiaxin Shi, Shulin Cao, Binghao Tang, Si Li, Yong Luo, Juanzi Li, Lei Hou
Subjects: Computation and Language (cs.CL)
[316] arXiv:2201.12409 [pdf, other]
Title: A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Ulrich Rückert, Srinivas Sunkara, Abhinav Rastogi, Sushant Prakash, Pranav Khaitan
Comments: Published at LREC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[317] arXiv:2201.12431 [pdf, other]
Title: Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Uri Alon, Frank F. Xu, Junxian He, Sudipta Sengupta, Dan Roth, Graham Neubig
Comments: Accepted to ICML'2022. Code and models are available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[318] arXiv:2201.12438 [pdf, other]
Title: Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Prajjwal Bhargava, Vincent Ng
Comments: AAAI 2022
Subjects: Computation and Language (cs.CL)
[319] arXiv:2201.12501 [pdf, other]
Title: Does Transliteration Help Multilingual Language Modeling?
Ibraheem Muhammad Moosa, Mahmud Elahi Akhter, Ashfia Binte Habib
Comments: In Findings of the Association for Computational Linguistics: EACL 2023
Subjects: Computation and Language (cs.CL)
[320] arXiv:2201.12502 [pdf, other]
Title: Unsupervised Multi-Granularity Summarization
Ming Zhong, Yang Liu, Suyu Ge, Yuning Mao, Yizhu Jiao, Xingxing Zhang, Yichong Xu, Chenguang Zhu, Michael Zeng, Jiawei Han
Comments: EMNLP 2022 Findings
Subjects: Computation and Language (cs.CL)
[321] arXiv:2201.12507 [pdf, other]
Title: AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
Dongkuan Xu, Subhabrata Mukherjee, Xiaodong Liu, Debadeepta Dey, Wenhui Wang, Xiang Zhang, Ahmed Hassan Awadallah, Jianfeng Gao
Comments: 15 pages, 4 figures, 10 tables
Subjects: Computation and Language (cs.CL)
[322] arXiv:2201.12538 [pdf, other]
Title: Incorporating Commonsense Knowledge into Story Ending Generation via Heterogeneous Graph Networks
Jiaan Wang, Beiqi Zou, Zhixu Li, Jianfeng Qu, Pengpeng Zhao, An Liu, Lei Zhao
Comments: DASFAA 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[323] arXiv:2201.12546 [pdf, other]
Title: Progressive Continual Learning for Spoken Keyword Spotting
Yizheng Huang, Nana Hou, Nancy F. Chen
Comments: ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[324] arXiv:2201.12549 [pdf, other]
Title: A Simple Information-Based Approach to Unsupervised Domain-Adaptive Aspect-Based Sentiment Analysis
Xiang Chen, Xiaojun Wan
Comments: 11 pages, 3 figures, 10 tables
Subjects: Computation and Language (cs.CL)
[325] arXiv:2201.12568 [pdf, other]
Title: Le Processus Powered Dirichlet-Hawkes comme A Priori Flexible pour Clustering Temporel de Textes
Gaël Poux-Médard, Julien Velcin, Sabine Loudcher
Comments: in French
Subjects: Computation and Language (cs.CL)
[326] arXiv:2201.12664 [pdf, other]
Title: A Deep CNN Architecture with Novel Pooling Layer Applied to Two Sudanese Arabic Sentiment Datasets
Mustafa Mhamed, Richard Sutcliffe, Xia Sun, Jun Feng, Eiad Almekhlafi, Ephrem A. Retta
Comments: 19 pages, 11 tables, 11 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[327] arXiv:2201.12793 [pdf, other]
Title: Part of Speech Tagging (POST) of a Low-resource Language using another Language (Developing a POS-Tagged Lexicon for Kurdish (Sorani) using a Tagged Persian (Farsi) Corpus)
Hossein Hassani
Comments: 7pages, 2 tables, 3 figures
Subjects: Computation and Language (cs.CL)
[328] arXiv:2201.12799 [pdf, other]
Title: Recognition of Implicit Geographic Movement in Text
Scott Pezanowski, Prasenjit Mitra
Journal-ref: Proceedings of The 12th Language Resources and Evaluation Conference, 2047-2056 (2020)
Subjects: Computation and Language (cs.CL)
[329] arXiv:2201.12806 [pdf, other]
Title: Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu
Comments: Accepted by ICASSP 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[330] arXiv:2201.12833 [pdf, other]
Title: Word Segmentation and Morphological Parsing for Sanskrit
Jingwen Li, Leander Girrbach
Comments: Code can be accessed from this https URL
Subjects: Computation and Language (cs.CL)
[331] arXiv:2201.12868 [pdf, other]
Title: Anticipation-Free Training for Simultaneous Machine Translation
Chih-Chiang Chang, Shun-Po Chuang, Hung-yi Lee
Comments: Accepted to IWSLT 2022
Subjects: Computation and Language (cs.CL)
[332] arXiv:2201.12911 [pdf, other]
Title: Grammatical cues to subjecthood are redundant in a majority of simple clauses across languages
Kyle Mahowald, Evgeniia Diachek, Edward Gibson, Evelina Fedorenko, Richard Futrell
Subjects: Computation and Language (cs.CL)
[333] arXiv:2201.12926 [pdf, other]
Title: Compositionality as Lexical Symmetry
Ekin Akyürek, Jacob Andreas
Comments: ACL2023 Final Version
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[334] arXiv:2201.13072 [pdf, other]
Title: Are Mutually Intelligible Languages Easier to Translate?
Avital Friedland, Jonathan Zeltser, Omer Levy
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[335] arXiv:2201.13125 [pdf, other]
Title: Corpus for Automatic Structuring of Legal Documents
Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi
Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[336] arXiv:2201.13230 [pdf, other]
Title: POTATO: exPlainable infOrmation exTrAcTion framewOrk
Ádám Kovács, Kinga Gémes, Eszter Iklódi, Gábor Recski
Comments: 4 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[337] arXiv:2201.13242 [pdf, other]
Title: Correcting diacritics and typos with a ByT5 transformer model
Lukas Stankevičius, Mantas Lukoševičius, Jurgita Kapočiūtė-Dzikienė, Monika Briedienė, Tomas Krilavičius
Journal-ref: Appl. Sci. 2022, 12(5), 2636
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[338] arXiv:2201.13405 [pdf, other]
Title: Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Olga Majewska, Evgeniia Razumovskaia, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen
Subjects: Computation and Language (cs.CL)
[339] arXiv:2201.13429 [pdf, other]
Title: Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations
Wei Zhao, Steffen Eger
Comments: ACML2022 Camera Ready
Subjects: Computation and Language (cs.CL)
[340] arXiv:2201.00195 (cross-list from q-bio.PE) [pdf, other]
Title: Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast
Jayden L. Macklin-Cordes, Erich R. Round
Comments: Accepted for publication in Linguistic Typology. Supplementary data at this https URL. 96 total pages (Main text: 41 pages, 6 figures, 3 tables. Supplementary S1: 34 pages, 1 figure. Supplementary S2: 21 pages)
Subjects: Populations and Evolution (q-bio.PE); Computation and Language (cs.CL)
[341] arXiv:2201.00304 (cross-list from cs.AI) [pdf, other]
Title: Informed Multi-context Entity Alignment
Kexuan Xin, Zequn Sun, Wen Hua, Wei Hu, Xiaofang Zhou
Comments: accepted by wsdm 2022
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[342] arXiv:2201.00365 (cross-list from cs.IR) [pdf, other]
Title: Establishing Strong Baselines for TripClick Health Retrieval
Sebastian Hofstätter, Sophia Althammer, Mete Sertkan, Allan Hanbury
Comments: Accepted at ECIR 2022
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[343] arXiv:2201.00614 (cross-list from cs.SI) [pdf, other]
Title: Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Subhabrata Dutta, Samiya Caur, Soumen Chakrabarti, Tanmoy Chakraborty
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[344] arXiv:2201.00693 (cross-list from cs.IR) [pdf, other]
Title: Multimodal Entity Tagging with Multimodal Knowledge Base
Hao Peng, Hang Li, Lei Hou, Juanzi Li, Chao Qiao
Comments: 11 pages, 4 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2201.00855 (cross-list from cs.CY) [pdf, other]
Title: AI & Racial Equity: Understanding Sentiment Analysis Artificial Intelligence, Data Security, and Systemic Theory in Criminal Justice Systems
Alia Abbas
Comments: 25 pages
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[346] arXiv:2201.00969 (cross-list from cs.CV) [pdf, other]
Title: Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety
Rajagopal A, Nirmala V, Arun Muthuraj Vedamanickam
Comments: In Springer Proceedings. International Conference On Big Data, Machine Learning and Applications 2021. this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[347] arXiv:2201.00971 (cross-list from cs.LG) [pdf, other]
Title: Submix: Practical Private Prediction for Large-Scale Language Models
Antonio Ginart, Laurens van der Maaten, James Zou, Chuan Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[348] arXiv:2201.00975 (cross-list from cs.CV) [pdf, other]
Title: StyleM: Stylized Metrics for Image Captioning Built with Contrastive N-grams
Chengxi Li, Brent Harrison
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[349] arXiv:2201.00985 (cross-list from cs.CV) [pdf, other]
Title: Variational Stacked Local Attention Networks for Diverse Video Captioning
Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M Ashraful Amin, A K M Mahbubur Rahman
Comments: To be published in Winter Conference on Applications of Computer Vision 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[350] arXiv:2201.01209 (cross-list from cs.DB) [pdf, other]
Title: Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[351] arXiv:2201.01490 (cross-list from cs.LG) [pdf, other]
Title: Debiased Learning from Naturally Imbalanced Pseudo-Labels
Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu
Comments: Accepted by CVPR 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2201.01609 (cross-list from cs.CV) [pdf, other]
Title: All You Need In Sign Language Production
Razieh Rastgoo, Kourosh Kiani, Sergio Escalera, Vassilis Athitsos, Mohammad Sabokrou
Comments: arXiv admin note: substantial text overlap with arXiv:2103.15910
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[353] arXiv:2201.01647 (cross-list from cs.AI) [pdf, other]
Title: Comparison of biomedical relationship extraction methods and models for knowledge graph creation
Nikola Milosevic, Wolfgang Thielemann
Comments: Paper submitted to Journal of Semantic Web
Journal-ref: Nikola Milosevic, Wolfgang Thielemann, Comparison of biomedical relationship extraction methods and models for knowledge graph creation, Journal of Web Semantics, 2022, 100756, ISSN 1570-8268,
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[354] arXiv:2201.01745 (cross-list from cs.IR) [pdf, other]
Title: Atomized Search Length: Beyond User Models
John Alex, Keith Hall, Donald Metzler
Comments: 13 pages, 6 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[355] arXiv:2201.01819 (cross-list from cs.LG) [pdf, other]
Title: Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language Models
Diana Kim, Ahmed Elgammal, Marian Mazzone
Comments: 23 pages, This paper is an extended version of a paper that will be published at the 36th AAAI Conference on Artificial Intelligence, to beheld in Vancouver, BC, Canada, February 22 - March 1, 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[356] arXiv:2201.01901 (cross-list from cs.CV) [pdf, other]
Title: Incremental Object Grounding Using Scene Graphs
John Seon Keun Yi, Yoonwoo Kim, Sonia Chernova
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[357] arXiv:2201.01984 (cross-list from cs.CV) [pdf, html, other]
Title: Image Captioning via Compact Bidirectional Architecture
Zijie Song, Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Huixia Ben, Richang Hong, Meng Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[358] arXiv:2201.02010 (cross-list from cs.CV) [pdf, other]
Title: Self-Training Vision Language BERTs with a Unified Conditional Model
Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[359] arXiv:2201.02034 (cross-list from stat.AP) [pdf, other]
Title: Bayesian Regression Approach for Building and Stacking Predictive Models in Time Series Analytics
Bohdan M. Pavlyshenko
Subjects: Applications (stat.AP); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[360] arXiv:2201.02058 (cross-list from cs.LG) [pdf, other]
Title: Sales Time Series Analytics Using Deep Q-Learning
Bohdan M. Pavlyshenko
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[361] arXiv:2201.02065 (cross-list from cs.CV) [pdf, other]
Title: ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language
Cleison Correia de Amorim, Cleber Zanchettin
Journal-ref: The paper is under consideration at Pattern Recognition Letters (2022) (under the manuscript number PRLETTERS-D-22-00140)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[362] arXiv:2201.02119 (cross-list from cs.NE) [pdf, other]
Title: An Opinion Mining of Text in COVID-19 Issues along with Comparative Study in ML, BERT & RNN
Md. Mahadi Hasan Sany, Mumenunnesa Keya, Sharun Akter Khushbu, Akm Shahariar Azad Rabby, Abu Kaisar Mohammad Masum
Comments: 16 pages, 9 figures
Journal-ref: 3rd International Conference on Deep Learning, Artificial Intelligence and Robotics, (ICDLAIR) 2021
Subjects: Neural and Evolutionary Computing (cs.NE); Computation and Language (cs.CL)
[363] arXiv:2201.02127 (cross-list from cs.IR) [pdf, other]
Title: Sentiment Analysis and Sarcasm Detection of Indian General Election Tweets
Arpit Khare, Amisha Gangwar, Sudhakar Singh, Shiv Prakash
Comments: 17 pages, 9 figures, ANTIC-2021
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[364] arXiv:2201.02229 (cross-list from cs.LG) [pdf, other]
Title: Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT
Aparna Elangovan, Yuan Li, Douglas E. V. Pires, Melissa J. Davis, Karin Verspoor
Comments: BMC BioInformatics
Journal-ref: BMC Bioinformatics 23, 4 (2022)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[365] arXiv:2201.02280 (cross-list from cs.CV) [pdf, other]
Title: Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping
Nora Horanyi, Kedi Xia, Kwang Moo Yi, Abhishake Kumar Bojja, Ales Leonardis, Hyung Jin Chang
Journal-ref: Pattern Recognition, 2022, 108485, ISSN 0031-3203
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[366] arXiv:2201.02494 (cross-list from cs.CV) [pdf, other]
Title: Progressive Video Summarization via Multimodal Self-supervised Learning
Li Haopeng, Ke Qiuhong, Gong Mingming, Tom Drummond
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[367] arXiv:2201.02495 (cross-list from cs.CV) [pdf, other]
Title: Sign Language Video Retrieval with Free-Form Textual Queries
Amanda Duarte, Samuel Albanie, Xavier Giró-i-Nieto, Gül Varol
Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[368] arXiv:2201.02639 (cross-list from cs.CV) [pdf, other]
Title: MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin Choi
Comments: CVPR 2022. Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[369] arXiv:2201.02772 (cross-list from cs.CV) [pdf, other]
Title: A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
Zhixiong Zeng, Wenji Mao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[370] arXiv:2201.02857 (cross-list from cs.HC) [pdf, other]
Title: Effect of Toxic Review Content on Overall Product Sentiment
Mayukh Mukhopadhyay, Sangeeta Sahney
Comments: 43 pages,30 figures, 2 tables
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); General Economics (econ.GN); Applications (stat.AP)
[371] arXiv:2201.03215 (cross-list from cs.LG) [pdf, other]
Title: Handwriting recognition and automatic scoring for descriptive answers in Japanese language tests
Hung Tuan Nguyen, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Masaki Nakagawa
Comments: Keywords: handwritten Japanese answers, handwriting recognition, automatic scoring, ensemble recognition, deep neural networks; Reported in IEICE technical report, PRMU2021-32, pp.45-50 (2021.12) Published after peer review and Presented in ICFHR2022, Lecture Notes in Computer Science, vol. 13639, pp. 274-284 (2022.11)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2201.03306 (cross-list from q-bio.NC) [pdf, other]
Title: A Planck Radiation and Quantization Scheme for Human Cognition and Language
Diederik Aerts, Lester Beltran
Comments: 7 figures
Journal-ref: Frontiers in Psychology 13, 850725 (2022)
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Quantum Physics (quant-ph)
[373] arXiv:2201.03546 (cross-list from cs.CV) [pdf, other]
Title: Language-driven Semantic Segmentation
Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl
Comments: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[374] arXiv:2201.03622 (cross-list from cs.IR) [pdf, other]
Title: Graph-Based Recommendation System Enhanced with Community Detection
Zeinab Shokrzadeh, Mohammad-Reza Feizi-Derakhshi, Mohammad-Ali Balafar, Jamshid Bagherzadeh-Mohasefi
Comments: This is a preprint of an article published in "Scientific Programming"
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[375] arXiv:2201.03967 (cross-list from cs.SD) [pdf, other]
Title: Emotion Intensity and its Control for Emotional Voice Conversion
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li
Comments: Accepted by IEEE Transactions on Affective Computing
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[376] arXiv:2201.03969 (cross-list from cs.LG) [pdf, other]
Title: Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis
Jiahao Zheng, Sen Zhang, Xiaoping Wang, Zhigang Zeng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[377] arXiv:2201.04026 (cross-list from cs.CV) [pdf, other]
Title: Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training
Yehao Li, Jiahao Fan, Yingwei Pan, Ting Yao, Weiyao Lin, Tao Mei
Comments: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[378] arXiv:2201.04419 (cross-list from cs.IR) [pdf, other]
Title: Topic Modeling on Podcast Short-Text Metadata
Francisco B. Valero, Marion Baranes, Elena V. Epure
Comments: Accepted for publication in the 44nd European Conference on Information Retrieval (ECIR'22)
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[379] arXiv:2201.04458 (cross-list from cs.IR) [pdf, other]
Title: Diagnosing BERT with Retrieval Heuristics
Arthur Câmara, Claudia Hauff
Comments: Published at ECIR 2020
Journal-ref: Advances in Information Retrieval. 2020;12035:605-618. Published 2020 Mar 17
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[380] arXiv:2201.04672 (cross-list from cs.IR) [pdf, other]
Title: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation
Hejie Cui, Jiaying Lu, Yao Ge, Carl Yang
Comments: This paper has been accepted to the 44th European Conference on Information Retrieval (ECIR) 2022
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[381] arXiv:2201.04868 (cross-list from cs.HC) [pdf, other]
Title: Interactive Data Analysis with Next-step Natural Language Query Recommendation
Xingbo Wang, Furui Cheng, Yong Wang, Ke Xu, Jiang Long, Hong Lu, Huamin Qu
Comments: 14 pages, 6 figures
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Databases (cs.DB)
[382] arXiv:2201.05176 (cross-list from cs.IR) [pdf, other]
Title: Neural Approaches to Conversational Information Retrieval
Jianfeng Gao, Chenyan Xiong, Paul Bennett, Nick Craswell
Comments: Book Draft
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[383] arXiv:2201.05234 (cross-list from cs.IT) [pdf, other]
Title: Optimal alphabet for single text compression
Armen E. Allahverdyan, Andranik Khachatryan
Comments: 17 pages, 14 figures, 2 table; enlarged and clarified version
Subjects: Information Theory (cs.IT); Computation and Language (cs.CL); Data Analysis, Statistics and Probability (physics.data-an)
[384] arXiv:2201.05299 (cross-list from cs.CV) [pdf, other]
Title: A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering
Feng Gao, Qing Ping, Govind Thattai, Aishwarya Reganti, Ying Nian Wu, Prem Natarajan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[385] arXiv:2201.05334 (cross-list from cs.SI) [pdf, other]
Title: This Must Be the Place: Predicting Engagement of Online Communities in a Large-scale Distributed Campaign
Abraham Israeli, Alexander Kremiansky, Oren Tsur
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL)
[386] arXiv:2201.05409 (cross-list from cs.IR) [pdf, other]
Title: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval
Shitao Xiao, Zheng Liu, Weihao Han, Jianjin Zhang, Yingxia Shao, Defu Lian, Chaozhuo Li, Hao Sun, Denvy Deng, Liangjie Zhang, Qi Zhang, Xing Xie
Comments: Accepted as a full paper in WWW 2022
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[387] arXiv:2201.05460 (cross-list from cs.IR) [pdf, other]
Title: Impact of Stop Sets on Stopping Active Learning for Text Classification
Luke Kurlandski, Michael Bloodgood
Comments: 8 pages, 3 tables, 1 figure; published in Proceedings of the IEEE 16th International Conference on Semantic Computing (ICSC), pages 25-32, January 2022. IEEE
Journal-ref: In Proceedings of the 2022 IEEE 16th International Conference on Semantic Computing (ICSC), pages 25-32, January 2022. IEEE
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[388] arXiv:2201.05648 (cross-list from cs.IR) [pdf, other]
Title: Towards Reducing Manual Workload in Technology-Assisted Reviews: Estimating Ranking Performance
Grace E. Lee, Aixin Sun
Comments: 9 pages, work done in 2019, revised in 2021
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[389] arXiv:2201.05658 (cross-list from cs.AI) [pdf, other]
Title: Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents
Ramon Pires, Fábio C. de Souza, Guilherme Rosa, Roberto A. Lotufo, Rodrigo Nogueira
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[390] arXiv:2201.05729 (cross-list from cs.CV) [pdf, other]
Title: CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks
Zhecan Wang, Noel Codella, Yen-Chun Chen, Luowei Zhou, Jianwei Yang, Xiyang Dai, Bin Xiao, Haoxuan You, Shih-Fu Chang, Lu Yuan
Comments: This paper is greatly modified and updated to be re-submitted to another conference. The new paper is under the name "Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks", this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[391] arXiv:2201.05771 (cross-list from eess.AS) [pdf, other]
Title: KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol
Comments: 8 pages, 2 figures, 5 tables, accepted to LREC 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[392] arXiv:2201.06224 (cross-list from cs.IR) [pdf, other]
Title: Unintended Bias in Language Model-driven Conversational Recommendation
Tianshu Shen, Jiaru Li, Mohamed Reda Bouadjenek, Zheda Mai, Scott Sanner
Comments: 12 pages, 7 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[393] arXiv:2201.06517 (cross-list from cs.SI) [pdf, other]
Title: Demographic Confounding Causes Extreme Instances of Lifestyle Politics on Facebook
Alexander Ruch, Yujia Zhang, Michael Macy
Comments: 29 pages (27 body, 2 supplemental material), 14 figures (12 body, 2 supplemental material), 2 tables
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Physics and Society (physics.soc-ph)
[394] arXiv:2201.06556 (cross-list from cs.SI) [pdf, other]
Title: Millions of Co-purchases and Reviews Reveal the Spread of Polarization and Lifestyle Politics across Online Markets
Alexander Ruch, Ari Decter-Frain, Raghav Batra
Comments: 25 pages (21 body, 4 supplemental material), 10 figures (4 body, 6 supplemental material), 5 tables (3 body, 2 supplemental material)
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Physics and Society (physics.soc-ph)
[395] arXiv:2201.06653 (cross-list from cs.LG) [pdf, other]
Title: Data-Centric Machine Learning in the Legal Domain
Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[396] arXiv:2201.06786 (cross-list from cs.AI) [pdf, other]
Title: Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues
Akira Taniguchi, Hiroaki Murakami, Ryo Ozaki, Tadahiro Taniguchi
Comments: Accepted to IEEE TRANSACTIONS ON COGNITIVE DEVELOPMENTAL SYSTEMS
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Robotics (cs.RO)
[397] arXiv:2201.06796 (cross-list from cs.HC) [pdf, other]
Title: CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities
Mina Lee, Percy Liang, Qian Yang
Comments: Published as a conference paper at CHI 2022
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL)
[398] arXiv:2201.06841 (cross-list from eess.AS) [pdf, other]
Title: Human and Automatic Speech Recognition Performance on German Oral History Interviews
Michael Gref, Nike Matthiesen, Christoph Schmidt, Sven Behnke, Joachim Köhler
Comments: Submitted to LREC 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[399] arXiv:2201.06868 (cross-list from eess.AS) [pdf, other]
Title: A Study on the Ambiguity in Human Annotation of German Oral History Interviews for Perceived Emotion Recognition and Sentiment Analysis
Michael Gref, Nike Matthiesen, Sreenivasa Hikkal Venugopala, Shalaka Satheesh, Aswinkumar Vijayananth, Duc Bach Ha, Sven Behnke, Joachim Köhler
Comments: Submitted to LREC 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[400] arXiv:2201.06910 (cross-list from cs.LG) [pdf, other]
Title: ZeroPrompt: Scaling Prompt-Based Pretraining to 1,000 Tasks Improves Zero-Shot Generalization
Hanwei Xu, Yujun Chen, Yulun Du, Nan Shao, Yanggang Wang, Haiyu Li, Zhilin Yang
Comments: 18 pages
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[401] arXiv:2201.06967 (cross-list from cs.CY) [pdf, other]
Title: Large Scale Analysis of Open MOOC Reviews to Support Learners' Course Selection
Manuel J. Gomez, Mario Calderón, Victor Sánchez, Félix J. García Clemente, José A. Ruipérez-Valiente
Comments: 36 pages, 8 figures
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[402] arXiv:2201.07207 (cross-list from cs.LG) [pdf, other]
Title: Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Wenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch
Comments: Project website at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[403] arXiv:2201.07538 (cross-list from cs.CY) [pdf, other]
Title: Writing about COVID-19 vaccines: Emotional profiling unravels how mainstream and alternative press framed AstraZeneca, Pfizer and vaccination campaigns
Alfonso Semeraro, Salvatore Vilella, Giancarlo Ruffo, Massimo Stella
Comments: 16 pages, 5 figures
Journal-ref: Scientific Reports volume 12, Article number: 14445 (2022)
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL); Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
[404] arXiv:2201.07604 (cross-list from cs.LG) [pdf, other]
Title: Semi-Supervised Clustering with Contrastive Learning for Discovering New Intents
Feng Wei, Zhenbo Chen, Zhenghong Hao, Fengxin Yang, Hua Wei, Bing Han, Sheng Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[405] arXiv:2201.07745 (cross-list from cs.IR) [pdf, other]
Title: Improving Biomedical Information Retrieval with Neural Retrievers
Man Luo, Arindam Mitra, Tejas Gokhale, Chitta Baral
Comments: Accepted at AAAI 2022
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[406] arXiv:2201.07876 (cross-list from cs.SD) [pdf, other]
Title: Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech
Kusha Sridhar, Carlos Busso
Comments: 8 Figures and 5 tables
Journal-ref: IEEE Transactions on Affective Computing, vol. 13, no. 4, pp. 1959-1972, October-December 2022
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS)
[407] arXiv:2201.07999 (cross-list from cs.LG) [pdf, other]
Title: Sentiment Analysis: Predicting Yelp Scores
Bhanu Prakash Reddy Guda, Mashrin Srivastava, Deep Karkhanis
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[408] arXiv:2201.08071 (cross-list from cs.CV) [pdf, other]
Title: Temporal Sentence Grounding in Videos: A Survey and Future Directions
Hao Zhang, Aixin Sun, Wei Jing, Joey Tianyi Zhou
Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[409] arXiv:2201.08264 (cross-list from cs.CV) [pdf, other]
Title: End-to-end Generative Pretraining for Multimodal Video Captioning
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid
Journal-ref: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[410] arXiv:2201.08471 (cross-list from cs.IR) [pdf, other]
Title: Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models
Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard
Comments: Accepted at ECIR 2022 (Full paper)
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[411] arXiv:2201.08520 (cross-list from cs.LG) [pdf, other]
Title: Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning
Tongzhou Mu, Kaixiang Lin, Feiyang Niu, Govind Thattai
Comments: Transactions on Machine Learning Research (TMLR)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[412] arXiv:2201.08742 (cross-list from cs.IR) [pdf, other]
Title: Towards Building Economic Models of Conversational Search
Leif Azzopardi, Mohammad Aliannejadi, Evangelos Kanoulas
Comments: To appear in ECIR 2022
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[413] arXiv:2201.08808 (cross-list from cs.IR) [pdf, other]
Title: Conversational Information Seeking
Hamed Zamani, Johanne R. Trippas, Jeff Dalton, Filip Radlinski
Comments: Draft Version 1.2
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[414] arXiv:2201.08810 (cross-list from cs.PL) [pdf, other]
Title: GAP-Gen: Guided Automatic Python Code Generation
Junchen Zhao, Yurun Song, Junlin Wang, Ian G. Harris
Comments: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop
Subjects: Programming Languages (cs.PL); Computation and Language (cs.CL); Machine Learning (cs.LG); Software Engineering (cs.SE)
[415] arXiv:2201.08817 (cross-list from cs.LO) [pdf, other]
Title: Biochemical Space Language in Relation to Multiset Rewriting Systems
Matej Troják, David Šafránek, Luboš Brim
Comments: 9 pages, 8 figures
Subjects: Logic in Computer Science (cs.LO); Computation and Language (cs.CL)
[416] arXiv:2201.09451 (cross-list from cs.SI) [pdf, other]
Title: Emotion-based Modeling of Mental Disorders on Social Media
Xiaobo Guo, Yaojia Sun, Soroush Vosoughi
Comments: Proceedings of the 20th IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL)
[417] arXiv:2201.09486 (cross-list from cs.SD) [pdf, other]
Title: Bias in Automated Speaker Recognition
Wiebke Toussaint Hutiri, Aaron Ding
Journal-ref: 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[418] arXiv:2201.09494 (cross-list from eess.AS) [pdf, other]
Title: Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages
A. Madhavaraj, Ramakrishnan Angarai Ganesan
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[419] arXiv:2201.09555 (cross-list from cs.AI) [pdf, other]
Title: A Knowledge Graph Embeddings based Approach for Author Name Disambiguation using Literals
Cristian Santini, Genet Asefa Gesese, Silvio Peroni, Aldo Gangemi, Harald Sack, Mehwish Alam
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Digital Libraries (cs.DL)
[420] arXiv:2201.09586 (cross-list from eess.AS) [pdf, other]
Title: PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
Takuya Yoshioka, Xiaofei Wang, Dongmei Wang
Comments: 5 pages, 2 figure, 2 tables, accepted for presentation at ICASSP 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[421] arXiv:2201.09708 (cross-list from cs.AI) [pdf, other]
Title: Towards Collaborative Question Answering: A Preliminary Study
Xiangkun Hu, Hang Yan, Qipeng Guo, Xipeng Qiu, Weinan Zhang, Zheng Zhang
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[422] arXiv:2201.09992 (cross-list from cs.IR) [pdf, other]
Title: HC4: A New Suite of Test Collections for Ad Hoc CLIR
Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang
Comments: 16 pages, 2 figures, accepted at ECIR 2022
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[423] arXiv:2201.10207 (cross-list from eess.AS) [pdf, other]
Title: SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training
Wenyong Huang, Zhenhe Zhang, Yu Ting Yeung, Xin Jiang, Qun Liu
Comments: ICLR 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[424] arXiv:2201.10222 (cross-list from cs.LG) [pdf, other]
Title: Explanatory Learning: Beyond Empiricism in Neural Networks
Antonio Norelli, Giorgio Mariani, Luca Moschella, Andrea Santilli, Giambattista Parascandolo, Simone Melzi, Emanuele Rodolà
Comments: Main paper: 10 pages, References: 3 pages, Appendix: 7 pages
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); History and Philosophy of Physics (physics.hist-ph)
[425] arXiv:2201.10240 (cross-list from eess.AS) [pdf, other]
Title: Improving the fusion of acoustic and text representations in RNN-T
Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-yiin Chang
Comments: Paper to appear at ICASSP 2022
Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Sound (cs.SD)
[426] arXiv:2201.10375 (cross-list from eess.AS) [pdf, other]
Title: Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention
Artem Gorodetskii, Ivan Ozhiganov
Comments: Added article structure
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[427] arXiv:2201.10812 (cross-list from stat.AP) [pdf, other]
Title: Both the validity of the cultural tightness index and the association with creativity and order are spurious -- a comment on Jackson et al
Alexander Koplenig, Sascha Wolfer
Subjects: Applications (stat.AP); Computation and Language (cs.CL)
[428] arXiv:2201.10890 (cross-list from cs.LG) [pdf, other]
Title: One Student Knows All Experts Know: From Sparse to Dense
Fuzhao Xue, Xiaoxin He, Xiaozhe Ren, Yuxuan Lou, Yang You
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2201.10978 (cross-list from cs.IR) [pdf, other]
Title: Machine Learning for Food Review and Recommendation
Tan Khang Le, Siu Cheung Hui
Comments: Accepted paper to International Student Conference on Artificial Intelligence (STCAI) 2021
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[430] arXiv:2201.11014 (cross-list from cs.CV) [pdf, other]
Title: Language-biased image classification: evaluation based on semantic representations
Yoann Lemesle, Masataka Sawayama, Guillermo Valle-Perez, Maxime Adolphe, Hélène Sauzéon, Pierre-Yves Oudeyer
Comments: Accepted at ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[431] arXiv:2201.11094 (cross-list from cs.IR) [pdf, other]
Title: SCAI-QReCC Shared Task on Conversational Question Answering
Svitlana Vakulenko, Johannes Kiesel, Maik Fröbe
Comments: 10 pages
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[432] arXiv:2201.11114 (cross-list from cs.CV) [pdf, other]
Title: Natural Language Descriptions of Deep Visual Features
Evan Hernandez, Sarah Schwettmann, David Bau, Teona Bagashvili, Antonio Torralba, Jacob Andreas
Comments: To be published as a conference paper at ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[433] arXiv:2201.11147 (cross-list from q-bio.BM) [pdf, other]
Title: OntoProtein: Protein Pretraining With Gene Ontology Embedding
Ningyu Zhang, Zhen Bi, Xiaozhuan Liang, Siyuan Cheng, Haosen Hong, Shumin Deng, Jiazhang Lian, Qiang Zhang, Huajun Chen
Comments: Accepted by ICLR 2022
Subjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[434] arXiv:2201.11207 (cross-list from cs.SD) [pdf, other]
Title: Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Piotr Żelasko, Siyuan Feng, Laureano Moro Velazquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak
Comments: Accepted for publication in Computer Speech and Language
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[435] arXiv:2201.11249 (cross-list from cs.LG) [pdf, other]
Title: Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment
Xinhang Li, Yong Zhang, Chunxiao Xing
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[436] arXiv:2201.11571 (cross-list from eess.AS) [pdf, other]
Title: Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition
Mohammad Soleymanpour, Michael T. Johnson, Rahim Soleymanpour, Jeffrey Berry
Comments: Accepted ICASSP 2022
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[437] arXiv:2201.11770 (cross-list from cs.SI) [pdf, other]
Title: Going Extreme: Comparative Analysis of Hate Speech in Parler and Gab
Abraham Israeli, Oren Tsur
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL)
[438] arXiv:2201.11794 (cross-list from cs.CV) [pdf, other]
Title: A Survey on Visual Transfer Learning using Knowledge Graphs
Sebastian Monka, Lavdim Halilaj, Achim Rettinger
Comments: Semantic Web Journal (SWJ)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[439] arXiv:2201.11895 (cross-list from cs.LG) [pdf, other]
Title: "That's so cute!": The CARE Dataset for Affective Response Detection
Jane Dwivedi-Yu, Alon Y. Halevy
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[440] arXiv:2201.11934 (cross-list from cs.CR) [pdf, other]
Title: A Secure and Efficient Federated Learning Framework for NLP
Jieren Deng, Chenghong Wang, Xianrui Meng, Yijue Wang, Ji Li, Sheng Lin, Shuo Han, Fei Miao, Sanguthevar Rajasekaran, Caiwen Ding
Comments: Accepted by EMNLP 2021
Subjects: Cryptography and Security (cs.CR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[441] arXiv:2201.11972 (cross-list from eess.AS) [pdf, other]
Title: DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Songxiang Liu, Dan Su, Dong Yu
Comments: Preprint. 16 pages
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[442] arXiv:2201.12091 (cross-list from cs.LG) [pdf, other]
Title: Linear Adversarial Concept Erasure
Shauli Ravfogel, Michael Twiton, Yoav Goldberg, Ryan Cotterell
Comments: Accepted in ICML 2022; a revised version
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[443] arXiv:2201.12114 (cross-list from cs.LG) [pdf, other]
Title: Rethinking Attention-Model Explainability through Faithfulness Violation Test
Yibing Liu, Haoliang Li, Yangyang Guo, Chenqi Kong, Jing Li, Shiqi Wang
Comments: Accepted to ICML 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[444] arXiv:2201.12122 (cross-list from cs.LG) [pdf, other]
Title: Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid, Yutaro Yamada, Shixiang Shane Gu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[445] arXiv:2201.12191 (cross-list from cs.LG) [pdf, html, other]
Title: Kernelized Concept Erasure
Shauli Ravfogel, Francisco Vargas, Yoav Goldberg, Ryan Cotterell
Comments: Accepted as a long paper in EMNLP22
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[446] arXiv:2201.12320 (cross-list from cs.LG) [pdf, other]
Title: Generative Cooperative Networks for Natural Language Generation
Sylvain Lamprier, Thomas Scialom, Antoine Chaffin, Vincent Claveau, Ewa Kijak, Jacopo Staiano, Benjamin Piwowarski
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[447] arXiv:2201.12469 (cross-list from cs.LG) [pdf, other]
Title: ScaLA: Accelerating Adaptation of Pre-Trained Transformer-Based Language Models via Efficient Large-Batch Adversarial Noise
Minjia Zhang, Niranjan Uma Naresh, Yuxiong He
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[448] arXiv:2201.12622 (cross-list from cs.CV) [pdf, other]
Title: Hand Gesture Recognition of Dumb Person Using one Against All Neural Network
Muhammad Asim Khan, Lan Hong, Sajjad Ahmed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[449] arXiv:2201.12675 (cross-list from cs.LG) [pdf, other]
Title: Decepticons: Corrupted Transformers Breach Privacy in Federated Learning for Language Models
Liam Fowl, Jonas Geiping, Steven Reich, Yuxin Wen, Wojtek Czaja, Micah Goldblum, Tom Goldstein
Comments: First two authors contributed equally. Order chosen by coin flip. Published at ICLR 2023. Implementation available at this http URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[450] arXiv:2201.12723 (cross-list from cs.CV) [pdf, other]
Title: A Frustratingly Simple Approach for End-to-End Image Captioning
Ziyang Luo, Yadong Xi, Rongsheng Zhang, Jing Ma
Comments: Work in progress
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[451] arXiv:2201.12796 (cross-list from cs.LG) [pdf, other]
Title: Co-Regularized Adversarial Learning for Multi-Domain Text Classification
Yuan Wu, Diana Inkpen, Ahmed El-Roby
Comments: The paper will appear in AISTATS 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[452] arXiv:2201.12888 (cross-list from cs.CV) [pdf, other]
Title: A Dataset for Medical Instructional Video Classification and Question Answering
Deepak Gupta, Kush Attal, Dina Demner-Fushman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[453] arXiv:2201.12950 (cross-list from cs.SE) [pdf, other]
Title: Network Programming via Computable Products
Dennis Volpano
Comments: 6 pages, 7 tables
Subjects: Software Engineering (cs.SE); Computation and Language (cs.CL); Programming Languages (cs.PL)
Total of 453 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack