Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for January 2022

Total of 453 entries : 126-375 251-453
Showing up to 250 entries per page: fewer | more | all
[126] arXiv:2201.05313 [pdf, other]
Title: ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization
Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki
Subjects: Computation and Language (cs.CL)
[127] arXiv:2201.05320 [pdf, other]
Title: CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor, Ori Yoran, Ronan Le Bras, Chandra Bhagavatula, Yoav Goldberg, Yejin Choi, Jonathan Berant
Comments: Presented as Oral at NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[128] arXiv:2201.05337 [pdf, other]
Title: A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang, Haolin Song, Shaoyu Li, Ming Zhou, Dawei Song
Comments: Accpeted by ACM Computing Surveys Journal
Subjects: Computation and Language (cs.CL)
[129] arXiv:2201.05363 [pdf, other]
Title: Polarity and Subjectivity Detection with Multitask Learning and BERT Embedding
Ranjan Satapathy, Shweta Pardeshi, Erik Cambria
Comments: 10 pages, 4 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[130] arXiv:2201.05382 [pdf, other]
Title: Mental Health Assessment for the Chatbots
Yong Shan, Jinchao Zhang, Zekang Li, Yang Feng, Jie Zhou
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
[131] arXiv:2201.05411 [pdf, other]
Title: Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
Yinyi Wei, Tong Mo, Yongtao Jiang, Weiping Li, Wen Zhao
Subjects: Computation and Language (cs.CL)
[132] arXiv:2201.05575 [pdf, other]
Title: Reasoning Through Memorization: Nearest Neighbor Knowledge Graph Embeddings
Peng Wang, Xin Xie, Xiaohan Wang, Ningyu Zhang
Comments: NLPCC 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[133] arXiv:2201.05590 [pdf, other]
Title: Czech Grammar Error Correction with a Large and Diverse Corpus
Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen
Comments: Published in TACL, MIT Press
Subjects: Computation and Language (cs.CL)
[134] arXiv:2201.05601 [pdf, other]
Title: A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
Vésteinn Snæbjarnarson, Haukur Barri Símonarson, Pétur Orri Ragnarsson, Svanhvít Lilja Ingólfsdóttir, Haukur Páll Jónsson, Vilhjálmur Þorsteinsson, Hafsteinn Einarsson
Subjects: Computation and Language (cs.CL)
[135] arXiv:2201.05609 [pdf, other]
Title: Multilingual Open Text Release 1: Public Domain News in 44 Languages
Chester Palen-Michel, June Kim, Constantine Lignos
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[136] arXiv:2201.05613 [pdf, other]
Title: The Dark Side of the Language: Pre-trained Transformers in the DarkNet
Leonardo Ranaldi, Aria Nourbakhsh, Arianna Patrizi, Elena Sofia Ruzzetti, Dario Onorati, Francesca Fallucchi, Fabio Massimo Zanzotto
Journal-ref: 2023.ranlp-1.102
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[137] arXiv:2201.05692 [pdf, other]
Title: Model Stability with Continuous Data Updates
Huiting Liu, Avinesh P.V.S., Siddharth Patwardhan, Peter Grasch, Sachin Agarwal
Subjects: Computation and Language (cs.CL)
[138] arXiv:2201.05700 [pdf, other]
Title: Cost-Effective Training in Low-Resource Neural Machine Translation
Sai Koneru, Danni Liu, Jan Niehues
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[139] arXiv:2201.05721 [pdf, other]
Title: Extracting Space Situational Awareness Events from News Text
Zhengnan Xie, Alice Saebom Kwak, Enfa George, Laura W. Dozal, Hoang Van, Moriba Jah, Roberto Furfaro, Peter Jansen
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[140] arXiv:2201.05742 [pdf, other]
Title: Kformer: Knowledge Injection in Transformer Feed-Forward Layers
Yunzhi Yao, Shaohan Huang, Li Dong, Furu Wei, Huajun Chen, Ningyu Zhang
Comments: Accepted by NLPCC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[141] arXiv:2201.05767 [pdf, other]
Title: Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems
Yoshitomo Matsubara, Luca Soldaini, Eric Lind, Alessandro Moschitti
Comments: Accepted to EMNLP 2022 as a long paper (Findings). Model code is available at this https URL
Journal-ref: Findings of the Association for Computational Linguistics: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[142] arXiv:2201.05780 [pdf, other]
Title: A Dual Prompt Learning Framework for Few-Shot Dialogue State Tracking
Yuting Yang, Wenqiang Lei, Pei Huang, Juan Cao, Jintao Li, Tat-Seng Chua
Subjects: Computation and Language (cs.CL)
[143] arXiv:2201.05793 [pdf, other]
Title: A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases
Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam
Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[144] arXiv:2201.05878 [pdf, other]
Title: Automatic Lexical Simplification for Turkish
Ahmet Yavuz Uluslu
Subjects: Computation and Language (cs.CL)
[145] arXiv:2201.05880 [pdf, other]
Title: Reasoning over Hybrid Chain for Table-and-Text Open Domain QA
Wanjun Zhong, Junjie Huang, Qian Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
Subjects: Computation and Language (cs.CL)
[146] arXiv:2201.05891 [pdf, other]
Title: Automatic Correction of Syntactic Dependency Annotation Differences
Andrew Zupon, Andrew Carnie, Michael Hammond, Mihai Surdeanu
Subjects: Computation and Language (cs.CL)
[147] arXiv:2201.05899 [pdf, other]
Title: Unobserved Local Structures Make Compositional Generalization Hard
Ben Bogin, Shivanshu Gupta, Jonathan Berant
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[148] arXiv:2201.05922 [pdf, other]
Title: Addressing the Challenges of Cross-Lingual Hate Speech Detection
Irina Bigoulaeva, Viktor Hangya, Iryna Gurevych, Alexander Fraser
Subjects: Computation and Language (cs.CL)
[149] arXiv:2201.05955 [pdf, other]
Title: WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation
Alisa Liu, Swabha Swayamdipta, Noah A. Smith, Yejin Choi
Comments: EMNLP Findings camera-ready
Subjects: Computation and Language (cs.CL)
[150] arXiv:2201.05966 [pdf, other]
Title: UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
Tianbao Xie, Chen Henry Wu, Peng Shi, Ruiqi Zhong, Torsten Scholak, Michihiro Yasunaga, Chien-Sheng Wu, Ming Zhong, Pengcheng Yin, Sida I. Wang, Victor Zhong, Bailin Wang, Chengzu Li, Connor Boyle, Ansong Ni, Ziyu Yao, Dragomir Radev, Caiming Xiong, Lingpeng Kong, Rui Zhang, Noah A. Smith, Luke Zettlemoyer, Tao Yu
Comments: EMNLP 2022
Subjects: Computation and Language (cs.CL)
[151] arXiv:2201.05979 [pdf, other]
Title: SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples
Hao Wang, Yangguang Li, Zhen Huang, Yong Dou, Lingpeng Kong, Jing Shao
Comments: 7 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[152] arXiv:2201.05981 [pdf, other]
Title: Double Retrieval and Ranking for Accurate Question Answering
Zeyu Zhang, Thuy Vu, Alessandro Moschitti
Subjects: Computation and Language (cs.CL)
[153] arXiv:2201.05984 [pdf, other]
Title: In Situ Answer Sentence Selection at Web-scale
Zeyu Zhang, Thuy Vu, Alessandro Moschitti
Subjects: Computation and Language (cs.CL)
[154] arXiv:2201.06009 [pdf, other]
Title: Memory-assisted prompt editing to improve GPT-3 after deployment
Aman Madaan, Niket Tandon, Peter Clark, Yiming Yang
Comments: EMNLP 2022. This version updates the title to be consistent with EMNLP camera ready
Subjects: Computation and Language (cs.CL)
[155] arXiv:2201.06025 [pdf, other]
Title: COLD: A Benchmark for Chinese Offensive Language Detection
Jiawen Deng, Jingyan Zhou, Hao Sun, Chujie Zheng, Fei Mi, Helen Meng, Minlie Huang
Comments: 19 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[156] arXiv:2201.06028 [pdf, other]
Title: Natural Language Deduction through Search over Statement Compositions
Kaj Bostrom, Zayne Sprague, Swarat Chaudhuri, Greg Durrett
Comments: Findings of EMNLP 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[157] arXiv:2201.06125 [pdf, other]
Title: Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model
Bo-Ying Su, Shang-Ling Hsu, Kuan-Yin Lai, Amarnath Gupta
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[158] arXiv:2201.06134 [pdf, other]
Title: The Ninth Advances in Cognitive Systems (ACS) Conference
Mark Burstein, Mohan Sridharan, David McDonald
Subjects: Computation and Language (cs.CL)
[159] arXiv:2201.06170 [pdf, other]
Title: Evaluation of HTR models without Ground Truth Material
Phillip Benjamin Ströbel, Simon Clematide, Martin Volk, Raphael Schwitter, Tobias Hodel, David Schoch
Comments: Accepted at LREC 2022. Final version submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[160] arXiv:2201.06199 [pdf, other]
Title: Proficiency Matters Quality Estimation in Grammatical Error Correction
Yujin Takahashi, Masahiro Kaneko, Masato Mita, Mamoru Komachi
Comments: 6 pages (4 pages + references)
Subjects: Computation and Language (cs.CL)
[161] arXiv:2201.06206 [pdf, other]
Title: SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning
Yushi Bai, Xin Lv, Juanzi Li, Lei Hou, Yincen Qu, Zelin Dai, Feiyu Xiong
Comments: EMNLP 2022. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[162] arXiv:2201.06219 [pdf, other]
Title: An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets
Yuqiao Wen, Guoqing Luo, Lili Mou
Comments: Accepted by LREC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[163] arXiv:2201.06223 [pdf, other]
Title: Korean-Specific Dataset for Table Question Answering
Changwook Jun, Jooyoung Choi, Myoseop Sim, Hyun Kim, Hansol Jang, Kyungkoo Min
Comments: 7 pages including references and 4 figures
Subjects: Computation and Language (cs.CL)
[164] arXiv:2201.06225 [pdf, other]
Title: Interactive Contrastive Learning for Self-supervised Entity Alignment
Kaisheng Zeng, Zhenhao Dong, Lei Hou, Yixin Cao, Minghao Hu, Jifan Yu, Xin Lv, Juanzi Li, Ling Feng
Comments: Accepted by CIKM 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[165] arXiv:2201.06230 [pdf, other]
Title: Generalizable Neuro-symbolic Systems for Commonsense Question Answering
Alessandro Oltramari, Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee
Comments: In Pascal Hitzler, Md Kamruzzaman Sarker (eds.), Neuro-Symbolic Artificial Intelligence: The State of the Art. Frontiers in Artificial Intelligence and Applications Vol. 342, IOS Press, Amsterdam, 2022. arXiv admin note: text overlap with arXiv:2003.04707
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[166] arXiv:2201.06286 [pdf, other]
Title: MuLVE, A Multi-Language Vocabulary Evaluation Data Set
Anik Jacobsen, Salar Mohtaj, Sebastian Möller
Comments: Submitted to LREC 2022
Journal-ref: Proceedings of the Language Resources and Evaluation Conference. 2022; 673-679
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[167] arXiv:2201.06302 [pdf, other]
Title: On the Context-Free Ambiguity of Emoji
Justyna Czestochowska, Kristina Gligoric, Maxime Peyrard, Yann Mentha, Michal Bien, Andrea Grutter, Anita Auer, Aris Xanthos, Robert West
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
[168] arXiv:2201.06309 [pdf, other]
Title: Group Gated Fusion on Attention-based Bidirectional Alignment for Multimodal Emotion Recognition
Pengfei Liu, Kun Li, Helen Meng
Comments: Published in INTERSPEECH-2020
Journal-ref: INTERSPEECH 2020
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[169] arXiv:2201.06313 [pdf, other]
Title: A Deep Convolutional Neural Networks Based Multi-Task Ensemble Model for Aspect and Polarity Classification in Persian Reviews
Milad Vazan, Fatemeh Sadat Masoumi, Sepideh Saeedi Majd
Subjects: Computation and Language (cs.CL)
[170] arXiv:2201.06348 [pdf, other]
Title: Chatbot System Architecture
Moataz Mohammed, Mostafa M. Aref
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[171] arXiv:2201.06384 [pdf, other]
Title: Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations
Chris Emmery, Ákos Kádár, Grzegorz Chrupała, Walter Daelemans
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[172] arXiv:2201.06469 [pdf, other]
Title: Handling Compounding in Mobile Keyboard Input
Andreas Kabel, Keith Hall, Tom Ouyang, David Rybach, Daan van Esch, Françoise Beaufays
Comments: 7 pages
Subjects: Computation and Language (cs.CL)
[173] arXiv:2201.06496 [pdf, other]
Title: ArCovidVac: Analyzing Arabic Tweets About COVID-19 Vaccination
Hamdy Mubarak, Sabit Hassan, Shammur Absar Chowdhury, Firoj Alam
Comments: 8 pages, 9 figures
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[174] arXiv:2201.06499 [pdf, other]
Title: RuMedBench: A Russian Medical Language Understanding Benchmark
Pavel Blinov, Arina Reshetnikova, Aleksandr Nesterov, Galina Zubkova, Vladimir Kokh
Comments: 11 pages, code available at this https URL; Published in the proceedings of 20th International Conference on Artificial Intelligence in Medicine, Halifax, Canada; code available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[175] arXiv:2201.06573 [pdf, other]
Title: PerPaDa: A Persian Paraphrase Dataset based on Implicit Crowdsourcing Data Collection
Salar Mohtaj, Fatemeh Tavakkoli, Habibollah Asghari
Comments: Submitted to LREC 2022
Journal-ref: Proceedings of the Language Resources and Evaluation Conference. 2022; 5090-5096
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[176] arXiv:2201.06642 [pdf, other]
Title: Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji, Pedro Ortiz Suarez, Laurent Romary, Benoît Sagot
Comments: 12 pages, 6 figures, 2 tables
Subjects: Computation and Language (cs.CL)
[177] arXiv:2201.06657 [pdf, other]
Title: A Literature Survey of Recent Advances in Chatbots
Guendalina Caldarini, Sardar Jaf, Kenneth McGarry
Journal-ref: Information 2022, 13(1), 41
Subjects: Computation and Language (cs.CL)
[178] arXiv:2201.06665 [pdf, other]
Title: Text characterization based on recurrence networks
Bárbara C. e Souza, Filipi N. Silva, Henrique F. de Arruda, Giovana D. da Silva, Luciano da F. Costa, Diego R. Amancio
Journal-ref: Information Sciences (2023)
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[179] arXiv:2201.06674 [pdf, other]
Title: TYPIC: A Corpus of Template-Based Diagnostic Comments on Argumentation
Shoichi Naito, Shintaro Sawada, Chihiro Nakagawa, Naoya Inoue, Kenshi Yamaguchi, Iori Shimizu, Farjana Sultana Mim, Keshav Singh, Kentaro Inui
Comments: LREC2022. The dataset is available at this https URL
Subjects: Computation and Language (cs.CL)
[180] arXiv:2201.06721 [pdf, other]
Title: Selecting and combining complementary feature representations and classifiers for hate speech detection
Rafael M. O. Cruz, Woshington V. de Sousa, George D. C. Cavalcanti
Comments: acceped for publication on the Online Social Networks and Media (OSNEM) journal
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Social and Information Networks (cs.SI)
[181] arXiv:2201.06723 [pdf, other]
Title: Emojis as Anchors to Detect Arabic Offensive Language and Hate Speech
Hamdy Mubarak, Sabit Hassan, Shammur Absar Chowdhury
Subjects: Computation and Language (cs.CL)
[182] arXiv:2201.06724 [pdf, other]
Title: Youling: an AI-Assisted Lyrics Creation System
Rongsheng Zhang, Xiaoxi Mao, Le Li, Lin Jiang, Lin Chen, Zhiwei Hu, Yadong Xi, Changjie Fan, Minlie Huang
Comments: accept by emnlp2020 demo track
Subjects: Computation and Language (cs.CL)
[183] arXiv:2201.06731 [pdf, other]
Title: Dialog Intent Induction via Density-based Deep Clustering Ensemble
Jiashu Pu, Guandan Chen, Yongzhu Chang, Xiaoxi Mao
Comments: accepted by AAAI-22 W16: Dialog System Technology Challenge (DSTC10)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[184] arXiv:2201.06741 [pdf, other]
Title: HashSet -- A Dataset For Hashtag Segmentation
Prashant Kodali, Akshala Bhatnagar, Naman Ahuja, Manish Shrivastava, Ponnurangam Kumaraguru
Subjects: Computation and Language (cs.CL)
[185] arXiv:2201.06757 [pdf, other]
Title: Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration
Bálint Csanády, András Lukács
Comments: 7 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[186] arXiv:2201.06774 [pdf, other]
Title: Hierarchical Neural Network Approaches for Long Document Classification
Snehal Khandve, Vedangi Wagh, Apurva Wani, Isha Joshi, Raviraj Joshi
Comments: Accepted at International Conference on Machine Learning and Computing (ICMLC) 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[187] arXiv:2201.06777 [pdf, other]
Title: COPA-SSE: Semi-structured Explanations for Commonsense Reasoning
Ana Brassard, Benjamin Heinzerling, Pride Kavumba, Kentaro Inui
Comments: 6 pages, 6 figures, LREC 2022. Data available at this https URL
Subjects: Computation and Language (cs.CL)
[188] arXiv:2201.06849 [pdf, other]
Title: Toward Self-learning End-to-End Task-Oriented Dialog Systems
Xiaoying Zhang, Baolin Peng, Jianfeng Gao, Helen Meng
Subjects: Computation and Language (cs.CL)
[189] arXiv:2201.06876 [pdf, other]
Title: Syntax-based data augmentation for Hungarian-English machine translation
Attila Nagy, Patrick Nanys, Balázs Frey Konrád, Bence Bial, Judit Ács
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[190] arXiv:2201.06885 [pdf, other]
Title: Evidence-aware Fake News Detection with Graph Neural Networks
Weizhi Xu, Junfei Wu, Qiang Liu, Shu Wu, Liang Wang
Comments: Accepted by TheWebConf 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[191] arXiv:2201.06907 [pdf, other]
Title: Improve Sentence Alignment by Divide-and-conquer
Wu Zhang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[192] arXiv:2201.07099 [pdf, other]
Title: What Makes the Story Forward? Inferring Commonsense Explanations as Prompts for Future Event Generation
Li Lin, Yixin Cao, Lifu Huang, Shu'ang Li, Xuming Hu, Lijie Wen, Jianmin Wang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[193] arXiv:2201.07105 [pdf, other]
Title: Beyond modeling: NLP Pipeline for efficient environmental policy analysis
Jordi Planas, Daniel Firebanks-Quevedo, Galina Naydenova, Ramansh Sharma, Cristina Taylor, Kathleen Buckingham, Rong Fang
Comments: Accepted at Fragile Earth workshop proceedings at KDD 2021
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[194] arXiv:2201.07112 [pdf, other]
Title: Sectioning of Biomedical Abstracts: A Sequence of Sequence Classification Task
Mehmet Efruz Karabulut, K. Vijay-Shanker
Comments: 9 pages, 2 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[195] arXiv:2201.07126 [pdf, other]
Title: Instance-aware Prompt Learning for Language Understanding and Generation
Feihu Jin, Jinliang Lu, Jiajun Zhang, Chengqing Zong
Comments: 7 pages, 5 figures
Subjects: Computation and Language (cs.CL)
[196] arXiv:2201.07198 [pdf, other]
Title: Klexikon: A German Dataset for Joint Summarization and Simplification
Dennis Aumiller, Michael Gertz
Comments: Code and data are available on Github: this https URL
Subjects: Computation and Language (cs.CL)
[197] arXiv:2201.07281 [pdf, other]
Title: Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Hang Jiang, Yining Hua, Doug Beeferman, Deb Roy
Comments: Accepted at LREC 2022 (Long Papers)
Subjects: Computation and Language (cs.CL)
[198] arXiv:2201.07288 [pdf, other]
Title: Extending the Vocabulary of Fictional Languages using Neural Networks
Thomas Zacharias, Ashutosh Taklikar, Raja Giryes
Comments: 10 pages, 1 figure, NeurIPS Workshop on Machine Learning for Creativity and Design 2021
Subjects: Computation and Language (cs.CL)
[199] arXiv:2201.07311 [pdf, other]
Title: Datasheet for the Pile
Stella Biderman, Kieran Bicheno, Leo Gao
Comments: Accompanies "The Pile: An 800GB Dataset of Diverse Text for Language Modeling" arXiv:2101.00027
Subjects: Computation and Language (cs.CL)
[200] arXiv:2201.07317 [pdf, other]
Title: A Privacy-Preserving Unsupervised Domain Adaptation Framework for Clinical Text Analysis
Qiyuan An, Ruijiang Li, Lin Gu, Hao Zhang, Qingyu Chen, Zhiyong Lu, Fei Wang, Yingying Zhu
Subjects: Computation and Language (cs.CL)
[201] arXiv:2201.07341 [pdf, other]
Title: Learning grammar with a divide-and-concur neural network
Sean Deyo, Veit Elser
Journal-ref: Phys. Rev. E 105, 064303 (2022)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Cellular Automata and Lattice Gases (nlin.CG)
[202] arXiv:2201.07365 [pdf, other]
Title: Improving Neural Machine Translation by Denoising Training
Liang Ding, Keqin Peng, Dacheng Tao
Comments: arXiv admin note: text overlap with arXiv:2109.07780
Subjects: Computation and Language (cs.CL)
[203] arXiv:2201.07406 [pdf, other]
Title: Fooling MOSS Detection with Pretrained Language Models
Stella Biderman, Edward Raff
Comments: To appear in the Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[204] arXiv:2201.07423 [pdf, other]
Title: Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19
Yueyi Jiang, Yunfan Jiang, Liu Leqi, Piotr Winkielman
Subjects: Computation and Language (cs.CL); Social and Information Networks (cs.SI)
[205] arXiv:2201.07434 [pdf, other]
Title: Interpreting Arabic Transformer Models
Ahmed Abdelali, Nadir Durrani, Fahim Dalvi, Hassan Sajjad
Comments: A new version of the paper was uploaded under a different reference: arXiv:2210.09990
Subjects: Computation and Language (cs.CL)
[206] arXiv:2201.07449 [pdf, other]
Title: TourBERT: A pretrained language model for the tourism industry
Veronika Arefieva, Roman Egger
Comments: Identified a mistake in our calculations. Will fix the problem within the next weeks and resubmit
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[207] arXiv:2201.07489 [pdf, other]
Title: Development of Fake News Model using Machine Learning through Natural Language Processing
Sajjad Ahmed, Knut Hinkelmann, Flavio Corradini
Journal-ref: International Journal of Computer and Information Engineering Vol:14, No:12, 2020
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[208] arXiv:2201.07520 [pdf, other]
Title: CM3: A Causal Masked Multimodal Model of the Internet
Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer
Subjects: Computation and Language (cs.CL)
[209] arXiv:2201.07614 [pdf, other]
Title: Uncovering More Shallow Heuristics: Probing the Natural Language Inference Capacities of Transformer-Based Pre-Trained Language Models Using Syllogistic Patterns
Reto Gubelmann, Siegfried Handschuh
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[210] arXiv:2201.07670 [pdf, other]
Title: Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts
Kilian Theil, Dirk Hovy, Heiner Stuckenschmidt
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[211] arXiv:2201.07725 [pdf, other]
Title: Data-to-Value: An Evaluation-First Methodology for Natural Language Projects
Jochen L. Leidner
Comments: 9 pages, 6 figures, 4 tables
Subjects: Computation and Language (cs.CL); Methodology (stat.ME)
[212] arXiv:2201.07899 [pdf, other]
Title: ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP)
Carol Neidle, Augustine Opoku, Dimitris Metaxas
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[213] arXiv:2201.07902 [pdf, other]
Title: Evaluating Machine Common Sense via Cloze Testing
Ehsan Qasemi, Lee Kezar, Jay Pujara, Pedro Szekely
Subjects: Computation and Language (cs.CL)
[214] arXiv:2201.07905 [pdf, other]
Title: CPTAM: Constituency Parse Tree Aggregation Method
Adithya Kulkarni, Nasim Sabetpour, Alexey Markin, Oliver Eulenstein, Qi Li
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[215] arXiv:2201.08038 [pdf, other]
Title: Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction
Daisuke Suzuki, Yujin Takahashi, Ikumi Yamashita, Taichi Aida, Tosho Hirasawa, Michitaka Nakatsuji, Masato Mita, Mamoru Komachi
Comments: 8 pages (6pages + references)
Subjects: Computation and Language (cs.CL)
[216] arXiv:2201.08054 [pdf, other]
Title: VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation
Yihang Li, Shuichiro Shimizu, Weiqi Gu, Chenhui Chu, Sadao Kurohashi
Comments: Accepted by LREC2022
Subjects: Computation and Language (cs.CL)
[217] arXiv:2201.08070 [pdf, other]
Title: Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi
Comments: An extension of work arXiv:2005.03361
Journal-ref: TALLIP Volume 21, Issue 4, July 2022
Subjects: Computation and Language (cs.CL)
[218] arXiv:2201.08081 [pdf, other]
Title: LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
Qi Shi, Qian Liu, Bei Chen, Yu Zhang, Ting Liu, Jian-Guang Lou
Comments: EMNLP 2022 Findings
Subjects: Computation and Language (cs.CL)
[219] arXiv:2201.08089 [pdf, other]
Title: Why Did You Not Compare With That? Identifying Papers for Use as Baselines
Manjot Bedi, Tanisha Pandey, Sumit Bhatia, Tanmoy Chakraborty
Comments: Preprint of upcoming paper at European Conference on Information Retrieval (ECIR) 2022
Subjects: Computation and Language (cs.CL)
[220] arXiv:2201.08174 [pdf, other]
Title: Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis
Aleksandr Perevalov, Xi Yan, Liubov Kovriguina, Longquan Jiang, Andreas Both, Ricardo Usbeck
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
[221] arXiv:2201.08193 [pdf, other]
Title: TextHacker: Learning based Hybrid Local Search Algorithm for Text Hard-label Adversarial Attack
Zhen Yu, Xiaosen Wang, Wanxiang Che, Kun He
Comments: Accepted by EMNLP 2022 Findings, Code is available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[222] arXiv:2201.08214 [pdf, html, other]
Title: A Latent-Variable Model for Intrinsic Probing
Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein
Subjects: Computation and Language (cs.CL)
[223] arXiv:2201.08239 [pdf, other]
Title: LaMDA: Language Models for Dialog Applications
Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia Jin, Taylor Bos, Leslie Baker, Yu Du, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yanping Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao, Yanqi Zhou, Chung-Ching Chang, Igor Krivokon, Will Rusch, Marc Pickett, Pranesh Srinivasan, Laichee Man, Kathleen Meier-Hellstern, Meredith Ringel Morris, Tulsee Doshi, Renelito Delos Santos, Toju Duke, Johnny Soraker, Ben Zevenbergen, Vinodkumar Prabhakaran, Mark Diaz, Ben Hutchinson, Kristen Olson, Alejandra Molina, Erin Hoffman-John, Josh Lee, Lora Aroyo, Ravi Rajakumar, Alena Butryna, Matthew Lamm, Viktoriya Kuzmina, Joe Fenton, Aaron Cohen, Rachel Bernstein, Ray Kurzweil, Blaise Aguera-Arcas, Claire Cui, Marian Croak, Ed Chi, Quoc Le
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[224] arXiv:2201.08277 [pdf, other]
Title: NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Salahudeen Abdullahi, Anuoluwapo Aremu, Alipio Jeorge, Pavel Brazdil
Comments: Submitted to LREC 2022, 13 pages, 2 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[225] arXiv:2201.08318 [pdf, other]
Title: Cheating Automatic Short Answer Grading: On the Adversarial Usage of Adjectives and Adverbs
Anna Filighera, Sebastian Ochs, Tim Steuer, Thomas Tregel
Subjects: Computation and Language (cs.CL)
[226] arXiv:2201.08340 [pdf, other]
Title: Signature Entrenchment and Conceptual Changes in Automated Theory Repair
Xue Li, Alan Bundy, Eugene Philalithis
Comments: Presented at The Ninth Advances in Cognitive Systems (ACS) Conference 2021 (arXiv:2201.06134)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[227] arXiv:2201.08451 [pdf, other]
Title: Regional Negative Bias in Word Embeddings Predicts Racial Animus--but only via Name Frequency
Austin van Loon, Salvatore Giorgi, Robb Willer, Johannes Eichstaedt
Comments: 5 pages, 1 figure
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[228] arXiv:2201.08495 [pdf, other]
Title: SciBERTSUM: Extractive Summarization for Scientific Documents
Athar Sefid, C Lee Giles
Subjects: Computation and Language (cs.CL)
[229] arXiv:2201.08531 [pdf, other]
Title: Black-box Prompt Learning for Pre-trained Language Models
Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao Zhou, Tong Zhang
Comments: To appear in the Transactions on Machine Learning Research (TMLR)
Subjects: Computation and Language (cs.CL)
[230] arXiv:2201.08542 [pdf, other]
Title: Can Model Compression Improve NLP Fairness
Guangxuan Xu, Qingyuan Hu
Subjects: Computation and Language (cs.CL)
[231] arXiv:2201.08555 [pdf, other]
Title: Identifying Adversarial Attacks on Text Classifiers
Zhouhang Xie, Jonathan Brophy, Adam Noack, Wencong You, Kalyani Asthana, Carter Perkins, Sabrina Reis, Sameer Singh, Daniel Lowd
Subjects: Computation and Language (cs.CL); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[232] arXiv:2201.08598 [pdf, other]
Title: Taxonomy Enrichment with Text and Graph Vector Representations
Irina Nikishina, Mikhail Tikhomirov, Varvara Logacheva, Yuriy Nazarov, Alexander Panchenko, Natalia Loukachevitch
Subjects: Computation and Language (cs.CL)
[233] arXiv:2201.08643 [pdf, other]
Title: Text Style Transfer for Bias Mitigation using Masked Language Modeling
Ewoenam Kwaku Tokpo, Toon Calders
Comments: 9 pages, 3 figures, 5 tables
Subjects: Computation and Language (cs.CL)
[234] arXiv:2201.08670 [pdf, other]
Title: Context-Tuning: Learning Contextualized Prompts for Natural Language Generation
Tianyi Tang, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen
Comments: 15 pages, accepted by COLING 2022
Subjects: Computation and Language (cs.CL)
[235] arXiv:2201.08675 [pdf, other]
Title: Gender Bias in Text: Labeled Datasets and Lexicons
Jad Doughman, Wael Khreich
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[236] arXiv:2201.08687 [pdf, other]
Title: A Comparative Study on Language Models for Task-Oriented Dialogue Systems
Vinsen Marselino Andreas, Genta Indra Winata, Ayu Purwarianti
Comments: 5 pages, 1 figure
Journal-ref: 2021 8th International Conference on Advanced Informatics: Concepts, Theory and Applications (ICAICTA) (pp. 1-5). IEEE
Subjects: Computation and Language (cs.CL)
[237] arXiv:2201.08702 [pdf, other]
Title: Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation
Qianben Chen, Richong Zhang, Yaowei Zheng, Yongyi Mao
Comments: 8 pages, 4 figures, under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[238] arXiv:2201.08717 [pdf, other]
Title: Personality Type Based on Myers-Briggs Type Indicator with Text Posting Style by using Traditional and Deep Learning
Sakdipat Ontoum, Jonathan H. Chan
Comments: 10 pages, 14 figures, this work was presented at the 11th Joint Symposium on Computational Intelligence (JSCI11)
Subjects: Computation and Language (cs.CL)
[239] arXiv:2201.08860 [pdf, other]
Title: GreaseLM: Graph REASoning Enhanced Language Models for Question Answering
Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, Jure Leskovec
Comments: Published at ICLR 2022. All code, data, and pretrained models are available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[240] arXiv:2201.08904 [pdf, other]
Title: Description-Driven Task-Oriented Dialog Modeling
Jeffrey Zhao, Raghav Gupta, Yuan Cao, Dian Yu, Mingqiu Wang, Harrison Lee, Abhinav Rastogi, Izhak Shafran, Yonghui Wu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[241] arXiv:2201.08919 [pdf, other]
Title: Recurrent Neural Networks with Mixed Hierarchical Structures and EM Algorithm for Natural Language Processing
Zhaoxin Luo, Michael Zhu
Comments: 9 pages, 5 figures
Subjects: Computation and Language (cs.CL); Machine Learning (stat.ML)
[242] arXiv:2201.08975 [pdf, other]
Title: Chinese Word Segmentation with Heterogeneous Graph Neural Network
Xuemei Tang, Jun Wang, Qi Su
Subjects: Computation and Language (cs.CL)
[243] arXiv:2201.09012 [pdf, other]
Title: Leaf: Multiple-Choice Question Generation
Kristiyan Vachev, Momchil Hardalov, Georgi Karadzhov, Georgi Georgiev, Ivan Koychev, Preslav Nakov
Comments: Accepted to ECIR 2022 (Demo)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[244] arXiv:2201.09060 [pdf, other]
Title: Solvability of orbit-finite systems of linear equations
Arka Ghosh, Piotr Hofman, Sławomir Lasota
Subjects: Computation and Language (cs.CL); Formal Languages and Automata Theory (cs.FL); Logic in Computer Science (cs.LO)
[245] arXiv:2201.09107 [pdf, other]
Title: Visual Information Guided Zero-Shot Paraphrase Generation
Zhe Lin, Xiaojun Wan
Comments: Accepted By COLING 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[246] arXiv:2201.09119 [pdf, other]
Title: A Causal Lens for Controllable Text Generation
Zhiting Hu, Li Erran Li
Comments: NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[247] arXiv:2201.09146 [pdf, other]
Title: Question rewriting? Assessing its importance for conversational question answering
Gonçalo Raposo, Rui Ribeiro, Bruno Martins, Luísa Coheur
Comments: Submitted manuscript (not anonymized) accepted to the 44th European Conference on Information Retrieval (ECIR) 2022. This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Information Retrieval, and is available online at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[248] arXiv:2201.09227 [pdf, other]
Title: A Large and Diverse Arabic Corpus for Language Modeling
Abbas Raza Ali, Muhammad Ajmal Siddiqui, Rema Algunaibet, Hasan Raza Ali
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[249] arXiv:2201.09282 [pdf, other]
Title: WIDAR -- Weighted Input Document Augmented ROUGE
Raghav Jain, Vaibhav Mavi, Anubhav Jangra, Sriparna Saha
Comments: Manuscript Accepted as full paper in ECIR 2022
Subjects: Computation and Language (cs.CL)
[250] arXiv:2201.09324 [pdf, other]
Title: Supervised Visual Attention for Simultaneous Multimodal Machine Translation
Veneta Haralampieva, Ozan Caglayan, Lucia Specia
Comments: Accepted to Journal of Artificial Intelligence Research (JAIR)
Journal-ref: Journal of Artificial Intelligence Research 74 (2022) 1059-1089
Subjects: Computation and Language (cs.CL)
[251] arXiv:2201.09377 [pdf, other]
Title: An Application of Pseudo-Log-Likelihoods to Natural Language Scoring
Darren Abramson, Ali Emami
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[252] arXiv:2201.09518 [pdf, other]
Title: Synthetic Books
Varvara Guljajeva
Comments: 7 pages, 5 figures
Journal-ref: ARTECH 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[253] arXiv:2201.09523 [pdf, other]
Title: BTPK-based interpretable method for NER tasks based on Talmudic Public Announcement Logic
Yulin Chen, Beishui Liao, Bruno Bentzen, Bo Yuan, Zelai Yao, Haixiao Chi, Dov Gabbay
Comments: 10 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[254] arXiv:2201.09651 [pdf, other]
Title: Artefact Retrieval: Overview of NLP Models with Knowledge Base Access
Vilém Zouhar, Marius Mosbach, Debanjali Biswas, Dietrich Klakow
Comments: 11 pages of main content, 7 pages of appendix; presented at AKBC CSRR 2021
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[255] arXiv:2201.09680 [pdf, other]
Title: Relational Memory Augmented Language Models
Qi Liu, Dani Yogatama, Phil Blunsom
Comments: Accepted to TACL, pre MIT Press publication version
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[256] arXiv:2201.09696 [pdf, other]
Title: Unified Question Generation with Continual Lifelong Learning
Wei Yuan, Hongzhi Yin, Tieke He, Tong Chen, Qiufeng Wang, Lizhen Cui
Comments: Paper accepted in The Web Conference (WWW) 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[257] arXiv:2201.09745 [pdf, other]
Title: Table Pre-training: A Survey on Model Architectures, Pre-training Objectives, and Downstream Tasks
Haoyu Dong, Zhoujun Cheng, Xinyi He, Mengyu Zhou, Anda Zhou, Fan Zhou, Ao Liu, Shi Han, Dongmei Zhang
Comments: Accepted by IJCAI'2022 survey track
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[258] arXiv:2201.09966 [pdf, other]
Title: Classification Of Fake News Headline Based On Neural Networks
Ke Yahan, Ruyi Qu, Lu Xiaoxia
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[259] arXiv:2201.09997 [pdf, other]
Title: Razmecheno: Named Entity Recognition from Digital Archive of Diaries "Prozhito"
Timofey Atnashev, Veronika Ganeeva, Roman Kazakov, Daria Matyash, Michael Sonkin, Ekaterina Voloshina, Oleg Serikov, Ekaterina Artemova
Comments: Submitted to LREC 2022
Subjects: Computation and Language (cs.CL)
[260] arXiv:2201.10005 [pdf, other]
Title: Text and Code Embeddings by Contrastive Pre-Training
Arvind Neelakantan, Tao Xu, Raul Puri, Alec Radford, Jesse Michael Han, Jerry Tworek, Qiming Yuan, Nikolas Tezak, Jong Wook Kim, Chris Hallacy, Johannes Heidecke, Pranav Shyam, Boris Power, Tyna Eloundou Nekoul, Girish Sastry, Gretchen Krueger, David Schnurr, Felipe Petroski Such, Kenny Hsu, Madeleine Thompson, Tabarak Khan, Toki Sherbakov, Joanne Jang, Peter Welinder, Lilian Weng
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[261] arXiv:2201.10066 [pdf, other]
Title: Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Angelina McMillan-Major, Zaid Alyafeai, Stella Biderman, Kimbo Chen, Francesco De Toni, Gérard Dupont, Hady Elsahar, Chris Emezue, Alham Fikri Aji, Suzana Ilić, Nurulaqilla Khamis, Colin Leong, Maraim Masoud, Aitor Soroa, Pedro Ortiz Suarez, Zeerak Talat, Daniel van Strien, Yacine Jernite
Comments: 8 pages plus appendix and references
Subjects: Computation and Language (cs.CL); Databases (cs.DB)
[262] arXiv:2201.10113 [pdf, other]
Title: Multimodal data matters: language model pre-training over structured and unstructured electronic health records
Sicen Liu, Xiaolong Wang, Yongshuai Hou, Ge Li, Hui Wang, Hui Xu, Yang Xiang, Buzhou Tang
Comments: 12 pages, 5 figures accepted for publication in the IEEE Journal of Biomedical and Health Informatics (J-BHI)
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[263] arXiv:2201.10262 [pdf, other]
Title: Do Transformers Encode a Foundational Ontology? Probing Abstract Classes in Natural Language
Mael Jullien, Marco Valentino, Andre Freitas
Subjects: Computation and Language (cs.CL)
[264] arXiv:2201.10274 [pdf, other]
Title: Multi-channel Attentive Graph Convolutional Network With Sentiment Fusion For Multimodal Sentiment Analysis
Luwei Xiao, Xingjiao Wu, Wen Wu, Jing Yang, Liang He
Subjects: Computation and Language (cs.CL)
[265] arXiv:2201.10376 [pdf, other]
Title: Modeling Multi-level Context for Informational Bias Detection by Contrastive Learning and Sentential Graph Network
Shijia Guo, Kenny Q. Zhu
Comments: 10 pages including bibliography
Subjects: Computation and Language (cs.CL)
[266] arXiv:2201.10422 [pdf, other]
Title: Language Generation for Broad-Coverage, Explainable Cognitive Systems
Marjorie McShane, Ivan Leon
Comments: Presented at The Ninth Advances in Cognitive Systems (ACS) Conference 2021 (arXiv:2201.06134)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[267] arXiv:2201.10430 [pdf, other]
Title: A Quantitative and Qualitative Analysis of Schizophrenia Language
Amal Alqahtani, Efsun Sarioglu Kay, Sardar Hamidian, Michael Compton, Mona Diab
Subjects: Computation and Language (cs.CL)
[268] arXiv:2201.10463 [pdf, other]
Title: Distantly supervised end-to-end medical entity extraction from electronic health records with human-level quality
Alexander Nesterov, Dmitry Umerenkov
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[269] arXiv:2201.10474 [pdf, other]
Title: Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, Noah A. Smith
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[270] arXiv:2201.10515 [pdf, other]
Title: Suicidal Ideation Detection on Social Media: A Review of Machine Learning Methods
Asma Abdulsalam, Areej Alhothali
Comments: 14 pages, 7 figures
Subjects: Computation and Language (cs.CL)
[271] arXiv:2201.10588 [pdf, other]
Title: Convex Polytope Modelling for Unsupervised Derivation of Semantic Structure for Data-efficient Natural Language Understanding
Jingyan Zhou, Xiaohan Feng, King Keung Wu, Helen Meng
Subjects: Computation and Language (cs.CL)
[272] arXiv:2201.10608 [pdf, other]
Title: DOM-LM: Learning Generalizable Representations for HTML Documents
Xiang Deng, Prashant Shiralkar, Colin Lockard, Binxuan Huang, Huan Sun
Subjects: Computation and Language (cs.CL)
[273] arXiv:2201.10618 [pdf, other]
Title: The ABBE Corpus: Animate Beings Being Emotional
Samira Zad, Joshuan Jimenez, Mark A. Finlayson
Comments: 9 pages, 1 figure
Subjects: Computation and Language (cs.CL)
[274] arXiv:2201.10707 [pdf, other]
Title: A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model
Xin Sun, Tao Ge, Shuming Ma, Jingjing Li, Furu Wei, Houfeng Wang
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[275] arXiv:2201.10716 [pdf, other]
Title: Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models
Lu Dong, Zhi-Qiang Guo, Chao-Hong Tan, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling
Comments: This paper is accepted by ICASSP2022
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[276] arXiv:2201.10792 [pdf, other]
Title: On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR
Zhao Yang, Dianwen Ng, Xiao Fu, Liping Han, Wei Xi, Rui Wang, Rui Jiang, Jizhong Zhao
Comments: submitted to INTERSPEECH 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[277] arXiv:2201.10797 [pdf, other]
Title: An Automated Question-Answering Framework Based on Evolution Algorithm
Sinan Tan, Hui Xue, Qiyu Ren, Huaping Liu, Jing Bai
Comments: In Proceedings of the AAAI 2019 Workshop (WS13) on Reasoning and Complex Question-Answering (RCQA-19) this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[278] arXiv:2201.10866 [pdf, other]
Title: CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search
Xiaonan Li, Yeyun Gong, Yelong Shen, Xipeng Qiu, Hang Zhang, Bolun Yao, Weizhen Qi, Daxin Jiang, Weizhu Chen, Nan Duan
Comments: Accepted to EMNLP 2022 (main conference)
Subjects: Computation and Language (cs.CL); Software Engineering (cs.SE)
[279] arXiv:2201.10881 [pdf, other]
Title: The Norwegian Parliamentary Speech Corpus
Per Erik Solberg, Pablo Ortiz
Comments: 6 pages, submitted to LREC 2022
Journal-ref: LREC 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[280] arXiv:2201.10927 [pdf, other]
Title: Pair-Level Supervised Contrastive Learning for Natural Language Inference
Shu'ang Li, Xuming Hu, Li Lin, Lijie Wen
Comments: Accepted at ICASSP 2022
Subjects: Computation and Language (cs.CL)
[281] arXiv:2201.10986 [pdf, other]
Title: Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data
Federico Bianchi, Vincenzo Cutrona, Dirk Hovy
Subjects: Computation and Language (cs.CL)
[282] arXiv:2201.11115 [pdf, other]
Title: CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Herbert Ullrich, Jan Drchal, Martin Rýpar, Hana Vincourová, Václav Moravec
Comments: submitted to LREV journal for review, resubmission, changed title according to reviewer suggestion
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[283] arXiv:2201.11153 [pdf, other]
Title: Addressing Issues of Cross-Linguality in Open-Retrieval Question Answering Systems For Emergent Domains
Alon Albalak, Sharon Levy, William Yang Wang
Comments: 6 pages, 8 figures
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[284] arXiv:2201.11155 [pdf, other]
Title: Explainable Patterns for Distinction and Prediction of Moral Judgement on Reddit
Ion Stagkos Efstathiadis, Guilherme Paulino-Passos, Francesca Toni
Comments: 1st Workshop on Human and Machine Decisions (WHMD 2021) at NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[285] arXiv:2201.11172 [pdf, other]
Title: Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Tu Anh Dinh, Danni Liu, Jan Niehues
Comments: 6 pages, 5 figures, accepted to IEEE ICASSP 2022. arXiv admin note: text overlap with arXiv:2107.06010
Journal-ref: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 6222-6226
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[286] arXiv:2201.11176 [pdf, other]
Title: DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
Wei Zhao, Michael Strube, Steffen Eger
Comments: EACL2023 Camera Ready
Subjects: Computation and Language (cs.CL)
[287] arXiv:2201.11258 [pdf, other]
Title: Learning How to Translate North Korean through South Korean
Hwichan Kim, Sangwhan Moon, Naoaki Okazaki, Mamoru Komachi
Comments: 8 pages, 1 figures, 8 tables
Subjects: Computation and Language (cs.CL)
[288] arXiv:2201.11294 [pdf, other]
Title: Highly Generalizable Models for Multilingual Hate Speech Detection
Neha Deshpande, Nicholas Farris, Vidhur Kumar
Subjects: Computation and Language (cs.CL)
[289] arXiv:2201.11312 [pdf, other]
Title: A Higher-Order Semantic Dependency Parser
Bin Li, Yunlong Fan, Yikemaiti Sataer, Zhiqiang Gao
Subjects: Computation and Language (cs.CL)
[290] arXiv:2201.11313 [pdf, other]
Title: Learning Deep Semantic Model for Code Search using CodeSearchNet Corpus
Chen Wu, Ming Yan
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[291] arXiv:2201.11332 [pdf, other]
Title: Ontology-enhanced Prompt-tuning for Few-shot Learning
Hongbin Ye, Ningyu Zhang, Shumin Deng, Xiang Chen, Hui Chen, Feiyu Xiong, Xi Chen, Huajun Chen
Comments: Accepted by WWW2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[292] arXiv:2201.11367 [pdf, other]
Title: Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation
Yihe Wang, Yitong Li, Yasheng Wang, Fei Mi, Pingyi Zhou, Xin Wang, Jin Liu, Xin Jiang, Qun Liu
Comments: Accepted in COLING 2022
Subjects: Computation and Language (cs.CL)
[293] arXiv:2201.11374 [pdf, other]
Title: Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing
Jivnesh Sandhan, Laxmidhar Behera, Pawan Goyal
Comments: Accepted at EACL2023 to be held in Croatia Europe
Subjects: Computation and Language (cs.CL)
[294] arXiv:2201.11391 [pdf, other]
Title: Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages
Jivnesh Sandhan, Ayush Daksh, Om Adideva Paranjay, Laxmidhar Behera, Pawan Goyal
Comments: The work is accepted at COLING22-SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Subjects: Computation and Language (cs.CL)
[295] arXiv:2201.11443 [pdf, other]
Title: Yes-Yes-Yes: Proactive Data Collection for ACL Rolling Review and Beyond
Nils Dycke, Ilia Kuznetsov, Iryna Gurevych
Comments: Accepted at Findings of EMNLP 2022
Subjects: Computation and Language (cs.CL)
[296] arXiv:2201.11473 [pdf, other]
Title: Reasoning Like Program Executors
Xinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen
Comments: To appear in EMNLP 2022 main conference. The first two authors contributed equally
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Symbolic Computation (cs.SC)
[297] arXiv:2201.11569 [pdf, other]
Title: Human Interpretation of Saliency-based Explanation Over Text
Hendrik Schuff, Alon Jacovi, Heike Adel, Yoav Goldberg, Ngoc Thang Vu
Comments: FAccT 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[298] arXiv:2201.11576 [pdf, other]
Title: Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation
Jixuan Wang, Kuan-Chieh Wang, Frank Rudzicz, Michael Brudno
Comments: NeurIPS 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[299] arXiv:2201.11582 [pdf, other]
Title: GUDN: A novel guide network with label reinforcement strategy for extreme multi-label text classification
Qing Wang, Jia Zhu, Hongji Shu, Kwame Omono Asamoah, Jianyang Shi, Cong Zhou
Comments: 12 pages, 6 figures
Subjects: Computation and Language (cs.CL)
[300] arXiv:2201.11732 [pdf, other]
Title: IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan Vulić
Comments: ICML 2022
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2201.11766 [pdf, other]
Title: Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding
Matthew Setzler, Scott Howland, Lauren Phillips
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[302] arXiv:2201.11826 [pdf, other]
Title: Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition
Ayoub Ghriss, Bo Yang, Viktor Rozgic, Elizabeth Shriberg, Chao Wang
Comments: ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[303] arXiv:2201.11838 [pdf, other]
Title: Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences
Yikuan Li, Ramsey M. Wehbe, Faraz S. Ahmad, Hanyin Wang, Yuan Luo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[304] arXiv:2201.11867 [pdf, other]
Title: Neural-FST Class Language Model for End-to-End Speech Recognition
Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer
Comments: Accepted for publication at ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[305] arXiv:2201.11870 [pdf, other]
Title: Multiple-Source Domain Adaptation via Coordinated Domain Encoders and Paired Classifiers
Payam Karisani
Comments: AAAI 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[306] arXiv:2201.11885 [pdf, other]
Title: Boosting Entity Mention Detection for Targetted Twitter Streams with Global Contextual Embeddings
Satadisha Saha Bhowmick, Eduard C. Dragut, Weiyi Meng
Subjects: Computation and Language (cs.CL)
[307] arXiv:2201.11903 [pdf, other]
Title: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[308] arXiv:2201.11990 [pdf, other]
Title: Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro
Comments: Shaden Smith and Mostofa Patwary contributed equally
Subjects: Computation and Language (cs.CL)
[309] arXiv:2201.12093 [pdf, other]
Title: PCL: Peer-Contrastive Learning with Diverse Augmentations for Unsupervised Sentence Embeddings
Qiyu Wu, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Daxin Jiang
Comments: To appear at EMNLP 2022
Subjects: Computation and Language (cs.CL)
[310] arXiv:2201.12105 [pdf, other]
Title: Improving End-to-End Models for Set Prediction in Spoken Language Understanding
Hong-Kwang J. Kuo, Zoltan Tuske, Samuel Thomas, Brian Kingsbury, George Saon
Comments: ICASSP \c{opyright}2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[311] arXiv:2201.12109 [pdf, other]
Title: Protum: A New Method For Prompt Tuning Based on "[MASK]"
Pan He, Yuxi Chen, Yan Wang, Yanru Zhang
Comments: under review in ICML
Subjects: Computation and Language (cs.CL)
[312] arXiv:2201.12155 [pdf, other]
Title: Reducing language context confusion for end-to-end code-switching automatic speech recognition
Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng
Comments: arXiv admin note: text overlap with arXiv:2010.14798,the paper has been accepted by Insterspeech 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[313] arXiv:2201.12219 [pdf, other]
Title: Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages
Silvia Severini, Ayyoob Imani, Philipp Dufter, Hinrich Schütze
Comments: LREC 2022
Subjects: Computation and Language (cs.CL)
[314] arXiv:2201.12323 [pdf, other]
Title: Describing Differences between Text Distributions with Natural Language
Ruiqi Zhong, Charlie Snell, Dan Klein, Jacob Steinhardt
Comments: International Conference on Machine Learning, 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[315] arXiv:2201.12407 [pdf, other]
Title: Schema-Free Dependency Parsing via Sequence Generation
Boda Lin, Zijun Yao, Jiaxin Shi, Shulin Cao, Binghao Tang, Si Li, Yong Luo, Juanzi Li, Lei Hou
Subjects: Computation and Language (cs.CL)
[316] arXiv:2201.12409 [pdf, other]
Title: A Unified Approach to Entity-Centric Context Tracking in Social Conversations
Ulrich Rückert, Srinivas Sunkara, Abhinav Rastogi, Sushant Prakash, Pranav Khaitan
Comments: Published at LREC 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[317] arXiv:2201.12431 [pdf, other]
Title: Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval
Uri Alon, Frank F. Xu, Junxian He, Sudipta Sengupta, Dan Roth, Graham Neubig
Comments: Accepted to ICML'2022. Code and models are available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[318] arXiv:2201.12438 [pdf, other]
Title: Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Prajjwal Bhargava, Vincent Ng
Comments: AAAI 2022
Subjects: Computation and Language (cs.CL)
[319] arXiv:2201.12501 [pdf, other]
Title: Does Transliteration Help Multilingual Language Modeling?
Ibraheem Muhammad Moosa, Mahmud Elahi Akhter, Ashfia Binte Habib
Comments: In Findings of the Association for Computational Linguistics: EACL 2023
Subjects: Computation and Language (cs.CL)
[320] arXiv:2201.12502 [pdf, other]
Title: Unsupervised Multi-Granularity Summarization
Ming Zhong, Yang Liu, Suyu Ge, Yuning Mao, Yizhu Jiao, Xingxing Zhang, Yichong Xu, Chenguang Zhu, Michael Zeng, Jiawei Han
Comments: EMNLP 2022 Findings
Subjects: Computation and Language (cs.CL)
[321] arXiv:2201.12507 [pdf, other]
Title: AutoDistil: Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models
Dongkuan Xu, Subhabrata Mukherjee, Xiaodong Liu, Debadeepta Dey, Wenhui Wang, Xiang Zhang, Ahmed Hassan Awadallah, Jianfeng Gao
Comments: 15 pages, 4 figures, 10 tables
Subjects: Computation and Language (cs.CL)
[322] arXiv:2201.12538 [pdf, other]
Title: Incorporating Commonsense Knowledge into Story Ending Generation via Heterogeneous Graph Networks
Jiaan Wang, Beiqi Zou, Zhixu Li, Jianfeng Qu, Pengpeng Zhao, An Liu, Lei Zhao
Comments: DASFAA 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[323] arXiv:2201.12546 [pdf, other]
Title: Progressive Continual Learning for Spoken Keyword Spotting
Yizheng Huang, Nana Hou, Nancy F. Chen
Comments: ICASSP 2022
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[324] arXiv:2201.12549 [pdf, other]
Title: A Simple Information-Based Approach to Unsupervised Domain-Adaptive Aspect-Based Sentiment Analysis
Xiang Chen, Xiaojun Wan
Comments: 11 pages, 3 figures, 10 tables
Subjects: Computation and Language (cs.CL)
[325] arXiv:2201.12568 [pdf, other]
Title: Le Processus Powered Dirichlet-Hawkes comme A Priori Flexible pour Clustering Temporel de Textes
Gaël Poux-Médard, Julien Velcin, Sabine Loudcher
Comments: in French
Subjects: Computation and Language (cs.CL)
[326] arXiv:2201.12664 [pdf, other]
Title: A Deep CNN Architecture with Novel Pooling Layer Applied to Two Sudanese Arabic Sentiment Datasets
Mustafa Mhamed, Richard Sutcliffe, Xia Sun, Jun Feng, Eiad Almekhlafi, Ephrem A. Retta
Comments: 19 pages, 11 tables, 11 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[327] arXiv:2201.12793 [pdf, other]
Title: Part of Speech Tagging (POST) of a Low-resource Language using another Language (Developing a POS-Tagged Lexicon for Kurdish (Sorani) using a Tagged Persian (Farsi) Corpus)
Hossein Hassani
Comments: 7pages, 2 tables, 3 figures
Subjects: Computation and Language (cs.CL)
[328] arXiv:2201.12799 [pdf, other]
Title: Recognition of Implicit Geographic Movement in Text
Scott Pezanowski, Prasenjit Mitra
Journal-ref: Proceedings of The 12th Language Resources and Evaluation Conference, 2047-2056 (2020)
Subjects: Computation and Language (cs.CL)
[329] arXiv:2201.12806 [pdf, other]
Title: Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu
Comments: Accepted by ICASSP 2022
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[330] arXiv:2201.12833 [pdf, other]
Title: Word Segmentation and Morphological Parsing for Sanskrit
Jingwen Li, Leander Girrbach
Comments: Code can be accessed from this https URL
Subjects: Computation and Language (cs.CL)
[331] arXiv:2201.12868 [pdf, other]
Title: Anticipation-Free Training for Simultaneous Machine Translation
Chih-Chiang Chang, Shun-Po Chuang, Hung-yi Lee
Comments: Accepted to IWSLT 2022
Subjects: Computation and Language (cs.CL)
[332] arXiv:2201.12911 [pdf, other]
Title: Grammatical cues to subjecthood are redundant in a majority of simple clauses across languages
Kyle Mahowald, Evgeniia Diachek, Edward Gibson, Evelina Fedorenko, Richard Futrell
Subjects: Computation and Language (cs.CL)
[333] arXiv:2201.12926 [pdf, other]
Title: Compositionality as Lexical Symmetry
Ekin Akyürek, Jacob Andreas
Comments: ACL2023 Final Version
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[334] arXiv:2201.13072 [pdf, other]
Title: Are Mutually Intelligible Languages Easier to Translate?
Avital Friedland, Jonathan Zeltser, Omer Levy
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[335] arXiv:2201.13125 [pdf, other]
Title: Corpus for Automatic Structuring of Legal Documents
Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, Ashutosh Modi
Comments: Accepted at LREC 2022, 10 Pages (8 page main paper + 2 page references)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[336] arXiv:2201.13230 [pdf, other]
Title: POTATO: exPlainable infOrmation exTrAcTion framewOrk
Ádám Kovács, Kinga Gémes, Eszter Iklódi, Gábor Recski
Comments: 4 pages
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[337] arXiv:2201.13242 [pdf, other]
Title: Correcting diacritics and typos with a ByT5 transformer model
Lukas Stankevičius, Mantas Lukoševičius, Jurgita Kapočiūtė-Dzikienė, Monika Briedienė, Tomas Krilavičius
Journal-ref: Appl. Sci. 2022, 12(5), 2636
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG); Machine Learning (stat.ML)
[338] arXiv:2201.13405 [pdf, other]
Title: Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Olga Majewska, Evgeniia Razumovskaia, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen
Subjects: Computation and Language (cs.CL)
[339] arXiv:2201.13429 [pdf, other]
Title: Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations
Wei Zhao, Steffen Eger
Comments: ACML2022 Camera Ready
Subjects: Computation and Language (cs.CL)
[340] arXiv:2201.00195 (cross-list from q-bio.PE) [pdf, other]
Title: Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast
Jayden L. Macklin-Cordes, Erich R. Round
Comments: Accepted for publication in Linguistic Typology. Supplementary data at this https URL. 96 total pages (Main text: 41 pages, 6 figures, 3 tables. Supplementary S1: 34 pages, 1 figure. Supplementary S2: 21 pages)
Subjects: Populations and Evolution (q-bio.PE); Computation and Language (cs.CL)
[341] arXiv:2201.00304 (cross-list from cs.AI) [pdf, other]
Title: Informed Multi-context Entity Alignment
Kexuan Xin, Zequn Sun, Wen Hua, Wei Hu, Xiaofang Zhou
Comments: accepted by wsdm 2022
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[342] arXiv:2201.00365 (cross-list from cs.IR) [pdf, other]
Title: Establishing Strong Baselines for TripClick Health Retrieval
Sebastian Hofstätter, Sophia Althammer, Mete Sertkan, Allan Hanbury
Comments: Accepted at ECIR 2022
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[343] arXiv:2201.00614 (cross-list from cs.SI) [pdf, other]
Title: Semi-supervised Stance Detection of Tweets Via Distant Network Supervision
Subhabrata Dutta, Samiya Caur, Soumen Chakrabarti, Tanmoy Chakraborty
Subjects: Social and Information Networks (cs.SI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[344] arXiv:2201.00693 (cross-list from cs.IR) [pdf, other]
Title: Multimodal Entity Tagging with Multimodal Knowledge Base
Hao Peng, Hang Li, Lei Hou, Juanzi Li, Chao Qiao
Comments: 11 pages, 4 figures
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[345] arXiv:2201.00855 (cross-list from cs.CY) [pdf, other]
Title: AI & Racial Equity: Understanding Sentiment Analysis Artificial Intelligence, Data Security, and Systemic Theory in Criminal Justice Systems
Alia Abbas
Comments: 25 pages
Subjects: Computers and Society (cs.CY); Computation and Language (cs.CL)
[346] arXiv:2201.00969 (cross-list from cs.CV) [pdf, other]
Title: Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety
Rajagopal A, Nirmala V, Arun Muthuraj Vedamanickam
Comments: In Springer Proceedings. International Conference On Big Data, Machine Learning and Applications 2021. this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[347] arXiv:2201.00971 (cross-list from cs.LG) [pdf, other]
Title: Submix: Practical Private Prediction for Large-Scale Language Models
Antonio Ginart, Laurens van der Maaten, James Zou, Chuan Guo
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[348] arXiv:2201.00975 (cross-list from cs.CV) [pdf, other]
Title: StyleM: Stylized Metrics for Image Captioning Built with Contrastive N-grams
Chengxi Li, Brent Harrison
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[349] arXiv:2201.00985 (cross-list from cs.CV) [pdf, other]
Title: Variational Stacked Local Attention Networks for Diverse Video Captioning
Tonmoay Deb, Akib Sadmanee, Kishor Kumar Bhaumik, Amin Ahsan Ali, M Ashraful Amin, A K M Mahbubur Rahman
Comments: To be published in Winter Conference on Applications of Computer Vision 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[350] arXiv:2201.01209 (cross-list from cs.DB) [pdf, other]
Title: Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[351] arXiv:2201.01490 (cross-list from cs.LG) [pdf, other]
Title: Debiased Learning from Naturally Imbalanced Pseudo-Labels
Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu
Comments: Accepted by CVPR 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2201.01609 (cross-list from cs.CV) [pdf, other]
Title: All You Need In Sign Language Production
Razieh Rastgoo, Kourosh Kiani, Sergio Escalera, Vassilis Athitsos, Mohammad Sabokrou
Comments: arXiv admin note: substantial text overlap with arXiv:2103.15910
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[353] arXiv:2201.01647 (cross-list from cs.AI) [pdf, other]
Title: Comparison of biomedical relationship extraction methods and models for knowledge graph creation
Nikola Milosevic, Wolfgang Thielemann
Comments: Paper submitted to Journal of Semantic Web
Journal-ref: Nikola Milosevic, Wolfgang Thielemann, Comparison of biomedical relationship extraction methods and models for knowledge graph creation, Journal of Web Semantics, 2022, 100756, ISSN 1570-8268,
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[354] arXiv:2201.01745 (cross-list from cs.IR) [pdf, other]
Title: Atomized Search Length: Beyond User Models
John Alex, Keith Hall, Donald Metzler
Comments: 13 pages, 6 figures
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL)
[355] arXiv:2201.01819 (cross-list from cs.LG) [pdf, other]
Title: Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language Models
Diana Kim, Ahmed Elgammal, Marian Mazzone
Comments: 23 pages, This paper is an extended version of a paper that will be published at the 36th AAAI Conference on Artificial Intelligence, to beheld in Vancouver, BC, Canada, February 22 - March 1, 2022
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[356] arXiv:2201.01901 (cross-list from cs.CV) [pdf, other]
Title: Incremental Object Grounding Using Scene Graphs
John Seon Keun Yi, Yoonwoo Kim, Sonia Chernova
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[357] arXiv:2201.01984 (cross-list from cs.CV) [pdf, html, other]
Title: Image Captioning via Compact Bidirectional Architecture
Zijie Song, Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Huixia Ben, Richang Hong, Meng Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[358] arXiv:2201.02010 (cross-list from cs.CV) [pdf, other]
Title: Self-Training Vision Language BERTs with a Unified Conditional Model
Xiaofeng Yang, Fengmao Lv, Fayao Liu, Guosheng Lin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[359] arXiv:2201.02034 (cross-list from stat.AP) [pdf, other]
Title: Bayesian Regression Approach for Building and Stacking Predictive Models in Time Series Analytics
Bohdan M. Pavlyshenko
Subjects: Applications (stat.AP); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[360] arXiv:2201.02058 (cross-list from cs.LG) [pdf, other]
Title: Sales Time Series Analytics Using Deep Q-Learning
Bohdan M. Pavlyshenko
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[361] arXiv:2201.02065 (cross-list from cs.CV) [pdf, other]
Title: ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language
Cleison Correia de Amorim, Cleber Zanchettin
Journal-ref: The paper is under consideration at Pattern Recognition Letters (2022) (under the manuscript number PRLETTERS-D-22-00140)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[362] arXiv:2201.02119 (cross-list from cs.NE) [pdf, other]
Title: An Opinion Mining of Text in COVID-19 Issues along with Comparative Study in ML, BERT & RNN
Md. Mahadi Hasan Sany, Mumenunnesa Keya, Sharun Akter Khushbu, Akm Shahariar Azad Rabby, Abu Kaisar Mohammad Masum
Comments: 16 pages, 9 figures
Journal-ref: 3rd International Conference on Deep Learning, Artificial Intelligence and Robotics, (ICDLAIR) 2021
Subjects: Neural and Evolutionary Computing (cs.NE); Computation and Language (cs.CL)
[363] arXiv:2201.02127 (cross-list from cs.IR) [pdf, other]
Title: Sentiment Analysis and Sarcasm Detection of Indian General Election Tweets
Arpit Khare, Amisha Gangwar, Sudhakar Singh, Shiv Prakash
Comments: 17 pages, 9 figures, ANTIC-2021
Subjects: Information Retrieval (cs.IR); Computation and Language (cs.CL); Machine Learning (cs.LG)
[364] arXiv:2201.02229 (cross-list from cs.LG) [pdf, other]
Title: Large-scale protein-protein post-translational modification extraction with distant supervision and confidence calibrated BioBERT
Aparna Elangovan, Yuan Li, Douglas E. V. Pires, Melissa J. Davis, Karin Verspoor
Comments: BMC BioInformatics
Journal-ref: BMC Bioinformatics 23, 4 (2022)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[365] arXiv:2201.02280 (cross-list from cs.CV) [pdf, other]
Title: Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping
Nora Horanyi, Kedi Xia, Kwang Moo Yi, Abhishake Kumar Bojja, Ales Leonardis, Hyung Jin Chang
Journal-ref: Pattern Recognition, 2022, 108485, ISSN 0031-3203
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[366] arXiv:2201.02494 (cross-list from cs.CV) [pdf, other]
Title: Progressive Video Summarization via Multimodal Self-supervised Learning
Li Haopeng, Ke Qiuhong, Gong Mingming, Tom Drummond
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[367] arXiv:2201.02495 (cross-list from cs.CV) [pdf, other]
Title: Sign Language Video Retrieval with Free-Form Textual Queries
Amanda Duarte, Samuel Albanie, Xavier Giró-i-Nieto, Gül Varol
Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[368] arXiv:2201.02639 (cross-list from cs.CV) [pdf, other]
Title: MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Rowan Zellers, Jiasen Lu, Ximing Lu, Youngjae Yu, Yanpeng Zhao, Mohammadreza Salehi, Aditya Kusupati, Jack Hessel, Ali Farhadi, Yejin Choi
Comments: CVPR 2022. Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[369] arXiv:2201.02772 (cross-list from cs.CV) [pdf, other]
Title: A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval
Zhixiong Zeng, Wenji Mao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[370] arXiv:2201.02857 (cross-list from cs.HC) [pdf, other]
Title: Effect of Toxic Review Content on Overall Product Sentiment
Mayukh Mukhopadhyay, Sangeeta Sahney
Comments: 43 pages,30 figures, 2 tables
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); General Economics (econ.GN); Applications (stat.AP)
[371] arXiv:2201.03215 (cross-list from cs.LG) [pdf, other]
Title: Handwriting recognition and automatic scoring for descriptive answers in Japanese language tests
Hung Tuan Nguyen, Cuong Tuan Nguyen, Haruki Oka, Tsunenori Ishioka, Masaki Nakagawa
Comments: Keywords: handwritten Japanese answers, handwriting recognition, automatic scoring, ensemble recognition, deep neural networks; Reported in IEICE technical report, PRMU2021-32, pp.45-50 (2021.12) Published after peer review and Presented in ICFHR2022, Lecture Notes in Computer Science, vol. 13639, pp. 274-284 (2022.11)
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2201.03306 (cross-list from q-bio.NC) [pdf, other]
Title: A Planck Radiation and Quantization Scheme for Human Cognition and Language
Diederik Aerts, Lester Beltran
Comments: 7 figures
Journal-ref: Frontiers in Psychology 13, 850725 (2022)
Subjects: Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Quantum Physics (quant-ph)
[373] arXiv:2201.03546 (cross-list from cs.CV) [pdf, other]
Title: Language-driven Semantic Segmentation
Boyi Li, Kilian Q. Weinberger, Serge Belongie, Vladlen Koltun, René Ranftl
Comments: ICLR 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[374] arXiv:2201.03622 (cross-list from cs.IR) [pdf, other]
Title: Graph-Based Recommendation System Enhanced with Community Detection
Zeinab Shokrzadeh, Mohammad-Reza Feizi-Derakhshi, Mohammad-Ali Balafar, Jamshid Bagherzadeh-Mohasefi
Comments: This is a preprint of an article published in "Scientific Programming"
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[375] arXiv:2201.03967 (cross-list from cs.SD) [pdf, other]
Title: Emotion Intensity and its Control for Emotional Voice Conversion
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li
Comments: Accepted by IEEE Transactions on Affective Computing
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Total of 453 entries : 126-375 251-453
Showing up to 250 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack