Skip to main content

Showing 1–50 of 80 results for author: Giles, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.19173  [pdf, other

    cs.AI

    Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style

    Authors: Debdeep Sanyal, Agniva Maiti, Umakanta Maharana, Dhruv Kumar, Ankur Mali, C. Lee Giles, Murari Mandal

    Abstract: Effective teaching requires adapting instructional strategies to accommodate the diverse cognitive and behavioral profiles of students, a persistent challenge in education and teacher training. While Large Language Models (LLMs) offer promise as tools to simulate such complex pedagogical environments, current simulation frameworks are limited in two key respects: (1) they often reduce students to… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 38 Pages

  2. arXiv:2501.19353  [pdf, other

    cs.CL cs.AI cs.CV

    Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SciCap Challenge 2023

    Authors: Ting-Yao E. Hsu, Yi-Li Hsu, Shaurya Rohatgi, Chieh-Yang Huang, Ho Yin Sam Ng, Ryan Rossi, Sungchul Kim, Tong Yu, Lun-Wei Ku, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Since the SciCap datasets launch in 2021, the research community has made significant progress in generating captions for scientific figures in scholarly articles. In 2023, the first SciCap Challenge took place, inviting global teams to use an expanded SciCap dataset to develop models for captioning diverse figure types across various academic fields. At the same time, text generation models advan… ▽ More

    Submitted 18 February, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Accepted to TACL 2025

  3. arXiv:2501.02552  [pdf, other

    cs.CL cs.CV

    Multi-LLM Collaborative Caption Generation in Scientific Documents

    Authors: Jaeyoung Kim, Jongho Lee, Hong-Jun Choi, Ting-Yao Hsu, Chieh-Yang Huang, Sungchul Kim, Ryan Rossi, Tong Yu, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang, Sungchul Choi

    Abstract: Scientific figure captioning is a complex task that requires generating contextually appropriate descriptions of visual content. However, existing methods often fall short by utilizing incomplete information, treating the task solely as either an image-to-text or text summarization problem. This limitation hinders the generation of high-quality captions that fully capture the necessary details. Mo… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: Accepted to AAAI 2025 AI4Research Workshop

  4. arXiv:2411.12649  [pdf, other

    cs.IR

    PseudoSeer: a Search Engine for Pseudocode

    Authors: Levent Toksoz, Mukund Srinath, Gang Tan, C. Lee Giles

    Abstract: A novel pseudocode search engine is designed to facilitate efficient retrieval and search of academic papers containing pseudocode. By leveraging Elasticsearch, the system enables users to search across various facets of a paper, such as the title, abstract, author information, and LaTeX code snippets, while supporting advanced features like combined facet searches and exact-match queries for more… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  5. arXiv:2410.03118  [pdf, other

    cs.CL

    Precision, Stability, and Generalization: A Comprehensive Assessment of RNNs learnability capability for Classifying Counter and Dyck Languages

    Authors: Neisarg Dave, Daniel Kifer, Lee Giles, Ankur Mali

    Abstract: This study investigates the learnability of Recurrent Neural Networks (RNNs) in classifying structured formal languages, focusing on counter and Dyck languages. Traditionally, both first-order (LSTM) and second-order (O2RNN) RNNs have been considered effective for such tasks, primarily based on their theoretical expressiveness within the Chomsky hierarchy. However, our research challenges this not… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 21 pages, 5 figures, 5 tables

  6. arXiv:2408.09176  [pdf, other

    cs.AI cs.CL cs.SC

    Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

    Authors: Siyu Wu, Alessandro Oltramari, Jonathan Francis, C. Lee Giles, Frank E. Ritter

    Abstract: Resolving the dichotomy between the human-like yet constrained reasoning processes of Cognitive Architectures and the broad but often noisy inference behavior of Large Language Models (LLMs) remains a challenging but exciting pursuit, for enabling reliable machine reasoning capabilities in production systems. Because Cognitive Architectures are famously developed for the purpose of modeling the in… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures, 2 tables

  7. arXiv:2406.04635  [pdf, other

    cs.IR cs.AI

    Scaling Automatic Extraction of Pseudocode

    Authors: Levent Toksoz, Gang Tan, C. Lee Giles

    Abstract: Pseudocode in a scholarly paper provides a concise way to express the algorithms implemented therein. Pseudocode can also be thought of as an intermediary representation that helps bridge the gap between programming languages and natural languages. Having access to a large collection of pseudocode can provide various benefits ranging from enhancing algorithmic understanding, facilitating further a… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  8. arXiv:2405.13209  [pdf, other

    cs.CL cs.LG

    Investigating Symbolic Capabilities of Large Language Models

    Authors: Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

    Abstract: Prompting techniques have significantly enhanced the capabilities of Large Language Models (LLMs) across various complex tasks, including reasoning, planning, and solving math word problems. However, most research has predominantly focused on language-based reasoning and word problems, often overlooking the potential of LLMs in handling symbol-based calculations and reasoning. This study aims to b… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  9. SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Crafting effective captions for figures is important. Readers heavily depend on these captions to grasp the figure's message. However, despite a well-developed set of AI technologies for figures and captions, these have rarely been tested for usefulness in aiding caption writing. This paper introduces SciCapenter, an interactive system that puts together cutting-edge AI technologies for scientific… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems

  10. arXiv:2402.11006  [pdf, other

    cs.CR cs.LG

    Automated Detection and Analysis of Data Practices Using A Real-World Corpus

    Authors: Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson

    Abstract: Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them. In this paper, we propose an automated approach to identify and visualize data practices within privacy policies at different levels of detail. Leveraging crowd-sourced annotations from the ToS;DR platform, we experiment with various methods to match policy ex… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  11. arXiv:2402.02627  [pdf, other

    cs.LG

    Stability Analysis of Various Symbolic Rule Extraction Methods from Recurrent Neural Network

    Authors: Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

    Abstract: This paper analyzes two competing rule extraction methodologies: quantization and equivalence query. We trained $3600$ RNN models, extracting $18000$ DFA with a quantization approach (k-means and SOM) and $3600$ DFA by equivalence query($L^{*}$) methods across $10$ initialization seeds. We sampled the datasets from $7$ Tomita and $4$ Dyck grammars and trained them on $4$ RNN cells: LSTM, GRU, O2RN… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  12. arXiv:2310.15405  [pdf, other

    cs.CL

    GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

    Abstract: There is growing interest in systems that generate captions for scientific figures. However, assessing these systems output poses a significant challenge. Human evaluation requires academic expertise and is costly, while automatic evaluation depends on often low-quality author-written captions. This paper investigates using large language models (LLMs) as a cost-effective, reference-free method fo… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To Appear in EMNLP 2023 Findings

  13. arXiv:2309.14691  [pdf, other

    cs.LG cs.CC

    On the Computational Complexity and Formal Hierarchy of Second Order Recurrent Neural Networks

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Artificial neural networks (ANNs) with recurrence and self-attention have been shown to be Turing-complete (TC). However, existing work has shown that these ANNs require multiple turns or unbounded computation time, even with unbounded precision in weights, in order to recognize TC grammars. However, under constraints such as fixed or bounded precision neurons and time, ANNs without memory are sho… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 12 pages, 5 tables, 1 figure

  14. arXiv:2309.14690  [pdf, ps, other

    cs.CC

    On the Tensor Representation and Algebraic Homomorphism of the Neural State Turing Machine

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recurrent neural networks (RNNs) and transformers have been shown to be Turing-complete, but this result assumes infinite precision in their hidden representations, positional encodings for transformers, and unbounded computation time in general. In practical applications, however, it is crucial to have real-time models that can recognize Turing complete grammars in a single pass. To address this… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 14 pages, 7 tables

  15. arXiv:2303.00866  [pdf, other

    cs.HC cs.AI cs.LG

    A prototype hybrid prediction market for estimating replicability of published work

    Authors: Tatiana Chakravorti, Robert Fraleigh, Timothy Fritton, Michael McLaughlin, Vaibhav Singh, Christopher Griffin, Anthony Kwasnica, David Pennock, C. Lee Giles, Sarah Rajtmajer

    Abstract: We present a prototype hybrid prediction market and demonstrate the avenue it represents for meaningful human-AI collaboration. We build on prior work proposing artificial prediction markets as a novel machine-learning algorithm. In an artificial prediction market, trained AI agents buy and sell outcomes of future events. Classification decisions can be framed as outcomes of future events, and acc… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  16. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  17. arXiv:2301.12293  [pdf, other

    cs.AI cs.CV cs.DL

    ACL-Fig: A Dataset for Scientific Figure Classification

    Authors: Zeba Karishma, Shaurya Rohatgi, Kavya Shrinivas Puranik, Jian Wu, C. Lee Giles

    Abstract: Most existing large-scale academic search engines are built to retrieve text-based information. However, there are no large-scale retrieval services for scientific figures and tables. One challenge for such services is understanding scientific figures' semantics, such as their types and purposes. A key obstacle is the need for datasets containing annotated scientific figures and tables, which can… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: 6 pages, 4 figures, accepted by the AAAI-23 Workshop on Scientific Document Understanding

  18. arXiv:2211.16590  [pdf, other

    cs.IT

    Artificial prediction markets present a novel opportunity for human-AI collaboration

    Authors: Tatiana Chakravorti, Vaibhav Singh, Sarah Rajtmajer, Michael McLaughlin, Robert Fraleigh, Christopher Griffin, Anthony Kwasnica, David Pennock, C. Lee Giles

    Abstract: Despite high-profile successes in the field of Artificial Intelligence, machine-driven technologies still suffer important limitations, particularly for complex tasks where creativity, planning, common sense, intuition, or learning from limited data is required. These limitations motivate effective methods for human-machine collaboration. Our work makes two primary contributions. We thoroughly exp… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  19. arXiv:2201.11795  [pdf, other

    eess.IV cs.CV

    Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG Encoder-Decoder

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recent advances in deep learning have led to superhuman performance across a variety of applications. Recently, these methods have been successfully employed to improve the rate-distortion performance in the task of image compression. However, current methods either use additional post-processing blocks on the decoder end to improve compression or propose an end-to-end compression scheme based on… ▽ More

    Submitted 31 January, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted in DCC 2022, 11 pages

  20. arXiv:2201.11782  [pdf, other

    cs.CV

    An Empirical Analysis of Recurrent Learning Algorithms In Neural Lossy Image Compression Systems

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recent advances in deep learning have resulted in image compression algorithms that outperform JPEG and JPEG 2000 on the standard Kodak benchmark. However, they are slow to train (due to backprop-through-time) and, to the best of our knowledge, have not been systematically evaluated on a large variety of datasets. In this paper, we perform the first large-scale comparison of recent state-of-the-ar… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted at DCC 2021, 15 pages

  21. arXiv:2201.08495  [pdf, other

    cs.CL

    SciBERTSUM: Extractive Summarization for Scientific Documents

    Authors: Athar Sefid, C Lee Giles

    Abstract: The summarization literature focuses on the summarization of news articles. The news articles in the CNN-DailyMail are relatively short documents with about 30 sentences per document on average. We introduce SciBERTSUM, our summarization framework designed for the summarization of long documents like scientific papers with more than 500 sentences. SciBERTSUM extends BERTSUM to long documents by 1)… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  22. arXiv:2201.06924  [pdf, other

    cs.CY cs.AI cs.IR cs.LG cs.MA

    A Synthetic Prediction Market for Estimating Confidence in Published Work

    Authors: Sarah Rajtmajer, Christopher Griffin, Jian Wu, Robert Fraleigh, Laxmaan Balaji, Anna Squicciarini, Anthony Kwasnica, David Pennock, Michael McLaughlin, Timothy Fritton, Nishanth Nakshatri, Arjun Menon, Sai Ajay Modukuri, Rajal Nivargi, Xin Wei, C. Lee Giles

    Abstract: Explainably estimating confidence in published scholarly work offers opportunity for faster and more robust scientific progress. We develop a synthetic prediction market to assess the credibility of published claims in the social and behavioral sciences literature. We demonstrate our system and detail our findings using a collection of known replication projects. We suggest that this work lays the… ▽ More

    Submitted 23 December, 2021; originally announced January 2022.

  23. arXiv:2110.11624  [pdf, other

    cs.CL cs.AI cs.CV

    SciCap: Generating Captions for Scientific Figures

    Authors: Ting-Yao Hsu, C. Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Researchers use figures to communicate rich, complex information in scientific papers. The captions of these figures are critical to conveying effective messages. However, low-quality figure captions commonly occur in scientific articles and may decrease understanding. In this paper, we propose an end-to-end neural framework to automatically generate informative, high-quality captions for scientif… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To Appear in EMNLP 2021 Findings. The dataset is available at: https://github.com/tingyaohsu/SciCap

  24. arXiv:2106.03246  [pdf, other

    cs.CL cs.AI

    Extractive Research Slide Generation Using Windowed Labeling Ranking

    Authors: Athar Sefid, Jian Wu, Prasenjit Mitra, Lee Giles

    Abstract: Presentation slides describing the content of scientific and technical papers are an efficient and effective way to present that work. However, manually generating presentation slides is labor intensive. We propose a method to automatically generate slides for scientific papers based on a corpus of 5000 paper-slide pairs compiled from conference proceedings websites. The sentence labeling module o… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Journal ref: NAACL/Proceedings of the Second Workshop on Scholarly Document Processing 2021

  25. Document Domain Randomization for Deep Learning Document Layout Extraction

    Authors: Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, C. Lee Giles

    Abstract: We present document domain randomization (DDR), the first successful transfer of convolutional neural networks (CNNs) trained only on graphically rendered pseudo-paper pages to real-world document segmentation. DDR renders pseudo-document pages by modeling randomized textual and non-textual contents of interest, with user-defined layout and font styles to support joint learning of fine-grained cla… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Main paper to appear in ICDAR 2021 (16th International Conference on Document Analysis and Recognition). This version contains additional materials. The associated test data is hosted on IEEE Data Port: http://doi.org/10.21227/326q-bf39

    Journal ref: International Conference on Document Analysis and Recognition (ICDAR), 2021

  26. arXiv:2104.09403  [pdf, other

    cs.CV

    OmniLayout: Room Layout Reconstruction from Indoor Spherical Panoramas

    Authors: Shivansh Rao, Vikas Kumar, Daniel Kifer, Lee Giles, Ankur Mali

    Abstract: Given a single RGB panorama, the goal of 3D layout reconstruction is to estimate the room layout by predicting the corners, floor boundary, and ceiling boundary. A common approach has been to use standard convolutional networks to predict the corners and boundaries, followed by post-processing to generate the 3D layout. However, the space-varying distortions in panoramic images are not compatible… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR, OmniCV Workshop. 10 Pages, 9 Figures, 6 Tables

  27. arXiv:2104.04580  [pdf, other

    cs.DL cs.AI cs.CL cs.LG

    Predicting the Reproducibility of Social and Behavioral Science Papers Using Supervised Learning Models

    Authors: Jian Wu, Rajal Nivargi, Sree Sai Teja Lanka, Arjun Manoj Menon, Sai Ajay Modukuri, Nishanth Nakshatri, Xin Wei, Zhuoer Wang, James Caverlee, Sarah M. Rajtmajer, C. Lee Giles

    Abstract: In recent years, significant effort has been invested verifying the reproducibility and robustness of research claims in social and behavioral sciences (SBS), much of which has involved resource-intensive replication projects. In this paper, we investigate prediction of the reproducibility of SBS papers using machine learning methods based on a set of features. We propose a framework that extracts… ▽ More

    Submitted 21 October, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 17 pages, 8 figures

  28. arXiv:2104.02899  [pdf, other

    cs.LG

    Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, C. Lee Giles

    Abstract: Automated mathematical reasoning is a challenging problem that requires an agent to learn algebraic patterns that contain long-range dependencies. Two particular tasks that test this type of reasoning are (1) mathematical equation verification, which requires determining whether trigonometric and linear algebraic statements are valid identities or not, and (2) equation completion, which entails fi… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  29. arXiv:2101.01787  [pdf, other

    cs.CE cs.LG

    Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets

    Authors: Nishanth Nakshatri, Arjun Menon, C. Lee Giles, Sarah Rajtmajer, Christopher Griffin

    Abstract: We present a synthetic prediction market whose agent purchase logic is defined using a sigmoid transformation of a convex semi-algebraic set defined in feature space. Asset prices are determined by a logarithmic scoring market rule. Time varying asset prices affect the structure of the semi-algebraic sets leading to time-varying agent purchase rules. We show that under certain assumptions on the u… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 17 pages, 7 figures

  30. arXiv:2012.07565  [pdf, other

    cs.CL cs.IR cs.LG

    Automating Document Classification with Distant Supervision to Increase the Efficiency of Systematic Reviews

    Authors: Xiaoxiao Li, Rabah Al-Zaidy, Amy Zhang, Stefan Baral, Le Bao, C. Lee Giles

    Abstract: Objective: Systematic reviews of scholarly documents often provide complete and exhaustive summaries of literature relevant to a research question. However, well-done systematic reviews are expensive, time-demanding, and labor-intensive. Here, we propose an automatic document classification approach to significantly reduce the effort in reviewing documents. Methods: We first describe a manual docu… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  31. Modeling Updates of Scholarly Webpages Using Archived Data

    Authors: Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles

    Abstract: The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources. Crawl frontiers thus need to be optimized to improve the coverage and freshness of crawled content. In this paper, we propose an approach for modeling the dynamics of change in the web using archived copies of webpages. To evaluate its utility, we conduct a preliminary study on the sch… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

    Comments: 12 pages, 2 appendix pages, 18 figures, to be published in Proceedings of IEEE Big Data 2020 - 5th Computational Archival Science (CAS) Workshop

  32. arXiv:2008.11290  [pdf, other

    cs.CL cs.IR cs.LG

    Extractive Summarizer for Scholarly Articles

    Authors: Athar Sefid, Clyde Lee Giles, Prasenjit Mitra

    Abstract: We introduce an extractive method that will summarize long scientific papers. Our model uses presentation slides provided by the authors of the papers as the gold summary standard to label the sentences. The sentences are ranked based on their novelty and their importance as estimated by deep neural networks. Our window-based extractive labeling of sentences results in the improvement of at least… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  33. arXiv:2007.13826  [pdf, other

    cs.CL cs.DL

    Large Scale Subject Category Classification of Scholarly Papers with Deep Attentive Neural Networks

    Authors: Bharath Kandimalla, Shaurya Rohatgi, Jian Wu, C Lee Giles

    Abstract: Subject categories of scholarly papers generally refer to the knowledge domain(s) to which the papers belong, examples being computer science or physics. Subject category information can be used for building faceted search for digital library search engines. This can significantly assist users in narrowing down their search space of relevant documents. Unfortunately, many academic papers do not ha… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    Comments: submitted to "Frontiers Mining Scientific Papers Volume II: Knowledge Discovery and Data Exploitation"

  34. arXiv:2006.03651  [pdf, other

    cs.LG cs.FL stat.ML

    A provably stable neural network Turing Machine

    Authors: John Stogin, Ankur Mali, C Lee Giles

    Abstract: We introduce a neural stack architecture, including a differentiable parametrized stack operator that approximates stack push and pop operations for suitable choices of parameters that explicitly represents a stack. We prove the stability of this stack architecture: after arbitrarily many stack operations, the state of the neural stack still closely resembles the state of the discrete stack. Using… ▽ More

    Submitted 18 September, 2022; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 28 pages, 2 figures

  35. arXiv:2005.02367  [pdf, other

    cs.CL cs.HC

    CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset

    Authors: Ting-Hao 'Kenneth' Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, C. Lee Giles

    Abstract: This paper introduces CODA-19, a human-annotated dataset that codes the Background, Purpose, Method, Finding/Contribution, and Other sections of 10,966 English abstracts in the COVID-19 Open Research Dataset. CODA-19 was created by 248 crowd workers from Amazon Mechanical Turk within 10 days, and achieved labeling quality comparable to that of experts. Each abstract was annotated by nine different… ▽ More

    Submitted 17 September, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: Accepted by the NLP COVID-19 Workshop at ACL 2020. (The data, code, and model are available at: https://github.com/windx0303/CODA-19)

  36. Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies

    Authors: Mukund Srinath, Shomir Wilson, C. Lee Giles

    Abstract: Organisations disclose their privacy practices by posting privacy policies on their website. Even though users often care about their digital privacy, they often don't read privacy policies since they require a significant investment in time and effort. Although natural language processing can help in privacy policy understanding, there has been a lack of large scale privacy policy corpora that co… ▽ More

    Submitted 30 March, 2024; v1 submitted 23 April, 2020; originally announced April 2020.

    Journal ref: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021

  37. arXiv:2004.07623  [pdf, other

    cs.CL cs.LG

    Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented With An External Differentiable Stack

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Clyde Lee Giles

    Abstract: Recurrent neural networks (RNNs) are a widely used deep architecture for sequence modeling, generation, and prediction. Despite success in applications such as machine translation and voice recognition, these stateful models have several critical shortcomings. Specifically, RNNs generalize poorly over very long sequences, which limits their applicability to many important temporal processing and t… ▽ More

    Submitted 22 April, 2020; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: 14 pages, 10 tables

  38. arXiv:2002.03911  [pdf, other

    cs.LG cs.NE stat.ML

    Large-Scale Gradient-Free Deep Learning with Recursive Local Representation Alignment

    Authors: Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

    Abstract: Training deep neural networks on large-scale datasets requires significant hardware resources whose costs (even on cloud platforms) put them out of reach of smaller organizations, groups, and individuals. Backpropagation, the workhorse for training these networks, is an inherently sequential process that is difficult to parallelize. Furthermore, it requires researchers to continually develop vario… ▽ More

    Submitted 18 September, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Further revised submission -- main description of rec-LRA revamped and architecture-agnostic pseudo-code moved to appendix with additional results/derivation updates

  39. arXiv:1912.04115  [pdf, other

    cs.IR

    Query Auto Completion for Math Formula Search

    Authors: Shaurya Rohatgi, Wei Zhong, Richard Zanibbi, Jian Wu, C. Lee Giles

    Abstract: Query Auto Completion (QAC) is among the most appealing features of a web search engine. It helps users formulate queries quickly with less effort. Although there has been much effort in this area for text, to the best of our knowledge there is few work on mathematical formula auto completion. In this paper, we implement 5 existing QAC methods on mathematical formula and evaluate them on the NTCIR… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  40. arXiv:1912.00839  [pdf, other

    cs.CL

    Automatic Generation of Headlines for Online Math Questions

    Authors: Ke Yuan, Dafang He, Zhuoren Jiang, Liangcai Gao, Zhi Tang, C. Lee Giles

    Abstract: Mathematical equations are an important part of dissemination and communication of scientific information. Students, however, often feel challenged in reading and understanding math content and equations. With the development of the Web, students are posting their math questions online. Nevertheless, constructing a concise math headline that gives a good description of the posted detailed math que… ▽ More

    Submitted 27 November, 2019; originally announced December 2019.

    Journal ref: AAA2020

  41. arXiv:1911.08478  [pdf, other

    cs.CV cs.LG

    Sibling Neural Estimators: Improving Iterative Image Decoding with Gradient Communication

    Authors: Ankur Mali, Alexander G. Ororbia, Clyde Lee Giles

    Abstract: For lossy image compression, we develop a neural-based system which learns a nonlinear estimator for decoding from quantized representations. The system links two recurrent networks that \help" each other reconstruct same target image patches using complementary portions of spatial context that communicate via gradient signals. This dual agent system builds upon prior work that proposed the iterat… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: 11 Pages, 2 figures, 1 Table

  42. arXiv:1911.04644  [pdf, other

    cs.LG stat.ML

    Connecting First and Second Order Recurrent Networks with Deterministic Finite Automata

    Authors: Qinglong Wang, Kaixuan Zhang, Xue Liu, C. Lee Giles

    Abstract: We propose an approach that connects recurrent networks with different orders of hidden interaction with regular grammars of different levels of complexity. We argue that the correspondence between recurrent networks and formal computational models gives understanding to the analysis of the complicated behaviors of recurrent networks. We introduce an entropy value that categorizes all regular gram… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

  43. arXiv:1910.06509  [pdf, other

    cs.LG math.AT stat.ML

    Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

    Authors: Kaixuan Zhang, Qinglong Wang, Xue Liu, C. Lee Giles

    Abstract: Data samples collected for training machine learning models are typically assumed to be independent and identically distributed (iid). Recent research has demonstrated that this assumption can be problematic as it simplifies the manifold of structured data. This has motivated different research areas such as data poisoning, model improvement, and explanation of machine learning models. In this wor… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  44. arXiv:1909.05233  [pdf, other

    cs.NE cs.CL cs.LG

    The Neural State Pushdown Automata

    Authors: Ankur Mali, Alexander Ororbia, C. Lee Giles

    Abstract: In order to learn complex grammars, recurrent neural networks (RNNs) require sufficient computational resources to ensure correct grammar recognition. A widely-used approach to expand model capacity would be to couple an RNN to an external memory stack. Here, we introduce a "neural state" pushdown automaton (NSPDA), which consists of a digital stack, instead of an analog one, that is coupled to a… ▽ More

    Submitted 19 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: 10 pages, 7 Table, 1 figure

  45. arXiv:1906.08470  [pdf, other

    cs.DL cs.IR

    Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big Datasets

    Authors: Athar Sefid, Jian Wu, Allen C. Ge, Jing Zhao, Lu Liu, Cornelia Caragea, Prasenjit Mitra, C. Lee Giles

    Abstract: Automatically extracted metadata from scholarly documents in PDF formats is usually noisy and heterogeneous, often containing incomplete fields and erroneous values. One common way of cleaning metadata is to use a bibliographic reference dataset. The challenge is to match records between corpora with high precision. The existing solution which is based on information retrieval and string similarit… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  46. arXiv:1905.10696  [pdf, other

    cs.LG cs.NE stat.ML

    Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting

    Authors: Alexander Ororbia, Ankur Mali, Daniel Kifer, C. Lee Giles

    Abstract: In lifelong learning systems based on artificial neural networks, one of the biggest obstacles is the inability to retain old knowledge as new information is encountered. This phenomenon is known as catastrophic forgetting. In this paper, we propose a new kind of connectionist architecture, the Sequential Neural Coding Network, that is robust to forgetting when learning from streams of data points… ▽ More

    Submitted 14 August, 2022; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Updated revision, additional baseline results, and expanded appendix (includes derivation from total discrepancy/variational free energy)

  47. arXiv:1811.06029  [pdf, other

    cs.LG stat.ML

    Verification of Recurrent Neural Networks Through Rule Extraction

    Authors: Qinglong Wang, Kaixuan Zhang, Xue Liu, C. Lee Giles

    Abstract: The verification problem for neural networks is verifying whether a neural network will suffer from adversarial samples, or approximating the maximal allowed scale of adversarial perturbation that can be endured. While most prior work contributes to verifying feed-forward networks, little has been explored for verifying recurrent networks. This is due to the existence of a more rigorous constraint… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  48. arXiv:1810.07411  [pdf, other

    cs.NE cs.LG

    Continual Learning of Recurrent Neural Networks by Locally Aligning Distributed Representations

    Authors: Alexander Ororbia, Ankur Mali, C. Lee Giles, Daniel Kifer

    Abstract: Temporal models based on recurrent neural networks have proven to be quite powerful in a wide variety of applications. However, training these models often relies on back-propagation through time, which entails unfolding the network over many time steps, making the process of conducting credit assignment considerably more challenging. Furthermore, the nature of back-propagation itself does not per… ▽ More

    Submitted 10 August, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

    Comments: Important revisions made throughout (additional items/results added, including a complexity analysis)

  49. arXiv:1809.03050  [pdf, other

    cs.CV

    TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade

    Authors: Dafang He, Xiao Yang, Daniel Kifer, C. Lee Giles

    Abstract: We study the problem of extracting text instance contour information from images and use it to assist scene text detection. We propose a novel and effective framework for this and experimentally demonstrate that: (1) A CNN that can be effectively used to extract instance-level text contour from natural images. (2) The extracted contour information can be used for better scene text detection. We pr… ▽ More

    Submitted 2 December, 2018; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: 9 pages(including references); WACV 2019

  50. arXiv:1809.03036  [pdf, ps, other

    cs.CV

    A Neural Temporal Model for Human Motion Prediction

    Authors: Anand Gopalakrishnan, Ankur Mali, Dan Kifer, C. Lee Giles, Alexander G. Ororbia

    Abstract: We propose novel neural temporal models for predicting and synthesizing human motion, achieving state-of-the-art in modeling long-term motion trajectories while being competitive with prior work in short-term prediction and requiring significantly less computation. Key aspects of our proposed system include: 1) a novel, two-level processing architecture that aids in generating planned trajectories… ▽ More

    Submitted 22 November, 2019; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: accepted to cvpr 2019

    Journal ref: In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 12116-12125. 2019