Skip to main content

Showing 1–50 of 129 results for author: Giles, L

.
  1. arXiv:2505.19173  [pdf, other

    cs.AI

    Investigating Pedagogical Teacher and Student LLM Agents: Genetic Adaptation Meets Retrieval Augmented Generation Across Learning Style

    Authors: Debdeep Sanyal, Agniva Maiti, Umakanta Maharana, Dhruv Kumar, Ankur Mali, C. Lee Giles, Murari Mandal

    Abstract: Effective teaching requires adapting instructional strategies to accommodate the diverse cognitive and behavioral profiles of students, a persistent challenge in education and teacher training. While Large Language Models (LLMs) offer promise as tools to simulate such complex pedagogical environments, current simulation frameworks are limited in two key respects: (1) they often reduce students to… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 38 Pages

  2. arXiv:2501.19353  [pdf, other

    cs.CL cs.AI cs.CV

    Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SciCap Challenge 2023

    Authors: Ting-Yao E. Hsu, Yi-Li Hsu, Shaurya Rohatgi, Chieh-Yang Huang, Ho Yin Sam Ng, Ryan Rossi, Sungchul Kim, Tong Yu, Lun-Wei Ku, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Since the SciCap datasets launch in 2021, the research community has made significant progress in generating captions for scientific figures in scholarly articles. In 2023, the first SciCap Challenge took place, inviting global teams to use an expanded SciCap dataset to develop models for captioning diverse figure types across various academic fields. At the same time, text generation models advan… ▽ More

    Submitted 18 February, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Accepted to TACL 2025

  3. arXiv:2501.02552  [pdf, other

    cs.CL cs.CV

    Multi-LLM Collaborative Caption Generation in Scientific Documents

    Authors: Jaeyoung Kim, Jongho Lee, Hong-Jun Choi, Ting-Yao Hsu, Chieh-Yang Huang, Sungchul Kim, Ryan Rossi, Tong Yu, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang, Sungchul Choi

    Abstract: Scientific figure captioning is a complex task that requires generating contextually appropriate descriptions of visual content. However, existing methods often fall short by utilizing incomplete information, treating the task solely as either an image-to-text or text summarization problem. This limitation hinders the generation of high-quality captions that fully capture the necessary details. Mo… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: Accepted to AAAI 2025 AI4Research Workshop

  4. arXiv:2411.12649  [pdf, other

    cs.IR

    PseudoSeer: a Search Engine for Pseudocode

    Authors: Levent Toksoz, Mukund Srinath, Gang Tan, C. Lee Giles

    Abstract: A novel pseudocode search engine is designed to facilitate efficient retrieval and search of academic papers containing pseudocode. By leveraging Elasticsearch, the system enables users to search across various facets of a paper, such as the title, abstract, author information, and LaTeX code snippets, while supporting advanced features like combined facet searches and exact-match queries for more… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  5. arXiv:2410.03118  [pdf, other

    cs.CL

    Precision, Stability, and Generalization: A Comprehensive Assessment of RNNs learnability capability for Classifying Counter and Dyck Languages

    Authors: Neisarg Dave, Daniel Kifer, Lee Giles, Ankur Mali

    Abstract: This study investigates the learnability of Recurrent Neural Networks (RNNs) in classifying structured formal languages, focusing on counter and Dyck languages. Traditionally, both first-order (LSTM) and second-order (O2RNN) RNNs have been considered effective for such tasks, primarily based on their theoretical expressiveness within the Chomsky hierarchy. However, our research challenges this not… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 21 pages, 5 figures, 5 tables

  6. arXiv:2408.09176  [pdf, other

    cs.AI cs.CL cs.SC

    Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

    Authors: Siyu Wu, Alessandro Oltramari, Jonathan Francis, C. Lee Giles, Frank E. Ritter

    Abstract: Resolving the dichotomy between the human-like yet constrained reasoning processes of Cognitive Architectures and the broad but often noisy inference behavior of Large Language Models (LLMs) remains a challenging but exciting pursuit, for enabling reliable machine reasoning capabilities in production systems. Because Cognitive Architectures are famously developed for the purpose of modeling the in… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures, 2 tables

  7. arXiv:2406.04635  [pdf, other

    cs.IR cs.AI

    Scaling Automatic Extraction of Pseudocode

    Authors: Levent Toksoz, Gang Tan, C. Lee Giles

    Abstract: Pseudocode in a scholarly paper provides a concise way to express the algorithms implemented therein. Pseudocode can also be thought of as an intermediary representation that helps bridge the gap between programming languages and natural languages. Having access to a large collection of pseudocode can provide various benefits ranging from enhancing algorithmic understanding, facilitating further a… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  8. arXiv:2405.13209  [pdf, other

    cs.CL cs.LG

    Investigating Symbolic Capabilities of Large Language Models

    Authors: Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

    Abstract: Prompting techniques have significantly enhanced the capabilities of Large Language Models (LLMs) across various complex tasks, including reasoning, planning, and solving math word problems. However, most research has predominantly focused on language-based reasoning and word problems, often overlooking the potential of LLMs in handling symbol-based calculations and reasoning. This study aims to b… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  9. arXiv:2405.11003  [pdf, other

    physics.space-ph

    Out-of-plane Parallel Current in the Diffusion Regions: The Interaction Between Diffusion Region Systems and their Impact on the Outer EDR

    Authors: Jason M. H. Beedle, Daniel J. Gershman, Vadim M. Uritsky, Jason R. Shuster, Tai D. Phan, Barbara L. Giles, Kevin J. Genestreti, Roy B. Torbert

    Abstract: Dayside magnetic reconnection allows for the transfer of the solar wind's energy into Earth's magnetosphere. This process takes place in electron diffusion regions (EDRs) embedded in ion diffusion regions (IDRs), which form in the magnetopause boundary's current sheet. A significant out-of-plane parallel current contribution in the diffusion regions was reported in Beedle et al. 2023. In order to… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  10. SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Crafting effective captions for figures is important. Readers heavily depend on these captions to grasp the figure's message. However, despite a well-developed set of AI technologies for figures and captions, these have rarely been tested for usefulness in aiding caption writing. This paper introduces SciCapenter, an interactive system that puts together cutting-edge AI technologies for scientific… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems

  11. arXiv:2402.11006  [pdf, other

    cs.CR cs.LG

    Automated Detection and Analysis of Data Practices Using A Real-World Corpus

    Authors: Mukund Srinath, Pranav Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson

    Abstract: Privacy policies are crucial for informing users about data practices, yet their length and complexity often deter users from reading them. In this paper, we propose an automated approach to identify and visualize data practices within privacy policies at different levels of detail. Leveraging crowd-sourced annotations from the ToS;DR platform, we experiment with various methods to match policy ex… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.02627  [pdf, other

    cs.LG

    Stability Analysis of Various Symbolic Rule Extraction Methods from Recurrent Neural Network

    Authors: Neisarg Dave, Daniel Kifer, C. Lee Giles, Ankur Mali

    Abstract: This paper analyzes two competing rule extraction methodologies: quantization and equivalence query. We trained $3600$ RNN models, extracting $18000$ DFA with a quantization approach (k-means and SOM) and $3600$ DFA by equivalence query($L^{*}$) methods across $10$ initialization seeds. We sampled the datasets from $7$ Tomita and $4$ Dyck grammars and trained them on $4$ RNN cells: LSTM, GRU, O2RN… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  13. arXiv:2312.15627  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Ultrahigh electrostrain in Pb-free piezoceramics: Effect of bending

    Authors: Gobinda Das Adhikary, John Daniels, Luke Giles, Rajeev Ranjan

    Abstract: Recently several reports showing ultra-high electrostrain (> 1 %) have appeared in Pb-free piezoceramics. However, there is lack of clarity on the nature of the ultrahigh strain. Here, we demonsrate that the ultrahigh strain is a consequence of bending of the disc. We show that the propensity for bending arises from the difference in the response magnitude of the grains at the positive and negativ… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 8 pages 4 figures

  14. arXiv:2311.05411  [pdf, other

    physics.space-ph physics.plasm-ph

    Multi-scale observation of magnetotail reconnection onset: 2. microscopic dynamics

    Authors: K. J. Genestreti, C. Farrugia, S. Lu, S. K. Vines, P. H. Reiff, T. -D. Phan, D. N. Baker, T. W. Leonard, J. L. Burch, S. T. Bingham, I. J. Cohen, J. R. Shuster, D. J. Gershman, C. G. Mouikis, A. T. Rogers, R. B. Torbert, K. J. Trattner, J. M. Webster, L. -J. Chen, B. L. Giles, N. Ahmadi, R. E. Ergun, C. T. Russell, R. J. Strangeway, R. Nakamura , et al. (1 additional authors not shown)

    Abstract: We analyze the local dynamics of magnetotail reconnection onset using Magnetospheric Multiscale (MMS) data. In conjunction with MMS, the macroscopic dynamics of this event were captured by a number of other ground and space-based observatories, as is reported in a companion paper. We find that the local dynamics of the onset were characterized by the rapid thinning of the cross-tail current sheet… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: In press, JGR Space Physics, JGRA58162

  15. arXiv:2311.05405  [pdf, other

    physics.space-ph

    Multi-scale observation of magnetotail reconnection onset: 1. macroscopic dynamics

    Authors: K. J. Genestreti, C. Farrugia, S. Lu, S. K. Vines, P. H. Reiff, T. -D. Phan, D. N. Baker, T. W. Leonard, J. L. Burch, S. T. Bingham, I. J. Cohen, J. R. Shuster, D. J. Gershman, C. G. Mouikis, A. T. Rogers, R. B. Torbert, K. J. Trattner, J. M. Webster, L. -J. Chen, B. L. Giles, N. Ahmadi, R. E. Ergun, C. T. Russell, R. J. Strangeway, R. Nakamura

    Abstract: We analyze a magnetotail reconnection onset event on 3 July 2017 that was observed under otherwise quiescent magnetospheric conditions by a fortuitous conjunction of six space and ground-based observatories. The study investigates the large-scale coupling of the solar wind - magnetosphere system that precipitated the onset of the magnetotail reconnection, focusing on the processes that thinned and… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: In press, JGR space physics, JGRA58161

  16. arXiv:2310.15405  [pdf, other

    cs.CL

    GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

    Abstract: There is growing interest in systems that generate captions for scientific figures. However, assessing these systems output poses a significant challenge. Human evaluation requires academic expertise and is costly, while automatic evaluation depends on often low-quality author-written captions. This paper investigates using large language models (LLMs) as a cost-effective, reference-free method fo… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To Appear in EMNLP 2023 Findings

  17. arXiv:2309.14691  [pdf, other

    cs.LG cs.CC

    On the Computational Complexity and Formal Hierarchy of Second Order Recurrent Neural Networks

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Artificial neural networks (ANNs) with recurrence and self-attention have been shown to be Turing-complete (TC). However, existing work has shown that these ANNs require multiple turns or unbounded computation time, even with unbounded precision in weights, in order to recognize TC grammars. However, under constraints such as fixed or bounded precision neurons and time, ANNs without memory are sho… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 12 pages, 5 tables, 1 figure

  18. arXiv:2309.14690  [pdf, ps, other

    cs.CC

    On the Tensor Representation and Algebraic Homomorphism of the Neural State Turing Machine

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recurrent neural networks (RNNs) and transformers have been shown to be Turing-complete, but this result assumes infinite precision in their hidden representations, positional encodings for transformers, and unbounded computation time in general. In practical applications, however, it is crucial to have real-time models that can recognize Turing complete grammars in a single pass. To address this… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 14 pages, 7 tables

  19. arXiv:2306.09370  [pdf, other

    physics.space-ph

    Differentiating EDRs from the Background Magnetopause Current Sheet: A Statistical Study

    Authors: Jason M. H. Beedle, Daniel J. Gershman, Vadim M. Uritsky, Tai D. Phan, Barbara L. Giles

    Abstract: The solar wind is a continuous outflow of charged particles from the Sun's atmosphere into the solar system. At Earth, the solar wind's outward pressure is balanced by the Earth's magnetic field in a boundary layer known as the magnetopause. Plasma density and temperature differences across the boundary layer generate the Chapman-Ferraro current which supports the magnetopause. Along the dayside m… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  20. arXiv:2305.14520  [pdf, other

    physics.space-ph astro-ph.IM astro-ph.SR physics.plasm-ph

    Three-dimensional energy transfer in space plasma turbulence from multipoint measurement

    Authors: Francesco Pecora, Sergio Servidio, Yan Yang, William H. Matthaeus, Alexandros Chasapis, Antonella Greco, Daniel J. Gershman, Barbara L. Giles, James L. Burch

    Abstract: A novel multispacecraft technique applied to Magnetospheric Multiscale (MMS) mission data collected in the Earth's magnetosheath enables evaluation of the energy cascade rate solving the full Yaglom's equation in a turbulent space plasma. The method differs from existing approaches in that (i) it is inherently three-dimensional; (ii) it provides a statistically significant number of estimates from… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  21. arXiv:2303.00866  [pdf, other

    cs.HC cs.AI cs.LG

    A prototype hybrid prediction market for estimating replicability of published work

    Authors: Tatiana Chakravorti, Robert Fraleigh, Timothy Fritton, Michael McLaughlin, Vaibhav Singh, Christopher Griffin, Anthony Kwasnica, David Pennock, C. Lee Giles, Sarah Rajtmajer

    Abstract: We present a prototype hybrid prediction market and demonstrate the avenue it represents for meaningful human-AI collaboration. We build on prior work proposing artificial prediction markets as a novel machine-learning algorithm. In an artificial prediction market, trained AI agents buy and sell outcomes of future events. Classification decisions can be framed as outcomes of future events, and acc… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  22. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  23. arXiv:2302.00634  [pdf, other

    physics.flu-dyn astro-ph.SR physics.plasm-ph physics.space-ph

    Relaxation of the turbulent magnetosheath

    Authors: Francesco Pecora, Yan Yang, Alexandros Chasapis, Sergio Servidio, Manuel Cuesta, Sohom Roy, Rohit Chhiber, Riddhi Bandyopadhyay, D. J. Gershman, B. L. Giles, J. L. Burch, William H. Matthaeus

    Abstract: In turbulence, nonlinear terms drive energy transfer from large-scale eddies into small scales through the so-called energy cascade. Turbulence often relaxes toward states that minimize energy; typically these states are considered globally. However, turbulence can also relax toward local quasi-equilibrium states, creating patches or cells where the magnitude of nonlinearity is reduced and energy… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  24. arXiv:2301.12293  [pdf, other

    cs.AI cs.CV cs.DL

    ACL-Fig: A Dataset for Scientific Figure Classification

    Authors: Zeba Karishma, Shaurya Rohatgi, Kavya Shrinivas Puranik, Jian Wu, C. Lee Giles

    Abstract: Most existing large-scale academic search engines are built to retrieve text-based information. However, there are no large-scale retrieval services for scientific figures and tables. One challenge for such services is understanding scientific figures' semantics, such as their types and purposes. A key obstacle is the need for datasets containing annotated scientific figures and tables, which can… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: 6 pages, 4 figures, accepted by the AAAI-23 Workshop on Scientific Document Understanding

  25. arXiv:2211.16590  [pdf, other

    cs.IT

    Artificial prediction markets present a novel opportunity for human-AI collaboration

    Authors: Tatiana Chakravorti, Vaibhav Singh, Sarah Rajtmajer, Michael McLaughlin, Robert Fraleigh, Christopher Griffin, Anthony Kwasnica, David Pennock, C. Lee Giles

    Abstract: Despite high-profile successes in the field of Artificial Intelligence, machine-driven technologies still suffer important limitations, particularly for complex tasks where creativity, planning, common sense, intuition, or learning from limited data is required. These limitations motivate effective methods for human-machine collaboration. Our work makes two primary contributions. We thoroughly exp… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  26. arXiv:2208.12671  [pdf

    astro-ph.EP physics.space-ph

    Thin current sheet behind the dipolarization front

    Authors: Nakamura, R., Baumjohann, W., Nakamura, T. K. M., Panov, E., V., Schmid, D., Varsani, A., S. Apatenkov, V. A. Sergeev, J. Birn, T. Nagai, C. Gabrielse, M. Andre, J. L. Burch, C. Carr, I. S Dandouras, C. P. Escoubet, A, N. Fazakerley , et al. (4 additional authors not shown)

    Abstract: We report a unique conjugate observation of fast flows and associated current sheet disturbances in the near-Earth magnetotail by MMS (Magnetospheric Multiscale) and Cluster preceding a positive bay onset of a small substorm at ~14:10 UT, Sep. 8, 2018. MMS and Cluster were located both at X ~-14 RE. A dipolarization front (DF) of a localized fast flow was detected by Cluster and MMS, separated in… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Journal ref: Journal of Geophysical Research: Space Physics, 126, e2021JA029518, 2021

  27. arXiv:2207.09029  [pdf, other

    physics.space-ph physics.flu-dyn physics.plasm-ph

    Tens to hundreds of keV electron precipitation driven by kinetic Alfvén waves during an electron injection

    Authors: Y. Shen, A. V. Artemyev, X. -J. Zhang, V. Angelopoulos, I. Vasko, D. Turner, E. Tsai, C. Wilkins, J. Weygand, C. T. Russell, R. E. Ergun, B. L. Giles

    Abstract: Electron injections are critical processes associated with magnetospheric substorms, which deposit significant electron energy into the ionosphere. Although wave scattering of $<$10 keV electrons during injections has been well studied, the link between magnetotail electron injections and energetic ($\geq$100 keV) electron precipitation remains elusive. Using conjugate observations between the ELF… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: 25 pages, 5 figures, with supporting information, the manuscript has been accepted for publication by JGR space physics

  28. arXiv:2203.13879  [pdf, other

    physics.space-ph

    On the origin of "patchy" energy conversion in electron diffusion regions

    Authors: Kevin J. Genestreti, Xiaocan Li, Yi-Hsin Liu, James L. Burch, Roy B. Torbert, Stephen A. Fuselier, Takuma Nakamura, Barbara L. Giles, Daniel J. Gershman, Robert E. Ergun, Christopher T. Russell, Robert J. Strangeway

    Abstract: During magnetic reconnection, field lines interconnect in electron diffusion regions (EDRs). In some EDRs the reconnection and energy conversion rates are controlled by a steady out-of-plane electric field. In other EDRs the energy conversion rate $\vec{J}\cdot\vec{E}'$ is "patchy", with electron-scale large-amplitude positive and negative peaks. We investigate 22 EDRs observed by NASA's Magnetosp… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 31 pages, 6 figures, submitted to Physics of Plasmas

  29. arXiv:2201.11795  [pdf, other

    eess.IV cs.CV

    Neural JPEG: End-to-End Image Compression Leveraging a Standard JPEG Encoder-Decoder

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recent advances in deep learning have led to superhuman performance across a variety of applications. Recently, these methods have been successfully employed to improve the rate-distortion performance in the task of image compression. However, current methods either use additional post-processing blocks on the decoder end to improve compression or propose an end-to-end compression scheme based on… ▽ More

    Submitted 31 January, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted in DCC 2022, 11 pages

  30. arXiv:2201.11782  [pdf, other

    cs.CV

    An Empirical Analysis of Recurrent Learning Algorithms In Neural Lossy Image Compression Systems

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, Lee Giles

    Abstract: Recent advances in deep learning have resulted in image compression algorithms that outperform JPEG and JPEG 2000 on the standard Kodak benchmark. However, they are slow to train (due to backprop-through-time) and, to the best of our knowledge, have not been systematically evaluated on a large variety of datasets. In this paper, we perform the first large-scale comparison of recent state-of-the-ar… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted at DCC 2021, 15 pages

  31. arXiv:2201.08495  [pdf, other

    cs.CL

    SciBERTSUM: Extractive Summarization for Scientific Documents

    Authors: Athar Sefid, C Lee Giles

    Abstract: The summarization literature focuses on the summarization of news articles. The news articles in the CNN-DailyMail are relatively short documents with about 30 sentences per document on average. We introduce SciBERTSUM, our summarization framework designed for the summarization of long documents like scientific papers with more than 500 sentences. SciBERTSUM extends BERTSUM to long documents by 1)… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  32. arXiv:2201.06924  [pdf, other

    cs.CY cs.AI cs.IR cs.LG cs.MA

    A Synthetic Prediction Market for Estimating Confidence in Published Work

    Authors: Sarah Rajtmajer, Christopher Griffin, Jian Wu, Robert Fraleigh, Laxmaan Balaji, Anna Squicciarini, Anthony Kwasnica, David Pennock, Michael McLaughlin, Timothy Fritton, Nishanth Nakshatri, Arjun Menon, Sai Ajay Modukuri, Rajal Nivargi, Xin Wei, C. Lee Giles

    Abstract: Explainably estimating confidence in published scholarly work offers opportunity for faster and more robust scientific progress. We develop a synthetic prediction market to assess the credibility of published claims in the social and behavioral sciences literature. We demonstrate our system and detail our findings using a collection of known replication projects. We suggest that this work lays the… ▽ More

    Submitted 23 December, 2021; originally announced January 2022.

  33. arXiv:2201.06091  [pdf, other

    physics.med-ph

    Parallel transmit PUlse design for Saturation Homogeneity (PUSH) for Magnetization Transfer imaging at 7T

    Authors: David Leitão, Raphael Tomi-Tricot, Pip Bridgen, Tom Wilkinson, Patrick Liebig, Rene Gumbrecht, Dieter Ritter, Sharon L. Giles, Ana Baburamani, Jan Sedlacik, Joseph V. Hajnal, Shaihan J. Malik

    Abstract: Purpose: This work proposes a novel RF pulse design for parallel transmit (pTx) systems to obtain uniform saturation of semisolid magnetization for Magnetization Transfer (MT) contrast in the presence of transmit field ($B_1^+$) inhomogeneities. The semisolid magnetization is usually modeled as being purely longitudinal, with the applied $B_1^+$ field saturating but not rotating its magnetization,… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    Comments: 18 pages, 9 figures. Code available at: https://github.com/mriphysics/PUSH

  34. arXiv:2112.00215  [pdf, other

    physics.space-ph astro-ph.EP astro-ph.SR

    Impact angle control of local intense d$B$/d$t$ variations during shock-induced substorms

    Authors: Denny M. Oliveira, James M. Weygand, Eftyhia Zesta, Chigomezyo M. Ngwira, Michael D. Hartinger, Zhonghua Xu, Barbara L. Giles, Dan J. Gershman, Marcos V. D. Silveira, Vitor M. Souza

    Abstract: The impact of interplanetary shocks on the magnetosphere can trigger magnetic substorms that intensify auroral electrojet currents. These currents enhance ground magnetic field perturbations (d$B$/d$t$), which in turn generate geomagnetically induced currents (GICs) that can be detrimental to power transmission infrastructure. We perform a comparative study of d$B$/d$t$ variations in response to t… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 44 pages, 18 figures, 3 tables

    Journal ref: Published in Space Weather, 2021

  35. arXiv:2111.06329  [pdf, other

    physics.space-ph astro-ph.EP

    A Systematic Look at the Temperature Gradient Contribution to the Dayside Magnetopause Current

    Authors: Jason M. H. Beedle, David J. Gershman, Vadim M. Uritsky, Tai D. Phan, Barbara L. Giles

    Abstract: Magnetopause diamagnetic currents arise from density and temperature driven pressure gradients across the boundary layer. While theoretically recognized, the temperature contributions to the magnetopause current system have not yet been systematically studied. To bridge this gap, we used a database of Magnetospheric Multiscale (MMS) magnetopause crossings to analyze diamagnetic current densities a… ▽ More

    Submitted 15 February, 2022; v1 submitted 3 October, 2021; originally announced November 2021.

  36. arXiv:2111.03118  [pdf, other

    physics.plasm-ph physics.space-ph

    Energy Dissipation in Turbulent Reconnection

    Authors: R. Bandyopadhyay, A. Chasapis, W. H. Matthaeus, T. N. Parashar, C. C. Haggerty, M. A. Shay, D. J. Gershman, B. L. Giles, J. L. Burch

    Abstract: We study the nature of pressure-strain interaction at reconnection sites, detected by NASA's Magnetospheric Multiscale (MMS) Mission. We employ data from a series of published case studies, including a large-scale reconnection event at the magnetopause, three small-scale reconnection events at the magnetosheath current sheets, and one example of the recently discovered electron-only reconnection.… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: The following article has been accepted by Physics of Plasmas

  37. arXiv:2110.11624  [pdf, other

    cs.CL cs.AI cs.CV

    SciCap: Generating Captions for Scientific Figures

    Authors: Ting-Yao Hsu, C. Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Researchers use figures to communicate rich, complex information in scientific papers. The captions of these figures are critical to conveying effective messages. However, low-quality figure captions commonly occur in scientific articles and may decrease understanding. In this paper, we propose an end-to-end neural framework to automatically generate informative, high-quality captions for scientif… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To Appear in EMNLP 2021 Findings. The dataset is available at: https://github.com/tingyaohsu/SciCap

  38. arXiv:2106.03246  [pdf, other

    cs.CL cs.AI

    Extractive Research Slide Generation Using Windowed Labeling Ranking

    Authors: Athar Sefid, Jian Wu, Prasenjit Mitra, Lee Giles

    Abstract: Presentation slides describing the content of scientific and technical papers are an efficient and effective way to present that work. However, manually generating presentation slides is labor intensive. We propose a method to automatically generate slides for scientific papers based on a corpus of 5000 paper-slide pairs compiled from conference proceedings websites. The sentence labeling module o… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Journal ref: NAACL/Proceedings of the Second Workshop on Scholarly Document Processing 2021

  39. Document Domain Randomization for Deep Learning Document Layout Extraction

    Authors: Meng Ling, Jian Chen, Torsten Möller, Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Robert S. Laramee, Han-Wei Shen, Jian Wu, C. Lee Giles

    Abstract: We present document domain randomization (DDR), the first successful transfer of convolutional neural networks (CNNs) trained only on graphically rendered pseudo-paper pages to real-world document segmentation. DDR renders pseudo-document pages by modeling randomized textual and non-textual contents of interest, with user-defined layout and font styles to support joint learning of fine-grained cla… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Main paper to appear in ICDAR 2021 (16th International Conference on Document Analysis and Recognition). This version contains additional materials. The associated test data is hosted on IEEE Data Port: http://doi.org/10.21227/326q-bf39

    Journal ref: International Conference on Document Analysis and Recognition (ICDAR), 2021

  40. arXiv:2104.09403  [pdf, other

    cs.CV

    OmniLayout: Room Layout Reconstruction from Indoor Spherical Panoramas

    Authors: Shivansh Rao, Vikas Kumar, Daniel Kifer, Lee Giles, Ankur Mali

    Abstract: Given a single RGB panorama, the goal of 3D layout reconstruction is to estimate the room layout by predicting the corners, floor boundary, and ceiling boundary. A common approach has been to use standard convolutional networks to predict the corners and boundaries, followed by post-processing to generate the 3D layout. However, the space-varying distortions in panoramic images are not compatible… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR, OmniCV Workshop. 10 Pages, 9 Figures, 6 Tables

  41. arXiv:2104.04580  [pdf, other

    cs.DL cs.AI cs.CL cs.LG

    Predicting the Reproducibility of Social and Behavioral Science Papers Using Supervised Learning Models

    Authors: Jian Wu, Rajal Nivargi, Sree Sai Teja Lanka, Arjun Manoj Menon, Sai Ajay Modukuri, Nishanth Nakshatri, Xin Wei, Zhuoer Wang, James Caverlee, Sarah M. Rajtmajer, C. Lee Giles

    Abstract: In recent years, significant effort has been invested verifying the reproducibility and robustness of research claims in social and behavioral sciences (SBS), much of which has involved resource-intensive replication projects. In this paper, we investigate prediction of the reproducibility of SBS papers using machine learning methods based on a set of features. We propose a framework that extracts… ▽ More

    Submitted 21 October, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 17 pages, 8 figures

  42. arXiv:2104.02899  [pdf, other

    cs.LG

    Recognizing and Verifying Mathematical Equations using Multiplicative Differential Neural Units

    Authors: Ankur Mali, Alexander Ororbia, Daniel Kifer, C. Lee Giles

    Abstract: Automated mathematical reasoning is a challenging problem that requires an agent to learn algebraic patterns that contain long-range dependencies. Two particular tasks that test this type of reasoning are (1) mathematical equation verification, which requires determining whether trigonometric and linear algebraic statements are valid identities or not, and (2) equation completion, which entails fi… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  43. Understanding the onset of hot streaks across artistic, cultural, and scientific careers

    Authors: Lu Liu, Nima Dehmamy, Jillian Chown, C. Lee Giles, Dashun Wang

    Abstract: Hot streaks dominate the main impact of creative careers. Despite their ubiquitous nature across a wide range of creative domains, it remains unclear if there is any regularity underlying the beginning of hot streaks. Here, we develop computational methods using deep learning and network science and apply them to novel, large-scale datasets tracing the career outputs of artists, film directors, an… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  44. arXiv:2101.01787  [pdf, other

    cs.CE cs.LG

    Design and Analysis of a Synthetic Prediction Market using Dynamic Convex Sets

    Authors: Nishanth Nakshatri, Arjun Menon, C. Lee Giles, Sarah Rajtmajer, Christopher Griffin

    Abstract: We present a synthetic prediction market whose agent purchase logic is defined using a sigmoid transformation of a convex semi-algebraic set defined in feature space. Asset prices are determined by a logarithmic scoring market rule. Time varying asset prices affect the structure of the semi-algebraic sets leading to time-varying agent purchase rules. We show that under certain assumptions on the u… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 17 pages, 7 figures

  45. arXiv:2012.07565  [pdf, other

    cs.CL cs.IR cs.LG

    Automating Document Classification with Distant Supervision to Increase the Efficiency of Systematic Reviews

    Authors: Xiaoxiao Li, Rabah Al-Zaidy, Amy Zhang, Stefan Baral, Le Bao, C. Lee Giles

    Abstract: Objective: Systematic reviews of scholarly documents often provide complete and exhaustive summaries of literature relevant to a research question. However, well-done systematic reviews are expensive, time-demanding, and labor-intensive. Here, we propose an automatic document classification approach to significantly reduce the effort in reviewing documents. Methods: We first describe a manual docu… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  46. Modeling Updates of Scholarly Webpages Using Archived Data

    Authors: Yasith Jayawardana, Alexander C. Nwala, Gavindya Jayawardena, Jian Wu, Sampath Jayarathna, Michael L. Nelson, C. Lee Giles

    Abstract: The vastness of the web imposes a prohibitive cost on building large-scale search engines with limited resources. Crawl frontiers thus need to be optimized to improve the coverage and freshness of crawled content. In this paper, we propose an approach for modeling the dynamics of change in the web using archived copies of webpages. To evaluate its utility, we conduct a preliminary study on the sch… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

    Comments: 12 pages, 2 appendix pages, 18 figures, to be published in Proceedings of IEEE Big Data 2020 - 5th Computational Archival Science (CAS) Workshop

  47. arXiv:2012.02641  [pdf, other

    physics.plasm-ph astro-ph.EP astro-ph.HE astro-ph.SR physics.space-ph

    In situ evidence of ion acceleration between consecutive reconnection jet fronts

    Authors: Filomena Catapano, Alessandro Retino, Gaetano Zimbardo, Alexandra Alexandrova, Ian J. Cohen, Drew L. Turner, Olivier Le Contel, Giulia Cozzani, Silvia Perri, Antonella Greco, Hugo Breuillard, Dominique Delcourt, Laurent Mirioni, Yuri Khotyaintsev, Andris Vaivads, Barbara L. Giles, Barry H. Mauk, Stephen A. Fuselier, Roy B. Torbert, Christopher T. Russell, Per A. Lindqvist, Robert E. Ergun, Thomas Moore, James L. Burch

    Abstract: Processes driven by unsteady reconnection can efficiently accelerate particles in many astrophysical plasmas. An example are the reconnection jet fronts in an outflow region. We present evidence of suprathermal ion acceleration between two consecutive reconnection jet fronts observed by the Magnetospheric Multiscale mission in the terrestrial magnetotail. An earthward propagating jet is approached… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

  48. arXiv:2010.01782  [pdf, other

    physics.space-ph physics.flu-dyn physics.plasm-ph

    Observation of Inertial-range Energy Cascade within a Reconnection Jet in Earth's Magnetotail

    Authors: Riddhi Bandyopadhyay, Alexandros Chasapis, D. J. Gershman, B. L. Giles, C. T. Russell, R. J. Strangeway, O. Le Contel, M. R. Argall, J. L. Burch

    Abstract: Earth's magnetotail region provides a unique environment to study plasma turbulence. We investigate the turbulence developed in an exhaust produced by magnetic reconnection at the terrestrial magnetotail region. Magnetic and velocity spectra show broad-band fluctuations corresponding to the inertial range, with Kolmorogov $-5/3$ scaling, indicative of a well developed turbulent cascade. We examine… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted for publication in MNRAS

  49. arXiv:2009.03079  [pdf, ps, other

    physics.plasm-ph

    Estimation of the electron density from spacecraft potential during high frequency electric field fluctuations

    Authors: O. W. Roberts, R. Nakamura, K. Torkar, D. B. Graham, D. J. Gershman, J. C. Holmes, A. Varsani, C. P. Escoubet, Z. Vörös, S. Wellenzohn, Y. Khotyaintsev, R. E. Ergun, B. L. Giles

    Abstract: Spacecraft potential has often been used to infer electron density with much higher time resolution than is typically possible with plasma instruments. However, recently two studies by Torkar et al. 2017 and Graham et al. 2018 have shown that external electric fields can also have an effect on the spacecraft potential by enhancing photoelectron escape from the surface. Consequently, should the ele… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: Published in JGR Space Physics

    Journal ref: Journal of Geophysical Research: Space Physics, 125, e2020JA027854

  50. arXiv:2008.11290  [pdf, other

    cs.CL cs.IR cs.LG

    Extractive Summarizer for Scholarly Articles

    Authors: Athar Sefid, Clyde Lee Giles, Prasenjit Mitra

    Abstract: We introduce an extractive method that will summarize long scientific papers. Our model uses presentation slides provided by the authors of the papers as the gold summary standard to label the sentences. The sentences are ranked based on their novelty and their importance as estimated by deep neural networks. Our window-based extractive labeling of sentences results in the improvement of at least… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.