Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.DB

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Databases

Authors and titles for July 2024

Total of 117 entries : 1-50 51-100 101-117
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2407.00017 [pdf, html, other]
Title: Streaming CityJSON datasets
Hugo Ledoux, Gina Stavropoulou, Balázs Dukai
Comments: Presented at the 3DGeoInfo 2024 conference: this https URL
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[2] arXiv:2407.00036 [pdf, html, other]
Title: LiveData -- A Worldwide Data Mesh for Stratified Data
Simone Bocca, Amarsanaa Ganbold, Tsolmon Zundui
Comments: Accepted to MMT-2024 Mongolian conference and ICTfocus journal (this https URL)
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[3] arXiv:2407.00064 [pdf, html, other]
Title: Constraint based Modeling according to Reference Design
Erik Heiland, Peter Hillmann, Andreas Karcher
Journal-ref: Conference on Perspectives in Business Informatics Research (BIR 2023)
Subjects: Databases (cs.DB); Information Retrieval (cs.IR); Information Theory (cs.IT); Software Engineering (cs.SE)
[4] arXiv:2407.00590 [pdf, html, other]
Title: Evaluating Learned Indexes for External-Memory Joins
Yuvaraj Chesetti, Prashant Pandey
Subjects: Databases (cs.DB)
[5] arXiv:2407.00998 [pdf, html, other]
Title: Opportunities for Shape-based Optimization of Link Traversal Queries
Bryan-Elliott Tam, Ruben Taelman, Pieter Colpaert, Ruben Verborgh
Comments: 6 pages, 2 figures
Subjects: Databases (cs.DB)
[6] arXiv:2407.01127 [pdf, html, other]
Title: Tractable Circuits in Database Theory
Antoine Amarilli, Florent Capelli
Comments: 15 pages including 12 pages of main text
Subjects: Databases (cs.DB)
[7] arXiv:2407.01183 [pdf, html, other]
Title: TCSR-SQL: Towards Table Content-aware Text-to-SQL with Self-retrieval
Wenbo Xu, Liang Yan, Peiyi Han, Haifeng Zhu, Chuanyi Liu, Shaoming Duan, Cuiyun Gao, Yingwei Liang
Subjects: Databases (cs.DB)
[8] arXiv:2407.02475 [pdf, other]
Title: Database Systems Course: Service Learning Project
Sherri WeitlHarms
Comments: Presented at and published in the Proceedings of 2012 Midwest Instructional Computing Symposium, Cedar Falls, Iowa, April 14, 2012 (MICS 2012). 15 pages; 6 figures; 3 appendices
Subjects: Databases (cs.DB); Computers and Society (cs.CY)
[9] arXiv:2407.02626 [pdf, other]
Title: The text2term tool to map free-text descriptions of biomedical terms to ontologies
Rafael S. Gonçalves, Jason Payne, Amelia Tan, Carmen Benitez, Jamie Haddock, Robert Gentleman
Subjects: Databases (cs.DB)
[10] arXiv:2407.02803 [pdf, html, other]
Title: KnobCF: Uncertainty-aware Knob Tuning
Yu Yan, Junfang Huang, Hongzhi Wang, Jian Geng, Kaixin Zhang, Tao Yu
Subjects: Databases (cs.DB)
[11] arXiv:2407.02862 [pdf, html, other]
Title: HybEA: Hybrid Models for Entity Alignment
Nikolaos Fanourakis, Fatia Lekbour, Guillaume Renton, Vasilis Efthymiou, Vassilis Christophides
Subjects: Databases (cs.DB)
[12] arXiv:2407.02994 [pdf, html, other]
Title: MedPix 2.0: A Comprehensive Multimodal Biomedical Data set for Advanced AI Applications
Irene Siragusa, Salvatore Contino, Massimo La Ciura, Rosario Alicata, Roberto Pirrone
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[13] arXiv:2407.03112 [pdf, html, other]
Title: A Data Model and Predicate Logic for Trajectory Data (Extended Version)
Johann Bornholdt, Theodoros Chondrogiannis, Michael Grossniklaus
Comments: Extended version of the ADBIS 2024 paper with the same title
Subjects: Databases (cs.DB)
[14] arXiv:2407.03286 [pdf, other]
Title: Large Language Models for JSON Schema Discovery
Michael J. Mior
Subjects: Databases (cs.DB)
[15] arXiv:2407.03750 [pdf, html, other]
Title: GriDB: Scaling Blockchain Database via Sharding and Off-Chain Cross-Shard Mechanism
Zicong Hong, Song Guo, Enyuan Zhou, Wuhui Chen, Huawei Huang, Albert Zomaya
Subjects: Databases (cs.DB)
[16] arXiv:2407.03954 [pdf, other]
Title: Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs
Yanping Wu, Renjie Sun, Xiaoyang Wang, Dong Wen, Ying Zhang, Lu Qin, Xuemin Lin
Subjects: Databases (cs.DB)
[17] arXiv:2407.04217 [pdf, html, other]
Title: An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models
Mengzhao Wang, Haotian Wu, Xiangyu Ke, Yunjun Gao, Xiaoliang Xu, Lu Chen
Comments: This demo paper has been accepted by VLDB 2024
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[18] arXiv:2407.04823 [pdf, html, other]
Title: Path-based Algebraic Foundations of Graph Query Languages
Renzo Angles, Angela Bonifati, Roberto García, Domagoj Vrgoč
Comments: Under review
Subjects: Databases (cs.DB)
[19] arXiv:2407.05096 [pdf, other]
Title: Database Technology Evolution III: Knowledge Graphs and Linked Data
Malcolm Crowe, Fritz Laux
Comments: 10 pages, 2 figures
Journal-ref: IARIA, 2024, ISBN: 978-1-68558-180-0
Subjects: Databases (cs.DB)
[20] arXiv:2407.05952 [pdf, html, other]
Title: H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Nikhil Abhyankar, Vivek Gupta, Dan Roth, Chandan K. Reddy
Comments: NAACL 2025 Main Conference
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[21] arXiv:2407.06199 [pdf, other]
Title: Data Governance and Data Management in Operations and Supply Chain: A Literature Review
Xuejiao Li, Yang Cheng, Xiaoning Xia, Charles Møller
Subjects: Databases (cs.DB)
[22] arXiv:2407.06228 [pdf, other]
Title: Implementing the Typed Graph Data Model Using Relational Database Technology
Malcolm Crowe, Fritz Laux
Comments: 12 pages, 8 figures, 2 tables. arXiv admin note: text overlap with arXiv:2303.12376
Journal-ref: IARIA, 2023, ISSN: 1942-2628
Subjects: Databases (cs.DB)
[23] arXiv:2407.06766 [pdf, other]
Title: Relational Perspective on Graph Query Languages
Diego Figueira, Anthony W. Lin, Liat Peterfreund
Subjects: Databases (cs.DB)
[24] arXiv:2407.07502 [pdf, html, other]
Title: Understanding the Semantic SQL Transducer
Théo Abgrall, Enrico Franconi
Subjects: Databases (cs.DB)
[25] arXiv:2407.07560 [pdf, html, other]
Title: Instrumentation and Analysis of Native ML Pipelines via Logical Query Plans
Stefan Grafberger
Journal-ref: VLDB 2024 Workshop: VLDB Ph.D. Workshop
Subjects: Databases (cs.DB); Machine Learning (cs.LG); Software Engineering (cs.SE)
[26] arXiv:2407.08082 [pdf, html, other]
Title: Maritime Tracking Data Analysis and Integration with AISdb
Gabriel Spadon, Jay Kumar, Jinkun Chen, Matthew Smith, Casey Hilliard, Sarah Vela, Romina Gehrmann, Claudio DiBacco, Stan Matwin, Ronald Pelot
Subjects: Databases (cs.DB)
[27] arXiv:2407.08874 [pdf, other]
Title: Implications of mappings between ICD clinical diagnosis codes and Human Phenotype Ontology terms
Amelia LM Tan, Rafael S Gonçalves, William Yuan, Gabriel A Brat, The Consortium for Clinical Characterization of COVID-19 by EHR (4CE), Robert Gentleman, Isaac S Kohane
Subjects: Databases (cs.DB)
[28] arXiv:2407.09023 [pdf, html, other]
Title: Challenges of Anomaly Detection in the Object-Centric Setting: Dimensions and the Role of Domain Knowledge
Alessandro Berti, Urszula Jessen, Wil M.P. van der Aalst, Dirk Fahland
Subjects: Databases (cs.DB)
[29] arXiv:2407.09409 [pdf, other]
Title: Thunderbolt: Concurrent Smart Contract Execution with Nonblocking Reconfiguration for Sharded DAGs
Junchao Chen, Alberto Sonnino, Lefteris Kokoris-Kogias, Mohammad Sadoghi
Comments: 15 pages
Subjects: Databases (cs.DB)
[30] arXiv:2407.09522 [pdf, html, other]
Title: UQE: A Query Engine for Unstructured Databases
Hanjun Dai, Bethany Yixin Wang, Xingchen Wan, Bo Dai, Sherry Yang, Azade Nova, Pengcheng Yin, Phitchaya Mangpo Phothilimthana, Charles Sutton, Dale Schuurmans
Journal-ref: NeurIPS 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
[31] arXiv:2407.09566 [pdf, other]
Title: Implementing the draft Graph Query Language Standard
Malcolm Crowe, Fritz Laux
Comments: 5 pages, 4 figures
Journal-ref: IARIA, 2024. ISBN: 978-1-68558-138-1
Subjects: Databases (cs.DB)
[32] arXiv:2407.09885 [pdf, html, other]
Title: Statistical Validation of Column Matching in the Database Schema Evolution of the Brazilian Public School Census
Muriki G. Yamanaka, Diogo H. de Almeida, Paulo R. Lisboa de Almeida, Simone Dominico, Leticia M. Peres, Marcos S. Sunye, Eduardo C. de Almeida
Comments: Accepted for presentation at the Simposio Brasileiro de Bancos de Dados (SBBD) 2024
Subjects: Databases (cs.DB)
[33] arXiv:2407.10440 [pdf, html, other]
Title: A novel multi-threaded web crawling model
Weijie.Jiang
Subjects: Databases (cs.DB)
[34] arXiv:2407.10539 [pdf, html, other]
Title: Intelligent Urban Traffic Management via Semantic Interoperability across Multiple Heterogeneous Mobility Data Sources
Mario Scrocca, Marco Grassi, Marco Comerio, Valentina Anita Carriero, Tiago Delgado Dias, Ana Vieira Da Silva, Irene Celino
Comments: In Use paper accepted for publication at the 23rd International Semantic Web Conference (ISWC) 2024. This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution will be published in the conference proceedings
Subjects: Databases (cs.DB)
[35] arXiv:2407.10720 [pdf, other]
Title: Semantic Units: Increasing Expressivity and Simplicity of Formal Representations of Data and Knowledge in Knowledge Graphs
Lars Vogt
Comments: arXiv admin note: text overlap with arXiv:2301.01227
Subjects: Databases (cs.DB)
[36] arXiv:2407.11418 [pdf, html, other]
Title: Semantic Operators: A Declarative Model for Rich, AI-based Data Processing
Liana Patel, Siddharth Jha, Melissa Pan, Harshit Gupta, Parth Asawa, Carlos Guestrin, Matei Zaharia
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[37] arXiv:2407.11425 [pdf, other]
Title: Incremental high average-utility itemset mining: survey and challenges
Jing Chen, Shengyi Yang, Weiping Ding, Peng Li, Aijun Liu, Hongjun Zhang, Tian Li
Comments: 25 pages, 23 figures
Subjects: Databases (cs.DB)
[38] arXiv:2407.11556 [pdf, html, other]
Title: LITS: An Optimized Learned Index for Strings (An Extended Version)
Yifan Yang, Shimin Chen
Subjects: Databases (cs.DB)
[39] arXiv:2407.11616 [pdf, html, other]
Title: PyTond: Efficient Python Data Science on the Shoulders of Databases
Hesam Shahrokhi, Amirali Kaboli, Mahdi Ghorbani, Amir Shaikhha
Comments: Extended version of ICDE 2024
Subjects: Databases (cs.DB); Programming Languages (cs.PL)
[40] arXiv:2407.11852 [pdf, html, other]
Title: Schema Matching with Large Language Models: an Experimental Study
Marcel Parciak, Brecht Vandevoort, Frank Neven, Liesbet M. Peeters, Stijn Vansummeren
Comments: Accepted at the 2nd International Workshop on Tabular Data Analysis (TaDA24), collocated with the 50th International Conference on Very Large Data Bases (VLDB 2024) Guangzhou, China - August 29, 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[41] arXiv:2407.12793 [pdf, html, other]
Title: Data Collection and Labeling Techniques for Machine Learning
Qianyu Huang, Tongfang Zhao
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[42] arXiv:2407.12794 [pdf, html, other]
Title: Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond
George-Octavian Bărbulescu, Taiyi Wang, Zak Singh, Eiko Yoneki
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[43] arXiv:2407.12802 [pdf, html, other]
Title: SimClone: Detecting Tabular Data Clones using Value Similarity
Xu Yang, Gopi Krishnan Rajbahadur, Dayi Lin, Shaowei Wang, Zhen Ming (Jack)Jiang
Comments: 24 pages, 9 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Software Engineering (cs.SE)
[44] arXiv:2407.13294 [pdf, html, other]
Title: Griffin: Fast Transactional Database Index with Hash and B+-Tree
Sho Nakazono, Yutaro Bessho, Hideyuki Kawashima, Tatsuhiro Nakamori
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Databases (cs.DB)
[45] arXiv:2407.14098 [pdf, html, other]
Title: Top-k Representative Search for Comparative Tree Summarization
Yuqi Chen, Xin Huang, Bilian Chen
Subjects: Databases (cs.DB)
[46] arXiv:2407.14384 [pdf, html, other]
Title: The Sticky Path to Expressive Querying: Decidability of Navigational Queries under Existential Rules
Piotr Ostropolski-Nalewaja, Sebastian Rudolph
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[47] arXiv:2407.14530 [pdf, html, other]
Title: FuncEvalGMN: Evaluating Functional Correctness of SQL via Graph Matching Network
Yi Zhan, Yang Sun, Han Weng, Longjie Cui, Guifeng Wang, Jiajun Xie, Yu Tian, Xiaoming Yin, Boyi Liu, Dongchi Huang
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[48] arXiv:2407.14907 [pdf, html, other]
Title: Monotone Rewritability and the Analysis of Queries, Views, and Rules
Michael Benedikt, Stanislav Kikot, Johannes Marti, Piotr Ostropolski-Nalewaja
Subjects: Databases (cs.DB)
[49] arXiv:2407.14953 [pdf, html, other]
Title: AgileDART: An Agile and Scalable Edge Stream Processing Engine
Cheng-Wei Ching, Xin Chen, Chaeeun Kim, Tongze Wang, Dong Chen, Dilma Da Silva, Liting Hu
Comments: To appear in IEEE Transactions on Mobile Computing (TMC); 18 pages for the main paper and 5 pages for the appendices
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[50] arXiv:2407.15071 [pdf, html, other]
Title: Relational Database Augmented Large Language Model
Zongyue Qin, Chen Luo, Zhengyang Wang, Haoming Jiang, Yizhou Sun
Subjects: Databases (cs.DB); Computation and Language (cs.CL)
Total of 117 entries : 1-50 51-100 101-117
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack