Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.DB

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Databases

Authors and titles for May 2024

Total of 92 entries : 26-75 51-92
Showing up to 50 entries per page: fewer | more | all
[26] arXiv:2405.07792 [pdf, html, other]
Title: Optimal Matrix Sketching over Sliding Windows
Hanyan Yin, Dongxie Wen, Jiajun Li, Zhewei Wei, Xiao Zhang, Zengfeng Huang, Feifei Li
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG)
[27] arXiv:2405.08315 [pdf, html, other]
Title: Independent Range Sampling on Interval Data (Longer Version)
Daichi Amagata
Comments: Ful version of our ICDE2024 paper
Subjects: Databases (cs.DB)
[28] arXiv:2405.08839 [pdf, html, other]
Title: PromptMind Team at EHRSQL-2024: Improving Reliability of SQL Generation using Ensemble LLMs
Satya K Gundabathula, Sriram R Kolar
Comments: Accepted as a poster for Clinical NLP workshop at NAACL 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[29] arXiv:2405.09593 [pdf, html, other]
Title: SQL-to-Schema Enhances Schema Linking in Text-to-SQL
Sun Yang, Qiong Su, Zhishuai Li, Ziyue Li, Hangyu Mao, Chenxi Liu, Rui Zhao
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[30] arXiv:2405.10045 [pdf, html, other]
Title: Global Benchmark Database
Markus Iser, Christoph Jabs
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO)
[31] arXiv:2405.10235 [pdf, other]
Title: Novel Data Models for Inter-operable LCA Frameworks
Kourosh Malek, Max Dreger, Zirui Tang, Qingshi Tu
Subjects: Databases (cs.DB); Data Analysis, Statistics and Probability (physics.data-an)
[32] arXiv:2405.11191 [pdf, html, other]
Title: Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines
Chaokun Chang, Eric Lo, Chunxiao Ye
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[33] arXiv:2405.11299 [pdf, html, other]
Title: The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving
Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[34] arXiv:2405.11419 [pdf, html, other]
Title: Sketches-based join size estimation under local differential privacy
Meifan Zhang, Xin Liu, Lihua Yin
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[35] arXiv:2405.11529 [pdf, html, other]
Title: Benchmarking Data Management Systems for Microservices
Rodrigo Laigner, Yongluan Zhou
Comments: Manuscript part of the accepted ICDE 2024 Lightning Talk
Subjects: Databases (cs.DB); Software Engineering (cs.SE)
[36] arXiv:2405.11988 [pdf, html, other]
Title: DuckDB-SGX2: The Good, The Bad and The Ugly within Confidential Analytical Query Processing
Ilaria Battiston, Lotte Felius, Sam Ansmink, Laurens Kuiper, Peter Boncz
Subjects: Databases (cs.DB)
[37] arXiv:2405.12350 [pdf, html, other]
Title: A framework for extraction and transformation of documents
Cristian Riveros, Markus L. Schmid, Nicole Schweikardt
Subjects: Databases (cs.DB); Formal Languages and Automata Theory (cs.FL)
[38] arXiv:2405.12358 [pdf, html, other]
Title: Using Color Refinement to Boost Enumeration and Counting for Acyclic CQs of Binary Schemas
Cristian Riveros, Benjamin Scheidt, Nicole Schweikardt
Subjects: Databases (cs.DB); Logic in Computer Science (cs.LO)
[39] arXiv:2405.12497 [pdf, html, other]
Title: RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search
Jianyang Gao, Cheng Long
Comments: The paper has been accepted by SIGMOD 2024
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR)
[40] arXiv:2405.12511 [pdf, html, other]
Title: Quantum Computing for Databases: Overview and Challenges
Gongsheng Yuan, Yuxing Chen, Jiaheng Lu, Sai Wu, Zhiwei Ye, Ling Qian, Gang Chen
Subjects: Databases (cs.DB)
[41] arXiv:2405.12709 [pdf, html, other]
Title: Object-Centric Event Logs: Specifications, Comparative Analysis and Refinement
Alexandre Goossens, Johannes De Smedt, Jan Vanthienen
Subjects: Databases (cs.DB)
[42] arXiv:2405.12871 [pdf, html, other]
Title: Efficient Influence Minimization via Node Blocking
Jinghao Wang, Yanping Wu, Xiaoyang Wang, Ying Zhang, Lu Qin, Wenjie Zhang, Xuemin Lin
Subjects: Databases (cs.DB)
[43] arXiv:2405.12881 [pdf, html, other]
Title: Explaining Expert Search and Team Formation Systems with ExES
Kiarash Golzadeh, Lukasz Golab, Jaroslaw Szlichta
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[44] arXiv:2405.14435 [pdf, html, other]
Title: High-Level Event Mining: Overview and Future Work
Bianka Bakullari, Wil M.P. van der Aalst
Subjects: Databases (cs.DB)
[45] arXiv:2405.14502 [pdf, html, other]
Title: DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]
Baotong Lu, Kaisong Huang, Chieh-Jan Mike Liang, Tianzheng Wang, Eric Lo
Comments: 16 pages; To appear at VLDB 2024
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[46] arXiv:2405.15193 [pdf, html, other]
Title: CuckooGraph: A Scalable and Space-Time Efficient Data Structure for Large-Scale Dynamic Graphs
Zhuochen Fan, Yalun Cai, Zirui Liu, Jiarui Guo, Xin Fan, Tong Yang, Bin Cui
Comments: 2025 IEEE International Conference on Data Engineering (ICDE)
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[47] arXiv:2405.16033 [pdf, other]
Title: Wrangling Data Issues to be Wrangled: Literature Review, Taxonomy, and Industry Case Study
Qiaolin Qin, Heng Li, Ettore Merlo
Subjects: Databases (cs.DB); Information Theory (cs.IT)
[48] arXiv:2405.16345 [pdf, other]
Title: Cypher4BIM: Releasing the Power of Graph for Building Knowledge Discovery
Junxiang Zhu, Nicholas Nisbet, Mengtian Yin, Ran Wei, Ioannis Brilakis
Journal-ref: Automation in Construction, 2025
Subjects: Databases (cs.DB); Emerging Technologies (cs.ET); Information Retrieval (cs.IR)
[49] arXiv:2405.17138 [pdf, html, other]
Title: CMOSS: A Reliable, Motif-based Columnar Molecular Storage System
Eugenio Marinelli, Yiqing Yan, Virginie Magnone, Pascal Barbry, Raja Appuswamy
Subjects: Databases (cs.DB)
[50] arXiv:2405.17434 [pdf, other]
Title: Efficient Search in Graph Edit Distance: Metric Search Trees vs. Brute Force Verification
Wenqi Marshall Guo, Jeffrey Uhlmann
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[51] arXiv:2405.17701 [pdf, html, other]
Title: Compression and In-Situ Query Processing for Fine-Grained Array Lineage
Jinjin Zhao, Sanjay Krishnan
Subjects: Databases (cs.DB)
[52] arXiv:2405.17723 [pdf, html, other]
Title: TableDC: Deep Clustering for Tabular Data
Hafiz Tayyab Rauf, Andre Freitas, Norman W. Paton
Subjects: Databases (cs.DB)
[53] arXiv:2405.17731 [pdf, html, other]
Title: Evaluating NoSQL Databases for OLAP Workloads: A Benchmarking Study of MongoDB, Redis, Kudu and ArangoDB
Rishi Kesav Mohan, Risheek Rakshit Sukumar Kanmani, Krishna Anandan Ganesan, Nisha Ramasubramanian
Subjects: Databases (cs.DB)
[54] arXiv:2405.18181 [pdf, html, other]
Title: Towards Practicable Algorithms for Rewriting Graph Queries beyond DL-Lite
Bianca Löhnert, Nikolaus Augsten, Cem Okulmus, Magdalena Ortiz
Subjects: Databases (cs.DB); Logic in Computer Science (cs.LO)
[55] arXiv:2405.18334 [pdf, html, other]
Title: SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches
Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong
Journal-ref: Published on International Conference on Very Large Databases 2024
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[56] arXiv:2405.18393 [pdf, other]
Title: A Critique of Snapshot Isolation
Daniel Gómez Ferro, Maysam Yabandeh
Journal-ref: EuroSys 2012
Subjects: Databases (cs.DB)
[57] arXiv:2405.18450 [pdf, html, other]
Title: Distance based prefetching algorithms for mining of the sporadic requests associations
Vadim Voevodkin, Andrey Sokolov
Subjects: Databases (cs.DB)
[58] arXiv:2405.19784 [pdf, other]
Title: PixelsDB: Serverless and NL-Aided Data Analytics with Flexible Service Levels and Prices
Haoqiong Bian, Dongyang Geng, Haoyang Li, Yunpeng Chai, Anastasia Ailamaki
Comments: 4 pages, 4 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[59] arXiv:2405.20416 [pdf, html, other]
Title: First Tree-like Quantum Data Structure: Quantum B+ Tree
Hao Liu, Xiaotian You, Raymond Chi-Wing Wong
Subjects: Databases (cs.DB)
[60] arXiv:2405.20429 [pdf, html, other]
Title: Quantum Preference Query
Hao Liu, Xiaotian You, Raymond Chi-Wing Wong
Subjects: Databases (cs.DB)
[61] arXiv:2405.00186 (cross-list from cs.AI) [pdf, other]
Title: Credentials in the Occupation Ontology
John Beverley, Robin McGill, Sam Smith, Jie Zheng, Giacomo De Colle, Finn Wilson, Matthew Diller, William D. Duncan, William R. Hogan, Yongqun He
Comments: 11
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[62] arXiv:2405.00197 (cross-list from cs.AI) [pdf, other]
Title: Grounding Realizable Entities
Michael Rabenberg, Carter Benson, Federico Donato, Yongqun He, Anthony Huffman, Shane Babcock, John Beverley
Comments: 13
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[63] arXiv:2405.00960 (cross-list from cs.AI) [pdf, other]
Title: Foundations for Digital Twins
Finn Wilson, Regina Hurley, Dan Maxwell, Jon McLellan, John Beverley
Comments: 14
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Theory (cs.IT)
[64] arXiv:2405.01510 (cross-list from cs.SI) [pdf, html, other]
Title: Reverse Influential Community Search Over Social Networks (Technical Report)
Qi Wen, Nan Zhang, Yutong Ye, Xiang Lian, Mingsong Chen
Subjects: Social and Information Networks (cs.SI); Databases (cs.DB)
[65] arXiv:2405.03267 (cross-list from cs.DC) [pdf, html, other]
Title: Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
Rongxin Cheng, Yifan Peng, Xingda Wei, Hongrui Xie, Rong Chen, Sijie Shen, Haibo Chen
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Information Retrieval (cs.IR)
[66] arXiv:2405.03579 (cross-list from stat.AP) [pdf, html, other]
Title: Some Statistical and Data Challenges When Building Early-Stage Digital Experimentation and Measurement Capabilities
C. H. Bryan Liu
Comments: PhD thesis. Imperial College London. Official library version available on: this https URL
Subjects: Applications (stat.AP); Databases (cs.DB); Methodology (stat.ME)
[67] arXiv:2405.03708 (cross-list from cs.DC) [pdf, other]
Title: Delta Tensor: Efficient Vector and Tensor Storage in Delta Lake
Zhiwei Bao, Liu Liao-Liao, Zhiyu Wu, Yifan Zhou, Dan Fan, Michal Aibin, Yvonne Coady, Andrew Brownsword
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Machine Learning (cs.LG)
[68] arXiv:2405.03870 (cross-list from cs.AI) [pdf, other]
Title: AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration
Widad Elouataoui
Comments: Doctoral thesis
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[69] arXiv:2405.03883 (cross-list from cs.SE) [pdf, html, other]
Title: sqlelf: a SQL-centric Approach to ELF Analysis
Farid Zakaria, Zheyuan Chen, Andrew Quinn, Thomas R. W. Scogland
Subjects: Software Engineering (cs.SE); Databases (cs.DB); Operating Systems (cs.OS)
[70] arXiv:2405.07022 (cross-list from cs.LG) [pdf, html, other]
Title: DTMamba : Dual Twin Mamba for Time Series Forecasting
Zexue Wu, Yifeng Gong, Aoqian Zhang
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[71] arXiv:2405.07460 (cross-list from cs.LG) [pdf, html, other]
Title: HoneyBee: A Scalable Modular Framework for Creating Multimodal Oncology Datasets with Foundational Embedding Models
Aakash Tripathi, Asim Waqas, Matthew B. Schabath, Yasin Yilmaz, Ghulam Rasool
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
[72] arXiv:2405.07601 (cross-list from cs.LG) [pdf, html, other]
Title: On-device Online Learning and Semantic Management of TinyML Systems
Haoyu Ren, Xue Li, Darko Anicic, Thomas A. Runkler
Comments: Accepted by Journal Transactions on Embedded Computing Systems (TECS)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[73] arXiv:2405.07770 (cross-list from quant-ph) [pdf, html, other]
Title: Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation
Maja Franz, Tobias Winker, Sven Groppe, Wolfgang Mauerer
Journal-ref: Proceedings of the 2024 IEEE International Conference on Quantum Computing and Engineering
Subjects: Quantum Physics (quant-ph); Databases (cs.DB); Machine Learning (cs.LG)
[74] arXiv:2405.09529 (cross-list from cs.CY) [pdf, other]
Title: Artificial Intelligence for the Internal Democracy of Political Parties
Claudio Novelli, Giuliano Formisano, Prathm Juneja, Giulia Sandri, Luciano Floridi
Journal-ref: Minds & Machines 34, 36 (2024)
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
[75] arXiv:2405.11706 (cross-list from cs.AI) [pdf, html, other]
Title: Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!
Dean Allemang, Juan Sequeda
Comments: 16 pages
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR); Logic in Computer Science (cs.LO)
Total of 92 entries : 26-75 51-92
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack