Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.DB

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Databases

Authors and titles for June 2024

Total of 112 entries : 1-50 51-100 101-112
Showing up to 50 entries per page: fewer | more | all
[51] arXiv:2406.14163 [pdf, html, other]
Title: A Unified Statistical And Computational Framework For Ex-Post Harmonisation Of Aggregate Statistics
Cynthia A. Huang
Subjects: Databases (cs.DB); Methodology (stat.ME)
[52] arXiv:2406.14935 [pdf, html, other]
Title: Modelling Legislative Systems into Property Graphs to Enable Advanced Pattern Detection
Andrea Colombo, Anna Bernasconi, Stefano Ceri
Subjects: Databases (cs.DB)
[53] arXiv:2406.15015 [pdf, html, other]
Title: GraLMatch: Matching Groups of Entities with Graphs and Language Models
Fernando De Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger
Comments: 12 pages, 4 figures, accepted as research paper at EDBT 2025
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[54] arXiv:2406.15655 [pdf, html, other]
Title: ProBE: Proportioning Privacy Budget for Complex Exploratory Decision Support
Nada Lahjouji, Sameera Ghayyur, Xi He, Sharad Mehrotra
Subjects: Databases (cs.DB)
[55] arXiv:2406.16082 [pdf, other]
Title: On enforcing function diagram commutativity and anti-commutativity constraints in MatBase
Christian Mancas, Diana Christina Mancas
Comments: This article was submitted on June 24, 2024 to the Open Access Journal of Computer Science and Engineering, Aytin Publications, this https URL
Journal-ref: Open Access Journal of Computer Science and Engineering, Volume 1, Issue 1, 2024, PP:01-13
Subjects: Databases (cs.DB)
[56] arXiv:2406.16268 [pdf, html, other]
Title: Efficient Antagonistic k-plex Enumeration in Signed Graphs
Lantian Xu, Rong-Hua Li, Dong Wen, Qiangqiang Dai, Guoren Wang, Lu Qin
Subjects: Databases (cs.DB)
[57] arXiv:2406.16412 [pdf, html, other]
Title: Not All RDF is Created Equal: Investigating RDF Load Times on Resource-Constrained Devices
Piotr Sowinski, Anh Le-Tuan, Pawel Szmeja, Maria Ganzha
Subjects: Databases (cs.DB)
[58] arXiv:2406.16880 [pdf, html, other]
Title: DataDock: An Open Source Data Hub for Research
Lexington Whalen (1), Homayoun Valafar (1) ((1) University of South Carolina)
Comments: 7 pages, 6 figures, submitted and in review at The 2024 World Congress in Computer Science, Computer Engineering, And Applied Computing (CSCE)
Subjects: Databases (cs.DB)
[59] arXiv:2406.17076 [pdf, html, other]
Title: Avoiding Materialisation for Guarded Aggregate Queries
Matthias Lanzinger, Reinhard Pichler, Alexander Selzer
Subjects: Databases (cs.DB)
[60] arXiv:2406.17871 [pdf, html, other]
Title: Revisiting the Expressiveness Landscape of Data Graph Queries
Michael Benedikt, Anthony Widjaja Lin, Di-De Yen
Subjects: Databases (cs.DB)
[61] arXiv:2406.18099 [pdf, html, other]
Title: CompassDB: Pioneering High-Performance Key-Value Store with Perfect Hash
Jin Jiang, Dongsheng He, Yu Hu, Dong Liu, Chenfan Xiao, Hongxiao Bi, Yusong Zhang, Chaoqu Jiang, Zhijun Fu
Subjects: Databases (cs.DB)
[62] arXiv:2406.18892 [pdf, html, other]
Title: LearnedKV: Integrating LSM and Learned Index for Superior Performance on Storage
Wenlong Wang, David Hung-Chang Du
Comments: 14 pages, 15 figures
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[63] arXiv:2406.19039 [pdf, html, other]
Title: Constructing and Analyzing Different Density Graphs for Path Extrapolation in Wikipedia
Martha Sotiroudi, Anastasia-Sotiria Toufa, Constantine Kotropoulos
Comments: The Sixteenth International Conference on Advances in Databases, Knowledge, and Data Applications (DBKDA 2024)
Subjects: Databases (cs.DB)
[64] arXiv:2406.19106 [pdf, html, other]
Title: MINE GRAPH RULE: A New Cypher-like Operator for Mining Association Rules on Property Graphs
Francesco Cambria, Francesco Invernici, Anna Bernasconi, Stefano Ceri
Subjects: Databases (cs.DB)
[65] arXiv:2406.19143 [pdf, html, other]
Title: QSketch: An Efficient Sketch for Weighted Cardinality Estimation in Streams
Yiyan Qi, Rundong Li, Pinghui Wang, Yufang Sun, Rui Xing
Comments: 12 pages, 10 figures, accepted by KDD 2024
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[66] arXiv:2406.19509 [pdf, other]
Title: Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and copper applications
Yoav Nahshon, Lukas Morand, Matthias Büschelberger, Dirk Helm, Kiran Kumaraswamy, Paul Zierep, Matthias Weber, Pablo de Andrés
Subjects: Databases (cs.DB)
[67] arXiv:2406.19651 [pdf, html, other]
Title: CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
Xianzhi Zeng, Zhuoyan Wu, Xinjing Hu, Xuanhua Shi, Shixuan Sun, Shuhao Zhang
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[68] arXiv:2406.19732 [pdf, other]
Title: French wine: Combination of multiple open data sources to mapping the expected harvest value
Martial Phélippé-Guinvarc'h (GAINS, UM)
Subjects: Databases (cs.DB)
[69] arXiv:2406.00019 (cross-list from cs.CL) [pdf, html, other]
Title: EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
Jaehee Ryu, Seonhee Cho, Gyubok Lee, Edward Choi
Comments: ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[70] arXiv:2406.00344 (cross-list from cs.SI) [pdf, other]
Title: Efficient Historical Butterfly Counting in Large Temporal Bipartite Networks via Graph Structure-aware Index
Qiuyang Mang, Jingbang Chen, Hangrui Zhou, Yu Gao, Yingli Zhou, Qingyu Shi, Richard Peng, Yixiang Fang, Chenhao Ma
Subjects: Social and Information Networks (cs.SI); Databases (cs.DB)
[71] arXiv:2406.00376 (cross-list from cs.DS) [pdf, html, other]
Title: Approaching 100% Confidence in Stream Summary through ReliableSketch
Yuhan Wu, Hanbo Wu, Xilai Liu, Yikai Zhao, Tong Yang, Kaicheng Yang, Sha Wang, Lihua Miao, Gaogang Xie
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB)
[72] arXiv:2406.01598 (cross-list from cs.CV) [pdf, other]
Title: D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation
Zehong Ke, Yanbo Jiang, Yuning Wang, Hao Cheng, Jinhao Li, Jianqiang Wang
Comments: Submit for ITSC 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Robotics (cs.RO)
[73] arXiv:2406.01964 (cross-list from cs.CR) [pdf, html, other]
Title: Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis
Priyanka Nanayakkara, Hyeok Kim, Yifan Wu, Ali Sarvghad, Narges Mahyar, Gerome Miklau, Jessica Hullman
Comments: Published in IEEE Symposium on Security and Privacy (SP) 2024
Journal-ref: in 2024 IEEE Symposium on Security and Privacy (SP), San Francisco, CA, USA, 2024 pp. 231-231
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB); Human-Computer Interaction (cs.HC)
[74] arXiv:2406.02318 (cross-list from cs.LG) [pdf, html, other]
Title: PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection
Ronghui Xu, Hao Miao, Senzhang Wang, Philip S. Yu, Jianxin Wang
Comments: Accepted by SIGKDD 2024 (Research Track)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[75] arXiv:2406.03559 (cross-list from cs.CR) [pdf, html, other]
Title: Stateless and Non-Interactive Order-Preserving Encryption for Outsourced Databases through Subtractive Homomorphism
Dongfang Zhao
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[76] arXiv:2406.04148 (cross-list from cs.LG) [pdf, html, other]
Title: Fast Redescription Mining Using Locality-Sensitive Hashing
Maiju Karjalainen, Esther Galbrun, Pauli Miettinen
Comments: 20 pages, 4 figures, to appear at ECML-PKDD 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[77] arXiv:2406.05439 (cross-list from cs.AI) [pdf, html, other]
Title: A Scalable and Near-Optimal Conformance Checking Approach for Long Traces
Eli Bogdanov, Izack Cohen, Avigdor Gal
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[78] arXiv:2406.05962 (cross-list from cs.DC) [pdf, html, other]
Title: Data Caching for Enterprise-Grade Petabyte-Scale OLAP
Chunxu Tang, Bin Fan, Jing Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian (James)Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen
Comments: Accepted to the USENIX Annual Technical Conference (USENIX ATC) 2024
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB)
[79] arXiv:2406.06596 (cross-list from cs.CL) [pdf, html, other]
Title: Are Large Language Models the New Interface for Data Pipelines?
Sylvio Barbon Junior, Paolo Ceravolo, Sven Groppe, Mustafa Jarrar, Samira Maghool, Florence Sèdes, Soror Sahri, Maurice Van Keulen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[80] arXiv:2406.06761 (cross-list from cs.CR) [pdf, html, other]
Title: Scalable Private Search with Wally
Hilal Asi, Fabian Boemer, Nicholas Genise, Muhammad Haris Mughees, Tabitha Ogilvie, Rehan Rishi, Kunal Talwar, Karl Tarbe, Akshay Wadia, Ruiyu Zhu, Marco Zuliani
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[81] arXiv:2406.06977 (cross-list from cs.LG) [pdf, html, other]
Title: Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation
Yushi Sun, Jiachuan Wang, Peng Cheng, Libin Zheng, Lei Chen, Jian Yin
Comments: Accepted by ICDE 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[82] arXiv:2406.07098 (cross-list from cs.IR) [pdf, html, other]
Title: Guiding Catalogue Enrichment with User Queries
Yupei Du, Jacek Golebiowski, Philipp Schmidt, Ziawasch Abedjan
Comments: ECML PKDD 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
[83] arXiv:2406.07769 (cross-list from cs.LG) [pdf, html, other]
Title: Personalized Product Assortment with Real-time 3D Perception and Bayesian Payoff Estimation
Porter Jenkins, Michael Selander, J. Stockton Jenkins, Andrew Merrill, Kyle Armstrong
Comments: Accepted to KDD 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[84] arXiv:2406.08335 (cross-list from cs.LG) [pdf, html, other]
Title: A Survey of Pipeline Tools for Data Engineering
Anthony Mbata, Yaji Sripada, Mingjun Zhong
Comments: 18 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Computation (stat.CO)
[85] arXiv:2406.08426 (cross-list from cs.CL) [pdf, html, other]
Title: Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong, Zheng Yuan, Qinggang Zhang, Hao Chen, Junnan Dong, Feiran Huang, Xiao Huang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[86] arXiv:2406.08461 (cross-list from cs.CY) [pdf, other]
Title: Bridging the Gap: Unravelling Local Government Data Sharing Barriers in Estonia and Beyond
Katrin Rajamäe Soosaar, Anastasija Nikiforova
Subjects: Computers and Society (cs.CY); Databases (cs.DB); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
[87] arXiv:2406.09046 (cross-list from cs.LG) [pdf, html, other]
Title: ExioML: Eco-economic dataset for Machine Learning in Global Sectoral Sustainability
Yanming Guo, Charles Guan, Jin Ma
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[88] arXiv:2406.10593 (cross-list from cs.AI) [pdf, other]
Title: QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL
Yinggang Sun, Ziming Guo, Haining Yu, Chuanyi Liu, Xiang Li, Bingxuan Wang, Xiangzhan Yu, Tiancheng Zhao
Comments: 10 pages, 9 figures
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[89] arXiv:2406.10635 (cross-list from cs.RO) [pdf, html, other]
Title: ROSfs: A User-Level File System for ROS
Zijun Xu, Xuanjun Wen, Yanjie Song, Shu Yin
Subjects: Robotics (cs.RO); Databases (cs.DB); Operating Systems (cs.OS)
[90] arXiv:2406.10690 (cross-list from cs.AI) [pdf, html, other]
Title: Automating Pharmacovigilance Evidence Generation: Using Large Language Models to Produce Context-Aware SQL
Jeffery L. Painter, Venkateswara Rao Chalamalasetti, Raymond Kassekert, Andrew Bate
Comments: 15 pages, 3 tables, 5 figures
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[91] arXiv:2406.10708 (cross-list from cs.CV) [pdf, html, other]
Title: MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception
M. Mahbubur Rahman, Ryoma Yataka, Sorachi Kato, Pu Perry Wang, Peizhao Li, Adriano Cardace, Petros Boufounos
Comments: 26 pages, 25 figures, 10 tables; See this https URL to access the MMVR dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Signal Processing (eess.SP)
[92] arXiv:2406.10922 (cross-list from cs.CL) [pdf, html, other]
Title: Generating Tables from the Parametric Knowledge of Language Models
Yevgeni Berkovitch, Oren Glickman, Amit Somech, Tomer Wolfson
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[93] arXiv:2406.11131 (cross-list from cs.CL) [pdf, html, other]
Title: Are Large Language Models a Good Replacement of Taxonomies?
Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen
Comments: Accepted by VLDB 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[94] arXiv:2406.11143 (cross-list from cs.AI) [pdf, other]
Title: Scorecards for Synthetic Medical Data Evaluation and Reporting
Ghada Zamzmi, Adarsh Subbaswamy, Elena Sizikova, Edward Margerrison, Jana Delfino, Aldo Badano
Comments: 7 pages, 2 figures
Subjects: Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Databases (cs.DB)
[95] arXiv:2406.11803 (cross-list from cs.LG) [pdf, html, other]
Title: Efficient Discovery of Significant Patterns with Few-Shot Resampling
Leonardo Pellegrina, Fabio Vandin
Comments: Accepted to VLDB 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Machine Learning (stat.ML)
[96] arXiv:2406.12104 (cross-list from cs.CL) [pdf, html, other]
Title: End-to-end Text-to-SQL Generation within an Analytics Insight Engine
Karime Maamari, Amine Mhedhbi
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG)
[97] arXiv:2406.12692 (cross-list from cs.CL) [pdf, html, other]
Title: MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL
Arian Askari, Christian Poelitz, Xinye Tang
Comments: Accepted at Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Human-Computer Interaction (cs.HC)
[98] arXiv:2406.12938 (cross-list from cs.CR) [pdf, other]
Title: Security in IS and social engineering -- an overview and state of the art
Florence Sèdes (UT3, IRIT, CNRS)
Comments: in French language, INFORSID 2024, May 2024, Nancy, France
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[99] arXiv:2406.13213 (cross-list from cs.CL) [pdf, html, other]
Title: Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata
Mykhailo Poliakov, Nadiya Shvai
Comments: Accepted to ICTERI 2024 Posters Track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[100] arXiv:2406.13844 (cross-list from cs.CV) [pdf, html, other]
Title: A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentations
Lidia Garrucho, Kaisar Kushibar, Claire-Anne Reidel, Smriti Joshi, Richard Osuala, Apostolia Tsirikoglou, Maciej Bobowicz, Javier del Riego, Alessandro Catanese, Katarzyna Gwoździewicz, Maria-Laura Cosaka, Pasant M. Abo-Elhoda, Sara W. Tantawy, Shorouq S. Sakrana, Norhan O. Shawky-Abdelfatah, Amr Muhammad Abdo-Salem, Androniki Kozana, Eugen Divjak, Gordana Ivanac, Katerina Nikiforaki, Michail E. Klontzas, Rosa García-Dosdá, Meltem Gulsun-Akpinar, Oğuz Lafcı, Ritse Mann, Carlos Martín-Isla, Fred Prior, Kostas Marias, Martijn P.A. Starmans, Fredrik Strand, Oliver Díaz, Laura Igual, Karim Lekadir
Comments: 15 paes, 7 figures, 3 tables
Journal-ref: Sci Data 12, 453 (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB)
Total of 112 entries : 1-50 51-100 101-112
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack