Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.DB

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Databases

Authors and titles for June 2024

Total of 112 entries : 1-100 101-112
Showing up to 100 entries per page: fewer | more | all
[1] arXiv:2406.00014 [pdf, html, other]
Title: KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR
Hajung Kim, Chanhwi Kim, Hoonick Lee, Kyochul Jang, Jiwoo Lee, Kyungjae Lee, Gangwoo Kim, Jaewoo Kang
Comments: Published at ClinicalNLP workshop @ NAACL 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR)
[2] arXiv:2406.00063 [pdf, other]
Title: Methods for Linking Data to Online Resources and Ontologies with Applications to Neurophysiology
Matthew Avaylon, Ryan Ly, Andrew Tritt, Benjamin Dichter, Kristofer E. Bouchard, Christopher J. Mungall, Oliver Ruebel
Subjects: Databases (cs.DB)
[3] arXiv:2406.00251 [pdf, html, other]
Title: Measures in SQL
Julian Hyde, John Fremlin
Comments: To be published in SIGMOD-Companion 24, June 9-15, 2024, Santiago, AA, Chile; 10 pages; updated with corrections as of 2024/05/31, and for formatting as of 2025/01/10
Subjects: Databases (cs.DB)
[4] arXiv:2406.00550 [pdf, html, other]
Title: Demystifying Object-based Big Data Storage Systems
Anindita Sarkar Mondal, Madhupa Sanyal, Ari Kusumastuti, Hrishav Bakul Barua, Kartick Chandra Mondal
Comments: 32 Pages
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[5] arXiv:2406.00583 [pdf, html, other]
Title: CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
Yanlin Feng, Sajjadur Rahman, Aaron Feng, Vincent Chen, Eser Kandogan
Comments: Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI '24), June 14, 2024, Santiago, AA, Chile
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[6] arXiv:2406.00584 [pdf, html, other]
Title: A Blueprint Architecture of Compound AI Systems for Enterprise
Eser Kandogan, Sajjadur Rahman, Nikita Bhutani, Dan Zhang, Rafael Li Chen, Kushan Mitra, Sairam Gurajada, Pouya Pezeshkpour, Hayate Iso, Yanlin Feng, Hannah Kim, Chen Shen, Jin Wang, Estevam Hruschka
Comments: Compound AI Systems Workshop at the Data+AI Summit 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[7] arXiv:2406.00616 [pdf, html, other]
Title: EMIT: Micro-Invasive Database Configuration Tuning
Jian Geng, Hongzhi Wang, Yu Yan
Subjects: Databases (cs.DB)
[8] arXiv:2406.00617 [pdf, html, other]
Title: Maximum $k$-Plex Search: An Alternated Reduction-and-Bound Method
Shuohao Gao, Kaiqiang Yu, Shengxin Liu, Cheng Long
Subjects: Databases (cs.DB); Social and Information Networks (cs.SI)
[9] arXiv:2406.01027 [pdf, html, other]
Title: PRICE: A Pretrained Model for Cross-Database Cardinality Estimation
Tianjing Zeng, Junwei Lan, Jiahong Ma, Wenqing Wei, Rong Zhu, Pengfei Li, Bolin Ding, Defu Lian, Zhewei Wei, Jingren Zhou
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[10] arXiv:2406.01250 [pdf, html, other]
Title: DumpKV: Learning based lifetime aware garbage collection for key value separation in LSM-tree
Zhutao Zhuang, Xinqi Zeng, Zhiguang Chen
Comments: Hi
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[11] arXiv:2406.01265 [pdf, html, other]
Title: The Dawn of Natural Language to SQL: Are We Fully Ready?
Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang
Comments: VLDB 2024
Subjects: Databases (cs.DB)
[12] arXiv:2406.01526 [pdf, html, other]
Title: PARQO: Penalty-Aware Robust Plan Selection in Query Optimization
Haibo Xiu, Pankaj K. Agarwal, Jun Yang
Comments: This paper has been accepted with shepherding by VLDB 2024 (Vol 17)
Subjects: Databases (cs.DB)
[13] arXiv:2406.01786 [pdf, html, other]
Title: Recent Advances in Data-Driven Business Process Management
Lars Ackermann, Martin Käppel, Laura Marcus, Linda Moder, Sebastian Dunzer, Markus Hornsteiner, Annina Liessmann, Yorck Zisgen, Philip Empl, Lukas-Valentin Herm, Nicolas Neis, Julian Neuberger, Leo Poss, Myriam Schaschek, Sven Weinzierl, Niklas Wördehoff, Stefan Jablonski, Agnes Koschmider, Wolfgang Kratsch, Martin Matzner, Stefanie Rinderle-Ma, Maximilian Röglinger, Stefan Schönig, Axel Winkelmann
Comments: position paper, 34 pages, 10 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[14] arXiv:2406.01876 [pdf, html, other]
Title: GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
Xuanqing Liu, Luyang Kong, Runhui Wang, Patrick Song, Austin Nevins, Henrik Johnson, Nimish Amlathe, Davor Golac
Comments: KDD 2024 Camera Ready; 11 pages, 8 figures
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[15] arXiv:2406.03965 [pdf, other]
Title: More Bang For Your Buck(et): Fast and Space-efficient Hardware-accelerated Coarse-granular Indexing on GPUs
Justus Henneberg, Felix Schuhknecht, Rosina Kharal, Trevor Brown
Subjects: Databases (cs.DB); Graphics (cs.GR)
[16] arXiv:2406.04738 [pdf, other]
Title: In-depth Analysis of Densest Subgraph Discovery in a Unified Framework
Yingli Zhou, Qingshuo Guo, Yi Yang, Yixiang Fang, Chenhao Ma, Laks Lakshmanan
Comments: 19pages, 27 figures
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[17] arXiv:2406.04995 [pdf, html, other]
Title: Data2Neo - A Tool for Complex Neo4j Data Integration
Julian Minder, Laurence Brandenberger, Luis Salamanca, Frank Schweitzer
Subjects: Databases (cs.DB)
[18] arXiv:2406.05070 [pdf, html, other]
Title: Targeted Mining Precise-positioning Episode Rules
Jian Zhu, Xiaoye Chen, Wensheng Gan, Zefeng Chen, Philip S. Yu
Comments: IEEE TETCI, 14 pages
Subjects: Databases (cs.DB)
[19] arXiv:2406.05107 [pdf, html, other]
Title: LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration
Tavor Lipman, Tova Milo, Amit Somech, Tomer Wolfson, Oz Zafar
Subjects: Databases (cs.DB)
[20] arXiv:2406.05327 [pdf, html, other]
Title: Multi-Entry Generalized Search Trees for Indexing Trajectories
Maxime Schoemans, Walid G. Aref, Esteban Zimányi, Mahmoud Sakr
Subjects: Databases (cs.DB)
[21] arXiv:2406.05417 [pdf, html, other]
Title: Optimizing Navigational Graph Queries
Thomas Mulder, George Fletcher, Nikolay Yakovets
Subjects: Databases (cs.DB)
[22] arXiv:2406.05462 [pdf, html, other]
Title: MatrixGate: A High-performance Data Ingestion Tool for Time-series Databases
Shuhui Wang, Zihan Sun, Chaochen Hu, Chao Li, Yong Zhang, Yandong Yao, Hao Wang, Chunxiao Xing
Subjects: Databases (cs.DB)
[23] arXiv:2406.05536 [pdf, html, other]
Title: Output-Optimal Algorithms for Join-Aggregate Queries
Xiao Hu
Subjects: Databases (cs.DB)
[24] arXiv:2406.05817 [pdf, html, other]
Title: Convex-area-wise Linear Regression and Algorithms for Data Analysis
Bohan Lyu, Jianzhong Li
Subjects: Databases (cs.DB)
[25] arXiv:2406.06754 [pdf, html, other]
Title: Incremental Sliding Window Connectivity over Streaming Graphs
Chao Zhang, Angela Bonifati, M. Tamer Özsu
Comments: To appear in VLDB 2024
Subjects: Databases (cs.DB)
[26] arXiv:2406.06886 [pdf, html, other]
Title: Enabling Data Dependency-based Query Optimization
Daniel Lindner, Daniel Ritter, Felix Naumann
Subjects: Databases (cs.DB)
[27] arXiv:2406.07596 [pdf, html, other]
Title: Transforming Object-Centric Event Logs to Temporal Event Knowledge Graphs (Extended Version)
Shahrzad Khayatbashi, Olaf Hartig, Amin Jalali
Comments: 14 pages (incl. appendix)
Subjects: Databases (cs.DB)
[28] arXiv:2406.07847 [pdf, html, other]
Title: Output-sensitive Conjunctive Query Evaluation
Shaleen Deep, Hangdong Zhao, Austen Z. Fan, Paraschos Koutris
Comments: 24 pages, accepted to PODS'2025
Subjects: Databases (cs.DB)
[29] arXiv:2406.08530 [pdf, html, other]
Title: Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models
Nesma M. Zaki, Iman M. A. Helal, Ehab E. Hassanein, Ahmed Awad
Subjects: Databases (cs.DB)
[30] arXiv:2406.08746 [pdf, html, other]
Title: The AHA-Tree: An Adaptive Index for HTAP Workloads
Lu Xing, Walid G. Aref
Subjects: Databases (cs.DB)
[31] arXiv:2406.09372 [pdf, html, other]
Title: An Adaptive Hotspot-Aware Index for Oscillating Write-Heavy and Read-Heavy Workloads
Lu Xing, Ruihong Wang, Walid G. Aref
Subjects: Databases (cs.DB)
[32] arXiv:2406.09469 [pdf, other]
Title: Conformance Testing of Relational DBMS Against SQL Specifications
Shuang Liu, Chenglin Tian, Jun Sun, Ruifeng Wang, Wei Lu, Yongxin Zhao, Yinxing Xue, Junjie Wang, Xiaoyong Du
Subjects: Databases (cs.DB)
[33] arXiv:2406.09534 [pdf, html, other]
Title: FeatNavigator: Automatic Feature Augmentation on Tabular Data
Jiaming Liang, Chuan Lei, Xiao Qin, Jiani Zhang, Asterios Katsifodimos, Christos Faloutsos, Huzefa Rangwala
Comments: 15 pages, 41 figures
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[34] arXiv:2406.09986 [pdf, html, other]
Title: DLHT: A Non-blocking Resizable Hashtable with Fast Deletes and Memory-awareness
Antonios Katsarakis, Vasilis Gavrielatos, Nikos Ntarmos
Comments: Originally appeared in 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC'24)
Subjects: Databases (cs.DB)
[35] arXiv:2406.10069 [pdf, other]
Title: CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment
Meihui Wang, James Haworth, Ilya Ilyankou, Nicola Christie
Comments: Accepted to the 2nd ACM SIGSPATIAL Workshop on Sustainable Urban Mobility (SUMob 2024)
Subjects: Databases (cs.DB)
[36] arXiv:2406.10158 [pdf, html, other]
Title: Harnessing GPU Power for Enhanced OLTP: A Study in Concurrency Control Schemes
Zihan Sun, Yong Zhang, Chao Li, Chunxiao Xing
Subjects: Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[37] arXiv:2406.10817 [pdf, html, other]
Title: A framework for optimisation based stochastic process discovery
Pierre Cry, András Horváth, Paolo Ballarini, Pascal Le Gall
Subjects: Databases (cs.DB)
[38] arXiv:2406.10938 [pdf, html, other]
Title: DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search
Jiuqi Wei, Botao Peng, Xiaodong Lee, Themis Palpanas
Journal-ref: PVLDB, 17(9): 2241 - 2254, 2024
Subjects: Databases (cs.DB)
[39] arXiv:2406.10940 [pdf, other]
Title: Towards AI-Augmented Data Quality Management: From Data Quality for AI to AI for Data Quality Management
Heidi Carolina Tamm, Anastasija Nikiforova
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Emerging Technologies (cs.ET)
[40] arXiv:2406.11033 [pdf, html, other]
Title: HAIChart: Human and AI Paired Visualization System
Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang
Comments: VLDB 2024
Journal-ref: Proceedings of the VLDB Endowment, vol. 17, no. 11, 2024, pp. 3178-3191
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[41] arXiv:2406.11227 [pdf, html, other]
Title: Compound Schema Registry
Silvery D. Fu, Xuewei Chen
Comments: 2 pages, compound ai system workshop 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[42] arXiv:2406.11255 [pdf, html, other]
Title: Liberal Entity Matching as a Compound AI Toolchain
Silvery D. Fu, David Wang, Wen Zhang, Kathleen Ge
Comments: 2 pages, compound ai systems 2024
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
[43] arXiv:2406.11421 [pdf, html, other]
Title: Private Approximate Query over Horizontal Data Federation
Ala Eddine Laouir, Abdessamad Imine
Comments: To appear in EDBT 2025
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[44] arXiv:2406.11434 [pdf, html, other]
Title: DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models
Fan Zhou, Siqiao Xue, Danrui Qi, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang, Caigai Jiang, Gangwei Jiang, Zhixuan Chu, Faqiang Chen
Subjects: Databases (cs.DB)
[45] arXiv:2406.11797 [pdf, html, other]
Title: Synthesizing Scoring Functions for Rankings Using Symbolic Gradient Descent
Zixuan Chen, Panagiotis Manolios, Mirek Riedewald
Subjects: Databases (cs.DB)
[46] arXiv:2406.12313 [pdf, other]
Title: A framework for developing a knowledge management platform
Marie Lisandra Zepeda Mendoza, Sonali Agarwal, James A. Blackshaw, Vanesa Bol, Audrey Fazzi, Filippo Fiorini, Amy Louise Foreman, Nancy George, Brett R. Johnson, Brian Martin, Dave McComb, Euphemia Mutasa-Gottgens, Helen Parkinson, Martin Romacker, Rolf Russell, Valérien Ségard, Shawn Zheng Kai Tan, Wei Kheng Teh, F. P. Winstanley, Benedict Wong, Adrian M. Smith
Comments: 18 pages, 1 figure
Subjects: Databases (cs.DB)
[47] arXiv:2406.13062 [pdf, html, other]
Title: Transforming Property Graphs
Angela Bonifati, Filip Murlak, Yann Ramusat
Comments: To appear in VLDB 2024
Subjects: Databases (cs.DB)
[48] arXiv:2406.13107 [pdf, html, other]
Title: Blitzcrank: Fast Semantic Compression for In-memory Online Transaction Processing
Yiming Qiao, Yihan Gao, Huanchen Zhang
Comments: 18 pages, 19 figures
Journal-ref: PVLDB, 17(10): 2528 - 2540, 2024
Subjects: Databases (cs.DB)
[49] arXiv:2406.13831 [pdf, html, other]
Title: A Comprehensive Overview of GPU Accelerated Databases
Harshit Sharma, Anmol Sharma
Subjects: Databases (cs.DB)
[50] arXiv:2406.13856 [pdf, html, other]
Title: Kishu: Time-Traveling for Computational Notebooks
Zhaoheng Li, Supawit Chockchowwat, Ribhav Sahu, Areet Sheth, Yongjoo Park
Journal-ref: PVLDB, 18(4): 970 - 985, 2024
Subjects: Databases (cs.DB)
[51] arXiv:2406.14163 [pdf, html, other]
Title: A Unified Statistical And Computational Framework For Ex-Post Harmonisation Of Aggregate Statistics
Cynthia A. Huang
Subjects: Databases (cs.DB); Methodology (stat.ME)
[52] arXiv:2406.14935 [pdf, html, other]
Title: Modelling Legislative Systems into Property Graphs to Enable Advanced Pattern Detection
Andrea Colombo, Anna Bernasconi, Stefano Ceri
Subjects: Databases (cs.DB)
[53] arXiv:2406.15015 [pdf, html, other]
Title: GraLMatch: Matching Groups of Entities with Graphs and Language Models
Fernando De Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger
Comments: 12 pages, 4 figures, accepted as research paper at EDBT 2025
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[54] arXiv:2406.15655 [pdf, html, other]
Title: ProBE: Proportioning Privacy Budget for Complex Exploratory Decision Support
Nada Lahjouji, Sameera Ghayyur, Xi He, Sharad Mehrotra
Subjects: Databases (cs.DB)
[55] arXiv:2406.16082 [pdf, other]
Title: On enforcing function diagram commutativity and anti-commutativity constraints in MatBase
Christian Mancas, Diana Christina Mancas
Comments: This article was submitted on June 24, 2024 to the Open Access Journal of Computer Science and Engineering, Aytin Publications, this https URL
Journal-ref: Open Access Journal of Computer Science and Engineering, Volume 1, Issue 1, 2024, PP:01-13
Subjects: Databases (cs.DB)
[56] arXiv:2406.16268 [pdf, html, other]
Title: Efficient Antagonistic k-plex Enumeration in Signed Graphs
Lantian Xu, Rong-Hua Li, Dong Wen, Qiangqiang Dai, Guoren Wang, Lu Qin
Subjects: Databases (cs.DB)
[57] arXiv:2406.16412 [pdf, html, other]
Title: Not All RDF is Created Equal: Investigating RDF Load Times on Resource-Constrained Devices
Piotr Sowinski, Anh Le-Tuan, Pawel Szmeja, Maria Ganzha
Subjects: Databases (cs.DB)
[58] arXiv:2406.16880 [pdf, html, other]
Title: DataDock: An Open Source Data Hub for Research
Lexington Whalen (1), Homayoun Valafar (1) ((1) University of South Carolina)
Comments: 7 pages, 6 figures, submitted and in review at The 2024 World Congress in Computer Science, Computer Engineering, And Applied Computing (CSCE)
Subjects: Databases (cs.DB)
[59] arXiv:2406.17076 [pdf, html, other]
Title: Avoiding Materialisation for Guarded Aggregate Queries
Matthias Lanzinger, Reinhard Pichler, Alexander Selzer
Subjects: Databases (cs.DB)
[60] arXiv:2406.17871 [pdf, html, other]
Title: Revisiting the Expressiveness Landscape of Data Graph Queries
Michael Benedikt, Anthony Widjaja Lin, Di-De Yen
Subjects: Databases (cs.DB)
[61] arXiv:2406.18099 [pdf, html, other]
Title: CompassDB: Pioneering High-Performance Key-Value Store with Perfect Hash
Jin Jiang, Dongsheng He, Yu Hu, Dong Liu, Chenfan Xiao, Hongxiao Bi, Yusong Zhang, Chaoqu Jiang, Zhijun Fu
Subjects: Databases (cs.DB)
[62] arXiv:2406.18892 [pdf, html, other]
Title: LearnedKV: Integrating LSM and Learned Index for Superior Performance on Storage
Wenlong Wang, David Hung-Chang Du
Comments: 14 pages, 15 figures
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[63] arXiv:2406.19039 [pdf, html, other]
Title: Constructing and Analyzing Different Density Graphs for Path Extrapolation in Wikipedia
Martha Sotiroudi, Anastasia-Sotiria Toufa, Constantine Kotropoulos
Comments: The Sixteenth International Conference on Advances in Databases, Knowledge, and Data Applications (DBKDA 2024)
Subjects: Databases (cs.DB)
[64] arXiv:2406.19106 [pdf, html, other]
Title: MINE GRAPH RULE: A New Cypher-like Operator for Mining Association Rules on Property Graphs
Francesco Cambria, Francesco Invernici, Anna Bernasconi, Stefano Ceri
Subjects: Databases (cs.DB)
[65] arXiv:2406.19143 [pdf, html, other]
Title: QSketch: An Efficient Sketch for Weighted Cardinality Estimation in Streams
Yiyan Qi, Rundong Li, Pinghui Wang, Yufang Sun, Rui Xing
Comments: 12 pages, 10 figures, accepted by KDD 2024
Subjects: Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[66] arXiv:2406.19509 [pdf, other]
Title: Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and copper applications
Yoav Nahshon, Lukas Morand, Matthias Büschelberger, Dirk Helm, Kiran Kumaraswamy, Paul Zierep, Matthias Weber, Pablo de Andrés
Subjects: Databases (cs.DB)
[67] arXiv:2406.19651 [pdf, html, other]
Title: CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
Xianzhi Zeng, Zhuoyan Wu, Xinjing Hu, Xuanhua Shi, Shixuan Sun, Shuhao Zhang
Subjects: Databases (cs.DB); Artificial Intelligence (cs.AI)
[68] arXiv:2406.19732 [pdf, other]
Title: French wine: Combination of multiple open data sources to mapping the expected harvest value
Martial Phélippé-Guinvarc'h (GAINS, UM)
Subjects: Databases (cs.DB)
[69] arXiv:2406.00019 (cross-list from cs.CL) [pdf, html, other]
Title: EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
Jaehee Ryu, Seonhee Cho, Gyubok Lee, Edward Choi
Comments: ACL 2024 (Findings)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[70] arXiv:2406.00344 (cross-list from cs.SI) [pdf, other]
Title: Efficient Historical Butterfly Counting in Large Temporal Bipartite Networks via Graph Structure-aware Index
Qiuyang Mang, Jingbang Chen, Hangrui Zhou, Yu Gao, Yingli Zhou, Qingyu Shi, Richard Peng, Yixiang Fang, Chenhao Ma
Subjects: Social and Information Networks (cs.SI); Databases (cs.DB)
[71] arXiv:2406.00376 (cross-list from cs.DS) [pdf, html, other]
Title: Approaching 100% Confidence in Stream Summary through ReliableSketch
Yuhan Wu, Hanbo Wu, Xilai Liu, Yikai Zhao, Tong Yang, Kaicheng Yang, Sha Wang, Lihua Miao, Gaogang Xie
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB)
[72] arXiv:2406.01598 (cross-list from cs.CV) [pdf, other]
Title: D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation
Zehong Ke, Yanbo Jiang, Yuning Wang, Hao Cheng, Jinhao Li, Jianqiang Wang
Comments: Submit for ITSC 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Robotics (cs.RO)
[73] arXiv:2406.01964 (cross-list from cs.CR) [pdf, html, other]
Title: Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis
Priyanka Nanayakkara, Hyeok Kim, Yifan Wu, Ali Sarvghad, Narges Mahyar, Gerome Miklau, Jessica Hullman
Comments: Published in IEEE Symposium on Security and Privacy (SP) 2024
Journal-ref: in 2024 IEEE Symposium on Security and Privacy (SP), San Francisco, CA, USA, 2024 pp. 231-231
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB); Human-Computer Interaction (cs.HC)
[74] arXiv:2406.02318 (cross-list from cs.LG) [pdf, html, other]
Title: PeFAD: A Parameter-Efficient Federated Framework for Time Series Anomaly Detection
Ronghui Xu, Hao Miao, Senzhang Wang, Philip S. Yu, Jianxin Wang
Comments: Accepted by SIGKDD 2024 (Research Track)
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[75] arXiv:2406.03559 (cross-list from cs.CR) [pdf, html, other]
Title: Stateless and Non-Interactive Order-Preserving Encryption for Outsourced Databases through Subtractive Homomorphism
Dongfang Zhao
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[76] arXiv:2406.04148 (cross-list from cs.LG) [pdf, html, other]
Title: Fast Redescription Mining Using Locality-Sensitive Hashing
Maiju Karjalainen, Esther Galbrun, Pauli Miettinen
Comments: 20 pages, 4 figures, to appear at ECML-PKDD 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[77] arXiv:2406.05439 (cross-list from cs.AI) [pdf, html, other]
Title: A Scalable and Near-Optimal Conformance Checking Approach for Long Traces
Eli Bogdanov, Izack Cohen, Avigdor Gal
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[78] arXiv:2406.05962 (cross-list from cs.DC) [pdf, html, other]
Title: Data Caching for Enterprise-Grade Petabyte-Scale OLAP
Chunxu Tang, Bin Fan, Jing Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian (James)Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen
Comments: Accepted to the USENIX Annual Technical Conference (USENIX ATC) 2024
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB)
[79] arXiv:2406.06596 (cross-list from cs.CL) [pdf, html, other]
Title: Are Large Language Models the New Interface for Data Pipelines?
Sylvio Barbon Junior, Paolo Ceravolo, Sven Groppe, Mustafa Jarrar, Samira Maghool, Florence Sèdes, Soror Sahri, Maurice Van Keulen
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[80] arXiv:2406.06761 (cross-list from cs.CR) [pdf, html, other]
Title: Scalable Private Search with Wally
Hilal Asi, Fabian Boemer, Nicholas Genise, Muhammad Haris Mughees, Tabitha Ogilvie, Rehan Rishi, Kunal Talwar, Karl Tarbe, Akshay Wadia, Ruiyu Zhu, Marco Zuliani
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[81] arXiv:2406.06977 (cross-list from cs.LG) [pdf, html, other]
Title: Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation
Yushi Sun, Jiachuan Wang, Peng Cheng, Libin Zheng, Lei Chen, Jian Yin
Comments: Accepted by ICDE 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[82] arXiv:2406.07098 (cross-list from cs.IR) [pdf, html, other]
Title: Guiding Catalogue Enrichment with User Queries
Yupei Du, Jacek Golebiowski, Philipp Schmidt, Ziawasch Abedjan
Comments: ECML PKDD 2024
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
[83] arXiv:2406.07769 (cross-list from cs.LG) [pdf, html, other]
Title: Personalized Product Assortment with Real-time 3D Perception and Bayesian Payoff Estimation
Porter Jenkins, Michael Selander, J. Stockton Jenkins, Andrew Merrill, Kyle Armstrong
Comments: Accepted to KDD 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[84] arXiv:2406.08335 (cross-list from cs.LG) [pdf, html, other]
Title: A Survey of Pipeline Tools for Data Engineering
Anthony Mbata, Yaji Sripada, Mingjun Zhong
Comments: 18 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB); Computation (stat.CO)
[85] arXiv:2406.08426 (cross-list from cs.CL) [pdf, html, other]
Title: Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong, Zheng Yuan, Qinggang Zhang, Hao Chen, Junnan Dong, Feiran Huang, Xiao Huang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[86] arXiv:2406.08461 (cross-list from cs.CY) [pdf, other]
Title: Bridging the Gap: Unravelling Local Government Data Sharing Barriers in Estonia and Beyond
Katrin Rajamäe Soosaar, Anastasija Nikiforova
Subjects: Computers and Society (cs.CY); Databases (cs.DB); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
[87] arXiv:2406.09046 (cross-list from cs.LG) [pdf, html, other]
Title: ExioML: Eco-economic dataset for Machine Learning in Global Sectoral Sustainability
Yanming Guo, Charles Guan, Jin Ma
Subjects: Machine Learning (cs.LG); Databases (cs.DB)
[88] arXiv:2406.10593 (cross-list from cs.AI) [pdf, other]
Title: QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL
Yinggang Sun, Ziming Guo, Haining Yu, Chuanyi Liu, Xiang Li, Bingxuan Wang, Xiangzhan Yu, Tiancheng Zhao
Comments: 10 pages, 9 figures
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
[89] arXiv:2406.10635 (cross-list from cs.RO) [pdf, html, other]
Title: ROSfs: A User-Level File System for ROS
Zijun Xu, Xuanjun Wen, Yanjie Song, Shu Yin
Subjects: Robotics (cs.RO); Databases (cs.DB); Operating Systems (cs.OS)
[90] arXiv:2406.10690 (cross-list from cs.AI) [pdf, html, other]
Title: Automating Pharmacovigilance Evidence Generation: Using Large Language Models to Produce Context-Aware SQL
Jeffery L. Painter, Venkateswara Rao Chalamalasetti, Raymond Kassekert, Andrew Bate
Comments: 15 pages, 3 tables, 5 figures
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[91] arXiv:2406.10708 (cross-list from cs.CV) [pdf, html, other]
Title: MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception
M. Mahbubur Rahman, Ryoma Yataka, Sorachi Kato, Pu Perry Wang, Peizhao Li, Adriano Cardace, Petros Boufounos
Comments: 26 pages, 25 figures, 10 tables; See this https URL to access the MMVR dataset
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB); Signal Processing (eess.SP)
[92] arXiv:2406.10922 (cross-list from cs.CL) [pdf, html, other]
Title: Generating Tables from the Parametric Knowledge of Language Models
Yevgeni Berkovitch, Oren Glickman, Amit Somech, Tomer Wolfson
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[93] arXiv:2406.11131 (cross-list from cs.CL) [pdf, html, other]
Title: Are Large Language Models a Good Replacement of Taxonomies?
Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen
Comments: Accepted by VLDB 2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[94] arXiv:2406.11143 (cross-list from cs.AI) [pdf, other]
Title: Scorecards for Synthetic Medical Data Evaluation and Reporting
Ghada Zamzmi, Adarsh Subbaswamy, Elena Sizikova, Edward Margerrison, Jana Delfino, Aldo Badano
Comments: 7 pages, 2 figures
Subjects: Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Databases (cs.DB)
[95] arXiv:2406.11803 (cross-list from cs.LG) [pdf, html, other]
Title: Efficient Discovery of Significant Patterns with Few-Shot Resampling
Leonardo Pellegrina, Fabio Vandin
Comments: Accepted to VLDB 2024
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Machine Learning (stat.ML)
[96] arXiv:2406.12104 (cross-list from cs.CL) [pdf, html, other]
Title: End-to-end Text-to-SQL Generation within an Analytics Insight Engine
Karime Maamari, Amine Mhedhbi
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG)
[97] arXiv:2406.12692 (cross-list from cs.CL) [pdf, html, other]
Title: MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL
Arian Askari, Christian Poelitz, Xinye Tang
Comments: Accepted at Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI 2025)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Human-Computer Interaction (cs.HC)
[98] arXiv:2406.12938 (cross-list from cs.CR) [pdf, other]
Title: Security in IS and social engineering -- an overview and state of the art
Florence Sèdes (UT3, IRIT, CNRS)
Comments: in French language, INFORSID 2024, May 2024, Nancy, France
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB)
[99] arXiv:2406.13213 (cross-list from cs.CL) [pdf, html, other]
Title: Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata
Mykhailo Poliakov, Nadiya Shvai
Comments: Accepted to ICTERI 2024 Posters Track
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
[100] arXiv:2406.13844 (cross-list from cs.CV) [pdf, html, other]
Title: A large-scale multicenter breast cancer DCE-MRI benchmark dataset with expert segmentations
Lidia Garrucho, Kaisar Kushibar, Claire-Anne Reidel, Smriti Joshi, Richard Osuala, Apostolia Tsirikoglou, Maciej Bobowicz, Javier del Riego, Alessandro Catanese, Katarzyna Gwoździewicz, Maria-Laura Cosaka, Pasant M. Abo-Elhoda, Sara W. Tantawy, Shorouq S. Sakrana, Norhan O. Shawky-Abdelfatah, Amr Muhammad Abdo-Salem, Androniki Kozana, Eugen Divjak, Gordana Ivanac, Katerina Nikiforaki, Michail E. Klontzas, Rosa García-Dosdá, Meltem Gulsun-Akpinar, Oğuz Lafcı, Ritse Mann, Carlos Martín-Isla, Fred Prior, Kostas Marias, Martijn P.A. Starmans, Fredrik Strand, Oliver Díaz, Laura Igual, Karim Lekadir
Comments: 15 paes, 7 figures, 3 tables
Journal-ref: Sci Data 12, 453 (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB)
Total of 112 entries : 1-100 101-112
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack