Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.DB

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Databases

Authors and titles for October 2017

Total of 47 entries
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:1710.00027 [pdf, other]
Title: Toward a System Building Agenda for Data Integration
AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Sanjib Das, Yash Govind, Pradap Konda, Han Li, Erik Paulson, Paul Suganthan G.C., Haojun Zhang
Subjects: Databases (cs.DB)
[2] arXiv:1710.00204 [pdf, other]
Title: Enabling Quality Control for Entity Resolution: A Human and Machine Cooperation Framework
Zhaoqiang Chen, Qun Chen, Fengfeng Fan, Yanyan Wang, Zhuo Wang, Youcef Nafa, Zhanhuai Li, Hailong Liu, Wei Pan
Comments: 12 pages, 11 figures. Camera-ready version of the paper submitted to ICDE 2018, In Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018)
Subjects: Databases (cs.DB)
[3] arXiv:1710.00560 [pdf, other]
Title: KV-match: A Subsequence Matching Approach Supporting Normalization and Time Warping [Extended Version]
Jiaye Wu, Peng Wang, Ningting Pan, Chen Wang, Wei Wang, Jianmin Wang
Comments: 13 pages
Journal-ref: 2019 IEEE 35th International Conference on Data Engineering (ICDE)
Subjects: Databases (cs.DB)
[4] arXiv:1710.00597 [pdf, other]
Title: DeepER -- Deep Entity Resolution
Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq Joty, Mourad Ouzzani, Nan Tang
Comments: Accepted to PVLDB 2018 as "Distributed Representations of Tuples for Entity Resolution". This version corrects a minor issue in Example 4 pointed out by Andrew Borthwick and Matthias Boehm
Subjects: Databases (cs.DB)
[5] arXiv:1710.00608 [pdf, other]
Title: Constrained Differential Privacy for Count Data
Graham Cormode, Tejas Kulkarni, Divesh Srivastava
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[6] arXiv:1710.00763 [pdf, other]
Title: You can't always sketch what you want: Understanding Sensemaking in Visual Query Systems
Doris Jung-Lin Lee, John Lee, Tarique Siddiqui, Jaewoo Kim, Karrie Karahalios, Aditya Parameswaran
Comments: Accepted for presentation at IEEE VAST 2019, to be held October 20-25 in Vancouver, Canada. Paper will also be published in a special issue of IEEE Transactions on Visualization and Computer Graphics (TVCG) IEEE VIS (InfoVis/VAST/SciVis) 2019 ACM 2012 CCS - Human-centered computing, Visualization, Visualization design and evaluation methods
Subjects: Databases (cs.DB); Human-Computer Interaction (cs.HC)
[7] arXiv:1710.00813 [pdf, other]
Title: A Practical Python API for Querying AFLOWLIB
Conred W. Rosenbrock
Comments: 7 pages, 3 code listings
Subjects: Databases (cs.DB)
[8] arXiv:1710.00867 [pdf, other]
Title: Clustering Stream Data by Exploring the Evolution of Density Mountain
Shufeng Gong, Yanfeng Zhang, Ge Yu
Subjects: Databases (cs.DB); Information Retrieval (cs.IR)
[9] arXiv:1710.01077 [pdf, other]
Title: Time Series Management Systems: A Survey
Søren Kejser Jensen, Torben Bach Pedersen, Christian Thomsen
Comments: 20 Pages, 15 Figures, 2 Tables, Accepted for publication in IEEE TKDE
Journal-ref: TKDE, 29, 11, 2017, 2581-2600
Subjects: Databases (cs.DB)
[10] arXiv:1710.01420 [pdf, other]
Title: Usable & Scalable Learning Over Relational Data With Automatic Language Bias
Jose Picado, Arash Termehchy, Sudhanshu Pathak, Alan Fern, Praveen Ilango, Yunqiao Cai
Subjects: Databases (cs.DB); Machine Learning (cs.LG)
[11] arXiv:1710.01792 [pdf, other]
Title: A Comparative Analysis of Materialized Views Selection and Concurrency Control Mechanisms in NoSQL Databases
Ashish Tapdiya, Yuan Xue, Daniel Fabbri (Vanderbilt University)
Subjects: Databases (cs.DB)
[12] arXiv:1710.01854 [pdf, other]
Title: InfiniViz: Interactive Visual Exploration using Progressive Bin Refinement
Niranjan Kamat, Arnab Nandi
Subjects: Databases (cs.DB)
[13] arXiv:1710.02317 [pdf, other]
Title: Enumeration Problems for Regular Path Queries
Wim Martens, Tina Trautner
Subjects: Databases (cs.DB); Formal Languages and Automata Theory (cs.FL)
[14] arXiv:1710.02817 [pdf, other]
Title: Discovery of Paradigm Dependencies
Jizhou Sun, Jianzhong Li, Hong Gao
Comments: This paper is submitted to 34th IEEE International Conference on Data Engineering (ICDE2018) On October 1, 2017
Subjects: Databases (cs.DB)
[15] arXiv:1710.03289 [pdf, other]
Title: Efficient mining of maximal biclusters in mixed-attribute datasets
Rosana Veroneze, Fernando J. Von Zuben
Subjects: Databases (cs.DB)
[16] arXiv:1710.04419 [pdf, other]
Title: Querying Best Paths in Graph Databases
Jakub Michaliszyn, Jan Otop, Piotr Wieczorek
Comments: A conference version fo this paper has been accepted to FSTTCS 2017
Subjects: Databases (cs.DB)
[17] arXiv:1710.04470 [pdf, other]
Title: V1: A Visual Query Language for Property Graphs
Lior Kogan
Comments: 193 pages, 502 figures
Subjects: Databases (cs.DB)
[18] arXiv:1710.07411 [pdf, other]
Title: STREAK: An Efficient Engine for Processing Top-k SPARQL Queries with Spatial Filters
Jyoti Leeka, Srikanta Bedathur, Debajyoti Bera, Sriram Lakshminarasimhan
Subjects: Databases (cs.DB)
[19] arXiv:1710.07736 [pdf, other]
Title: BigSparse: High-performance external graph analytics
Sang-Woo Jun, Andy Wright, Sizhuo Zhang, Shuotao Xu, Arvind
Subjects: Databases (cs.DB)
[20] arXiv:1710.07891 [pdf, other]
Title: Natural Language Aggregate Query over RDF Data
Xin Hu, Yingting Yao, Luting Ye, Depeng Dang
Subjects: Databases (cs.DB)
[21] arXiv:1710.08023 [pdf, other]
Title: A Brief Comparison of Two Enterprise-Class RDBMSs
Andrew Figueroa, Steven Rollo, Sean Murthy
Comments: 14 pages, 16 figures, 2 tables
Subjects: Databases (cs.DB)
[22] arXiv:1710.08748 [pdf, other]
Title: Bottom-up automata on data trees and vertical XPath
Diego Figueira, Luc Segoufin
Journal-ref: Logical Methods in Computer Science, Volume 13, Issue 4 (November 6, 2017) lmcs:4044
Subjects: Databases (cs.DB)
[23] arXiv:1710.09420 [pdf, other]
Title: SOPE: A Spatial Order Preserving Encryption Model for Multi-dimensional Data
Eirini Molla, Theodoros Tzouramanis, Stefanos Gritzalis
Comments: 24 pages, 37 figures, 2 tables, 60 references
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
[24] arXiv:1710.10555 [pdf, other]
Title: Complexity Analysis Approach for Prefabricated Construction Products Using Uncertain Data Clustering
Wenying Ji, Simaan M. AbouRizk, Osmar R. Zaiane, Yitong Li
Subjects: Databases (cs.DB); Machine Learning (cs.LG); Applications (stat.AP)
[25] arXiv:1710.11528 [pdf, other]
Title: Extracting Syntactic Patterns from Databases
Andrew Ilyas, Joana M. F. da Trindade, Raul Castro Fernandez, Samuel Madden
Subjects: Databases (cs.DB)
[26] arXiv:1710.00454 (cross-list from cs.IR) [pdf, other]
Title: Building a Structured Query Engine
Amanpreet Singh, Karthik Venkatesan, Simranjyot Singh Gill
Subjects: Information Retrieval (cs.IR); Databases (cs.DB)
[27] arXiv:1710.01431 (cross-list from cs.DS) [pdf, other]
Title: Massively Parallel Algorithms and Hardness for Single-Linkage Clustering Under $\ell_p$-Distances
Grigory Yaroslavtsev, Adithya Vadapalli
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC)
[28] arXiv:1710.01615 (cross-list from cs.CR) [pdf, other]
Title: ($k$,$ε$)-Anonymity: $k$-Anonymity with $ε$-Differential Privacy
Naoise Holohan, Spiros Antonatos, Stefano Braghin, Pól Mac Aonghusa
Subjects: Cryptography and Security (cs.CR); Databases (cs.DB); Probability (math.PR)
[29] arXiv:1710.02030 (cross-list from stat.ML) [pdf, other]
Title: McDiarmid Drift Detection Methods for Evolving Data Streams
Ali Pesaranghader, Herna Viktor, Eric Paquet
Comments: 9 pages, 3 figures, 3 tables
Subjects: Machine Learning (stat.ML); Databases (cs.DB); Machine Learning (cs.LG)
[30] arXiv:1710.02035 (cross-list from cs.NI) [pdf, other]
Title: HANDY: A Hybrid Association Rules Mining Approach for Network Layer Discovery of Services for Mobile Ad hoc Network
Noman Islam, Zubair A. Shaikh, Aqeel-ur-Rehman, Muhammad Shahab Siddiqui
Subjects: Networking and Internet Architecture (cs.NI); Artificial Intelligence (cs.AI); Databases (cs.DB)
[31] arXiv:1710.02261 (cross-list from cs.NA) [pdf, other]
Title: Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries
Sejoon Oh, Namyong Park, Lee Sael, U Kang
Comments: IEEE International Conference on Data Engineering (ICDE 2018)
Subjects: Numerical Analysis (math.NA); Databases (cs.DB); Information Retrieval (cs.IR)
[32] arXiv:1710.02690 (cross-list from stat.AP) [pdf, other]
Title: Unique Entity Estimation with Application to the Syrian Conflict
Beidi Chen, Anshumali Shrivastava, Rebecca C. Steorts
Comments: 35 pages, 6 figures, 2 tables
Subjects: Applications (stat.AP); Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[33] arXiv:1710.02823 (cross-list from cs.LG) [pdf, other]
Title: Structural Feature Selection for Event Logs
Markku Hinkka, Teemu Lehto, Keijo Heljanko, Alexander Jung
Comments: Extended version of a paper published in the proceedings of the BPM 2017 workshops
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Software Engineering (cs.SE); Machine Learning (stat.ML)
[34] arXiv:1710.03222 (cross-list from cs.LG) [pdf, other]
Title: Forecasting Across Time Series Databases using Recurrent Neural Networks on Groups of Similar Series: A Clustering Approach
Kasun Bandara, Christoph Bergmeir, Slawek Smyl
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Econometrics (econ.EM); Applications (stat.AP); Machine Learning (stat.ML)
[35] arXiv:1710.03439 (cross-list from cs.PF) [pdf, other]
Title: BestConfig: Tapping the Performance Potential of Systems via Automatic Configuration Tuning
Yuqing Zhu, Jianxun Liu, Mengying Guo, Yungang Bao, Wenlong Ma, Zhuoyue Liu, Kunpeng Song, Yingchun Yang
Journal-ref: ACM SoCC 2017
Subjects: Performance (cs.PF); Databases (cs.DB); Distributed, Parallel, and Cluster Computing (cs.DC); Software Engineering (cs.SE)
[36] arXiv:1710.03852 (cross-list from cs.SI) [pdf, other]
Title: Top-k Route Search through Submodularity Modeling of Recurrent POI Features
Hongwei Liang, Ke Wang
Comments: 11 pages, 7 figures, 2 tables
Journal-ref: Hongwei Liang and Ke Wang. 2018. Top-k Route Search through Submodularity Modeling of Recurrent POI Features. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR '18). ACM, 545-554
Subjects: Social and Information Networks (cs.SI); Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[37] arXiv:1710.04031 (cross-list from cs.DL) [pdf, other]
Title: The number of linked references of publications in Microsoft Academic in comparison with the Web of Science
Robin Haunschild, Sven E. Hug, Martin P. Brändle, Lutz Bornmann
Comments: 6 pages
Subjects: Digital Libraries (cs.DL); Databases (cs.DB); Information Retrieval (cs.IR)
[38] arXiv:1710.04144 (cross-list from cs.CY) [pdf, other]
Title: GUIDES - Geospatial Urban Infrastructure Data Engineering Solutions
Booma Sowkarthiga Balasubramani, Omar Belingheri, Eric S. Boria, Isabel F. Cruz, Sybil Derrible, Michael D. Siciliano
Comments: 4 pages, SIGSPATIAL'17, November 7-10, 2017, Los Angeles Area, CA, USA
Subjects: Computers and Society (cs.CY); Databases (cs.DB)
[39] arXiv:1710.04469 (cross-list from cs.DC) [pdf, other]
Title: Pure Operation-Based Replicated Data Types
Carlos Baquero, Paulo Sergio Almeida, Ali Shoker
Comments: 30 pages
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Databases (cs.DB); Data Structures and Algorithms (cs.DS)
[40] arXiv:1710.05091 (cross-list from cs.LG) [pdf, other]
Title: A simple data discretizer
Gourab Mitra, Shashidhar Sundareisan, Bikash Kanti Sarkar
Subjects: Machine Learning (cs.LG); Databases (cs.DB); Machine Learning (stat.ML)
[41] arXiv:1710.05693 (cross-list from cs.AI) [pdf, other]
Title: Mining Frequent Patterns in Process Models
David Chapela-Campa, Manuel Mucientes, Manuel Lama
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[42] arXiv:1710.06590 (cross-list from cs.DL) [pdf, other]
Title: MEDOC: a Python wrapper to load MEDLINE into a local MySQL database
Emeric Dynomant, Mathilde Gorieu, Helene Perrin, Marion Denorme, Fabien Pichon, Arnaud Desfeux
Comments: 4 pages, 1 figure
Subjects: Digital Libraries (cs.DL); Databases (cs.DB)
[43] arXiv:1710.07114 (cross-list from cs.AI) [pdf, other]
Title: Swift Linked Data Miner: Mining OWL 2 EL class expressions directly from online RDF datasets
Jedrzej Potoniec, Piotr Jakubowski, Agnieszka Ławrynowicz
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
[44] arXiv:1710.07660 (cross-list from cs.LO) [pdf, other]
Title: Verifying Equivalence of Database-Driven Applications
Yuepeng Wang, Isil Dillig, Shuvendu K. Lahiri, William R. Cook
Subjects: Logic in Computer Science (cs.LO); Databases (cs.DB)
[45] arXiv:1710.08436 (cross-list from cs.DS) [pdf, other]
Title: HyperMinHash: MinHash in LogLog space
Yun William Yu, Griffin M. Weber
Comments: 12 pages, 6 figures
Subjects: Data Structures and Algorithms (cs.DS); Databases (cs.DB)
[46] arXiv:1710.10088 (cross-list from cs.CV) [pdf, other]
Title: Fine-grained Pattern Matching Over Streaming Time Series
Rong Kang, Chen Wang, Peng Wang, Yuting Ding, Jianmin Wang
Comments: 14 pages, 14 figures, 29 conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[47] arXiv:1710.11531 (cross-list from cs.AI) [pdf, other]
Title: SemTK: An Ontology-first, Open Source Semantic Toolkit for Managing and Querying Knowledge Graphs
Paul Cuddihy, Justin McHugh, Jenny Weisenberg Williams, Varish Mulwad, Kareem S. Aggour
Subjects: Artificial Intelligence (cs.AI); Databases (cs.DB)
Total of 47 entries
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack