default search action
ACM SIGMOD Conference 2021: Virtual Event, China
- Guoliang Li, Zhanhuai Li, Stratos Idreos, Divesh Srivastava:
SIGMOD '21: International Conference on Management of Data, Virtual Event, China, June 20-25, 2021. ACM 2021, ISBN 978-1-4503-8343-1
Keynote Talks
- Kenneth A. Ross:
Utilizing (and Designing) Modern Hardware for Data-Intensive Computations: The Role of Abstraction. 1 - Wang-Chiew Tan:
Deep Data Integration. 2
Award Talks
- Natacha Crooks:
A Client-centric Approach to Transactional Datastores. 3-5 - Maximilian Schleich:
Structure-Aware Machine Learning over Multi-Relational Databases. 6-7 - Erfan Zamanian:
Scalable Distributed Transaction Processing on Modern RDMA-enabled Networks. 8 - Huanchen Zhang:
Memory-Efficient Search Trees for Database Management Systems. 9
Research Data Management Track Papers
- Samira Akili, Matthias Weidlich:
MuSE Graphs for Flexible Distribution of Event Stream Processing in Networks. 10-22 - Rana Alotaibi, Bogdan Cautis, Alin Deutsch, Ioana Manolescu:
HADAD: A Lightweight Approach for Optimizing Hybrid Complex Analytics Queries. 23-35 - Daichi Amagata, Makoto Onizuka, Takahiro Hara:
Fast and Exact Outlier Detection in Metric Spaces: A Proximity Graph-based Approach. 36-48 - Daichi Amagata, Takahiro Hara:
Fast Density-Peaks Clustering: Multicore-based Parallelization Approach. 49-61 - Sihem Amer-Yahia, Tova Milo, Brit Youngmann:
Exploring Ratings in Subjective Databases. 62-75 - Mohammad Javad Amiri, Divyakant Agrawal, Amr El Abbadi:
SharPer: Sharding Permissioned Blockchains Over Network Clusters. 76-88 - Arvind Arasu, Badrish Chandramouli, Johannes Gehrke, Esha Ghosh, Donald Kossmann, Jonathan Protzenko, Ravi Ramamurthy, Tahina Ramananandro, Aseem Rastogi, Srinath T. V. Setty, Nikhil Swamy, Alexander van Renen, Min Xu:
FastVer: Making Data Integrity a Commodity. 89-101 - Diego Arroyuelo, Aidan Hogan, Gonzalo Navarro, Juan L. Reutter, Javiel Rojas-Ledesma, Adrián Soto:
Worst-Case Optimal Graph Joins in Almost No Space. 102-114 - Vadim Arzamasov, Klemens Böhm:
REDS: Rule Extraction for Discovering Scenarios. 115-128 - Abolfazl Asudeh, Nima Shahbazi, Zhongjun Jin, H. V. Jagadish:
Identifying Insufficient Data Coverage for Ordinal Continuous-Valued Attributes. 129-141 - Jees Augustine, Suraj Shetiya, Mohammadreza Esfandiari, Senjuti Basu Roy, Gautam Das:
A Generalized Approach for Reducing Expensive Distance Calls for A Broad Class of Proximity Problems. 142-154 - Darshana Balakrishnan, Carl Nuessle, Oliver Kennedy, Lukasz Ziarek:
TreeToaster: Towards an IVM-Optimized Compiler. 155-167 - Maximilian Bandle, Jana Giceva, Thomas Neumann:
To Partition, or Not to Partition, That is the Join Question in a Real System. 168-180 - Kaustubh Beedkar, Jorge-Arnulfo Quiané-Ruiz, Volker Markl:
Compliant Geo-distributed Query Processing. 181-193 - Souvik Bhattacherjee, Gang Liao, Michael Hicks, Daniel J. Abadi:
BullFrog: Online Schema Evolution via Lazy Evaluation. 194-206 - Shaofeng Cai, Kaiping Zheng, Gang Chen, H. V. Jagadish, Beng Chin Ooi, Meihui Zhang:
ARM-Net: Adaptive Relation Modeling Network for Structured Data. 207-220 - Jeeta Ann Chacko, Ruben Mayer, Hans-Arno Jacobsen:
Why Do My Blockchain Transactions Fail?: A Study of Hyperledger Fabric. 221-234 - Aleksey Charapko, Ailidani Ailijiang, Murat Demirbas:
PigPaxos: Devouring the Communication Bottlenecks in Distributed Consensus. 235-247 - Lu Chen, Chengfei Liu, Rui Zhou, Jiajie Xu, Jianxin Li:
Efficient Exact Algorithms for Maximum Balanced Biclique Search in Bipartite Graphs. 248-260 - Peiqing Chen, Dong Chen, Lingxiao Zheng, Jizhou Li, Tong Yang:
Out of Many We are One: Measuring Item Batch with Clock-Sketch. 261-273 - Xingguang Chen, Sibo Wang:
Efficient Approximate Algorithms for Empirical Entropy and Mutual Information. 274-286 - Yueting Chen, Xiaohui Yu, Nick Koudas, Ziqiang Yu:
Evaluating Temporal Queries Over Video Feeds. 287-299 - Zihao Chen, Chen Xu, Juan Soto, Volker Markl, Weining Qian, Aoying Zhou:
Hybrid Evaluation for Distributed Iterative Matrix Computation. 300-312 - Zitong Chen, Ada Wai-Chee Fu, Minhao Jiang, Eric Lo, Pengfei Zhang:
P2H: Efficient Distance Querying on Road Networks by Projected Vertex Separators. 313-325 - Yodsawalai Chodpathumwan, Arash Termehchy, Stephen A. Ramsey, Aayam Shrestha, Amy Glen, Zheng Liu:
Structural Generalizability: The Case of Similarity Search. 326-338 - Björn Daase, Lars Jonas Bollmeier, Lawrence Benson, Tilmann Rabl:
Maximizing Persistent Memory Bandwidth Utilization for OLAP Workloads. 339-351 - Zhenwei Dai, Aditya Desai, Reinhard Heckel, Anshumali Shrivastava:
Active Sampling Count Sketch (ASCS) for Online Sparse Estimation of a Trillion Scale Covariance Matrix. 352-364 - Niv Dayan, Moshe Twitto:
Chucky: A Succinct Cuckoo Filter for LSM-Tree. 365-378 - Daniel Deutch, Ariel Frankenthal, Amir Gilad, Yuval Moskovitch:
On Optimizing the Trade-off between Privacy and Utility in Data Provenance. 379-391 - Yanlei Diao, Pawel Guzewicz, Ioana Manolescu, Mirjana Mazuran:
Efficient Exploration of Interesting Aggregates in RDF Graphs. 392-404 - Ralf Diestelkämper, Seokki Lee, Melanie Herschel, Boris Glavic:
To Not Miss the Forest for the Trees - A Holistic Approach for Explaining Missing Answers over Nested Data. 405-417 - Jialin Ding, Umar Farooq Minhas, Badrish Chandramouli, Chi Wang, Yinan Li, Ying Li, Donald Kossmann, Johannes Gehrke, Tim Kraska:
Instance-Optimized Data Layouts for Cloud Analytics Workloads. 418-431 - Wei Dong, Ke Yi:
Residual Sensitivity for Differentially Private Multi-Way Joins. 432-444 - Dominik Durner, Viktor Leis, Thomas Neumann:
JSON Tiles: Fast Analytics on Semi-Structured Data. 445-458 - Wenfei Fan, Chao Tian, Ruiqi Xu, Qiang Yin, Wenyuan Yu, Jingren Zhou:
Incrementalizing Graph Algorithms. 459-471 - Wenfei Fan, Yuanhao Li, Muyang Liu, Can Lu:
Making Graphs Compact by Lossless Contraction. 472-484 - Omar Farhat, Khuzaima Daudjee, Leonardo Querzoni:
Klink: Progress-Aware Scheduling for Streaming Data Systems. 485-498 - Anna Fariha, Ashish Tiwari, Arjun Radhakrishna, Sumit Gulwani, Alexandra Meliou:
Conformance Constraint Discovery: Measuring Trust in Data-Driven Systems. 499-512 - Guanyu Feng, Zixuan Ma, Daixuan Li, Shengqi Chen, Xiaowei Zhu, Wentao Han, Wenguang Chen:
RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s. 513-527 - Su Feng, Boris Glavic, Aaron Huber, Oliver A. Kennedy:
Efficient Uncertainty Tracking for Complex Queries with Attribute-level Bounds. 528-540 - Weiqi Feng, Dong Deng:
Allign: Aligning All-Pair Near-Duplicate Passages in Long Texts. 541-553 - Yannis Foufoulas, Lefteris Sidirourgos, Elefterios Stamatogiannakis, Yannis E. Ioannidis:
Adaptive Compression for Fast Scans on String Columns. 554-562 - Fangcheng Fu, Yingxia Shao, Lele Yu, Jiawei Jiang, Huanran Xue, Yangyu Tao, Bin Cui:
VF2Boost: Very Fast Vertical Federated Gradient Boosting for Cross-Enterprise Learning. 563-576 - Sainyam Galhotra, Romila Pradhan, Babak Salimi:
Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals. 577-590 - Junyang Gao, Yifan Xu, Pankaj K. Agarwal, Jun Yang:
Efficiently Answering Durability Prediction Queries. 591-604 - Gábor E. Gévay, Jorge-Arnulfo Quiané-Ruiz, Volker Markl:
The Power of Nested Parallelism in Big Data Processing - Hitting Three Flies with One Slap -. 605-618 - Amir Gilad, Shweta Patwa, Ashwin Machanavajjhala:
Synthesizing Linked Data Under Cardinality and Integrity Constraints. 619-631 - Orest Gkini, Theofilos Belmpas, Georgia Koutrika, Yannis E. Ioannidis:
An In-Depth Benchmarking of Text-to-SQL Systems. 632-644 - Xiangyang Gou, Lei Zou:
Sliding Window-based Approximate Triangle Counting over Streaming Graphs with Duplicate Edges. 645-657 - Zhihan Guo, Kan Wu, Cong Yan, Xiangyao Yu:
Releasing Locks As Early As You Can: Reducing Contention of Hotspots by Violating Two-Phase Locking. 658-670 - Kai Han, Benwei Wu, Jing Tang, Shuang Cui, Çigdem Aslay, Laks V. S. Lakshmanan:
Efficient and Effective Algorithms for Revenue Maximization in Social Advertising. 671-684 - Brandon Haynes, Maureen Daum, Dong He, Amrita Mazumdar, Magdalena Balazinska, Alvin Cheung, Luis Ceze:
VSS: A Storage System for Video Analytics. 685-696 - Axel Hertzschuch, Guido Moerkotte, Wolfgang Lehner, Norman May, Florian Wolf, Lars Fricke:
Small Selectivities Matter: Lifting the Burden of Empty Samples. 697-709 - Benjamin Hilprecht, Carsten Binnig:
ReStore - Neural Data Completion for Relational Databases. 710-722 - Denis Hirn, Torsten Grust:
One WITH RECURSIVE is Worth Many GOTOs. 723-735 - Lin Hu, Lei Zou, Yu Liu:
Accelerating Triangle Counting on GPU. 736-748 - Haoyu Huang, Shahram Ghandeharizadeh:
Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store. 749-763 - Kai Huang, Huey-Eng Chua, Sourav S. Bhowmick, Byron Choi, Shuigeng Zhou:
MIDAS: Towards Efficient and Effective Maintenance of Canned Patterns in Visual Graph Query Interfaces. 764-776 - Qiang Huang, Yifan Lei, Anthony K. H. Tung:
Point-to-Hyperplane Nearest Neighbor Search Beyond the Unit Hypersphere. 777-789 - Yuming Huang, Jing Tang, Qianhao Cong, Andrew Lim, Jianliang Xu:
Do the Rich Get Richer? Fairness Analysis for Blockchain Incentives. 790-803 - Yesdaulet Izenov, Asoke Datta, Florin Rusu, Jun Hyung Shin:
COMPASS: Online Sketch-based Query Optimization for In-Memory Databases. 804-816 - Shuping Ji, Hans-Arno Jacobsen:
A-Tree: A Dynamic Data Structure for Efficiently Indexing Arbitrary Boolean Expressions. 817-829 - Peng Jia, Pinghui Wang, Junzhou Zhao, Shuo Zhang, Yiyan Qi, Min Hu, Chao Deng, Xiaohong Guan:
Bidirectionally Densifying LSH Sketches with Empty Bins. 830-842 - Hao Jiang, Chunwei Liu, John Paparrizos, Andrew A. Chien, Jihong Ma, Aaron J. Elmore:
Good to the Last Bit: Data-Driven Encoding with CodecDB. 843-856 - Jiawei Jiang, Shaoduo Gan, Yue Liu, Fanlin Wang, Gustavo Alonso, Ana Klimovic, Ankit Singla, Wentao Wu, Ce Zhang:
Towards Demystifying Serverless Machine Learning Training. 857-871 - Wen Jin, Weining Qian, Aoying Zhou:
Efficient String Sort with Multi-Character Encoding and Adaptive Sampling. 872-884 - Georgios Kalamatianos, Georgios John Fakas, Nikos Mamoulis:
Proportionality in Spatial Keyword Search. 885-897 - Donghe Kang, Ruochen Jiang, Spyros Blanas:
Jigsaw: A Data Storage and Query Processing Engine for Irregular Table Partitioning. 898-911 - Kapil Khurana, Jayant R. Haritsa:
Shedding Light on Opaque Application Queries. 912-924 - Hyunjoon Kim, Yunyoung Choi, Kunsoo Park, Xuemin Lin, Seok-Hee Hong, Wook-Shin Han:
Versatile Equivalences: Speeding up Subgraph Query Processing and Subgraph Matching. 925-937 - Jong-Bin Kim, Kihwang Kim, Hyunsoo Cho, Jaeseon Yu, Sooyong Kang, Hyungsoo Jung:
Rethink the Scan in MVCC Databases. 938-950 - Jongik Kim:
Boosting Graph Similarity Search through Pre-Computation. 951-963 - Kyoungmin Kim, Hyeonji Kim, George Fletcher, Wook-Shin Han:
Combining Sampling and Synopses with Worst-Case Optimal Runtime and Quality Guarantees for Graph Pattern Cardinality Estimation. 964-976 - Seongyun Ko, Taesung Lee, Kijae Hong, Wonseok Lee, In Seo, Jiwon Seo, Wook-Shin Han:
iTurboGraph: Scaling and Automating Incremental Graph Analytics. 977-990 - André Kohn, Viktor Leis, Thomas Neumann:
Building Advanced SQL Analytics From Low-Level Plan Operators. 1001-1013 - Johan Kok Zhi Kang, Gaurav, Sien Yi Tan, Feng Cheng, Shixuan Sun, Bingsheng He:
Efficient Deep Learning Pipelines for Accurate Cost Estimations Over Large Scale Query Workload. 1014-1022 - Michael Körber, Nikolaus Glombiewski, Bernhard Seeger:
Index-Accelerated Pattern Matching in Event Stores. 1023-1036 - Ziliang Lai, Chenxia Han, Chris Liu, Pengfei Zhang, Eric Lo, Ben Kao:
Top-K Deep Video Analytics: A Probabilistic Approach. 1037-1050 - Chenjie Li, Zhengjie Miao, Qitian Zeng, Boris Glavic, Sudeepa Roy:
Putting Things into Context: Rich Explanations for Query Answers using Join Graphs. 1051-1063 - Peng Li, Xiang Cheng, Xu Chu, Yeye He, Surajit Chaudhuri:
Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples. 1064-1076 - Rundong Li, Pinghui Wang, Jiongli Zhu, Junzhou Zhao, Jia Di, Xiaofei Yang, Kai Ye:
Building Fast and Compact Sketches for Approximately Multi-Set Multi-Membership Querying. 1077-1089 - Tianyu Li, Badrish Chandramouli, Jose M. Faleiro, Samuel Madden, Donald Kossmann:
Asynchronous Prefix Recoverability for Fast Distributed Stores. 1090-1102 - Yan Li, Tingjian Ge:
Imminence Monitoring of Critical Events: A Representation Learning Approach. 1103-1115 - Yin Li, Dhrubajyoti Ghosh, Peeyush Gupta, Sharad Mehrotra, Nisha Panwar, Shantanu Sharma:
PRISM: Private Verifiable Set Computation over Multi-Owner Outsourced Databases. 1116-1128 - Xi Liang, Stavros Sintos, Zechao Shang, Sanjay Krishnan:
Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing. 1129-1141 - Xueling Lin, Lei Chen, Chaorui Zhang:
TENET: Joint Entity and Relation Linking with Coherence Relaxation. 1142-1155 - Yu-Shan Lin, Ching Tsai, Tz-Yu Lin, Yun-Sheng Chang, Shan-Hung Wu:
Don't Look Back, Look into the Future: Prescient Data Partitioning and Migration for Deterministic Database Systems. 1156-1168 - Sebastian Link, Ziheng Wei:
Logical Schema Design that Quantifies Update Inefficiency and Join Efficiency. 1169-1181 - Ester Livshits, Rina Kochirgan, Segev Tsur, Ihab F. Ilyas, Benny Kimelfeld, Sudeepa Roy:
Properties of Inconsistency Measures for Databases. 1182-1194 - Can Lu, Jeffrey Xu Yu, Zhiwei Zhang, Hong Cheng:
Graph Iso/Auto-morphism: A Divide-&-Conquer Approach. 1195-1207 - Shengliang Lu, Shixuan Sun, Johns Paul, Yuchen Li, Bingsheng He:
Cache-Efficient Fork-Processing Patterns on Large Graphs. 1208-1221 - Shangyu Luo, Dimitrije Jankov, Binhang Yuan, Chris Jermaine:
Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra. 1222-1234 - Yuyu Luo, Nan Tang, Guoliang Li, Chengliang Chai, Wenbo Li, Xuedi Qin:
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks. 1235-1247 - Lin Ma, William Zhang, Jie Jiao, Wuwen Wang, Matthew Butrovich, Wan Shen Lim, Prashanth Menon, Andrew Pavlo:
MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems. 1248-1261 - Pingchuan Ma, Rui Ding, Shi Han, Dongmei Zhang:
MetaInsight: Automatic Discovery of Structured Knowledge for Exploratory Data Analysis. 1262-1274 - Ryan Marcus, Parimarjan Negi, Hongzi Mao, Nesime Tatbul, Mohammad Alizadeh, Tim Kraska:
Bao: Making Learned Query Optimization Practical. 1275-1288 - Ruben Mayer, Hans-Arno Jacobsen:
Hybrid Edge Partitioner: Partitioning Large Power-Law Graphs under Memory Constraints. 1289-1302 - Zhengjie Miao, Yuliang Li, Xiaolan Wang:
Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond. 1303-1316 - Kyriakos Mouratidis, Keming Li, Bo Tang:
Marrying Top-k with Skyline Queries: Relaxing the Preference Input while Producing Output of Controllable Size. 1317-1330 - Jan Mühlig, Jens Teubner:
MxTasks: How to Make Efficient Synchronization and Prefetching Easy. 1331-1344 - Felix Neutatz, Felix Biessmann, Ziawasch Abedjan:
Enforcing Constraints for Machine Learning Systems via Declarative Feature Selection: An Experimental Study. 1345-1358 - Wangze Ni, Peng Cheng, Lei Chen, Xuemin Lin:
When the Recursive Diversity Anonymity Meets the Ring Signature. 1359-1371 - Prashant Pandey, Brian Wheatman, Helen Xu, Aydin Buluç:
Terrace: A Hierarchical Graph Container for Skewed Dynamic Graphs. 1372-1385 - Prashant Pandey, Alex Conway, Joe Durie, Michael A. Bender, Martin Farach-Colton, Rob Johnson:
Vector Quotient Filters: Overcoming the Time/Space Trade-Off in Filter Design. 1386-1399 - Eliana Pastor, Luca de Alfaro, Elena Baralis:
Looking for Trouble: Analyzing Classifier Behavior via Pattern Divergence. 1400-1412 - Johns Paul, Shengliang Lu, Bingsheng He, Chiew Tong Lau:
MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures. 1413-1425 - Arnab Phani, Benjamin Rath, Matthias Boehm:
LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems. 1426-1439 - Jose Picado, Arash Termehchy, Alan Fern, Sudhanshu Pathak, Praveen Ilango, John Davis:
Scalable and Usable Relational Learning With Automatic Language Bias. 1440-1451 - Olga Poppe, Chuan Lei, Lei Ma, Allison Rozet, Elke A. Rundensteiner:
To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams. 1452-1464 - Yuan Qiu, Yilei Wang, Ke Yi, Feifei Li, Bin Wu, Chaoqun Zhan:
Weighted Distinct Sampling: Cardinality Estimation for SPJ Queries. 1465-1477