default search action
ACM SIGMOD Conference 2013: New York, NY, USA
- Kenneth A. Ross, Divesh Srivastava, Dimitris Papadias:
Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York, NY, USA, June 22-27, 2013. ACM 2013, ISBN 978-1-4503-2037-5
Research session 1: data analytics
- Botong Huang, Shivnath Babu, Jun Yang:
Cumulon: optimizing statistical data analysis in the cloud. 1-12 - Reynold S. Xin, Josh Rosen, Matei Zaharia, Michael J. Franklin, Scott Shenker, Ion Stoica:
Shark: SQL and rich analytics at scale. 13-24 - Petrie Wong, Zhian He, Eric Lo:
Parallel analytics as a service. 25-36
Research session 2: XML
- Ba Quan Truong, Sourav S. Bhowmick, Curtis E. Dyreson, Aixin Sun:
MESSIAH: missing element-conscious SLCA nodes search in XML data. 37-48 - Sara Cohen:
Indexing for subtree similarity-search using edit distance. 49-60 - Marcelo Arenas, Jonny Daenen, Frank Neven, Martín Ugarte, Jan Van den Bussche, Stijn Vansummeren:
Discovering XSD keys from XML data. 61-72
Research session 3: transactions
- Hyungsoo Jung, Hyuck Han, Alan D. Fekete, Gernot Heiser, Heon Young Yeom:
A scalable lock manager for multicores. 73-84 - Goetz Graefe, Mark Lillibridge, Harumi A. Kuno, Joseph A. Tucek, Alistair C. Veitch:
Controlled lock violation. 85-96 - Woon-Hak Kang, Sang-Won Lee, Bongki Moon, Gi-Hwan Oh, Changwoo Min:
X-FTL: transactional FTL for SQLite databases. 97-108
Research session 4: data storage
- Wangchao Le, Feifei Li, Yufei Tao, Robert Christensen:
Optimal splitters for temporal and multi-version databases. 109-120 - Mihaela A. Bornea, Julian Dolby, Anastasios Kementsietsidis, Kavitha Srinivas, Patrick Dantressangle, Octavian Udrea, Bishwaranjan Bhattacharjee:
Building an efficient RDF store over a relational database. 121-132 - Yannis Klonatos, Andres Nötzli, Andrej Spielmann, Christoph Koch, Viktor Kuncak:
Automatic synthesis of out-of-core algorithms. 133-144
Research session 5: schema matching and spatial databases I
- Meihui Zhang, Kaushik Chakrabarti:
InfoGather+: semantic matching and annotation of numeric and time-varying attributes in web tables. 145-156 - Patricia C. Arocena, Boris Glavic, Renée J. Miller:
Value invention in data exchange. 157-168 - Hanan Samet, Jagan Sankaranarayanan, Michael Auerbach:
Indexing methods for moving object databases: games and other applications. 169-180
Research session 6: graph connectivity
- Zhiwei Zhang, Jeffrey Xu Yu, Lu Qin, Lijun Chang, Xuemin Lin:
I/O efficient: computing SCCs in massive graphs. 181-192 - James Cheng, Silu Huang, Huanhuan Wu, Ada Wai-Chee Fu:
TF-Label: a topological-folding labeling scheme for reachability querying in a large graph. 193-204 - Lijun Chang, Jeffrey Xu Yu, Lu Qin, Xuemin Lin, Chengfei Liu, Weifa Liang:
Efficiently computing k-edge connected components via graph decomposition. 205-216
Research session 7: crowdsourcing
- Jinyang Gao, Xuan Liu, Beng Chin Ooi, Haixun Wang, Gang Chen:
An online cost sensitive decision-making method in crowdsourcing systems. 217-228 - Jiannan Wang, Guoliang Li, Tim Kraska, Michael J. Franklin, Jianhua Feng:
Leveraging transitive relations for crowdsourced joins. 229-240 - Yael Amsterdamer, Yael Grossman, Tova Milo, Pierre Senellart:
Crowd mining. 241-252
Research session 8: social media
- Mikalai Tsytsarau, Sihem Amer-Yahia, Themis Palpanas:
Efficient sentiment correlation for large-scale demographics. 253-264 - Huy Pham, Cyrus Shahabi, Yan Liu:
EBM: an entropy-based model to infer social strength from spatiotemporal data. 265-276 - Wanyun Cui, Yanghua Xiao, Haixun Wang, Yiqi Lu, Wei Wang:
Online search of overlapping communities. 277-288
Research session 9: systems, performance I
- Yinan Li, Jignesh M. Patel:
BitWeaving: fast scans for main memory data processing. 289-300 - Barzan Mozafari, Carlo Curino, Alekh Jindal, Samuel Madden:
Performance and resource modeling in highly-concurrent OLTP workloads. 301-312 - Kyu-Young Whang, Tae-Seob Yun, Yeon-Mi Yeo, Il-Yeol Song, Hyuk-Yoon Kwon, In-Joong Kim:
ODYS: an approach to building a massively-parallel search engine using a DB-IR tightly-integrated parallel DBMS for higher-level functionality. 313-324
Research session 10: graph management
- Xiaocheng Hu, Yufei Tao, Chin-Wan Chung:
Massive graph triangulation. 325-336 - Wook-Shin Han, Jinsoo Lee, Jeong-Hoon Lee:
Turboiso: towards ultrafast and robust subgraph isomorphism search in large graph databases. 337-348 - Takuya Akiba, Yoichi Iwata, Yuichi Yoshida:
Fast exact shortest-path distance queries on large networks by pruned landmark labeling. 349-360
Research session 11: text databases
- Xiaochun Yang, Bin Wang, Tao Qiu, Yaoshu Wang, Chen Li:
Improving regular-expression matching on strings using negative factors. 361-372 - Jiaheng Lu, Chunbin Lin, Wei Wang, Chen Li, Haiyong Wang:
String similarity measures and joins with synonyms. 373-384 - Younghoon Kim, Kyuseok Shim:
Efficient top-k algorithms for approximate substring matching. 385-396
Research session 12: systems, performance II
- Ce Zhang, Christopher Ré:
Towards high-throughput gibbs sampling at scale: a study across storage managers. 397-408 - Takashi Horikawa:
Latch-free data structures for DBMS: design, implementation, and evaluation. 409-420 - Sabah Currim, Richard T. Snodgrass, Young-Kyoon Suh, Rui Zhang, Matthew Wong Johnson, Cheng Yi:
DBMS metrology: measuring query time. 421-432
Research session 13: information extraction
- Yan Zheng, Jeffrey Jestes, Jeff M. Phillips, Feifei Li:
Quality and efficiency for kernel density estimates in large data. 433-444 - Yasuhiro Fujiwara, Makoto Nakatsuji, Hiroaki Shiokawa, Takeshi Mishima, Makoto Onizuka:
Efficient ad-hoc search for personalized PageRank. 445-456 - Sudeepa Roy, Laura Chiticariu, Vitaly Feldman, Frederick Reiss, Huaiyu Zhu:
Provenance-based dictionary refinement in information extraction. 457-468
Research session 14: query processing and optimization
- Feng Yu, Wen-Chi Hou, Cheng Luo, Dunren Che, Mengxia Zhu:
CS2: a new database synopsis for query estimation. 469-480 - Akrivi Vlachou, Christos Doulkeridis, Kjetil Nørvåg, Yannis Kotidis:
Branch-and-bound algorithm for reverse top-k queries. 481-492 - Guido Moerkotte, Pit Fender, Marius Eich:
On the correct and complete enumeration of the core search space. 493-504
Research session 15: cloud computing
- Bin Shao, Haixun Wang, Yatao Li:
Trinity: a distributed graph engine on a memory cloud. 505-516 - Aaron J. Elmore, Sudipto Das, Alexander Pucher, Divyakant Agrawal, Amr El Abbadi, Xifeng Yan:
Characterizing tenant behavior for placement and crisis mitigation in multitenant DBMSs. 517-528 - Yufei Tao, Wenqing Lin, Xiaokui Xiao:
Minimal MapReduce algorithms. 529-540
Research session 16: data cleaning
- Michele Dallachiesa, Amr Ebaid, Ahmed Eldawy, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Nan Tang:
NADEEF: a commodity data cleaning system. 541-552 - Mohamed Yakout, Laure Berti-Équille, Ahmed K. Elmagarmid:
Don't be SCAREd: use SCalable Automatic REpairing with maximal likelihood and bounded changes. 553-564 - Yang Cao, Wenfei Fan, Wenyuan Yu:
Determining the relative accuracy of attributes. 565-576
Research session 17: complex event processing
- Rajagopal Ananthanarayanan, Venkatesh Basker, Sumit Das, Ashish Gupta, Haifeng Jiang, Tianhao Qiu, Alexey Reznichenko, Deomid Ryabkov, Manpreet Singh, Shivakumar Venkataraman:
Photon: fault-tolerant and scalable joining of continuous data streams. 577-588 - Di Wang, Yeye He, Elke A. Rundensteiner, Jeffrey F. Naughton:
Utility-maximizing event stream suppression. 589-600 - Zheng Li, Tingjian Ge, Cindy X. Chen:
ε-Matching: event processing over noisy sequences in real time. 601-612
Research session 18: systems, performance III
- Paraschos Koutris, Prasang Upadhyaya, Magdalena Balazinska, Bill Howe, Dan Suciu:
Toward practical query pricing with QueryMarket. 613-624 - Michael Armbrust, Eric Liang, Tim Kraska, Armando Fox, Michael J. Franklin, David A. Patterson:
Generalized scale independence through incremental precomputation. 625-636 - Zhuhua Cai, Zografoula Vagena, Luis Leopoldo Perez, Subramanian Arumugam, Peter J. Haas, Christopher M. Jermaine:
Simulation of database-valued markov chains using SimSQL. 637-648
Research session 19: privacy
- Shixi Chen, Shuigeng Zhou:
Recursive mechanism: towards node differential privacy and unrestricted joins. 653-664 - Jun Zhang, Xiaokui Xiao, Yin Yang, Zhenjie Zhang, Marianne Winslett:
PrivGene: differentially private model fitting using genetic algorithms. 665-676 - Bing-Rong Lin, Daniel Kifer:
Information preservation in statistical privacy and bayesian estimation of unattributed histograms. 677-688
Research session 20: spatial databases II
- Cheng Long, Raymond Chi-Wing Wong, Ke Wang, Ada Wai-Chee Fu:
Collective spatial keyword queries: a distance owner-driven approach. 689-700 - Sadegh Nobari, Farhan Tauheed, Thomas Heinis, Panagiotis Karras, Stéphane Bressan, Anastasia Ailamaki:
TOUCH: in-memory spatial join by hierarchical data-oriented partitioning. 701-712 - Wuman Luo, Haoyu Tan, Lei Chen, Lionel M. Ni:
Finding time period-based most frequent path in big trajectory data. 713-724
Research session 21: data streams
- Raul Castro Fernandez, Matteo Migliavacca, Evangelia Kalyvianaki, Peter R. Pietzuch:
Integrating scale out and fault tolerance in stream processing using operator state management. 725-736 - Lu Wang, Ge Luo, Ke Yi, Graham Cormode:
Quantiles over data streams: an experimental study. 737-748 - Lisi Chen, Gao Cong, Xin Cao:
An efficient query indexing mechanism for filtering geo-textual data. 749-760
Research session 22: distributed systems
- Peter Bailis, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica:
Bolt-on causal consistency. 761-772 - Jan Schaffner, Tim Januschowski, Megan Kercher, Tim Kraska, Hasso Plattner, Michael J. Franklin, Dean Jacobs:
RTP: robust tenant placement for elastic in-memory database clusters. 773-784 - Jingkuan Song, Yang Yang, Yi Yang, Zi Huang, Heng Tao Shen:
Inter-media hashing for large-scale retrieval from heterogeneous data sources. 785-796
Research session 23: data mining
- Iris Miliaraki, Klaus Berberich, Rainer Gemulla, Spyros Zoupanos:
Mind the gap: large-scale frequent sequence mining. 797-808 - Meihui Zhang, Hazem Elmeleegy, Cecilia M. Procopiuc, Divesh Srivastava:
Reverse engineering complex join queries. 809-820 - Feida Zhu, Zequn Zhang, Qiang Qu:
A direct mining approach to efficient constrained graph pattern discovery. 821-832
Research session 24: road networks and trajectories
- Han Su, Kai Zheng, Haozhou Wang, Jiamin Huang, Xiaofang Zhou:
Calibrating trajectory data for similarity-based analysis. 833-844 - Cheng Long, Raymond Chi-Wing Wong, Philip S. Yu, Minhao Jiang:
On optimal worst-case matching. 845-856 - Andy Diwen Zhu, Hui Ma, Xiaokui Xiao, Siqiang Luo, Youze Tang, Shuigeng Zhou:
Shortest path and distance queries on road networks: towards bridging theory and practice. 857-868
Research session 25: security
- Gabriel Bender, Lucja Kot, Johannes Gehrke, Christoph Koch:
Fine-grained disclosure control for app ecosystems. 869-880 - Stavros Papadopoulos, Graham Cormode, Antonios Deligiannakis, Minos N. Garofalakis:
Lightweight authentication of linear algebraic queries on data streams. 881-892
Research session 26: indexing
- Lefteris Sidirourgos, Martin L. Kersten:
Column imprints: a secondary index structure. 893-904 - Jan Finis, Robert Brunel, Alfons Kemper, Thomas Neumann, Franz Färber, Norman May:
DeltaNI: an efficient labeling scheme for versioned hierarchical data. 905-916
Keynote addresses
- Alex Nazaruk, Michael Rauchman:
Big data in capital markets. 917-918 - Paul Yaron:
Managing database technology at enterprise scale. 919-920
Panel
- David J. DeWitt, Ihab F. Ilyas, Jeffrey F. Naughton, Michael Stonebraker:
We are drowning in a sea of least publishable units (LPUs). 921-922
Tutorials
- Philip A. Bernstein, Sudipto Das:
Rethinking eventual consistency. 923-928 - Ashraf Aboulnaga, Shivnath Babu:
Workload management for big data analytics. 929-932 - Fabian M. Suchanek, Gerhard Weikum:
Knowledge harvesting in the big-data era. 933-938 - Tyson Condie, Paul Mineiro, Neoklis Polyzotis, Markus Weimer:
Machine learning for big data. 939-942 - Richard Hull, Jianwen Su, Roman Vaculín:
Data management perspectives on business process management: tutorial overview. 943-948 - Lukasz Golab, Theodore Johnson:
Data stream warehousing. 949-952
Demo session 1: data intensive applications
- Alexandros Stougiannis, Mirjana Pavlovic, Farhan Tauheed, Thomas Heinis, Anastasia Ailamaki:
Data-driven neuroscience: enabling breakthroughs via innovative data management. 953-956 - Guoliang Li, Nan Zhang, Ruicheng Zhong, Sitong Liu, Weihuang Huang, Ju Fan, Kian-Lee Tan, Lizhu Zhou, Jianhua Feng:
TsingNUS: a location-based service system towards live city. 957-960 - René Müller, Tim Kaldewey, Guy M. Lohman, John McPherson:
WOW: what the world of (data) warehousing can learn from the World of Warcraft. 961-964 - Serge Abiteboul, Émilien Antoine, Gerome Miklau, Julia Stoyanovich, Jules Testard:
Rule-based application development using Webdamlog. 965-968 - Alvin Cheung, Owen Arden, Samuel Madden, Andrew C. Myers:
Speeding up database applications with Pyxis. 969-972 - Alex Cheng, Nilesh Bansal, Nick Koudas:
Peckalytics: analyzing experts and interests on Twitter. 973-976 - Fernando Seabra Chirigati, Dennis E. Shasha, Juliana Freire:
Packing experiments for sharing and publication. 977-980 - Manasi Vartak, Samuel Madden:
CHIC: a combination-based recommendation system. 981-984 - Charles Tian, Yan Huang, Zhi Liu, Favyen Bastani, Ruoming Jin:
Noah: a dynamic ridesharing system. 985-988 - Siarhei Bykau, Flavio Rizzolo, Yannis Velegrakis:
A query answering system for data with evolution relationships. 989-992
Demo session 2: data analysis and mining; privacy; security
- Ce Zhang, Vidhya Govindaraju, Jackson Borchardt, Tim Foltz, Christopher Ré, Shanan Peters:
GeoDeepDive: statistical inference using familiar data-processing languages. 993-996 - François Goasdoué, Konstantinos Karanasos, Yannis Katsis, Julien Leblay, Ioana Manolescu, Stamatis Zampetakis:
Fact checking and analyzing the web. 997-1000 - Carlos Ordonez, Javier García-García, Carlos Garcia-Alvarado, Wellington Cabrera, Veerabhadran Baladandayuthapani, Mohammed S. Quraishi:
Data mining algorithms as a service in the cloud exploiting relational database systems. 1001-1004 - Adrien Guille, Cécile Favre, Hakim Hacid, Djamel A. Zighed:
SONDY: an open source platform for social dynamics mining and analysis. 1005-1008 - Elke Achtert, Hans-Peter Kriegel, Erich Schubert, Arthur Zimek:
Interactive data mining with 3D-parallel-coordinate-trees. 1009-1012 - Mike Barnett, Badrish Chandramouli, Robert DeLine, Steven Mark Drucker, Danyel Fisher, Jonathan Goldstein, Patrick Morrison, John C. Platt:
Stat!: an interactive analytics environment for big data. 1013-1016 - Abhishek Mukherji, Xika Lin, Christopher R. Botaish, Jason Whitehouse, Elke A. Rundensteiner, Matthew O. Ward, Carolina Ruiz:
PARAS: interactive parameter space exploration for association rule mining. 1017-1020 - Theodoros Lappas, Marcos R. Vieira, Dimitrios Gunopulos, Vassilis J. Tsotras:
STEM: a spatio-temporal miner for bursty activity. 1021-1024 - Craig P. Sayers, Alkis Simitsis, Georgia Koutrika, Alejandro Guerrero Gonzalez, David Tamez Cantu, Meichun Hsu:
The farm: where pig scripts are bred and raised. 1025-1028 - Luca Bonomi, Li Xiong, James J. Lu:
LinkIT: privacy preserving record linkage and integration via transformations. 1029-1032 - Arvind Arasu, Spyros Blanas, Ken Eguro, Manas Joglekar, Raghav Kaushik, Donald Kossmann, Ravishankar Ramamurthy, Prasang Upadhyaya, Ramarathnam Venkatesan:
Secure database-as-a-service with Cipherbase. 1033-1036
Demo session 3: database optimization; performance
- Ioannis Konstantinou, Dimitrios Tsoumakos, Ioannis Mytilinis, Nectarios Koziris:
DBalancer: distributed load balancing for NoSQL data-stores. 1037-1040 - Ioannis Konstantinou, Verena Kantere, Dimitrios Tsoumakos, Nectarios Koziris:
COCCUS: self-configured cost-based query services in the cloud. 1041-1044 - Georgios Giannikis, Darko Makreshanski, Gustavo Alonso, Donald Kossmann:
Workload optimization using SharedDB. 1045-1048 - Ying Zhang, Martin L. Kersten, Stefan Manegold:
SciQL: array data processing inside an RDBMS. 1049-1052 - Stephan Ewen, Sebastian Schelter, Kostas Tzoumas, Daniel Warneke, Volker Markl:
Iterative parallel data processing with stratosphere: an inside look. 1053-1056 - Alekh Jindal, Jorge-Arnulfo Quiané-Ruiz, Samuel Madden:
CARTILAGE: adding flexibility to the Hadoop skeleton. 1057-1060 - Dimitrios Georgiadis, Maria Kontaki, Anastasios Gounaris, Apostolos N. Papadopoulos, Kostas Tsichlas, Yannis Manolopoulos:
Continuous outlier detection in data streams: an extensible framework and state-of-the-art algorithms. 1061-1064 - Liyue Fan, Li Xiong, Vaidy S. Sunderam:
FAST: differentially private real-time aggregate monitor with filtering and adaptive sampling. 1065-1068 - Harold Lim, Shivnath Babu:
Execution and optimization of continuous queries with cyclops. 1069-1072 - Louis Woods, Jens Teubner, Gustavo Alonso:
Less watts, more performance: an intelligent storage engine for data appliances. 1073-1076 - Vivek R. Narasayya, Sudipto Das, Manoj Syamala, Surajit Chaudhuri, Feng Li,