default search action
BigData Congress 2015: New York City, NY, USA
- Barbara Carminati, Latifur Khan:
2015 IEEE International Congress on Big Data, New York City, NY, USA, June 27 - July 2, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-7278-7
Research Track
Research Session 1: Mining I
- N. Denizcan Vanli, Muhammed O. Sayin, Ibrahim Delibalta, Suleyman Serdar Kozat:
A Scalable Approach for Online Hierarchical Big Data Mining. 1-8 - Aris-Kyriakos Koliopoulos, Paraskevas Yiapanis, Firat Tekiner, Goran Nenadic, John A. Keane:
A Parallel Distributed Weka Framework for Big Data Mining Using Spark. 9-16 - José I. Rodrigues, Mauro J. G. Figueiredo, Ivo Silvestre, Cristina Veiga-Pires:
Geometrical and Topological Modelling: A Fast Computation of Spatial 3D TLS Data Selections. 17-24
Research Session 2: Mining II
- Elena Baralis, Luca Cagliero, Paolo Garza, Luigi Grimaudo:
PaWI: Parallel Weighted Itemset Mining by Means of MapReduce. 25-32 - Mulugeta Mammo, Srividya K. Bansal:
Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce. 33-40 - Bo Yan, Yitian Ren, Zijiang Yang:
A GPU Based SVM Method with Accelerated Kernel Matrix Calculation. 41-46
Research Session 3: Big Data and Social Network
- Paolo Suppa, Eugenio Zimeo:
A Clustered Approach for Fast Computation of Betweenness Centrality in Social Networks. 47-54 - Stefano Faralli, Giovanni Stilo, Paola Velardi:
A Semantic Recommender for Micro-blog Users. 55-62 - Youliang Zhong, Jian Yang, Robertus Nugroho:
Incorporating Tie Strength in Robust Social Recommendation. 63-70
Research Session 4: Big Data and Social Network
- Robertus Nugroho, Youliang Zhong, Jian Yang, Cécile Paris, Surya Nepal:
Matrix Inter-joint Factorization - A New Approach for Topic Derivation in Twitter. 79-86 - Robertus Nugroho, Jian Yang, Youliang Zhong, Cécile Paris, Surya Nepal:
Deriving Topics in Twitter by Exploiting Tweet Interactions. 87-94
Research Session 5: Privacy
- Jingquan Li, Xueying Li:
Privacy Preserving Data Analysis in Mental Health Research. 95-101 - Christian Schaefer, P. M. Manoj:
Enabling Privacy Mechanisms in Apache Storm. 102-109 - Chao Han, Ke Wang:
Sensitive Disclosures under Differential Privacy Guarantees. 110-117
Research Session 6: Big Data and Learning
- Hussein Mohsen, Hasan Kurban, Kurt Zimmer, Mark Jenne, Mehmet M. Dalkiliç:
Red-RF: Reduced Random Forest for Big Data Using Priority Voting & Dynamic Data Reduction. 118-125 - Muhammed O. Sayin, N. Denizcan Vanli, Ibrahim Delibalta, Suleyman Serdar Kozat:
Optimal and Efficient Distributed Online Learning for Big Data. 126-133 - Huaming Chen, Hong Zhao, Jun Shen, Rui Zhou, Qingguo Zhou:
Supervised Machine Learning Model for High Dimensional Gene Data in Colon Cancer Detection. 134-141
Research Session 7: Query
- Phani Rohit Mullangi, Lakshmish Ramaswamy:
CoUPE: Continuous Query Processing Engine for Evolving Graphs. 142-149 - Jianting Zhang, Simin You, Le Gruenwald:
Lightweight Distributed Execution Engine for Large-Scale Spatial Join Query Processing. 150-157 - Duy-Hung Phan, Quang-Nhat Hoang-Xuan, Matteo Dell'Amico, Pietro Michiardi:
Efficient and Self-Balanced ROLLUP Aggregates for Large-Scale Data Summarization. 158-165
Research Session 8: Big Data Processing
- Parijat Shukla, Arun K. Somani:
Tree Matching Using Data Shaping. 166-173 - Yehia Elshater, Patrick Martin, Dan Rope, Mike McRoberts, Craig Statchuk:
A Study of Data Locality in YARN. 174-181 - Wilson A. Higashino, Miriam A. M. Capretz, Luiz F. Bittencourt:
CEPSim: A Simulator for Cloud-Based Complex Event Processing. 182-190
Research Session 9: Big Data Quality
- Ikbal Taleb, Rachida Dssouli, Mohamed Adel Serhani:
Big Data Pre-processing: A Quality Framework. 191-198 - Marianela García Lozano, Ulrik Franke, Magnus Rosell, Vladimir Vlassov:
Towards Automatic Veracity Assessment of Open Source Information. 199-206 - Daniel Joseph, Nikolay Mehandjiev, Babis Theodoulidis, John Davies, Ian Thurlow:
Identifying Relevant Formal Concepts through the Collapse Index. 207-214
Research Session 10: Big Data Platform/Framework
- Hong Liu, Ashwin Kumar T. K, Johnson P. Thomas:
Cleaning Framework for Big Data - Object Identification and Linkage. 215-221 - Chaochao Zhou, Saurabh Kumar Garg:
Performance Analysis of Scheduling Algorithms for Dynamic Workflow Applications. 222-229 - Rong Zhang, Yuanchao Shu, Zequ Yang, Peng Cheng, Jiming Chen:
Hybrid Traffic Speed Modeling and Prediction Using Real-World Data. 230-237
Research Session 11: Big Data Semantics
- Artem Chebotko, Andrey Kashlev, Shiyong Lu:
A Big Data Modeling Methodology for Apache Cassandra. 238-245 - Avrilia Floratou, Jignesh M. Patel:
Replica Placement in Multi-tenant Database Environments. 246-253 - Mustafa V. Nural, Michael E. Cotterell, John A. Miller:
Using Semantics in Predictive Big Data Analytics. 254-261
Research Session 12: Analysis on Big Data Research and Platforms
- Alan L. Porter, Ying Huang, Jannik Schuehle, Jan L. Youtie:
Meta Data: Big Data Research Evolving across Disciplines, Players, and Topics. 262-267 - Pedro Daniel Coimbra de Almeida, Jorge Bernardino:
Big Data Open Source Platforms. 268-275
Data Science Special Track
DS Session 1
- T. H. A. S. Siriweera, Incheon Paik, Banage T. G. S. Kumara, Koswatte R. C. Koswatta:
Intelligent Big Data Analysis Architecture Based on Automatic Service Composition. 276-280
DS Session 2
- Mohammed Aledhari, Fahad Saeed:
Design and Implementation of Network Transfer Protocol for Big Genomic Data. 281-288 - Ana Cristina Oliveira, Christof Fetzer, André Martin, Marco Spohn:
Optimizing Query Prices for Data-as-a-Service. 289-296 - Alp Oral, Bedir Tekinerdogan:
Supporting Performance Isolation in Software as a Service Systems with Rich Clients. 297-304
Big Data Research in Healthcare Special Track
BDRH Session 1
- Benjamin Yip, Hoyee W. Hirai, Yong-Hong Kuo, Helen M. Meng, Samuel Y. S. Wong, Kelvin Kam-fai Tsoi:
Blood Pressure Management with Data Capturing in the Cloud among Hypertensive Patients: A Monitoring Platform for Hypertensive Patients. 305-308 - Kin Fai Ho, Hoyee W. Hirai, Yong-Hong Kuo, Helen M. Meng, Kelvin Kam-fai Tsoi:
Indoor Air Monitoring Platform and Personal Health Reporting System: Big Data Analytics for Public Health Research. 309-312 - Yong-Hong Kuo, Janny M. Y. Leung, Kelvin Kam-fai Tsoi, Helen M. Meng, Colin A. Graham:
Embracing Big Data for Simulation Modelling of Emergency Department Processes and Activities. 313-316
BDRH Session 2
- Xin Lai, Liu Liu, Paul B. S. Lai, Kelvin Kam-fai Tsoi, Haitian Wang, Ka Chun Chong, Benny Zee:
Risk-Adjusted Monitoring Method for Surgical Data: Methodology for Data Analytics (Work in Progress). 317-319 - Marc Chong, Maggie Haitian Wang, Xin Lai, Benny Zee, Fung Hong, Ek Yeoh, Eliza Wong, Carrie Yam, Patsy Chau, Kelvin Kam-fai Tsoi, Colin A. Graham:
Patient Flow Evaluation with System Dynamic Model in an Emergency Department: Data Analytics on Daily Hospital Records. 320-323 - Maggie Haitian Wang, Kelvin Kam-fai Tsoi, Xin Lai, Marc Chong, Benny Zee, Tian Zheng, Shaw-Hwa Lo, Inchi Hu:
Two Screening Methods for Genetic Association Study with Application to Psoriasis Microarray Data Sets. 324-326
Shenzhen Satellite Track
Shenzhen Satellite Session 1
- Chao Ma, Yinda Wang, Haowen Liu, Hao Gui, Weiping Zhu, Xiaochuan Shi, Xuhui Li:
An Approach to Social Relationship Ranking on Internet-Based Social Platforms by Tempo-spatial Data Mining Using Location Prediction Technique. 327-334 - Xiaolu Zhu, Jinglin Li, Zhihan Liu, Fangchun Yang:
Optimization Approach to Depot Location in Car Sharing Systems with Big Data. 335-342 - Dingsheng Wan, Yan Xiao, Pengcheng Zhang, Hareton Leung:
Hydrological Big Data Prediction Based on Similarity Search and Improved BP Neural Network. 343-350 - Liqin Yang, Guosheng Kang, Weigang Cai, Qiang Zhou:
An Effective Process Mining Approach against Diverse Logs Based on Case Classification. 351-358
Taipei Satellite Track
Taipei Satellite Session 1
- Victor W. Chu, Raymond K. Wong, Fang Chen, Chi-Hung Chi:
Web Service Recommendations Based on Time-Aware Bayesian Networks. 359-366 - Wei-Feng Tung, Guillaume Jordann:
Crowdsourcing Service Design for Social Enterprise Insight Innovation. 367-373 - Chuen-Min Huang, Cheng-Yi Wu:
Effects of Word Assignment in LDA for News Topic Discovery. 374-380 - Chieh-Hsin Liao, Yu-Heng Lei, Kai-Yu Liou, Jian-Shing Lin, Hsiao-Feng Yeh:
Using Big Data for Profiling Heavy Users in Top Video Apps. 381-385 - Chi-Ou Chen, Ye-Qi Zhuo, Chao-Chun Yeh, Che-Min Lin, Shih-Wei Liao:
Machine Learning-Based Configuration Parameter Tuning on Hadoop System. 386-392 - Yen-Hui Liang, Shiow-Yang Wu:
Sequence-Growth: A Scalable and Effective Frequent Itemset Mining Algorithm for Big Data Based on MapReduce Framework. 393-400
Application Track
Applications Session 1: Big Data and Health
- Brian Xu, Sathish Alampalayam Kumar:
Big Data Analytics Framework for System Health Monitoring. 401-408 - Muhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M. Keenan, Diana J. Wilkie, Ashfaq A. Khokhar:
Predictive Modeling for Comfortable Death Outcome Using Electronic Health Records. 409-415
Applications Session 2: Big Data and Network Management
- Hongyan Cui, Yuchen Zhang, Chenhang Ma, Wei Lai, Norman C. Beaulieu, Stanislav Sobolevsky, Yunjie Liu:
Design and Realization of Cognitive Routing Resources Using Big Data Analysis in SDN. 424-429 - MingXue Wang, Robin Grindrod, Jimmy O'Meara, Mikel Zuzuarregui, Eloy Martinez, Enda Fallon:
Enterprise Search with Development for Network Management System. 430-437 - José R. Ortiz-Ubarri, Humberto Ortiz-Zuazaga, Albert Maldonado, Eric Santos, Jhensen Grullon:
Toa: A Web Based Network Flow Data Monitoring System at Scale. 438-443
Applications Session 3: Distributed Processing
- Jeyhun Karimov, A. Murat Ozbayoglu, Erdogan Dogdu:
k-Means Performance Improvements with Centroid Calculation Heuristics Both for Serial and Parallel Environments. 444-451 - Daniel Presser, Lau Cheuk Lung, Miguel Correia:
Greft: Arbitrary Fault-Tolerant Distributed Graph Processing. 452-459
Applications Session 4: Social Network
- Bilal Abu-Salih, Pornpit Wongthongtham, Amin Beheshti, Dengya Zhu:
A Preliminary Approach to Domain-Based Evaluation of Users' Trustworthiness in Online Social Networks. 460-466 - Deepak Puthal, Surya Nepal, Cécile Paris, Rajiv Ranjan, Jinjun Chen:
Efficient Algorithms for Social Network Coverage and Reach. 467-474 - Rohit Parimi, Toma Trepka, Doina Caragea, Cody Bennett:
How to Choose a Recommender System: Insights and Experiences for Large-Scale User Personalization. 475-482
Applications Session 5: Social Media
- Fenno F. Terry Heath III, Richard Hull, Elham Khabiri, Matthew Riemer, Noi Sukaviriya, Roman Vaculín:
Alexandria: Extensible Framework for Rapid Exploration of Social Media. 483-490 - Roberto Saia, Ludovico Boratto, Salvatore Carta:
A Latent Semantic Pattern Recognition Strategy for an Untrivial Targeted Advertising. 491-498
Applications Session 6: Image Processing
- Fatema Rashid, Ali Miri, Isaac Woungang:
Proof of Storage for Video Deduplication in the Cloud. 499-505 - Sridhar Vemula, Christopher Crick:
Hadoop Image Processing Framework. 506-513 - Ranga Raju Vatsavai:
A Scalable Complex Pattern Mining Framework for Global Settlement Mapping. 514-521
Applications Session 7: Big Data Application
- Hyejung Moon, Jangho Park, Sung-Kyung Kim:
Study on Corporate Governance of Stock Market in Korea: Network Analysis with Relationship of Major Shareholders. 522-525 - John Klein, Ian Gorton, Neil A. Ernst, Patrick Donohoe, Kim Pham, Chrisjan Matser:
Application-Specific Evaluation of No SQL Databases. 526-534 - Dennis Wei, Kush R. Varshney, Marcy Wagman:
Optigrow: People Analytics for Job Transfers. 535-542
Applications Session 8: Optimization
- Andrea Acquaviva, Daniele Apiletti, Antonio Attanasio, Elena Baralis, Lorenzo Bottaccioli, Federico Boni Castagnetti, Tania Cerquitelli, Silvia Chiusano, Enrico Macii, Dario Martellacci, Edoardo Patti:
Energy Signature Analysis: Knowledge at Your Fingertips. 543-550 - Kai-Fung Hong, Chien-Chih Chen, Yu-Ting Chiu, Kuo-Sen Chou:
Ctracer: Uncover C&C in Advanced Persistent Threats Based on Scalable Framework for Enterprise Log Data. 551-558 - Unekwu Idachaba, Frank Wang:
A Community-Based Cloud Computing Caching Service. 559-566
Applications Session 9: Evaluation
- Vinod Hegde, Milovan Krnjajic, Alexei Pozdnoukhov:
Unsupervised Event Detection with Infinite Poisson Mixture Model. 567-575 - Apostolos Papageorgiou, Bin Cheng, Ernö Kovacs:
Reconstructability-Aware Filtering and Forwarding of Time Series Data in Internet-of-Things Architectures. 576-583 - João Ricardo Lourenço, Veronika Abramova, Bruno Cabral, Jorge Bernardino, Paulo Carreiro, Marco Vieira:
No SQL in Practice: A Write-Heavy Enterprise Application. 584-591
Applications Session 10: Big Data Framework
- Bin Cheng, Salvatore Longo, Flavio Cirillo, Martin Bauer, Ernö Kovacs:
Building a Big Data Platform for Smart Cities: Experience and Lessons from Santander. 592-599 - Stanislav Sobolevsky, Iva Bojic, Alexander Belyi, Izabela Sitko, Bartosz Hawelka, Juan Murillo Arias, Carlo Ratti:
Scaling of City Attractiveness for Foreign Visitors through Big Data of Human Economical and Social Media Activity. 600-607 - R. Bruce Wallace, Rafik A. Goubran, Frank Knoefel, Shawn Marshall, Michelle Porter, Madelaine Harlow, Akshay Puli:
Automation of the Validation, Anonymization, and Augmentation of Big Data from a Multi-year Driving Study. 608-614
Applications Session 11: Big Data Use Cases
- Vladimir Hahanov, Wajeb Gharibi, Eugenia Litvinova, Svetlana Chumachenko:
Big Data Driven Cyber Analytic System. 615-622 - Chien-An Lai, Jim Donahue, Aibek Musaev, Calton Pu:
Nimbus: Tuning Filters Service on Tweet Streams. 623-630
Short Paper Track
Session 1
- Hoi Ting Poon, Ali Miri:
Computation and Search over Encrypted XML Documents. 631-634 - Mohammed Nazim Feroz, Susan A. Mengel:
Phishing URL Detection Using URL Ranking. 635-638 - Yong-Hong Kuo, Janny M. Y. Leung, Helen M. Meng, Kelvin Kam-fai Tsoi:
A Real-Time Decision Support Tool for Disaster Response: A Mathematical Programming Approach. 639-642
Session 2
- Keren Ouaknine, Michael J. Carey, Scott Kirkpatrick:
The PigMix Benchmark on Pig, MapReduce, and HPCC Systems. 643-648 - Yi Shan, Yi Chen:
Scalable Query Optimization for Efficient Data Processing Using MapReduce. 649-652 - U. S. N. Raju, Irlanki Sandeep, Nattam Sai Karthik, Rayapudi Siva Praveen, Mayank Singh Sachan:
Weighted Finite Automata Based Image Compression on Hadoop MapReduce Framework. 653-656 - Sangwhan Cha, Monica Wachowicz:
Developing a Real-Time Data Analytics Framework Using Hadoop. 657-660 - U. S. N. Raju, Shibin George, V. Sairam Praneeth, Ranjeet Deo, Priyanka Jain:
Content Based Image Retrieval on Hadoop Framework. 661-664
Session 3
- Longzhuang Li, Douglas Boulware:
High-Order Tensor Decomposition for Large-Scale Data Analysis. 665-668 - Johann A. Bengua, Ho N. Phien, Hoang Duong Tuan:
Optimal Feature Extraction and Classification of Tensors via Matrix Product State Decomposition. 669-672 - Verena Kantere, Maxim Filatov:
A Workflow Model for Adaptive Analytics on Big Data. 673-676 - Muhammad Raza Khan, Joshua Manoj, Anikate Singh, Joshua Blumenstock:
Behavioral Modeling for Churn Prediction: Early Indicators and Accurate Predictors of Custom Defection and Loyalty. 677-680 - Mariusz Kamola:
Analytics of Industrial Operational Data Inspired by Natural Language Processing. 681-684 - N. Denizcan Vanli, Huseyin Ozkan, Ibrahim Delibalta, Suleyman Serdar Kozat:
Online Nonlinear Classification for High-Dimensional Data. 685-688
Session 4
- Yun Tian, Bojian Xu, Yanqing Ji, Jesse Scholer:
Cloud Tree: A Library to Extend Cloud Services for Trees. 689-693 - Soo-Hyong Kim, Yoon-Joon Lee, Jaehwan John Lee:
Matrix-Based XML Stream Processing Using a GPU. 694-697 - Kadjo Kouame, Naser Ezzati-Jivan, Michel R. Dagenais:
A Flexible Data-Driven Approach for Execution Trace Filtering. 698-703 - Weijia Xu, Wei Luo, Nicholas Woodward, Yan Zhang:
Supporting Data Driven Access through Automatic Keyword Extraction and Summarization. 704-707 - Ardi Imawan, Titus Irma Damaiyanti, Joonho Kwon:
Road Traffic Analytic Query Processing Based on a Timeline Modeling. 708-711 - Verena Kantere:
Approximate Queries on Big Heterogeneous Data. 712-715
Session 5
- Abdul Wasay, Manos Athanassoulis, Stratos Idreos:
Queriosity: Automated Data Exploration. 716-719 - Feng Yu, Eric S. Jones, Wen-Chi Hou:
Write Optimization Using Asynchronous Update on Out-of-Core Column-Store Databases in Map-Reduce. 720-723 - Hai Nguyen, Matthew S. Weber:
Internet Archives as a Tool for Research: Decay in Large Scale Archival Records. 724-727 - Armel Jacques Nzekon Nzeko'o, Matthieu Latapy, Maurice Tchuenté:
Social Network Analysis of Developers' and Users' Mailing Lists of Some Free Open Source Software. 728-732 - Purva Pruthi, Anu Yadav, Farheen Abbasi, Durga Toshniwal:
How Has Twitter Changed the Event Discussion Scenario? A Spatio-temporal Diffusion Analysis. 733-736
Session 6
- Kelvin Kam-fai Tsoi, Yong-Hong Kuo, Helen M. Meng:
A Data Capturing Platform in the Cloud for Behavioral Analysis among Smokers: An Application Platform for Public Health Research. 737-740 - Minh-Son Dao, Koji Zettsu:
Discovering Environmental Impacts on Public Health Using Heterogeneous Big Sensory Data. 741-744