Fifth International Workshop on

High Performance Big Graph Data

Management, Analysis, and Mining

December 10, 2018

To be held in conjunction with the

2018 IEEE International Conference on Big Data (IEEE BigData 2018)
The Westin Seattle, Seattle, WA, USA

Workshop Description

Modern Big Data increasingly appears in the form of complex graphs and networks. Examples include the physical Internet, the world wide web, online social networks, phone networks, and biological networks. In addition to their massive sizes, these graphs are dynamic, noisy, and sometimes transient. They also conform to all five Vs (Volume, Velocity, Variety, Value and Veracity) that define Big Data. However, many graph-related problems are computationally difficult, and thus big graph data brings unique challenges, as well as numerous opportunities for researchers, to solve various problems that are significant to our communities.

Big graph problems are currently solved using several complementary paradigms. The most popular approach is perhaps by exploiting parallelism, through specialized algorithms for supercomputers, shared-memory multicore and manycore systems, and heterogeneous CPU-GPU systems. However, since real-world graphs are sparse and highly irregular, there are very few parallel implementations that can actually deliver high performance. The major challenges to scaling and efficiency include irregular data dependencies, poor locality, and high synchronization costs of current approaches. In addition to parallelism, researchers are developing approximation algorithms that use sampling for compressing and summarizing graph data. Streaming algorithms are also being considered for scenarios where the rate of updates is too fast to process the entire graph in a single pass. Further, out-of-core algorithms are necessary for massive graphs that do not fit in the main memory of a typical system. Researchers can use graph-based solutions for solving problems from many diverse disciplines, including routing and transportation, social networks, bioinformatics, computational science, health care, security and intelligence analysis.

This workshop aims to bring together researchers from different paradigms solving big graph problems under a unified platform for sharing their work and exchanging ideas. We are soliciting novel and original research contributions related to big graph data management, analysis, and mining (algorithms, software systems, applications, best practices, performance). Significant work-in-progress papers are also encouraged. Papers can be from any of the following areas, including but not limited to:

Regular paper submissions must be at most 10 pages long, including all figures, tables, and references. They must be formatted according to the style files used by the IEEE BigData 2018 conference proceedings. Additionally, we encourage short paper submissions (at most 6 pages) describing new work in progress.

Past Workshops

BigGraphs 2014
BigGraphs 2015
BigGraphs 2016
BigGraphs 2017

Important Dates

Keynote

Jennifer Neville
Jennifer Neville
Purdue University

Talk title: Deep Learning for Relational Networks

Abstract: Although deep learning methods have been successfully applied in structured domains comprised of images and natural language, it is difficult to apply the methods directly to graph and network domains due to issues of heterogeneity and long range dependence. In this talk, I will discuss some of our recent work developing neural network methods for complex network domains, including node classification, knowledge graph embeddings, and motif prediction. The key insights include incorporating network dependencies in both the input features and the model architectures, using randomization to transform heterogeneous sets into sequences, and using network-aware data augmentation to offset sparsity. Our experiments on real work social network data shows that our methods produces significant gains compared to other state-of-the-art methods.

Bio: Jennifer Neville is the Miller Family Chair Associate Professor of Computer Science and Statistics at Purdue University. She received her PhD from the University of Massachusetts Amherst in 2006. She is currently an elected member of the AAAI Executive Council. She was PC chair of the 9th ACM International Conference on Web Search and Data in 2016 and is PC chair for the SIAM International Conference on Data Mining in 2019. In 2012, she was awarded an NSF Career Award, in 2008 she was chosen by IEEE as one of "AI's 10 to watch", and in 2007 was selected as a member of the DARPA Computer Science Study Group. Her work, which includes over 100 peer-reviewed publications with more than 5000 citations, focuses on developing data mining and machine learning techniques for complex relational and network domains, including social, information, and physical networks.

Keynote talk slides

Workshop Program

December 10, 2018
8 am -- 12.15 pm

Location: Blakely, third floor, The Westin Seattle, Seattle, WA, USA

20-minute oral presentation of papers (17-minute talk and 3 minutes for Q&A)

8.05 am Opening remarks
Shaikh Arifuzzaman and Nesreen Ahmed

8:15 am Scalability Analysis of Cluster-based Betweenness Computation in Large Weighted Graphs
Andrea Castiello, Gianmarco Fucci, Angelo Furno, and Eugenio Zimeo

8:35 am PatBinQL, a compact, inference-enabled query language dedicated for RDF stream processing
Jeremy Lhez, Badre Belabbess, and Olivier Cure

8:55 am Single-Source Shortest Path Tree For Big Dynamic Graphs
Sara Riazi, Sriram Srinivasan, Sajal K. Das, Sanjukta Bhowmick, and Boyana Norris

9:15 am RECUT: RE-Compressing partially Unordered Trees
Stefan Bottcher and Rita Hartel

9:35 am Mobility Optimization on Cyber Physical System via Multiple Object Tracking and Mathematical Programming
Nozomi Hata, Takashi Nakayama, Akira Tanaka, Takashi Wakamatsu, Akihiro Yoshida, Nariaki Tateiwa, Yuri Nishikawa, Jun Ozawa, and Katsuki Fujisawa

10:00 am Coffee Break

10:20 am Keynote by Prof. Jennifer Neville

11:10 am BiasedWalk: Biased Sampling for Representation Learning on Graphs
Duong Nguyen, Fragkiskos Malliaros

11:30 am Social-Based Classification of Multiple Interactions in Dynamic Attributed Networks
Thiago H. P. Silva, Pedro O. S. Vaz de Melo, Alberto H. F. Laender

11:50 pm Automatic Hierarchical Clustering of Static Call Graphs for Program Comprehension
Gharib Gharibi, Rakan Alanazi, and Yugyung Lee

12:10 pm Closing Remarks
Nesreen Ahmed and Shaikh Arifuzzaman

Program Committee

Mansurul Bhuiyan, Walmart Labs
Jose Cadena, Lawrence Livermore National Laboratory
Vachik Dave, Indiana University Purdue University Indianapolis
Tim La Fond, Lawrence Livermore National Laboratory
Ananda Mondal, Florida International University
Ryan Rossi, Adobe Research
Shaden Smith, Intel Labs
Tianxing Wu, Nanyang Technological University
Baichuan Zhang, Facebook

Workshop Organizers

Nesreen Ahmed
Nesreen Ahmed
Intel Labs
Santa Clara, CA 95054

Mohammad Al Hasan
Mohammad Al Hasan
Department of Computer and Information Science
Indiana University - Purdue University
Indianapolis, IN 46202

Shaikh Arifuzzaman
Shaikh Arifuzzaman
Department of Computer Science
The University of New Orleans
New Orleans, LA 70148

Kamesh Madduri
Kamesh Madduri
Department of Computer Science and Engineering
The Pennsylvania State University
University Park, PA 16802

Contact

Please send email to one of the workshop organizers.