#### Seventh International Workshop on ## High Performance Big Graph Data ## Management, Analysis, and Mining **December 12, 2020** To be held in conjunction with the [2020 IEEE International Conference on Big Data **(IEEE BigData 2020)**](http://bigdataieee.org/BigData2020/) ### Workshop Description Modern Big Data increasingly appears in the form of complex graphs and networks. Examples include the physical Internet, the world wide web, online social networks, phone networks, and biological networks. In addition to their massive sizes, these graphs are dynamic, noisy, and sometimes transient. They also conform to all five Vs (Volume, Velocity, Variety, Value and Veracity) that define Big Data. However, many graph-related problems are computationally difficult, and thus big graph data brings unique challenges, as well as numerous opportunities for researchers, to solve various problems that are significant to our communities. Big graph problems are currently solved using several complementary paradigms. The most popular approach is perhaps by exploiting parallelism, through specialized algorithms for supercomputers, shared-memory multicore and manycore systems, and heterogeneous CPU-GPU systems. However, since real-world graphs are sparse and highly irregular, there are very few parallel implementations that can actually deliver high performance. The major challenges to scaling and efficiency include irregular data dependencies, poor locality, and high synchronization costs of current approaches. In addition to parallelism, researchers are developing approximation algorithms that use sampling for compressing and summarizing graph data. Streaming algorithms are also being considered for scenarios where the rate of updates is too fast to process the entire graph in a single pass. Further, out-of-core algorithms are necessary for massive graphs that do not fit in the main memory of a typical system. Researchers can use graph-based solutions for solving problems from many diverse disciplines, including routing and transportation, social networks, bioinformatics, computational science, health care, security and intelligence analysis. This workshop aims to bring together researchers from different paradigms solving big graph problems under a unified platform for sharing their work and exchanging ideas. We are soliciting novel and original research contributions related to big graph data management, analysis, and mining (algorithms, software systems, applications, best practices, performance). Significant work-in-progress papers are also encouraged. Papers can be from any of the following areas, including but not limited to: * Graph machine learning, graph embeddings, graph neural networks * Representation learning for graph data * Deep Learning-based models for learning on graph data * Extreme-scale computing for large tensor, network, and graph problems * Parallel algorithms for big graph analysis on HPC systems * Heterogeneous CPU-GPU solutions to solve big graph problems * Sampling and summarization of large graphs * Graph algorithms for large-scale scientific computing problems * Graph clustering, partitioning, and classification methods * Scalable graph topology measurement: diameter approximation, eigenvalues, triangle and graphlet counting * Parallel algorithms for computing graph kernels * Inference on large graph data * Graph evolution and dynamic graph models * Graph streams * Computational methods for visualization of large-scale graphs * Graph databases, novel querying and indexing strategies for RDF data * Novel applications of big graph problems in bioinformatics, health care, security, and social networks * New software systems and runtime systems for big graph data mining **Regular paper** submissions must be at most 10 pages long, including all figures, tables, and references. They must be formatted according to the paper submission formatting guidelines provided in the [IEEE BigData 2020 Call for Papers](http://bigdataieee.org/BigData2020/CallPapers.html). Additionally, we encourage **short paper** submissions (at most 6 pages) describing new work in progress. ### Past Workshops [BigGraphs 2014](workshop2014.html) [BigGraphs 2015](workshop2015.html) [BigGraphs 2016](workshop2016.html) [BigGraphs 2017](workshop2017.html) [BigGraphs 2018](workshop2018.html) [BigGraphs 2019](workshop2019.html) ### Important Dates * Oct 8, 2020 (11.59 pm Anywhere on Earth time): [Submission](https://wi-lab.com/cyberchair/2020/bigdata20/index.php) deadline * Nov 5, 2020: Notification of paper acceptance to authors * Nov 15, 2020: Camera-ready submissions due * Dec 12, 2020: Workshop to be held virtually ### Keynote **[Madhav Marathe](https://engineering.virginia.edu/faculty/madhav-marathe)** ![Madhav Marathe](marathe.jpg "Madhav Marathe") Networks, Simulation Science and Advanced Computing Biocomplexity Institute and Initiative & Dept. of Computer Science University of Virginia *Talk title*: **Computational aspects of real-world co-evolving networks** *Abstract*: Developing practical informatics tools and decision support environments for reasoning about real-world social habitats often represented as networks is complicated and scientifically challenging due to their size, co-evolutionary nature and the need for representing multiple dynamical processes simultaneously. The COVID-19 pandemic, 2014 Ebola epidemic, global migration, societal impacts of natural and human initiated disasters and the effect of climate change provide examples of the many challenges faced when developing such environments. Recent quantitative changes in computing and information sciences have created new opportunities to create innovative tools and technologies to support these applications. The talk will focus on science and engineering of socially coupled networks with the aim of developing scalable and practical decision support systems. I will draw on our work in urban transport planning, national security and public health epidemiology to guide the discussion. Understanding these systems is of interest but they serve as excellent "model organisms" for developing data driven computational foundations of co-evolving complex networks. I will discuss this possibility in my concluding remarks. ### Workshop Program December 12, 2020 9:00 am -- 2:30 pm U.S. Eastern Standard Time Location: Virtual, to be announced by conference organizers. 25-minute presentations (20-minute talk video will be broadcast and 5 minutes for live Q&A) 9:00 am     Opening remarks Workshop organizers 9:05 am     Approximately and Efficiently Estimating Dynamic Point-to-Point Shortest Path Alok Tripathy and Oded Green 9:30 am     Agent-Navigable Dynamic Graph Construction and Visualization over Distributed Memory Justin Gilroy, Satine Paronyan, Jonathan Acoltzi, and Munehiro Fukuda 9:55 am     Graph Filtering to Remove the "Middle Ground" for Anomaly Detection William Eberle and Lawrence Holder 10:25 am     Computational aspects of real-world co-evolving networks (keynote) Madhav Marathe 11:20 am     Massively Parallel Processing Database for Sequence and Graph Data Structures Applied to Rapid-Response Drug Repurposing Chris Rickett, Kristyn Maschoff, and Sreenivas Sukumar 11:45 am     Retrieving Entities from Knowledge Graphs without Knowing Much: On Learning Generalizable Patterns and Indexing Yuting Xie and Tingjian Ge 12:10 pm     Graph Force Learning Ke Sun, Jiaying Liu, Shuo Yu, Bo Xu, and Feng Xia 12:35 pm     Memory Efficient Graph Convolutional Network based Distributed Link Prediction Damitha Senevirathne, Isuru Wijesiri, Suchitha Dehigaspitiya, Miyuru Dayarathna, Sanath Jayasena, and Toyotaro Suzumura 1:00 pm     Break 1:10 pm     Understanding Coarsening for Embedding Large-Scale Graphs Taha Atahan Akyıldız, Amro Alabsi Alundi, and Kamer Kaya 1:35 pm     Learning Embeddings of Directed Networks with Text-Associated Nodes---with Application in Software Package Dependency Networks Kexuan Sun, Shudan Zhong, and Hong Xu 2:00 pm     Relational Graph Embeddings for Table Retrieval Mohamed Trabelsi, Zhiyu Chen, Brian D. Davison, and Jeff Heflin 2:25 pm     Closing remarks Workshop organizers ### Workshop Organizers [Nesreen Ahmed](http://nesreenahmed.com/) ![Nesreen Ahmed](ahmed.jpg "Nesreen Ahmed") Intel Labs Santa Clara, CA 95054 [Mohammad Al Hasan](http://cs.iupui.edu/~alhasan/) ![Mohammad Al Hasan](hasan.jpg "Mohammad Al Hasan") Department of Computer and Information Science Indiana University - Purdue University Indianapolis, IN 46202 [Shaikh Arifuzzaman](https://www.cs.uno.edu/~arif/) ![Shaikh Arifuzzaman](arif.jpg "Shaikh Arifuzzaman") Department of Computer Science The University of New Orleans New Orleans, LA 70148 [Kamesh Madduri](https://madduri.org/) ![Kamesh Madduri](madduri.jpg "Kamesh Madduri") Department of Computer Science and Engineering The Pennsylvania State University University Park, PA 16802 ### Contact Please send email to one of the workshop organizers.