The first scalable motif-based graph local clustering algorithm for real-world massive graphs
en-GBde-DEes-ESfr-FR

The first scalable motif-based graph local clustering algorithm for real-world massive graphs

21/06/2024 Frontiers Journals

Motif-based graph local clustering is a popular method for graph mining tasks due to its various applications, such as community detection, network optimization and graph learning. However, the traditional two-phase approach of precomputing motif weights before performing graph local clustering loses locality and thus is impractical for real-world massive graphs.

To solve the problems, a research team led by Zhewei WEI published their new research on 15 June 2024 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.

The team proposed an index-free and purely local motif-based graph local clustering algorithm with developing effectiveness and efficient Monte-Carlo sampling techniques. The correctness and efficiency of the algorithm are proved with a series of theoretical results and demonstrated by extensive experiments on seven widely used real-world graph benchmarks. Compared with the existing research results, the proposed algorithm is the first one to be applicable and scalable on massive graphs.

In the research, they focused on the scalability challenges for motif-based graph local clustering and attributed the core problem to the motif-based graph weighting process. In order to avoid the time-consuming pre-computing operation to improve the scalability while keep the algorithm correct, they followed the widely used graph local clustering framework Nibble and devised novel triangle-based adaptive rejection sampling technique to outcome the proximity score vector towards the seed node in Monte-Carlo manner. With the score vector, they conduct standard sweep process to get the final clustering result. The practicality, scalability and consistency of the proposed algorithm is demonstrated by extensive experiments on seven massive graphs under both topology-based metric (conductance) and information-based metric (F1-score). Besides, they devise a novel visualization layout to better demonstrate the empirical comparisons by defining the performance distribution and its density. The layout can reveal the characters of graph local clustering algorithms with their visual patterns and being more informative and convincing.

Future work can focus on developing methods to choosing the optimal proximity metrics rather than just the Personalized PageRank (PPR) and generalizing to various motifs or even arbitrary high-order structures rather than just the triangle.

DOI: 10.1007/s11704-023-2768-7

Attached files
  • The framework of the Index-free Triangle-based Graph Local Clustering (TGLC*).
  • The Performance Distribution Profile (PDF) layouts of TGLC* with baselines on two typical graphs.
21/06/2024 Frontiers Journals
Regions: Asia, China
Keywords: Applied science, Computing

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • BBC
  • The Times
  • National Geographic
  • The University of Edinburgh
  • University of Cambridge
Copyright 2024 by AlphaGalileo Terms Of Use Privacy Statement