Constrained Clustering with Weak Label Prior
en-GBde-DEes-ESfr-FR

Constrained Clustering with Weak Label Prior

18/06/2024 Frontiers Journals

Clustering is widely exploited in data mining. It has been proved that embedding weak label prior into clustering is effective to promote its performance. Previous researches mainly focus on only one type of prior. However, in many real scenarios, two kinds of weak label prior information, e.g., pairwise constraints and cluster ratio, are easily obtained or already available. How to incorporate them to improve clustering performance is important but rarely studied.

To deal with this problem, a research team led by Chenping Hou published their new research on 15 June 2024 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.

The team proposed a constrained Clustering with Weak Label Prior (CWLP) to consider compound weak label prior in an integrated framework. Within the unified spectral clustering model, the pairwise constraints are employed as a regularizer in spectral embedding and label proportion is added as a constraint in spectral rotation. Except for the theoretical convergence and computational complexity analyses, the experimental evaluation illustrates the superiority of the proposed approach.

In the research, both pairwise constraints information and cluster ratio information are helpful in improving the confidence of the clustering problem. To establish a unified model by simultaneously integrating pairwise constraints information and cluster ratio information, which could effectively improve the clustering performance.

Specifically, the pairwise constraints information is utilized as a regularization term in the spectral clustering model. The cluster ratio is added as a constraint to the indicator matrix. To approximate a variant of the embedding matrix more precisely, we replace a cluster indicator matrix with a scaled cluster indicator matrix. Instead of fixing an initial similarity matrix in the integrated model, they learn a new similarity matrix that is more suitable for deriving the final clustering results. These ideas can help to reduce information loss and obtain a globally optimized clustering result. Extensive experiments on ten benchmark data sets clearly validate the effectiveness of the proposed method for constrained clustering with weak label prior.

In our future work, methods to decrease the computational complexity of the proposed method are worth studying, so that the computational efficiency can be increased even more and the improved method can be applied to large-scale datasets.

DOI: 10.1007/s11704-023-3355-7

Research Article, Published: 15 June 2024
Jing ZHANG, Ruidong FAN, Hong TAO, Jiacheng JIANG, Chenping HOU. Constrained clustering with weak label prior. Front. Comput. Sci., 2024, 18(3): 183338, https://doi.org/10.1007/s11704-023-3355-7
Attached files
  • Compound weak label prior plays an essential role in increasing clustering confidence.
18/06/2024 Frontiers Journals
Regions: Asia, China
Keywords: Applied science, Computing

Testimonials

For well over a decade, in my capacity as a researcher, broadcaster, and producer, I have relied heavily on Alphagalileo.
All of my work trips have been planned around stories that I've found on this site.
The under embargo section allows us to plan ahead and the news releases enable us to find key experts.
Going through the tailored daily updates is the best way to start the day. It's such a critical service for me and many of my colleagues.
Koula Bouloukos, Senior manager, Editorial & Production Underknown
We have used AlphaGalileo since its foundation but frankly we need it more than ever now to ensure our research news is heard across Europe, Asia and North America. As one of the UK’s leading research universities we want to continue to work with other outstanding researchers in Europe. AlphaGalileo helps us to continue to bring our research story to them and the rest of the world.
Peter Dunn, Director of Press and Media Relations at the University of Warwick
AlphaGalileo has helped us more than double our reach at SciDev.Net. The service has enabled our journalists around the world to reach the mainstream media with articles about the impact of science on people in low- and middle-income countries, leading to big increases in the number of SciDev.Net articles that have been republished.
Ben Deighton, SciDevNet

We Work Closely With...


  • BBC
  • The Times
  • National Geographic
  • The University of Edinburgh
  • University of Cambridge
Copyright 2024 by AlphaGalileo Terms Of Use Privacy Statement