Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell

Zhu, Xiaoshu and Li, Jian and Li, Hong-Dong and Xie, Miao and Wang, Jianxin (2020) Sc-GPE: A Graph Partitioning-Based Cluster Ensemble Method for Single-Cell. Frontiers in Genetics, 11. ISSN 1664-8021

[thumbnail of pubmed-zip/versions/1/package-entries/fgene-11-604790/fgene-11-604790.pdf] Text
pubmed-zip/versions/1/package-entries/fgene-11-604790/fgene-11-604790.pdf - Published Version

Download (8MB)

Abstract

Clustering is an efficient way to analyze single-cell RNA sequencing data. It is commonly used to identify cell types, which can help in understanding cell differentiation processes. However, different clustering results can be obtained from different single-cell clustering methods, sometimes including conflicting conclusions, and biologists will often fail to get the right clustering results and interpret the biological significance. The cluster ensemble strategy can be an effective solution for the problem. As the graph partitioning-based clustering methods are good at clustering single-cell, we developed Sc-GPE, a novel cluster ensemble method combining five single-cell graph partitioning-based clustering methods. The five methods are SNN-cliq, PhenoGraph, SC3, SSNN-Louvain, and MPGS-Louvain. In Sc-GPE, a consensus matrix is constructed based on the five clustering solutions by calculating the probability that the cell pairs are divided into the same cluster. It solved the problem in the hypergraph-based ensemble approach, including the different cluster labels that were assigned in the individual clustering method, and it was difficult to find the corresponding cluster labels across all methods. Then, to distinguish the different importance of each method in a clustering ensemble, a weighted consensus matrix was constructed by designing an importance score strategy. Finally, hierarchical clustering was performed on the weighted consensus matrix to cluster cells. To evaluate the performance, we compared Sc-GPE with the individual clustering methods and the state-of-the-art SAME-clustering on 12 single-cell RNA-seq datasets. The results show that Sc-GPE obtained the best average performance, and achieved the highest NMI and ARI value in five datasets.

Item Type: Article
Subjects: GO for STM > Medical Science
Depositing User: Unnamed user with email support@goforstm.com
Date Deposited: 16 Feb 2023 10:46
Last Modified: 29 Aug 2023 04:22
URI: http://archive.article4submit.com/id/eprint/149

Actions (login required)

View Item
View Item