pegasus.spectral_leiden¶

pegasus.spectral_leiden(data, rep='pca', resolution=1.3, rep_kmeans='diffmap', n_clusters=30, n_clusters2=50, n_init=10, n_jobs=- 1, random_state=0, class_label='spectral_leiden_labels')[source]¶

Cluster the data using Spectral Leiden algorithm. [Li20]

Parameters

data (pegasusio.MultimodalData) – Annotated data matrix with rows for cells and columns for genes.
rep (str, optional, default: "pca") – The embedding representation used for clustering. Keyword 'X_' + rep must exist in data.obsm. By default, use PCA coordinates.
resolution (int, optional, default: 1.3) – Resolution factor. Higher resolution tends to find more clusters.
rep_kmeans (str, optional, default: "diffmap") – The embedding representation on which the KMeans runs. Keyword must exist in data.obsm. By default, use Diffusion Map coordinates. If diffmap is not calculated, use PCA coordinates instead.
n_clusters (int, optional, default: 30) – The number of first level clusters.
n_clusters2 (int, optional, default: 50) – The number of second level clusters.
n_init (int, optional, default: 10) – Number of kmeans tries for the first level clustering. Default is set to be the same as scikit-learn Kmeans function.
n_jobs (int, optional (default: -1)) – Number of threads to use for the KMeans step. -1 refers to using all physical CPU cores.
random_state (int, optional, default: 0) – Random seed for reproducing results.
class_label (str, optional, default: "spectral_leiden_labels") – Key name for storing cluster labels in data.obs.

Return type

None

Returns

None
Update data.obs –
- data.obs[class_label]: Cluster labels for cells as categorical data.

Examples

>>> pg.spectral_leiden(data)