This activity performs hierarchical clustering of rows and/or columns and produces a visual heat map representation of the clustered matrix. Clustering can be performed with a choice of distances and linkages. This activity can also be used just to display your data in a heat map without performing clustering by deselecting row and column clustering.
If checked rows will be clustered and a tree (dendrogram) is generated (default: checked).
Selected distance that will be used for the clustering process (default: Euclidean). The distance can be selected from a predefined list:
Selected clustering method that will be applied (default: Average). It can be selected from a predefined list:
Selected constraint that should be preserved from the input data (default: None). The used constraint can be selected from a predefined list of constraints:
Specifies, whether the data should be preprocessed using k-means before applying clustering and generating a heatmap (default: checked).
This parameter is just relevant, if the parameter “Preprocess with k-means ” is checked. Defines the number of clusters that will be created by the k-means algorithm (default: 300).
If checked columns will be clustered and a tree (dendrogram) is generated (default: checked).
Selected distance that will be used for the clustering process (default: Euclidean). The distance can be selected from a predefined list:
Selected clustering method that will be applied (default: Average). It can be selected from a predefined list:
Selected constraint that should be preserved from the input data (default: None). The used constraint can be selected from a predefined list of constraints:
Specifies, whether the data should be preprocessed using k-means before applying clustering and generating a hetamap (default: checked).
This parameter is just relevant, if the parameter “Preprocess with k-means” is checked. Defines the number of clusters that will be created by the k-means algorithm (default: 300).
List of all expression/numerical columns in the data set (default: all numerical columns; the expression columns are selected see parameter “Use for clustering”).
Selected expression/numerical columns that should be used for the clustering (default: all expression columns are selected).
Selected expression/numerical columns that should be displayed in the output heat map, but are not used for the clustering (default: empty).