Published using Google Docs
Selecting diverse compounds using jklustor
Updated automatically every 5 minutes

Selecting diverse compounds

Examples - Online demo

Examples - Command line

Maximum of minimal dissimilarity selection (MMDS)

Clustering using MMDS

Using sphere exclusion clustering

JKlustor can be used to select diverse compounds for a set by using cluster centroid/representant structures.

Examples - Online demo

Examples - Command line

Maximum of minimal dissimilarity selection (MMDS)

This selection algorithm yields a diverse subset which size (k) is specified. The selection algorithm:

Note that this algorithm typically tends to select the outliers (apart from the first centrum) from the input set.

Clustering using MMDS

A clustering algorithm (accessible with “-c mmds:<k>” in jklustor command line) is defined which used the MMDS algorithm described above:

Using sphere exclusion clustering

Cluster centroids identified by sphere exclusion clustering algorithm can be considered as a diverse subset.. The clustering algorithm currently implemented:

Note that any two centroids have a higher dissimilarity than the given radius. The proper dissimilarity radius depends on the input set and the fingerprint method (CFP/ECFP) used; determining it requires an iterative refinement.

Forum links

Tracker topic:

https://www.chemaxon.com/forum/ftopic8015.html (JKlustor related documents)

References to this documents are in the following topics:

https://www.chemaxon.com/forum/ftopic7912.html (Diverse compound selector)