r - Where can I find a good set of benchmark clustering datasets with ground truth labels? -


i looking clustering dataset "ground truth" labels known natural clustering, preferably high dimensionality.

i found candidates here (http://cs.joensuu.fi/sipu/datasets/), glass , iris data-sets have labels points. found code generate gaussian datasets (syndeca). main reason want compare distance metrics clustering methods. it's difficult use external (extrinsic) evaluation criteria many of biased towards euclidean distances; , there many choose from.

thanks!

there many data sets @ uci machine learning repository.


Comments

Popular posts from this blog

user interface - How to replace the Python logo in a Tkinter-based Python GUI app? -

objective c - Greedy NSProgressIndicator Allocation -

how to set an OCR language in Google Drive -