Description
The Clustering Benchmark Datasets contain 3 synthetic datasets conforming a structured domain with intuitively separable clusters in different forms as shown below.
Blobs Dataset
![notion image](https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F5c2acab6-92b0-4589-b753-1f7f6bc541e0%2FUntitled.png?table=block&id=7acb0555-6361-4e31-920e-ac6ca243b6a0&cache=v2)
Circles Dataset
![notion image](https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F5ffce280-479b-4a25-a602-cf74cd5e20c3%2FUntitled.png?table=block&id=3103e465-7bd2-43db-b40f-0919c9f5171e&cache=v2)
Moons Dataset
![notion image](https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F4de45d60-eb14-4765-a126-12aa4286879c%2FUntitled.png?table=block&id=3642388f-bfd6-4a7c-b9cb-ec987cef3982&cache=v2)
All three datasets include 500 data points in 2-dimensional space.
This dataset is often used for experimenting with clustering techniques and exploring their performance visually.
Data Location
Storage → Samples → segmentation_blobs.csv
Storage → Samples → segmentation_circles.csv
Storage → Samples → segmentation_moons.csv
Data Description