epsilon- The farthest apart two points can be while still being considered connected or related.epsilonmust be a positive double float.minPoints- The minimum number of neighbor points (as determined by epsilon). A point needsminPointsneighbors to be considered a core point.minPointsmust be a positive integer.
Example overview
In this example, we will generate some random data and use DBSCAN to cluster that data. Then, we’ll visualize the clusters using a scatter plot. This demo is derived from the scikit-learn DBSCAN demo.Installation Required
Install the required package:scikit-learnDefine Sedona Context
Data Generation
In the following code section, we’ll generate some data using sklearn’smake_blobs function. We’ve set the data to consist of 750 points with 3 clusters. After clustering the data, we’ll visualize it in pyplot.
Clustering
In the following section, we’ll use the DBSCAN implementation in Wherobots to cluster the data in a dataframe, settingepsilon to 0.3 and minPoints to 10.
Wherobots’ DBSCAN returns outliers by default.

