Core products
Wherobots has the following core products, which when used together provide a comprehensive platform for spatial data processing, analytics, and AI:WherobotsDB
Spatial analytics engineCloud-native, serverless engine for querying and processing geospatial data. Supports Spatial SQL, Python, and Scala with 300+ vector and raster functions. Apache Sedona-compatible.
- Process vector and raster data at planetary scale
- Up to 20x faster than alternatives
- Open formats: Parquet, GeoParquet, GeoTIFF
GeoStats & Map Matching
Spatial algorithmsDetect spatial patterns with statistical algorithms and match GPS traces to road networks.
- GeoStats: DBSCAN, Getis-Ord Gi*, Local Outlier Factor
- Map Matching: GPS-to-road-network alignment
RasterFlow
Raster processing & inference engine Private PreviewEnd-to-end pipeline for large-scale raster workflows. Build mosaics from satellite imagery, run ML model inference, and vectorize results.
- Pre-configured models for land cover, roads, canopy height
- Bring your own data and models
- Simple, high-level API
Havasu
Spatial table formatBuilt on Apache Iceberg, Havasu adds native support for geometry and raster data types. ACID transactions, schema evolution, time travel, and partition pruning for spatial workloads.
- Geometry and raster column types
- Spatial filter push-down
- Iceberg-compatible tooling
WherobotsDB in depth
WherobotsDB is a cloud-native, serverless analytics engine optimized for geospatial workloads.Key capabilities
Key capabilities
- Modern, performant, and affordable: Run small to planetary-scale geospatial queries up to 20x faster, at a fraction of the cost. Only pay for what you use.
- Unified vector and raster: Derive insights through spatial relationships between vector and raster data types in a single engine.
- Apache Sedona compatible: Lift-and-shift Sedona workloads with confidence.
- Popular languages: Spatial SQL, Python, and Scala. No esoteric tools required.
- No vendor lock-in: Open formats (Parquet, GeoParquet), Apache Iceberg, and your own cloud storage.
- Feature complete: 300+ vector and raster functions, notebooks for Exploratory Data Analysis (EDA), production-ready job runs, and a geospatial dataset catalog.
- Serverless and secure: No infrastructure management. 99.5% runtime SLA with isolation by default for Enterprise Edition Organizations.
Common use cases
Common use cases
- Automating spatial data processing: Schedule and automate ETL with Job Runs. Orchestrate pipelines with the Wherobots Airflow Provider.
- Modernizing Spark + Sedona workloads: Eliminate cluster management, performance tuning, and indexing overhead. Lift-and-shift existing Sedona-on-Spark workloads.
- Extracting insights from aerial imagery: RasterFlow integrates with WherobotsDB for end-to-end raster processing, ML inference, and analysis workflows.
Getting started
Getting started
The fastest way to start is to run an example notebook:
- Create a free Community Edition account.
- Start a runtime and open
examples/Getting_Started/Part_1_Loading_Data.ipynb.
WherobotsRunOperator to run them as Job Runs from Airflow.GeoStats and Map Matching in depth
GeoStats and Map Matching make it easy to extract insights from geospatial data without needing to be a geospatial or infrastructure expert.GeoStats
GeoStats
Distributed ML clustering algorithms for detecting hotspots, density patterns, and local outliers in vector data.Use cases: pedestrian activity hotspots, public health outbreak detection, strategic retail placement.Algorithms: DBSCAN, Getis-Ord Gi*, Local Outlier Factor.GeoStats is available in all Organization Editions, including Community (free).
Map Matching
Map Matching
Offline distributed map matching — precisely align GPS or location-tracking coordinates to digital road networks.Use cases: traffic pattern analysis, optimized routing.Map Matching is available in all Organization Editions. Community Edition may experience latency with large datasets.
Edition availability
Edition availability
| Capability | Community | Professional | Enterprise |
|---|---|---|---|
| GeoStats (DBSCAN, LOF, Gi*) | Yes | Yes | Yes |
| Map Matching | Yes | Yes | Yes |
| RasterFlow | No | Yes Private Preview | Yes Private Preview |
| Capability | Notebook Path |
|---|---|
| DBSCAN | examples/Analyzing_Data/Clustering_DBSCAN.ipynb |
| Local Outlier Factor | examples/Analyzing_Data/Local_Outlier_Factor.ipynb |
| Getis-Ord Gi* | examples/Analyzing_Data/Getis_Ord_Gi*.ipynb |
| Map Matching | examples/Analyzing_Data/GPS_Map_Matching.ipynb |
How the products work together
Wherobots products can be used independently but are most effective when combined. Here are some common patterns for how customers use them together to build spatial data products:Spatial ETL Pipeline
Spatial ETL Pipeline
WherobotsDB + Havasu + AirflowIngest raw spatial data from S3 or public sources, transform with Spatial SQL, and store in Havasu tables for efficient querying. Automate your ETL with the Wherobots Airflow Provider.Typical flow: Data Source → WherobotsDB (transform) → Havasu (store) → Downstream consumers
Satellite Imagery Analysis
Satellite Imagery Analysis
RasterFlow + WherobotsDBUse RasterFlow to build mosaics from satellite imagery, run ML inference (classification, segmentation, object detection) with pre-configured or custom models, and vectorize the results. Then analyze and enrich the output with WherobotsDB.Typical flow: Satellite imagery → RasterFlow (mosaic + infer + vectorize) → WherobotsDB (analyze vectors) → Visualization
Location Intelligence
Location Intelligence
WherobotsDB + GeoStatsLoad point-of-interest or movement data, enrich with spatial joins, and detect clusters, hotspots, or anomalies using GeoStats algorithms.Typical flow: Vector data → WherobotsDB (join + enrich) → GeoStats (cluster) → Insights
Traffic & Routing Analysis
Traffic & Routing Analysis
WherobotsDB + Map MatchingLoad GPS traces, match them to road network geometries, and analyze traffic patterns, congestion, or optimal routes.Typical flow: GPS data + road network → Map Matching → WherobotsDB (analyze) → Routing decisions
Key security properties
Wherobots is designed with security as a top priority. Wherobots has the SOC 2 Type 2 attestation. For the full details, see the Security Guide and Wherobots Trust Center.Workload types
Wherobots has the following workload types, each optimized for different use cases:Notebooks
Interactive Jupyter notebooks for exploratory data analysis, prototyping, and visualization
Job Runs
Scheduled or ad-hoc batch jobs for production ETL, analytics, and Machine Learning workloads
SQL Sessions
Submit Spatial SQL queries programmatically to run ad-hoc analyses or power applications with a spatial SQL backend
MCP Server
Natural language interface for exploring spatial data in VS Code Copilot. MCP Server query executions are logged as SQL Sessions.
Data connectivity
Wherobots connects to your data wherever it lives.Managed Storage
Built-in storage included with every organization
Amazon S3
Bring your own S3 buckets with cross-account IAM roles
Unity Catalog
Access Databricks Unity Catalog Delta tables directly
Supported languages & formats
| Category | Supported |
|---|---|
| Languages | Spatial SQL, Python, Scala |
| Vector formats | Parquet, GeoParquet, GeoJSON, Shapefile, GeoPackage, CSV |
| Raster formats | GeoTIFF, COG, NetCDF, Zarr |
| Table formats | Apache Iceberg (Havasu), Delta Lake (via Unity Catalog) |
| Tile formats | PMTiles (vector tiles) |
| APIs | STAC, REST |
Organization editions
Wherobots offers three tiers to match your needs:| Feature | Community (Free) | Professional | Enterprise |
|---|---|---|---|
| WherobotsDB | Yes | Yes | Yes |
| GeoStats | Yes | Yes | Yes |
| RasterFlow | - | Yes Private Preview | Yes Private Preview |
| GPU runtimes | - | Yes | Yes |
| SSO / SAML | - | - | Yes |
| Dedicated support | - | - | Yes |

