Curated Datasets
The Data Hub section of Wherobots Cloud centralizes datasets from various sources to enable geospatial analysis and insights.
These datasets are cleaned and transformed into Havasu format for fast and efficient analytics with WherobotsDB in Wherobots Cloud.
Many of these datasets are available for free as a part of a Wherobots Community Organization within Wherobots Cloud. A subset of datasets reserved for Professional or Enterprise Edition Organizations.
Curated Datasets¶
Wherobots offers the following kind of built-in datasets:
wherobots_open_data
: Datasets available to all Organization Editions (Community, Professional, and Enterprise).wherobots_pro_data
: Datasets restricted to Paid Organization Editions (Professional and Enterprise).
Dataset Name | Availability | Type | Count | Description |
---|---|---|---|---|
Overture Maps buildings/building | All Organization Editions | Polygon | 785 million | Any human-made structures with roofs or interior spaces |
Overture Maps places/place | All Organization Editions | Point | 59 million | Any business or point of interest within the world |
Overture Maps admins/administrativeBoundary | All Organization Editions | LineString | 96 thousand | Any officially defined border between two Administrative Localities |
Overture Maps admins/locality | All Organization Editions | Point | 2948 | Countries and hierarchical subdivisions of countries |
Overture Maps transportation/connector | All Organization Editions | Point | 330 million | Points of physical connection between two or more segments |
Overture Maps transportation/segment | All Organization Editions | LineString | 294 million | Center-line of a path which may be traveled |
Overture Places with Isochrones |
Professional or Enterprise Edition |
Point | 53.2 million precomputed travel isochrones | Wherobots added precomputed 5, 10, 15, and 20-min outbound driving isochrones for all 13.3M US locations in the Overture Places dataset. |
Google & Microsoft open buildings | Professional or Enterprise Edition |
Polygon | 2.5 billion | Google & Microsoft Open Buildings, combined by VIDA |
LandSAT surface temperature | Professional or Enterprise Edition |
Raster (GeoTiff) | 166K images, 10 TB size | The temperature of the Earth's surface in Kelvin, from Aug 2023 to Oct 2023 |
US Census ZCTA codes | Professional or Enterprise Edition |
Polygon | 33144 | ZIP Code Tabulation Areas defined in 2018 |
NYC TLC taxi trip records | Professional or Enterprise Edition |
Point | 200 million | NYC TLC taxi trip pickup and dropoff records per trip |
Open Street Maps all nodes | Professional or Enterprise Edition |
Point | 8 billion | All the nodes of the OpenStreetMap Planet dataset |
Open Street Maps postal codes | Professional or Enterprise Edition |
Polygon | 154 thousand | Boundaries of postal code areas as defined in OpenStreetMap |
Weather events | Professional or Enterprise Edition |
Point | 8.6 million | Events such as rain, snow, storm, from 2016 - 2022 |
Wild fires | Professional or Enterprise Edition |
Point | 1.8 million | Wildfire that occurred in the United States from 1992 to 2015 |
Overture Places with Isochrones Licensing
Access to Overture Places with Isochrones through the Wherobots Data Hub requires a Professional or Enterprise Edition Organization.
The dataset itself is provided under the Community Data License Agreement – Permissive, Version 2.0, governing its use if obtained or redistributed. This dataset also carries the licenses inherited from its source data.
Example notebooks¶
Explore our collection of example notebooks to see how you can link your data to the physical world and drive new insights. To get started, simply start a runtime and open the notebook that best fits your use case.
Plan Requirements
Some advanced example notebooks, like those for raster inference, require a Professional or Enterprise Edition Organization and GPU-Optimized runtimes to execute.
If you are new to the platform, learn how to Start Your First Notebook.
Query datasets in Wherobots¶
You can query datasets in Wherobots Cloud with an interactive notebook or within a Job Run or SQL Session.
For more information on querying data, see Querying Datasets.