Skip to content

Notebook and data storage

This page provides detailed information about storing and accessing your data in Wherobots Cloud.

You can upload file(s) to Wherobots Cloud for later use within your Jupyter notebooks and jobs.

You can upload individual files directly through your browser using Wherobots Managed Storage, or integrate your existing Amazon S3 storage buckets.

Wherobots notebook and data storage guidance

When creating geospatial solutions, we recommend that you store all of your code, configuration files, and notebook files within the Wherobots File System.

While, you can use other locations inside your Jupyter notebook environment for temporary data storage, we recommend using Wherobots storage options: Wherobots Managed Storage or an S3 storage integration.

You can use Wherobots Managed Storage or integrate external Amazon S3 storage with Wherobots.

Wherobots File System

Jupyter's top-level directory points to the /home/wherobots directory in your Jupyter notebook environment's file system. This directory is backed by Wherobots File System and automatically persisted across your notebook instances, and is only visible and accessible to your Wherobots user.

To share notebooks with other members of your Organization, you can download your notebook files from your Jupyter notebook environment and exchange them with your team members. You can also use Wherobots Managed Storage (see below) and upload your notebook files into the S3 data folder /data/shared. You can then upload new notebook files into Jupyter notebook environment with Jupyter's built-in upload function.

User specific folders

The folders with the name of customer-XXXX/ are unique to each account. The files and folders within customer-XXXX/ folders can only be accessed by the account holder.

Organization-wide shared folders

Folders named shared/ grant access to everyone within your Organization. Any member of the Organization can create new folders, delete existing ones, and have the privileges to read, write, upload, and delete files within this shared folder.

Copy a file or folder's S3 Path

If needed, you can copy the direct S3 path for your directory and files by clicking on the "copy" icon on the right-hand side of each displayed folder or file.

Folder permissions

The following top-level folders are read-only, indicating that it is not possible to create new folders, upload files, or delete any existing folders:

  • / (root)
  • /data/

The following folders are where the user can create new folders, upload files, or delete any file within them:

  • /data/customer-XXXX
  • /data/shared
  • /spark-logs

Note

The /spark-logs folder allows read and write permissions, but it is recommended that users refrain from tampering with the logs, as it may affect their experience.

Wherobots Managed Storage

Wherobots provides out-of-the-box, managed storage for your geospatial datasets and Havasu tables, accessible from your notebooks and jobs with WherobotsDB. You can browse and manage the contents of your Wherobots Managed Storage with the Managed Storage page in Wherobots Cloud in Wherobots Cloud.

Wherobots Managed Storage allows you to see the file and directory structure of your data warehouse on Wherobots Cloud. Managed Storage

Note

You can also use Wherobots Managed Storage to upload files and data for your spatial analytics tasks.

S3 Storage integration

Wherobots’ integration with Amazon Simple Storage Service (S3) allows S3 customers to utilize Wherobots as the spatial engine that operates on their data while still using Amazon S3 for their data storage. If you need to upload large files or more complex folder structures like partitioned Parquet files, consider integrating your existing S3 Storage buckets.

For more information, see S3 Storage integration