Datalore 2022.3 Help

Use bucket data sources

Datalore provides interface for mounting S3 or Google Cloud Storage buckets directly inside your notebooks. Here are the main benefits of using such data sources:

  • You extend your storage as you do not have to upload to notebooks those files that you store in buckets.

  • You can grant read or write access to the files stored in buckets.

Attach buckets to your notebook

  1. Open the notebook.

  2. Go to Main menu | Tools | Attached data or click the Attached data icon on the left-hand sidebar.

  3. Click New connection and select New cloud storage.

  4. In the New connection dialog, select Amazon S3 or Google cloud storage.

  5. Fill in the fields based on the selected data source type and click Create data source.

  6. Once mounted, the cloud storage is automatically attached to the notebook. Your further actions:

    • Copy the data source path to access it in your code.

    • Click the edit icon next to the source type to change the access type to read-only.

    • Click the vertical ellipsis icon to edit or remove the data source. For S3 buckets, you can also check the connection or choose connection using boto3.

    • Unmount the bucket.

Buckets attached to a specific notebook are saved to the file system and can be attached to any notebook.

Attach an existing bucket datasource

  1. Open the notebook.

  2. Select Main menu > Tools > Attached datasources. This opens a view to the right of your editor.

  3. In the Attached datasources view, click Attach existing datasources and select the required datasource from the list.

Enable SSE-C for the Datalore S3 datasource

In the custom_options field, specify the following:

use_sse=c:/path/to/keys/file

where:

/path/to/keys/file is the file that contain keys. Make sure permissions are 600.

    Last modified: 30 November 2022