AzureStor 1.0 defined several functions for working with storage, which were specific to each storage type: blob, file or ADLSgen2. AzureStor 2.0 organises these functions into a consistent framework, using S3 generics and methods.

The client framework for AzureStor 2.0 is described here. While the original interface is still supported, it's recommended that you use the new framework as it protects against specifying the wrong function for a given storage type.

Storage endpoints and containers

The storage_endpoint creates a storage endpoint object based on the URL specified. The blob_endpoint, file_endpoint and adls_endpoint functions do the same thing, but require that the URL and the function match.

# generic endpoint function: storage type inferred from URL
storage_endpoint("https://mystorage.blob.core.windows.net/", ...)  # blob endpoint
storage_endpoint("https://mystorage.file.core.windows.net/", ...)  # file endpoint
storage_endpoint("https://mystorage.dfs.core.windows.net/", ...)   # ADLSgen2 endpoint

# storage-type-specific functions
blob_endpoint("https://mystorage.blob.core.windows.net/", ...)     # blob endpoint
file_endpoint("https://mystorage.file.core.windows.net/", ...)     # file endpoint
adls_endpoint("https://mystorage.dfs.core.windows.net/", ...)      # ADLSgen2 endpoint

# error: using the wrong function for a given storage type
# this is not possible with the new framework
file_endpoint("https://mystorage.blob.core.windows.net/")

The following generics are for managing storage containers, given a storage endpoint of a given type (blob, file or ADLSgen2):

In turn these dispatch to the following lower-level functions for each type of storage:

| Generic | Blob | File | ADLS2 | | ------- | ---- | ---- | ----- | | storage_container | blob_container | file_share | adls_filesystem | | create_storage_container | create_blob_container | create_file_share | create_adls_filesystem | | delete_storage_container | delete_blob_container | delete_file_share | delete_adls_filesystem | | list_storage_containers | list_blob_containers | list_file_shares | list_adls_filesystems |

# example of working with containers (blob storage)
bl_endp_key <- storage_endpoint("https://mystorage.blob.core.windows.net/", key="mykey")
list_storage_containers(bl_endp_key)
cont <- storage_container(bl_endp, "mycontainer")
newcont <- create_storage_container(bl_endp, "newcontainer")
delete_storage_container(newcont)

# you can also call the lower-level functions directly if desired
bl_endp_key <- blob_endpoint("https://mystorage.blob.core.windows.net/", key="mykey")
list_blob_containers(bl_endp_key)
cont <- blob_container(bl_endp, "mycontainer")
newcont <- create_blob_container(bl_endp, "newcontainer")
delete_blob_container(newcont)

# error: using the wrong function for a given storage type
# this is not possible with the new framework
list_file_shares(bl_endp_key)

Files and blobs

The following generics are for working with objects within a storage container:

As before, these dispatch to a family of lower-level functions for each type of storage:

| Generic | Blob | File | ADLS2 | | ------- | ---- | ---- | ----- | | list_storage_files | list_blobs | list_azure_files | list_adls_files | | create_storage_dir | N/A | create_azure_dir | create_adls_dir | | delete_storage_dir | N/A | delete_azure_dir | delete_adls_dir | | delete_storage_file | delete_blob | delete_azure_file | delete_adls_file | | storage_upload | upload_blob | upload_azure_file | upload_adls_file | | storage_download | download_blob | download_azure_file | download_adls_file | | storage_multiupload | multiupload_blob | multiupload_azure_file | multiupload_adls_file | | storage_multidownload | multidownload_blob | multidownload_azure_file | multidownload_adls_file | | copy_url_to_storage | copy_url_to_blob | N/A | N/A | | multicopy_url_to_storage | multicopy_url_to_blob | N/A | N/A |

# example workflow with generics (blob storage)
cont <- storage_container(bl_endp, "mycontainer")
list_storage_files(cont)
storage_upload(cont, "description.txt", "description")
storage_multiupload(cont, "*.tar.gz")

# using lower-level functions
cont <- blob_container(bl_endp, "mycontainer")
list_blobs(cont)
upload_blob(cont, "description.txt", "description")
multiupload_blob(cont, "*.tar.gz")


cloudyr/AzureStor documentation built on July 31, 2022, 11:22 a.m.