ServerlessInferenceConfig: Configuration object passed in when deploying models to...

ServerlessInferenceConfigR Documentation

Configuration object passed in when deploying models to Amazon SageMaker Endpoints.

Description

This object specifies configuration related to serverless endpoint. Use this configuration when trying to create serverless endpoint and make serverless inference

Public fields

memory_size_in_mb

The memory size of your serverless endpoint.

max_concurrency

The maximum number of concurrent invocations your serverless endpoint can process

Methods

Public methods


Method new()

Initialize a ServerlessInferenceConfig object for serverless inference configuration.

Usage
ServerlessInferenceConfig$new(memory_size_in_mb = 2048, max_concurrency = 5)
Arguments
memory_size_in_mb

(int): Optional. The memory size of your serverless endpoint. Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB. If no value is provided, Amazon SageMaker will choose the default value for you. (Default: 2048)

max_concurrency

(int): Optional. The maximum number of concurrent invocations your serverless endpoint can process. If no value is provided, Amazon SageMaker will choose the default value for you. (Default: 5)


Method to_request_list()

Generates a request dictionary using the parameters provided to the class.

Usage
ServerlessInferenceConfig$to_request_list()

Method format()

Format class

Usage
ServerlessInferenceConfig$format()

Method clone()

The objects of this class are cloneable with this method.

Usage
ServerlessInferenceConfig$clone(deep = FALSE)
Arguments
deep

Whether to make a deep clone.


DyfanJones/sagemaker-r-common documentation built on June 14, 2022, 10:31 p.m.