ServerlessInferenceConfig | R Documentation |
This object specifies configuration related to serverless endpoint. Use this configuration when trying to create serverless endpoint and make serverless inference
memory_size_in_mb
The memory size of your serverless endpoint.
max_concurrency
The maximum number of concurrent invocations your serverless endpoint can process
new()
Initialize a ServerlessInferenceConfig object for serverless inference configuration.
ServerlessInferenceConfig$new(memory_size_in_mb = 2048, max_concurrency = 5)
memory_size_in_mb
(int): Optional. The memory size of your serverless endpoint. Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB. If no value is provided, Amazon SageMaker will choose the default value for you. (Default: 2048)
max_concurrency
(int): Optional. The maximum number of concurrent invocations your serverless endpoint can process. If no value is provided, Amazon SageMaker will choose the default value for you. (Default: 5)
to_request_list()
Generates a request dictionary using the parameters provided to the class.
ServerlessInferenceConfig$to_request_list()
format()
Format class
ServerlessInferenceConfig$format()
clone()
The objects of this class are cloneable with this method.
ServerlessInferenceConfig$clone(deep = FALSE)
deep
Whether to make a deep clone.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.