LoadBalancing: Debugger LoadBalancing class

Description Super class Methods

Description

This rule helps to detect issues in workload balancing between multiple GPUs. It computes a histogram of utilization per GPU and measures the distance between those histograms. If the histogram exceeds a pre-defined threshold then rule triggers.

Super class

sagemaker.debugger::ProfilerRuleBase -> LoadBalancing

Methods

Public methods

Inherited methods

Method new()

Initialize LoadBalancing class

Usage
LoadBalancing$new(
  threshold = 0.5,
  patience = 1000,
  scan_interval_us = 60 * 1000 * 1000
)
Arguments
threshold

: difference between 2 histograms 0.5

patience

: how many values to record before checking for loadbalancing issues

scan_interval_us

: interval with which timeline files are scanned. Default is 60000000 us.


Method clone()

The objects of this class are cloneable with this method.

Usage
LoadBalancing$clone(deep = FALSE)
Arguments
deep

Whether to make a deep clone.


DyfanJones/sagemaker-r-debugger documentation built on Jan. 20, 2022, 5:49 p.m.