Description Usage Arguments Details Value References Examples
Apply an arg max choice rule
1 |
x |
A numeric vector or matrix with probabilistic predictions for actions. If |
tol |
A small number to avoid numeric tolerance problems in finding the maximum in |
The returned values are typically 1 for the row maximum or 0, except if n actions have the maximum probability the result is 1/n, correcting ties by equal choice probability. If the iput is a vector it is silently coerced to a two-column matrix.
A numeric vector or matrix (depending on the input) of indicators of the maximum.
Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press
1 2 3 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.