steph: The Stephenson Rating System

Description Usage Arguments Details Value References See Also Examples

Description

Implements the Stephenson rating system for estimating the relative skill level of players in two-player games such as chess. It extends the Glicko method by including a second parameter controlling player deviation across time, a bonus parameter, and a neighbourhood parameter.

Usage

1
2
3
steph(x, status = NULL, init = c(2200,300), gamma = 0, cval = 10, 
  hval = 10, bval = 0, lambda = 2, history = FALSE, sort = TRUE, 
  rdmax = 350, ...)

Arguments

x

A data frame containing four variables: (1) a numeric vector denoting the time period in which the game took place (2) a numeric or character identifier for player one (3) a numeric or character identifier for player two and (4) the result of the game expressed as a number, typically equal to one for a player one win, zero for a player two win and one half for a draw.

status

A data frame with the current status of the system. If not NULL, this needs to be a data frame in the form of the ratings component of the returned list, containing variables named Player, Rating, Deviation, and optionally Games, Win, Draw, Loss and Lag, which are set to zero if not given.

init

The rating vector at which to initialize a new player not appearing in status. Must be a vector of length two giving the initial rating and initial deviation respectively. If different initializations for different players are required, this can be done using status. The initial deviation cannot be greater than rdmax.

gamma

A player one advantage parameter; either a single value or a numeric vector equal to the number of rows in x. Positive values favour player one, while negative values favour player two. This could represent the advantage of playing at home, or the advantage of playing white for chess. Note that this is not passed to predict.rating, which has its own gamma parameter.

cval

The c parameter, which controls the increase in the player deviations across time. Must be a single non-negative number. Note that both cval and hval increase player deviations, so if hval is not zero then cval should typically be lower than the corresponding parameter in glicko.

hval

The h parameter, which also controls the increase in the player deviations across time. Must be a single non-negative number.

bval

The bonus parameter, which gives a per game bonus to each player on the basis that players who play more often may improve irrespective of whether they win or lose. A single non-negative number. Note that this will create ratings inflation (i.e. ratings will increase over time).

lambda

The neighbourhood parameter, which shrinks player ratings towards their opponents. A single non-negative number.

history

If TRUE returns the entire history for each period in the component history of the returned list.

sort

If TRUE sort the results by rating (highest to lowest). If FALSE sort the results by player.

rdmax

The maximum value allowed for the rating deviation.

...

Not used.

Details

The Stephenson rating system is a method for evaluating the skill of players. It was developed by Alec Stephenson in 2012 as a variant of his winning entry in a competition to find the most useful practical chess rating system, organized by Jeff Sonas on Kaggle, a platform for data prediction competitions. The precise details are given in the file doc/ChessRatings.pdf.

This implementation is written so that Glicko is obtained as a special case upon setting all of the parameters hval, bval and lambda to zero. Default values are roughly optimized for the chess data analyzed in the file doc/ChessRatings.pdf, using the binomial deviance criterion.

Value

A list object of class "rating" with the following components

ratings

A data frame of the results at the end of the final time period. The variables are self explanatory except for Lag, which represents the number of time periods since the player last played a game. This is equal to zero for players who played in the latest time period, and is also zero for players who have not yet played any games.

history

A three dimensional array, or NULL if history is FALSE. The row dimension is the players, the column dimension is the time periods. The third dimension gives different parameters.

gamma

The player one advantage parameter.

cval

The c parameter.

hval

The h parameter.

bval

The bonus parameter.

lambda

The neighbourhood parameter.

type

The character string "Stephenson".

References

Glickman, M.E. (1999) Parameter estimation in large dynamic paired comparison experiments. J. R. Stat. Soc. Ser. C: Applied Statistics, 48(3), 377-394.

Glickman, M.E. (2001) Dynamic paired comparison models with stochastic variances. Journal of Applied Statistics, 28, 673-689.

See Also

glicko

Examples

1
2
3
4
5
6
7
afl <- aflodds[,c(2,3,4,7)]
robj <- steph(afl)
robj

robj <- steph(afl[afl$Week==1,])
for(i in 2:max(afl$Week)) robj <- steph(afl[afl$Week==i,], robj$ratings)
robj

Example output

Stephenson Ratings For 18 Players Playing 675 Games

                      Player Rating Deviation Games Win Draw Loss Lag
1        Collingwood Magpies   2390     78.14    88  68    2   18   0
2               Geelong Cats   2306     77.51    87  68    0   19   0
3             Hawthorn Hawks   2299     74.60    82  48    1   33   1
4          West Coast Eagles   2285     74.85    81  39    0   42   0
5               Sydney Swans   2274     74.29    82  44    1   37   0
6             Adelaide Crows   2224     75.21    80  40    0   40   0
7           Essendon Bombers   2224     75.69    80  37    2   41   0
8            St Kilda Saints   2205     74.68    86  57    3   26   1
9              Carlton Blues   2191     74.24    82  45    1   36   1
10           Richmond Tigers   2171     75.62    78  25    2   51   1
11 North Melbourne Kangaroos   2170     75.75    78  34    1   43   0
12         Fremantle Dockers   2160     74.92    80  35    0   45   0
13          Western Bulldogs   2141     75.08    84  45    0   39   0
14            Brisbane Lions   2107     77.06    80  30    1   49   0
15       Port Adelaide Power   2071     76.12    78  26    0   52   1
16          Melbourne Demons   2070     77.67    78  22    2   54   0
17           Gold Coast Suns   1889     87.06    34   3    0   31   1
18    Greater Western Sydney   1884    117.86    12   1    0   11   0


Stephenson Ratings For 18 Players Playing 675 Games

                      Player Rating Deviation Games Win Draw Loss Lag
1        Collingwood Magpies   2390     78.14    88  68    2   18   0
2               Geelong Cats   2306     77.51    87  68    0   19   0
3             Hawthorn Hawks   2299     74.60    82  48    1   33   1
4          West Coast Eagles   2285     74.85    81  39    0   42   0
5               Sydney Swans   2274     74.29    82  44    1   37   0
6             Adelaide Crows   2224     75.21    80  40    0   40   0
7           Essendon Bombers   2224     75.69    80  37    2   41   0
8            St Kilda Saints   2205     74.68    86  57    3   26   1
9              Carlton Blues   2191     74.24    82  45    1   36   1
10           Richmond Tigers   2171     75.62    78  25    2   51   1
11 North Melbourne Kangaroos   2170     75.75    78  34    1   43   0
12         Fremantle Dockers   2160     74.92    80  35    0   45   0
13          Western Bulldogs   2141     75.08    84  45    0   39   0
14            Brisbane Lions   2107     77.06    80  30    1   49   0
15       Port Adelaide Power   2071     76.12    78  26    0   52   1
16          Melbourne Demons   2070     77.67    78  22    2   54   0
17           Gold Coast Suns   1889     87.06    34   3    0   31   1
18    Greater Western Sydney   1884    117.86    12   1    0   11   0

PlayerRatings documentation built on March 1, 2020, 5:07 p.m.