tennis: Cleaned game-wise dataset with player statistics

Description Usage Format See Also

Description

A combination of player statistics and game-wise pbp data in which there is a lagging and a leading player

Usage

1

Format

A data frame with 17680 rows and 81 variables. Variables that are character specific have been omitted (e.g., ace_lagging and ace_leading have the same description, but refer to different players hence only ace_lagging appears in the list below):

pbp_id

pbp_id for matches in which there is a set-wise difference of at least 4 games between the lagging and leading player

match_num

counter of the matches cleaned from pbp_raw_clean.df

set_num

identifies the set in which the game is played

game_num

counter of the games in each set (resets to 1 after a new set)

pbp

point-by-point record of the game; S (server won), R (returner won), A (ace), D (double fault)

server

points won by the server of the game

returner

points won by the returner of the game

winner

winner of the game i.e. server or returner

player1_serve

1 if player1 is server, 0 otherwise

player1_game

1 if player1 wins game, 0 otherwise

player2_game

1 if player2 wins game, 0 otherwise

player1_game_acc_set

running total of number of games won by player1 in a given set

player2_game_acc_set

running total of number of games won by player2 in a given set

player1_game_acc_total

running total of number of games won by player1 in a given match

player2_game_acc_total

running total of number of games won by player2 in a given match

total_game_acc

running total of number of games in a given match

leading_set

denotes which player has won more games in a given set

player1_set_acc

number of sets player1 wins

player2_set_acc

number of sets player2 wins

diff_game_set

set-wise game deficit; by construction, must be of magnitude at least four

match_winner

winner of the match, i.e., 1 or 2

player1_leading_set

1 if player1 has more games than player2 in a given set

rally_length

length of rally, i.e., nchar(pbp)

server1

Name of server 1

server2

Name of server 2

rank_lagging

Rank

age_lagging

Age and Birth Date

pro_start_lagging

Pro_start

weight_lagging

Weight

height_lagging

Height

residence_lagging

Residence

hand_lagging

Hand

coach_lagging

Coach

aces_lagging

Aces

df_lagging

Double Faults

first_serve_lagging

1st Serve

first_serve_won_lagging

1st Serve Points Won

second_serve_won_lagging

2nd Serve Points Won

bp_faced_lagging

Break Points Faced

bp_saved_lagging

Break Points Saved

serv_game_played_lagging

Service Games Played

serv_game_won_lagging

Service Games Won

total_serv_won_lagging

Total Service Points Won

first_return_lagging

1st Serve Return Points Won

second_return_lagging

2nd Serve Return Points Won

bp_opp_lagging

Break Points Opportunities

bp_conv_lagging

Break Points Converted

ret_game_played_lagging

Return Games Played

ret_game_won_lagging

Return Games Won

ret_won_lagging

Return Points Won

total_ret_won_lagging

Total Points Won

See Also

tennis_cleaned for a player1/player2-agnostic dataset

Other final files: tennis_cleaned, tennis_data


kattaoa/oktennis documentation built on July 13, 2019, 3:15 p.m.