interleaved: Interleaved search results

Description Usage Arguments Format References Examples

Description

Tools for analysis of experiments that use interleaved search results wherein users receive results from multiple sets of retrieval functions.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
interleaved_bootstraps(sessions, clicks, bootstraps = 1000L)

interleaved_confint(sessions, clicks, bootstraps = 1000L, confidence = 0.95)

interleaved_preference(sessions, clicks)

interleaved_data

interleaved_data_a

interleaved_data_b

Arguments

sessions

vector of session IDs used to group positions and ranking_functions

clicks

vector that shows which ranking function the clicked search result came from ("A" or "B")

bootstraps

number of times to sample unique sessions (with replacement); 1000 by default

confidence

level; 0.95 by default

Format

interleaved_data* are data.frame-s of generated search sessions with the following columns:

session_id

10-character alphanumeric ID; for grouping events

timestamp

when the event occurred; uses POSIXct format

event

"serp" or "click"

position

position ("ranking") of the clicked search result

ranking_function

"A" or "B"

Users in interleaved_data have no preference, users in interleaved_data_a have preference for ranking function "A", and users in interleaved_data_b have preference for ranking function "B".

An object of class data.frame with 5396 rows and 5 columns.

An object of class data.frame with 5552 rows and 5 columns.

References

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
data("interleaved_data", package = "wmf")
x <- interleaved_data[interleaved_data$event == "click", ]
x <- x[order(x$session_id, x$timestamp), ]
data("interleaved_data_a", package = "wmf")
y <- interleaved_data_a[interleaved_data_a$event == "click", ]
y <- y[order(y$session_id, y$timestamp), ]
data("interleaved_data_b", package = "wmf")
z <- interleaved_data_b[interleaved_data_b$event == "click", ]
z <- z[order(z$session_id, z$timestamp), ]

# Bootstrapped preference statistics:

## Data without a clear preference:
b <- interleaved_bootstraps(x$session_id, x$ranking_function)
hist(b)

## Data where A is preferred over B:
b <- interleaved_bootstraps(y$session_id, y$ranking_function)
hist(b)

## Data where B is preferred over A:
b <- interleaved_bootstraps(z$session_id, z$ranking_function)
hist(b)

# Preference statistic confidence intervals:

## Data without a clear preference:
interleaved_confint(x$session_id, x$ranking_function)

## Data where A is preferred over B:
interleaved_confint(y$session_id, y$ranking_function)

## Data where B is preferred over A:
interleaved_confint(z$session_id, z$ranking_function)

# Preference statistic calculation:

## Data without a clear preference:
interleaved_preference(x$session_id, x$ranking_function)

## Data where A is preferred over B:
interleaved_preference(y$session_id, y$ranking_function)

## Data where B is preferred over A:
interleaved_preference(z$session_id, z$ranking_function)

wikimedia/wikimedia-discovery-ortiz documentation built on Aug. 23, 2020, 9:46 a.m.