R package of cleaned profile data from the 2021 revision of OkCupid Profile Data for Introductory Statistics and Data Science Courses (Journal of Statistics Education 2015): 59,946 OkCupid users who were living within 25 miles of San Francisco, had active profiles during a period in the 2010s, and had at least one picture in their profile.
The data in this package are a “cleaned” version of the 2021 revised data from the above paper, in that the following variables are modified for easier use by novices:
profiles_revised
:income
values: Previously coded as -1
, they are
now coded as NA
""
, they are
now coded as NA
offspring
and sign
: String instances of "?’"
are
replaced with apostrophesessay0_revised_and_shuffled
essay0
: my
self summary) are included.Note:
location
, last_online
Get the released version from CRAN:
install.packages("okcupiddata")
Or the development version from GitHub:
# If you haven't installed devtools yet, do so:
# install.packages("devtools")
devtools::install_github("rudeboybert/okcupiddata")
To load the revised profile data, run:
library(okcupiddata)
profiles_revised
To load the row-shuffled essay data, run:
essay0_revised_and_shuffled
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.