Nothing
#' Alcohol at Georgetown College
#'
#' Alcohol policy violations on the Georgetown College campus over several years.
#'
#'
#' @name alcohol
#' @docType data
#' @format A data frame with 10 observations on the following 4 variables.
#' \describe{ \item{year}{Academic year ending with Spring of the given
#' year.}
#' \item{enrollment}{Full-time equivalent enrollment.}
#' \item{writeups}{Number of write-ups for alcohol violations.}
#' \item{writeups.per.100}{Number of write-ups per 100 students.} }
#' @source Collected by MAT 111 students as a project.
#' @keywords datasets
#'
NA
#'
#' Attitudes Experiment 2001
#'
#' Study conducted in November 2001 by students in MAT 111. Subjects were 267
#' Georgetown College students. Not all subjects got the same survey form.
#'
#' @name attitudes
#' @docType data
#' @format A data frame with 268 observations on the following 8 variables.
#' \describe{
#' \item{def.race}{Suggested race of the defendant in the survey form.}
#' \item{vic.race}{Suggested race of the victim in the survey form.}
#' \item{conc.situation}{Scenario described in the
#' in the "rock concert" question on the survey form.}
#' \item{sentence}{Sentence, in years, recommended for the defendant.}
#' \item{conc.decision}{Whether or not the subject chose to buy a ticket (or buy another
#' ticket).}
#' \item{year}{Class rank of the subject.}
#' \item{sex}{a factor with level Sex of the survey participant.}
#' \item{major}{possible values: \code{humanities} \code{math.sci} \code{pre.prof}
#' \code{social.sci} Type of major the subject intends.}
#' }
#'
#' @details
#' Here is a sample survey form, with variants noted.
#'
#' Attitudes Survey
#'
#' Crime: You are on a jury for a manslaughter case in Lewistown, PA. The
#' defendant has been found guilty, and in Pennsylvania it is part of the job
#' of the jury to recommend a sentence to the judge. The facts of the case are
#' as follows. The defendant, Tyrone Marcus Watson, a 35-year old native of
#' Lewistown, was driving under the influence of alcohol on the evening of
#' Tuesday July 17, 2001. At approximately 11:00 PM Watson drove through a red
#' light, striking a pedestrian, Betsy Brockenheimer, a 20-year old resident of
#' Lewistown. Brockenheimer was taken unconscious to the hospital and died of
#' her injuries about one hour later. Watson did not flee the scene, nor did
#' he resist arrest.
#'
#' The prior police record for Mr. Watson is as follows: two minor traffic
#' violations, and one previous arrest, five years ago, for DUI. No one was
#' hurt in that incident.
#'
#' Watson has now been convicted of DUI and manslaughter. The minimum jail
#' term for this combination of offenses is two years; the maximum term is
#' fifty years. In the blank below, write a number from 2 to 50 as your
#' recommended length of sentence for Tyrone Marcus Watson. _______________
#'
#' [In the question above, name of defendant could vary: either William Shane
#' Winchester or Tyrone Marcus Watson. The name of the victim could also vary:
#' either Betsy Brockenheimer or Latisha Dawes.]
#'
#' Spending Habits
#'
#' You have purchased a $30 ticket to see a rock concert in Rupp Arena. When
#' you arrive at the Arena on the night of the performance, you find that you
#' have lost the ticket. You have no receipt, so it will not be possible to
#' see the concert unless you purchase another ticket. Would you purchase
#' another ticket? Circle below.
#'
#' YES NO
#'
#' [In other forms, the question above could have been: You plan to see a rock
#' concert in Rupp Arena. Tickets for the performance are $30. When you arrive
#' at the Arena on the night of the performance, you find that you have lost
#' two bills from your purse or wallet: a ten and a twenty. Would you buy the
#' ticket anyway?]
#'
#' Respondent Data
#'
#' I am (circle one): freshman sophomore junior senior
#'
#' I am (circle one) male female
#'
#' (Optional) My intended major is: _____________________
#'
#' @source Georgetown College
#' @keywords datasets
NA
#'
#' Beans
#'
#' Experiment performed at UC-Davis; fifteen students participated. Each
#' student was asked to place as many beans into a cup as he/she could, in 15
#' seconds. Each student performed this task once with the dominant hand, and
#' once with the non-dominant hand, but the order of performance was randomized.
#' The purpose of the study was to see whether manual dexterity was better for
#' the dominant hand. Terminology: your dominant hand is the hand you use the
#' most.
#'
#'
#' @name beans
#' @docType data
#' @format A data frame with 15 observations on the following 3 variables.
#' \describe{ \item{Dom}{Number of beans placed into
#' cup with the dominant hand.}
#' \item{NonDom}{Number
#' of beans placed with the non-dominant hand.}
#' \item{Diff}{Difference in number of beans placed (dominant hand minus non-dominant
#' hand).} }
#' @source Uts and Heckard, Mind on Statistics, Fourth Edition.
#' @keywords datasets
NA
#' Miguel Cabrera
#'
#' PITCHf/x data on nine-time All-Star Miguel Cabrera, who is thought to be one of the best
#' pure hitters in baseball. Covers seasons 2009 through 2012.
#'
#'
#' @name cabrera
#' @docType data
#' @format A data frame with 6265 observations on the following 12 variables. Each observation
#' is a pitch to Cabrera.
#' \describe{ \item{season}{The year of play}
#' \item{gamedate}{Date of the game in which the pitch was thrown}
#' \item{pitch_type}{Type of pitch thrown, as determined by a computer algorithm.}
#' \item{balls}{Current ball count}
#' \item{strikes}{Current strike count}
#' \item{speed}{speed of pitch (in mph). (When crossing plate?)}
#' \item{px}{x-coordinate of pitch (in feet, measured from center of plate)}
#' \item{pz}{vertical coordinate of pitch (in feet above plate)}
#' \item{swung}{Whether or not Cabrera swung at the ball. Factor with levels "no", "yes".}
#' \item{hitx}{x-coordinate of landing point of ball (if it was hit). Relative to park.}
#' \item{hity}{y-coordinate of landing point of ball (if it was hit). Relative to park.}
#' \item{hit_outcome}{Outcome when ball was hit. Factor with levels E (error), H (hit), O (batter out).}
#' }
#' @source Marchi and Albert: analyzing Baseball Data with R, CRC Press 2014. For
#' more on the PITCHf/x system, see \url{http://en.wikipedia.org/wiki/PITCHf/x}.
#'
#' @keywords datasets
NA
#'
#' Time to Chug
#'
#' College-aged males chugging a 12-ounce can of a certain beverage.
#'
#'
#' @name chugtime
#' @docType data
#' @format A data frame with 13 observations on the following 2 variables.
#' \describe{
#' \item{Weight}{Weight of the subject (in pounds).}
#' \item{ChugTime}{How long (in seconds) the subject requires to drink the
#' beverage.}
#' }
#' @source Utts and Heckard, Mind on Statistics, Fourth Edition.
#' @keywords datasets
#'
NA
#' The Death Penalty and Race
#'
#' A dataset recreated from summary data that describes relationships
#' between race of defendant, race of victim, and outcome of trial in a number of capital
#' cases in Florida in 1976-1977.
#' @name deathpen
#' @docType data
#' @format A data frame with 326 rows and 3 variables
#'
#' \describe{
#' \item{defrace}{Race of the defendant in the capital case}
#' \item{vicrace}{Race of the victim}
#' \item{death}{Whether or not the defendant in the case received the death penalty}
#' }
#'
#' @source Michael J. Radelet: "Racial Characteristics and
#' the Imposition of the Death Penalty", \emph{American Sociological Review}, 46 (1981).
#' @keywords datasets
NA
#' Speed and Fuel Efficiency (British Ford Escort)
#'
#' A British Ford Escort was driven along a prescribed course. Each drive was
#' done at a different speed, and the fuel efficiency was recorded for each
#' drive.
#'
#'
#' @name fuel
#' @docType data
#' @format A data frame with 15 observations on the following 2 variables.
#' \describe{
#' \item{speed}{in kilometers per hour.}
#' \item{efficiency}{fuel efficiency, measured in liters of fuel required to travel 100 kilometers. }
#' }
#' @source The Basic Practice of Statistics, by Moore and McCabe.
#' @keywords datasets
NA
#' Galton's Father-Son Data
#'
#' Data on father-son pairs. Collected in 1885 by Francis Galton.
#'
#'
#' @name galton
#' @docType data
#' @format A data frame with 1078 observations on the following 2 variables.
#' \describe{
#' \item{fheight}{Height of the father, in inches.}
#' \item{sheight}{Height of the son, in inches.}
#' }
#' @keywords datasets
NA
#' Feelings About Georgetown College
#'
#' Results of a survey conducted by Georgetown College students on 47 Georgetown College upperclass students.
#'
#'
#' @name gcfeeling
#' @docType data
#' @format A data frame with 47 observations on the following 6 variables.
#' \describe{
#' \item{rating.fresh}{how happy the subjects remembers being as a first-year student, on a scale of 1 to 10.}
#' \item{rating.js}{how happy the subjects feels now, on a scale of 1 to 10.}
#' \item{greek}{whether or not the subject belongs to a greek organization.}
#' \item{athlete}{whether or not the subject is a varsity athlete}
#' \item{rating.diff}{upper-level happiness rating minus remembered first-year rating}
#' \item{happier}{whether or not subject feels happier now than as a first-year student}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' Georgetown College Students
#'
#' Data collected by GC students.
#'
#'
#' @name gcstudents
#' @docType data
#' @format A data frame with 62 observations on the following 4 variables.
#' \describe{ \item{height}{height of the survey participant, in inches}
#' \item{GPA}{grade-point average}
#' \item{enough_Sleep}{Does the participant feel that he/she gets enough sleep?}
#' \item{sex}{sex of the survey participant}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' General Social Survey, 2002
#'
#' The General Social Survey (GSS) is a nationwide poll that has been conducted
#' since 1972 (semiannually since 1994). Most interviews are done
#' face-to-face. For further information, see below.
#'
#'
#' @name gss02
#' @docType data
#' @format A data frame with 2765 observations on the following 13 variables.
#' \describe{
#' \item{sex}{a factor with levels \code{Female}
#' \code{Male}}
#' \item{race}{a factor with levels \code{AfrAm}
#' \code{Hispanic} \code{Other} \code{White}}
#' \item{degree}{a factor
#' with levels \code{Bachelor} \code{Graduate} \code{HighSchool} \code{JunColl}
#' \code{NotHs}}
#' \item{relig}{a factor with levels \code{Catholic}
#' \code{Jewish} \code{Other} \code{Protestant}}
#' \item{polparty}{a
#' factor with levels \code{Democrat} \code{Independent} \code{Other}
#' \code{Republican}}
#' \item{cappun}{a factor with levels \code{Favor}
#' \code{Oppose} Whether or not the subject favors capital punishment.}
#' \item{tvhours}{the subject estimates number of
#' hours per day he or she watches TV.}
#' \item{marijuan}{a factor with
#' levels \code{Legal} \code{NotLegal} Whether or not subject believes that
#' marijuana should be legalized.}
#' \item{owngun}{a factor with levels
#' \code{No} \code{Yes}. Does the subject own a gun?}
#' \item{gunlaw}{a factor with levels
#' \code{Favor} \code{Oppose} Whether or not the subject favors stricter
#' gunlaws.}
#' \item{age}{age of the subject}
#' \item{chldidel}{the ideal number of children the subject would like to have.}
#' \item{emailtime}{estimated number of hours per week subject spends using email.}
#' }
#' @source National Opinion Research Center: \url{http://www3.norc.org/gss+website/}.
#' Found in Uts and Heckard: Mind on Statistics, Fourth Edition.
#' @keywords datasets
NA
#' General Social Survey, 2008
#'
#'
#'
#' @name gss08
#' @docType data
#' @format A data frame with 2023 observations on the following 12 variables.
#' \describe{ \item{sex}{a factor with levels \code{Female}
#' \code{Male}} \item{race}{a factor with levels \code{AfrAm}
#' \code{Other} \code{White}} \item{degree}{a factor with levels
#' \code{Bachelor} \code{Graduate} \code{HighSchool} \code{JunColl}
#' \code{NotHs}} \item{relig}{a factor with levels \code{Catholic}
#' \code{Jewish} \code{None} \code{Other} \code{Protestant}}
#' \item{polparty}{a factor with levels \code{Democrat}
#' \code{Independent} \code{Other} \code{Republican}} \item{cappun}{a
#' factor with levels \code{Favor} \code{Oppose}} \item{tvhours}{a
#' numeric vector} \item{marijuan}{a factor with levels \code{Legal}
#' \code{NotLegal}} \item{owngun}{a factor with levels \code{No}
#' \code{Yes}} \item{gunlaw}{a factor with levels \code{Favor}
#' \code{Oppose}} \item{age}{a numeric vector}
#' \item{chldidel}{a numeric vector} }
#' @references For more information see \code{gss02}
#' @source National Opinion Research Center: \url{http://www3.norc.org/gss+website/}.
#' Found in Uts and Heckard: Mind on Statistics, Fourth Edition.
#' @keywords datasets
NA
#' @title General Social Survey, 2012
#'
#' @description A selection of variables from the 2012 General Social Survey. The variables are as follows:
#'
#' \itemize{
#' \item age. Age of the subject.
#' \item sex. Sex of the subject.
#' \item race. Race of the subject.
#' \item polviews. Subject's political views.
#' \item relig. Religion of the subject.
#' \item cappun. Opinion on capital punishment.
#' \item owngun. Whether or not one owns a gun.
#' \item emailhr. Number of hours per week spent on email.
#' \item bigbang. Whether or not subject believes the Big Bang theory is true.
#' \item premarsx. Opinion on premarital sex.
#' \item pornlaw. Should pornography be legal?
#' \item zodiac. Sign of the Zodiac under which the subject was born.
#' }
#'
#' @docType data
#' @keywords datasets
#' @source \url{http://www3.norc.org/gss+website/}.
#' @format A data frame with 1976 rows and 12 variables
#' @name gss2012
NA
#' Hair Color and ACT Score
#'
#' A study performed by MAT 111 students at Georgetown College.
#'
#'
#' @name hair_and_act
#' @docType data
#' @format A data frame with 100 observations on the following 3 variables.
#' \describe{
#' \item{sex}{a factor with levels \code{female} \code{male}}
#' \item{hair.color}{a factor with levels \code{dark} \code{light}}
#' \item{act}{composite ACT score of
#' subject.}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' Height and Handspan
#'
#' Height and handspan of a few subjects.
#'
#'
#' @name handheight
#' @docType data
#' @format A data frame with 167 observations on the following 3 variables.
#' \describe{
#' \item{Sex}{a factor with levels \code{Female} \code{Male}}
#' \item{Height}{height of subject, in inches.}
#' \item{HandSpan}{handspan of subject, in centimeters.}
#' }
#' @source Uts and Heckard, Mind on Statistics, Fourth Edition.
#' @keywords datasets
NA
#' Hanford Weather Station, 1984-2010
#'
#' The station is located in Hanford, WA.
#'
#'
#' @name hanford1
#' @docType data
#' @format A data frame with 27 observations on the following 2 variables.
#' \describe{
#' \item{year}{calendar year}
#' \item{temp}{average high temperature for that year.}
#' }
#' @source For more on the Hanford station, see \url{http://www.hanford.gov/page.cfm/HMS}
#' @keywords datasets
NA
#' Hanford Weather Station, 1945-2010
#'
#' The weather station is located in Hanford, WA. Note that this dataset is more complete than \code{\link{hanford1}}
#'
#'
#' @name hanford2
#' @docType data
#' @format A data frame with 66 observations on the following 2 variables.
#' \describe{
#' \item{year}{calendar year}
#' \item{temp}{average high temperature for that year.}
#' }
#' @source For more on the Hanford station, see \url{http://www.hanford.gov/page.cfm/HMS}
#' @keywords datasets
NA
#' Ricky Henderson's Career Stats
#'
#' Various year-by-year statistics for the Major League player Ricky Henderson
#'
#'
#' @name henderson
#' @docType data
#' @format A data frame with 23 observations on the following 18 variables.
#' \describe{ \item{Season}{season}
#' \item{TM}{team played for}
#' \item{G}{number of games played in}
#' \item{AB}{number of at-bats}
#' \item{R}{runs scored}
#' \item{H}{number of base hits}
#' \item{X2B}{doubles}
#' \item{X3B}{triples}
#' \item{HR}{home runs}
#' \item{RBI}{runs batted in}
#' \item{BB}{bases on balls}
#' \item{SO}{number of times struck out}
#' \item{SB}{Number of stolen bases}
#' \item{CS}{number of times caught stealing}
#' \item{AVG}{batting average}
#' \item{OBP}{on-base percentage}
#' \item{SLG}{slugging rate}
#' \item{OPS}{on-base plus slugging}
#' }
#' @source unknown (possibly Albert: Teaching Statistics With Baseball)
#' @keywords datasets
NA
#' Hall of Fame Batters
#'
#' Data on the 147 batters inducted into the Major LeagueBaseball Hall of Fame
#' as of the year 2013.
#'
#'
#' @name hofbatting
#' @docType data
#' @format A data frame with 147 observations on the following 29 variables.
#' \describe{ \item{Name}{Player name}
#' \item{Rk}{Unknown}
#' \item{Inducted}{Year inducted into Hall of Fame}
#' \item{Yrs}{Number of years planed in the Majors}
#' \item{From}{First year in the Majors}
#' \item{To}{Last year in the Majors}
#' \item{MidCareer}{Middle year of player's career}
#' \item{Era}{Era of Baseball history in which the player was (for the most part) active.
#' Values are: 19th century (before 1900), Dead Ball (1900-1919), Lively Ball (1920-1940),
#' Integration, (1941-1959), Expansion (1960-1975), Free Agency (1976-1992), Long Ball (1993 +).}
#' \item{ASG}{Number of All-Star games played}
#' \item{WAR.pos}{Wins above replacement (WAR), as defined for a position player}
#' \item{G}{Games played}
#' \item{PA}{Number of plate appearances}
#' \item{AB}{Number of times at bat}
#' \item{R}{Runs scored}
#' \item{H}{Base hits}
#' \item{X2B}{Doubles}
#' \item{X3B}{Triples}
#' \item{Triple.Rate}{Number of triples divided by number of times at bat}
#' \item{HR}{Home runs}
#' \item{HR.Rate}{Number of home runs divided by number of times at bat}
#' \item{RBI}{Runs batted in}
#' \item{SB}{Number of successful stolen base attempts}
#' \item{CS}{Number of times thrown out while attempting to steal a base}
#' \item{BB}{Base on Balls (number of times "walked")}
#' \item{SO}{Number of times struck out}
#' \item{BA}{Batting average}
#' \item{OBP}{On base percentage}
#' \item{SLG}{Slugging average}
#' \item{OPS}{OBP plus SLG}
#' }
#' @source Modified from Marchi and Albert: analyzing Baseball Data with R, CRC Press 2014.
#' @keywords datasets
NA
#' Hall of Fame Pitchers
#'
#' Data on the 70 pitchers inducted into the Major League Baseball Hall of Fame
#' as of the year 2013.
#'
#'
#' @name hofpitching
#' @docType data
#' @format A data frame with 147 observations on the following 32 variables.
#' \describe{ \item{Name}{Player name}
#' \item{Rk}{Unknown}
#' \item{Inducted}{Year inducted into Hall of Fame}
#' \item{Yrs}{Number of years planed in the Majors}
#' \item{From}{First year in the Majors}
#' \item{To}{Last year in the Majors}
#' \item{MidCareer}{Middle year of player's career}
#' \item{Era}{Era of Baseball history in which the player was (for the most part) active.
#' Values are: 19th century (before 1900), Dead Ball (1900-1919), Lively Ball (1920-1940),
#' Integration, (1941-1959), Expansion (1960-1975), Free Agency (1976-1992), Long Ball (1993 +).}
#' \item{ASG}{Number of All-Star games played}
#' \item{WAR}{Wins above replacement}
#' \item{W}{Games won}
#' \item{L}{Games lost}
#' \item{W.L.}{proportion of games won}
#' \item{ERA}{Earned run average}
#' \item{G}{Games played}
#' \item{GS}{Games started}
#' \item{GF}{Games finished}
#' \item{CG}{Complete games}
#' \item{SHO}{Shut-outs}
#' \item{SV}{Saves}
#' \item{IP}{Innings pitched}
#' \item{H}{Hits allowed}
#' \item{R}{Runs allowed}
#' \item{ER}{Earned Runs allowed}
#' \item{HR}{Home Runs allowed}
#' \item{BB}{Bases on Balls (number of "walks")}
#' \item{IBB}{Intentional walks}
#' \item{SO}{Strikeouts}
#' \item{HBP}{Hit batter with pitch (?)}
#' \item{BK}{Balks}
#' \item{WP}{Wild Pitches}
#' \item{BF}{Total batters faced}
#' }
#' @source Modified from Marchi and Albert: analyzing Baseball Data with R, CRC Press 2014.
#' @keywords datasets
NA
#' @title An Imaginary Population
#'
#' @description An imaginary population, used for instructional purposes. The variables are as follows:
#'
#' \itemize{
#' \item sex. (male, female).
#' \item math. Whether or not you were a mathematics major.
#' \item income. Your annual income, rounded to the nearest $100.
#' \item cappun. Opinion about the death penalty (favor, oppose).
#' \item height. Height in inches.
#' \item idealheight. The height you would like to be, in inches.
#' \item diff. ideal height - actual height.
#' \item kkardashtemp. Your feelings about Kim Kardashian on a 0-100 scale (0=very cold, 100=very warm).
#' }
#'
#' @docType data
#' @keywords datasets
#' @format A data frame with 10000 rows and 8 variables
#' @name imagpop
NA
#' IQs of Siblings
#'
#' IQs of pairs of siblings.
#'
#'
#' @name iqsiblings
#' @docType data
#' @format A data frame with 80 observations on the following 2 variables.
#' \describe{
#' \item{First}{IQ of the older sibling.}
#' \item{Second}{IQ of the younger sibling.}
#' }
#' @source William Harris, Georgetown College
#' @keywords datasets
NA
#' Knife or Gun?
#'
#' What will make you yell louder: being killed with a knife or being killed
#' with a gun? Results of an entirely imaginary experiment performed on very
#' strange volunteers. Members of the Knife group are killed by a knife, and
#' members of the Gun group are killed by a gun. The volume of the screams of each
#' subject during slaying is recorded. In order to ensure that the two groups
#' are similar with respect to how loud they can yell in the first place,
#' subjects are blocked by whether or not they have participated in
#' hog-hollering contests. After blocking, subjects are randomly assigned to
#' groups.
#'
#'
#' @name knifeorgunblock
#' @docType data
#' @format A data frame with 20 observations on the following 3 variables.
#' \describe{
#' \item{hoghollerer}{a factor with levels \code{no}
#' \code{yes} whether or not the subject competes in hog-hollering contests}
#' \item{means}{a factor with levels \code{gun}
#' \code{knife} means by which subject is slain}
#' \item{volume}{volume of expiring subject's cries.}
#' }
#' @source A morbid imagination.
#' @keywords datasets
NA
#' Labels and Perception of Quality
#'
#' Students in MAT 111 performed an experiment to see whether the perception of
#' the quality of peanut butter was affected by the labeling on the peanut
#' butter jar. Each subject tasted from two jars, one of which was labeled
#' Jiff, and the other of which was labeled Great Value (a cheaper brand).
#' Unknown to the subjects, both jars contained Great Value peanut butter.
#' Each subject rated the quality of the peanut butter on a scale of 1 to 10.
#'
#'
#' @name labels
#' @docType data
#' @format A data frame with 30 observations on the following 3 variables.
#' \describe{
#' \item{jiffrating}{rating subject gave to the PB in the jar with the Jiff label}
#' \item{greatvaluerating}{rating subject gave to the PB in the jar with the Great Value label}
#' \item{sex}{a factor with levels \code{female} \code{male}}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' @title Crowd Behavior at Ledge-Jumping Incidents
#'
#' @description A dataset recreated from summary data describing the relationship between weather and crowd behavior during 21 recorded
#' incidents in England, in which a (suicidal) person was contemplating jumping from a ledge or other high structure and a crowd
#' gathered to watch. The variables are as follows:
#'
#' \itemize{
#' \item weather. Warm or cool, based on the time of year when the incident occurred.
#' \item crowd.behavior. The crowd either baited the would-be jumper, or was polite.
#' }
#'
#' @docType data
#' @keywords datasets
#' @source "The baiting crowd in episodes of threatened suicide",
#' \emph{Journal of Personality and Social Psychology}, 41, 703-709. See also dataset 59 in
#' \emph{A Handbook of Small Datasets} by Hand et al. See also \url{http://www.ncbi.nlm.nih.gov/pubmed/7288565}.
#' @format A data frame with 21 rows and 2 variables
#' @name ledgejump
NA
#' @title MAT 111 Survey
#'
#' @description Results of a survey of MAT 111 students at Georgetown College.
#'
#' \itemize{
#' \item \strong{height} How tall are you, in inches?
#' \item \strong{ideal_ht} A numeric vector How tall would you
#' LIKE to be, in inches?
#' \item \strong{sleep} How much sleep did you get last night?
#' \item \strong{fastest} What is the highest speed at which
#' you have ever driven a car?
#' \item \strong{weight_feel} How do you feel about your weight?
#' \item \strong{love_first} Do you believe in love at first sight?
#' \item \strong{extra_life} Do you believe in extraterrestrial life?
#' \item \strong{seat} When you have a choice, where do you prefer
#' to sit in a classroom?
#' \item \strong{GPA} What is your college GPA?
#' \item \strong{enough_Sleep} Do you think you get enough sleep?
#' \item \strong{sex} What sex are you?
#' \item \strong{diff.ideal.act.} Your ideal height minus your actual height.
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Georgetown College, MAT 111.
#' @format A data frame with 71 rows and 12 variables
#' @name m111survey
NA
#' MAT 111 Survey, Fall 2012
#'
#' Results of a survey given at beginning of semester, to all students in MAT 111.
#'
#'
#' @name m111surveyfa12
#' @docType data
#' @format A data frame with 89 observations on the following 14 variables.
#' \describe{
#' \item{height}{Your height in inches.}
#' \item{ideal_ht}{How tall you would LIKE to be, in
#' inches.}
#' \item{sleep}{How much sleep you got last
#' night, in hours.}
#' \item{fastest}{What is the
#' highest speed at which you have ever driven a car (in mph)?}
#' \item{wt_feel}{a factor with levels \code{1_underweight}
#' \code{2_about_right} \code{3_overweight} How do you feel about your weight?}
#' \item{love_first}{a factor with levels \code{no} \code{yes} Do you
#' believe in love at first sight?}
#' \item{et_life}{a factor with levels
#' \code{no} \code{yes} Do you believe in life on other planets?}
#' \item{seat}{a factor with levels \code{1_front}
#' \code{2_middle} \code{3_back} When you have a choice, where do you prefer to
#' sit in a classroom?}
#' \item{GPA}{What is your
#' current GPA?}
#' \item{engh_slp}{a factor with levels \code{no}
#' \code{yes} Do you think you get enough sleep?}
#' \item{sex}{a factor
#' with levels \code{female} \code{male} What sex are you?}
#' \item{anchor}{a factor with levels \code{australia}
#' \code{united_states} (Anchor for the next question.) For the next question,
#' either Australia or the US, along with its population, was given in the
#' leadup information to the question. The "anchor" variable records which version of the question you were
#' given.}
#' \item{canada}{"The population of country
#' XXX is YYY million. About what is the population of Canada, in millions?" XXX was either the U.S. or Australia.}
#' \item{diff.ih.ah.}{Your ideal height minus your actual height.}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' MAT 111 Survey, Fall 2013
#'
#' Results of a survey given at beginning of semester, to all students in MAT 111.
#'
#'
#' @name m111surveyfa13
#' @docType data
#' @format A data frame with 85 observations on the following 14 variables.
#' \describe{
#' \item{height}{Your height in inches.}
#' \item{ideal_ht}{How tall you would LIKE to be, in
#' inches.}
#' \item{sleep}{How much sleep you got last
#' night, in hours.}
#' \item{fastest}{What is the
#' highest speed at which you have ever driven a car (in mph)?}
#' \item{weight_feel}{a factor with levels \code{1_underweight}
#' \code{2_about_right} \code{3_overweight} How do you feel about your weight?}
#' \item{love_first}{a factor with levels \code{no} \code{yes} Do you
#' believe in love at first sight?}
#' \item{extra_life}{a factor with levels
#' \code{no} \code{yes} Do you believe in life on other planets?}
#' \item{seat}{a factor with levels \code{1_front}
#' \code{2_middle} \code{3_back} When you have a choice, where do you prefer to
#' sit in a classroom?}
#' \item{GPA}{What is your
#' current GPA?}
#' \item{enough_Sleep}{a factor with levels \code{no}
#' \code{yes} Do you think you get enough sleep?}
#' \item{sex}{a factor
#' with levels \code{female} \code{male} What sex are you?}
#' \item{diff}{ideal height minus actual height}
#' \item{symbol}{a factor with levels \code{a}
#' \code{b} (Anchor for the next question.) For the next question,
#' either Australia or the US, along with its population, was given in the
#' leadup information to the question. The "anchor" variable records which version of the question you were
#' given. If "a", the population of Australia was given. If "b", the U.S. population was given.}
#' \item{pop_Canada}{"The population of country
#' XXX is YYY million. About what is the population of Canada, in millions?" XXX was either the U.S. or Australia.}
#' }
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' @title Music and Reading Comprehension
#'
#' @description An experiment performed by a student at Georgetown College. Forty-four subjects were randomized into four
#' groups. All subjects read an article; one group read in a silent environment, while the other three groups
#' heard each three different genres of music. Each subject took a reading comprehension test afterward.
#'
#' \itemize{
#' \item{\code{sex}} {a factor with levels \code{Female} \code{Male}}
#' \item{\code{year}} {class rank of subject}
#' \item{\code{type}} {type of music subject listened to while reading}
#' \item{\code{score}} {number of questions correct on reading comprehension test}
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Matt Doolin, MAT 111 at Georgetown College
#' @format A data frame with 44 observations on 4 variables.
#' @name music
NA
#' Napkin Use
#'
#' Students at GC observed their fellow students in the Cafe at lunch.
#'
#'
#' @name napkins
#' @docType data
#' @format A data frame with 86 observations on the following 2 variables.
#' \describe{ \item{napkins}{number of napkins used
#' by the subject during the meal.} \item{sex}{a factor with levels
#' \code{female} \code{male} Sex of the person being observed}}
#' @source MAT 111 at Georgetown College
#' @keywords datasets
NA
#' Non-Response to Surveys
#'
#' Results of a study on non-response to a mail survey. Subjects were residents of Denmark.
#'
#'
#' @name nonresponse
#' @docType data
#' @format A data frame with 4229 observations on the following 3 variables.
#' \describe{ \item{residence}{where the subject resides: either in Copenhagen,
#' a city outside of Copenhagen, or
#' in the countryside}
#' \item{gender}{sex of the subject}
#' \item{response}{Whether or not the subject responded to the mail survey}}
#' @source Rebuilt from a contingency table in E. B. Andersen (1991),
#' The Statistical Analysis of Categorical Data, Second Edition. Springer-Verlag,
#' Berlin. Table found in
#' package \code{vcd}.
#' @keywords datasets
NA
#' Nicotine Withdrawal and Accidents
#'
#' Results of study conducted in Great Britain to see if nicotine withdrawal increases the risk of an accident.
#'
#'
#' @name nosmokeday
#' @docType data
#' @format A data frame with 10 observations on the following 3 variables.
#' \describe{ \item{year}{calendar year}
#' \item{Injuries.before.NSD}{number of injury accidents on the day one week prior to
#' National No Smoke Day in the United Kingdom}
#' \item{Injuries.on.NSD}{number of injury accidents on
#' National No Smoke Day in the United Kingdom}}
#' @source J. Knowles, "Nicotine withdrawal and road accidents", Science, 400, 128, (8 July 1999).
#' Found in Whitlock and Schluter, The Analysis of Biological Data.
#' @keywords datasets
NA
#' Old Faithful
#'
#' Old faithful geyser at Yellowstone Park.
#'
#'
#' @name oldfaithful
#' @docType data
#' @format A data frame with 299 observations on the following 2 variables.
#' \describe{
#' \item{Duration}{duration of eruption, in minutes}
#' \item{TimeNext}{time until the next eruption, in minutes}
#' }
#' @source Unknown
#' @keywords datasets
NA
#' Ovarian Cancer Study
#'
#' Results of a retrospective study, conducted in 1973,
#' on 299 women who been surgically treated for ovarian cancer 10 years before.
#'
#'
#' @name Ovarian
#' @docType data
#' @format A data frame with 299 observations on the following 4 variables.
#' \describe{ \item{stage}{factor indicating the stage of the cancer at the time of operation (early, advanced)}
#' \item{operation}{factor indicating the amount of tissue removed during surgery (radical,limited)}
#' \item{survival}{whether or not the subject was still alive after ten years (yes,no)}
#' \item{xray}{factor indicating whether or not the subject also received x-ray treatments (yes,no)}}
#' @source Rebuilt from a contingency table in E. B. Andersen (1991),
#' The Statistical Analysis of Categorical Data. 2nd edition. Springer-Verlag, Berlin. Table found in
#' package \code{vcd}.
#' @keywords datasets
NA
#' Penn State #1
#'
#' A study of students at Penn State University.
#'
#'
#' @name pennstate1
#' @docType data
#' @format A data frame with 190 observations on the following 9 variables.
#' \describe{
#' \item{Sex}{a factor with levels \code{F} \code{M}}
#' \item{HrsSleep}{how many hours of sleep the subject gets per night}
#' \item{SQpick}{a factor
#' with levels \code{Q} \code{S}. Each subject was presented with two letters
#' (S and Q), and asked to pick one. This variable indicates which letter the
#' subject picked.}
#' \item{Height}{height in inches}
#' \item{RandNumb}{a numeric vector: Each subject was asked to choose
#' randomly an integer from 1 to 10.}
#' \item{Fastest}{highest speed, in mph, at which subject has ever driven a car}
#' \item{RtSpan}{span of the right hand, in
#' centimeters.}
#' \item{LftSpan}{span of the left
#' hand, in centimeters.}
#' \item{Form}{a factor with levels \code{QorS}
#' \code{SorQ}. The order of presentation of the S and Q options to the
#' subject varied from one survey form to another. This variable indicates
#' which letter was presented first on the form.}
#' }
#' @source Uts and Heckard, Mind on Statistics, Fourth Edition.
#' @keywords datasets
NA
#' Push-ups by Football Players at Georgetown College
#'
#' Two football players at GC asked their team-mates to do as many push-ups as
#' they could in two minutes.
#'
#'
#' @name pushups
#' @docType data
#' @format A data frame with 30 observations on the following 3 variables.
#' \describe{
#' \item{weight}{weight of subject in
#' pounds.}
#' \item{pushups}{number of push-ups
#' completed.}
#' \item{position}{a factor with levels \code{LINE}
#' \code{SKILL}: type of position played by the subject. Line positions
#' require high body mass, skill positions require a lot of running around.}
#' }
#' @source MAT 111, Georgetown College
#' @keywords datasets
NA
#'Effect of Soil Salinity on Plant Growth
#'
#' @description Result of an experiment conducted to investigate the effect of salinity level in soil
#' on the growth of plants.
#'
#' @details From the source (see below): "Experimental
#' fields of land were located at an agricultural field station, and each field was divided into six smaller
#' plots. Each of the smaller plots was treated with a different amount of salt (measured in ppm) and the
#' biomass at the end of the experiment was recorded."
#'
#'
#' @name saltmarsh
#' @docType data
#' @format A data frame with 24 observations on the following 3 variables.
#' \describe{ \item{salt}{amount of salt applied to the plot (in parts per million)}
#' \item{biomass}{total biomass of plot at the end of the study period (units unknown)}
#' \item{block}{field in which the plot was located} }
#' @source The Course Notes of Carl Schwarz, Simon Fraser University:
#' \url{http://people.stat.sfu.ca/~cschwarz/CourseNotes/}
#' @keywords datasets
NA
#' @title SAT Scores
#'
#' @description SAT scores by state. The variables are as follows:
#'
#' \itemize{
#' \item state. A state in the U.S.
#' \item expend. Mean annual expenditure per student (in 1000$).
#' \item ratio. Mean student-teacher ratio.
#' \item salary. Mean annual teacher salary.
#' \item frac. Percentage of students in the state who take the SAT.
#' \item verbal. Mean SAT Verbal score for the state.
#' \item math. Mean SAT Math score for the state.
#' \item sat. Sum of mean Verbal and mean Math.
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Deborah Lynn Guber, "Getting what you pay for: the debate over equity
#' in public school expenditures" (1999), \emph{Journal of Statistics Education} 7(2).
#' @format A data frame with 50 rows and 8 variables
#' @name sat
NA
#' Weddell Seal Oxygen Consumption
#'
#' Results of an experiment conducted on ten Weddell seals.
#'
#'
#' @name sealsO2
#' @docType data
#' @format A data frame with 10 observations on the following 2 variables.
#' \describe{ \item{o2.nonfeeding}{Oxygen consumption during recovery time after a dive during which
#' no plankton was consumed by the seal, in ml of O2 per kilogram of weight}
#' \item{o2.feeding}{Oxygen consumption during recovery time after a dive during which
#' plankton was consumed, in ml of O2 per kilogram of weight}}
#' @source Williams, T. M., L. A. Fuiman, M. Horning, and R. W. Davis. 2004.
#' The Journal of Experimental Biology 207: 973 to 982. \url{http://jeb.biologists.org/content/207/6/973.full}
#' @keywords datasets
NA
#' A Small Experiment
#'
#' Subjects in a hypothetical experiment
#'
#'
#' @name SmallExp
#' @docType data
#' @format A data frame with 16 observations on the following 3 variables.
#' \describe{ \item{name}{name of the subject}
#' \item{sex}{sex of the subject}
#' \item{athlete}{whether or not the subject is an athlete} }
#' @keywords datasets
NA
#' Larvae on Stumps
#'
#' Biologists were interested in whether beetles prefer areas where beavers have cut down cottonwood trees.
#' (The tree-stumps produce tender green shoots that beetles are thought to like.) 23 circular plots, all of equal area,
#' were studied. For each plot the researchers counted the number of cottonwood stumps, and also the number of
#' clusters of beetle larvae found in the plot.
#'
#'
#' @name stumps
#' @docType data
#' @format A data frame with 23 observations on the following 2 variables.
#' \describe{
#' \item{stumps}{number of stumps in the plot}
#' \item{larvae}{number of larvae clusters in the plot}
#' }
#' @source Basic Practice of Statistics, by Moore and McCabe.
#' @keywords datasets
NA
#'@title Temperature in U.S. Cities
#'
#' @description Average temperatures for cities in the United States.
#'
#' \itemize{
#' \item{\code{city}} {Name of the city}
#' \item{\code{latitude}} {latitude of the city, in degrees north of the Equator}
#' \item{\code{JanTemp}} {mean temperature of the city in January.}
#' \item{\code{AprTemp}} {mean temperature of the city in April.}
#' \item{\code{AugTemp}} {mean temperature of the city in August.}
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Mind on Statistics, Fourth Edition, Uts and Heckard.
#' @format A data frame with 20 observations on 5 variables.
#' @name temperature
NA
#' @title Restaurant Tips
#'
#' @description A waiter recorded all tips earned during a 2.5 month period in early 1990, along with other
#' information about the customers who gave the tips. The variables are as follows:
#'
#' \itemize{
#' \item {obs} {Observation number of event}
#' \item {totbill} {The total bill}
#' \item {tip} {Amount of the tip}
#' \item {sex} {Sex of tipper (F or M)}
#' \item {smoker} {Whether the tipper smokes: Yes or No}
#' \item {day} {Day of the week}
#' \item {time} {Whether the meal was during the Day or the Night}
#' \item {size} {Number of people in the dining party}
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Bryant, P. G. and Smith, M. A.(1995), Practical Data Analysis: Case
#` Studies in Business Statistics, Richard D. Irwin Publishing, Homewood, IL.
#' Found in Interactive and Dynamic Graphics for Data Analysis:
#' With Examples Using R and GGobi, Dianne Cook and Deborah F. Swayne.
#' See also \url{http://www.ggobi.org/book/}.
#' @format A data frame with 244 rows and 8 variables
#' @name tips
NA
#'@title Tornado Damage
#'
#' @description Tornado damage in the U.S., by state. Also includes Puerto Rico.
#'
#' \itemize{
#' \item{\code{state}} {the state}
#' \item{\code{damage}} {mean annual damage from tornadoes, over a five-year period, in millions of dollars}
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Moore and McCabe, The Basic Practice of Statistics.
#' @format A data frame with 51 observations on 2 variables.
#' @name tornado
NA
#' @title UC Davis #1
#'
#' @description Results of a survey of students at UC-Davis.
#'
#' \itemize{
#' \item{\code{Sex}}{ a factor with levels \code{Female} \code{Male}}
#' \item{\code{TV}}{ Number of hours spent watching TV per week}
#' \item{\code{computer}}{ number of hours spent on computer per week}
#' \item{\code{Sleep}}{ hours of sleep per night}
#' \item{\code{Seat}}{ a factor with levels \code{Back} \code{Front} \code{Middle} Where do you prefer to sit in class,
#' when you have a choice?}
#' \item{\code{alcohol}}{ number of alcoholic drinks consumed per week}
#' \item{\code{Height}}{ height in inches}
#' \item{\code{momheight}}{ height of mother, in inches}
#' \item{\code{dadheight}}{ height of father, in inches}
#' \item{\code{exercise}}{ number of hours of exercise per week}
#' \item{\code{GPA}}{ grade point average}
#' \item{\code{class}}{ a factor with levels \code{LibArts} \code{NonLib} Student Category: liberal arts or
#' not}
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Mind on Statistics, Fourth Edition, Uts and Heckard.
#' @format A data frame with 173 observations on 12 variables..
#' @name ucdavis1
NA
#' Justin Verlander
#'
#' PITCHf/x data on Justin Verlander, winner of the 2011 Cy Young Award. Covers his
#' 2009 through 2012 seasons.
#'
#'
#' @name verlander
#' @docType data
#' @format A data frame with 15307 observations on the following 12 variables. Each observation
#' is a single pitch.
#' \describe{ \item{season}{The year of play}
#' \item{gamedate}{Date of the game in which the pitch was thrown}
#' \item{pitch_type}{Type of pitch thrown: CH (Change-up), CU (Curveball), FF (Four-Seam Fastball),
#' FT (Two-Seam Fastball), SL (Slider). Pitch type is determined by computer algorithm.}
#' \item{balls}{Current ball count}
#' \item{strikes}{Current strike count}
#' \item{pitches}{number of pitches previously thrown in the game}
#' \item{speed}{speed of pitch (in mph). (When crossing plate?)}
#' \item{px}{x-coordinate of pitch (in feet, measured from center of plate)}
#' \item{pz}{vertical coordinate of pitch (in feet above plate)}
#' \item{pfx_x}{the horizontal movement, in inches, of the pitch between the release point and
#' home plate, as compared to a theoretical pitch thrown at the same speed with no spin-induced
#' movement. Measured at 40 feet from home plate.}
#' \item{pfx_z}{the vertical movement, in inches, of the pitch between the release point and
#' home plate, as compared to a theoretical pitch thrown at the same speed with no spin-induced
#' movement. Measured at 40 feet from home plate.}
#' \item{batter_hand}{A factor with two values: L (left) and R (right).}
#' }
#' @source Marchi and Albert: analyzing Baseball Data with R, CRC Press 2014. For
#' more on the PITCHf/x system, see \url{http://en.wikipedia.org/wiki/PITCHf/x}.
#'
#' @keywords datasets
NA
#' @title Youth Risk 2003
#'
#' @description A Study of Risky Behaviors in High School Seniors, from year 2003.
#'
#' \itemize{
#' \item \code{Sex}
#' \item \code{Grades} Typical grades you earn in school
#' \item \code{WtAction} What do you plan to do about your weight?
#' \item \code{Seatbelt} How often do you wear a seat-belt?
#' \item \code{Sunscreen} How much do you wear sunscreen?
#' \item \code{Grades_1} Same as grades, but with some groups combined
#' \item \code{Sun_1} Same as sunscreen, but with some groups combined
#' }
#'
#' @docType data
#' @keywords datasets
#' @source Mind on Statistics, Fourth Edition, by Uts and Heckard.
#' @format A data frame with 3042 observations on 7 variables.
#' @name youthrisk03
NA
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.