An Experimental Survey Measuring Plagiarism Using the Crosswise Model


A dataset containing the reponses to sensitive questions about plagiarism and other attributes of 812 students. The crosswise model (CM) and direct questioning (DQ) were utilized to gather the data. Each row holds the response to one question for one student. The variables are as follows:




A data frame with 812 rows and 24 variables


  • id. identification code of the student

  • question. which question was asked (1 and 3: Partial Plagiarism, 2 and 4: Severe Plagiarism)

  • gender. gender of the student (0: male, 1: female)

  • age. age in years

  • nationality. nationality of the student (0: German or Swiss, 1: other)

  • no_papers. number of papers

  • uni. location of data collection (1: ETH Zurich, 2: LMU Munich, 3: University Leipzig)

  • course. course in which the data was collected

  • Aspired_Degree. aspired degree of the student

  • Semester. semesters enrolled

  • ur_none. used resources: none

  • ur_books. used resources: books

  • ur_art. used resources: articles

  • ur_int. used resources: internet

  • ur_fsp. used resources: fellow students' papers

  • ur_other. used resources: other

  • preading. proofreading

  • gradesf. satisfaction with grades

  • pp. Plagiarism indicator (0: Severe Plagiarism, 1: Partial Plagiarism)

  • RR. Randomized Response indicator (0: DQ, 1: Crosswise)

  • RRp1. Randomized Response parameter p1

  • RRp2. Randomized Response parameter p2

  • RRmodel. Randomized Response Model


Ben Jann and Laurcence Brandenberger


Want to suggest features or report bugs for Use the GitHub issue tracker. Vote for new features on Trello.

comments powered by Disqus