| SardiniaHotels | R Documentation |
This data set contains information and satisfaction scores appearing on the TripAdvisor website between the years 2008 and 2016 regarding hotels in Sardinia, Italy.
The satisfaction data refer to the reputation of hotel located along Sardinian coasts, as expressed by clients with respect to different services (e.g., breakfast, restaurant, swimming pool) offered by the hotel.
data(SardiniaHotels)
A data frame with 518 rows and 43 columns (variables). Each row refers to a single hotel.
The following variables are included in the dataset:
municipalitya factor, the municipality where the hotel is located.
starsan ordered factor with levels:
1OR2stars for 1 star or 2 star hotels,
3stars 3 star hotels,
residence,
4stars, 4 star hotels,
5starsORresort, 5 star hotels or resorts.
areaa factor with levels related to the area of the Sardinian coast where each single hotel is located:
AlgheroSassari,
CagliariVillasimius, CostaSmeralda,
DorgaliOrosei, Gallura, NurraAnglona,
Ogliastra, Olbia, OristanoBosa,
PulaChia, Sarrabus,
Sulcis.
seaLocationa factor with levels
yes (if the hotel is located close to the sea)
and no (otherwise).
excellenta numeric vector, the number of people that expressed the highest level of satisfaction.
gooda numeric vector, the number of people that expressed a good level of satisfaction.
averagea numeric vector, the number of people that expressed an average level of satisfaction.
bada numeric vector, the number of people that expressed a bad level of satisfaction.
poora numeric vector, the number of people that expressed the lowest level of satisfaction.
familya numeric vector, the number of people travelling with family.
couplea numeric vector, the number of people travelling with their partner.
singlea numeric vector, the number of people travelling alone.
businessa numeric vector, the number of people travelling for work.
MarMaya numeric vector, the number of people travelling during the period March to May.
JunAuga numeric vector, the number of people travelling during the period June to August.
SepNova numeric vector, the number of people travelling during the period September to November.
DecFeba numeric vector, the number of people travelling during the period December to February.
locationa numeric vector, the satisfaction score expressed by tourists towards the location.
sleepQualitya numeric vector, the satisfaction score expressed by tourists towards the sleep quality.
rooma numeric vector, the satisfaction score expressed by tourists towards the comfort and quality of the room.
servicesa numeric vector, the satisfaction score expressed by tourists towards the quality of the services.
priceQualityRatea numeric vector, the satisfaction score expressed by tourists towards ratio between price and quality.
cleaninga numeric vector, the satisfaction score expressed by tourists towards level of room and hotel cleaning.
bt1a factor with levels breakfast,
cleaning, location, overall,
price, restaurant, room,
services, staff, structure and
Wi-Fi.
It expresses the 1st most used word in reviews for a hotel.
ratebt1a factor with levels -1 (if the
satisfaction score espressed in bt1 is prevalently negative)
and 1 (if the satisfaction score espressed in bt1
is prevalently positive).
bt2a factor with levels breakfast,
cleaning, location, overall,
price, restaurant, room,
services, staff, structure and
Wi-Fi.
It expresses the 2nd most used word in reviews for a hotel.
ratebt2a factor with levels -1 (if the
satisfaction score espressed in bt2 is prevalently negative)
and 1 (if the satisfaction score espressed in bt2
is prevalently positive).
bt3 similar to bt1 and bt2,
but with a corresponding different ranking.
bt4 similar to bt1 and bt2,
but with a corresponding different ranking.
bt5 similar to bt1 and bt2,
but with a corresponding different ranking.
bt6 similar to bt1 and bt2,
but with a corresponding different ranking.
bt7 similar to bt1 and bt2,
but with a corresponding different ranking.
bt8 similar to bt1 and bt2,
but with a corresponding different ranking.
bt9 similar to bt1 and bt2,
but with a corresponding different ranking.
bt10 similar to bt1 and bt2,
but with a corresponding different ranking.
ratebt3similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt4similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt5similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt6similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt7similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt8similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt9similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
ratebt10similar
to ratebt1 and ratebt2, but with a
corresponding different ranking.
These data were manually collected during March–June 2016 by students of the class of "Statistics for Turism" at the University of Cagliari, Italy (Bachelor's degree in Tourism Economics and Managment), under the supervision of Prof. Claudio Conversano and Dr. Giulia Contu.
Many of the variables fall into several natural groups, e.g.,
[municipality,
stars,
area,
seaLocation];
[excellent,
good,
average,
bad,
poor];
[MarMay,
JunAug,
SepNov,
DecFeb];
[family,
couple,
single,
business];
[location,...cleaning];
[bt1,...bt10];
[ratebt1,...ratebt10].
TripAdvisor,
https://www.tripadvisor.it/.
data(SardiniaHotels)
summary(SardiniaHotels)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.