library(ggplot2)
library(pander)
ggAggHist <- getFromNamespace("ggAggHist", "dataMaid")
ggAggBarplot <- getFromNamespace("ggAggBarplot", "dataMaid")

Data report overview

The dataset examined has the following dimensions:


Feature Result


Number of observations 200

Number of variables 11

Checks performed

The following variable checks were performed, depending on the data type of each variable:


  character factor labelled numeric integer logical Date


Identify miscoded missing values × × × × × ×

Identify prefixed and suffixed whitespace × × ×

Identify levels with < 6 obs. × × ×

Identify case issues × × ×

Identify misclassified numeric or integer variables × × ×

Identify non-trailing colons × × ×

Identify outliers × × ×

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table


  Variable class # unique values Missing observations Any problems?


[ArtistID] character 179 0.00 % ×

[ArtistName] character 179 0.00 % ×

[NoOfMiddlenames] numeric 4 0.00 % ×

[Title] character 200 0.00 % ×

[Year] integer 149 0.00 % ×

[Location] character 98 1.50 % ×

[Continent] factor 3 0.00 % ×

[Width] numeric 164 5.00 % ×

[Height] numeric 165 5.00 % ×

[Media] character 28 5.00 % ×

[Movement] character 86 9.00 % ×

Variable list

ArtistID


ArtistName

--------------------------------------------- Feature Result ------------------------- ------------------- Variable type character Number of missing obs. 0 (0 %) Number of unique values 179 Mode "Diego Velazquez" No. zeros 0 ---------------------------------------------
wzxhzdk:2

NoOfMiddlenames

----------------------------------- Feature Result ------------------------- --------- Variable type numeric Number of missing obs. 0 (0 %) Number of unique values 4 Mode "0" No. zeros 157 -----------------------------------
wzxhzdk:3

Title


Year

----------------------------------------- Feature Result ------------------------- --------------- Variable type integer Number of missing obs. 0 (0 %) Number of unique values 149 Median 1851.5 1st and 3rd quartiles 1627.75; 1914 Min. and max. 1150; 1968 No. zeros 0 Mean 1765.73 -----------------------------------------
wzxhzdk:4

Location

---------------------------------------------- Feature Result ------------------------- -------------------- Variable type character Number of missing obs. 3 (1.5 %) Number of unique values 97 Mode "National Gallery" No. zeros 0 ----------------------------------------------
wzxhzdk:5

Continent

------------------------------------ Feature Result ------------------------- ---------- Variable type factor Number of missing obs. 0 (0 %) Number of unique values 3 Mode "Europe" No. zeros 0 ------------------------------------
wzxhzdk:6

Width

------------------------------------- Feature Result ------------------------- ----------- Variable type numeric Number of missing obs. 10 (5 %) Number of unique values 163 Median 122.45 1st and 3rd quartiles 77; 198.3 Min. and max. 10.7; 990 No. zeros 0 Mean 168.46 -------------------------------------
wzxhzdk:7

Height

----------------------------------------- Feature Result ------------------------- --------------- Variable type numeric Number of missing obs. 10 (5 %) Number of unique values 164 Median 113 1st and 3rd quartiles 73.12; 168.75 Min. and max. 12.3; 666 No. zeros 0 Mean 134.51 -----------------------------------------
wzxhzdk:8

Media

--------------------------------------- Feature Result ------------------------- ------------- Variable type character Number of missing obs. 10 (5 %) Number of unique values 27 Mode "oil paint" No. zeros 0 ---------------------------------------
wzxhzdk:9

Movement

------------------------------------- Feature Result ------------------------- ----------- Variable type character Number of missing obs. 18 (9 %) Number of unique values 85 Mode "Baroque" No. zeros 0 -------------------------------------
wzxhzdk:10

Report generation information:



ekstroem/dataMaid documentation built on Jan. 31, 2022, 9:10 a.m.