library("ggplot2")
library("pander")
ggAggHist <- getFromNamespace("ggAggHist", "dataMaid")
ggAggBarplot <- getFromNamespace("ggAggBarplot", "dataMaid")

Data report overview

The dataset examined has the following dimensions:


Feature Result


Number of observations 250

Number of variables 21

Checks performed

The following variable checks were performed, depending on the data type of each variable:


  character factor labelled haven labelled numeric integer logical Date


Identify miscoded missing values $\times$ $\times$ $\times$ $\times$ $\times$ $\times$ $\times$

Identify prefixed and suffixed whitespace $\times$ $\times$ $\times$ $\times$

Identify levels with < 6 obs. $\times$ $\times$ $\times$ $\times$

Identify case issues $\times$ $\times$ $\times$ $\times$

Identify misclassified numeric or integer variables $\times$ $\times$ $\times$ $\times$

Identify outliers $\times$ $\times$ $\times$

Please note that all numerical values in the following have been rounded to 2 decimals.

Summary table


  Variable class # unique values Missing observations Any problems?


[ID] character 250 0.00 % $\times$

[Name] character 250 0.40 % $\times$

[Sex] character 3 0.40 %

[Age] numeric 50 0.40 %

[Race] character 7 0.40 % $\times$

[PreinvasiveComponent] character 3 0.40 %

[LVI] character 2 0.00 %

[PNI] character 3 0.40 %

[LastFollowUpDate] POSIXct 13 0.40 % $\times$

[Death] logical 3 0.40 %

[Group] character 3 0.40 %

[Grade] character 4 0.40 %

[TStage] character 5 0.40 %

[Anti-X-intensity] numeric 4 0.40 %

[Anti-Y-intensity] numeric 4 0.40 %

[LymphNodeMetastasis] character 3 0.40 %

[Valid] logical 3 0.40 %

[Smoker] logical 3 0.40 %

[Grade_Level] character 4 0.40 %

[SurgeryDate] POSIXct 224 0.40 % $\times$

[DeathTime] character 2 0.00 %

Variable list

ID

\fullline

Name

\fullline

Sex

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "Male"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Female", 
"Male"), class = "factor"), y = c(122L, 127L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "Sex")

\emini

\fullline

Age

\bminione


Feature Result


Variable type numeric

Number of missing obs. 1 (0.4 %)

Number of unique values 49

Median 47

1st and 3rd quartiles 37; 61

Min. and max. 25; 73

\emini \bminitwo

ggAggHist(data = structure(list(xmin = c(25L, 30L, 35L, 40L, 
45L, 50L, 55L, 60L, 65L, 70L), xmax = c(30L, 35L, 40L, 45L, 50L, 
55L, 60L, 65L, 70L, 75L), ymin = c(0, 0, 0, 0, 0, 0, 0, 0, 0, 
0), ymax = c(29L, 28L, 31L, 29L, 23L, 24L, 21L, 24L, 24L, 16L
)), class = "data.frame", row.names = c(NA, -10L)), vnam = "Age")

\emini

\fullline

Race

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 6

Mode "White"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:6, .Label = c("Asian", 
"Bi-Racial", "Black", "Hispanic", "Native", "White"), class = "factor"), 
    y = c(13L, 2L, 35L, 36L, 1L, 162L)), class = "data.frame", row.names = c(NA, 
-6L)), vnam = "Race")

\emini

\fullline

PreinvasiveComponent

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "Absent"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Absent", 
"Present"), class = "factor"), y = c(195L, 54L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "PreinvasiveComponent")

\emini

\fullline

LVI

\bminione


Feature Result


Variable type character

Number of missing obs. 0 (0 %)

Number of unique values 2

Mode "Absent"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Absent", 
"Present"), class = "factor"), y = c(156L, 94L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "LVI")

\emini

\fullline

PNI

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "Absent"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Absent", 
"Present"), class = "factor"), y = c(170L, 79L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "PNI")

\emini

\fullline

LastFollowUpDate

\fullline

Death

\bminione


Feature Result


Variable type logical

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "TRUE"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("FALSE", 
"TRUE"), class = "factor"), y = c(72L, 177L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "Death")

\emini

\fullline

Group

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "Treatment"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Control", 
"Treatment"), class = "factor"), y = c(119L, 130L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "Group")

\emini

\fullline

Grade

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 3

Mode "3"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:3, .Label = c("1", 
"2", "3"), class = "factor"), y = c(65L, 70L, 114L)), class = "data.frame", row.names = c(NA, 
-3L)), vnam = "Grade")

\emini

\fullline

TStage

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 4

Mode "4"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:4, .Label = c("1", 
"2", "3", "4"), class = "factor"), y = c(18L, 48L, 67L, 116L)), class = "data.frame", row.names = c(NA, 
-4L)), vnam = "TStage")

\emini

\fullline

Anti-X-intensity

\bminione


Feature Result


Variable type numeric

Number of missing obs. 1 (0.4 %)

Number of unique values 3

Mode "3"

Reference category 1

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:3, .Label = c("1", 
"2", "3"), class = "factor"), y = c(32L, 98L, 119L)), class = "data.frame", row.names = c(NA, 
-3L)), vnam = "Anti-X-intensity")

\emini

\fullline

Anti-Y-intensity

\bminione


Feature Result


Variable type numeric

Number of missing obs. 1 (0.4 %)

Number of unique values 3

Mode "2"

Reference category 1

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:3, .Label = c("1", 
"2", "3"), class = "factor"), y = c(74L, 105L, 70L)), class = "data.frame", row.names = c(NA, 
-3L)), vnam = "Anti-Y-intensity")

\emini

\fullline

LymphNodeMetastasis

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "Absent"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("Absent", 
"Present"), class = "factor"), y = c(155L, 94L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "LymphNodeMetastasis")

\emini

\fullline

Valid

\bminione


Feature Result


Variable type logical

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "FALSE"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("FALSE", 
"TRUE"), class = "factor"), y = c(131L, 118L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "Valid")

\emini

\fullline

Smoker

\bminione


Feature Result


Variable type logical

Number of missing obs. 1 (0.4 %)

Number of unique values 2

Mode "FALSE"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("FALSE", 
"TRUE"), class = "factor"), y = c(141L, 108L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "Smoker")

\emini

\fullline

Grade_Level

\bminione


Feature Result


Variable type character

Number of missing obs. 1 (0.4 %)

Number of unique values 3

Mode "high"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:3, .Label = c("high", 
"low", "moderate"), class = "factor"), y = c(99L, 75L, 75L)), class = "data.frame", row.names = c(NA, 
-3L)), vnam = "Grade_Level")

\emini

\fullline

SurgeryDate

\fullline

DeathTime

\bminione


Feature Result


Variable type character

Number of missing obs. 0 (0 %)

Number of unique values 2

Mode "Within1Year"

\emini \bminitwo

ggAggBarplot(data = structure(list(x = structure(1:2, .Label = c("MoreThan1Year", 
"Within1Year"), class = "factor"), y = c(101L, 149L)), class = "data.frame", row.names = c(NA, 
-2L)), vnam = "DeathTime")

\emini

\fullline

Report generation information:



sbalci/histopathR documentation built on Nov. 12, 2024, 12:28 p.m.