Description Usage Arguments Value Examples
It shows basic statistics for each characteristic in a data frame. The report includes:
Field: Field name.
Type: Factor, numeric, integer, other.
Recs: Number of records.
Miss: Number of missing records.
Min: Minimum value.
Q25: First quartile. It splits off the lowest 25% of data from the highest 75%.
Q50: Median or second quartile. It cuts data set in half.
Avg: Average value.
Q75: Third quartile. It splits off the lowest 75% of data from the highest 25%.
Max: Maximum value.
StDv: Standard deviation of a sample.
Neg: Number of negative values.
Pos: Number of positive values.
OutLo: Number of outliers. Records below Q25-1.5*IQR
, where IQR=Q75-Q25
.
OutHi: Number of outliers. Records above Q75+1.5*IQR
, where IQR=Q75-Q25
.
1 | smbinning.eda(df, rounding = 3, pbar = 1)
|
df |
A data frame. |
rounding |
Optional parameter to define the decimal points shown in the output table. Default is 3. |
pbar |
Optional parameter that turns on or off a progress bar. Default value is 1. |
The command smbinning.eda
generates two data frames that list each characteristic
with basic statistics such as extreme values and quartiles;
and also percentages of missing values and outliers, among others.
1 2 3 4 5 6 | # Load library and its dataset
library(smbinning) # Load package and its data
# Example: Exploratory data analysis of dataset
smbinning.eda(smbsimdf1,rounding=3)$eda # Table with basic statistics
smbinning.eda(smbsimdf1,rounding=3)$edapct # Table with basic percentages
|
Loading required package: sqldf
Loading required package: gsubfn
Loading required package: proto
Loading required package: RSQLite
Loading required package: partykit
Loading required package: grid
Loading required package: libcoin
Loading required package: mvtnorm
Loading required package: Formula
Warning message:
no DISPLAY variable so Tk is not available
|
| | 0%
|
|-- | 5%
|
|----- | 9%
|
|------- | 14%
|
|--------- | 18%
|
|----------- | 23%
|
|-------------- | 27%
|
|---------------- | 32%
|
|------------------ | 36%
|
|-------------------- | 41%
|
|----------------------- | 45%
|
|------------------------- | 50%
|
|--------------------------- | 55%
|
|------------------------------ | 59%
|
|-------------------------------- | 64%
|
|---------------------------------- | 68%
|
|------------------------------------ | 73%
|
|--------------------------------------- | 77%
|
|----------------------------------------- | 82%
|
|------------------------------------------- | 86%
|
|--------------------------------------------- | 91%
|
|------------------------------------------------ | 95%
|
|--------------------------------------------------| 100%
Field Type Recs Miss Unique Min Q25 Q50 Avg
1 fgood Num/Int 2500 0 2 0.000 1.000 1.000 0.800
2 cbs1 Num/Int 2500 256 1767 11.000 44.078 52.415 52.419
3 cbs2 Num/Int 2500 276 1754 9.540 43.638 51.790 51.822
4 cbinq Factor 2500 0 3 NA NA NA NA
5 cbline Num/Int 2500 0 6 0.000 1.000 2.000 1.742
6 cbterm Factor 2500 0 3 NA NA NA NA
7 cblineut Num/Int 2500 0 2500 0.000 34.197 42.565 42.814
8 cbtob Num/Int 2500 0 6 3.000 5.000 6.000 5.624
9 cbdpd Factor 2500 0 2 NA NA NA NA
10 cbnew Factor 2500 0 2 NA NA NA NA
11 pmt Factor 2500 0 3 NA NA NA NA
12 tob Num/Int 2500 0 6 0.000 2.000 3.000 2.793
13 dpd Factor 2500 0 3 NA NA NA NA
14 dep Num/Int 2500 0 2498 1598.290 10013.923 12106.595 12076.991
15 dc Num/Int 2500 0 39 4.000 19.000 22.000 22.098
16 od Factor 2500 0 3 NA NA NA NA
17 home Factor 2500 0 2 NA NA NA NA
18 inc Factor 2500 220 10 NA NA NA NA
19 dd Factor 2500 0 3 NA NA NA NA
20 online Factor 2500 0 2 NA NA NA NA
21 rnd Num/Int 2500 0 2500 0.001 0.247 0.500 0.498
22 period Other 2500 0 10 NA NA NA NA
Q75 Max StDv Neg Zero Pos OutLo OutHi
1 1.000 1.000 0.400 0 500 2000 500 0
2 60.970 90.910 12.541 0 0 2244 7 4
3 60.190 98.060 12.363 0 0 2224 8 6
4 NA NA NA NA NA NA NA NA
5 2.000 5.000 1.162 0 388 2112 0 173
6 NA NA NA NA NA NA NA NA
7 51.663 91.597 13.266 0 1 2499 18 12
8 7.000 8.000 1.309 0 0 2500 0 0
9 NA NA NA NA NA NA NA NA
10 NA NA NA NA NA NA NA NA
11 NA NA NA NA NA NA NA NA
12 4.000 5.000 1.416 0 134 2366 0 0
13 NA NA NA NA NA NA NA NA
14 14254.487 25000.000 3163.989 0 0 2500 8 7
15 26.000 45.000 5.162 0 0 2500 12 6
16 NA NA NA NA NA NA NA NA
17 NA NA NA NA NA NA NA NA
18 NA NA NA NA NA NA NA NA
19 NA NA NA NA NA NA NA NA
20 NA NA NA NA NA NA NA NA
21 0.748 0.999 0.285 0 0 2500 0 0
22 NA NA NA NA NA NA NA NA
|
| | 0%
|
|-- | 5%
|
|----- | 9%
|
|------- | 14%
|
|--------- | 18%
|
|----------- | 23%
|
|-------------- | 27%
|
|---------------- | 32%
|
|------------------ | 36%
|
|-------------------- | 41%
|
|----------------------- | 45%
|
|------------------------- | 50%
|
|--------------------------- | 55%
|
|------------------------------ | 59%
|
|-------------------------------- | 64%
|
|---------------------------------- | 68%
|
|------------------------------------ | 73%
|
|--------------------------------------- | 77%
|
|----------------------------------------- | 82%
|
|------------------------------------------- | 86%
|
|--------------------------------------------- | 91%
|
|------------------------------------------------ | 95%
|
|--------------------------------------------------| 100%
Field Type Recs Miss Neg Zero Pos OutLo OutHi
1 fgood Num/Int 2500 0.000 0 0.200 0.800 0.200 0.000
2 cbs1 Num/Int 2500 0.102 0 0.000 0.898 0.003 0.002
3 cbs2 Num/Int 2500 0.110 0 0.000 0.890 0.003 0.002
4 cbinq Factor 2500 0.000 NA NA NA NA NA
5 cbline Num/Int 2500 0.000 0 0.155 0.845 0.000 0.069
6 cbterm Factor 2500 0.000 NA NA NA NA NA
7 cblineut Num/Int 2500 0.000 0 0.000 1.000 0.007 0.005
8 cbtob Num/Int 2500 0.000 0 0.000 1.000 0.000 0.000
9 cbdpd Factor 2500 0.000 NA NA NA NA NA
10 cbnew Factor 2500 0.000 NA NA NA NA NA
11 pmt Factor 2500 0.000 NA NA NA NA NA
12 tob Num/Int 2500 0.000 0 0.054 0.946 0.000 0.000
13 dpd Factor 2500 0.000 NA NA NA NA NA
14 dep Num/Int 2500 0.000 0 0.000 1.000 0.003 0.003
15 dc Num/Int 2500 0.000 0 0.000 1.000 0.005 0.002
16 od Factor 2500 0.000 NA NA NA NA NA
17 home Factor 2500 0.000 NA NA NA NA NA
18 inc Factor 2500 0.088 NA NA NA NA NA
19 dd Factor 2500 0.000 NA NA NA NA NA
20 online Factor 2500 0.000 NA NA NA NA NA
21 rnd Num/Int 2500 0.000 0 0.000 1.000 0.000 0.000
22 period Other 2500 0.000 NA NA NA NA NA
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.