smbinning.eda: Exploratory Data Analysis (EDA)

Description Usage Arguments Value Examples

View source: R/smbinning.R

Description

It shows basic statistics for each characteristic in a data frame. The report includes:

Usage

1
smbinning.eda(df, rounding = 3, pbar = 1)

Arguments

df

A data frame.

rounding

Optional parameter to define the decimal points shown in the output table. Default is 3.

pbar

Optional parameter that turns on or off a progress bar. Default value is 1.

Value

The command smbinning.eda generates two data frames that list each characteristic with basic statistics such as extreme values and quartiles; and also percentages of missing values and outliers, among others.

Examples

1
2
3
4
5
6
# Load library and its dataset
library(smbinning) # Load package and its data

# Example: Exploratory data analysis of dataset
smbinning.eda(smbsimdf1,rounding=3)$eda # Table with basic statistics
smbinning.eda(smbsimdf1,rounding=3)$edapct # Table with basic percentages

Example output

Loading required package: sqldf
Loading required package: gsubfn
Loading required package: proto
Loading required package: RSQLite
Loading required package: partykit
Loading required package: grid
Loading required package: libcoin
Loading required package: mvtnorm
Loading required package: Formula
Warning message:
no DISPLAY variable so Tk is not available 
 

  |                                                        
  |                                                  |   0%
  |                                                        
  |--                                                |   5%
  |                                                        
  |-----                                             |   9%
  |                                                        
  |-------                                           |  14%
  |                                                        
  |---------                                         |  18%
  |                                                        
  |-----------                                       |  23%
  |                                                        
  |--------------                                    |  27%
  |                                                        
  |----------------                                  |  32%
  |                                                        
  |------------------                                |  36%
  |                                                        
  |--------------------                              |  41%
  |                                                        
  |-----------------------                           |  45%
  |                                                        
  |-------------------------                         |  50%
  |                                                        
  |---------------------------                       |  55%
  |                                                        
  |------------------------------                    |  59%
  |                                                        
  |--------------------------------                  |  64%
  |                                                        
  |----------------------------------                |  68%
  |                                                        
  |------------------------------------              |  73%
  |                                                        
  |---------------------------------------           |  77%
  |                                                        
  |-----------------------------------------         |  82%
  |                                                        
  |-------------------------------------------       |  86%
  |                                                        
  |---------------------------------------------     |  91%
  |                                                        
  |------------------------------------------------  |  95%
  |                                                        
  |--------------------------------------------------| 100%
      Field    Type Recs Miss Unique      Min       Q25       Q50       Avg
1     fgood Num/Int 2500    0      2    0.000     1.000     1.000     0.800
2      cbs1 Num/Int 2500  256   1767   11.000    44.078    52.415    52.419
3      cbs2 Num/Int 2500  276   1754    9.540    43.638    51.790    51.822
4     cbinq  Factor 2500    0      3       NA        NA        NA        NA
5    cbline Num/Int 2500    0      6    0.000     1.000     2.000     1.742
6    cbterm  Factor 2500    0      3       NA        NA        NA        NA
7  cblineut Num/Int 2500    0   2500    0.000    34.197    42.565    42.814
8     cbtob Num/Int 2500    0      6    3.000     5.000     6.000     5.624
9     cbdpd  Factor 2500    0      2       NA        NA        NA        NA
10    cbnew  Factor 2500    0      2       NA        NA        NA        NA
11      pmt  Factor 2500    0      3       NA        NA        NA        NA
12      tob Num/Int 2500    0      6    0.000     2.000     3.000     2.793
13      dpd  Factor 2500    0      3       NA        NA        NA        NA
14      dep Num/Int 2500    0   2498 1598.290 10013.923 12106.595 12076.991
15       dc Num/Int 2500    0     39    4.000    19.000    22.000    22.098
16       od  Factor 2500    0      3       NA        NA        NA        NA
17     home  Factor 2500    0      2       NA        NA        NA        NA
18      inc  Factor 2500  220     10       NA        NA        NA        NA
19       dd  Factor 2500    0      3       NA        NA        NA        NA
20   online  Factor 2500    0      2       NA        NA        NA        NA
21      rnd Num/Int 2500    0   2500    0.001     0.247     0.500     0.498
22   period   Other 2500    0     10       NA        NA        NA        NA
         Q75       Max     StDv Neg Zero  Pos OutLo OutHi
1      1.000     1.000    0.400   0  500 2000   500     0
2     60.970    90.910   12.541   0    0 2244     7     4
3     60.190    98.060   12.363   0    0 2224     8     6
4         NA        NA       NA  NA   NA   NA    NA    NA
5      2.000     5.000    1.162   0  388 2112     0   173
6         NA        NA       NA  NA   NA   NA    NA    NA
7     51.663    91.597   13.266   0    1 2499    18    12
8      7.000     8.000    1.309   0    0 2500     0     0
9         NA        NA       NA  NA   NA   NA    NA    NA
10        NA        NA       NA  NA   NA   NA    NA    NA
11        NA        NA       NA  NA   NA   NA    NA    NA
12     4.000     5.000    1.416   0  134 2366     0     0
13        NA        NA       NA  NA   NA   NA    NA    NA
14 14254.487 25000.000 3163.989   0    0 2500     8     7
15    26.000    45.000    5.162   0    0 2500    12     6
16        NA        NA       NA  NA   NA   NA    NA    NA
17        NA        NA       NA  NA   NA   NA    NA    NA
18        NA        NA       NA  NA   NA   NA    NA    NA
19        NA        NA       NA  NA   NA   NA    NA    NA
20        NA        NA       NA  NA   NA   NA    NA    NA
21     0.748     0.999    0.285   0    0 2500     0     0
22        NA        NA       NA  NA   NA   NA    NA    NA
 

  |                                                        
  |                                                  |   0%
  |                                                        
  |--                                                |   5%
  |                                                        
  |-----                                             |   9%
  |                                                        
  |-------                                           |  14%
  |                                                        
  |---------                                         |  18%
  |                                                        
  |-----------                                       |  23%
  |                                                        
  |--------------                                    |  27%
  |                                                        
  |----------------                                  |  32%
  |                                                        
  |------------------                                |  36%
  |                                                        
  |--------------------                              |  41%
  |                                                        
  |-----------------------                           |  45%
  |                                                        
  |-------------------------                         |  50%
  |                                                        
  |---------------------------                       |  55%
  |                                                        
  |------------------------------                    |  59%
  |                                                        
  |--------------------------------                  |  64%
  |                                                        
  |----------------------------------                |  68%
  |                                                        
  |------------------------------------              |  73%
  |                                                        
  |---------------------------------------           |  77%
  |                                                        
  |-----------------------------------------         |  82%
  |                                                        
  |-------------------------------------------       |  86%
  |                                                        
  |---------------------------------------------     |  91%
  |                                                        
  |------------------------------------------------  |  95%
  |                                                        
  |--------------------------------------------------| 100%
      Field    Type Recs  Miss Neg  Zero   Pos OutLo OutHi
1     fgood Num/Int 2500 0.000   0 0.200 0.800 0.200 0.000
2      cbs1 Num/Int 2500 0.102   0 0.000 0.898 0.003 0.002
3      cbs2 Num/Int 2500 0.110   0 0.000 0.890 0.003 0.002
4     cbinq  Factor 2500 0.000  NA    NA    NA    NA    NA
5    cbline Num/Int 2500 0.000   0 0.155 0.845 0.000 0.069
6    cbterm  Factor 2500 0.000  NA    NA    NA    NA    NA
7  cblineut Num/Int 2500 0.000   0 0.000 1.000 0.007 0.005
8     cbtob Num/Int 2500 0.000   0 0.000 1.000 0.000 0.000
9     cbdpd  Factor 2500 0.000  NA    NA    NA    NA    NA
10    cbnew  Factor 2500 0.000  NA    NA    NA    NA    NA
11      pmt  Factor 2500 0.000  NA    NA    NA    NA    NA
12      tob Num/Int 2500 0.000   0 0.054 0.946 0.000 0.000
13      dpd  Factor 2500 0.000  NA    NA    NA    NA    NA
14      dep Num/Int 2500 0.000   0 0.000 1.000 0.003 0.003
15       dc Num/Int 2500 0.000   0 0.000 1.000 0.005 0.002
16       od  Factor 2500 0.000  NA    NA    NA    NA    NA
17     home  Factor 2500 0.000  NA    NA    NA    NA    NA
18      inc  Factor 2500 0.088  NA    NA    NA    NA    NA
19       dd  Factor 2500 0.000  NA    NA    NA    NA    NA
20   online  Factor 2500 0.000  NA    NA    NA    NA    NA
21      rnd Num/Int 2500 0.000   0 0.000 1.000 0.000 0.000
22   period   Other 2500 0.000  NA    NA    NA    NA    NA

smbinning documentation built on May 1, 2019, 10:06 p.m.