glance_data: Glance Data

Description Usage Arguments Value Author(s) Examples

View source: R/glance_data.R

Description

Provides a summary of data with the the following columns:

name

Name of the column.

type

Type of the column, equal to "numerical", "logical", "factor", "categorical", or "NA only".

distinct_values

Count of distinct values. It ignores NA values. Thus, if a columns only has NAs, then the value of this field will be zero.

minimum

Minimum of numerical columns excluding NA values.

median

Median of numerical columns excluding NA values.

maximum

Maximum of numerical columns excluding NA values.

mean

Mean of numerical variables. It ignores NAs.

sd

Standard deviation of numerical variables. It ignores NAs.

na_proportion

Proportion of NAs.

count

Tally of values if the column has 5 values at most. This value (5) can be modified with the parameter limit2tally.

sample_values

Sample of (different) values in each column.

Usage

1
glance_data(x, limit2tally = 20)

Arguments

x

A dataframe with named columns.

limit2tally

One of the summaries is a tally of the distinct values on each column. If there are too many different values in a column, this summary would be meaningless. This limit2tally is the limit of distinct values to tally. If there are more than that it returns "Too many unique values".

Value

A tibble.

Author(s)

Guillermo Basulto-Elias

Examples

1

glancedata documentation built on Dec. 1, 2019, 1:22 a.m.