google_apps: Google apps data

Description Usage Format Details Note Source Examples

Description

Ratings and other information for Google Play Store apps.

Usage

1

Format

The nested main data frame google_apps has 9707 rows and 14 variables:

app

Application name

category

Category the app belongs to

rating

Overall user rating of the app

reviews

Number of user reviews for the app

size_in_MB

Size of the app in megabytes

installs

Number of user downloads/installs for the app

type

Paid or Free

price

Price of the app

content_rating

Age group the app is targeted at - Children / Mature 21+ / Adult

genres

An app can belong to multiple genres (apart from its main category). For example, a musical family game will belong to Music, Game, Family genres.

last_updated

Date when the app was last updated on Play Store

current_ver

Current version of the app available on Play Store

android_ver

Min required Android version

greviews

A list column. See details.

Details

This data set is joins two data from the Kaggle link. The data was scraped and to some extent cleaned before presented to Kaggle. This data has alterations to make the last_updated an actual date column (range 2010-2018), and the size a numeric column after stripping the M (megabyte) and k (kilobyte). The final column is a list column of actual reviews that can be used for text analysis, sentiment analysis and similar. Each data set within greviews contains the following columns:

Translated_Review

User review (preprocessed and translated to English)

Sentiment

Positive/Negative/Neutral (preprocessed)

Sentiment_Polarity

Sentiment polarity score

Sentiment_Subjectivity

Sentiment subjectivity score

Note

License is CC-BY

Source

Data set created by Lavanya Gupta

Examples

1
2

m-clark/noiris documentation built on Sept. 9, 2019, 9:08 a.m.