urldiversity

Quantify 'URL' Diversity and Apply Popular Biodiversity Indices to a 'URL' Collection

Description

Methods are provided to compute the 'WSDL Diversity Index' http://ws-dl.blogspot.com/2018/05/2018-05-04-exploration-of-url-diversity.html along with selected biodiversity indidces to a corpus (collection) of 'URLs'.

NOTE

All credit goes to Alexander Nwala for the algorithm research and original Python implementation.

TODO

What's Inside The Tin

The following functions are implemented:

Core function:

Processing Helpers:

Scraping Helpers:

Installation

devtools::install_github("hrbrmstr/urldiversity")
options(width=120)

Usage

library(urldiversity)

# current verison
packageVersion("urldiversity")
collection <- readLines(system.file("extdat", "corpus.txt", package = "urldiversity"))

print(collection)

x <- uri_diversity(collection)

dplyr::glimpse(x)

x

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.



hrbrmstr/urldiversity documentation built on May 14, 2019, 4 a.m.