Description Usage Arguments Value Note Author(s) References Examples
View source: R/uri-diversity.R
Compute WSDL Diversity Index, Shannon's evenness index, and Simpson's diversity index for a corpus (collection) of URLs.
1 2 3 4 5 | uri_diversity(corpus, corpus_id = uuid::UUIDgenerate(),
exception_domains = NULL)
url_diversity(corpus, corpus_id = uuid::UUIDgenerate(),
exception_domains = NULL)
|
corpus |
a collection (character vector) of URLs |
corpus_id |
an identifier (ideally unique) for the collection; will be generated if not provided. |
exception_domains |
a character vector of domains; use this to specify domains
where the query string is important. Normally, the query string is excluded from
the canonicalized URI but in some cases (e.g. |
a data frame (tibble) with WSDL, Shannon and Simpson diversity indices for canonical URIs and hostnames.
Algorithm creator: Alexander C. Nwala
Alexander Nwala (anwala@cs.odu.edu); Bob Rudis (bob@rud.is)
http://ws-dl.blogspot.com/2018/05/2018-05-04-exploration-of-url-diversity.html
1 2 | collection <- readLines(system.file("extdat", "corpus.txt", package = "urldiversity"))
uri_diversity(collection)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.