host_handler: host_handler

Description Usage Arguments Details Value See Also

Description

host_handler is a domain-neutral host extractor for URLs, excluding generic prefixes (http, https, www) and paths/queries.

Usage

1

Arguments

urls

a character vector of URLs

Details

extracts the hostname, TLD and subdomains from a generic URL

Value

a vector of hostnames, or "Unknown" if the hostname was invalid

See Also

project_extractor for extracting Wikimedia language codes and projects.


wikimedia-research/WMUtils documentation built on May 4, 2019, 5:23 a.m.