is_spider: check whether a request is from a spider

Description Usage Arguments Value See Also

View source: R/filter.R

Description

consumes a vector of user agents and checks whether each one matches the ua-parser spider definitions. In addition, it looks for Wikimedia-specific spiders that aren't included in the (generalised) spider list.

Usage

1
is_spider(user_agents)

Arguments

user_agents

a vector of user agents, which can be retrieved with read_sampled_log.

Value

a boolean vector identifying whether the user agent at the equivalent indices in the input vector matched that of a spider/web crawler or not.

See Also

read_sampled_log for retrieving user agents, and is_automata for identifying non-crawler automata.


wikimedia-research/pageviews documentation built on May 4, 2019, 5:24 a.m.