wiki_crawler: wiki_crawler

Description Usage Arguments Details Value See Also

Description

identifies Wikimedia-specific crawlers from their user agents

Usage

1
wiki_crawler(agents)

Arguments

agents

a vector of user agents

Details

ua_parse is great for identifying spiders, but only /generic/ spiders - for obvious reasons. Unfortunately there are some Wikimedia-specific spiders out there which need to be caught. wiki_crawler hopes to identify these.

Value

a logical vector indicating whether or not a user agent was identified as a MediaWiki-specific spider.

See Also

hive_query and sampled_logs for extracting request logs, ua_parse for generic user-agent identification.


wikimedia-research/WMUtils documentation built on May 4, 2019, 5:23 a.m.