Description Usage Arguments Details Value See Also
Prototype pageviews filter for the Wikimedia request logs
1 | log_sieve(log_data)
|
log_data |
an input data.frame or data.table. This should ideally be the output of |
log_sieve
contains the prototype filter for "pageviews", as applicable to the Wikimedia request logs.
It consumes logs, tags the "actual" pageviews, and returns them. While it's there, the XFFs are also
passed through to the ip_address field, replacing those IPs. It's implemented in R, so the full definition
can be seen just by printing log_sieve
.
log_data
, the first argument
a data.table containing those rows of log_data
that are pageviews.
codesampled_logs, to read from the sampled logs, or hive_query
to read from
the HDFS-based logs.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.