aquarium

Validate 'Phishing' 'URLs' with the 'PhishTank' Service

Description

'PhishTank' is a free community site where anyone can submit, verify, track and share 'phishing' data. Methods are provided to test if a 'URL' is classsified as a 'phishing' site and to download aggregated 'phishing' 'URL' databases.

What's Inside The Tin

The following functions are implemented:

Installation

devtools::install_github("hrbrmstr/aquarium")
options(width=120)

Usage

library(aquarium)
library(hrbrthemes)
library(tidyverse)

# current verison
packageVersion("aquarium")

Test a URL

x <- pt_check_url("http://www.seer.revpsi.org/hhh/1/")

x

glimpse(x)

Get the databases

x <- pt_read_db(.progress = FALSE)

x

glimpse(x)

Top Phishing Targets

filter(x, verified == "yes") %>% 
  count(day = as.Date(verification_time), target) -> targets

count(targets, target, sort=TRUE) %>%
  filter(target != "Other") %>% 
  head(9) -> top_named_targets

filter(targets, target %in% top_named_targets$target) %>% 
  mutate(target = factor(target, levels=rev(top_named_targets$target))) %>% 
  ggplot(aes(day, n, group=target, color=target)) +
  geom_segment(aes(xend=day, yend=0), size=0.25) +
  scale_x_date(name = NULL, limits=as.Date(c("2008-01-01", "2018-06-31"))) +
  scale_y_comma(name = "# entries/day") +
  ggthemes::scale_color_tableau() +
  facet_wrap(~target, scales="free") +
  labs(
    title = "PhishTank Top Phishing Targets 2008-present",
    subtitle = "Note: Free Y scale",
    caption = "Source: PhishTank <phishtank.com>"
  ) +
  theme_ipsum_rc(grid="Y", strip_text_face = "bold") +
  theme(legend.position="none")

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.



hrbrmstr/aquarium documentation built on May 29, 2019, 3:04 p.m.