Description Usage Arguments Details Value Examples
Read Solr CSV
1 | sparkler.read.csv(vm, pattern, crawl_id, limit, extracted = FALSE)
|
vm |
The Instance object |
pattern |
The hostname |
crawl_id |
The Id of crawl |
limit |
The max URLs you want |
extracted |
It TRUE, will get extracted text |
You can find a SolR tutorial here : https://lucene.apache.org/solr/guide/7_2/solr-tutorial.html
A dataframe
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 | ## Not run:
library(RsparkleR)
ovh <- import_ovh()
client <- load_client(ovh,endpoint,application_key,application_secret,consumer_key)
sshPubKeyPath <- 'C:/Users/vterrasi/.ssh/id_rsa.pub'
sshPrivKeyPath <- 'C:/Users/vterrasi/.ssh/id_rsa'
vm <- sparkler.create(client, regionVM="UK1", typeVM="s1-4", sshPubKeyPath, sshPrivKeyPath)
topN <- 1000
crawlid <- sparkler.crawl(vm, url, topN, topGroups, maxIter, debug=FALSE, mode="fast")
crawlDF <- sparkler.read.csv(vm, pattern, crawlid, topN, extracted=FALSE)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.