sparkler.read.csv: Read Solr CSV

Description Usage Arguments Details Value Examples

View source: R/sparkler.R

Description

Read Solr CSV

Usage

1
sparkler.read.csv(vm, pattern, crawl_id, limit, extracted = FALSE)

Arguments

vm

The Instance object

pattern

The hostname

crawl_id

The Id of crawl

limit

The max URLs you want

extracted

It TRUE, will get extracted text

Details

You can find a SolR tutorial here : https://lucene.apache.org/solr/guide/7_2/solr-tutorial.html

Value

A dataframe

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
## Not run: 

library(RsparkleR)

ovh <- import_ovh()
client <- load_client(ovh,endpoint,application_key,application_secret,consumer_key)

sshPubKeyPath  <- 'C:/Users/vterrasi/.ssh/id_rsa.pub'
sshPrivKeyPath <- 'C:/Users/vterrasi/.ssh/id_rsa'

vm <- sparkler.create(client, regionVM="UK1", typeVM="s1-4", sshPubKeyPath, sshPrivKeyPath)

topN <- 1000

crawlid <- sparkler.crawl(vm, url, topN, topGroups, maxIter, debug=FALSE, mode="fast")

crawlDF <- sparkler.read.csv(vm, pattern, crawlid, topN, extracted=FALSE)


## End(Not run)

voltek62/RsparkleR documentation built on May 19, 2019, 1:48 a.m.