buildGSlinks: Build GS links

View source: R/buildGSlinks.R

buildGSlinksR Documentation

Build GS links

Description

Constructs series of Google Scholar search page URLs

Usage

buildGSlinks(
  and_terms = "",
  exact_phrase = "",
  or_terms = "",
  not_terms = "",
  year_from = "",
  year_to = "",
  start_page = 1,
  pages = 1,
  language = "en",
  incl_cit = TRUE,
  incl_pat = TRUE,
  titlesearch = FALSE,
  authors = "",
  source = ""
)

Arguments

and_terms

Vector of alphanumeric terms searched using the AND Boolean operator, specified by Google Scholar as 'with all of the words'.

exact_phrase

Vector of alphanumeric terms enclosed in inverted commas and searched as phrases (e.g. "large cat"), specified by Google Scholar as 'with the exact phrase'.

or_terms

Vector of alphanumeric terms searched using the OR Boolean operator, specified by Google Scholar as 'with at least one of the words'.

not_terms

Vector of alphanumeric terms searched using the NOT Boolean operator, specified by Google Scholar as 'without the words'.

year_from

Integer full numeric year (e.g. 2000) from which searching will be performed (inclusive). If no value provided, all years are searched.

year_to

Integer full numeric year (e.g. 2020) to which searching will be performed (inclusive). If no value provided, all years are searched.

start_page

Integer specifying which page(s) of search results should be displayed. If multiple pages are selected, multiple URLs are returned, one for each page of ten search results. The default is set to generate a list of 100 URLs (maximum set of Google Scholar results visible).

pages

Integer for the number of pages of search results to be returned (one link per page). A maximum of 100 pages can be displayed in Google Scholar. The default value is 1.

language

Two-letter language code for search language. The default is 'en' (English).

incl_cit

Logical argument (TRUE or FALSE) specifying whether citations should be included in the search

incl_pat

Logical argument (TRUE or FALSE) specifying whether patents should be included in the search

titlesearch

Logical argument (TRUE or FALSE) specifying whether the search should be performed on article titles only or anywhere in the record. The default is FALSE.

authors

The names of authors searched for.

source

The name of the source of the articles (e.g. academic journal).

Details

Function takes as input a partial Boolean search string and produces a functioning (set of) URLs; one for each page of search results on Google Scholar.

Value

A list containing: 1) ($link) links to the specified search results; and, 2) ($report) a text string containing a report of the links generated and the input variables used.

Examples

## Not run: 
and_terms <- c('river', 'aquatic')
exact_phrase <- c('water chemistry')
or_terms <- c('crayfish', 'fish')
not_terms <- c('lobster', 'coral')
year_from <- 1900
year_to <- 2020
link <- buildGSlinks(and_terms = and_terms,
    exact_phrase = exact_phrase,
    or_terms = or_terms,
    not_terms = not_terms,
    year_from = year_from,
    year_to = year_to,
    pages = 1,
    authors = 'haddaway',
    source = 'freshwater')
link

## End(Not run)

nealhaddaway/GSscraper documentation built on May 6, 2022, 10:52 a.m.