weblmGenerateNextWords: Returns the words most likely to follow a sequence of words.

Description Usage Arguments Value Author(s) Examples

View source: R/weblmGenerateNextWords.R

Description

This function returns the list of words (completions) most likely to follow a given sequence of words. The input string must be in ASCII format.

Internally, this function invokes the Microsoft Cognitive Services Web Language Model REST API documented at https://www.microsoft.com/cognitive-services/en-us/web-language-model-api/documentation.

You MUST have a valid Microsoft Cognitive Services account and an API key for this function to work properly. See https://www.microsoft.com/cognitive-services/en-us/pricing for details.

Usage

1
2
weblmGenerateNextWords(precedingWords, modelToUse = "body",
  orderOfNgram = 5L, maxNumOfCandidatesReturned = 5L)

Arguments

precedingWords

(character) Character string to retrieve completions for. Must be in ASCII format.

modelToUse

(character) Which language model to use, supported values: "title", "anchor", "query", or "body" (optional, default: "body")

orderOfNgram

(integer) Which order of N-gram to use, supported values: 1L, 2L, 3L, 4L, or 5L (optional, default: 5L)

maxNumOfCandidatesReturned

(integer) Maximum number of candidates to return (optional, default: 5L)

Value

An S3 object of the class weblm. The results are stored in the results dataframe inside this object. The dataframe contains the candidate words and their log(probability).

Author(s)

Phil Ferriere pferriere@hotmail.com

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
## Not run: 
 tryCatch({

   # Generate next words
   wordCandidates <- weblmGenerateNextWords(
     precedingWords = "how are you",  # ASCII only
     modelToUse = "title",            # "title"|"anchor"|"query"(default)|"body"
     orderOfNgram = 4L,               # 1L|2L|3L|4L|5L(default)
     maxNumOfCandidatesReturned = 5L  # Default: 5L
   )

   # Class and structure of wordCandidates
   class(wordCandidates)
   #> [1] "weblm"

   str(wordCandidates, max.level = 1)
   #> List of 3
   #>  $ results:'data.frame':  5 obs. of  2 variables:
   #>  $ json   : chr "{"candidates":[{"word":"doing","probability":-1.105}, __truncated__ ]}
   #>  $ request:List of 7
   #>   ..- attr(*, "class")= chr "request"
   #>  - attr(*, "class")= chr "weblm"

   # Print results
   pandoc.table(wordCandidates$results)
   #> ---------------------
   #>   word    probability
   #> ------- -------------
   #>   doing     -1.105
   #>
   #>    in       -1.239
   #>
   #> feeling     -1.249
   #>
   #>   going     -1.378
   #>
   #>   today      -1.43
   #> ---------------------

 }, error = function(err) {

   # Print error
   geterrmessage()

 })

## End(Not run)

philferriere/mscsweblm4r documentation built on May 25, 2019, 5:03 a.m.