demodata | R Documentation |
This demo data contains a sample of 8000 English words with 300-dimension word vectors pre-trained using the "word2vec" algorithm based on the Google News corpus. Most of these words are from the Top 8000 frequent wordlist, whereas a few are selected from less frequent words and appended.
data(demodata)
A data.table
(of new class wordvec
) with two variables word
and vec
,
transformed from the raw data (see the URL in Source) into .RData
using the data_transform
function.
Google Code - word2vec (https://code.google.com/archive/p/word2vec/)
class(demodata)
demodata
embed = as_embed(demodata, normalize=TRUE)
class(embed)
embed
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.