Seed_Vectors: Word embeddings for seed dictionaries

Description Usage Format

Description

Word embeddings for seed dictionaries

Usage

1

Format

A data frame with 30 rows and 901 variables:

Dictionary

Name of the dictionary that the word embeddings are representing.

fasttext

variables starting in fasttext are the word embedding dimensions for Fasttext trained on 2 million word vectors trained with subword information on Common Crawl (https://fasttext.cc/docs/en/english-vectors.html)

Glove

variables starting in Glove are the word embedding dimensions for Glove trained on Common Crawl (840B tokens, 2.2M vocab, cased, 300d vectors; https://nlp.stanford.edu/projects/glove/) (https://fasttext.cc/docs/en/english-vectors.html)

W2v

variables starting in W2v are the word embedding dimensions for Word2vec trained Google News (https://code.google.com/archive/p/word2vec/)

...


gandalfnicolas/ADCAT documentation built on April 15, 2020, 8:26 p.m.