Description Usage Format Source
A dataset with 24 338 textual items.
1 |
A data frame with 24338 rows and 8 columns:
the id is a composed string, that should make the identifier unique even when used together with other similarly shaped datasets. Elements are separated by a an hyphen-minus. A an example doc_id
would be president_ru-en-012345
.
this includes the full text of the document, including the title and the textual string with date and location (when present).
date of publication in the date format.
the title of the document
the location from where the document was issued as reported at the beginning of each post, e.g. "Novo-Ogaryovo, Moscow Region"; if not given, an empty string.
a URL, source of the document
numeric id; includes only the numeric part of doc_id
, may be useful if only a numeric identifier is needed.
a character string referring to the presidential term. The period after Yeltsin's resignation, but before Putin's first inauguration in May 2000 is indicated as "Putin 0", the following as "Putin 1", "Putin 2", "Medvedev 1", "Putin 3", and "Putin 4"
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.