data.frame describing names containing character codes
rare or non-existent in standard English text, e.g., with various
accent marks that may not be coded consistenty in different locales or
by different software.
data.frame with two columns:
a character vector containing names that often have non-standard characters with the non-standard characters replaced by "_"
a character vector containing a standard English-character
1 2 3 4