Build Status Build status

auditr: Classify Surnames by Race Across U.S. Counties Demographics

This is a companion package to Crabtree and Chykina (2018). It contains one function names_probabilites. This function takes a vector of last names, generates a matrix of name and county pairs using packaged data, takes this matrix and returns the probability that a name denotes one of four racial (or ethnic) groups (i.e. Asian, Black, Hispanic, and White) for all counties, and then plots these values. This allows individuals to visually identify the extent to which the racial information provided by surnames varies across geographic contexts and to identify potentially problematic surnames.

For the reasons why you might want to do this, see Crabtree and Chykina (2018) and Gaddis (2017).

Package Installation

The latest development version (0.1.0) is on GitHub can be installed using devtools.


Support or Contact

Please use the issue tracker for problems, questions, or feature requests. If you would rather email with questions or comments, you can contact Charles Crabtree and he will address the issue.

If you would like to contribute to the package, that is great! We welcome pull requests and new developers.


To test the software, users and potential contributors can use the example code provided in the documentation for each function.


cdcrabtree/auditr documentation built on Dec. 4, 2017, 12:12 a.m.