company_names_data: A public dataset of company names

Description Usage Format Details Source

Description

List of registered Broker-Dealers provided by the SEC to the general public.

Usage

1

Format

An unnamed character vector.

Details

The sample dataset provided in the eztfidf package is a good example of where the tfidf adjustment is a critical improvement over simple string distances. Overused domain-specific language (such as 'securities' or 'associates') can be effectively mitigated with this approach.

Source

https://www.sec.gov/help/foiadocsbdfoiahtm.html


patricklyngrutz/eztfidf documentation built on May 6, 2019, 8:31 p.m.