Description Usage Format Details
A dataset containing sample security input data for 492 securities and 65 weekdays, from 2020-06-01 to 2020-08-31. Data items include average trading dollar volume, market cap, and normalized size and value factors. The pricing data used to construct the dataset was downloaded using the Tiingo Stock API and is used with permission. Fundamental data items were downloaded from EDGAR.
1 |
A data frame with 31980 rows and 7 variables:
Input date. It is assumed that the input data for day X is known at the beginning of day X (e.g., the data is as-of the previous day's close).
Security identifier.
Average dollar trading volume for the security over the past 20 trading days.
Market capitalization, in dollars. The shares outstanding value used to calculate market cap is the latest value available at the beginning of the month.
Ratio of total equity to market cap. The stockholders' equity value used to calculate book to price is the latest value available at the beginning of the month.
Market cap factor normalized to be N(0,1) for each day.
Book to price factor normalized to be N(0,1) for each day.
Data for most members of the S&P 500 are present. Some securities have been omitted due to data processing complexities. For example, securities for companies with multiple share classes have been omitted in the current version.
Values for shares outstanding and stockholders' equity downloaded from EDGAR may be inaccurate due to XBRL parsing issues.
Full code for reconstructing the dataset can be found in the pystrand repository.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.