as_url | Turn PTT board name to URL |
browse | Open web page in default browser |
check_404 | Check web page for 404 error |
comment2corpus | Convert 'comment' list-column to 'corpus' list-column |
down_html | Dowload HTML files to local directory |
example_posts | Retreive example data set of posts data frame |
extr_post_category | Extract post category from title |
get_post | Get all information from an individual PTT post |
get_post_comment | Retrieve user comments from an individual PTT post |
get_post_content | Retrieve content from an individual PTT post |
get_post_meta | Retrieve mata data from an individual PTT post |
get_ptt_dict | Get PTT dictionary |
hotboards | Return a data frame with popular boards info |
index2df | Extract data from multiple index pages of a PTT board. |
mutate_content_len | Word count 'content' col of get_post |
mutate_content_url | Extract and remove URL from 'content' column of a data frame... |
parse_comment_date | Parse dates to add year in PTT post comments |
parse_post_date | Extract the publish date of a PTT post |
ping2zh | Pingyin-Character translation |
post2corpus | Convert post data frame to corpus objects |
post2df | Extract information from PTT posts |
ptt | Get PTT info |
read_html2 | Read PTT pages with "over18-confirmation" |
scrape-index | Helper functions for scraping index pages |
segment | Word segmentation for PTT post content and comments. |
seg_ptt | Word segmentation for PTT post content |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.