Man pages for abjur/djt
A toolbox to download Brazilian DJTs

chop_atChop string at given positions
clean_all_but_firstSpecial cleaner trim all texts but keeps left trim on first...
clean_right_sideSpecial cleaner trim just the right side of a text and remove...
djt_better_namesSpecial cleaner better names to help merge work
djt_parseParse DJT files
djt_parse_summaryHeuristics to DJT file into TOC table
djt_remove_footerremove footer from DJT.
djt_remove_headerremove header from DJT.
djt_remove_regexRemove regex from DJT
djt_stack_columnsStack columns
djt_subsection_dictDJT subsection list
djt_tocGets toc from text of pdf file and text
djt_trimSpecial cleaner split raw text into escapes and trim output
download_djtDownload DJT PDFs based on date, booklet, and trt
extract_partextract name of the part from trimmed column based on...
find_allFind all occurrences of multiple patterns
find_html_stylesFind HTML styles
get_id_summarya palavra sumário aparece várias vezes, por conta do "RITO...
is_lawsuitverifies if the trimmed column has a lawsuit
match_lawsuitsGet lawsuits that match a pattern
parse_single_sectionParse a section from TOC obtained from pdftools::pdf_toc()...
pdf_to_textConverts a PDF file to text (wraps a call to Poppler's...
pipePipe operator
pre_processPreprocess text (remove headers and footers)
remove_first_headerRemove text's first header
remove_footersRemove text's footers
remove_headersRemove text's headers (except first)
abjur/djt documentation built on May 10, 2019, 4:12 a.m.