docxtractr: Extract Data Tables and Comments from Microsoft Word Documents

Microsoft Word docx files provide an XML structure that is fairly straightforward to navigate, especially when it applies to Word tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from Microsoft Word docx documents.

Author
Bob Rudis [aut, cre]
Date of publication
2016-07-20 09:34:55
Maintainer
Bob Rudis <bob@rudis.net>
License
MIT + file LICENSE
Version
0.2.0

View on CRAN

Man pages

assign_colnames
Make a specific row the column names for the specified...
docx_cmnt_count
Get number of comments in a Word document
docx_describe_cmnts
Returns information about the comments in the Word document
docx_describe_tbls
Returns a description of all the tables in the Word document
docx_extract_all
Extract all tables from a Word document
docx_extract_all_cmnts
Extract all comments from a Word document
docx_extract_all_tbls
Extract all tables from a Word document
docx_extract_tbl
Extract a table from a Word document
docx_tbl_count
Get number of tables in a Word document
docxtractr
docxtractr is an R package for extracting tables and comments...
print.docx
Display information about the document
read_docx
Read in a Word document for table extraction

Files in this package

docxtractr
docxtractr/inst
docxtractr/inst/examples
docxtractr/inst/examples/comments.docx
docxtractr/inst/examples/data3.docx
docxtractr/inst/examples/none.docx
docxtractr/inst/examples/data.docx
docxtractr/inst/examples/realworld.docx
docxtractr/inst/examples/complex.docx
docxtractr/tests
docxtractr/tests/testthat.R
docxtractr/tests/testthat
docxtractr/tests/testthat/test-docxtractr.R
docxtractr/NAMESPACE
docxtractr/NEWS.md
docxtractr/R
docxtractr/R/extract_all.r
docxtractr/R/aaa.r
docxtractr/R/read_docs.r
docxtractr/R/utils.r
docxtractr/R/docxtractr-package.r
docxtractr/R/describe.r
docxtractr/R/assign_colnames.r
docxtractr/R/docx_find_tbls.r
docxtractr/MD5
docxtractr/DESCRIPTION
docxtractr/man
docxtractr/man/docx_extract_all_cmnts.Rd
docxtractr/man/docx_describe_cmnts.Rd
docxtractr/man/read_docx.Rd
docxtractr/man/docx_describe_tbls.Rd
docxtractr/man/docxtractr.Rd
docxtractr/man/assign_colnames.Rd
docxtractr/man/docx_tbl_count.Rd
docxtractr/man/docx_extract_tbl.Rd
docxtractr/man/docx_cmnt_count.Rd
docxtractr/man/docx_extract_all.Rd
docxtractr/man/print.docx.Rd
docxtractr/man/docx_extract_all_tbls.Rd
docxtractr/LICENSE