docxtractr: Extract Data Tables and Comments from Microsoft Word Documents
Version 0.2.0

Microsoft Word docx files provide an XML structure that is fairly straightforward to navigate, especially when it applies to Word tables and comments. Tools are provided to determine table count/structure, comment count and also to extract/clean tables and comments from Microsoft Word docx documents.

AuthorBob Rudis [aut, cre]
Date of publication2016-07-20 09:34:55
MaintainerBob Rudis <bob@rudis.net>
LicenseMIT + file LICENSE
Version0.2.0
Package repositoryView on CRAN
InstallationInstall the latest version of this package by entering the following in R:
install.packages("docxtractr")

Popular man pages

assign_colnames: Make a specific row the column names for the specified...
docx_describe_cmnts: Returns information about the comments in the Word document
docx_describe_tbls: Returns a description of all the tables in the Word document
docx_extract_all_cmnts: Extract all comments from a Word document
docx_extract_all_tbls: Extract all tables from a Word document
docxtractr: docxtractr is an R package for extracting tables and comments...
read_docx: Read in a Word document for table extraction
See all...

All man pages Function index File listing

Man pages

assign_colnames: Make a specific row the column names for the specified...
docx_cmnt_count: Get number of comments in a Word document
docx_describe_cmnts: Returns information about the comments in the Word document
docx_describe_tbls: Returns a description of all the tables in the Word document
docx_extract_all: Extract all tables from a Word document
docx_extract_all_cmnts: Extract all comments from a Word document
docx_extract_all_tbls: Extract all tables from a Word document
docx_extract_tbl: Extract a table from a Word document
docx_tbl_count: Get number of tables in a Word document
docxtractr: docxtractr is an R package for extracting tables and comments...
print.docx: Display information about the document
read_docx: Read in a Word document for table extraction

Functions

assign_colnames Man page Source code
docx_cmnt_count Man page Source code
docx_describe_cmnts Man page Source code
docx_describe_tbls Man page Source code
docx_extract_all Man page Source code
docx_extract_all_cmnts Man page Source code
docx_extract_all_tbls Man page Source code
docx_extract_tbl Man page Source code
docx_tbl_count Man page Source code
docxtractr Man page
docxtractr-package Man page
ensure_docx Source code
has_header Source code
is_docx Source code
is_url Source code
print.docx Man page Source code
read_docx Man page Source code

Files

inst
inst/examples
inst/examples/comments.docx
inst/examples/data3.docx
inst/examples/none.docx
inst/examples/data.docx
inst/examples/realworld.docx
inst/examples/complex.docx
tests
tests/testthat.R
tests/testthat
tests/testthat/test-docxtractr.R
NAMESPACE
NEWS.md
R
R/extract_all.r
R/aaa.r
R/read_docs.r
R/utils.r
R/docxtractr-package.r
R/describe.r
R/assign_colnames.r
R/docx_find_tbls.r
MD5
DESCRIPTION
man
man/docx_extract_all_cmnts.Rd
man/docx_describe_cmnts.Rd
man/read_docx.Rd
man/docx_describe_tbls.Rd
man/docxtractr.Rd
man/assign_colnames.Rd
man/docx_tbl_count.Rd
man/docx_extract_tbl.Rd
man/docx_cmnt_count.Rd
man/docx_extract_all.Rd
man/print.docx.Rd
man/docx_extract_all_tbls.Rd
LICENSE
docxtractr documentation built on May 19, 2017, 6:12 p.m.

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

Please suggest features or report bugs in the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.