Man pages for hrbrmstr/warc
Tools to Work with the Web Archive Ecosystem

as_warcConvert an 'httr::respone' object to WARC response objects
create_cdxCreate a CDX from a WARC file
create_warc_wgetUse wget to create a WARC archive for a URL list
expandExpand a compressed raw buffer
find_sequenceFind the first occurrence (if any) of a sequence of raw bytes...
gz_closeClose the gz file
gz_eofTest for end of file
gz_flushFlush currenzt gzip stream
gz_fseekSets the starting position for the next 'gz_read()' or...
gz_getsRead a line from a gz file
gz_gets_rawRead a line from a gz file
gzip_inflate_from_posInflate a gzip stream from a file
gz_offsetReturn the current raw compressed offset in the file
gz_openOpen a gzip file for reading or writing
gz_read_charRead from a gz file into a character vector
gz_read_rawRead from a gz file into a raw vector
gz_seekSets the starting position for the next 'gz_read()' or...
gz_tellReturn the current raw uncompressedf offset in the file
gz_write_charWrite an atomic character vector to a file
gz_write_rawWrite a raw vector to a gz file
print.cdxDisplay a CDX object
read_cdxRead a WARC CDX index file
read_warc_entryRead a WARC entry from a WARC file
warcTools to Work with the Web Archive Ecosystem
warc_headersExtract WARC headers from a WARC response object
write_warc_recordWrite a WARC record to a file
hrbrmstr/warc documentation built on May 17, 2019, 5:53 p.m.