bdpar: Big Data Preprocessing Architecture

Provide a tool to easily build customized data flows to pre-process large volumes of information from different sources. To this end, 'bdpar' allows to (i) easily use and create new functionalities and (ii) develop new data source extractors according to the user needs. Additionally, the package provides by default a predefined data flow to extract and pre-process the most relevant information (tokens, dates, ... ) from some textual sources (SMS, Email, tweets, YouTube comments).

Package details

AuthorMiguel Ferreiro-Díaz [aut, cre], David Ruano-Ordás [aut, ctr], Tomás R. Cotos-Yañez [aut, ctr], José Ramón Méndez Reboredo [aut, ctr], University of Vigo [cph]
MaintainerMiguel Ferreiro-Díaz <miguel.ferreiro.diaz@gmail.com>
LicenseGPL-3
Version3.0.3
URL https://github.com/miferreiro/bdpar
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("bdpar")

Try the bdpar package in your browser

Any scripts or data that you put into this service are public.

bdpar documentation built on Aug. 22, 2022, 5:08 p.m.