stream - Infrastructure for Data Stream Mining - R package

CRAN version CRAN RStudio mirror downloads Travis-CI Build Status AppVeyor Build Status

A framework for data stream modeling and associated data mining tasks such as clustering and classification. The development of this package was supported in part by NSF IIS-0948893 and NIH R21HG005912.

Additional packages in the stream family are:


Stable CRAN version: install from within R with


Current development version: Download package from AppVeyor or install from GitHub (needs devtools).



Load the package and create micro-clusters via sampling.

stream <- DSD_Gaussians(k=3, noise=0)

sample <- DSC_Sample(k=20)
update(sample, stream, 500)
Reservoir sampling
Class: DSC_Sample, DSC_Micro, DSC_R, DSC 
Number of micro-clusters: 20 

Recluster micro-clusters using k-means and plot results

kmeans <- DSC_Kmeans(k=3)
recluster(kmeans, sample)
plot(kmeans, stream, type="both")


Try the stream package in your browser

Any scripts or data that you put into this service are public.

stream documentation built on May 29, 2017, 9:29 a.m.