Clusters sequences hierarchically with regular expressions. At each step we minimize number of degrees of freedom for all regular expressions needed to describe the data

1 | ```
cluster_reg_exp(ngrams)
``` |

`ngrams` |
list of elements |

Regular expression is a list of the length equal to the length of the input sequences. Each element of the list represents a position in the sequence and contains amino acid, that are likely to occure on this position.

List of four

"regExps"regular expression in best clustering

"seqClustering"clustering of sequences in best clustering

"allRegExps"all regular expressions.

"allIndices"all clusterings

1 2 3 4 | ```
data(human_cleave)
#cluster_reg_exp is computationally expensive
results <- cluster_reg_exp(human_cleave[1L:10, 1L:4])
``` |

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.