fine_cluster_seqs: Calculate distances and perform hierarchical clustering on a...
In CellaRepertorium: Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

Description Usage Arguments Value See Also Examples

The distances between AA sequences is defined to be 1-score/max(score) times the median length of the input sequences. The distances between nucleotide sequences is defined to be edit_distance/max(edit_distance) times the median length of input sequences.

fine_cluster_seqs(
  seqs,
  type = "AA",
  big_memory_brute = FALSE,
  method = "levenshtein",
  substitution_matrix = "BLOSUM100",
  cluster_fun = "none",
  cluster_method = "complete"
)

`seqs`	character vector, DNAStringSet or AAStringSet
`type`	character either `AA` or `DNA` specifying type of `seqs`
`big_memory_brute`	attempt to cluster more than 4000 sequences? Clustering is quadratic, so this will take a long time and might exhaust memory
`method`	one of 'substitutionMatrix' or 'levenshtein'
`substitution_matrix`	a character vector naming a substitution matrix available in Biostrings, or a substitution matrix itself
`cluster_fun`	`character`, one of "hclust" or "none", determining if distance matrices should also be clustered with `hclust`
`cluster_method`	character passed to `hclust`

list

hclust(), Biostrings::stringDist()

fasta_path = system.file('extdata', 'demo.fasta', package='CellaRepertorium')
aaseq = Biostrings::readAAStringSet(fasta_path)[1:100]
cls = fine_cluster_seqs(aaseq, cluster_fun = 'hclust')
plot(cls$cluster)

CellaRepertorium documentation built on Nov. 8, 2020, 7:48 p.m.

CellaRepertorium index

README.md An Introduction to CellaRepertorium Clustering and differential usage of repertoire CDR3 sequences Combining Repertoire with Expression with SingleCellExperiment Quality control and Exploration of UMI-based repertoire data

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

CellaRepertorium
Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

fine_cluster_seqs: Calculate distances and perform hierarchical clustering on a...
In CellaRepertorium: Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

Description

Usage

Arguments

Value

See Also

Examples

Related to fine_cluster_seqs in CellaRepertorium...

R Package Documentation

Browse R Packages

We want your feedback!

CellaRepertorium Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

fine_cluster_seqs: Calculate distances and perform hierarchical clustering on a... In CellaRepertorium: Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

Description

Usage

Arguments

Value

See Also

Examples

Related to fine_cluster_seqs in CellaRepertorium...

R Package Documentation

Browse R Packages

We want your feedback!

CellaRepertorium
Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)

fine_cluster_seqs: Calculate distances and perform hierarchical clustering on a...
In CellaRepertorium: Data structures, clustering and testing for single cell immune receptor repertoires (scRNAseq RepSeq/AIRR-seq)