augment_annotation: Include additional variables and UniProt accessions

augment_annotationR Documentation

Include additional variables and UniProt accessions

Description

All UniProt accessions that match each amino acid sequence will be added to the data.table. Currently the data.table only consists of the highest scoring UniProt accession for each amino acid sequence. Additional variables are also created from the current variables.

Usage

augment_annotation(x, fst_path, fst_name)

Arguments

x

A data.table output from the read_toppic function.

fst_path

A character string specifying the path to the protein database (.fasta) file.

fst_name

A character string containing the name of the .fasta file.

Value

A data.table with all protein accessions that match each amino acid sequence, not just the highest scoring accession. The number of rows in the output could be much larger than the number of rows in the input because all accessions for each sequence are included.

Author(s)

Vlad Petyuk, Evan A Martin


evanamartin/TopPICR documentation built on Dec. 9, 2022, 8:05 p.m.