parse_ids: Parse protein ids. This is useful to remove additionnal...

View source: R/pannot.R

parse_idsR Documentation

Parse protein ids. This is useful to remove additionnal information such as isoform or entry name.

Description

Parse protein ids. This is useful to remove additionnal information such as isoform or entry name.

Usage

parse_ids(x, sep_split = ";", sep_secondary = c("|", "-"), sep_collapse = ";")

Arguments

x

Character string with protein ids

sep_split

Character separating different protein ids

sep_secondary

Character separating UniProt entry from other infromation in protein ids

sep_collapse

Character used to separate different protein ids after parsing

Examples

ids <- "A2AMW0|A2AMW0_MOUSE; P47757-2|CAPZB_MOUSE; P47757-4|CAPZB_MOUSE; Q3TVK4|Q3TVK4_MOUSE"
parse_ids(ids, sep = "; ", sep_secondary=c("|", "-"), sep_collapse = ";")

ids <- c("Q5SWU9|ACACA_MOUSE", "Q9ES52-2|SHIP1_MOUSE; Q9ES52|SHIP1_MOUSE", "Q8VDD5|MYH9_MOUSE")
parse_ids(ids, sep = "; ", sep_secondary=c("|", "-"), sep_collapse = ";")


VoisinneG/pannot documentation built on July 5, 2023, 4:19 p.m.