fasta_filter: Filter a fasta file

Description Usage Arguments Value To do Examples

View source: R/bioinfo.R

Description

Read a fasta file and write back a fasta file keeping only sequences with less than max_ambig and between min and max length. The file name is changed from xxx.fas to xxx.filtered.fas. Prints also the number of sequences in and out.

Usage

1
2
3
4
5
6
7
fasta_filter(
  file_name,
  min_length = 100,
  max_length = 10000,
  type = "AA",
  max_ambig = 5
)

Arguments

file_name

Name the input fasta file

min_length

Minimum length of sequences to keep

max_length

Maximum length of sequences to keep

type

"DNA" or "AA"

max_ambig

Maximum number of ambiguites

Value

TRUE if it terminates OK

To do

Nothing =)

Examples

1
fasta_filter("protein.faa", min_length=100, max_length=10000, type="AA", max_ambig=5)

vaulot/dvutils documentation built on Nov. 20, 2021, 11:01 a.m.