remove_duplicate_biopax_components: Remove Duplicate Components from BioPAX Object

Description Usage Details Author(s)

Description

BioPAX objects frequently have components that are identical in everything, but id. This function removes all such duplicates, and changes all property_attr_values accordingly.

Usage

1

Details

The biopax is processed iteratively.
First, two sets of properties are used to identify and remove:
1) duplicate term-db-id, db-id, and position-status instances;
2) duplicate instances that refer to instance listed in 1) – they have properties xref, term, evidenceCode, featureLocation, modificationType, sequenceIntervalBegin, sequenceIntervalEnd.

Second, two sets of classes are used to identify and remove:
1) duplicate Protein, SmallMolecule, PhysicalEntity, RNA, DNA instances;
2) duplicate Complex instances.

Author(s)

Ivan Grishagin


grishagin/RIGbiopax documentation built on May 24, 2019, 1:33 a.m.