hardclip_correct_umi_features: Hard clip correct UMI features (ce and cd)

View source: R/load_BAM.R

hardclip_correct_umi_featuresR Documentation

Hard clip correct UMI features (ce and cd)

Description

This function identifies and corrects for hard clipped bases in the UMI (Unique Molecular Identifier) features within sequencing data. Hard clipping, indicated by 'H' in the CIGAR string of a BAM file, refers to bases of the sequence that are not aligned to the reference genome and are therefore not included in the read. This function adjusts the UMI feature sequences accordingly to ensure that only the aligned portions of the UMIs are considered in downstream analyses. The function applies corrections to both 'ce' and 'cd' UMI feature columns in the provided dataframe.

Usage

hardclip_correct_umi_features(df)

Arguments

df

data.frame converted from lists-of-lists (scanBam)

Value

data.frame with corrected UMI features


JakobPedersenLab/dreams documentation built on Feb. 2, 2024, 3:14 p.m.