calculate_fdr: Calculates qvalues on the subset PSMs.

Description Usage Arguments Value Examples

View source: R/functions.R

Description

Calculates qvalues on the subset PSMs.

Usage

1
calculate_fdr(df, score_higher = TRUE)

Arguments

df

dataframe with at least 3 columns:

score

score assigned to the peptide to spectrum match (PSM).

subset

TRUE if PSM belongs to the subset in interest, FALSE or otherwise.

decoy

TRUE if decoy PSM, FALSE otherwise.

Additional columns are allowed but ignored. Target and decoy PSMs are assumbed to be from a competitive target decoy database search.

score_higher

TRUE if a higher score means a better PSM.

Value

A data frame containing all columns in “df”. Following columns are added:

pi_0_cons

conservative estimation of π_0.

FDR

estimated subset PSM qvalues calculated according the competitive target decoy approach.

FDR_BH

estimated subset PSM qvalues calculated according the Benjamini Hochbergh procedure. When provided, non-subset decoy PSMs are used to stabilize estimates in small subsets

FDR_stable

estimated subset PSM qvalues calculated with “pi_0_cons”. When provided, non-subset decoy PSMs are used to stabilize estimates in small subsets

Examples

1
2
3
4
5
6
7
8
## Simulate a dataset with 140 correct target subset PSMs, 60 incorrect target subset PSMS,
## 60 decoy subset PSMs and 2000 additional decoy PSMs.
set.seed(10)
d = sample_dataset(H1_n = 140,H0_n = 60, decoy_n = 60 ,decoy_large_n = 2000)

## calculate the qvalues in the subset target PSMS according the classical target-decoy approach
## and our more stable estimation method.
calculate_fdr(d)

compomics/search-all-assess-subset documentation built on May 13, 2019, 9:55 p.m.