README.md

Panel Informativity Optimizer (PIO)

PIO is an open-source software created to help researchers and biologists in designing their NGS panels.

Based on individual mutational profiles available on public cancer genomics databases or in private databases, PIO proposes a tool to either select the minimal set of genes or exons to achieve absolute informativity (100% of the patients with at least one mutation) or assess the informativity of a given panel in a cancer of interest. Several others features are proposed by PIO, such as the exploration of mutation data frequency and distribution. By combining metrics on informativity and panel length, PIO can help making accurate choices in the design of panels, and might also be used to easily benchmark available commercial solutions.

Getting started

Online version

PIO has a ready-to-use online version available at https://vincentalcazer.shinyapps.io/Panel_informativity_optimizer/.

Local version

You can install the local development version from GitHub either by cloning the repository or directly downloading the package in R. Note that dependencies will have to be installed first in case of repository cloning.


install.packages("remotes")
remotes::install_github("VincentAlcazer/PIO")

shiny::runApp("path_to_PIO")

Quick tutorial

For a complete tutorial, please visit the vignette.

Parameters overview

Parameters can be selected in the left panel.

1. Datasets

The training dataset can be selected here.

An example dataset is provided (Custom_dataset_example.tsv). Other example of datasets from cbioportal can be found in the raw_mutation_data folder.

Note that preloaded datasets can be merged with custom datasets to enable a global analysis on a larger base (can be done with a local use only).

2. Analysis parameters

Analysis mode

Optimal panel selection (optimal mode), custom panel interrogation (custom mode) or panel test. See the vignette for more informations about the mods.

Group mutations by

Should mutations (and their respective length) be grouped by gene or exon/intron? (default: gene)

Informativity metric

The number of unique patients per kilobase (UPKB) or the number of unique patients (UP) can be used as informativity metric. (default: UPKB)

Min patients/mutation

A minimal number of patients per mutation, calculated on the overall cohort, can be set in order to avoid overfitting to private mutations. (default: 2)

Min. mutations/patient

How many mutations per patient you would like PIO to aim at. (default: 2)

Panel upload

For custom analysis: a custom panel should be uploaded. The uploaded file should contain only one column containing the list of gene or exon names. An example is provided (Custom_panel_example.tsv).

Proposed parameters according to applications

Best panel establishment

  1. Dataset: select a preloaded or upload a custom dataset
  2. Parameters: PIO optimal
  3. Group mutation by: exon/intron or gene
  4. Informativity metric: UP or UPKB
  5. Min. patients/mutations: at least 2
  6. Min. mutations/patient: between 1 and 5

With these parameters, PIO will selects an optimal set of mutations allowing to maximize informativity in a given disease, targeting a selected number of mutations per patient (informativity level).

Custom panel evaluation

  1. Dataset: select a preloaded or upload a custom dataset
  2. Parameters: PIO custom (do not forget to upload your custom panel below)
  3. Group mutation by: exon/intron or gene
  4. Informativity metric: UP or UPKB
  5. Min. patients/mutations: at least 2
  6. Min. mutations/patient: between 1 and 5

Using these parameters, PIO will select an optimal set of mutations among the uploaded panel allowing to maximize informativity in a given disease. PIO will automatically suggest the most informative mutations to add to optimize panel informativity.

Custom panel size optimization

  1. Dataset: select a preloaded or upload a custom dataset
  2. Parameters: PIO custom (do not forget to upload your custom panel below)
  3. Group mutation by: exon/intron is recommanded for panel size optimization.
  4. Informativity metric: UPKB is the most adapted metric for panel size optimization.
  5. Min. patients/mutations: at least 2
  6. Min. mutations/patient: between 1 and 5

In this configuration, PIO will propose an optimized panel allowing to maximize informativity with for the minimal size. Different panels can be compared by manually uploading individual panels.

Comparison of different panel performances

  1. Dataset: select a preloaded or upload a custom dataset
  2. Parameters: Panel test (do not forget to upload your custom panel below)
  3. Group mutation by: according to the custom panel: gene or exon/intron
  4. Informativity metric: UP or UPKB
  5. Min. patients/mutations: 1
  6. Min. mutations/patient: between 1 and 5

In this configuration, PIO will show the performances of the uploaded panel in a given dataset without further mutation selection/panel optimization.

Mutation exploration

  1. Dataset: select a preloaded or upload a custom dataset
  2. Parameters: PIO optimal
  3. Group mutation by: gene (or exon/intron)
  4. Informativity metric: UP or UPKB
  5. Min. patients/mutations: 1
  6. Min. mutations/patient: 1

Using these parameters will optimize mutations exploration for a given disease in the mutations tab.



VincentAlcazer/PIO documentation built on April 14, 2022, 4:03 a.m.