| fgga | R Documentation | 
A hierarchical graph-based machine learning model for the consistent GO, PO, ZFA, HPO annotation of protein coding genes.
fgga(graphOnto, tableOntoTerms, dxCharacterized, dxTestCharacterized,
    kFold, kernelSVM, tmax, epsilon)
graphOnto | 
 A graphNEL graph with ‘m’ Ontology node labels.  | 
tableOntoTerms | 
 A binary matrix with ‘n’ proteins (rows) by ‘m’ Ontology node labels (columns).  | 
dxCharacterized | 
 A data frame with ‘n’ proteins (rows) by ‘f’ features (columns).  | 
dxTestCharacterized | 
 A data frame with ‘k’ proteins (rows) by ‘f’ features (columns).  | 
kFold | 
 An integer for the number of folds.  | 
kernelSVM | 
 The kernel used to calculate the variance (default: radial).  | 
tmax | 
 An integer indicating the maximum number of iterations (default: 200).  | 
epsilon | 
 A real value less than 1 that represents the convergence criteria (default: 0.001).  | 
The FGGA model is built in two main steps. In the first step, a core Factor Graph (FG) modeling hidden Ontology-term predictions and relationships is created. In the second step, the FG is enriched with nodes modeling observable Ontology-term predictions issued by binary SVM classifiers. In addition, probabilistic constraints modeling learning gaps between hidden and observable Ontology-term predictions are introduced. These gaps are assumed to be independent among Ontology-terms, locally additive with respect to observed predictions, and zero-mean Gaussian. FGGA predictions are issued by the native iterative message passing algorithm of factor graphs.
A named matrix with ‘k’ protein coding genes (rows) by ‘m’ cross-Ontology node labels (columns) where each element indicates a probabilistic prediction value.
Flavio E. Spetale and Elizabeth Tapia <spetale@cifasis-conicet.gov.ar>
Spetale F.E., Tapia E., Krsticevic F., Roda F. and Bulacio P. “A Factor Graph Approach to Automated GO Annotation”. PLoS ONE 11(1): e0146986, 2016.
Spetale Flavio E., Arce D., Krsticevic F., Bulacio P. and Tapia E. “Consistent prediction of GO protein localization”. Scientific Report 7787(8), 2018
fgga2bipartite, sumProduct, svmOnto
data(CfData)
mygraphGO <- as(CfData[["graphCfGO"]], "graphNEL")
dxCfTestCharacterized <- CfData[["dxCf"]][CfData[["indexGO"]]$indexTest[1:2], ]
myTableGO <- CfData[["tableCfGO"]][
                    CfData[["indexGO"]]$indexTrain[1:300], ]
dataTrain <- CfData[["dxCf"]][
                    CfData[["indexGO"]]$indexTrain[1:300], ]
fggaResults <- fgga(graphOnto = mygraphGO,
                tableOntoTerms = myTableGO, dxCharacterized = dataTrain,
                dxTestCharacterized = dxCfTestCharacterized, kFold = 2,
                tmax = 50, epsilon = 0.05)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.