README.md

ypssc: Yeast-Proteome Secondary-Structure Calculator

An extension for 'NetSurfP-2.0' which is specifically designed to analyze the results of bottom-up-proteomics that is primarily analyzed with 'MaxQuant'. This tool is designed to process a large number of yeast peptides that produced as a results of whole yeast cell-proteome digestion and provide a coherent picture of secondary structure of proteins.

What is ypssc?

ypssc is an extension for NetSurfP-2.0 which is specifically designed to analyze the results of bottom-up proteomics that is primarily analyzed with MaxQuant. We call this tool Yeast Proteome Secondary Structure Calculator (ypssc). This tool is designed to process a large number of yeast peptides that produced as a results of whole yeast cell-proteome digestion and provide a coherent picture of secondary structure of proteins.

Functionalities in ypssc:

  1. findSecondary
  2. findAlpha
  3. findBeta
  4. findChain

Note: NetSurfP

Why this package?

This tool is designed to process large number of yeast peptides that produced as a results of whole yeast cell proteome digestion and provide a coherent picture of secondary structure of proteins. NetSurfP-2.0 is not designed to do this task.

Drawbacks of NetSurfP-2.0**

Advantages of ypssc

Methodology

The SSDYP contains structural information for all amino acids of whole yeast proteome (Over 3000,000 amino acids) which contains over 6700 proteins. For a hypothetical protein, the SSDYP contains the ID of the protein, amino acids with numbers and structural information for each amino acid. Focusing on the hypothetical protein, in the real sample, there are many peptides identified from the hypothetical protein. ypssc first finds all the peptides that belongs to the hypothetical protein and arrange them based on the numbers of the amino acids; then it removes the parts of the protein that have been identified more than once in multiple peptides and collapses the population of identified peptides in the sample into one sequence that represents the coverage of the hypothetical protein. The result would show that which part of the protein is identified, and which part is missing. Then, ypssc matches the the sequence that identified in the sample with SSDYP to find the structural information about amino acids.



Stasharofi/ypssc documentation built on Dec. 18, 2021, 2:08 p.m.