Why Choose PRESGENE?
Unique ML strategies for Gene Essentiality prediction in Prokaryotes and Eukaryotes
Unique ML strategies for Gene Essentiality prediction in Prokaryotes and Eukaryotes
Total 289 Features (Network topological features, Sequence-based features)
Prediction of essential genes help to find minimal genes indispensable for the survival of any organism.
Available machine learning techniques for essential gene predictions are inherent with problems like imbalanced provision of training datasets, choice of a best model biased for a given balanced dataset, choice of a complex machine learning algorithm, and data-based automated selection of biologically relevant features for classification.
Essential gene prediction helps to find minimal genes indispensable for appropriate cellular function and survival of any organism. Machine learning (ML) algorithms have been useful for prediction of gene essentiality and annotation. ...
The server provides the users with three channels or ways of predicting the essential genes via the PRESGENE server
The prediction of essential genes by ML Strategy 1 or ML Strategy 2 for 14 sample model organisms, including both prokaryotes and eukaryotes with PRESGENE feature matrix (289 Features i.e., diverse set of biological features such as sequence and network topological features derived from flux coupling and reaction network)
The prediction of essential genes by ML Strategy 1 or ML Strategy 2 for a new organism with PRESGENE feature matrix (289 Features i.e., diverse set of biological features such as sequence and network topological features derived from flux coupling and reaction network)
The prediction of essential genes by ML Strategy 1 or ML Strategy 2 for a new organism with User Created Feature Matrix
Essential genes are the genes which are indispensable for the survival of any organism.
Five types of Input Files are required. These are, Genome Scale Reconstructed Metabolic Network, fasta file containing the coding nucleotide sequences of the genes, fasta file containing the ribosomal sequences of the genes, protein sequences of the target organisms and gene essentiality label from experiments.
Yes
PRESGENE has two types of Machine learning strategies. Strategy 1 is developed for essential gene prediction where sufficient labeled data (>30%) is available. Strategy 2 is developed for essential gene prediction where limited data (1% labeled data of the genome) is available.
Total 289 features can be calculated.