DTU Health Tech
Department of Health Technology
This link is for the general contact of the DTU Health Tech institute.
If you need help with the bioinformatics programs, see the "Getting Help" section below the program.
The ChloroP server predicts the presence of chloroplast transit peptides (cTP) in protein sequences and the location of potential cTP cleavage sites.
NOTE: This service is superseded by TargetP which integrates predictions of chloroplast transit peptides, signal peptides and mitochondrial transit peptides. |
Include the N-terminus
It is strongly recommended to include the N-terminus of the submitted sequence.
The further from the N-terminal residue the submitted sequence starts, the more
difficult and unreliable the prediction will be.
Submit preferably around 100 residues
Submit if possible about 100 N-terminal residues. The suggested length is due
to the fact that the "cTP"/"no cTP" predictor was trained with input sequences
of length 100 residues. However, shorter sequences may also be satisfactory
predicted (it is more important that the N-terminal part is intact). The
cleavage site prediction is restricted to search for a potential cleavage site
within the 100 most N-terminal residues and is in itself not influenced by
sequence length.
Sequence names
The name of the sequences may be of any length, but only the first 20
characters will be preserved throughout the prediction and presented on the
ChloroP prediction result page.
Detailed output
Check this box if you would like the neural network score also for each
residue (and not only for each sequence) to be presented.
A derivative of the network score used for finding the area in which the
cleavage site is searched for (the 40 residues surrounding the residue with
the highest derivative score) is also presented, along with the cleavage site
score (CS-score) for each residue.
If "Detailed output" is chosen, the neural network score for each residue will also be presented. The higher this score, the more certain is the network that this residue is part of a cTP. A derivative of the network score is also presented. This score is used for finding the area in which the cleavage site is searched for - namely among the 40 residues surrounding the residue with the highest derivative score. Finally, the cleavage site score (CS-score) is presented for each residue. This score is calculated from a scoring matrix derived from an automatic motif finding algorithm called MEME at sdsc. The cleavage site score is defined so that the predicted cleavage site is directly N-terminal of the highest scoring residue within the 40 residues. Thus, there might exist one or several CS-score(s) that happen(s) to be greater than the score of the proposed cTP length, but since they are located outside of the 40 residues around the highest derivative score, the presented cTP length is still what ChloroP considers as the most likely presequence length (ie. corresponding to the most likely cleavage site).
Here are the 75 sequences containing cTP:
Here are the 75 sequences not containing cTP:
1
Department of Biochemistry, Stockholm University,
S-106 91 Stockholm, Sweden
2
Center for Biological Sequence Analysis, BioCenterum-DTU,
Technical University of Denmark, DK-2800 Lyngby, Denmark
We present a neural network based method (ChloroP) for identifying chloroplast transit peptides and their cleavage sites. Using cross-validation, 88% of the sequences in our homology reduced training set were correctly classified as transit peptides or nontransit peptides. This performance level is well above that of the so far only publicly available chloroplast localization predictor PSORT. Cleavage sites are predicted using a scoring matrix derived by an automatic motif-finding algorithm. Approximately 60% of the known cleavage sites in our sequence collection were predicted to within +- 2 residues from the cleavage sites given in SWISS-PROT. An analysis of 715 A. thaliana sequences from SWISS-PROT suggests that the ChloroP method should be useful for the identification of putative transit peptides in genome-wide sequence data.
3. The first 20 characters of sequence names will be preserved
throughout the prediction and presented on the ChloroP prediction result page
(as compared to 11 in previous versions. Wow! Great improvement, isn't it?).
If you need help regarding technical issues (e.g. errors or missing results) contact Technical Support. Please include the name of the service and version (e.g. NetPhos-4.0) and the options you have selected. If the error occurs after the job has started running, please include the JOB ID (the long code that you see while the job is running).
If you have scientific questions (e.g. how the method works or how to interpret results), contact Correspondence.
Correspondence:
Technical Support: