Training and testing data sets

The datasets for training and testing SignalP5.0 can be found here. Both datasets are in 3-line FASTA format:

>Uniprot_AC|Kingdom|Type|Partition No
amino-acid sequence
annotation [S: Sec/SPI signal peptide | T: Tat/SPI signal peptide | L: Sec/SPII signal peptide | I: cytoplasm | M: transmembrane | O: extracellular]

Training set: download

Benchmark set: download