Here, one can download the data used for training, testing and evaluating this method.
File Format: Fasta
Header: <Positive/Negative>ID_<IEDB_Epitope_ID>
The header explains whether the sequence has a negative or positive epitope sequence mapped onto it.
Epitope: Uppercased
Non-Epitope: Lowercased
Download: IEDB Linear Epitope Dataset
File Format: Fasta
Header: <PDBID>_<Heavy_Chain_ID><Light_Chain_ID> <Antigen_Chain_ID> <Partition>
The header explains which antibody chains and antigen chains in a given PDB have been used. The partition is which of the 5 randomly split partitions it belonged to and the partition EVAL is the completely left out PDBs.
Epitope: Uppercased
Non-Epitope: Lowercased
Download: PDB Combined Epitopes Dataset