The data sets used in the benchmark calculationis are given below in the FASTA format. For each entry is given the Uniprot of the protein "hosting" the epitope, the epitope sequence, and the HLA full-type information (if applicable), and the HLA supertype (if applicable).
An example showing part of such a fasta file is given below
>sp|O43707|ACTN4_HUMAN AIDQLHLEY HLA-A*0101 A1 MVDYHAANQSYQYGPSSAGNGAGGGGSMGDYMAQEDDWDRDLLLDPAWEKQQRKTFTAWC NSHLRKAGTQIENIDEDFRDGLKLMLLLEVISGERLPKPERGKMRVHKINNVNKALDFIA SKGVKLVSIGAEEIVDGNAKMTLGMIWTIILRFAIQDISVEETSAKEGLLLWCQRKTAPY KNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAFEVAEKYLDIPKM LDAEDIVNTARPDEKAIMTYVSSFYHAFSGAQKAETAANRICKVLAVNQENEHLMEDYEK LASDLLEWIRRTIPWLEDRVPQKTIQEMQQKLEDFRDYRRVHKPPKVQEKCQLEINFNTL