Additional material


Additional data from publication:

New light on the HLA-DR immunopeptidomic landscape
Emilie Egholm Bruun Jensen, Birkir Reynisson, Carolina Barra, and Morten Nielsen

Overview of the includes MS samples (samples.tsv)

In the format

sampleID #ligs                    HLA_typing 
   144.1  4382 DRB1_0101,DRB1_0301,DRB3_0101 
   144.2  3975 DRB1_0101,DRB1_0301,DRB3_0101 
   183.1  3929 DRB1_0801,DRB1_1501,DRB5_0101 

..

File with the complete annotated data set (ANNOTATED_DATA.tsc)

In the format

          Clus_ID+DonorID               Clus_ID               Peptide       HLA          Rank      Core   ProtID SampleID DeepLoc DonorID      Best_Rnk Nclus 
      AAAAFNKDALLNWLK+158       AAAAFNKDALLNWLK       AAAAFNKDALLNWLK DRB3_0101  6.5508117676 FNKDALLNW P42338.1    158.2    Dual     158  6.5508117676     1 
      AAAAVRQMNPHIRVT+978       AAAAVRQMNPHIRVT       AAAAVRQMNPHIRVT DRB5_0202  0.5007802844 VRQMNPHIR P22314.3    978.1    Cyto     978  0.5007802844     1 
      AAADDIKPCPRCAAY+154       AAADDIKPCPRCAAY       AAADDIKPCPRCAAY DRB1_0404 38.7160453796 IKPCPRCAA Q9NV58.3    154.3    Endo     154 38.7160453796     1 
AAADDIKPCPRCAAYIIKMND+786 AAADDIKPCPRCAAYIIKMND AAADDIKPCPRCAAYIIKMND DRB1_1302 52.2107086182 IKPCPRCAA Q9NV58.3    786.1    Endo     786 52.2107086182     1 
...

where 
Clus_ID+DonorID: The unique cluster ID
Clus_ID: The peptide definting the cluster 
Peptide: The peptide member of the cluster 
HLA: The predicted HLA restriction
Rank: The predicted percentile rank score of the peptide
Core: The predicted binding core
ProtID: Protein ID
SampleID: Sample ID
DeepLoc: DeepLoc subcellular prediction [Cyto, Endo or Dual]
DonorID: Donor ID
Best_Rnk: Lowest rank of all peptides within the cluster
Nclus: Number of peptides within the cluster (1 indicates singletons)

File with Crapome annotations for the 21 most contaminant containing proteins (Crapome_contaminants.csv)

Here we find the 11 of the 21 (52%) proteins have an Average Spectral Count value beyond 3. In comparision, this is only the case for 2329 of the 21935 unique entries in the Single Step Epitope tag CRAPome database. This result thus strongly indicating that the identified proteins have a high propensity to be a contaminant.