DTU Health Tech
Department of Health Technology
This link is for the general contact of the DTU Health Tech institute.
If you need help with the bioinformatics programs, see the "Getting Help" section below the program.
The NetMHCIIpan-4.1 server predicts peptide binding to any MHC II molecule of known sequence using Artificial Neural Networks (ANNs). It is trained on an extensive dataset of over 500.000 measurements of Binding Affinity (BA) and Eluted Ligand mass spectrometry (EL), covering the three human MHC class II isotypes HLA-DR, HLA-DQ, HLA-DP, as well as the mouse molecules (H-2). The introduction of EL data extends the number of MHC II molecules covered, since BA data covers 59 molecules and EL data covers 74. As mentioned, the network can predict for any MHC II of known sequence, which the user can specify as FASTA format. The network can predict for peptides of any length.
The output of the model is a prediction score for the likelihood of a peptide to be naturally presented by and MHC II receptor of choice. The output also includes %rank score, which normalizes prediction score by comparing to prediction of a set of random peptides. Optionally, the model also outputs BA prediction and %rank scores.
New in version 4.1: The method is trained on a extented set of EL data compared to version 4.0, and novel and correct BA for HLA-DQA1*04:01-DQB1*04:02 are included.
Refer to the instructions page for more details.
The project is a collaboration between DTU-Bioinformatics, and LIAI.
View the version history of this server.
For publication of results, please cite:
Benchmark data used to develop this server were obtained from:
# NetMHCIIpan version 4.1 # Input is in FASTA format # Peptide length 15 # Prediction Mode: EL # Threshold for Strong binding peptides (%Rank) 1% # Threshold for Weak binding peptides (%Rank) 5% # Allele: DRB1_0101 -------------------------------------------------------------------------------------------------------------------------------------------- Pos MHC Peptide Of Core Core_Rel Identity Score_EL %Rank_EL Exp_Bind BindLevel -------------------------------------------------------------------------------------------------------------------------------------------- 40 DRB1_0101 QGQWRGAAGTAAQAA 3 WRGAAGTAA 1.000 P9WNK5 0.867119 0.56 NA <=SB 39 DRB1_0101 LQGQWRGAAGTAAQA 4 WRGAAGTAA 1.000 P9WNK5 0.775118 0.99 NA <=SB 41 DRB1_0101 GQWRGAAGTAAQAAV 2 WRGAAGTAA 1.000 P9WNK5 0.585897 2.04 NA <=WB 38 DRB1_0101 SLQGQWRGAAGTAAQ 5 WRGAAGTAA 1.000 P9WNK5 0.513786 2.57 NA <=WB 73 DRB1_0101 STNIRQAGVQYSRAD 3 IRQAGVQYS 1.000 P9WNK5 0.184990 7.08 NA 26 DRB1_0101 KTQIDQVESTAGSLQ 3 IDQVESTAG 0.993 P9WNK5 0.122282 9.40 NA 15 DRB1_0101 AGNFERISGDLKTQI 3 FERISGDLK 1.000 P9WNK5 0.121026 9.45 NA 53 DRB1_0101 AAVVRFQEAANKQKQ 3 VRFQEAANK 0.873 P9WNK5 0.118252 9.59 NA 14 DRB1_0101 EAGNFERISGDLKTQ 4 FERISGDLK 1.000 P9WNK5 0.118084 9.60 NA 72 DRB1_0101 ISTNIRQAGVQYSRA 4 IRQAGVQYS 0.993 P9WNK5 0.103021 10.47 NA
The prediction output for each molecule consists of the following columns:
Here, you will find the data set used for training of NetMHCIIpan-4.1.
Download the file and untar the content using
cat NetMHCIIpan_train.tar.gz | uncompress | tar xvf -
This will creat the directory called NetMHCIIpan_train. In this directory you will find 12 files. 10 files (c00?_ba, c00?_el) with partitions with binding affinity (ba) with eluted ligand data (el). The format for each file is (here shown for an el file)
AAAAAAAAAAAAA 1 Bergseng__9037_SWEIG AGRAAAAAAAAG AAAAAAAAAAAAA 1 Bergseng__9064_AMALA AGRAAAAAAAAG AAAAAAAAAAAAA 1 Bergseng__9089_BOB AGRAAAAAAAAG AAAAAAAAAAAAAA 1 Bergseng__9037_SWEIG AGRAAAAAAAGA AAAAAAAAAAAAAA 1 Bergseng__9064_AMALA AGRAAAAAAAGA AAAAAAAAAAAAAA 1 Bergseng__9089_BOB AGRAAAAAAAGA AAAAAAAAAAAAAAA 1 Bergseng__9037_SWEIG AGRAAAAAAGAG AAAAAAAAAAAAAAA 1 Bergseng__9064_AMALA AGRAAAAAAGAG AAAAAAAAAAAAAAA 1 Bergseng__9089_BOB AGRAAAAAAGAG AAAAAAAAAAAAAAAAAAAAA 1 Abelin__MAPTAC_HLA_DQB10602_DQA10102 KHPAAAAAAYYQwhere the different columns are peptide, target value, MHC_molecule/cell-line, and context. In cases where the 3rd columns is a cell-line ID, the MHC molecules expressed in the cell-line are listed in the allelelist.txt file.
The allelelist.txt file contains the information about alleles expressed in each MA cell-line data set, and pseudosequence.2016.all.X.dat the MHC pseudo sequenes for each MHC molecule.
Please click on the version number to activate the corresponding server.
4.0 |
The current server (online since April 2020). New in this version:
|
3.2 |
(online since January 2018). New in this version:
|
3.1 |
(online since December 2014). New in this version:
|
3.0 |
(online since June 2013). New in this version:
|
2.1 |
(online since 6 June 2011). New in this version:
|
2.0 |
(online since 17 Nov 2010). New in this version:
|
1.1 |
(online since 15 April 2010). New in this version:
|
1.0 |
Original version (online version until April 15 2010):
Main publication:
|
If you need help regarding technical issues (e.g. errors or missing results) contact Technical Support. Please include the name of the service and version (e.g. NetPhos-4.0) and the options you have selected. If the error occurs after the job has started running, please include the JOB ID (the long code that you see while the job is running).
If you have scientific questions (e.g. how the method works or how to interpret results), contact Correspondence.
Correspondence:
Technical Support: