1. TRAIN or UPLOAD a model

Paste peptides in PEPTIDE format

or submit a file directly from your local disk:

To load some SAMPLE DATA click here:

More sample training data:


2. EVALUATION data (optional)

Paste in evaluation examples in PEPTIDE format

or upload evaluation examples:


Sample evaluation data in FASTA or PEPTIDE format

3. SUBMIT job




PRESET parameter configurations


MHC CLASS I ligands of variable length
MHC CLASS II ligands:
DNA/RNA data:


CUSTOMIZE your run

Hover the mouse cursor over the symbol for a short description of the options

BASIC options

Job name

Motif length

DATA PROCESSING options

Order of the data
High values are positive instances
Low values are positive instances

Data rescaling
Linear rescale
Log-transform
No rescale

Average target values of identical sequences

Folds for cross-validation

Perform nested cross-validation

Stop training on best test-set performance

Method to create subsets
Random subsets
Homology clustering
Common-motif clustering
User-defined partitions

Alphabet

NEURAL NETWORK architecture

Number of training cycles

Number of seeds

Number of hidden neurons

Amino acid encoding

Maximum length for Deletions

Maximum length for Insertions

Only allow insertions in sequences shorter than the motif length

Burn-in period

Impose amino acid preference at P1 during burn-in

Length of the PFR for composition encoding

Encode PFR composition as sparse

Encode PFR length

Expected peptide length for encoding

Binned peptide length encoding

Load receptor pseudo-sequences

Example

SORTING and VISUALIZATION options

Number of networks (per fold) in the final network ensemble

Sort results by prediction value

Exclude offset correction

Show all logos in the final ensemble

EVALUATION DATA options

Length of peptides generated from FASTA entries

Sort evaluation results by prediction value

Threshold on evaluation set predictions


SUBMIT job



NOTE, depending on the size of your datasets and selected parameters it might take up to a few hours to complete the query.
Please be patient.

Confidentiality:
The sequences are kept confidential and will be deleted after processing.


CITATIONS

For publication of results, please cite: