In a given population, certain HLA alleles are more frequently observed than others (http://www.allelefrequencies.net/ ). One practical outcome of this is that a small set of alleles can be compiled to cover most of the population. Such set may be useful in applications such as development of vaccines. Towards this end, we provide such sets for both HLA class I and II molecules:
Class I [file:hla_ref_set.class_i.txt] (cite: Weiskopf et al.)
Class II [file:hla_ref_set.class_ii.txt] (cite: Greenbaum et al.)
Note: These files can be used with the 'upload allele file' feature for the MHC binding prediction tools. Also for the class II set of alleles, there is no predictor for "HLA-DPA1*02:01/DPB1*14:01".
The reference sets were prepared using the following criteria:
1) the most common specificities in the general population, based on data available from DbMHC and allelefrequencies.net
2) representative of commonly shared binding specificities (i.e., supertypes).
In terms of population coverage, the reference sets for class I and II should provide > 97% and >99%, respectively.
Greenbaum J. et al. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics 2011. (http://www.ncbi.nlm.nih.gov/pubmed/21305276)
Weiskopf D. et al. Comprehensive analysis of dengue virus-specific responses supports an HLA-linked protective role for CD8+ T cells. PNAS 2012. (http://www.ncbi.nlm.nih.gov/pubmed/23580623)