Concepedia

Publication | Open Access

OptiType: precision HLA typing from next-generation sequencing data

806

Citations

31

References

2014

Year

TLDR

The HLA gene cluster is central to adaptive immunity, yet its high sequence similarity and variability make genotype inference from routine NGS data challenging, and existing methods require costly enrichment and sequencing. OptiType is introduced as a novel HLA genotyping algorithm that uses integer linear programming to accurately predict HLA types from non‑enriched NGS data. The method is benchmarked on a comprehensive dataset comprising RNA, exome, and whole‑genome sequencing data. OptiType achieves 97 % accuracy, outperforming prior in‑silico approaches and enabling broad application. Contact szolek@informatik.uni-tuebingen.de and supplementary data are available online.

Abstract

Abstract Motivation: The human leukocyte antigen (HLA) gene cluster plays a crucial role in adaptive immunity and is thus relevant in many biomedical applications. While next-generation sequencing data are often available for a patient, deducing the HLA genotype is difficult because of substantial sequence similarity within the cluster and exceptionally high variability of the loci. Established approaches, therefore, rely on specific HLA enrichment and sequencing techniques, coming at an additional cost and extra turnaround time. Result: We present OptiType, a novel HLA genotyping algorithm based on integer linear programming, capable of producing accurate predictions from NGS data not specifically enriched for the HLA cluster. We also present a comprehensive benchmark dataset consisting of RNA, exome and whole-genome sequencing data. OptiType significantly outperformed previously published in silico approaches with an overall accuracy of 97% enabling its use in a broad range of applications. Contact: szolek@informatik.uni-tuebingen.de Supplementary information: Supplementary data are available at Bioinformatics online.

References

YearCitations

Page 1