Concepedia

TLDR

Sequence analysis of immunoglobulin transcripts refines B cell subpopulation categorization and reveals selective forces in immune responses, with high‑throughput sequencing producing large datasets that the IMGT/HighV‑QUEST program analyzes but requires further processing for figures and comparative analysis. We present an easy‑to‑use Microsoft Excel‑based software, Immunoglobulin Analysis Tool (IgAT), for summarizing, interrogating, and processing IMGT/HighV‑QUEST output files. IgAT generates descriptive statistics and high‑quality figures for 1–150 000 human or mouse Ig transcripts, evaluates germline usage, CDR‑3 length and composition, antigen‑selection probability, hydrophobicity, and CDR‑H3 structural properties using Shirai’s H3‑rules, and runs on any computer with Excel 2003 or newer. These refined analyses provide in‑depth insight into selective forces and functional properties of Ig repertoires, enabling statistical and graphical comparison of multiple sequence sets and helping researchers fully mine their data.

Abstract

Sequence analysis of immunoglobulin (Ig) heavy and light chain transcripts can refine categorization of B cell subpopulations and can shed light on the selective forces that act during immune responses or immune dysregulation, such as autoimmunity, allergy, and B cell malignancy. High-throughput sequencing yields Ig transcript collections of unprecedented size. The authoritative web-based IMGT/HighV-QUEST program is capable of analyzing large collections of transcripts and provides annotated output files to describe many key properties of Ig transcripts. However, additional processing of these flat files is required to create figures, or to facilitate analysis of additional features and comparisons between sequence sets. We present an easy-to-use Microsoft(®) Excel(®) based software, named Immunoglobulin Analysis Tool (IgAT), for the summary, interrogation, and further processing of IMGT/HighV-QUEST output files. IgAT generates descriptive statistics and high-quality figures for collections of murine or human Ig heavy or light chain transcripts ranging from 1 to 150,000 sequences. In addition to traditionally studied properties of Ig transcripts - such as the usage of germline gene segments, or the length and composition of the CDR-3 region - IgAT also uses published algorithms to calculate the probability of antigen selection based on somatic mutational patterns, the average hydrophobicity of the antigen-binding sites, and predictable structural properties of the CDR-H3 loop according to Shirai's H3-rules. These refined analyses provide in-depth information about the selective forces acting upon Ig repertoires and allow the statistical and graphical comparison of two or more sequence sets. IgAT is easy to use on any computer running Excel(®) 2003 or higher. Thus, IgAT is a useful tool to gain insights into the selective forces and functional properties of small to extremely large collections of Ig transcripts, thereby assisting a researcher to mine a data set to its fullest.

References

YearCitations

Page 1