Concepedia

Publication | Open Access

Direct high‐throughput amplification and sequencing of immunoglobulin genes from single human B cells

36

Citations

10

References

2015

Year

Abstract

Single cell Ig gene amplification and sequencing has been widely used for the molecular and functional assessment of human antibody repertoires and has led to the identification of recombinant monoclonal antibodies with therapeutic potential against diverse pathogens 1-3. Due to the high reagent and Sanger sequencing costs, these antibody-cloning strategies are limited to the analysis of relatively small B-cell numbers and do not allow in-depth repertoire measurements to assess the clonality and clonal evolution of B-cell responses. A major goal in the field has been the development of platforms that allow the high-throughput analysis of antibody repertoires at low costs 4. Next-generation sequencing (NGS) of Ig heavy and light chain genes facilitates the high-throughput analysis of antibody repertoires. So far only one platform preserves natural Ig gene associations at single-cell level through linkage of Ig heavy and light chain amplicons before sequencing 5. However, due to the short NGS read-lengths full-length Ig genes are not readily available for cloning and recombinant monoclonal antibody production. Here, we describe a platform for the high-throughput analysis of human antibody repertoires at single cell level that is fully compatible with direct Ig gene cloning and expression. We have previously developed a strategy that combines single-cell RT-PCR of murine Igh, Igk, and Igl transcripts with NGS 6. Use of a 2D bar-coded primer matrix in the second amplification round of the nested PCR allows pooling of amplicons for NGS. To adapt this approach for the amplification and sequencing of full-length human Ig genes, we modified existing protocols 7-9. A major goal was to reduce the number of primers (especially in the second PCR to keep the costs for barcoded primers at minimum) while enabling the unbiased amplification of all human Ig heavy and light chain gene transcripts at high efficiency. We initially designed a set of nine primers (complete mix) for the amplification of all VH families. The set is a mixture of previously published and newly designed VH leader primers for use in the primary PCR (Supporting Information Table 1, 8, 9). In addition, we designed a single forward primer (RMX2-A) for the secondary IGH-V PCR. RMX2-A anneals at the 5′ end of exon 2 encoding the N-terminus of FWR1, a region that is strongly conserved even in IGH-V genes with high loads of somatic hypermutations (Supporting Information Fig. 1). To determine whether the newly designed PCR strategy could efficiently amplify human Ig genes from single B cells, we initially compared our primers to the established primer sets. Single B-cell sort was performed using PBMCs isolated from three healthy donors (HD1-3) upon informed consent and approval by the local Ethical Committee. Similar amplification efficiency was observed for IGH transcripts from IgG+CD27+ memory B cells (Fig. 1A). The overall amplification efficiency varied among isotypes, but we observed no differences in the amplification efficiency between the new and published primer sets for IgM, IgA, and IgE PCRs or when all constant region primers were mixed (Fig. 1B). Next, we aimed to design Ig light chain gene amplification strategies that are compatible with the matrix PCR approach. Whereas the secondary PCR for IGK already uses a single forward primer, six forward primers are needed for the amplification of IGL genes 8. We therefore designed a new set of only two forward IGL primers that amplified IGL genes with the same efficiency as the established 6-primer combination resulting in a combined IGK/IGL amplification efficiency of near 80% (Fig. 1C). To evaluate whether our newly developed PCR strategy was compatible with the high-throughput amplification of Ig gene transcripts, we generated uniquely barcoded IGH, IGK, and IGL forward and reverse primers for use in the secondary PCR. By using these primers in a 2D matrix format in which forward and reverse primers with different barcodes were used in each column and row, respectively, we generated amplicons with unique combinations of 5′ and 3′ tags to facilitate parallel sequencing while preserving single cell/well identity 6. To test the efficiency of the strategy, we amplified the Ig genes of 1152 single-sorted IgG+ memory B cells from three healthy donors (Fig. 1D). Full-length IGH sequences were obtained from 53% of cells, whereas the combined IGK or IGL amplification efficiency reached 76% (IGK 49%, IGL 30%). Thus, the relatively lower IGH PCR efficiency limited the number of cells with successful amplification of corresponding Ig heavy and Ig light chain transcripts (41%). The primary PCR efficiency is critical for the overall amplification success. To increase the amplification likelihood at single cell level, we tested if we could boost the efficiency by running two independent nested IGH PCRs. Indeed, two PCRs boosted the amplification efficiency from 40% to near 60% for all IGH isotypes (Fig. 1E and data not shown). Of note, this 10–20% increase in the overall IGH amplification success was also reached if we pooled the products of the two independent primary PCRs as template in only one secondary PCR (data not shown). Sequence analysis and comparison to previous studies confirmed that our strategy facilitates unbiased Ig gene repertoire studies (Supporting Information Fig. 2; 7, 8). Over 95% of the IGH-V and paired IGK-/IGL-V genes present in our dataset were also detected with a different high-throughput amplification and sequencing approach 8. We conclude that our newly developed PCR strategy is fully compatible with the efficient high-throughput amplification and parallel sequencing of corresponding Ig heavy and light chain genes from mutated and naïve single human B cells of all isotypes. It is critical to point out that our protocol does not include any cell stimulation step, which might increase the overall amplification efficiency but harbors the risk of inducing a repertoire bias. The relatively lower IGH amplification efficiency compared to Ig light chain genes is likely due to limited transcript levels. Indeed, IGH amplification efficiencies are about 50% higher for antibody secreting cells compared to memory B cells (data not shown). Due to the use of random hexamers, amplification is not limited to Ig genes and enables the parallel assessment of non-Ig transcripts. Sequencing costs are significantly reduced (up to tenfold) compared to strategies based on Sanger sequencing. The platform is ideally suited for the functional characterization of small subpopulations as it allows deep sampling to identify rare antibodies and is fully compatible with direct cloning of the full-length Ig genes 9, 10. A major advantage is the possibility to perform reliable analyses of the clonality and clonal evolution of B-cell responses based on error-corrected full-length Ig heavy and associated light chain gene sequences. Thus, applications range from basic repertoire analyses under steady state to the analysis of human B-cell responses in infectious disease, autoimmunity, or cancer. The authors would like to thank the blood donors, the Sequencing core facility at the Max Planck Institute for Molecular Genetics, Berlin, for 454 sequencing; and the Flow Cytometry core facility at the Max Planck Institute for Infection Biology, Berlin for single cell sorting. Research reported in this publication was supported by the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number UM1AI100663 to H.W. R.M. was supported by International Max Planck Research School for Infectious Diseases and Immunology (IMPRS-IDI), Berlin and Research Training Group “Genetic and Immunological Determinants of Pathogen-Host-Interactions” (GRK1121) funded by the German Research Foundation (D.F.G.). K.I. was supported by IMPRS-IDI, Berlin and Helmholtz International Graduate School for Cancer Research, German Cancer Research Center, Heidelberg. The authors declare no financial or commercial conflict of interest. As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re-organized for online delivery, but are not copy-edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors. Figure 1: Conservation of N–-terminal IGH–-V region Figure 2: Ig gene repertoire analysis Table 1: Ig gene primers Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.

References

YearCitations

Page 1