- Sequencing the human immune system will help define genetic basis for individual disease response
- The human immune system is billions of times larger than the human genome and was previously considered impossible to sequence
- Recent improvements in computing power & new genomic tools put the task within reach
For the first time ever, researchers are comprehensively sequencing the human immune system, which is billions of times larger than the human genome. In a new study from the Human Vaccines Project, scientists have sequenced a key part of this vast and mysterious system: the genes encoding the circulating B cell receptor repertoire.
In sequencing these receptors in both adults and infants, the scientists found surprising overlaps that could provide potential new antibody targets for vaccines and therapeutics that work across populations. As part of a large multi-year initiative, this work seeks to define the genetic underpinnings of people’s ability to respond and adapt to an immense range of disease.
Led by scientists at Vanderbilt University Medical Center and the San Diego Supercomputer Center (SDSC) at UC San Diego, this advancement is possible due to the merging of biological research with high-powered frontier supercomputing. While the Human Genome Project sequenced the human genome and led to the development of novel genomics tools, it did not tackle the size and complexity of the human immune system.
“A continuing challenge in the human immunology and vaccine development fields has been that we do not have comprehensive reference data for what the normal healthy human immune system looks like,” said James E. Crowe, Jr., Director of the Vanderbilt Vaccine Center.
“Prior to the current era, people assumed it would be impossible to do such a project because the immune system is theoretically so large, but this new paper shows it is possible to define a large portion, because the size of each person’s B cell receptor repertoire is unexpectedly small.”
The new study specifically looks at one part of the adaptive immune system, the circulating B cell receptors that are responsible for the production of antibodies that are considered the main determinant of immunity in people.
The receptors randomly select and join gene segments, forming unique sequences of nucleotides known as receptor ‘clonotypes.’ In this way, a small number of genes can lead to an incredible diversity of receptors, allowing the immune system to recognize almost any new pathogen.
Conducting leukapheresis on three individual adults, the researchers cloned and sequenced up to 40 billion cells to sequence the combinations of gene segments that comprise the circulating B cell receptors, achieving a depth of sequencing never before done. They also sequenced umbilical cord blood from three infants. The idea was to collect a vast amount of data on a few individuals, rather than the traditional model of collecting only a few points of data on many.
“The overlap in antibody sequences between individuals was unexpectedly high,” Crowe explained, “even showing some identical antibody sequences between adults and babies at the time of birth.” Understanding this commonality is key to identifying antibodies that can be targets for vaccines and treatments that work more universally across populations.
A central question was whether the shared sequences across individuals were the result of chance, rather than the result of some shared common biological or environmental factor. To address this issue, the researchers developed a synthetic B cell receptor repertoire and found that “the overlap observed experimentally was significantly greater than what would be expected by chance,” said Robert Sinkovits, director of scientific computing applications at SDSC.
As part of a unique consortium created by the Human Vaccines Project, SDSC applied its considerable computing power to working with the multiple terabytes of data. A central tenet of the Project is the merger of biomedicine and advanced computing.
"The Human Vaccines Project allows us to study problems at a larger scale than would be normally possible in a single lab and brings together groups that might not normally collaborate." ~Robert Sinkovits, SDSC
Continued collaborative work is now under way to expand this study, including sequencing other areas of the adaptive immune system, the T cell repertoire; adding additional demographics such as supercentenarians and international populations; and applying AI-driven algorithms to further mine the datasets for insights. The goal is to continue to interrogate the shared components of the immune system to develop safer and highly targeted vaccines and immunotherapies that work across populations.
“Due to recent technological advances, we now have an unprecedented opportunity to harness the power of the human immune system to fundamentally transform human health,” said Wayne Koff, CEO of the Human Vaccines Project.
“Decoding the human immune system is central to tackling the global challenges of infectious and non-communicable diseases, from cancer to Alzheimer’s to pandemic influenza. This study marks a key step toward understanding how the human immune system works, setting the stage for developing next-generation health products through the convergence of genomics and immune monitoring technologies with machine learning and artificial intelligence.”
- Supercomputing the flu vaccine
- Deadly handshake: how staph bacteria cling to human cells
- Stopping HIV in its tracks