Abstract

Robert Green
Robert Green, M.D., MPH, is director of the Boston-based Genomes2People Research Program and new Preventive Genomics Clinic at Brigham and Women's Hospital, the Broad Institute, and Harvard Medical School. Through his work there on such research initiatives as the REVEAL Study, the MedSeq, and BabySeq Projects, Green has become one of the leading authorities, if not the leading authority, on the health benefits and economic effects of genomic medicine. He recently spoke with Clinical OMICs Senior Editor Julianna LeMieux about this work and current landscape for genomic-driven medicine.
This focus on population genomics is pretty new. There has been plenty of attention and research around the ways in which genomics can assist with diagnosis, prognosis, and treatment of heritable conditions and with personalizing the treatment of cancer, but far less scientific attention to the potential for genomics to provide risk information that could prevent disease and maintain wellness. Our most recent scientific studies in the Genomes2People Research Program have been examining the amazing wealth of information that is available, even in a healthy individual, if a human genome is comprehensively sequenced and interpreted. And the data we have generated over the past decade have encouraged us to recently create the Brigham Preventive Genomics Clinic, the world's first clinical service to offer comprehensive DNA sequencing and interpretation to healthy adults and their children.
To put that in perspective, we have frequently talked about how genomes can now be sequenced for under $1,000 and by some companies more recently for much less than this. But these rates for sequencing typically offer little or no clinical interpretation. For most vendors of very low-cost sequencing, specific genes and variants are not even examined. In contrast our MedSeq and BabySeq Projects have not only offered sequencing, but also in-depth, careful analysis of thousands of disease-associated genes.
So, when you are talking about clinical sequencing, it helps to talk about all of the different elements of the interpretive step and how many genes you are planning to analyze. Are you going to select 10 or 50 or 147 or 5000 genes to interpret? And does your interpretive process involve a trained molecular geneticist? Does it involve a manual curation step where a variant curation scientist takes difficult-to-interpret variants and goes into the literature and explores what the papers have written about this gene and this variance?
Remember, there are all sorts of weird boobytraps in analyzing variants for pathogenicity in the genome… it's not as simple as just finding where you have a predicted loss of function. For example, there are some conditions, like some of the inherited cardiomyopathies, where a loss of function mutation does not cause the disease, but a particular missense mutation, with a single change in one amino acid, does.
The other bucket, which we are starting to hear more about, is pre-dispositional or preventive genomics, where you do not have something obviously wrong with you, but we want to find out if you are carrying a DNA change that puts you at risk for a specific condition—either a rare condition that you are at risk for by virtue of a DNA change in a single gene, or a more common condition where you have a constellation of many genetic markers that puts you at higher risk.
My work has been demonstrating that a surprisingly high percentage of people are walking around with a pathogenic or likely pathogenic mutation in a disease-associated gene for a rare condition. This discovery is going to fundamentally change what it means to talk about “rare” genetic diseases because while they remain individually rare, in the aggregate they are not at all rare!
At the same time, polygenic risks that were first promoted by consumer-facing companies as early as 2007 are coming into maturity, in which thousands or even millions of markers contribute to defining a small group of people for each common complex disease—heart disease, atrial fibrillation, type 2 diabetes (as example)—where a small percentage of people are at a significantly higher risk.
These are some ways in which genetics in the province of rare disease is becoming more and more salient to everyday medicine.
So, there is a tremendous amount of genetic risk for supposedly rare disease that is out there. As part of our MedSeq Project, we asked the question that if money were no object and we sequenced 100 healthy people and simply asked—if you look at every well-established disease-associated gene and you applied the very best practices in variant curation—how many of them would be found to be carrying well-characterized pathogenic, or likely pathogenic, mutations for a dominantly heritable condition?
In MedSeq, we looked at over 5,000 genes for each person, which was tremendously difficult, time-consuming and labor-intensive. We looked at them extremely carefully and found that 20% of these 100 people were carrying such a mutation. Twenty percent!
Then, in our BabySeq Project, we looked at about 150 such newborn babies, perfectly healthy, cherubic, baby boys and girls. We looked at a smaller gene set—closer to 2,000 genes—because we wanted to restrict ourselves in this case to conditions that manifested in childhood and adolescence. We still found 11% of these healthy babies were carrying pathogenic variations in these genes.
This discovery has shaken up the field. We have discovered that a predisposition to rare diseases is, in fact, not rare at all. In addition to this we have found that such a comprehensive analysis of the genomes reveals that over 90% of humans are carrying pathogenic changes in genes for a recessive condition which would have implications for their reproductive health. And on top of all this, there are polygenic risk scores. If you just take the top six conditions that have polygenic risk scores, our colleagues have calculated that 50% of men and women, approximately, are going to have an odds ratio of greater than 2.5 for one of those conditions.
It's important to remember that all of these markers are risk factors, not diagnostic markers, but with this kind of frequency, all of a sudden, genetics is everywhere. And I think we have to ask ourselves why genetics is not a more important part of our day-to-day medical care.
It turns out there are a lot of “myths” about why genomics cannot, or should not, be part of day-to-day medicine, and we are systematically developing high quality evidence to disprove these myths.
There are algorithms for doing this and, of course, the first thing any new biotech expert might say is that if there is an algorithm for curating these variants, let us just put it in the computer and have the computer do it. Up to a point you can do that. But the state of current best practice in clinical molecular genetics interpretation does still require some thoughtful time, what you might consider to be manual curation, before the final report is generated.
Now, you see that this manual curation step is in tension with how many genes you want to evaluate. Because if you only evaluate 10 genes, you can do those algorithms and do that tiny bit of manual curation on 10 genes, and you can offer that to a lot of people. But if you want to really evaluate each genome for 5,000 disease-associated genes that you have to put through your algorithm and find markers in almost all of them that will require manual curation, you have bogged the system down in the manual curation step.
This tension will gradually be resolved because we are doing a better and better job of categorizing all the variants that are being found, and agreeing on how they should be categorized, and updating that agreement in powerful ways. This will lend itself to faster and faster algorithms as time goes on, and I'm thrilled to say that we have a new NIH grant to take genomes that have already been sequenced in the Framingham and Jackson Heart Studies and try to build faster, better interpretation pipelines that will reduce the manual curation component. At the same time, this will be the first research grant to return genomic results to research participants in a population-based epidemiological study, and the very first to do so in a population of African Americans in Jackson, Mississippi.
