Wednesday, August 8, 2007

Science on Wednesday

During junior high, I attended a weekly lecture series at the Princeton Plasma Physics Laboratory designed to introduce students to science research. The program was called Science on Saturday, and it featured scientists in fields as diverse as cosmology and forensics talking about their work to a largely non-technical audience that consisted of students and their parents.

The Broad Institute had a similar program this summer called Midsummer Nights' Science, an apt name given this year's Shakespeare on the Common production. For the four Wednesdays following Independence Day, scientists at the Broad would describe their work to the greater Boston community. Each Wednesday featured a different researcher describing his or her work. While the projects and interests described each week were quite different, all of them implicitly promoted the idea that large databases of data can enable new kinds of research.

The first talk featured David Reich, who discussed how he and his colleague conducted a comparative analysis of the DNA of humans, chimpanzees, and gorillas, which have led them to a new model for the evolution of these species from a common ancestor. The way I first learned about evolution was that it starts when two groups of the same species are physically isolated from one another. Then, under appropriate environmental conditions, the two groups would eventually evolve into different species, after which any hybrids between these two species would be less fertile and die out. This is called allopatric speciation. If this is true, then one can model the DNA sequences as following a branching process, so the evolution of species would look like a tree, where each fork in the tree indicates one species dividing into two. Reich and his colleagues discovered that this model is not a great one to describe the evolution of humans, chimps, and gorillas. In fact, if one constructs a phylogenetic tree for these three species using DNA from one section of the genome, one tree emerges indicating that the most recent split was between humans and chimps split, but if the same analysis is performed using a sequence from another section, which comprises between a fifth and a third of the genome, a different tree emerges, indicating the most recent split was between humans and gorillas. An alternate hypothesis the group proposed was that hybridization among these species took place, and by a careful analysis of the sequence data they had, they were able to confirm this was a better model by which to describe the speciation of these three species. Indeed, the study probably would not have been possible without all the DNA sequence information available for these three species.

During the following week, Pardis Sabeti explained how the HapMap project, another data gathering effort, is enabling researchers to determine the role natural selection has played in humans and pathogens. The HapMap project collects DNA samples from different populations around the world. The samples of DNA they collect account for 90% of the genetic variation among humans. These samples are divided into haplotypes, which represent sections of DNA that are inherited as a group. If there is no selective pressure on an organism, one would expect the prevalence of a particular haplotype to decay as its size gets larger. By similar reasoning, if a larger haplotype is highly prevalent in a population, then there is evidence that the corresponding section of DNA is under selective pressure. Sabeti explained how this has allowed researchers to track lactose tolerance in European populations, who domesticated cattle relatively early, and link the sickle cell trait to malaria resistance. Once again, the availability of this data enabled such an analysis.

The third talk, given by Todd Golub, was about cancer research in the era of genomics. He started the talk by describing two patients, both the same age, both diagnosed with the same type of leukemia in a similar stage of progression, and both given similar doses chemotherapy. However, Patient A lived and Patient B did not. Golub then explained how the mutations that had occurred at the genomic level for these patients were actually quite different, and if one were to look at patient survival by isolating these two different mutations, the group with the same mutation as Patient A had a survival rate much closer to 1 and those treated with the same mutation as Patient B had a survival rate close to 0 within a few years of diagnosis. Golub then went on to describe a treatment that had been customized to target the mutation in groups with Patient B's mutation. The result led to Gleevec, a drug now available to patients with this version of the disease. Since it's introduction, patients diagnosed with this specific mutation have had a 100% survival rate with minimal side effects from the medication. Golub appeared optimistic that similar treatments could be developed for most of these mutations that results in cancer.

Unlike the the preceeding talks, the final speaker barely mentioned genomics in his talk. Vamsi Mootha described mitochondria and his group's research efforts on understanding them. Mitochondria are found inside the cell and produce much of the energy a cell uses. Unlike most other organelles, mitochondria contain their own DNA. However, proteins found in mitochondria are mix of genes derived from within the mitochondrial DNA and the cell's nuclear DNA. It turns out that metabolic diseases are closely related to problems with mitochondrial function, which are apparent in changes to their protein composition. Mootha's group is building an atlas of the protein content of mitochondria in different parts of the body. Their hope is that this data will enable researchers to characterize the specific problems associated with certain metabolic diseases. In this instance, the hope of a future payoff inspired his group to procure a large data set.

Midsummer Nights' Science showcased how biological and medical research have benefited and can continue to do so when certain kinds of data are available in significant quantities. Hopefully this message reached the students who attended and will capture the imagination of those who decide to pursue research in the future.

No comments: