Our research group is part of the ENCODE project. I am one of four authors of one of the papers in Nature that are featured in this study, and a contributor to a second paper.
We found several interesting things about the human genome:
• control regions are everywhere, in areas previously and poorly described as "junk DNA"
• control regions are cell-specific, even categorizing cancerous and healthy tissues
• these regions, or "footprints" can be discovered reliably with high-throughput sequencing
• we can associate footprints with SNPs, which in turn can be associated with various diseases of genetic provenance
• we can build regulatory networks on a wider scale than previously done
We can find footprints by simply counting the sequencing fragments that show up for a given genomic region. Where there are fewer fragments, we can make inferences as to whether there is protein binding to those specific parts of the human genome being sequenced.
Once we know where those footprints are located, we can associate them with known and putative transcription factors, and start building large-scale regulatory networks that we can compare between cell types, to explore and understand the differences that make a specialized heart muscle cell different from an unspecialized embryonic stem cell, or what differentiates a particular type of cancer tissue from healthy tissue. posted by Blazecock Pileon at 1:27 PM on September 5, 2012 [7 favorites]
A companion paper describing the semi-automated annotation analysis I led is now available, and is open access. It was one of many that wasn't actually published at the same time as the rest. posted by grouse at 4:47 PM on December 14, 2012
« Older
Visualizing the Web Index...
| Game of Thrones - Iron Throne ...
Newer »
We found several interesting things about the human genome:
• control regions are everywhere, in areas previously and poorly described as "junk DNA"
• control regions are cell-specific, even categorizing cancerous and healthy tissues
• these regions, or "footprints" can be discovered reliably with high-throughput sequencing
• we can associate footprints with SNPs, which in turn can be associated with various diseases of genetic provenance
• we can build regulatory networks on a wider scale than previously done
We can find footprints by simply counting the sequencing fragments that show up for a given genomic region. Where there are fewer fragments, we can make inferences as to whether there is protein binding to those specific parts of the human genome being sequenced.
Once we know where those footprints are located, we can associate them with known and putative transcription factors, and start building large-scale regulatory networks that we can compare between cell types, to explore and understand the differences that make a specialized heart muscle cell different from an unspecialized embryonic stem cell, or what differentiates a particular type of cancer tissue from healthy tissue.
posted by Blazecock Pileon at 1:27 PM on September 5, 2012 [7 favorites]