discovery - Fred Hutchinson Cancer Research Center
discovery - Fred Hutchinson Cancer Research Center
discovery - Fred Hutchinson Cancer Research Center
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
10<br />
Robert Gentleman, Ph.D.<br />
Computational Biologist<br />
S earching<br />
M I N I N G G E N O M I C D A T A F O R C A N C E R ’ S<br />
MOLECULAR SECRETS<br />
for the molecular roots Common of cancer duplicate is akin to reads after biological filtering research. Just a decade ago, scientists could only<br />
looking for the proverbial > needle df1 in lapply(split(df1[, which is where Dr. Robert c("sequence", advances now "count")], enable efficient df1$lane), analysis of half head) a million genes.<br />
Gentleman comes in. Technology $s_1_export.txt<br />
and the need to sift through The new tools enable researchers to uncover novel potential<br />
a deluge of genomic information has transformed biology from targets sequence for therapies count as well as to explore the underlying genetic<br />
a purely lab-based science to an 101 information TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA science as well. causes of many human 243 diseases.<br />
102 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 237<br />
Gentleman, head of the <strong>Hutchinson</strong> <strong>Center</strong>’s Herbold There has also been an explosion in the amount of informa-<br />
103 GTGGAAAATTTAGAAATGTCCACTGTAGGACGTGG 177<br />
Computational Biology Program, 104creates CCATATTCCACGTCCTACAGTGGACATTTCTAAAT computing tools to tion available about the 165DNA<br />
sequence of the human genome.<br />
analyze massive amounts of data 105 from AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT biological experiments Consequently, researchers 162 have identified a large number of<br />
and uses mathematics and statistics 106 to CATGATTTTCAGTTTTCTTGCCATATTCCACGTCC generate new insights novel genes within these 158previously<br />
unknown sequences. The<br />
incorporating the existing<br />
data. The approach<br />
$s_2_export.txt<br />
sequence count<br />
challenge currently facing<br />
scientists is to find a way to<br />
helps researchers<br />
251 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 357 organize and catalog this vast<br />
understand cancer at<br />
its most fundamental<br />
level by illuminating<br />
252 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA<br />
253 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT<br />
254 ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />
255 CATGATTTTCAGTTTTCTTGCCATATTCCACGTCC<br />
324<br />
194<br />
184<br />
160<br />
amount of information into a<br />
usable form.<br />
“Technology drove enor-<br />
which cellular proteins 256 GCCATATTCCACGTCCTACAGTGGACATTTCTAAA 151 mous data sets, which drove<br />
interact and how they<br />
the need for clear, statistical<br />
$s_3_export.txt<br />
work together within a<br />
thinking,” Gentleman said.<br />
sequence count<br />
cell. The process also saves both 401 time GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC and money over other “We have no true comprehension 313 of cancer on a molecular level<br />
lab methods.<br />
402 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA yet. We need tools to 291 understand how the genome works and<br />
A former auto mechanic might<br />
403<br />
seem<br />
ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />
like an unlikely we need to know how 235 things interact. If we’re going to go in<br />
404 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT 205<br />
candidate to become a research<br />
405<br />
scientist.<br />
GTGGAAAATTTAGAAATGTCCACTGTAGGACGTGG<br />
But if you follow and poke something, we better know what the domino effect<br />
188<br />
Gentleman’s path from Hondas 406 to Harvard CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAA to the <strong>Hutchinson</strong> will be.” 177<br />
<strong>Center</strong>, his trajectory seems perfectly probable. The former He and his colleagues around the globe collaborate to write<br />
grease monkey has a penchant for<br />
$s_4_export.txt<br />
challenges, so it makes sense free software for the entire scientific community to use. “This is<br />
sequence count<br />
that he’d choose a research endeavor<br />
551 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA<br />
that’s moving at breakneck a high-risk area, which is not where commercial software goes,”<br />
266<br />
speed. For brilliant young minds, 552biocomputing GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC is the most he said. “We’re only interested 232 in being as close to the edge of<br />
challenging field around. 553 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT scientific investigation 187 as we can be. We share to learn. Real<br />
Advances in knowledge and<br />
554<br />
technology<br />
ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />
have transformed change comes from people 164 working well together.”<br />
555 CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAA 161<br />
556 TGTAGGACGTGGAATATGGCAAGAAAACTGAAAAT 139<br />
$s_6_export.txt<br />
sequence count<br />
701 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA 395<br />
702 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 354<br />
703 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT 263<br />
704UNRAVELING ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA VAST DNA SEQUENCES, WRITTEN AS A-G-T-C 232 CODES, REQUIRES POWERFUL TOOLS.<br />
5