13.10.2013 Views

discovery - Fred Hutchinson Cancer Research Center

discovery - Fred Hutchinson Cancer Research Center

discovery - Fred Hutchinson Cancer Research Center

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

10<br />

Robert Gentleman, Ph.D.<br />

Computational Biologist<br />

S earching<br />

M I N I N G G E N O M I C D A T A F O R C A N C E R ’ S<br />

MOLECULAR SECRETS<br />

for the molecular roots Common of cancer duplicate is akin to reads after biological filtering research. Just a decade ago, scientists could only<br />

looking for the proverbial > needle df1 in lapply(split(df1[, which is where Dr. Robert c("sequence", advances now "count")], enable efficient df1$lane), analysis of half head) a million genes.<br />

Gentleman comes in. Technology $s_1_export.txt<br />

and the need to sift through The new tools enable researchers to uncover novel potential<br />

a deluge of genomic information has transformed biology from targets sequence for therapies count as well as to explore the underlying genetic<br />

a purely lab-based science to an 101 information TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA science as well. causes of many human 243 diseases.<br />

102 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 237<br />

Gentleman, head of the <strong>Hutchinson</strong> <strong>Center</strong>’s Herbold There has also been an explosion in the amount of informa-<br />

103 GTGGAAAATTTAGAAATGTCCACTGTAGGACGTGG 177<br />

Computational Biology Program, 104creates CCATATTCCACGTCCTACAGTGGACATTTCTAAAT computing tools to tion available about the 165DNA<br />

sequence of the human genome.<br />

analyze massive amounts of data 105 from AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT biological experiments Consequently, researchers 162 have identified a large number of<br />

and uses mathematics and statistics 106 to CATGATTTTCAGTTTTCTTGCCATATTCCACGTCC generate new insights novel genes within these 158previously<br />

unknown sequences. The<br />

incorporating the existing<br />

data. The approach<br />

$s_2_export.txt<br />

sequence count<br />

challenge currently facing<br />

scientists is to find a way to<br />

helps researchers<br />

251 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 357 organize and catalog this vast<br />

understand cancer at<br />

its most fundamental<br />

level by illuminating<br />

252 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA<br />

253 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT<br />

254 ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />

255 CATGATTTTCAGTTTTCTTGCCATATTCCACGTCC<br />

324<br />

194<br />

184<br />

160<br />

amount of information into a<br />

usable form.<br />

“Technology drove enor-<br />

which cellular proteins 256 GCCATATTCCACGTCCTACAGTGGACATTTCTAAA 151 mous data sets, which drove<br />

interact and how they<br />

the need for clear, statistical<br />

$s_3_export.txt<br />

work together within a<br />

thinking,” Gentleman said.<br />

sequence count<br />

cell. The process also saves both 401 time GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC and money over other “We have no true comprehension 313 of cancer on a molecular level<br />

lab methods.<br />

402 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA yet. We need tools to 291 understand how the genome works and<br />

A former auto mechanic might<br />

403<br />

seem<br />

ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />

like an unlikely we need to know how 235 things interact. If we’re going to go in<br />

404 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT 205<br />

candidate to become a research<br />

405<br />

scientist.<br />

GTGGAAAATTTAGAAATGTCCACTGTAGGACGTGG<br />

But if you follow and poke something, we better know what the domino effect<br />

188<br />

Gentleman’s path from Hondas 406 to Harvard CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAA to the <strong>Hutchinson</strong> will be.” 177<br />

<strong>Center</strong>, his trajectory seems perfectly probable. The former He and his colleagues around the globe collaborate to write<br />

grease monkey has a penchant for<br />

$s_4_export.txt<br />

challenges, so it makes sense free software for the entire scientific community to use. “This is<br />

sequence count<br />

that he’d choose a research endeavor<br />

551 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA<br />

that’s moving at breakneck a high-risk area, which is not where commercial software goes,”<br />

266<br />

speed. For brilliant young minds, 552biocomputing GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC is the most he said. “We’re only interested 232 in being as close to the edge of<br />

challenging field around. 553 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT scientific investigation 187 as we can be. We share to learn. Real<br />

Advances in knowledge and<br />

554<br />

technology<br />

ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA<br />

have transformed change comes from people 164 working well together.”<br />

555 CTGTAGGACGTGGAATATGGCAAGAAAACTGAAAA 161<br />

556 TGTAGGACGTGGAATATGGCAAGAAAACTGAAAAT 139<br />

$s_6_export.txt<br />

sequence count<br />

701 TAGGACGTGGAATATGGCAAGAAAACTGAAAATCA 395<br />

702 GTAGGACGTGGAATATGGCAAGAAAACTGAAAATC 354<br />

703 AGGACGTGGAATATGGCAAGAAAACTGAAAATCAT 263<br />

704UNRAVELING ACTGTAGGACGTGGAATATGGCAAGAAAACTGAAA VAST DNA SEQUENCES, WRITTEN AS A-G-T-C 232 CODES, REQUIRES POWERFUL TOOLS.<br />

5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!