2008 Barcelona - European Society of Human Genetics
2008 Barcelona - European Society of Human Genetics
2008 Barcelona - European Society of Human Genetics
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Concurrent Sessions<br />
reporting, and literature mining .<br />
Array-CGH technology is currently on its way to replace classical<br />
karyotyping as primary diagnostic tool in copy number screening . Its<br />
vast importance in clinical diagnosis and research are underlined by<br />
large number <strong>of</strong> genes in human development and disease still unknown.<br />
A data storage and mining tool aimed specifically at leveraging<br />
the power <strong>of</strong> Array-CGH in a clinical context will aid aetiology <strong>of</strong><br />
rare and complex genetic diseases by characterising genomic rearrangements,<br />
annotating and analysing clinical features, and providing<br />
advanced data mining and integration . For example, through an<br />
automated analysis <strong>of</strong> PUBMED abstracts, Bench allows to prioritize<br />
candidate genes in genomic deletions and duplications by phenotype<br />
characteristics annotated to the patient . Also, known Copy Number<br />
Variations, genes and their functions, and other genome annotations,<br />
when integrated with patient related data, aid in diagnosis and aetiology<br />
<strong>of</strong> new submicroscopic chromosomal imbalance syndromes .<br />
Bench provides a collaborative Array-CGH LIMS system that allows<br />
to maintain records <strong>of</strong> phenotype and chromosome rearrangements<br />
in patients, augmented with data mining, reporting and visualisation<br />
facilitating research, diagnostics, research and genetic counseling by<br />
involving relevant information from a variety <strong>of</strong> sources . Bench can<br />
be used free <strong>of</strong> charge in a research collaboration . http://www .esat .<br />
kuleuven .be/cghgate/ .<br />
c05.1<br />
Functional interactions <strong>of</strong> conserved non-coding (cNc)<br />
sequences with other cNc using circular chromosome<br />
conformation capture (4c)<br />
D. Robyr, G. Duriaux-Sail, S. E. Antonarakis;<br />
University <strong>of</strong> Geneva Medical School, Geneva, Switzerland.<br />
The comparison <strong>of</strong> human chromosome 21 (Hsa21) sequences with<br />
the mouse syntenic regions led to the identification <strong>of</strong> roughly 3500<br />
regions displaying an identity <strong>of</strong> >70% over a length <strong>of</strong> a least 100<br />
nucleotides <strong>of</strong> ungaped alignment . About 65% (~ 2300) <strong>of</strong> these are<br />
conserved non-coding sequences (CNCs) . Very little is known about<br />
the function <strong>of</strong> most CNCs . We speculated that a functional CNC may<br />
interact with its genomic target (i .e . an enhancer would bind to it’s cognate<br />
gene promoter). Thus, the identification <strong>of</strong> any part <strong>of</strong> the genome<br />
that interacts directly with a CNC could provide clues on the function <strong>of</strong><br />
the latter . We have generated libraries <strong>of</strong> CNC-interacting DpnII fragments<br />
by chromosome conformation capture (4C) whose identity is<br />
determined by subsequent high-throughput sequencing .<br />
We are currently screening for the interactions <strong>of</strong> 18 CNCs located in<br />
the two ENCODE regions <strong>of</strong> HSA21 in different cell lines . Preliminary<br />
results for two CNCs in K562 cells indicate that these may interact<br />
near regions <strong>of</strong> the genome that show conservation among vertebrates<br />
. Indeed, the median distances from the sequenced DpnII fragments<br />
to the nearest conserved region are 381 .5 bp (P = 0 .0583) and<br />
764bp (P = 0 .023) respectively for the 2 CNCs analysed . These results<br />
provide initial evidence that the function <strong>of</strong> CNCs is mediated by their<br />
interactions with other conserved regions . Interestingly, these CNCs<br />
are capable <strong>of</strong> interactions with loci not only in cis and over several Mb,<br />
but also in trans with loci located on other chromosomes .<br />
c05.2<br />
A high-resolution structural variation map <strong>of</strong> a human genome<br />
by next-generation, high-throughput paired-end sequencing<br />
F. M. De La Vega 1 , H. E. Peckham 2 , S. S. Ranade 2 , S. F. McLaughlin 2 , C. C.<br />
Lee 2 , Y. Fu 2 , Z. Zhang 1 , F. C. L. Hyland 1 , C. L. Clouser 2 , A. A. Antipova 2 , J. M.<br />
Manning 2 , C. L. Hendrickson 2 , L. Zhang 2 , E. T. Dimalanta 2 , T. D. Sokolsky 2 ,<br />
M. W. Laptewicz 2 , B. E. Coleman 2 , J. K. Ichikawa 2 , J. B. Warner 2 , B. Li 1 , J. M.<br />
Kidd 3 , J. A. Malek 4 , G. L. Costa 2 , E. E. Eichler 3 , K. J. McKernan 2 ;<br />
1 Applied Biosystems, Foster City, CA, United States, 2 Applied Biosystems, Beverly,<br />
MA, United States, 3 HHMI, University <strong>of</strong> Washington, Seattle, WA, United<br />
States, 4 Weill Cornell Medical College in Qatar, Doha, Qatar.<br />
The human genome and HapMap projects have considerably increased<br />
our understanding <strong>of</strong> the role <strong>of</strong> sequence variation in evolution<br />
and disease . Hybridization microarrays and fosmid-end sequencing<br />
reveal that structural variants (SVs) including insertions, deletions,<br />
duplications, inversions and translocations are common and extensive .<br />
Microarray methods, however, lack resolution and are blind to unbalanced<br />
events, while clone-based end-sequencing is time consuming<br />
and expensive . Here, we present a high-resolution survey <strong>of</strong> SVs <strong>of</strong> a<br />
human genome, a HapMap Yoruba sample (NA18507), by ultra-high<br />
throughput sequencing <strong>of</strong> paired-end libraries with the AB SOLiD(TM)<br />
System . We sequenced a variety <strong>of</strong> 2x25-bp paired-end libraries<br />
(>15Gb) with insert sizes ranging from 600bp to 6kb (SD 10-23%) .<br />
Each library provides over 10x physical (clone) coverage, with a total<br />
combined physical coverage >60x for 90% <strong>of</strong> the genome . The high<br />
physical coverage and diverse insert sizes allowed detecting small indels<br />
within tags (1-10 bp), and approximately 70,000 indels <strong>of</strong> length<br />
20 bp to >100 kb . Additionally, we sequenced 7Gb <strong>of</strong> 50-bp fragment<br />
libraries, which combined with the paired libraries provided over 12x<br />
sequence coverage, allowing us to discover millions <strong>of</strong> SNPs <strong>of</strong> which<br />
75% are found in dbSNP . Inferred SVs were compared to a database<br />
<strong>of</strong> end-sequence pairs <strong>of</strong> 10x physical coverage obtained by di-deoxy<br />
sequencing <strong>of</strong> 40kb fosmid ends . A subset <strong>of</strong> novel SVs were validated<br />
by PCR and Sanger sequencing . Our results serves as a model for<br />
further high-resolution exploration <strong>of</strong> genetic variation in human populations<br />
and cancer with next-generation sequencing .<br />
c05.3<br />
Expression analysis using deep solexa sequencing shows<br />
major advances in robustness, resolution and inter-lab<br />
portability over microarray platforms<br />
J. T. den Dunnen 1,2 , Y. Ariyurek 2 , H. H. Thygesen 1 , E. Vreugdenhil 3 , J. M.<br />
Boer 1 , G. B. van Ommen 1 , P. A. C. ‘t Hoen 1 ;<br />
1 <strong>Human</strong> and Clinical Gentics, Leiden, The Netherlands, 2 Leiden Genome<br />
Technology Center, Leiden, The Netherlands, 3 Medical Pharmacology, Leiden /<br />
Amsterdam Center for Drug Research, Leiden, The Netherlands.<br />
The hippocampal transcriptomes <strong>of</strong> wild-type mice and mice transgenic<br />
for δC-doublecortin-like kinase were analyzed with the Solexa deep<br />
sequencing technology . We determined around 2 million sequence<br />
tags per sample and compared these data with results <strong>of</strong> the same<br />
samples analysed using five different microarray platforms. Seventy<br />
percent <strong>of</strong> the sequence tags were mapped to approximately 30,000<br />
unique, high-confidence transcripts, their abundance spanning four<br />
orders <strong>of</strong> magnitude . Antisense transcription, undetectable by microarrays,<br />
was found in 51% <strong>of</strong> all genes, and alternative poly-adenylation<br />
in 47% . With a dedicated Bayesian model and false discovery rate<br />
<strong>of</strong> 8.5%, we measured statistically significant differential expression<br />
for 3179 transcripts; many more and with higher fold-changes than<br />
observed using microarrays . The deep sequencing technology demonstrates<br />
superb reproducibility, not only between biological replicates<br />
but even across laboratories . The described major advance in robustness,<br />
comparability and richness <strong>of</strong> sequence -based transcriptomics<br />
data is expected to boost in-depth collaborative, comparative and integrative<br />
genomics studies .<br />
c05.4<br />
studying gene dosage imbalance in embryonic stem cells<br />
G. Cobellis 1,2 , A. Romito 1 , R. De Cegli 1 , S. Iacobacci 1 , A. Fedele 1 , D. di Bernardo<br />
1 , A. Ballabio 1 ;<br />
1 TIGEM, Napoli, Italy, 2 Second University <strong>of</strong> Naples, Naples, Italy.<br />
To gain insight into the alterations <strong>of</strong> the transcriptional pathways underlying<br />
the pathogenesis <strong>of</strong> Down syndrome, we decided to use an<br />
integrated strategy combining the systematic overexpression <strong>of</strong> chromosome<br />
21 genes in ES cells, transcriptome analysis and systems<br />
biology approaches .<br />
We generated an ES cell clone bearing an inducible/exchangeable<br />
cassette in the ROSA26 locus to be used to insert, one by one, every<br />
murine orthologs <strong>of</strong> human chromosome 21 genes. Using this flexible<br />
system, we developed a library <strong>of</strong> ES cells over-expressing, in<br />
an inducible manner, murine orthologs <strong>of</strong> human transcription factors,<br />
kinases and miRNAs mapping on HSA21 in order to perturb the physiological<br />
genetic network at the cellular level . The biological pathways<br />
affected by the over-expression <strong>of</strong> each coding and non-coding gene<br />
and their regulators and regulated genes have been inferred using<br />
system biology approach . We already mapped the regulatory gene<br />
networks specifically altered by each transcription factors and kinases,<br />
opening new hypothesis toward the understanding <strong>of</strong> pathogenesis <strong>of</strong><br />
Down syndrome .<br />
Finally, the above-described ES cell lines have been used to study the<br />
gene dosage imbalance effects on ES cell differentiation to cardiomyocytes,<br />
myeloid and neuronal lineages, tissues affected by the DS .<br />
This project represent the first to involve a systematic overexpression