Despite our knowledge of the sequence of the human genome, the relation of its three-dimensional dynamic architecture with its function – the storage and expression of genetic information – remains one of the central unresolved issues of our age. It became very clear meanwhile that this link is crucial for the entire holistic function of the genome on all genomic coding levels from the DNA sequence to the entire chromosomes. To fulfil the dreams for better diagnostics and treatment in the 21st century (e.g. by gene therapy by inserting a gene into a new global context), we propose here in a unique interdisciplinary project to combine experiment with theory to analyze the (epi-)genomic structure function relationships within the dynamic organization of the -Globin locus, the Immuno Globin loci, and the Tumor Necrosis Factor Alpha regulated SAMD4 region in mouse and human active and inactive cell states, and their global genomic context. The project consists of five work packages (WP1-WP5) and corresponding tasks connected in a system biological approach with iterative use of data, modelling, simulation and experiments via a unique data sharing and visualization platform: In WP1 (Längst, Rippe, Wedemann, Knoch/Grosfeld; T1-T5) to investigate nucleosomal association changes in relation to the DNA sequence and the activity of ATP-driven chromatin remodelling complexes, nucleosome positions will be determined by high-throughput sequencing. The resulting nucleosomal localization probability maps will be evaluated by a novel combination of analysis tools and innovative generic data ontologies. The relation to epigenetic modifications, to the activity of ATP-driven remodelling complexes and compaction degree of nucleosomes will be analysed to understand chromatin morphogenesis and fiber formation. In parallel, in WP2 (Grosveld/Knoch, Cook, Rippe, Längst; T1-T3) we determine by high-throughput monitoring of intra/inter chromosomal contacts and architecture absolute DNA-DNA interaction probability maps for the individual loci and their global context using a novel chromosome conformation capture approach based on deep sequencing. From these the 3D conformation of the chromatin fiber and its higher-order folding into loops and loop clusters can be derived using algorithms recently developed by us. WP3 (Cook, Grosveld/Knoch, Längst; T1-T5) focuses on the determination of transcription rates and structure by qRT-PCR, DNA and RNA fluorescence in situ hybridization using intronic probes and high-resolution laser-scanning and single molecule imaging with advanced image analysis tools. Transcription-dependent changes of active and inactive loci as well as rapid synchronous transcription alteration against the unchanged background is one main interest here. This will yield results in a detailed cartography of the structure-transcription-function dependency and its importance. To rationalize the experimental results theoretically, in WP4 (Wedemann Knoch/Grosveld, Rippe; T1-T3) simulations are made of nucleosomal structure, chromatin fiber conformation and chromosomal architecture using parallel and grid super-computers with ~10.000 CPUs. The impact of different nucleosomal positions and epigenetic modifications on the nucleosomal structure and the chromatin fiber conformation will be assessed by novel Monte Carlo approaches. To understand the higher-order architecture Brownian Dynamics simulations of entire cell nuclei with molecular resolution, morphogenic processes and transcriptional states will be made. This results in a virtual multi-scale model with unseen spatial and time resolution providing novel insights into genome organization. All the resulting virtual architectures will be compared to experiment to prompt again new experiments and vice versa in reiterative cycles. In WP5 (Knoch/Grosveld, Cook, Rippe, Längst, Wedemann; T1-T5) all partners together will integrate the experimental and theoretic results to achieve a system biological model with existing genome-wide data in the GLOBE 3D Genome Browser – a novel platform developed by us for the analysis, manipulation and understanding of multi-dimensional complex genome wide data in an easy to understand 3D visualization environment. The entire experimental data will be archived, all simulations and analysis on high-performance computing infrastructures (e.g. German MediGRID, Dutch Erasmus Computing Grid) will be controlled by a new management system. Together with a novel generic correlation finder and a process/pathway data base this will result together with our Portable Genome Format (PGF) in a unique system biological platform publicly available to understand genome complexity. We are strongly convinced, that the reiterative combination of quantitative experiment with theory leads to a virtual system biological model of the (epi-)genomic structure-function relationship with major impact and great valorization opportunities in research, training, diagnosis and treatment due to the uniqueness, novelty, and frontier position (~20 academic, ~10 industry collaborations underway). This is stressed by our i) scientific excellence, ii) interdisciplinary experience, iii) participative (SOP) management, and iv) IP valorization successes, embedded within strong education/training regimes and famous infrastructures. Consequently, our “EpiGenSys” virtual laboratory is a prime example for systems biology combining high-throughput/performance techniques of cell biology, mathematics, physics and informatics to solve one of the most fundamental issues of personalized genomic medicine. Beyond, we believe to make a major contribution to e-Science, e-Health, e-Learning, as well as e-Commerce creating a novel awareness and understanding of genomic complexity within society.

Additional Metadata
Keywords Genome, genomics, genome organization, genome architecture, structural sequencing, architectural sequencing, systems genomics, coevolution, holistic genetics, genome mechanics, genome statistical mechanics, genomic uncertainty principle, genome function, genetics, gene regulation, replication, transcription, repair, homologous recombination, simultaneous co-transfection, cell division, mitosis, metaphase, interphase, cell nucleus, nuclear structure, nuclear organization, chromatin density distribution, nuclear morphology, chromosome territories, subchromosomal domains, chromatin loop aggregates, chromatin rosettes, chromatin loops, chromatin fibre, chromatin density, persistence length, spatial distance measurement, histones, H1.0, H2A, H2B, H3, H4, mH2A1.2, DNA sequence, complete sequenced genomes, molecular transport, obstructed diffusion, anomalous diffusion, percolation, long-range correlations, fractal analysis, scaling analysis, exact yard-stick dimension, box-counting dimension, lacunarity dimension, local nuclear dimension, nuclear diffuseness, parallel super computing, grid computing, volunteer computing, Brownian Dynamics, Monte Carlo, fluorescence in situ hybridization, chromatin cross-linking, chromosome conformation capture (3C), selective high-resolution high-throughput chromosome interaction capture (T2C), confocal laser scanning microscopy, fluorescence correlation spectroscopy, super resolution microscopy, spatial precision distance microscopy, auto-fluorescent proteins, CFP, GFP, YFP, DsRed, fusion protein, in vivo labelling, information browser, visual data base access, holistic viewing system, integrative data management, extreme visualization, three-dimensional virtual environment, virtual paper tool.
Persistent URL
Conference EraSysBio+ Kick-Off Meeting, ANR Biopark, Paris, France, 17th - 18st May, 2010.
Knoch, T.A, Cook, P.R, Rippe, K, Gernot Längst, Wedemann, G, & Grosveld, F.G. (2010, May 17). Systems Biological Determination of the Epi-Genomic Structure Function Relation:. Presented at the EraSysBio+ Kick-Off Meeting, ANR Biopark, Paris, France, 17th - 18st May, 2010. Retrieved from