Publication | Open Access
Decoding topologically associating domains with ultra-low resolution Hi-C data by graph structural entropy
125
Citations
35
References
2018
Year
Megabase-size Tad-like DomainsEngineeringGeneticsMolecular BiologyNetwork AnalysisGenomicsBioinformatics DatabaseEpigeneticsComputational TopologyData ScienceHi-c Contact MatrixComputational GenomicsSequence AnalysisGenome StructureTopological RepresentationTopological Data AnalysisComputer ScienceFunctional GenomicsBioinformaticsChromatinGraph TheoryEntropyComputational BiologyProper Bin SizeStructure DiscoveryHigh-dimensional NetworkGraph Structural EntropySystems BiologyMedicine
Submegabase-size topologically associating domains (TAD) have been observed in high-throughput chromatin interaction data (Hi-C). However, accurate detection of TADs depends on ultra-deep sequencing and sophisticated normalization procedures. Here we propose a fast and normalization-free method to decode the domains of chromosomes (deDoc) that utilizes structural information theory. By treating Hi-C contact matrix as a representation of a graph, deDoc partitions the graph into segments with minimal structural entropy. We show that structural entropy can also be used to determine the proper bin size of the Hi-C data. By applying deDoc to pooled Hi-C data from 10 single cells, we detect megabase-size TAD-like domains. This result implies that the modular structure of the genome spatial organization may be fundamental to even a small cohort of single cells. Our algorithms may facilitate systematic investigations of chromosomal domains on a larger scale than hitherto have been possible.
| Year | Citations | |
|---|---|---|
Page 1
Page 1