Publication | Open Access
Nonlinear sequence similarity between the <i>Xist</i> and <i>Rsx</i> long noncoding RNAs suggests shared functions of tandem repeat domains
38
Citations
60
References
2019
Year
The marsupial inactive X chromosome expresses a long noncoding RNA (lncRNA) called <i>Rsx</i> that has been proposed to be the functional analog of eutherian <i>Xist</i> Despite the possibility that <i>Xist</i> and <i>Rsx</i> encode related functions, the two lncRNAs harbor no linear sequence similarity. However, both lncRNAs harbor domains of tandemly repeated sequence. In <i>Xist</i>, these repeat domains are known to be critical for function. Using <i>k</i>-mer based comparison, we show that the repeat domains of <i>Xist</i> and <i>Rsx</i> unexpectedly partition into two major clusters that each harbor substantial levels of nonlinear sequence similarity. <i>Xist</i> Repeats B, C, and D were most similar to each other and to <i>Rsx</i> Repeat 1, whereas <i>Xist</i> Repeats A and E were most similar to each other and to <i>Rsx</i> Repeats 2, 3, and 4. Similarities at the level of <i>k</i>-mers corresponded to domain-specific enrichment of protein-binding motifs. Within individual domains, protein-binding motifs were often enriched to extreme levels. Our data support the hypothesis that <i>Xist</i> and <i>Rsx</i> encode similar functions through different spatial arrangements of functionally analogous protein-binding domains. We propose that the two clusters of repeat domains in <i>Xist</i> and <i>Rsx</i> function in part to cooperatively recruit PRC1 and PRC2 to chromatin. The physical manner in which these domains engage with protein cofactors may be just as critical to the function of the domains as the protein cofactors themselves. The general approaches we outline in this report should prove useful in the study of any set of RNAs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1