Identification and characterization of centromeric sequences in Xenopus laevis
AbstractCentromeres play an essential function in cell division by specifying the site of kinetochore formation on each chromosome for mitotic spindle attachment. Centromeres are defined epigenetically by the histone H3 variant CEntromere Protein A (CENP-A). CENP-A nucleosomes maintain the centromere by designating the site for new CENP-A assembly after dilution by replication. Vertebrate centromeres assemble on tandem arrays of repetitive sequences but the function of repeat DNA in centromere formation has been challenging to dissect due to the difficulty in manipulating centromeres in cells. Xenopus laevis egg extracts assemble centromeres in vitro, providing a system for studying centromeric DNA functions. However, centromeric sequences in X. laevis have not been extensively characterized. In this study we combine CENP-A ChIP-seq with a k-mer based analysis approach to identify the X. laevis centromere repeat sequences. By in situ hybridization we show that X. laevis centromeres contain diverse repeat sequences and we map the centromere position on each X. laevis chromosome using the distribution of centromere enriched k-mers. Our identification of X. laevis centromere sequences enables previously unapproachable centromere genomic studies. Our approach should be broadly applicable for the analysis of centromere and other repetitive sequences in any organism.