Using long-read sequencing to shed light on complex allele-specific loci
Ontology highlight
ABSTRACT: In recent years, long-read sequencing technologies have detected transcript isoforms with unprecedented accuracy and resolution. However, it remains unclear whether long-read sequencing can effectively disentangle complex allele-specific loci that arise from genetic or epigenetic differences between alleles. Here, we combine the PacBio Iso-Seq workflow with the established phasing approach WhatsHap to assign long reads to the corresponding allele in polymorphic F1 mouse hybrids. Upon comparing the long-read sequencing results with matched short reads, we observed general consistency in the allele-specific information and were able to confirm the imprinting status of all the known imprinted genes. We then explored the complex imprinted Gnas locus known for allele-specific noncoding and coding isoforms and were able to benchmark historical observations. This approach also allowed us to elucidate allele-specific isoforms of genes that escape X-chromosome inactivation. The described workflow offers a promising framework and demonstrates the power of long-read transcriptomic data to provide mechanistic insight into complex allele-specific loci.
ORGANISM(S): Mus musculus
PROVIDER: GSE246857 | GEO | 2025/02/03
REPOSITORIES: GEO
ACCESS DATA