Uncovering the Potential Pan Proteomes Encoded by Genomic Strand RNAs of Influenza A Viruses.
Ontology highlight
ABSTRACT: Influenza A virus genomes are composed of eight negative sense RNAs. In total, 16 proteins encoded by eight positive sense RNAs were identified. One putative protein coding sequence (PCS) encoded by genomic strand RNA of segment 8 has been previously proposed. In this study, 95,608, 123,965 and 35,699 genomic strand RNA sequences from influenza A viruses from avian, human and mammalian hosts, respectively, were used to identify PCSs encoded by the genomic strand RNAs. In total, 326,069 PCSs with lengths equal to or longer than 80 amino acids were identified and clustered into 270 PCS groups. Twenty of the 270 PCS groups which have greater than 10% proportion in influenza A viruses from avian, human or mammalian hosts were selected for detailed study. Maps of the 20 PCSGs in the influenza A virus genomes were constructed. The proportions of the 20 PCSGs in influenza A viruses from different hosts and serotypes were analyzed. One secretory and five membrane proteins predicted from the PCS groups encoded by genomic strand RNAs of segments 1, 2, 4, 6, 7 and 8 were identified. These results suggest the possibility of the ambisense nature of the influenza A virus genomic RNAs and a potential coding sequence reservoir encoding potential pan proteomes of influenza A viruses.
SUBMITTER: Yang CW
PROVIDER: S-EPMC4711952 | biostudies-literature |
REPOSITORIES: biostudies-literature
ACCESS DATA