Identification and investigation of ORFans in the viral world

Yanbin Yin; Fischer, Daniel
January 2008
BMC Genomics;2008, Vol. 9, Special section p1
Academic Journal
Background: Genome-wide studies have already shed light into the evolution and enormous diversity of the viral world. Nevertheless, one of the unresolved mysteries in comparative genomics today is the abundance of ORFans -- ORFs with no detectable sequence similarity to any other ORF in the databases. Recently, studies attempting to understand the origin and functions of bacterial ORFans have been reported. Here we present a first genome-wide identification and analysis of ORFans in the viral world, with focus on bacteriophages. Results: Almost one-third of all ORFs in 1,456 complete virus genomes correspond to ORFans, a figure significantly larger than that observed in prokaryotes. Like prokaryotic ORFans, viral ORFans are shorter and have a lower GC content than non-ORFans. Nevertheless, a statistically significant lower GC content is found only on a minority of viruses. By focusing on phages, we find that 38.4% of phage ORFs have no homologs in other phages, and 30.1% have no homologs neither in the viral nor in the prokaryotic world. Phages with different host ranges have different percentages of ORFans, reflecting different sampling status and suggesting various diversities. Similarity searches of the phage ORFeome (ORFans and non-ORFans) against prokaryotic genomes shows that almost half of the phage ORFs have prokaryotic homologs, suggesting the major role that horizontal transfer plays in bacterial evolution. Surprisingly, the percentage of phage ORFans with prokaryotic homologs is only 18.7%. This suggests that phage ORFans play a lesser role in horizontal transfer to prokaryotes, but may be among the major players contributing to the vast phage diversity. Conclusion: Although the current sampling of viral genomes is extremely low, ORFans and near-ORFans are likely to continue to grow in number as more genomes are sequenced. The abundance of phage ORFans may be partially due to the expected vast viral diversity, and may be instrumental in understanding viral evolution. The functions, origins and fates of the majority of viral ORFans remain a mystery. Further computational and experimental studies are likely to shed light on the mechanisms that have given rise to so many bacterial and viral ORFans.


Related Articles

  • Visualization of ordered genomic RNA and localization of transcriptional complexes in rotavirus. Prasad, B.V. Venkataram; Rothnagel, R. // Nature;8/1/1996, Vol. 382 Issue 6590, p471 

    Discusses a three-dimensional characterization of viral genomes and the transcription complex in a prototypical double-stranded-RNA virus. Dodecahedral structure formed by viral dsRNA; Number of base pairs.

  • Genetic characterization of Pseudomonas aeruginosa bacteriophage KPP10. Uchiyama, Jumpei; Rashel, Mohammad; Takemura, Iyo; Kato, Shin-ichiro; Ujihara, Takako; Muraoka, Asako; Matsuzaki, Shigenobu; Daibata, Masanori // Archives of Virology;Apr2012, Vol. 157 Issue 4, p733 

    Bacteriophage (phage) KPP10 has been used in experimental phage therapies directed against P. aeruginosa infections. To examine the eligibility of phage KPP10 as a therapeutic phage, its genome was analyzed. The genomic DNA was shown to be 88,322 bp long, with 158 open reading frames (ORFs), and...

  • Genomic characterization of Salmonella bacteriophages isolated from India. Karpe, Yogesh; Kanade, Gayatri; Pingale, Kunal; Arankalle, Vidya; Banerjee, Kalyan // Virus Genes;Feb2016, Vol. 52 Issue 1, p117 

    Salmonella are a medically important Gram-negative foodborne pathogen. Genomic diversity of Salmonella is increasingly studied but at the same time, we have limited knowledge of Salmonella phage diversity. In this study, we have isolated Salmonella phages from sewage and river water. Genomic...

  • Evolution and the complexity of bacteriophages. Serwer, Philip // Virology Journal;2007, Vol. 4, p30 

    Background: The genomes of both long-genome (> 200 Kb) bacteriophages and long-genome eukaryotic viruses have cellular gene homologs whose selective advantage is not explained. These homologs add genomic and possibly biochemical complexity. Understanding their significance requires a definition...

  • Stability of Marek’s disease virus 132-bp repeats during serial in vitro passages. Niikura, M.; Dodgson, J. B.; Cheng, H. H. // Archives of Virology;Jul2006, Vol. 151 Issue 7, p1431 

    The Marek’s disease virus (MDV) genome contains 2 sets of 132-bp tandem repeat sequences. An increase in 132-bp repeat units has been associated with attenuation of oncogenicity during in vitro passage. By cloning entire genomes, we demonstrated that the copy number of 132-bp repeats can...

  • Genomic Encyclopedia of Bacteria and Archaea: Sequencing a Myriad of Type Strains. Kyrpides, Nikos C.; Hugenholtz, Philip; Eisen, Jonathan A.; Woyke, Tanja; Göker, Markus; Parker, Charles T.; Amann, Rudolf; Beck, Brian J.; Chain, Patrick S. G.; Chun, Jongsik; Colwell, Rita R.; Danchin, Antoine; Dawyndt, Peter; Dedeurwaerdere, Tom; DeLong, Edward F.; Detter, John C.; De Vos, Paul; Donohue, Timothy J.; Dong, Xiu-Zhu; Ehrlich, Dusko S. // PLoS Biology;Aug2014, Vol. 12 Issue 8, p1 

    This manuscript calls for an international effort to generate a comprehensive catalog from genome sequences of all the archaeal and bacterial type strains.

  • Genomic Analysis of Freshwater Cyanophage Pf-WMP3 Infecting Cyanobacterium Phormidium foveolarum: The Conserved Elements for a Phage. Liu, Xinyao; Kong, Shuanglei; Shi, Miao; Fu, Liwen; Gao, Yin; An, Chengcai // Microbial Ecology;Nov2008, Vol. 56 Issue 4, p671 

    Cyanophages are ecologically abundant, genetically diverse in aquatic environments, and affect the population and evolutionary trajectories of their hosts. After reporting the cyanophage Pf-WMP4 genome (Liu et al. in Virology 366:28–39, ), we hereby present a related cyanophage, Pf-WMP3,...

  • Heterogeneity in codon usages of sobemovirus genes. H. Zhou; H. Wang; L. F. Huang; Naylor, M.; Clifford, P. // Archives of Virology;Aug2005, Vol. 150 Issue 8, p1591 

    When conventional phylogenetic trees were built using 14 genome sequences of 9 sobemoviruses, two main lineages were apparent: monocot-infecting viruses and dicot-infecting viruses. To investigate whether members of the genus Sobemovirus originated from monocot hosts or from dicot hosts, we...

  • Evidence for inter- and intra-genotypic variations in dengue serotype 4 viruses representing predominant and non-predominant genotypes co-circulating in Thailand from 1977 to 2001. Zhao, Richard; Chinnawirotpisan, Piyawan; Klungthong, Chonticha; Zhang, Chunlin; Putnak, Robert // Virus Genes;Aug2010, Vol. 41 Issue 1, p5 

    In order to characterize viral genetic variation among predominant and non-predominant genotypes of Thai dengue serotype 4 viruses (DENV-4) and follow mutations that occur during virus evolution, we performed a comparative analysis of the complete genomic sequences of six DENV-4 isolates...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics