Evolution of protein and non-coding RNA genes studied with comparative genomics
Abstract
The identification of protein and non-coding RNA (ncRNA) genes is one important step in the analysis of a genome. This thesis focuses on the identification and analysis of proteins and ncRNAs homologues by exploiting a variety of computational methods in order to reach conclusions as to their structure, function, evolution and regulation. This work is composed of two different parts. One deals with computational prediction of protein and ncRNA homologues from different ribonucleoprotein (RNP) complexes and the other addresses problems related to non-random gene order in eukaryotes.
In the first part RNPs that were previously not well explored with respect to their phylogenetic distribution were examined. Thus, homology-based methods were employed to analyze the RNP complexes of RNase P, RNase MRP and the spliceosome as well as RNPs and RNA structures involved in the 3' end processing of histone mRNAs. We identified a large number of previously unrecognized homologues that improved our
understanding of the evolution of the different RNPs. For example, homology relationships of the RNases P and MRP proteins were identified providing further evidence of homology between the human and the yeast RNPs. We presented evidence that the histone 3’ end processing machinery is more ancient than previously anticipated and can be traced to the root of the eukaryotic phylogenetic tree. We presented a detailed map of the distribution of the spliceosomal U12-type RNA genes, supporting an early origin of the minor spliceosome and pointing to a number of occasions where it was lost during evolution.
In the second part we generated gene order maps to show the localization of both protein and ncRNA genes in a wide range of eukaryotic organisms. Non-random gene order was then examined to identify the most important determinants of gene order conservation. One important conclusion was that gene pairs that are evolutionarily conserved and that are divergently transcribed are much more likely to be related by function as compared to poorly conserved gene pairs. The genes of such pairs are likely to be related also in terms of transcriptional control. Moreover, we presented the eukaryotic Gene Order Browser (eGOB), where data related to this project is available and where researchers can visualize and compare the evolution of gene organization in different organisms. In addition, the browser may be used to identify pairs of adjacent genes that are evolutionarily conserved and likely to be transcriptionally linked. eGOB is available at http://egob.bioimedicine.gu.se.
Parts of work
Rosenblad MA, López MD, Piccinelli P, Samuelsson T. (2006). Inventory and analysis of the protein subunits of the ribonucleases P and MRP provides further evidence of homology between the yeast and human enzymes. Nucleic Acids Res. 34(18):5145-56. ::PMID::16998185 Dávila López M, Rosenblad MA, Samuelsson T. (2008). Computational screen for spliceosomal RNA genes aids in defining the phylogenetic distribution of major and minor spliceosomal components. Nucleic Acids Res. 36(9):3001-10. ::PMID::18390578 Dávila López M, Samuelsson T. (2008). Early evolution of histone mRNA 3' end processing.
RNA. 14(1):1-10. ::PMID::17998288 Dávila López M, Martínez Guerra JJ, Samuelsson T. (2010). Analysis of gene order conservation in eukaryotes identifies transcriptionally and functionally linked genes. PLoS One. 14;5(5):e10654. ::PMID::20498846 Davila Lopez, M. and Samuelsson, T. (2010). eGOB: Eukaryotic Gene Order Browser. Submitted for publication
Degree
Doctor of Philosophy (Medicine)
University
University of Gothenburg. Sahlgrenska Academy
Institution
Institute of Biomedicine. Department of Medical Biochemistry and Cell Biology
Disputation
Fredagen den 18 januari 2011, kl. 9.00, Hörsal Tör Bjurström
Date of defence
2011-01-18
marcela.davila@medkem.gu.se
Date
2010-12-16Author
Davila Lopez, Marcela
Keywords
Bioinformatics
evolution
non-coding RNA
gene prediction
gene order
bidirectional promoter
homologue prediction
secondary structure
RNAse P
RNase MRP
Publication type
Doctoral thesis
ISBN
978-91-628-8183-2
Language
eng