An operon-based data science approach for the inference of tRNA and rRNA gene evolution

dc.contributor.authorPawliszak, Tomasz
dc.contributor.examiningcommitteeDomaratzki, Michael (Computer Science) Hausner, Georg (Microbiology)en_US
dc.contributor.supervisorLeung, Carson K. (Computer Science). Tremblay-Savard, Olivier (Computer Science)en_US
dc.date.accessioned2019-12-11T15:20:02Z
dc.date.available2019-12-11T15:20:02Z
dc.date.issued2019-12-10en_US
dc.date.submitted2019-12-10T17:00:00Zen
dc.degree.disciplineComputer Scienceen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractWith advancements in technology, big data can be easily generated and collected. Big data mining and analytics is in demand for discovery of important information and useful knowledge from these big data. An example of big data includes ribonucleic acid (RNA) genes in bacterial genomes in the area of bioinformatics and biological data mining. Specifically, in bacterial genomes, ribosomal ribonucleic acid (rRNA) and transfer ribonucleic acid (tRNA) genes are often organized into operons, i.e., segments of closely located genes that share a single promoter and are transcribed as a single unit. Analyzing how these genes and operons evolve can help us understand what the most common evolutionary events are affecting them and give us a better picture of ancestral codon usage and protein synthesis. We introduce a new approach for the inference of evolutionary histories of rRNA and tRNA genes in bacteria called BOPAL for Bacterial Operon Aligner, which is based on the identification of orthologous operons. This approach allows for a better inference of orthologous genes in genomes that have been affected by many rearrangements, which in turn helps with the inference of more realistic evolutionary scenarios and ancestors. From our comparisons of BOPAL with other gene order alignment programs using simulated data, we have found that BOPAL infers evolutionary events and ancestral gene orders more accurately than other methods based on alignments. An analysis of 12 Bacillus genomes also showed that BOPAL performs well in building ancestral histories in a minimal amount of events.en_US
dc.description.noteFebruary 2020en_US
dc.identifier.citationTomasz Pawliszak, Meghan Chua, Carson K. Leung, and Olivier Tremblay-Savard. Operon-based approach for the inference of rRNA and tRNA evolutionary histories in bacteria. BMC Genomics (in press), 2019.en_US
dc.identifier.urihttp://hdl.handle.net/1993/34397
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectData mining, Data science, Bioinformatics, Biological data miningen_US
dc.titleAn operon-based data science approach for the inference of tRNA and rRNA gene evolutionen_US
dc.typemaster thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
pawliszak_tomasz.pdf
Size:
2.57 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.2 KB
Format:
Item-specific license agreed to upon submission
Description: