Strain classification of genomic data using variation graphs and gene ranking

dc.contributor.authorJayamanna, Vasena
dc.contributor.examiningcommitteeVan Domselaar, Gary (Medical Microbiology and Infectious Diseases)
dc.contributor.examiningcommitteeDurocher, Stephane (Computer Science)
dc.contributor.supervisorTremblay-Savard, Olivier
dc.date.accessioned2025-01-13T21:17:48Z
dc.date.available2025-01-13T21:17:48Z
dc.date.issued2024-12-20
dc.date.submitted2024-12-22T04:46:19Zen_US
dc.degree.disciplineComputer Science
dc.degree.levelMaster of Science (M.Sc.)
dc.description.abstractGenomics is the study of an organism’s genetic information; of how organism traits and characteristics are developed and inherited. A common problem in genomic analysis is identifying new strains of pathogens and classifying them. In scenarios involving pathogenic bacteria, for example, this can help with outbreak analysis, prediction, and prevention. There are existing high resolution classifiers that perform at the species level, but for organisms with high rates of gene transfer, classification at the sub-species (strain) level can still prove challenging. In this work, we implement a bioinformatics pipeline that tests the use of several metrics (one novel) for identifying specific loci of the foodborne disease pathogen Campylobacter jejuni that may be associated with particular strains. The pipeline itself is highly adaptable to user-provided bacterial genome data, and shows how certain tools can be used with our metric approach to classify novel strains into cluster groups from the user-provided metadata.
dc.description.noteFebruary 2025
dc.identifier.urihttp://hdl.handle.net/1993/38792
dc.language.isoeng
dc.subjectstrain
dc.subjectclassification
dc.subjectbacteria
dc.subjectvariation
dc.subjectgenomics
dc.titleStrain classification of genomic data using variation graphs and gene ranking
local.subject.manitobano
project.funder.identifierhttp://dx.doi.org/10.13039/100011094
project.funder.namePublic Health Agency of Canada
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis-VasenaJayamanna-CS-MSc.pdf
Size:
3.29 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
770 B
Format:
Item-specific license agreed to upon submission
Description: