Approaching “Big Data” in Biological Research Imaging Spectroscopy with Novel Compression

dc.contributor.authorChen, Yixuan
dc.contributor.examiningcommitteePaliwal, Jitendra (Biosystems Engineering) Leung, Carson Kai-Sang (Computer Science)en_US
dc.contributor.supervisorMorrison, Jason (Biosystems Engineering)en_US
dc.date.accessioned2014-04-10T16:28:08Z
dc.date.available2014-04-10T16:28:08Z
dc.date.issued2014-04-10
dc.degree.disciplineBiosystems Engineeringen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractThis research focuses on providing a fast and space efficient compression method to answer information queries on spectroscopic data. Our primary hypothesis was whether a conversion from decimal data to character/integer space could be done in a manner that enables use of succinct structures and provides good compression. This compression algorithm is motivated to handle queries on spectroscopic data that approaches limits of main computer memory. The primary hypothesis is supported in that the new compression method can save 79.20% - 94.07% computer space on the average. The average of maximum error rates is also acceptable, being 0.05% - 1.36% depending on the subject that the data was collected from. Additionally, the data’s compression rate and entropy are negatively correlated; while compression rate and maximum error were positively correlated when the max error rates were performed on a natural logarithm transformation. The effects of different types of data sources on compression rate have been studied as well. Fungus datasets achieved highest compression rates, while mouse brain datasets obtained the lowest compression rates among four types of data sources. Finally, the effect of the studied compression algorithm and method on integrating spectral bands has been investigated in this study. The spectral integration for determining lipid, CH2 and dense core plaque obtained good image quality and the errors can be considered inconsequential except the case of determining creatine deposits. Despite the fact that creatine deposits are still recognizable in the reconstructed image, the image quality was reduced.en_US
dc.description.noteMay 2014en_US
dc.identifier.urihttp://hdl.handle.net/1993/23434
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectImage Compressionen_US
dc.subjectBig Dataen_US
dc.titleApproaching “Big Data” in Biological Research Imaging Spectroscopy with Novel Compressionen_US
dc.typemaster thesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Chen_Yixuan.pdf
Size:
1.98 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.25 KB
Format:
Item-specific license agreed to upon submission
Description: