Improving cross-dataset generalization in image classification with contrastive representation learning

dc.contributor.authorSaffar, Najmeh
dc.contributor.examiningcommitteeKhoshdarregi, Matt (Mechanical Engineering)en_US
dc.contributor.examiningcommitteeYahampath, Pradeepa (Electrical and Computer Engineering)en_US
dc.contributor.examiningcommitteeKhan, Sheroz (KITE Research Institute)en_US
dc.contributor.supervisorAshraf, Ahmed
dc.date.accessioned2022-08-26T20:53:39Z
dc.date.available2022-08-26T20:53:39Z
dc.date.copyright2022-08-25
dc.date.issued2022-08-18
dc.date.submitted2022-08-25T07:32:49Zen_US
dc.date.submitted2022-08-25T19:58:42Zen_US
dc.degree.disciplineElectrical and Computer Engineeringen_US
dc.degree.levelMaster of Science (M.Sc.)en_US
dc.description.abstractRegular monitoring of marine wildlife is essential for rapid detection of changes in the marine ecosystem allowing for adaptive strategies. However, the manual analysis of large volumes of underwater images taken by cameras is highly time-consuming. Deep learning techniques have been adopted in marine wildlife for the automatic classification of underwater photos to accelerate image analysis. However, water quality varies at different locations, depths, and acquisition times during data collection. This, along with differences in other acquisition parameters, leads to datasets with idiosyncratic footprints and, therefore, limited generalization of the trained deep learning model to other sets of images different from the training set. As a result, more work is required toward improving the cross-dataset generalization of deep learning models. In our research, we started by assessing dataset biases' impact on cross-dataset generalization in the classification of beluga whale images from empty underwater image frames. We used three underwater image datasets with varying image acquisition profiles: a dataset of good water quality photos, moderately bad water quality photos, and a dataset of images with both the horizon and water in the same frame. Then, we investigated two frameworks to improve cross-dataset generalization. One attempts to unlearn dataset-specific information for explicitly handling the dataset bias problem. The other uses a contrastive loss for learning a representation by contrasting the images with beluga whales against the images with empty frames regardless of their dataset membership. We conducted an exhaustive evaluation of proposed deep learning architectures and compared performance using cross-dataset approaches with traditional architectures. The supervised contrastive approach outperforms the other architectures. To the best of our knowledge, this was the first use of contrastive settings to implicitly address the dataset bias problem.en_US
dc.description.noteOctober 2022en_US
dc.description.sponsorshipMitacs Accelerate Program University of Manitoba Startup Granten_US
dc.identifier.urihttp://hdl.handle.net/1993/36792
dc.language.isoengen_US
dc.rightsopen accessen_US
dc.subjectImage classificationen_US
dc.subjectObject detectionen_US
dc.subjectDeep learningen_US
dc.subjectConvolutional neural networken_US
dc.titleImproving cross-dataset generalization in image classification with contrastive representation learningen_US
dc.typemaster thesisen_US
local.subject.manitobayesen_US
oaire.awardNumberRGPIN-06457-2020en_US
oaire.awardTitleNSERC Discovery Granten_US
project.funder.identifierNSERC: https://doi.org/10.13039/501100000038en_US
project.funder.nameNatural Sciences and Engineering Research Council of Canadaen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
saffar_najmeh.pdf
Size:
2.39 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.2 KB
Format:
Item-specific license agreed to upon submission
Description: