CLIP for point cloud understanding
dc.contributor.author | Ghose, Shuvozit | |
dc.contributor.examiningcommittee | Livi, Lorenzo (Computer Science) | |
dc.contributor.examiningcommittee | Kai-Sang Leung, Carson (Computer Science) | |
dc.contributor.supervisor | Wang, Yang | |
dc.date.accessioned | 2023-08-02T16:47:14Z | |
dc.date.available | 2023-08-02T16:47:14Z | |
dc.date.issued | 2023-08-01 | |
dc.date.submitted | 2023-08-02T02:07:56Z | en_US |
dc.degree.discipline | Computer Science | en_US |
dc.degree.level | Master of Science (M.Sc.) | |
dc.description.abstract | Contrastive Vision-Language Pre-training (CLIP) based point cloud classification model has added a new direction in the point cloud classification research domain. In this thesis, we propose two novel methods for CLIP-based point cloud classification. First, we propose a Pretrained Point Cloud to Image Translation Network (PPCITNet) that produces generalized colored images along with additional salient visual cues to the point cloud depth maps for CLIP based point cloud classification. In addition, we propose a novel viewpoint adapter that combines the view feature processed by each viewpoint as well as the global intertwined knowledge that exists across the multi-view features. Next, we propose a novel meta-episodic learning framework for CLIP-based point cloud classification. In addition, we introduce dynamic task sampling within the episode based on performance memory. The experimental results demonstrate the superior performance of the proposed model over existing state-of-the-art CLIP-based models on ModelNet10, ModelNet40, and ScanobjectNN datasets. | |
dc.description.note | October 2023 | |
dc.identifier.uri | http://hdl.handle.net/1993/37441 | |
dc.language.iso | eng | |
dc.rights | open access | en_US |
dc.subject | CLIP | |
dc.subject | Point Cloud understanding | |
dc.subject | Meta Learning | |
dc.subject | Few shot classification | |
dc.subject | Contrastive Language-Image Pre-Training | |
dc.subject | Point Cloud to Image Translation | |
dc.title | CLIP for point cloud understanding | |
dc.type | master thesis | en_US |
local.subject.manitoba | no | |
project.funder.name | University of Manitoba |