Recurrent Neural Network for Learning Spatial and Temporal Information from Videos

Nabavi, Seyed shahabeddin

Recurrent Neural Network for Learning Spatial and Temporal Information from Videos

dc.contributor.author	Nabavi, Seyed shahabeddin
dc.contributor.examiningcommittee	Hu, Pingzhao (BMG/Computer Science) Ashraf, Ahmed (ECE)	en_US
dc.contributor.supervisor	Wang, Yang (Computer Science)	en_US
dc.date.accessioned	2019-07-19T13:40:35Z
dc.date.available	2019-07-19T13:40:35Z
dc.date.issued	2019	en_US
dc.date.submitted	2019-06-22T21:58:41Z	en
dc.degree.discipline	Computer Science	en_US
dc.degree.level	Master of Science (M.Sc.)	en_US
dc.description.abstract	Recurrent Neural Network is a well-established tool for sequential modelling. It includes a variety of techniques and models to extract temporal information from a sequence of data (e.g. frames of a video sequence). This thesis presents novel end-to-end deep learning recurrent based architectures for two computer vision problems: semantic segmentation prediction and camera pose estimation. Firstly, we investigate the problem of extracting temporal information in the context of semantic segmentation prediction. we demonstrate the capability of recurrent architecture in feature prediction by presenting a novel encoder-decoder convolutional LSTM architecture. We also utilize a bidirectional convolutional LSTM as an extension of our work. Furthermore, we explore a step-by-step extraction of spatial information in the problem of monocular camera pose estimation with an end-to-end unsupervised training scheme which relies on a recurrent based pose estimator. We illustrate the contribution of recurrent estimation (a.k.a step-by-step estimation) in the estimation of large displacements and complex transformations. We also show the impact of this process on the monocular depth estimation process.	en_US
dc.description.note	October 2019	en_US
dc.identifier.citation	Nabavi, Seyed shahabeddin. Rochan, Mrigank.Wang, Yang. (2018). Future Semantic Segmentation with Convolutional LSTM. British Machine Vision Conference	en_US
dc.identifier.uri	http://hdl.handle.net/1993/34039
dc.language.iso	eng	en_US
dc.rights	open access	en_US
dc.subject	Future semantic segmentation	en_US
dc.subject	Recurrent neural network	en_US
dc.subject	Unsupervised camera pose estimation	en_US
dc.subject	Spatial information	en_US
dc.subject	Deep learning	en_US
dc.subject	Computer vision	en_US
dc.subject	temporal information	en_US
dc.subject	Video prediction	en_US
dc.title	Recurrent Neural Network for Learning Spatial and Temporal Information from Videos	en_US
dc.type	master thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Nabavi_Seyed shahabeddin.pdf
Size:: 12.92 MB
Format:: Adobe Portable Document Format
Description:: Master's Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.2 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

FGS - Electronic Theses and Practica