University of Oulu

Patch-based 3D reconstruction of deforming objects from monocular grey-scale videos

Saved in:
Author: Mostafa, Abdelrahman1
Organizations: 1University of Oulu, Faculty of Information Technology and Electrical Engineering, Computer Science
Format: ebook
Version: published version
Access: open
Online Access: PDF Full Text (PDF, 3.7 MB)
Pages: 56
Persistent link:
Language: English
Published: Oulu : A. Mostafa, 2020
Publish Date: 2020-06-30
Thesis type: Master's thesis (tech)
Tutor: Pedone, Matteo
Reviewer: Heikkilä, Janne
Pedone, Matteo


The ability to reconstruct the spatio-temporal depth map of a non-rigid object surface deforming over time has many applications in many different domains. However, it is a challenging problem in Computer Vision. The reconstruction is ambiguous and not unique as many structures can have the same projection in the camera sensor.

Given the recent advances and success of Deep Learning, it seems promising to use and train a Deep Convolutional Neural Network to recover the spatio-temporal depth map of deforming objects. However, training such networks requires a large-scale dataset. This problem can be tackled by artificially generating a dataset and using it in training the network.

In this thesis, a network architecture is proposed to estimate the spatio-temporal structure of the deforming object from small local patches of a video sequence. An algorithm is presented to combine the spatio-temporal structure of these small patches into a global reconstruction of the scene. We artificially generated a database and used it to train the network. The performance of our proposed solution was tested on both synthetic and real Kinect data. Our method outperformed other conventional non-rigid structure-from-motion methods.

see all

Copyright information: © Abdelrahman Mostafa, 2020. This publication is copyrighted. You may download, display and print it for your own personal use. Commercial use is prohibited.