Spelling suggestions: "subject:"sense depth"" "subject:"dense depth""
1 |
Multi-view Video Coding Via Dense Depth FieldOzkalayci, Burak Oguz 01 September 2006 (has links) (PDF)
Emerging 3-D applications and 3-D display technologies raise
some transmission problems of the next-generation multimedia data.
Multi-view Video Coding (MVC) is one of the challenging topics in
this area, that is on its road for standardization via ISO MPEG. In
this thesis, a 3-D geometry-based MVC approach is proposed and
analyzed in terms of its compression performance. For this purpose,
the overall study is partitioned into three preceding parts. The
first step is dense depth estimation of a view from a fully
calibrated multi-view set. The calibration information and
smoothness assumptions are utilized for determining dense
correspondences via a Markov Random Field (MRF) model, which is
solved by Belief Propagation (BP) method. In the second part, the
estimated dense depth maps are utilized for generating (predicting)
arbitrary (other camera) views of a scene, that is known as novel
view generation. A 3-D warping algorithm, which is followed by an
occlusion-compatible hole-filling process, is implemented for this
aim. In order to suppress the occlusion artifacts, an intermediate
novel view generation method, which fuses two novel views generated
from different source views, is developed. Finally, for the last
part, dense depth estimation and intermediate novel view generation
tools are utilized in the proposed H.264-based MVC scheme for the
removal of the spatial redundancies between different views. The
performance of the proposed approach is compared against the
simulcast coding and a recent MVC proposal, which is expected to be
the standard recommendation for MPEG in the near future. These
results show that the geometric approaches in MVC can still be
utilized, especially in certain 3-D applications, in addition to
conventional temporal motion compensation techniques, although the
rate-distortion performances of geometry-free approaches are quite
superior.
|
2 |
Dense Depth Map Estimation For Object Segmentation In Multi-view VideoCigla, Cevahir 01 August 2007 (has links) (PDF)
In this thesis, novel approaches for dense depth field estimation and object segmentation from mono, stereo and multiple views are presented. In the first stage, a novel graph-theoretic color segmentation algorithm is proposed, in which the popular Normalized Cuts 59H[6] segmentation algorithm is improved with some modifications on its graph structure. Segmentation is obtained by the recursive partitioning of the weighted graph. The simulation results for the comparison of the proposed segmentation scheme with some well-known segmentation methods, such as Recursive Shortest Spanning Tree 60H[3] and Mean-Shift 61H[4] and the conventional Normalized Cuts, show clear improvements over these traditional methods.
The proposed region-based approach is also utilized during the dense depth map estimation step, based on a novel modified plane- and angle-sweeping strategy. In the proposed dense depth estimation technique, the whole scene is assumed to be region-wise planar and 3D models of these plane patches are estimated by a greedy-search algorithm that also considers visibility constraint. In order to refine the depth maps and relax the planarity assumption of the scene, at the final step, two refinement techniques that are based on region splitting and pixel-based optimization via Belief Propagation 62H[32] are also applied.
Finally, the image segmentation algorithm is extended to object segmentation in multi-view video with the additional depth and optical flow information. Optical flow estimation is obtained via two different methods, KLT tracker and region-based block matching and the comparisons between these methods are performed. The experimental results indicate an improvement for the segmentation performance by the usage of depth and motion information.
|
3 |
Improving deep monocular depth predictions using dense narrow field of view depth imagesMöckelind, Christoffer January 2018 (has links)
In this work we study a depth prediction problem where we provide a narrow field of view depth image and a wide field of view RGB image to a deep network tasked with predicting the depth for the entire RGB image. We show that by providing a narrow field of view depth image, we improve results for the area outside the provided depth compared to an earlier approach only utilizing a single RGB image for depth prediction. We also show that larger depth maps provide a greater advantage than smaller ones and that the accuracy of the model decreases with the distance from the provided depth. Further, we investigate several architectures as well as study the effect of adding noise and lowering the resolution of the provided depth image. Our results show that models provided low resolution noisy data performs on par with the models provided unaltered depth. / I det här arbetet studerar vi ett djupapproximationsproblem där vi tillhandahåller en djupbild med smal synvinkel och en RGB-bild med bred synvinkel till ett djupt nätverk med uppgift att förutsäga djupet för hela RGB-bilden. Vi visar att genom att ge djupbilden till nätverket förbättras resultatet för området utanför det tillhandahållna djupet jämfört med en existerande metod som använder en RGB-bild för att förutsäga djupet. Vi undersöker flera arkitekturer och storlekar på djupbildssynfält och studerar effekten av att lägga till brus och sänka upplösningen på djupbilden. Vi visar att större synfält för djupbilden ger en större fördel och även att modellens noggrannhet minskar med avståndet från det angivna djupet. Våra resultat visar också att modellerna som använde sig av det brusiga lågupplösta djupet presterade på samma nivå som de modeller som använde sig av det omodifierade djupet.
|
Page generated in 0.0388 seconds