Global ETD Search

Return to search

Adaptive Streaming and Packet Scheduling for VR Video

Over the past few years, the surge in VR (Virtual Reality) video traffic on networks has been remarkable. Nonetheless, a key challenge remains: ensuring a top-notch quality of experience (QoE) for VR video playback, especially when network bandwidth is limited. Prior studies have mainly focused on tile-based adaptive bitrate (ABR) streaming operating at the application layer on the server/client side to improve QoE, using single viewport prediction to conserve bandwidth. However, single-viewpoint prediction models face limitations due to uncertainties linked with head movement, making it difficult to handle sudden user motions effectively. To overcome these constraints, we propose a lightweight multimodal spatial-temporal transformer architecture, which generates multiple viewpoint trajectories and their corresponding probabilities while leveraging historical trajectory information. Consequently, we introduce a multi-agent reinforcement learning (MARL)-based ABR algorithm that capitalizes on multiple viewport prediction for VR video streaming at the application layer. Our algorithm strives to optimize various QoE objectives under diverse network conditions. To address the ABR problem, we formulate it as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP) problem. To tackle this effectively, we develop a MAPPO (Multi-Agent Proximal Policy Optimization) algorithm within a centralized training and decentralized execution (CTDE) framework.
Meanwhile, we also improve QoE at the network layer by utilizing network resources
in different network nodes during VR video streaming. We present an innovative system called tile-weighted rate-distortion (TWRD) packet scheduling optimization, which takes advantage of viewpoint prediction. The system dynamically assigns weights to tiles and their corresponding packets using the probability of viewpoint prediction. Due to limited bandwidth, the problem of packet scheduling arises, requiring the determination of which packets should be dropped. To address this challenge, we formulate the problem as an optimization task, taking into account error propagation in the video. Our system leverages the weighted rate-distortion information of packets and applies dynamic programming techniques to design an optimal packet scheduling scheme. By selectively dropping packets at network nodes, our proposed system effectively reduces network congestion and enhances the overall performance of VR video streaming systems operating within bandwidth limitations.

VR video

rate adaptation

reinforcement learning

viewpoint prediction

packet scheduling

transformer attention

Identifer	oai:union.ndltd.org:uottawa.ca/oai:ruor.uottawa.ca:10393/45886
Date	25 January 2024
Creators	Wang, Haopeng
Contributors	El Saddik, Abdulmotaleb
Publisher	Université d'Ottawa / University of Ottawa
Source Sets	Université d’Ottawa
Language	English
Detected Language	English
Type	Thesis
Format	application/pdf

Page generated in 0.0026 seconds

Adaptive Streaming and Packet Scheduling for VR Video

Description

Links & Downloads

Tags

Additional Fields