PPO and A2C based adaptive bitrate algorithms (Variants of Pensieve, Pytorch version)

Re-implementation of existing neural ABR algorithms for Video-on-demand services with Pytorch.

User guide

To train the policy with a linear or logarithmic video quality metric, refer to ./main.py

To train the policy with a perceptual video quality metric, i.e., VMAF, refer to ./variant_vmaf/main_vmaf.py

In these files, you can run the Pensive training by python main.py --a2c or python ./variant_vmaf/main_vmaf.py --a2c. Please refer to ./script/train.sh for more details.

We also have implemented two variants of Pensieve: Pensieve with A3C algorithm (a well-established DRL method), and Pensieve with MAML algorithm (a meta-reinforcement learning method). You can run their training processes by python ./variant_vmaf/main_vmaf.py --a2c and python ./variant_vmaf/main_vmaf.py --a2br, respectively.

implemented by pytorch and trained using GPU

Note that the original version of Pensieve using asynchronous advantage actor-critic algorithm (A3C) to train the policy, which can only implementated on CPU. Our A2C version removes the asynchronous setting and use GPU to accelerate the speed of NNs training.

Further improvements are ongoing...

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
cooked_test_traces		cooked_test_traces
cooked_traces		cooked_traces
model		model
script		script
test_traces		test_traces
variant_vmaf		variant_vmaf
video_size		video_size
.gitignore		.gitignore
README.md		README.md
a3c.py		a3c.py
env.py		env.py
env_wrapper.py		env_wrapper.py
fixed_env.py		fixed_env.py
load_trace.py		load_trace.py
main.py		main.py
maml_ppo.py		maml_ppo.py
model_ac_torch.py		model_ac_torch.py
model_ppo_torch.py		model_ppo_torch.py
model_torch.py		model_torch.py
mpc.py		mpc.py
mpc_v2.py		mpc_v2.py
plot_results.py		plot_results.py
replay_memory.py		replay_memory.py
rl_no_training.py		rl_no_training.py
test.py		test.py
test_maml_torch.py		test_maml_torch.py
test_ppo_torch.py		test_ppo_torch.py
test_torch.py		test_torch.py
train.py		train.py
train_a2c.py		train_a2c.py
train_ac.py		train_ac.py
train_maml.py		train_maml.py
train_ppo.py		train_ppo.py
train_ppo_gae.py		train_ppo_gae.py

confiwent/NeuralABR-Pensieve-PPO-MAML

Folders and files

Latest commit

History

Repository files navigation

PPO and A2C based adaptive bitrate algorithms (Variants of Pensieve, Pytorch version)

User guide

About

Topics

Resources

Stars

Watchers

Forks

Languages