Enhancing Agent's Performance in Viz Doom Using Reinforcement Learning

Shakir Jan, 01-134202-062; Adnan Musa, 01-134202-113

DSpace Home
→
Final Year Project Report (BUIC)
→
Department of Computer Science and IT (BUIC-E-8)
→
BS (CS) (BUIC-FYP-E8)
→
View Item

dc.contributor.author	Shakir Jan, 01-134202-062
dc.contributor.author	Adnan Musa, 01-134202-113
dc.date.accessioned	2024-07-09T06:19:38Z
dc.date.available	2024-07-09T06:19:38Z
dc.date.issued	2024
dc.identifier.uri	http://hdl.handle.net/123456789/17489
dc.description	Supervised by Mr. Adil Khan	en_US
dc.description.abstract	An agent interacting with its surroundings to learn how to make decisions is called a reinforcement learner. Through reinforcement learning, the agent gradually gains the ability to behave in a way that maximizes its overall reward. Instead of receiving instructions from an instructor, it learns by trial-and-error process. VizDoom is an easy-to-use reinforcement learning-based research platform for Doom games. The task is to train agents in different VizDoom scenarios, including basic scenario, defend the center, and deadly corridor making decisions based on visual information from the screen. This project compares three common reinforcement learning algorithms. Double Dueling Deep Q Learning Network (DDDQN), Advantage Actor Critic (A2C), and Proximal Policy Optimization (PPO). We examine each algorithm's effectiveness in enhancing agent performance in the VizDoom environment through testing and analysis. Each of the previously stated algorithms is used to train the agents in our study, and their performance is assessed using a range of measures, including average score, number of kills, and reward. To further optimize each algorithm's performance, we have tried different values for tuning hyperparameters during our training process. During the training process, we changed critical hyperparameter values that influence the overall agent behavior and lead to its improvement. In the context of improving agent performance in VizDoom, the study's conclusions offer insightful information about the advantages and disadvantages of PPO, A2C, and DQ. Furthermore, our insights into hyperparameter optimization clarify the significance of parameter adjustment to optimize the performance of reinforcement learning algorithms in difficult real-world scenarios. The present study explores how reinforcement learning (RL), a type of AI training, can be better applied to video games. By understanding how RL works in games, researchers can develop better ways to train AI agents, making them smarter and more fun to play against.	en_US
dc.language.iso	en	en_US
dc.publisher	Computer Sciences	en_US
dc.relation.ispartofseries	BS (CS);P-2328
dc.subject	Enhancing Agent's	en_US
dc.subject	Performance in Viz Doom	en_US
dc.subject	Reinforcement Learning	en_US
dc.title	Enhancing Agent's Performance in Viz Doom Using Reinforcement Learning	en_US
dc.type	Project Reports	en_US