合肥生活安徽新闻合肥交通合肥房产生活服务合肥教育合肥招聘合肥旅游文化艺术合肥美食合肥地图合肥社保合肥医院企业服务合肥法律

代做COMP532、代写a video game from OpenAI Gym

时间:2024-04-19  来源:合肥网hfw.cc  作者:hfw.cc 我要纠错



COMP532-202324 Assignment 2
You need to solve each of the following problems. The assignment aims to design and
implement a deep reinforcement learning agent for a video game from OpenAI Gym or
Gymnasium. You must also include a brief report describing and discussing your solutions to the
problems. Students can do the assignment in groups or individuals.
● This assignment is worth 15% of the total mark for COMP532
● 80% of the assignment marks will be awarded for correctness of results
● 20% of the assignment marks will be awarded for the quality of the accompanying report
● Students will do the assignment in groups
● The assignment marks will be awarded for correctness of results
● We expect 5 students in one group (it would be fine to have groups of 1, 2, 3, and 4 as
well, but it is suggested to have groups of 5), please find your team members on your
own.
● Only one single submission is needed for each group
● The same marks will be granted to all the members in the same group
● Please list all your group members (names, emails, student ids) and individual
contributions in your submitted report
Submission Instructions
● Deadline: 22 Apr 2024 17:00 (UK Time)
● Send all solutions as a single PDF document containing your answers, results, and
discussion of the results. Attach the source code for the programming problems as
separate files.
● Submit your solution via Canvas.
● Penalties for late submission apply in accordance with departmental policy as set
out in the student handbook, which can be found at
https://intranet.csc.liv.ac.uk/student/msc-handbook.pdf and the University Code of
Practice on Assessment, found at
https://www.liverpool.ac.uk/media/livacuk/tqsd/code-of-practice-on-assessment/code_of_
practice_on_assessment.pdf
Problem 1 (80 marks)
Implement a deep reinforcement learning agent for a game or environment of OpenAI Gym or
Gymnasium.
Use the lunar_lander environment:
https://gymnasium.farama.org/environments/box2d/lunar_lander/.
Please plot the learning progress of your method from 0 to 1000 episodes. You can have a
figure to show rewards and another figure to show training loss.
Please use a video or gifs or figures to demonstrate how your agent works.
Prepare a report explaining your solution and containing your results, and discussion of the
results.
Attach the source code as separate files. For example, .ipnb - an ipython notebook file.
Problem 2 (20 marks)
Explain exploration and exploitation for deep reinforcement learning.

请加QQ:99515681  邮箱:99515681@qq.com   WX:codinghelp













 

扫一扫在手机打开当前页
  • 上一篇:代做CSE340、代写Parsing编程语言
  • 下一篇:泰国留学签离境后要注销吗(泰国留学签注销的流程是什么)
  • 无相关信息
    合肥生活资讯

    合肥图文信息
    海信罗马假日洗衣机亮相AWE  复古美学与现代科技完美结合
    海信罗马假日洗衣机亮相AWE 复古美学与现代
    合肥机场巴士4号线
    合肥机场巴士4号线
    合肥机场巴士3号线
    合肥机场巴士3号线
    合肥机场巴士2号线
    合肥机场巴士2号线
    合肥机场巴士1号线
    合肥机场巴士1号线
    合肥轨道交通线路图
    合肥轨道交通线路图
    合肥地铁5号线 运营时刻表
    合肥地铁5号线 运营时刻表
    合肥地铁4号线 运营时刻表
    合肥地铁4号线 运营时刻表
  • 关于我们 | 打赏支持 | 广告服务 | 联系我们 | 网站地图 | 免责声明 | 帮助中心 | 友情链接 |

    Copyright © 2020 hfw.cc Inc. All Rights Reserved. 合肥网 版权所有
    ICP备06013414号-3 公安备 42010502001045