Reinforcement Learning MCQs

26 min Score: 0 Attempted: 0/26 Subscribe
1. What is the main objective of reinforcement learning (RL)?





2. In RL, what is an “agent”?





3. What does the term “policy” refer to in reinforcement learning?





4. Which of the following is a common approach to solving RL problems?





5. What is the “reward” in reinforcement learning?





6. In RL, what does the “value function” represent?





7. What is the “Q-function” in Q-Learning?





8. Which of the following is an off-policy algorithm?





9. In the context of RL, what does “exploration” mean?





10. What is “exploitation” in reinforcement learning?





11. What does the “Bellman Equation” describe?





12. Which algorithm uses a model of the environment to predict future states and rewards?





13. In RL, what does “Temporal Difference (TD) Learning” refer to?





14. What is the “discount factor” in reinforcement learning?





15. What is “Policy Gradient” in reinforcement learning?





16. What is the main advantage of using “Deep Reinforcement Learning”?





17. In the “Actor-Critic” method, what are the two main components?





18. What is “Monte Carlo Tree Search (MCTS)” used for in RL?





19. What does “SARSA” stand for in reinforcement learning?





20. What is “Reward Shaping”?





21. What does “Bootstrapping” refer to in reinforcement learning?





22. What is “Experience Replay” in deep reinforcement learning?





23. In the context of RL, what is a “Markov Decision Process (MDP)”?





24. What is “Dynamic Programming” in reinforcement learning?





25. Which of the following is a challenge in reinforcement learning?





26. What is “Double Q-Learning”?





Leave a Comment

All copyrights Reserved by MCQsAnswers.com - Powered By T4Tutorials