reinforcement learning(RL) focuses on training agents to take any action at a particular stage in an environment to maximise rewards.