SSRN 4768234
SSRN 4768234
• Discount factor (𝛾): The discount factor 𝛾 determines Where, Q (S, A) is the Q-value at state S for action A, and for
the influence of the future rewards and determines the computing Q- value immediate reward R(S, A) and maximum