For an introduction to Reinforcement Learning, its basic terminologies, concepts and types read Reinforcement Learning - Part 1 by following this link: http://blog.cerelabs.com/2017/04/reinforcement-learning-part-1.html Q-Learning Q learning is an algorithm in reinforcement learning. It originates from the model based reinforcement learning. It can be referred to as a different kind of value function. The values are called Q values and are denoted by Q(s,a). It signifies the Q value when in a state 's' and taking an action 'a'. Mathematically, Q(s,a) = R(s) + γ Σ s' P(s,a,s') max a' Q(s',a') It can be defined as the value for arriving in a state which is obtained by learning via action 'a' and proceeding optimally thereafter. Also, ...