I will try explaining map of RL in my own words here. Bear with me for sometime as it involves some interesting maths and arguments too.

We start with some model.