Let's understand RL intuitively.
A brief summary of non-linear optimization.
Quick bites of some interesting paper