Reinforcement learning offers efficient solutions for optimizing complex decision-making tasks through continuous state-action-reward cycle with real-time adaptability. This work presents twin delayed ...