Abstract: In this paper, Q-learning and Double Q-learning reinforcement learning algorithms were used to fine-tune sliding mode controller parameters to balance the Ball-and-Beam system. Each ...