COMPARING THE PERFORMANCE OF A PID CONTROLLER WITH THAT OF A THERMOSTAT.
GOAL
To compare the control of the indoor temperature of a room between a PID controller and our DDPG Reinforcement Learning.
QUESTIONS
DEEP DETERMINISTIC POLICY GRADIENT
The deep deterministic policy gradient (DDPG) algorithm is a model-free, online, off-policy reinforcement learning method. A DDPG agent is an actor-critic reinforcement learning agent that computes an optimal policy that maximizes the long-term reward.
During training, a DDPG agent:
PID TEMPERATURE CONTROLLER
A PID temperature controller, as its name implies, is an instrument used to control temperature, mainly without extensive operator involvement. A PID controller in a temperature control system will accept a temperature sensor such as a thermocouple or RD as input and compare the actual temperature to the desired control temperature or setpoint. It will then provide an output to a control element.
SIMULINK MODELS
Controller
THERMOSTAT TEMPERATURE CONTROLLER
THERMOSTAT RESULTS
PID TEMPERATURE CONTROLLER
PID RESULTS
DDPG REINFORCEMENT LEARNING TEMPERATURE CONTROLLER
RL RESULTS
TRAINING PROCESS
CONCLUSIONS