I am trying to train my RL agent (ddpg) but it's performing quite poorly. I think it may be a problem with the hyperparameter values since I have not tuning. Now I have two questions-- If there is anything in MATLAB that may help solve this problem of hyperparameter optimization other than manual trial-and-error? How do I save the best performing agent given I don't know the critical values (i.e. don't know the range of the reward)? Basically, I want to save the agent that provides maximum reward or, say, top-5 highest rewarding agents?
John Williams answered .
2025-11-20