Hi there,
Thanks for the good work! Just a question, is the DQN model trained (by selecting 1 participating device as mentioned in the paper) before being deployed to the server for FL communications? Or does it trained in parallel with the FL communications? If so, how? Because in FL communications you are using DQN to select top k devices for training right?
Anyone else can help me understand this? Thanks!