After training for a period of time, new episodes cannot be generated.The phenomenon manifested is that training stops. This bug is hard to reproduce because it always appears after a long period of training, such as 100 epochs. I suspect it may be related to the communication between multiple processes.
After training for a period of time, new episodes cannot be generated.The phenomenon manifested is that training stops. This bug is hard to reproduce because it always appears after a long period of training, such as 100 epochs. I suspect it may be related to the communication between multiple processes.