sorry I try to show my screen picture but fail to load
@awjuliani many thanks.
I think I miss the debug configure setting, sorry for the previous message. Thanks.
what the... I haven't trained my agent on puzzle rooms at all (floors 10+) and it somehow got a score of 25 on one of the runs (14 floors probably)?! It gets to floor 10 from time to time, but not often. I hope I made a mistake in reward calculation... otherwise it might have become self aware D:
Hi @KarolisRam Could you possibly record a video of the agent? It might have discovered a bug that allows it to skip floors :)
I'll try. Sadly I wasn't setting/saving random seeds, I will try to do a sweep of seeds and figure out what's happening.
hmm even with a set seed the agent starting position seems to be a bit random (the agent is sometimes a bit further forward). Is this intended?