A Closer Look at Invalid Action Masking in Policy Gradient Algorithms (2020-06-25T00:00:00.000000Z)