Clip-low increases entropy and clip-high decreases entropy in reinforcement learning of large language models (submitted)