Long Context Reasoning benchmark (LCRB) - Pump