HumanAgent Interface

Take Action

Learning rate scale for this step (e.g. 1e-4 to 1.0)
Momentum coefficient (0 = no momentum, 1 = full carry)
Gradient clipping threshold (0 = no clipping)
Weight decay (L2) scale for this step (0 = no weight decay)

Current State

Status: Not initialized
Episode ID: -
Step Count: 0
State Observer

Current Observation

No observation yet

Loss / Perplexity

Action History

No actions taken yet