资讯

The test runs contain a run config that references the checkpoint from the train run and contains the config to replicate the demonstration data generation by rolling out the expert policy. state_logs ...