ou_strategy noise missing dt? #194

zacwellmer · 2017-11-07T06:07:26Z

I could be wrong but it seems like we need to add a dt parameter to ou_strategy.py's evolve_state function?

def evolve_state(self):
    x = self.state
    dx = self.theta * (self.mu - x) * self.dt + self.sigma * np.sqrt(self.dt) * np.random.normal(size=self.action_space)
    self.state = x + dx
    return self.state

dementrock · 2017-11-20T07:29:05Z

Make sense. Would you like to submit a pull request?

zacwellmer · 2017-11-20T12:07:23Z

#197 submitted

schneimo · 2019-06-21T10:59:10Z

Another question regarding to the OU noise process here:
In baselines the previous state is set to a zero-array, wherefore here in rllab the state is set to an array filled with ones!?
I think that the baselines noise is correct, because so the noise is centered around 0, or not?

Baselines:

def reset(self):
        self.x_prev = self.x0 if self.x0 is not None else np.zeros_like(self.mu)

rllab:

def reset(self):
        self.state = np.ones(self.action_space.flat_dim) * self.mu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ou_strategy noise missing dt? #194

ou_strategy noise missing dt? #194

zacwellmer commented Nov 7, 2017 •

edited

Loading

dementrock commented Nov 20, 2017

zacwellmer commented Nov 20, 2017

schneimo commented Jun 21, 2019 •

edited

Loading

ou_strategy noise missing dt? #194

ou_strategy noise missing dt? #194

Comments

zacwellmer commented Nov 7, 2017 • edited Loading

dementrock commented Nov 20, 2017

zacwellmer commented Nov 20, 2017

schneimo commented Jun 21, 2019 • edited Loading

zacwellmer commented Nov 7, 2017 •

edited

Loading

schneimo commented Jun 21, 2019 •

edited

Loading