Making perplexity calculations consistent across sample length #27

mcleish7 · 2024-07-13T15:25:26Z

When calculating perplexity, on lines 110 and 115 the code currently divides by len(token_probs) which is twice the length of token_probs[mid_index:] as mid_index = len(token_probs) // 2 but then on line 120, for the whole sequence, the code still divides by len(token_probs).
This means the is a slight inconsistency in the perplexity calculation.

I have corrected this by using a single function cases, also removing the repeated code.

…probabilities

CLAassistant · 2024-07-13T15:25:32Z

All committers have signed the CLA.

Creating single function to handle calculating perplexity from token …

2c29229

…probabilities

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making perplexity calculations consistent across sample length #27

Making perplexity calculations consistent across sample length #27

mcleish7 commented Jul 13, 2024

CLAassistant commented Jul 13, 2024 •

edited

Loading

Making perplexity calculations consistent across sample length #27

Are you sure you want to change the base?

Making perplexity calculations consistent across sample length #27

Conversation

mcleish7 commented Jul 13, 2024

CLAassistant commented Jul 13, 2024 • edited Loading

CLAassistant commented Jul 13, 2024 •

edited

Loading