Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guessit name/language problem #660

Open
Entixs opened this issue Oct 12, 2020 · 2 comments
Open

Guessit name/language problem #660

Entixs opened this issue Oct 12, 2020 · 2 comments
Labels

Comments

@Entixs
Copy link

Entixs commented Oct 12, 2020

Guessit using having a problem with the anime title Hi Score Girl https://www.thetvdb.com/series/high-score-girl

It is labelling Hi as the language for Hindi which is labelled in the log for Medusa program.

2020-10-12 19:25:39 DEBUG FORCEDSEARCHQUEUE-MANUAL-346673 :: [AnimeBytes] :: [e194cb2] Error during parsing of release name: HI.SCORE.GIRL.II.S01E01-E09.Blu-ray.MKV.h264.1080p.FLAC2.0-SonicBoom, with error: Unable to match HI.SCORE.GIRL.II.S01E01-E09.Blu-ray.MKV.h264.1080p.FLAC2.0-SonicBoom to a series in your database. Parser result: language: hi, title: SCORE GIRL II, season: 1, episode: [1, 2, 3, 4, 5, 6, 7, 8, 9], source: Blu-ray, container: mkv, video_codec: H.264, screen_size: 1080p, audio_codec: FLAC, audio_channels: 2.0, release_group: SonicBoom, type: episode, parsing_time: 0.37002110481262207, absolute_episode: [], quality: 1080p BluRay, total_time: 0.4190239906311035

Can this be fixed? Thank you!

@Toilal
Copy link
Member

Toilal commented Dec 16, 2020

You can configure the list of allowed languages (

"allowed_languages": [
).

See https://guessit.readthedocs.io/en/latest/configuration.html#configuration

I have to admin that "Hi" should not be in this list by default, as it's quite a common word in english.

@Toilal Toilal added the Bug label Dec 16, 2020
@p0psicles
Copy link

p0psicles commented Dec 17, 2020

I checked. But this is our current allowed_languages array:
allowed_languages = [
'de',
'en',
'es',
'ca',
'fr',
'he',
'hu',
'it',
'jp',
'nl',
'pl',
'pt',
'ro',
'ru',
'sv',
'uk',
'mul', # multi language
'und', # undetermined
]

I also disabled our rebulk rules to be sure. But the allowed_languages doesn't seem to have any effect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants