Supported Languages #41
-
Hi, What languages are supported?
Thank you |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 2 replies
-
We should distinguish between 1) programmatic documents and 2) scanned documents. In the first case, we are language independent, we have tested Asian languages with good success. We are actually extending Docling with a simpler way to change OCR backend and customize the parameters. For the moment changing the config requires you to make a new |
Beta Was this translation helpful? Give feedback.
This comment was marked as spam.
This comment was marked as spam.
This comment was marked as spam.
This comment was marked as spam.
This comment was marked as spam.
This comment was marked as spam.
-
EasyOCR does NOT support Hebrew. Any work-around to process Hebrew image files in Docling? Thank you. |
Beta Was this translation helpful? Give feedback.
We should distinguish between 1) programmatic documents and 2) scanned documents.
In the first case, we are language independent, we have tested Asian languages with good success.
In the second case, we depend on the underlying OCR engine. At the moment we have binding for EasyOCR which has support for 80+ languages. On their website you find the language parameters to provide.
We are actually extending Docling with a simpler way to change OCR backend and customize the parameters. For the moment changing the config requires you to make a new
ModelPipeline
object.