You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tesseract.js is the Javascript/Webassembly port of Tesseract. Tesseract.js intentionally inherits all code and default behaviors from the main Tesseract project, including the default language data. The language data used by default by Tesseract.js can be found here, and is copied directly from the main Tesseract project.
That being said, the language data used by Tesseract and Tesseract.js is simply a reasonable default. It is entirely possible to create and use custom language files that support any characters or language you like, or use language files created by other users. Instructions on how to use custom language files with Tesseract.js are here. For more information about language data, you should refer to the main Tesseract documentation, as the process of creating new language data is in the scope of the main Tesseract project rather than the JavaScript port.
Uh oh!
There was an error while loading. Please reload this page.
Is your feature request related to a problem? Please describe.
Yes, when running a test to check for various currency symbols some are not recognized
Describe the solution you'd like
I want OCR to recognized the following currency Symbols {₡, ֏, ₨, ﷼, ₱, GH₵}.
Describe alternatives you've considered
A different tesseract npm package that does support these currency symbols
Additional context
Add any other context or screenshots about the feature request here.
Below is a list of Languages - Countries : {ISO, symbols}
The text was updated successfully, but these errors were encountered: