Skip to content

Support Recognition for the following Currency Symbols #1012

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
craitose opened this issue Apr 3, 2025 · 1 comment
Open

Support Recognition for the following Currency Symbols #1012

craitose opened this issue Apr 3, 2025 · 1 comment

Comments

@craitose
Copy link

craitose commented Apr 3, 2025

Is your feature request related to a problem? Please describe.

Yes, when running a test to check for various currency symbols some are not recognized

Describe the solution you'd like

I want OCR to recognized the following currency Symbols {₡, ֏, ₨, ﷼, ₱, GH₵}.

Describe alternatives you've considered

A different tesseract npm package that does support these currency symbols

Additional context
Add any other context or screenshots about the feature request here.
Below is a list of Languages - Countries : {ISO, symbols}

Spanish - Costa Rican Colon:{ ISO: "CRC", symbol: "₡" },
Arminian - Dram:{ ISO: "AMD", symbol: "֏" }
French - Seychellios Rupee:{ ISO: "SCR", symbol: "₨" },
Arabic - Yemeni Rial:{ ISO: "YER", symbol: "﷼" },
Spanish - Cuban Peso:{ ISO: "CUP", symbol: "₱" },
English - Ghanaian Cedi:{ ISO: "GHS", symbol: "GH₵" },
@Balearica
Copy link
Member

Tesseract.js is the Javascript/Webassembly port of Tesseract. Tesseract.js intentionally inherits all code and default behaviors from the main Tesseract project, including the default language data. The language data used by default by Tesseract.js can be found here, and is copied directly from the main Tesseract project.

That being said, the language data used by Tesseract and Tesseract.js is simply a reasonable default. It is entirely possible to create and use custom language files that support any characters or language you like, or use language files created by other users. Instructions on how to use custom language files with Tesseract.js are here. For more information about language data, you should refer to the main Tesseract documentation, as the process of creating new language data is in the scope of the main Tesseract project rather than the JavaScript port.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants