You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+11-13Lines changed: 11 additions & 13 deletions
Original file line number
Diff line number
Diff line change
@@ -99,18 +99,27 @@ The following are examples and projects built by the community using Tesseract.j
99
99
100
100
If you have a project or example repo that uses Tesseract.js, feel free to add it to this list using a pull request. Examples submitted should be well documented such that new users can run them; projects should be functional and actively maintained.
101
101
102
+
## Major changes in v6
103
+
Version 6 changes are documented in [this issue](https://github.com/naptha/tesseract.js/issues/993). Highlights are below.
104
+
- Fixed memory leak in previous versions
105
+
- Overall reductions in runtime and memory usage
106
+
- Breaking changes:
107
+
- All outputs formats other than `text` are disabled by default.
108
+
- To re-enable the `hocr` output (for example), set the following: `worker.recognize(image, {}, { hocr: true })`
109
+
- Minor changes to the structure of the JavaScript object (`blocks`) output
110
+
- See [this issue](https://github.com/naptha/tesseract.js/issues/993) for full list
111
+
102
112
## Major changes in v5
103
113
Version 5 changes are documented in [this issue](https://github.com/naptha/tesseract.js/issues/820). Highlights are below.
104
114
105
115
- Significantly smaller files by default (54% smaller for English, 73% smaller for Chinese)
106
116
- This results in a ~50% reduction in runtime for first-time users (who do not have the files cached yet)
107
117
- Significantly lower memory usage
108
-
- Compatible with iOS 17 (using default settings)
109
118
- Breaking changes:
110
119
-`createWorker` arguments changed
111
120
- Setting non-default language and OEM now happens in `createWorker`
112
121
- E.g. `createWorker("chi_sim", 1)`
113
-
-`worker.initialize` and `worker.loadLanguage` functions now do nothing and can be deleted from code
122
+
-`worker.initialize` and `worker.loadLanguage` functions should be deleted from code
114
123
- See [this issue](https://github.com/naptha/tesseract.js/issues/820) for full list
115
124
116
125
Upgrading from v2 to v5? See [this guide](https://github.com/naptha/tesseract.js/issues/771).
@@ -125,17 +134,6 @@ Version 4 includes many new features and bug fixes--see [this issue](https://git
125
134
-`createWorker` is now async
126
135
-`getPDF` function replaced by `pdf` recognize option
127
136
128
-
## Major changes in v3
129
-
- Significantly faster performance
130
-
- Runtime reduction of 84% for Browser and 96% for Node.js when recognizing the [example images](./examples/data)
131
-
- Upgrade to Tesseract v5.1.0 (using emscripten 3.1.18)
132
-
- Added SIMD-enabled build for supported devices
133
-
- Added support:
134
-
- Node.js version 18
135
-
- Removed support:
136
-
- ASM.js version, any other old versions of Tesseract.js-core (<3.0.0)
0 commit comments