You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-7Lines changed: 9 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -197,17 +197,19 @@ LLama 3.2 Vision Strategy is licensed on [Meta Community License Agreement](http
197
197
Enabled by default. Please do use the `strategy=llama_vision` CLI and URL parameters to use it. It's by the way the default strategy
198
198
199
199
200
-
### `marker`
200
+
### `remote`
201
201
202
-
[Marker, state of the art PDF OCR](https://github.com/VikParuchuri/marker) - works really great for more than 50 languages, including great accuracy for Polish and other languages - let's say that are "diffult" to readfor standard OCR.
202
+
Some OCR's - like [Marker, state of the art PDF OCR](https://github.com/VikParuchuri/marker) - works really great for more than 50 languages, including great accuracy for Polish and other languages - let's say that are "diffult" to read for standard OCR.
203
203
204
204
The `marker-pdf` is however licensed on GPL3 license and **therefore it's not included** by default in this application (as we're bound to MIT).
205
205
206
206
The weights for the models are licensed cc-by-nc-sa-4.0, but I will waive that for any organization under $5M USD in gross revenue in the most recent 12-month period AND under $5M in lifetime VC/angel funding raised. You also must not be competitive with the Datalab API. If you want to remove the GPL license requirements (dual-license) and/or use the weights commercially over the revenue limit, check out the options here.
207
207
208
-
To have it up and running please execute the following steps:
208
+
To have it up and running you can execute the following steps:
209
209
210
210
```bash
211
+
mkdir marker-distribution # this should be outside of the `text-extract-api` folder!
212
+
cd marker-distribution
211
213
pip install marker-pdf
212
214
pip install -U uvicorn fastapi python-multipart
213
215
marker_server --port 8002
@@ -216,16 +218,16 @@ marker_server --port 8002
216
218
**Note: *** you might run `marker_server` on different port - then just make sure you export a proper env setting beffore starting off `text-extract-api` server:
raiseException("Failed to generate text with Marker PDF API. Make sure marker-pdf server is up and running: marker_server --port 8002. Details: https://github.com/VikParuchuri/marker")
0 commit comments