Skip to content

Commit 4fda050

Browse files
committed
fix: readme updates
1 parent 5adb8b3 commit 4fda050

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ The API is built with FastAPI and uses Celery for asynchronous task processing.
88

99
## Features:
1010
- **No Cloud/external dependencies** all you need: PyTorch based OCR (EasyOCR) + Ollama are shipped and configured via `docker-compose` no data is sent outside your dev/server environment,
11-
- **PDF/Office to Markdown** conversion with very high accuracy using different OCR strategies including [llama3.2-vision](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/), [easyOCR](https://github.com/JaidedAI/EasyOCR), [minicpm-v](https://github.com/OpenBMB/MiniCPM-o?tab=readme-ov-file#minicpm-v-26)
11+
- **PDF/Office to Markdown** conversion with very high accuracy using different OCR strategies including [llama3.2-vision](https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/), [easyOCR](https://github.com/JaidedAI/EasyOCR), [minicpm-v](https://github.com/OpenBMB/MiniCPM-o?tab=readme-ov-file#minicpm-v-26), [marker-pdf](https://github.com/VikParuchuri/marker)
1212
- **PDF/Office to JSON** conversion using Ollama supported models (eg. LLama 3.1)
1313
- **LLM Improving OCR results** LLama is pretty good with fixing spelling and text issues in the OCR text
1414
- **Removing PII** This tool can be used for removing Personally Identifiable Information out of document - see `examples`
@@ -467,7 +467,7 @@ apiClient.uploadFile(formData).then(response => {
467467
- **Method**: POST
468468
- **Parameters**:
469469
- **file**: PDF, image or Office file to be processed.
470-
- **strategy**: OCR strategy to use (`llama_vision`, `minicpm_v` or `easyocr`).
470+
- **strategy**: OCR strategy to use (`llama_vision`, `minicpm_v`, `marker` or `easyocr`). See the [available strategies](#text-extract-stratgies)
471471
- **ocr_cache**: Whether to cache the OCR result (true or false).
472472
- **prompt**: When provided, will be used for Ollama processing the OCR result
473473
- **model**: When provided along with the prompt - this model will be used for LLM processing
@@ -486,7 +486,7 @@ curl -X POST -H "Content-Type: multipart/form-data" -F "file=@examples/example-m
486486
- **Method**: POST
487487
- **Parameters** (JSON body):
488488
- **file**: Base64 encoded PDF file content.
489-
- **strategy**: OCR strategy to use (`llama_vision`, `minicpm_v` or `easyocr`).
489+
- **strategy**: OCR strategy to use (`llama_vision`, `minicpm_v`, marker or `easyocr`). See the [available strategies](#text-extract-stratgies)
490490
- **ocr_cache**: Whether to cache the OCR result (true or false).
491491
- **prompt**: When provided, will be used for Ollama processing the OCR result.
492492
- **model**: When provided along with the prompt - this model will be used for LLM processing.

0 commit comments

Comments
 (0)