PDF Processing Service with OCR

A Sanic-based web service for processing PDF files with OCR text detection and insertion. It allows users to upload PDF files, which are processed asynchronously using PaddleOCR, and return the modified PDF with detected text embedded at the detected positions. The UI is built using Vue.js for simplicity.

Installation

Clone the repository:

git clone https://github.com/[your-repo]/pdf-ocr-service.git
cd pdf-ocr-service

Install dependencies (create a virtual environment first if preferred):

pip install -r requirements.txt  # Ensure requirements.txt includes:
# sanic sanic-jinja2 sanic-session paddleocr numpy pypdfium2

Install PaddleOCR models:

# The Latin language model is required (ensure your environment meets PaddleOCR's prerequisites)
pip install paddleocr[extra]

Usage

Running the Server

Start the Sanic application:
```
sanic app --port 5000
```
The server runs on http://localhost:5000.

Running the Server on docker

docker build -t ocr-pdf-tool .
docker run -it --rm -p 5000:5000 -v .paddleocr:/root/.paddleocr ocr-pdf-tool

Uploading PDF via Web UI:

Open the browser and go to http://localhost:5000.
Select a PDF file from your computer and click "Process PDF". The output PDF will be downloaded automatically.

Command-Line Usage (cURL):

curl -X POST -F 'file=@path/to/your/file.pdf' http://localhost:5000/process-pdf -o output.pdf

Dependencies

Package	Description
Sanic	Fast Python web server
PaddleOCR	OCR engine from PaddlePaddle
pypdfium2	PDF rendering/pdf modification library
numpy	Array processing
Vue.js (template)	Lightweight frontend framework

License

This project is licensed under the Apache License 2.0. See LICENSE for details.

Contributions

Contributions are welcome! For issues or feature requests, please open an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
app		app
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF Processing Service with OCR

Table of Contents

Installation

Usage

Running the Server

Running the Server on docker

Uploading PDF via Web UI:

Command-Line Usage (cURL):

Dependencies

License

Contributions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

walter-bd/ocr-pdf-tool

Folders and files

Latest commit

History

Repository files navigation

PDF Processing Service with OCR

Table of Contents

Installation

Usage

Running the Server

Running the Server on docker

Uploading PDF via Web UI:

Command-Line Usage (cURL):

Dependencies

License

Contributions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages