textsense-ocr / app.py

Commit History

Implement PP-OCRv5 using official model names
edb3860
Running

Marc Allen Lopez commited on

Enable PP-OCRv5 support with fallback
e4fceaf

Marc Allen Lopez commited on

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors
5b041e6

Marc Allen Lopez commited on

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n
aafb1d3

Marc Allen Lopez commited on

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality
88f879c

Marc Allen Lopez commited on

Fix Qwen2-VL model loading - use correct model class
e0b420c

Marc Allen Lopez commited on

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct
778d70f

Marc Allen Lopez commited on

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies
cef0c83

Marc Allen Lopez commited on

OCR: switch to TrOCR printed model and better decoding (beam search)
30a9f52

Marc Allen Lopez commited on

Improve error handling for network issues and connection failures
2b73fdf

Marc Allen Lopez commited on

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction
198cd52

Marc Allen Lopez commited on