textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors
5b041e6
Marc Allen Lopezcommited on
Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n
aafb1d3
Marc Allen Lopezcommited on
Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality
88f879c
Marc Allen Lopezcommited on
Fix Qwen2-VL model loading - use correct model class
e0b420c
Marc Allen Lopezcommited on
Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct
778d70f
Marc Allen Lopezcommited on
Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies
cef0c83
Marc Allen Lopezcommited on
OCR: switch to TrOCR printed model and better decoding (beam search)
30a9f52
Marc Allen Lopezcommited on
Improve error handling for network issues and connection failures
2b73fdf
Marc Allen Lopezcommited on
Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction