Commits · mlopez6132/textsense-ocr

Implement PP-OCRv5 using official model names

edb3860

Running

Marc Allen Lopez commited on Sep 15

Enable PP-OCRv5 support with fallback

e4fceaf

Marc Allen Lopez commited on Sep 15

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors

5b041e6

Marc Allen Lopez commited on Aug 13

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n

aafb1d3

Marc Allen Lopez commited on Aug 13

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality

88f879c

Marc Allen Lopez commited on Aug 13

Fix Qwen2-VL model loading - use correct model class

e0b420c

Marc Allen Lopez commited on Aug 12

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct

778d70f

Marc Allen Lopez commited on Aug 12

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies

cef0c83

Marc Allen Lopez commited on Aug 12

OCR: switch to TrOCR printed model and better decoding (beam search)

30a9f52

Marc Allen Lopez commited on Aug 12

Improve error handling for network issues and connection failures

2b73fdf

Marc Allen Lopez commited on Aug 12

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction

198cd52

Marc Allen Lopez commited on Aug 12

Spaces:

mlopez6132
/

textsense-ocr

Running

Commit History

Implement PP-OCRv5 using official model names

edb3860

Running

Enable PP-OCRv5 support with fallback

e4fceaf

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors

5b041e6

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n

aafb1d3

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality

88f879c

Fix Qwen2-VL model loading - use correct model class

e0b420c

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct

778d70f

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies

cef0c83

OCR: switch to TrOCR printed model and better decoding (beam search)

30a9f52

Improve error handling for network issues and connection failures

2b73fdf

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction

198cd52

Commit History

Implement PP-OCRv5 using official model names edb3860 Running

Enable PP-OCRv5 support with fallback e4fceaf

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors 5b041e6

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n aafb1d3

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality 88f879c

Fix Qwen2-VL model loading - use correct model class e0b420c

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct 778d70f

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies cef0c83

OCR: switch to TrOCR printed model and better decoding (beam search) 30a9f52

Improve error handling for network issues and connection failures 2b73fdf

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction 198cd52

Implement PP-OCRv5 using official model names

edb3860

Running

Enable PP-OCRv5 support with fallback

e4fceaf

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors

5b041e6

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n

aafb1d3

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality

88f879c

Fix Qwen2-VL model loading - use correct model class

e0b420c

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct

778d70f

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies

cef0c83

OCR: switch to TrOCR printed model and better decoding (beam search)

30a9f52

Improve error handling for network issues and connection failures

2b73fdf

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction

198cd52