5 points | by pkarwatka5 小时前
Features: - No Cloud/external dependencies all you need: - PyTorch based OCR (Marker) + Ollama are shipped and configured via docker-compose no data is sent outside your dev/server environment, - PDF to Markdown conversion with very high accuracy using different OCR strategies including marker, surya-ocr or tessereact - PDF to JSON conversion using Ollama supported models (e.g. LLama 3.1) - LLM Improving OCR results LLama is pretty good with fixing spelling and text issues in the OCR text - Removing PII This tool can be used for removing Personally Identifiable Information out of PDF - see examples - Distributed queue processing using Celery, - Caching using Redis - the OCR results can be easily cached prior to LLM processing - CLI tool for sending tasks and processing results
Looking for some motivation to move the needle! Contributions are welcome.