A reliable PDF parser that extracts text from PDFs and supports image/scanned PDFs via OCR. Output is structured with Pydantic models and rich metadata for LLM contextual analysis.