mineru-document-extractor
MinerU document extraction CLI that converts PDFs, images, and web pages into Markdown, HTML, LaTeX, or DOCX via the MinerU API. Supports token-free flash extraction for quick start, precision extraction with table/formula recognition, web crawling, batch processing, and piped workflows.
更新日志: Source: GitHub https://github.com/opendatalab/MinerU-Ecosystem
还没有评论,快来第一个发言吧。