grobid
GROBID is a machine learning library that transforms scientific PDFs into structured XML/TEI formats, with functionalities such as header and reference extraction. Known for its accuracy, GROBID is used by platforms like Semantic Scholar and ResearchGate, offering API, Docker, and batch processing for efficient deployment across Linux and macOS.