Project Icon

grobid

Efficient Machine Learning-Based Structuring of Scientific PDFs

Product DescriptionGROBID is a machine learning library that transforms scientific PDFs into structured XML/TEI formats, with functionalities such as header and reference extraction. Known for its accuracy, GROBID is used by platforms like Semantic Scholar and ResearchGate, offering API, Docker, and batch processing for efficient deployment across Linux and macOS.
Project Details