This Python client is used to process a single PDF document by the GROBID service. Results are returned in an XML format.
You need first to install and start the grobid service, latest stable version, see the documentation. The default server host is localhost
and port is 8080
. The GrobidClient
can be configure via host and port.
This client has been developed and tested with Python 3.7.
pip install pygrobid
You can take a quick test via python tests.py pdf_file -h host -p port
In your code:
from grobid.client import GrobidClient
client = GrobidClient(host, port)
rsp = client.serve(service_name, pdf_file)
rsp = client.serve(service_name, pdf_file, consolidate_header=1)
- 0.1.6: support coordinates
This project is based on grobid-python-client by Patrice Lopez ([email protected])