Jina AI
Jina AI
Tool
extract_pdf
Extract figures, tables, and equations from PDF documents using layout detection. Perfect for extracting visual elements from academic papers on arXiv or any PDF URL. Returns base64-encoded images of detected elements with metadata.
Pricing
Per call
$0.01
Model
flat
Pay only for what you use. No subscriptions.
Inputs
id
stringmax_edge
numbertype
stringurl
stringTry It
API
MCP Config
Input Parameters
arXiv paper ID (e.g., '2301.12345' or 'hep-th/9901001'). Either id or url is required.
Maximum edge size for extracted images in pixels (default: 1024)
Filter by float types (comma-separated): figure, table, equation. If not specified, returns all types.
Direct PDF URL. Either id or url is required.

