Back to Tools

Document Data Extractor

Extract fields from PDFs, images, and Word docs with pixel-exact bounding boxes in a standalone interactive HTML viewer.

Click to launch Document Data Extractor

How It Works

Document Data Extractor takes a document (PDF, image, or Word doc) and a natural-language description of the fields you need, then produces a standalone interactive HTML file showing the original document with editable bounding boxes drawn around each extracted field.

The key insight: instead of asking a vision model to guess pixel coordinates, the skill splits the work cleanly—Claude handles the semantics (deciding which text is the invoice number, vendor, etc.) while a Python script handles the geometry (using the PDF text layer or OCR to get pixel-exact coordinates).

Privacy by design: All extraction runs locally through Claude Code. The output HTML is self-contained and runs in your browser with no external dependencies at view time.

Key Features

Tutorial: Extract Fields with Claude Code

1
Install the Claude Code skill

Download the Document Field Extractor skill and add it to your Claude Code skills directory.

2
Provide your document and field list

Point the skill at your document and describe the fields you need. For example: "Extract the vendor name, invoice number, date, and total from this PDF."

3
AI extracts and locates fields

Claude identifies field values semantically, then a Python script maps each value to pixel-exact coordinates using the document's text layer or OCR.

4
Review in the interactive viewer

Open the generated HTML file to see your document with color-coded bounding boxes. Click any field in the sidebar to jump to its location. Drag boxes to adjust, or draw new ones for anything the AI missed.

Open source under the MIT License. Free to use, modify, and distribute.