OneOffTech's

The Chunk List

Open-Source database of document parsers, their pricing, and capabilities

Cardinal

State of the Art Document Intelligence. Extract text, tables, and structure from any document with our powerful OCR technology. Choose from three specialized models to match your exact needs.

Cloud Self-hosted
Plan Price Quota Features
Pay as you go Usage $0.03/page
$0.03/page
  • Our flagship model included
  • Bounding box detection
  • Signatures recognition
  • Redlines extraction
  • Image metadata extraction
  • Barcode scanning
  • Handwriting recognition
Growth Subscription $1095/month
$0.016/page
60000 pages
  • Our flagship model included
  • Bounding box detection
  • Signatures recognition
  • Redlines extraction
  • Image metadata extraction
  • Barcode scanning
  • Handwriting recognition
  • Priority support via email and chat
  • HIPAA BAA available
  • Enhanced throughput with higher rate limits
  • US/EU regional data residency options
Growth Subscription $1795/month
$0.014/page
125000 pages
  • Our flagship model included
  • Bounding box detection
  • Signatures recognition
  • Redlines extraction
  • Image metadata extraction
  • Barcode scanning
  • Handwriting recognition
  • Priority support via email and chat
  • HIPAA BAA available
  • Enhanced throughput with higher rate limits
  • US/EU regional data residency options
Growth Subscription $4195/month
$0.010/page
350000 pages
  • Our flagship model included
  • Bounding box detection
  • Signatures recognition
  • Redlines extraction
  • Image metadata extraction
  • Barcode scanning
  • Handwriting recognition
  • Priority support via email and chat
  • HIPAA BAA available
  • Enhanced throughput with higher rate limits
  • US/EU regional data residency options

Chunkr

Chunkr AI is an API service to convert complex documents into LLM/RAG-ready data.

Cloud Self-hosted
Plan Price Quota Features
Free Usage $0/month 200 pages
  • No payment info required
  • Discord community support
Dev Subscription, Usage $375/month
$0.015 / credit (page)
25,000 credits
  • Priority support channel
Growth Subscription, Usage $750/month
$0.010 / credit (page)
75,000 credits
  • Dedicated founder support
Scale Subscription, Usage $2000/month
$0.008 / credit (page)
250,000 credits
  • Dedicated founder support

Datalab

Datalab builds state-of-the-art document intelligence models to convert complex PDFs and other unstructured formats into structured, machine-readable outputs — fast, accurately, and at scale.

Cloud Self-hosted
Plan Price Quota Features
Managed Usage $25 / month
$2 per 1000 OCR pages, $4 per 1000 markdown, layout, table recognition pages
15000 credits
  • Table detection
  • Markdown conversion
  • Reading order
  • OCR
  • Bounding box detection
  • Layout analysis
  • Standard email support
Self-hosted Subscription $5K/year 50,000 pages per month
  • Docker image for easy-to-run, single-GPU deployment
  • Self-serve checkout via Stripe
  • Priority email support
Custom Usage
  • Custom API or Self-Hosting needs
  • Throughput, latency, and accuracy tuning
  • Premium Support (Slack channel)
  • Custom rate limits
  • Training & Consulting
  • Custom Agreements (MSA, DPA, BAAs)

Landing AI Agentic Document Extraction

Intelligent Document Understanding with Visual Context. Convert decades of archived documents into LLM-ready data in hours rather than weeks.

Cloud
Plan Price Quota Features
Explore Usage pay as you go
$1 buys 100 credits
1000 credits
  • Parse
  • Field extraction
  • Visual grounding
  • Document splitting & classification
  • Multilingual documents
  • API & library access
Team - 27.5k Subscription, Usage $250/month
$1 buys 110 credits. Charged at $0.01 per credit
27,500 credits
  • Team management and shared usage
  • Unlimited API key creation with usage tracking
  • Email support
  • Zero data retention available
  • HIPAA-compliant processing with BAA agreement available
Team - 55 Subscription, Usage $500/month
$1 buys 110 credits. Charged at $0.01 per credit
55,000 credits
  • Team management and shared usage
  • Unlimited API key creation with usage tracking
  • Email support
  • Zero data retention available
  • HIPAA-compliant processing with BAA agreement available
Team - 110 Subscription, Usage $1000/month
$1 buys 110 credits. Charged at $0.01 per credit
110,000 credits
  • Team management and shared usage
  • Unlimited API key creation with usage tracking
  • Email support
  • Zero data retention available
  • HIPAA-compliant processing with BAA agreement available
Team - 165 Subscription, Usage $1500/month
$1 buys 110 credits. Charged at $0.01 per credit
165,000 credits
  • Team management and shared usage
  • Unlimited API key creation with usage tracking
  • Email support
  • Zero data retention available
  • HIPAA-compliant processing with BAA agreement available
Visionary - 260k Subscription, Usage $2000/month
$1 buys 130 credits. Charged at $0.01 per credit
260,000 credits
  • Team management and shared usage
  • Confidence scoring
  • Slack support
Visionary - 455k Subscription, Usage $3500/month
$1 buys 130 credits. Charged at $0.01 per credit
455,000 credits
  • Team management and shared usage
  • Confidence scoring
  • Slack support
Visionary - 650k Subscription, Usage $5000/month
$1 buys 130 credits. Charged at $0.01 per credit
650,000 credits
  • Team management and shared usage
  • Confidence scoring
  • Slack support
Enterprise Subscription, Usage custom
  • SaaS, VPL (Virtual Private LandingAI), VPC, and on-prem deployments
  • Custom processing pipeline
  • SLAs and uptime guarantees
  • Priority rate limits
  • Snowflake integration support

LlamaParse

LlamaParse is a highly accurate parser for complex documents like financial reports, research papers, and scanned PDFs.

Cloud
Plan Price Quota Features
Free Free 0 10000 credits -
Starter Subscription, Usage $50 /month
1.00 $/1000 credits US, 1.50 $/1000 credits EU
50000 credits -
Pro Subscription, Usage $500 /month
1.00 $/1000 credits US, 1.50 $/1000 credits EU
500000 credits -
Enterprise Subscription Custom
  • Customer-hosted deployment

Reducto

Turn documents into data. Build without constraints. Reducto combines the best of computer vision and new vision-language models to produce the most accurate, LLM-ready results.

Cloud
Plan Price Quota Features
Standard Usage $350/month
$0.020/credit
15000 credits
  • Intelligent Chunking
  • No page limits
Growth - 50k Subscription, Usage $840/month
$0.015 / credit
50,000 pages included
  • Studio Access
  • Priority support in Slack
  • Business Associate Agreement
  • Zero Data Retention
Growth - 150k Subscription, Usage $1950/month
$0.010 / credit
150,000 pages included
  • Studio Access
  • Priority support in Slack
  • Business Associate Agreement
  • Zero Data Retention
Growth - 300k Subscription, Usage $2825/month
$0.008 / credit
300,000 pages included
  • Studio Access
  • Priority support in Slack
  • Business Associate Agreement
  • Zero Data Retention
Enterprise Subscription, Usage custom
  • Custom SLAs
  • SSO and SAML Authentication
  • Data Processing Agreement
  • Priority Rate Limits
  • VPC and On Prem Deployments
  • Custom Processing Pipelines
  • EU/AU endpoints

Unstruct LLMWhisperer

LLMWhisperer is a technology that presents data from complex documents (different designs and formats) to LLMs in a way that they can best understand.

Cloud
Plan Price Quota Features
Pay as you go pay-as-you-go $1-15 /1000 pages
$0.01/page
100 pages/month
  • API or UI access for document processing
  • Access to latest VLMs for partitioning and enrichment
  • Balance speed, performance and cost with different partition strategies
  • Transform 80+ file types
  • HIPAA, SOC2 Type 2, GDPR and ISO 27001 compliance
  • Basic support
Enterprise Subscription Custom
  • Customer-hosted deployment

Unstructured

Unstructured provides a platform and tools to ingest and process unstructured documents for retrieval-augmented generation (RAG), agentic AI, and model fine-tuning.

Cloud Self-hosted
Plan Price Quota Features
Starter Subscription, Usage $500/month
$0.03/page
15000 pages/month
  • API or UI access for document processing
  • Access to latest VLMs for partitioning and enrichment
  • Balance speed, performance and cost with different partition strategies
  • Transform 80+ file types
  • HIPAA, SOC2 Type 2, GDPR and ISO 27001 compliance
  • Basic support
Team Subscription Custom
  • Team accounts with enterprise grade Role Based Access Control
  • Access to workspaces
  • 24/7 connector maintenance
  • Robust error handling and observability
  • Access to the Unstructured support portal
Enterprise Subscription Custom
  • Dedicated Instance or Customer-hosted deployment (VPC)
  • Support for fine-tuned, private models
  • Modular plugin architecture for custom integrations
  • Dedicated customer success team