LDX hub tools for document processing, translation refinement, and structured data extraction
Document AI for Dify workflows — structured data extraction and translation refinement, powered by leading LLMs through a unified gateway.

✅ Free to try — 25,000 credits/month, no credit card required
✅ One key for everything — OpenAI, Anthropic, Google, AWS, Azure, xAI
✅ 30-second sign-up — GitHub, Google, or email; your API key is shown immediately
LDX hub is a document AI gateway built by LDX Lab. This plugin brings two of its capabilities into Dify:
Both tools support OpenAI, Microsoft Azure OpenAI, Google, Anthropic, Amazon Bedrock, and xAI through a single API and a single billing system. You only need one API key.
StructFlow turns unstructured text into structured JSON. You define an extraction schema with a system prompt and an example output, then run a JSONL of input records through the model of your choice.

We have documented 8 real-world use cases across industries — patents, healthcare, finance, legal, customer support, HR, real estate, and e-commerce.
See examples/USE_CASES.md for the full list with prompts and sample inputs/outputs.
Input (one record from a JSONL file — clinical note):
Output (actual JSON extracted by StructFlow):
captures eight distinct findings — including granular physical exam observations like "Marked erythema on the palatine tonsils" and "Tender anterior cervical lymphadenopathy". is the formal assessment. preserves the full prescription details with dosage and frequency.
A free-form clinical note becomes structured, queryable data. Physical findings, rapid test results, diagnostic assessment, and prescriptions are all extracted with their relevant context. StructFlow handles unstructured text in any language natively, supporting English, Japanese, Chinese, and others.
RefineLoop is built on StructFlow's iterative refinement engine, but specialized for translation quality review on XLIFF files. Each segment goes through multiple revision rounds where the AI critiques and improves the translation, with structured revision notes.

RefineLoop is built on the same iterative refinement engine as StructFlow. When you give it an XLIFF, RefineLoop groups all segments by source/target language pair across every element in the XLIFF, then dispatches them to the engine in a single batch per pair.
That means the number of engine invocations is bounded by — not by the number of elements. Segments that converge (i.e., produce the same translation as a previous revision) are dropped from subsequent rounds, so actual usage is often well below the worst case.
XLIFF tag integrity (, , , , etc.) is validated after every revision. If a revision breaks the tag structure, that revision is marked as failed and the next round retries — your final XLIFF stays compatible with your CAT tool.
Real-world benchmark: A translation of the BERT paper (about 65,000 source characters; approximately 145,000 with XLIFF tags) was refined with Gemini 3 Flash Preview and in 3 minutes 46 seconds end-to-end on Dify Cloud, including upload and download. Higher-accuracy models typically converge in fewer rounds and finish faster.

Sign up at https://gw.portal.ldxhub.io — free, no credit card. Sign up with GitHub, Google, or email; your API key is shown immediately.
Open the LDX hub plugin authorization screen and enter:

Your API key is encrypted by Dify (PKCS1_OAEP) and stored within your workspace.
Both tools currently support models from:
The model lineup is maintained dynamically on the LDX hub side, but the plugin embeds the current list at build time. New models become available to your Dify workflow through plugin version updates.
The plugin polls the LDX hub API for job completion. For typical workloads this is sufficient — the BERT paper benchmark above completed in under 4 minutes.
For exceptionally long jobs that may exceed Dify's execution timeout, an optional parameter is exposed. The plugin transmits this to LDX hub so the server can notify a URL of your choice on completion.
Note: Server-side webhook delivery is currently being implemented and is not active yet. Synchronous polling works normally in the meantime.
This plugin transmits your data to LDX hub servers and the AI providers configured for your selected model. Your API key is encrypted by Dify (PKCS1_OAEP). The plugin itself stores no data and sends no telemetry.
See PRIVACY.md for details.
LDX hub is built by LDX Lab. It provides a unified API gateway for document AI processing across multiple LLM providers — one API, one key, one billing system.
For questions about the API itself, see https://ldxlab.io.
For questions or bug reports about this Dify plugin, please use GitHub Issues.