app icon
upstage-document-parser
0.0.1

parse documents with Upstage Document Parser

grayashh/upstage-document-parser273 installs

Upstage Document Parsing Plugin for Dify

A powerful document parsing plugin for the Dify platform that leverages the Upstage Document Parse API to convert various document formats into structured Markdown, HTML, or plain text.

Features

  • Broad format support: Handles PDF, DOCX, and various image formats
  • Intelligent document understanding: Extracts text, tables, charts, and figures while preserving original structure
  • Multiple output formats: Converts documents to Markdown, HTML, or plain text
  • Efficient caching: Content-based caching prevents reprocessing of identical files
  • OCR capability: Extracts text from scanned documents and images
  • Chart recognition: Identifies and extracts charts from documents
  • Batch processing: Efficiently processes multi-page documents
  • Coordinate extraction: Obtains bounding box coordinates for document elements

Installation

This plugin is under active development.

Configure the plugin in the Dify platform.

Configuration

Required credentials

The plugin requires the following credentials:

Parameter options

You can configure the following parameters when using the tool:

  • : Output format (options: "md", "html", "text")
  • : Whether to return the result as a file or text (options: "file", "text")

Usage

In a Dify application

  1. Add the Upstage Document Parse tool to your application.
  2. Configure the required credentials.
  3. Use the tool within your application flow to process documents.

Direct usage in Python

You can also use the client directly in Python:

API Parameters

The plugin uses the following parameters when calling the Upstage Document Parse API:

ParameterTypeDescriptionDefault
FileThe document file to processRequired
StringControl OCR behavior: "auto" (applies only to images) or "force" (convert everything to images first)"auto"
BooleanWhether to return bounding box coordinatesfalse
BooleanWhether to use chart recognitiontrue
List[String]Formats for layout elements: "text", "html", "markdown"["html", "markdown", "text"]
StringModel used for inference"document-parse-250618"
List[String]Layout categories to provide as base64-encoded strings["table", "figure", "chart"]

Caching Mechanism

The plugin implements an efficient caching system:

  1. File content hashing to identify duplicate documents
  2. Result caching based on content hash and output format
  3. TTL-based cache expiration (default: 1 hour)

Examples

Convert a PDF to Markdown

Handling large documents

Development

Project structure

  • : Main Dify plugin integration
  • : Core client that interacts with the Upstage API
  • : Python dependencies
CATEGORY
Tool
VERSION
0.0.1
grayashh·08/11/2025 06:12 AM
REQUIREMENTS
Tool invocation
App invocation
Endpoint registration
Maximum memory
256MB