An intelligent data analysis tool based on LLM and natural language interaction, featuring built-in data query and data visualization, as well as report generation capabilities.
Author: ChartGen AI
Version: 2.0.4
Type: tool
This plugin enables codeless data analysis through natural language interaction. It supports Text2SQL, Text2Data, and Text2Code analysis. Simply upload Excel/CSV files to automatically execute data queries, data interpretation, data cleaning, and data visualization (ChatBI).
New support for multi-sheet queries and cross-sheet analysis, capable of automatically recognizing and parsing structured data in multiple worksheets, improving multi-sheet data processing capabilities.The plugin will intelligently parse time, metrics, and analytical dimensions through conversational queries , then generate SQL queries for data, and create interactive BI charts, structured analysis reports, data cleaning operations, and data quality assessments. Optimized for standardized vertical datasets, powered by enterprise-grade analytics engine for reliable results.
This plugin is supported by ChartGen AI
You can easily create and manage your API Key in the ChartGen AI - API. To begin with, You need to register for an ChartGen AI account.
Once on the homepage, click the bottom left corner to access the API management dashboard.
[Image blocked: apply_apikey_1.png]
Here, you can create new APIs and set the credit consumption limit for each API. A single account can create up to 10 APIs.
[Image blocked: apply_apikey_2.png]
After successful creation, you can copy the API Key to Dify for verification. You can also view the credit consumption of each API and manage your APIs.
[Image blocked: apply_apikey_3.png]
The tools could be found at the plugin Marketplace, please install it.
[Image blocked: set_apikey.png]
[Image blocked: apply_apikey_4.png]
The following are the parameter descriptions and usage scenario examples of each tool.
Used to connect mainstream databases such as MySQL, PostgreSQL, Starrocks and Doris, allowing users to query database data using natural language. Once data is retrieved, it can be seamlessly integrated with our other tools for analysis, interpretation, and visualization.
The query results support downloading as an .xlsx file for easier local viewing and further processing.
💡 If you want the output to include files, please ensure to add the ' files ' output type in the last component of the flow to get the download link.
Note: For optimal browsing experience, results are limited to 100 rows by default. When working with large datasets, user may retrieve the full dataset by using the intelligently generated SQL query provided by the tool.
Example input: For the database with url="mysql+pymysql://aaaadmin:[email protected]:11110/dify?charset=utf8", fill in the parameters as shown in the following figures.
[Image blocked: data_connector_1.png]
[Image blocked: data_connector_2.png]
[Image blocked: data_connector_3.png]
Common Precautions:
Note: Only one of input_data or file is needed. If both are provided, file takes precedence. File types support both row-metric-column data files and column-metric-row data files.
[Image blocked: data_analysis_1.png]
[Image blocked: data_analysis_4.png]
The query results support downloading as an .docx file for easier local viewing and further processing.
💡 If you want the output to include files, please ensure to add the ' files ' output type in the last component of the flow to get the download link.
Note: Only one of input_data or file is needed. If both are provided, file takes precedence.
[Image blocked: data_interpretation_1.png]
[Image blocked: data_interpretation_4.png]
The query results support downloading as an .docx file for easier local viewing and further processing.
💡 If you want the output to include files, please ensure to add the ' files ' output type in the last component of the flow to get the download link.
Note: Only one of input_data or file is needed. If both are provided, file takes precedence. File types support both row-metric-column data files and column-metric-row data files.
[Image blocked: data_visualization_1.png]
[Image blocked: data_visualization_4.png]
The query results support downloading as an .html file for easier local viewing and further processing.
💡 If you want the output to include files, please ensure to add the ' files ' output type in the last component of the flow to get the download link.
Used to parse the time required for analysis based on the problem description
Note: Any time range excludes today and future dates. When the user asks about the last 7 days, the end time of the returned does not include today, and it is calculated backwards 7 days from yesterday.
[Image blocked: time_identify_1.png]
Merge multiple files into a single file with multiple worksheets.
Note: The uploaded files must meet the size and quantity requirements of the Dify platform.
[Image blocked: merge_to_multisheet_1.png] [Image blocked: merge_to_multisheet_2.png]
This tool performs data cleaning through natural language interaction. Users can describe data cleaning requirements in natural language, and the system will automatically process the data based on the description. This tool supports Markdown format data input and file upload (xlsx/xls/csv).
Note: Only one of input_data or file is needed. If both are provided, file takes precedence. This tool can handle various data cleaning tasks, such as deleting duplicate items, handling missing values, data type conversion, etc. If no requirements are mentioned, it will be processed according to commonly used data cleaning methods.
This tool generates data quality reports through natural language interaction. Users can describe their data quality requirements in natural language, and the system will automatically analyze and generate comprehensive data quality reports. This tool supports Markdown format data input and file upload (xlsx/xls/csv).
Note: Only one of input_data or file is needed. If both are provided, file takes precedence. This tool analyzes various aspects of data quality, such as missing values, duplicates, data types, statistical summaries, and data distribution. The generated report can help users understand the overall quality of their data.
Contact us for inquiries or feedback.