HyperAI超神经

一、教程简介

Dolphin 是由字节跳动团队于 2025 年 5 月推出的多模态文档解析模型。该模型基于先解析结构后解析内容的两阶段方法第一阶段生成文档布局元素序列，第二阶段用元素作为锚点并行解析内容。 Dolphin 在多种文档解析任务上表现出色，性能超越 GPT-4.1 、 Mistral-OCR 等模型。相关论文成果为「Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting」。已被 ACL 2025 接受。

本教程采用资源为单卡 RTX 4090 。

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

若显示「Bad Gateway」，这表示模型正在初始化，由于模型较大，请等待约 1-2 分钟后刷新页面。

2. 使用示例

2.1. Document Recognition

结果

2.2. Element Recognition

结果

四、交流探讨

🖌️ 如果大家看到优质项目，欢迎后台留言推荐！另外，我们还建立了教程交流群，欢迎小伙伴们扫码备注【SD 教程】入群探讨各类技术问题、分享应用效果↓

引用信息

本项目引用信息如下：

@inproceedings{dolphin2025,
  title={Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting},
  author={Feng, Hao and Wei, Shu and Fei, Xiang and Shi, Wei and Han, Yingdong and Liao, Lei and Lu, Jinghui and Wu, Binghong and Liu, Qi and Lin, Chunhui and Tang, Jingqun and Liu, Hao and Huang, Can},
  year={2025},
  booktitle={Proceedings of the 65rd Annual Meeting of the Association for Computational Linguistics (ACL)}
}

一、教程简介

本教程采用资源为单卡 RTX 4090 。

引用信息

本项目引用信息如下：

@inproceedings{dolphin2025, title={Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting}, author={Feng, Hao and Wei, Shu and Fei, Xiang and Shi, Wei and Han, Yingdong and Liao, Lei and Lu, Jinghui and Wu, Binghong and Liu, Qi and Lin, Chunhui and Tang, Jingqun and Liu, Hao and Huang, Can}, year={2025}, booktitle={Proceedings of the 65rd Annual Meeting of the Association for Computational Linguistics (ACL)} }

一、教程简介

本教程采用资源为单卡 RTX 4090 。

引用信息

本项目引用信息如下：

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters

Command Palette

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters

Command Palette

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters

Command Palette

Dolphin 多模态文档图像解析

一、教程简介

二、项目示例

三、运行步骤

1. 启动容器后点击 API 地址即可进入 Web 界面

2. 使用示例

2.1. Document Recognition

2.2. Element Recognition

四、交流探讨

引用信息

用 AI 构建 AI

Hyper Newsletters