Skip to content

Commit 91c8300

Browse files
authored
Merge pull request #50 from RapidAI/add_slanet_plus_teds
chore: update teds test for slanet plus
2 parents 1d5b3b0 + 7e7cd9f commit 91c8300

File tree

1 file changed

+18
-12
lines changed

1 file changed

+18
-12
lines changed

README.md

Lines changed: 18 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -13,11 +13,10 @@
1313
</div>
1414

1515
### 最近更新
16-
- **2024.9.26**
17-
- 修正RapidTable默认英文模型导致的测评结果错误。
18-
- 补充测评数据集,补充开源社区更多模型的测评结果
1916
- **2024.9.30**
2017
- 优化表格类型判断,增加在线演示
18+
- **2024.10.13**
19+
- 补充最新paddlex-SLANet-plus 测评结果(已集成模型到rappid_table仓库)
2120

2221
### 简介
2322
💖该仓库是用来对文档中表格做结构化识别的推理库,包括来自paddle的表格识别模型,
@@ -48,16 +47,21 @@
4847

4948
:StructEqTable 输出为 latex,只取成功转换为html并去除样式标签后进行测评
5049

51-
| 方法 | TEDS | TEDS-only-structure |
52-
|:---------------------------------------------------------------------------------------------------------------------------|:-----------:|:-------------------:|
53-
| [deepdoctection(rag-flow)](https://github.com/deepdoctection/deepdoctection?tab=readme-ov-file) | 0.59975 | 0.69918 |
54-
| [ppstructure_table_master](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.61606 | 0.73892 |
55-
| [ppsturcture_table_engine](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.67924 | 0.78653 |
56-
| table_cls + wired_table_rec v1 + lineless_table_rec | 0.68507 | 0.75140 |
57-
| [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) | 0.67310 | **0.81210** |
58-
| [RapidTable](https://github.com/RapidAI/RapidStructure/blob/b800b156015bf5cd6f5429295cdf48be682fd97e/docs/README_Table.md) | 0.71654 | 0.81067 |
59-
| table_cls + wired_table_rec v2 + lineless_table_rec | **0.73702** | 0.80210 |
50+
| 方法 | TEDS | TEDS-only-structure |
51+
|:------------------------------------------------------------------------------------------------|:-----------:|:-------------------:|
52+
| [deepdoctection(rag-flow)](https://github.com/deepdoctection/deepdoctection?tab=readme-ov-file) | 0.59975 | 0.69918 |
53+
| [ppstructure_table_master](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.61606 | 0.73892 |
54+
| [ppsturcture_table_engine](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.67924 | 0.78653 |
55+
| table_cls + wired_table_rec v1 + lineless_table_rec | 0.68507 | 0.75140 |
56+
| [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) | 0.67310 | 0.81210 |
57+
| [RapidTable(SLANet)](https://github.com/RapidAI/RapidTable) | 0.71654 | 0.81067 |
58+
| table_cls + wired_table_rec v2 + lineless_table_rec | 0.73702 | 0.80210 |
59+
| [RapidTable(SLANet-plus)](https://github.com/RapidAI/RapidTable) | **0.84481** | **0.91369** |
6060

61+
### 使用建议
62+
wired_table_rec_v2(有线表格精度最高): 通用场景有线表格(论文,杂志,期刊, 收据,单据,账单)
63+
64+
paddlex-SLANet-plus(综合精度最高): 文档场景表格(论文,杂志,期刊中的表格)
6165

6266
### 安装
6367

@@ -153,6 +157,8 @@ flowchart TD
153157

154158
### 致谢
155159

160+
[PaddleX 表格识别](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/table_structure_recognition.md)
161+
156162
[PaddleOCR 表格识别](https://github.com/PaddlePaddle/PaddleOCR/blob/4b17511491adcfd0f3e2970895d06814d1ce56cc/ppstructure/table/README_ch.md)
157163

158164
[读光-表格结构识别-有线表格](https://www.modelscope.cn/models/damo/cv_dla34_table-structure-recognition_cycle-centernet/summary)

0 commit comments

Comments
 (0)