測試幾個 ocr 對日語的識別情況

1. EasyOCR
2. PaddleOCR
3. Deepdoc（識別pdf中圖片）
4. Deepdoc（識別pdf中文字）
5. Nvidia neva-22b
6. Claude 3.5 sonnet 識別圖片中的文字
7. Claude 3.5 sonnet 識別 pdf 中表格
8. OpenAI gpt-4o 識別圖片中文字
9. OpenAI gpt-4o 識別 pdf 中表格

1. EasyOCR

github: https://github.com/JaidedAI/EasyOCR

jaided: https://www.jaided.ai/easyocr/

測試圖片：

在這里插入圖片描述
識別效果：

在這里插入圖片描述

結論：
效果不行

2. PaddleOCR

Github: https://github.com/PaddlePaddle/PaddleOCR

WebUI: https://aistudio.baidu.com/community/app/91660/webUI

測試圖片：

在這里插入圖片描述
識別效果：

在這里插入圖片描述

結論：
效果不行

3. Deepdoc（識別pdf中圖片）

Github: https://github.com/infiniflow/ragflow/tree/main/deepdoc

測試內容：

在這里插入圖片描述
識別效果：

在這里插入圖片描述
結論：
沒識別成功

4. Deepdoc（識別pdf中文字）

Github: https://github.com/infiniflow/ragflow/tree/main/deepdoc

測試內容：

在這里插入圖片描述
識別效果：

Oのra開c発le チDaーtaムbaはse、2A3Iとaiの開提発供者開の始生を産発性表向で上きにる重こ點とをを置嬉いしてく、思OrいacまleすD。atこabのas4e年の間次、のO長ra期cサle ポDaーtaトba?seリリースの提供に向けて懸命に取り組んできました。このリリースではAIに焦點を當てており、データベースの名前をOracle Database 23cからOracle Database 23aiに変更することを決定しました。これは、このリリースの焦點と、リリースされる情勢を反映しています。、のの焦點、情勢反映。

結論：
效果不行

5. Nvidia neva-22b

neva-22b: https://build.nvidia.com/nvidia/neva-22b

在這里插入圖片描述
結論：
沒識別出來

6. Claude 3.5 sonnet 識別圖片中的文字

please identify the text in the picture, response the text only in it's original language.

在這里插入圖片描述

7. Claude 3.5 sonnet 識別 pdf 中表格

Convert the entire table to markdown format, preserving its original language. Include all content from all pages, even if information is repeated across multiple pages. Present the complete table without omitting any sections.

在這里插入圖片描述

8. OpenAI gpt-4o 識別圖片中文字

在這里插入圖片描述

9. OpenAI gpt-4o 識別 pdf 中表格

Please convert the entire table to Markdown format, preserving its original language. Include all content from all pages, even if information is repeated across multiple pages. Present the complete table without omitting any sections, and make sure to include any duplicated information exactly as it appears in the original document.

在這里插入圖片描述
問題點：
表頭被重復打印了

完結！

本文來自互聯網用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權，不承擔相關法律責任。
如若轉載，請注明出處：http://www.pswp.cn/web/40191.shtml
繁體地址，請注明出處：http://hk.pswp.cn/web/40191.shtml
英文地址，請注明出處：http://en.pswp.cn/web/40191.shtml

如若內容造成侵權/違法違規/事實不符，請聯系多彩編程網進行投訴反饋email:809451989@qq.com，一經查實，立即刪除！