點擊上方“CVer”,選擇加"星標"置頂
重磅干貨,第一時間送達
本文轉載自:CSIG文檔圖像分析與識別專委會
圖4為LOTM結構。LOTM模塊的輸入Proposal Features是在Adaptive-RPN后的共享特征圖上使用Deformable RoI pooling [4] 和雙線性插值得到。經過1*1卷積后,展開兩個平行分支,解耦為水平和和豎直兩個正交方向的輪廓檢測。水平方向分支使用1*k的卷積核水平方向卷積,豎直方向分支則使用k*1的卷積核豎直方向卷積,k是超參數,實驗驗證使用k=3比較好。卷積后的特征圖經過Sigmoid歸一化得到相應方向的熱圖。LOTM使用交叉熵損失分類輪廓邊界點。
Point Re-scoring Algorithm模塊中,先對兩個方向熱圖進行簡單的NMS預處理濾波得到更高置信度的準確表征,然后綜合考慮LOTM輸出的水平和垂直方向上響應,即文本輪廓需同時具有兩個方向的響應,濾除單方向噪聲,從而抑制偽召回。
三、主要實驗結果及可視化效果Table 1. The single-scale results on Total-Text. * indicates the results?from [5]. Ext is the short for external data used in training?stage. y means testing at multi-scale setting. The evaluation protocol?is DetEval.?2?ContourNet論文地址:https://arxiv.org/pdf/2004.04940.pdf
2?ContourNet開源代碼:https://github.com/wangyuxin87/ContourNet
參考文獻[1] Tsung-Yi Lin, Piotr Doll′ar, Ross B. Girshick, Kaiming He,?Bharath Hariharan, and Serge J. Belongie. Feature pyramid?networks for object detection. In CVPR, pages 936–944,?2017.[2] Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun.?Faster r-cnn: Towards real-time object detection with region?proposal networks. In Advances in neural information processing?systems, pages 91–99, 2015.[3] Hamid Rezatofighi, Nathan Tsoi, JunYoung Gwak, Amir?Sadeghian, Ian Reid, and Silvio Savarese. Generalized intersection?over union: A metric and a loss for bounding box?regression. In Proceedings of the IEEE Conference on Computer?Vision and Pattern Recognition, pages 658–666, 2019.[4] Xizhou Zhu, Han Hu, Stephen Lin, and Jifeng Dai. Deformable?convnets v2: More deformable, better results. In?CVPR, 2019.[5] Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He,?Wenhao Wu, and Cong Yao. Textsnake: A flexible representation?for detecting text of arbitrary shapes. In ECCV,?pages 19–35. Springer, 2018.[6] Yuliang Liu, Lianwen Jin, Shuaitao Zhang, Canjie Luo, Sheng Zhang.Curved scene text detection via transverse and longitudinal sequence connection. Pattern Recognition 90:337–345.[7] Jie Hu, Li Shen, and Gang Sun. Squeeze-and-excitation networks.?In Proceedings of the IEEE conference on computer?vision and pattern recognition, pages 7132–7141, 2018.原文作者:Yuxin Wang, ?Hongtao Xie, ?Zhengjun Zha, ?Mengting Xing, ?Zilong Fu and Yongdong Zhang
撰稿:伍思航 |?編排:高?學審校:殷 飛 |?發布:金連文
免責聲明:(1)本文僅代表撰稿者觀點,撰稿者不一定是原文作者,其個人理解及總結不一定準確及全面,論文完整思想及論點應以原論文為準。(2)本文觀點不代表本公眾號立場。下載
在CVer公眾號后臺回復:CVPR2020,即可下載CVPR 2020所有論文和300+篇代碼開源的論文項目,開源地址如下:
https://github.com/amusi/CVPR2020-Code
重磅!CVer-論文寫作與投稿交流群成立
掃碼添加CVer助手,可申請加入CVer-論文寫作與投稿?微信交流群,目前已滿2000+人,旨在交流頂會(CVPR/ICCV/ECCV/ICML/ICLR/AAAI等)、頂刊(IJCV/TPAMI等)、SCI、EI等寫作與投稿事宜。
同時也可申請加入CVer大群和細分方向技術群,細分方向已涵蓋:目標檢測、圖像分割、目標跟蹤、人臉檢測&識別、OCR、姿態估計、超分辨率、SLAM、醫療影像、Re-ID、GAN、NAS、深度估計、自動駕駛、強化學習、車道線檢測、模型剪枝&壓縮、去噪、去霧、去雨、風格遷移、遙感圖像、行為識別、視頻理解、圖像融合、圖像檢索、論文投稿&交流、PyTorch和TensorFlow等群。
一定要備注:研究方向+地點+學校/公司+昵稱(如論文寫作+上海+上交+卡卡),根據格式備注,可更快被通過且邀請進群
▲長按加微信群
▲長按關注CVer公眾號
整理不易,請給CVer一個在看!