❓

PMTD 의 기반이 되는 Mask R-CNN 의 detection bbox 는 어떻게 직사각형 형태가 아닌 사각형 형태일 수 있는 것인가? Mask-RCNN 뿐 아니라 일반적인 detection 모델을 어떻게 개량해야 직사각형 형태가 아닌 사각형의 bbox 를 생성할 수 있는가?

상태

해결되지 않은 궁금증

생성일

2022/10/26 04:09

관련태그

1 more property

질문을 구체화할 수 있는 메모들

None

질문을 해소하기 위해 해소해야 하는 질문들

None

질문과 관련한 답을 작성한 글들

2p, Considering that scene texts are with arbitrary orientations, works in [22, 32, 46, 10, 26, 9] make the above methods possible for multi-oriented text detection. RRPN [32] introduces inclined anchors with angle information for arbitrary-oriented text prediction and rotated RoI pooling layer to project arbitrary-oriented proposals to the feature map for a text region classifier. TextBoxes++ [22] improves TextBoxes by regressing horizontal anchors to more general quadrilaterals enclosing oriented texts. It also proposes an efficient cascaded non-maximum suppression for quadrilaterals or rotated rectangles. With dense predictions and one step post processing, EAST [46] and DDR [10] both directly produce the rotated boxes or quadrangles of text ateach point in the text region. Recent text spotting methods like FOTS [26] and He et al. [9] show that training text detection and recognition simultaneously could greatly boost detection performance. → 그냥 여기 있는애들 소스코드를 알아보는걸로.