TY - GEN
T1 - An efficient method for text detection in video based on stroke width similarity
AU - Dinh, Viet Cuong
AU - Seong, Soo Chun
AU - Seungwook, Cha
AU - Hanjin, Ryu
AU - Sull, Sanghoon
PY - 2007
Y1 - 2007
N2 - Text appearing in video provides semantic knowledge and significant information for video indexing and retrieval system. This paper proposes an effective method for text detection in video based on the similarity in stroke width of text (which is defined as the distance between two edges of a stroke). From the observation that text regions can be characterized by a dominant fixed stroke width, edge detection with local adaptive thresholds is first devised to keep text- while reducing background-regions. Second, morphological dilation operator with adaptive structuring element size determined by stroke width value is exploited to roughly localize text regions. Finally, to reduce false alarm and refine text location, a new multi-frame refinement method is applied. Experimental results show that the proposed method is not only robust to different levels of background complexity, but also effective to different fonts (size, color) and languages of text.
AB - Text appearing in video provides semantic knowledge and significant information for video indexing and retrieval system. This paper proposes an effective method for text detection in video based on the similarity in stroke width of text (which is defined as the distance between two edges of a stroke). From the observation that text regions can be characterized by a dominant fixed stroke width, edge detection with local adaptive thresholds is first devised to keep text- while reducing background-regions. Second, morphological dilation operator with adaptive structuring element size determined by stroke width value is exploited to roughly localize text regions. Finally, to reduce false alarm and refine text location, a new multi-frame refinement method is applied. Experimental results show that the proposed method is not only robust to different levels of background complexity, but also effective to different fonts (size, color) and languages of text.
UR - http://www.scopus.com/inward/record.url?scp=38349078899&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-76386-4_18
DO - 10.1007/978-3-540-76386-4_18
M3 - Conference contribution
AN - SCOPUS:38349078899
SN - 9783540763857
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 200
EP - 209
BT - Computer Vision - ACCV 2007 - 8th Asian Conference on Computer Vision, Proceedings
PB - Springer Verlag
T2 - 8th Asian Conference on Computer Vision, ACCV 2007
Y2 - 18 November 2007 through 22 November 2007
ER -