3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in curved-text-detection-11
Use these libraries to find curved-text-detection-11 models and implementations
No subtasks available.
A novel Progressive Scale Expansion Network (PSENet) is proposed, which can precisely detect text instances with arbitrary shapes and is effective to split the close text instances, making it easier to use segmentation-based methods to detect arbitrary-shaped text instances.
This work proposes a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes, and significantly outperforms state-of-the-art methods in terms of both accuracy and efficiency.
SegLink, an oriented text detection method to decompose text into two locally detectable elements, namely segments and links, achieves an f-measure of 75.0% on the standard ICDAR 2015 Incidental (Challenge 4) benchmark, outperforming the previous best by a large margin.
A more flexible representation for scene text is proposed, termed as TextSnake, which is able to effectively represent text instances in horizontal, oriented and curved forms and outperforms the baseline on Total-Text by more than 40% in F-measure.
To evaluate its robustness against curved text, DeconvNet is fine-tuned and benchmarked on Total-Text to facilitate a new research direction for the scene text community.
A polygon based curve text detector (CTD) which can directly detect curve text without empirical combination and which can be end-to-end trainable to learn the inherent connection among the position offsets by seamlessly integrating the recurrent transverse and longitudinal offset connection (TLOC).
This study proposes a novel method named sliding line point regression (SLPR) in order to detect arbitrary-shape text in natural scene and achieved competitive results on traditional ICDAR2015 Incidental Scene Text benchmark and curve text detection dataset CTW1500.
MTRNet is a conditional adversarial generative network (cGAN) with an auxiliary mask that makes the cGAN a generic text eraser, but also enables stable training and early convergence on a challenging large-scale synthetic dataset, initially proposed for text detection in real scenes.
An unsupervised Graph Spectral Clustering technique for Moving Object Detection in Event-based data (GSCEventMOD) is presented and it is shown how the optimum number of moving objects can be automatically determined.
Adding a benchmark result helps the community track progress.