The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we define a new paradigm for eight-connection labeling, which employes a general approach to improve neighborhood exploration and minimizes the number of memory accesses. First, we exploit and extend the decision table formalism introducing or-decision tables, in which multiple alternative actions are managed. An automatic procedure to synthesize the optimal decision tree from the decision...
This paper proposes a rapid method to accomplish text images correction from slant ones. In the method, the well-known projection algorithm is presented as basic principle. In traditional projection algorithm, the huge calculation for rotation leads to a slow processing speed. This improved method mainly targets at solving computational speed problem arising from projection algorithm, and this could...
Video artificial text detection is a challenging problem of pattern recognition. Current methods which are usually based on edge, texture, connected domain, feature or learning are always limited by size, location, language of artificial text in video. To solve the problems mentioned above, this paper applied SOM (Self-Organizing Map) based on supervised learning to video artificial text detection...
Optical character recognition is one of the challenging fields in recognition of printed Bangla text. The main difficulties are that there are no precise techniques or algorithms for separating lines, words, and characters from printed Bangla text and efficiently recognize these separate characters. In this paper, we introduce new methods to separate lines, words, and characters from printed Bangla...
Document image has been the area of research for a couple of decades because of its potential application in the area of text recognition, line recognition or any other shape recognition from the image. Text recognition from document image is very much dependent on the language of the text itself. English text recognition algorithms have already been developed and are standardized. Some works on Bangla...
Automatic recognition of printed and handwritten documents remains an active area of research. Arabic is one of the languages that present special problems. Arabic is cursive and therefore necessitates a segmentation process to determine the boundaries of a character. Arabic characters consist of multiple disconnected parts. Dots and Diacritics are used in many Arabic characters and can appear above...
Character recognition has been in importance for several decades. Lot of research interest are now focused on applying pattern recognition and computer vision algorithms on camera captured documents to retrieve information from the documents. This paper presents a novel approach for extracting text in camera captured images using edge based algorithm. Extensive experiments have been carried out on...
Text line segmentation in freestyle handwritten documents remains an open document analysis problem. Curvilinear text lines and small gaps between neighbouring text lines present a challenge to algorithms developed for machine-printed or hand-printed documents. We investigate a general-purpose, knowledge-free method for the automatic detection of text lines based on a stable path approach. Lines affected...
With the paper as the medium of electronic information, traditional books, magazines, newspapers, etc are scanned into the images, and changed into electronic documents through OCR (optical character recognition) technology, layout analysis as an important part of OCR has played a greater role. This paper presents a Chinese document layout analysis based on non-text images, solve the deformed image...
Detection of text from documents in which text is embedded in complex colored document images is a very challenging problem. There are a lot of potential uses of text extraction in image searching, archiving documents etc. In this paper, we propose a simple edge based feature to perform this task. It aims at detecting textual regions from the document and separating it from the graphics portion. The...
The graphical information used in technical degrees is not accessible to visually disabled people, which is why a system is being developed to provide a textual description of the information from diagrams using digital image processing and computer vision techniques. This data is used to create a textual description which will be automatically added to the corresponding figure in order to make it...
This paper presents two new preprocessing techniques for cursive script recognition. Enhanced algorithms for core-region detection and effective uniform slant angle estimation are proposed. Reference lines composed of core-region are usually obtained as the ones surrounding highest density peaks, but are strongly affected by the presence of long horizontal strokes and erratic characters in the word...
Text image binarization is an important step in text image analysis and text understanding systems. Some corrupted regions may remain in the binarization result due to noises such as dust, streaks, shadows and small unwanted objects. In this paper, a novel method based on 3D tensor voting is proposed for enhancing text image binarization. The 3D tensor voting is used to detect corrupted regions by...
The detection of texts in video images is an important task towards automatic content-based information indexing and retrieval system. In this paper, we propose a texture-based method for text detection in complex video images. Taking advantage of the desirable characteristic of gray-scale invariance of local binary patterns (LBP), we apply a modified LBP operator to extract feature of texts. A polynomial...
A method of detecting text regions in images which combines grayscale decomposition and stroke extraction is proposed. By checking the consistency of the two text features, text-like connected components are grouped together to generate text line regions in the processed image. It shows good performance on efficiently detecting image text rendered in relatively complex backgrounds.
Captions are text or logos superimposed on videos during a postproduction process. Caption detection in videos is useful for a variety of applications. For many applications, temporal consistency and stability is very important. Most of the prior work adopts certain post-processing procedures to smooth detected caption bounding boxes over time. Although these approaches mitigate the effect of the...
This paper presents a method to assist the indexation of digitized Syriac manuscripts. Syriac belongs to the Aramaic branch of Semitic languages, it is written from right to left intentionally tilted by an angle of approximately 45??. The proposed method is based on a word spotting approach that should locate all the occurrences of a certain query word image. The method is based on a selective sliding...
This paper presents a technique for detecting caption text for indexing purposes. This technique is to be included in a generic indexing system dealing with other semantic concepts. The various object detection algorithms are required to share a common image description which, in our case, is a hierarchical region-based image model. Caption text objects are detected combining texture and geometric...
We propose a fully automatic method for summarizing and indexing unstructured presentation videos based on text extracted from the projected slides. We use changes of text in the slides as a means to segment the video into semantic shots. Unlike precedent approaches, our method does not depend on availability of the electronic source of the slides, but rather extracts and recognizes the text directly...
We offer, in this paper, a new method to segment text in natural scenes. This method is based on the use of a morphological operator: the Toggle Mapping. The efficiency of the method is illustrated and the method is compared, according to various criteria, with common methods issued from the state of the art. This comparison shows that our method gives better results and is faster than the state of...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.