The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Communication through web is becoming increasingly popular thanks to wireless and cellular networks. As this awareness spreads far and wide in different countries, significant complexities arise in terms of language and communication means for extracting information on the web. This is particularly true in India where more than fifteen officially recognized language texts and more variations in local...
In this work we propose a method for localizing text regions within scene images consisting of two major stages. In the first stage, a set of potential text regions is extracted from the input image using residual operators (such as ultimate attribute opening and closing). In the second stage a set of features is obtained from each potential text region and this feature set will be later used as an...
We propose a method to detect events and event boundaries in soccer videos by using web-casting texts and audio-visual features. The events and their inaccurate time information given in web-casting texts need to be aligned with the visual content of the video. We overcome this issue by utilizing textual, visual and audio features. Existing methods assume that the time at which the event occurs is...
A novel text detection algorithm based on 2D tensor voting is proposed. Tensor voting is used to extract text line information by exploiting the curve saliency value and curve normal vector at each character. The text line information is useful information to improve the results and reduce the effect of using heuristic rules of region-based methods. The experimental results attained from several natural...
Character recognition has been in importance for several decades. Lot of research interest are now focused on applying pattern recognition and computer vision algorithms on camera captured documents to retrieve information from the documents. This paper presents a novel approach for extracting text in camera captured images using edge based algorithm. Extensive experiments have been carried out on...
Detection of text from documents in which text is embedded in complex colored document images is a very challenging problem. There are a lot of potential uses of text extraction in image searching, archiving documents etc. In this paper, we propose a simple edge based feature to perform this task. It aims at detecting textual regions from the document and separating it from the graphics portion. The...
The graphical information used in technical degrees is not accessible to visually disabled people, which is why a system is being developed to provide a textual description of the information from diagrams using digital image processing and computer vision techniques. This data is used to create a textual description which will be automatically added to the corresponding figure in order to make it...
The detection of texts in video images is an important task towards automatic content-based information indexing and retrieval system. In this paper, we propose a texture-based method for text detection in complex video images. Taking advantage of the desirable characteristic of gray-scale invariance of local binary patterns (LBP), we apply a modified LBP operator to extract feature of texts. A polynomial...
A method of detecting text regions in images which combines grayscale decomposition and stroke extraction is proposed. By checking the consistency of the two text features, text-like connected components are grouped together to generate text line regions in the processed image. It shows good performance on efficiently detecting image text rendered in relatively complex backgrounds.
Robust extraction of text from scene images is essential for successful scene text recognition. Scene images usually have nonuniform illumination, complex background, and text-like objects. In this paper, we propose a text extraction algorithm by combining the adaptive binarization and perceptual color clustering method. Adaptive binarization method can handle gradual illumination changes on character...
Text separation in natural scenes is a crucial step to recognize scene text. Since computational power in a mobile device is limited, current text extraction methods are impractical in real-time devices. We propose efficient text extraction methods by utilizing user's indication. When user simply indicates focus or draws the line on touch screen, the system can extract text in natural scenes efficiently...
Medical image annotation remains a challenging task. Many feature schemes have been experimented with limited success. In this paper, we propose to improve the image categorization prediction through the employment of better feature schemes assessed with feature analysis. A new edge descriptor based on the Canny detector is proposed along with modified MPEG-7 features. Some preliminary results are...
Double boundary text, also called outlined text, problem is presented when the text boundary has different color with background color and text stroke color. This problem, which frequently occurs in natural text of signboards, imposes another obstacle to the proper extraction of text information. In this paper, an efficient method is proposed based on well-known filling algorithms and characteristics...
Recognition techniques for printed and handwritten text in scanned documents are significantly different. In this paper we address the problem of identifying each type. We can list at least four steps: digitalization, preprocessing, feature extraction and decision or classification. A new aspect of our approach is the use of data mining techniques on the decision step. A new set of features extracted...
In a multi-script multi-lingual environment, a document may contain text lines in more than one script/language forms. It is necessary to identify different script regions of the document in order to feed the document to the OCRs of individual language. With this context, this paper proposes to develop a monothetic algorithmic model to identify and separate text lines Telugu, Hindi and English scripts...
Distortion always appears in document images while scanning thick bound volumes. There are two kinds of distortion for the scanned grayscale images, shadow appears at the volumes' spine area, and warping of the words occurs in the shadow. In this paper, a novel text boundary lines based method for efficient restoration of warped scanning Chinese document images is presented. We first detect on which...
With the development of information technology, the number of scanned images is increasing rapidly. There are many important texts in these images. In order to satisfy the need of images viewing, text identification and text retrieval, this paper presents an efficient method for text segmentation. Firstly, localizes the text blocks in scanned image. Secondly, according to its gray/color distribution,...
Texts appearing on compound image are usually classified into scene text and imposed text. Imposed text, like scripts in videos, slogans in advertisements and titles in magazine covers, contains important information. In this paper, a multistep compression method that could preserve imposed text quality in restored image is presented. The method separates imposed text from background and compresses...
This paper describes an approach towards an orientation and skew detection for texts in scanned documents. Before using OCR systems to obtain character information from images, a preprocessing stage, comprising a number of adjustments, has to be performed in order to obtain accurate results. One important operation that has to be considered is the skew correction, or deskewing, of the image, a fault...
This paper discusses the history and current trends of video retrieval, focusing mainly on video segmentation, indexing and search. The objective is to share with the readers how much we have done so far as well as the current trends in the field. Unlike text documents, video contains dynamic information such as audio, motion (object and/or camera motions), etc. Thus, indexing videos for future search...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.