The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Before the image of a document enter an OCR module, it should undergo Preprocessing and Document Layout Analysis steps. Document layout analysis usually comes after preprocessing. Noise removal and skew correction are two major preprocessing operations. Document layout analysis itself is divided into physical and logical layout analysis. Physical layout analysis decomposes the image of a document...
Single document summarization generates summary by extracting the representative sentences from the document. In this paper, we presented a novel technique for summarization of domain-specific text from a single web document that uses statistical and linguistic analysis on the text in a reference corpus and the web document. The proposed summarizer uses the combinational function of Sentence Weight...
Historical manuscripts are considered one of the most imperative human riches and a source of intellectual production. Unfortunately, due to aging effects, multiple noises and deviations are found in the document image. Moreover, challenges for several images of ancient documents show defects of inclinations and curvatures of text lines. These defects arise due to bad storage conditions, or during...
Various kinds of information that is available on a topic electronically has abundantly increased over the past years. It has led the information highway to a situation called “information overload” problem. Automatic text summarization technique mainly addresses this issue by the extraction of a shortened version of information from texts written about the same topic. Several algebraic reduction...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.