The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, query sound-by-example video retrieval framework based on audio concepts is presented. First, audio stream extracted from movies in the database is set into orientation clusters using an unsupervised segmentation technique. Audio signals admit a new proposed particular pretreatment process to distinguish audio concepts. This is used for indexing the video data. Second, the query asked...
This paper describes the Arabic Recognition Competition: Multi-font Multi-size Digitally Represented Text held in the context of the 12th International Conference on Document Analysis and Recognition (ICDAR'2013), during August 25-28, 2013, Washington DC, United States of America. This competition has used the freely available Arabic Printed Text Image (APTI) database. A first edition took place in...
We propose in this work an approach for automatic recognition of printed Arabic text in open vocabulary mode and ultra low resolution (72 dpi). This system is based on Hidden Markov Models using the HTK toolkit. The novelty of our work is in the analysis of three complex fonts presenting strong ligatures: DiwaniLetter, DecoTypeNaskh and DecoTypeThuluth. We propose a feature extraction based on statistical...
This paper describes the Arabic Recognition Competition: Multi-font Multi-size Digitally Represented Text held in the context of the 11$^{th}$ International Conference on Document Analysis and Recognition (ICDAR2011), during September 18-21, 2011, Beijing, China. This first competition used the freely available Arabic Printed Text Image (APTI) database. Several research groups have started using the...
Arabic script presents a challenge complexity and variability for handwriting recognition. The first on line Arabic Database called ADAB is known as a standard benchmark in the ICDAR competition of 2009. This paper describes the Online Arabic handwriting recognition competition held at ICDAR 2011. 3 groups with 5 systems are participating in the competition. The systems were tested on known data (sets...
The selection of the classifier architecture is a very important step in the recognition process. This paper presents a new algorithm for the HMMs architectures optimization: Multi-Models Evolvement using PSO (MME-PSO). The proposed algorithm is applied to an Arabic handwriting recognition system. The recognizer is based on character Hidden Markov Models which can have different architectures. This...
We present in this paper a framework for audio concept identification based on audio stream analysis and binary classifiers encapsulation. The system consists of three stages. The first stage is called the pre-processing level audio, where audio stream is segmented and silence segments are detected. In the second stage, speech, music and environmental sounds are automatically divided and further classified...
In this paper, we propose a new linguistic-based approach called the affixal approach for Arabic word and text image recognition. Most of the existing works in the field integrate the knowledge of the Arabic language in the recognition process in two ways: either in post-recognition using the language of dictionary (dictionary of words) to validate the word hypotheses suggested by the OCR or in the...
In this paper, we propose a strategy of multi-SVM incremental learning system based on Learn++ classifier for detection of predefined events in the video. This strategy is offline and fast in the sense that any new class of event can be learned by the system from very few examples. The extraction and synthesis of suitably video events are used for this purpose. The results showed that the performance...
We present in this paper a new approach of online Arabic handwriting modeling based on the graphemes segmentation. This segmentation rests on the previous detection of baseline. It involves the detection of two types of topologically meaningful points: the backs of the valleys adjoining the baseline and the angular points. The stage of features extraction allows to model the shapes of segmented graphemes...
We present in this paper a new approach for Arabic font recognition. Our proposal is to use a fixed-length sliding window for the feature extraction and to model feature distributions with Gaussian Mixture Models (GMMs). This approach presents a double advantage. First, we do not need to perform a priori segmentation into characters, which is a difficult task for arabic text. Second, we use versatile...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.