The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
A study of educational approaches to higher education computing curricula using publicly available data from a representative sample of 27 public colleges in the State University of New York (SUNY) system is performed. Data and text analysis techniques were applied to a corpus built from course descriptions and listed prerequisites across school, department, school type (2- or 4- year or graduate)...
The K-12 learning space is evolving in both the United States and internationally. Students are given increasingly frequent access to the internet through various platforms such as desktop computers, laptops, tablets, and other mobile devices. Some schools are distributing mobile devices to students in order to facilitate the integration of technology in the classroom. These devices have a web filter...
Now a day's many of crimes are related to financial domain so forensic analysis of such documents is required. Due to digitization many of documents for investigation is faster. If analyzer analyzes the document manually it will time consuming and tedious task so, we follow the approach which will specify the clustering algorithm to document for forensic analysis of seize system which will help the...
Many approaches to solve the problem of scene character recognition utilize local features such as histograms of oriented gradients (HoG), SIFT, Shape Contexts (SC), Geometric Blur (GB), etc. An issue associated with these methods is the ad hoc rasterization of the local features into a single vector which perturbs the global spatial correlations that carry crucial information for recognition. To...
The paper discusses the efficiency of parallel implementation an algorithm of frequency analysis of textual information. The algorithm is implemented as a multithreaded application. Two approaches for job distribution between threads are compared — the text distribution and alphabet distribution. Experiment results shown that both methods can be used for acceleration of the procedure of text frequency...
There are several requirements to the preprocessing of the classified texts. Within the frame of this work importance of these requirements have been analysed.
Documents can be a valuable source of information but often they suffer degradation problems, especially in the case of historical documents, such as strains, background of big variations and uneven illumination, ink seepage, etc. Binarization techniques should be applied to remove the noise and improve the quality of the documents. Collections of historical and old document images care commonly provided...
In this paper we describe the hacker culture by analyzing 25 years of communication on one of the oldest and most renowned hacker websites. For this purpose, we utilize a previously documented text analysis technique [14] which provides an efficient and effective method of producing a quick overview of values underlying any written text. The technique allows for the creation of culture profiles of...
In this paper we propose a neural net based characters recognition scheme for Bangla printed text books. There are a lot of scientific literature, novels, magazines and books etc that are written in Bangla language. More than 400 million people use Bangla language. Most of the library and educational institutions want to keep copy of the books in a digital format. For storing those books in digital...
We describe a basic framework and methodology to convert Bangla Text to Speech. Articulated words are automatically produced from Bangla input text by the methodology from the basic pronunciation of the Bangla words. The single tone syllables are considered as the fundamental units for analysis. The methodology selects phonetic units from uttered vocabulary and then combined the appropriate diphones...
Web is gigantic and being constantly update. Bangla news in web are rapidly grown in the era of information age where each news site has its own different layout and categorization for grouping news. These heterogeneity of layout and categorization can not always satisfy individual user's need. Removing these heterogeneity and classifying the news articles according to user preference is a formidable...
Document layout analysis is necessary process for automated document recognition systems. Document layout analysis identifies, categorizes and labels the semantics of text blocks for meaningful information retrieval from document images. Our primary target document includes various newspaper and magazine pages which are having complex layout without following any static rules. We propose an effective...
We present WebGT, the first web-based system to help users produce ground truth data for document images. This user-friendly software system helps historians and computer scientists collectively annotate historical documents. It supports real time collaboration among remote sites independent of the local operating system and also provides several novel semi-automatic tools that have proven effective...
Social computing is an emerging field, encompassing a wide range of topics. A broad understanding of the major topics involved in social computing is important for both scholars and practitioners. The authors present and analyze the voluminous social computing related studies to date, applying document co-citation analysis, pathfinder networks, core-document analysis, and the Herfindahl-Hirschman...
The Semantic Web is an evolving extension of the World Wide Web in which the semantics of information and services on the web is defined, making it possible for the web to understand and satisfy the requests of people and machines to use the web content. Semantic information processing is used to construct knowledge base at the human level. The most fundamental step in semantic information processing...
The issue of plagiarism is discussed in the context of university education in disciplines related to computing. The focus is therefore mainly on software plagiarism. First, however, a case is made for the claim that the most important reason that plagiarism cannot be tolerated lies in the essence of the concept of a university as it is rooted in the Western cultural tradition. The main contribution...
A multi-agent based Web mining model is designed for the improvement of the efficiency of keywords based search engine. The model divides mining task into several parallel agents which coordinately work together, and the mining efficiency is improved greatly. Evolving from HITS, algorithm named Grabber in the model removes Link Farm pages in the expansion of root set, makes anchor text similarity...
The emotion tendency of sentiment word is divided into two types: static emotion tendency and dynamic emotion tendency. Basic semantic lexicon is static emotion tendency, in the real context, but it is different between static emotion tendency and dynamic emotion tendency. The paper proposes a method based on degree lexicon, negative lexicon and dependence relationship of sentence elements. The experimental...
The idea of text to speech by a computer is an enhancement of the human learning ability. Due to the fact that each person has individual ability of visualization, the receiving of information in the form of voice helps make everything become easier. The objective of this research is to develop computer software that can translate Thai Text to Speech (TTTS). The TTTS consists of four modules, which...
In this paper we present a method for Bangla speech generation from Bangla PDF document. Our main goal is to generate almost natural speech from Bangla PDF document. For this we have proposed a method which performs three major tasks. One is PDF to text conversion, then text to ASCII conversion, and then follows the character and modifier rules while reading text and finally speech generation from...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.