The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
One key step in text mining is the categorization of texts, i.e., to put texts of the same or similar contents into one group so as to distinguish texts of different contents. However, traditional word-frequency-based statistical approaches, such as VSM model, failed to reflect the complicated meaning in texts. This paper ushers in domain ontology and constructs new conceptual vector space model in...
LJParser is a developing platform for web search and mining. It is a middleware by LING-JOIN Software, which is well known for over ten years of expertise in natural language understanding and web search. LJParser provides powerful modules including precise search for multiple language, new words detection, Chinese word segmentation and pas tagging, language modeling and term translation, text clustering,...
Rapid progress of network arouses much attention on Internet public opinion, it is important to grasp the internet public opinion in time and understand the trends of their opinion correctly. Text mining plays a fundamental role in categorization and monitoring of internet public opinion, but internet public opinion is much more difficult than pure-text process because of their semi-structured characteristic...
EWAS is an early warning prototype that collects and analyzes news items from the European media monitor. Although, it currently processes news articles, it can easily be adapted to any other form of text. Data mining functions performed by the system are categorization, clustering, and named entity extraction. The main design concern of the system is scalability, which is achieved by a modular architecture...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.