Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
Web text mining is a growing research area in data mining. Interestingly, the existing Web text mining algorithms have concentrated on finding frequent patterns while discarding the less frequent ones that may contain outliers. In addition, the domain knowledge in one industry is partly different from that in the others. Whatever they belong to, web texts are analyzed using the same dictionary. This...
Web mining consists of three aspects: Web content mining, Web structure mining, and web usage mining. The most important application of web mining is targeted advertising. Sequential mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. An important application of sequential...
To provide an autonomous learning platform for the students who are interested in web design, the authors of this article and a team of senior undergraduate students established a learning website on the campus network of Tibet Institute for Nationalities. This article first sum up the achievement and problems of the learning website, then proposes a new system framework on topic-based learning websites...
Finding knowledge on the Web has long been a hot research issue. Today the Web has become a popular medium for publishing news and opinion articles, which are important carriers of human knowledge, especially of social knowledge. Developing techniques of automatically collecting and analysing these articles on a large scale is thus desirable. In this paper we propose techniques for searching for events...
Advances in the Internet technologies, specifically Web 2.0, have brought about richer Internet contents and applications. Although recent reports have shown increase of the Internet use by Thai citizens, Web-2.0-based websites in Thailand still do not keep pace very well that would permit full benefits of the technologies to be reaped. This paper presents the design and implementation of a Web 2...
Recent advances in automatic knowledge acquisition methods make it possible to construct massive knowledge bases of semantic relations, containing information potentially unknown to their users. However for certain data mining tasks like finding potential causes of a disease or side-effects of a drug, where missing a small piece of information can have grave consequences, the coverage of automatically...
This paper takes session identification in web log mining as research object, proposes an improved algorithm based on average time threshold value. By calculating the average intervals dynamically among request records in the session, adjusting the time threshold value individually, and compared to the traditional algorithm that defines a uniform threshold value for all users' web pages, the algorithm...
25 Statistical Reports on the Development of Chinese Internet and 5 Survey Reports on the Chinese Network Information Resources published by China Internet Network Information Center are taken for analysis. From the point of view of domain name, web site and web page respectively, the paper conducts detailed analysis on the growth of basic network information resources in China. The statistical analysis...
This paper describes the research about Web data mining using Natural Language Processing. System accepts arbitrary data as input from Web document and then extracts information from the document. A new method to implement Web data mining is proposed in this paper. There are three steps in this system. First, the Web document will be decomposed to paragraph, sentence and phrase level. Second, extract...
In order to improve searching results of Web pages and enhancing Web crawling operation, the Web page clustering based on searching keywords is proposed in this paper, which firstly employed matching degree between Web pages and searching keywords to decide the sequence of showing pages of searching results. Then clustering algorithm was chosen to group pages of searching results according to matching...
The number of Web sites has noticeably increased to roughly 225 million in the last ten years. This means there is a rapid growth of knowledge and information on the Internet. Although search engines can help users to filter their desired information based on key words, the searched result is normally presented in the form of a list, and users have to visit each Web page in order to determine the...
This paper introduces the web mining technology and the application of web mining in the long-distance education platform, points out the process of web mining, discusses the key techniques of personalized long-distance education platform applying web mining technology. The study process of student are analyzed, the structures of teacher model and the structures of student model are given. The method...
With the rapid increase of available information online, especially with the growing popularity of electronic commerce, web data mining is being paid much attention. Combing web data mining and e-commerce has been a hot issue. Following paper focus on how to apply web mining to electronic commerce instead of the plethora or algorithms in data mining. We first introduce the concept, method and process...
The disordered way of the Web information organization has seriously hindered the knowledge sharing and interoperability, this paper presents a knowledge-oriented Web page automatic acquisition system (AKAS2WP). This system includes four core modules, and they are accessing of web pages, text extraction, the management and organizations of the concept and the attribute extraction of the concept. Accessing...
The massive heterogeneous Web information resources flood on the Internet, and the Web pages classified have no relation with each other. In this paper, the topic knowledge repository is built to find the semantic relation between Web pages, and the similar relation and the associated relation are defined to describe the semantic relation, which helps to provide knowledge service for user and other...
A Web information extraction system based on label library is proposed for extracting information from data intensive Web pages in this paper. It downloads dynamic Web pages based on a knowledge database, changes them to XML documents after a preprocessing, mines data regions by using MDR repeated patterns discovery algorithm, recognizes their structure and extracts data from them through a novel...
Topic map is one of the hottest areas in information retrieval field in that it enables a user to navigate and access the documents he needs in an organized manner, rather than browsing through hyperlinks that are generally unstructured and often misleading. This paper proposes a new framework for information retrieval. Through actions of registry, discovery and access in grid information service...
Nowadays Web users are facing the problems of information overload and drowning due to the significant and rapid growth in the amount of information and the number of users. As a result, how to provide Web users with more exactly needed information is becoming a critical issue in Web-based information retrieval and Web applications. In this work, we aim to address improving the performance of Web...
This paper presents Knowledge Puzzle, a tool for knowledge construction from the Web. Its main contribution to Web-based learning is the adaptation of information structure on the Web to cope with the interlinked knowledge structure in the learnerpsilas mind. Self-directed learners will be able to adapt the path of instruction on the Web to their way of thinking, regardless of how the Web content...
One promising application of natural language processing (NLP) research is in the area of information extraction (IE). In this paper, we present work flow of our IE system for the extraction of semantically rich information from the unstructured or semi-structured Chinese web pages. Knowledge engineering approach and automatic training approach are used to extract pattern and built knowledge repository...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.