The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Blogosphere is the name associated to universe of all the blog sites. A blog is a website that allows people to write about topics they want to share with others. The ease & simplicity of creating blog posts and their free form and unedited nature have made blogging a happening thing. The blogosphere today contains a large number of posts on virtually every topic of interest. This rich and unique...
This article explores hierarchical clustering and graph analysis for detection groups in Wikis. Both approaches are explored in this work using historical information about Wiki pages. The results shows that this type of analysis can be used to identify people and groups with similar interests and abilities.
This paper addresses clustering of blog users and posts in blogosphere. First, we model blogosphere as a bipartite graph where blog users and posts correspond to nodes of two types and actions on posts performed by blog users corresponds to links. Next, for clustering in blogosphere, we employ LinkClus, a link-based algorithm that finds clusters of nodes in a network effectively and efficiently. For...
The typical task of unsupervised learning is to organize data, for example into clusters, typically disjoint clusters (eg. the K-means algorithm). One would expect (for example) a clustering of books into topics to present overlapping clusters. The situation is even more so in social networks, a source of ever increasing data. Finding the groups or communities in social networks based on interactions...
Since the emergence of BLOG, it not only represents a new network technology, but also means the beginning of a new life style. How to utilize and mine the BLOG content which contains hidden sentiment and real-time update is a big challenge in the data-mining domain. As most of the existing method for network text's topic mining is achieved through clustering text's topic and label which are labeled...
This document aims at the new energy and low-carbon economy studied by scholars as the main line, and describes the cloud concept in the application of such knowledge service through constructions of the database and the collections of network resources. New energy and low-carbon economy are the knowledge cluster widely covered. Many relative literatures have been published, so have many subjects...
Recently, micro-blogging sites such as Twitter have garnered a great deal of interests as an advanced form of blogging, where individuals can share their experiences, thoughts, feelings, etc, in real time. Additionally, mobile device based micro-blogging applications are now enabling the incorporation of extremely precise location information in the form of GPS-based coordinates. With the enormous...
Clustered Web pages, such as blog posts, could be used to improve Web search. In this paper, we propose a folksonomy extending framework using relations in the Blogosphere and demonstrate how the framework could be used to help clustering blog posts. We evaluate the framework by comparing it with content-based folksonomy extending approach. Experiment results show that the framework does help the...
Usually a meaningful Web topic has tens of thousands of comments, especially the hot topics. It is valuable if we congregate the comments into clusters and find out the mainstreams. However, such analysis has two difficulties. First, there is no explicit link relationship between Web comments just like those among Web pages or Blog comments. The other problem is, most of the comments are very short,...
The growth of eLearning systems popularity motivates researchers to study these systems intensively. Users of eLearning systems form social networks through the different activities performed by them (sending emails, reading study materials, chat, taking tests, etc.). This paper focuses on searching of latent social networks from eLearning systems data. This data consists of students activity records...
A important phase in any Web personalization system is transaction identification. Recently a number of researches have been done to incorporate semantics of a website in representation of transactions. Building a hierarchy of concepts manually is time consuming and expensive. In this paper we intend to address these shortcomings. Our contribution is that we introduce a mechanism to automatically...
Traditional approaches to document classification requires labeled data in order to construct reliable and accurate classifiers. Unfortunately, labeled data are seldom available, and often too expensive to obtain. Given a learning task for which training data are not available, abundant labeled data may exist for a different but related domain. One would like to use the related labeled data as auxiliary...
Blog is increasingly becoming an important source of information. Blog community is a kind of a group of bloggers with the same interest and common topics on the Internet. To use blog resources effectively, one important way is to identify blog communities and their members in order to refine the blog circle. In this paper, we first define the blog community and the community center, and then construct...
Wikipedia is nowadays one of the most valuable information resources; nevertheless, its current structure, which has no formal organization, does not allow to always have a useful browsing among topics. Moreover, even though most Wikipedia pages include a "See Also" section for navigating through those articles' related Wikipedia pages, the only references included here are those which authors...
We consider topic detection without any prior knowledge of category structure or possible categories. Keywords are extracted and clustered based on different similarity measures using the induced k-bisecting clustering algorithm. Evaluation on Wikipedia articles shows that clusters of keywords correlate strongly with the Wikipedia categories of the articles. In addition, we find that a distance measure...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.