Algorithms and Models for the Web-Graph

chapter

Modelling and Mining of Networked Information Spaces

William Aiello, Andrei Broder, Jeannette Janssen, Evangelos Milios

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 1-17

In recent years, the emergence of the Web and the dramatic increase in computing, storage and networking capacity has given rise to the concept of networked information spaces. The prime example of a networked information space is the World Wide Web itself. The Web, in its pure form, is a set of hypertext documents, with links in one document pointing to another document.

chapter

Workshop on Algorithms and Models for the Web Graph

William Aiello, Andrei Broder, Jeannette Janssen, Evangelos Milios

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 18-23

For barely a decade now the Web graph (the network formed by Web pages and their hyperlinks) has been the focus of scientific study. In that short a time, this study has made a significant impact on research in physics, computer science and mathematics. It has focussed the attention of the scientific community on all the different kinds of networks that have arisen through technology and human activity;...

chapter

Expansion and Lack Thereof in Randomly Perturbed Graphs

Abraham D. Flaxman

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 24-35

This paper studies the expansion properties of randomly perturbed graphs. These graphs are formed by, for example, adding a random $1{\text{-out}}$ or very sparse Erdős-Rényi graph to an arbitrary connected graph. The central results show that there exists a constant δ such that when any connected n-vertex base graph $\bar{G}$ is perturbed by adding a random 1-out then, with high probability,...

chapter

Web Structure in 2005

Yu Hirate, Shin Kato, Hayato Yamana

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 36-46

The estimated number of static web pages in Oct 2005 was over 20.3 billion, which was determined by multiplying the average number of pages per web server based on the results of three previous studies, 200 pages, by the estimated number of web servers on the Internet, 101.4 million. However, based on the analysis of 8.5 billion web pages that we crawled by Oct. 2005, we estimate the total number...

chapter

Local/Global Phenomena in Geometrically Generated Graphs

Ross M. Richardson

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 47-58

We study a geometric random tree model which is a variant of the FKP model proposed in [1]. We choose vertices v ₁, ..., v _n in some convex body uniformly and fix a point . We then build our tree inductively, where at time t we add an edge from v _t to the vertex in v ₁, ..., v ...

chapter

Approximating PageRank from In-Degree

Santo Fortunato, Marián Boguñá, Alessandro Flammini, Filippo Menczer

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 59-71

PageRank is a key element in the success of search engines, allowing to rank the most important hits in the top screen of results. One key aspect that distinguishes PageRank from other prestige measures such as in-degree is its global nature. From the information provider perspective, this makes it difficult or impossible to predict how their pages will be ranked. Consequently a market has emerged...

chapter

Probabilistic Relation between In-Degree and PageRank

Nelly Litvak, Werner R. W. Scheinhardt, Yana Volkovich

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 72-83

This paper presents a novel stochastic model that explains the relation between power laws of In-Degree and PageRank. PageRank is a popularity measure designed by Google to rank Web pages. We model the relation between PageRank and In-Degree through a stochastic equation, which is inspired by the original definition of PageRank. Using the theory of regular variation and Tauberian theorems, we prove...

chapter

Communities in Large Networks: Identification and Ranking

Martin Olsen

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 84-96

We study the problem of identifying and ranking the members of a community in a very large network with link analysis only, given a set of representatives of the community. We define the concept of a community justified by a formal analysis of a simple model of the evolution of a directed graph. We show that the problem of deciding whether a non trivial community exists is NP complete. Nevertheless,...

chapter

Combating Spamdexing: Incorporating Heuristics in Link-Based Ranking

Tony Abou-Assaleh, Tapajyoti Das

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 97-106

Users typically locate useful Web pages by querying a search engine. However, today’s search engines are seriously threatened by malicious spam pages that attempt to subvert the unbiased searching and ranking services provided by the engines. Given the large fraction of Web traffic originating from search engine referrals and the high potential monetary value of this traffic, it is not surprising...

chapter

Traps and Pitfalls of Topic-Biased PageRank

Paolo Boldi, Roberto Posenato, Massimo Santini, Sebastiano Vigna

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 107-116

We discuss a number of issues in the definition, computation and comparison of PageRank values that have been addressed sparsely in the literature, often with contradictory approaches. We study the difference between weakly and strongly preferential PageRank, which patch the dangling nodes with different distributions, extending analytical formulae known for the strongly preferential case, and corroborating...

chapter

A Scalable Multilevel Algorithm for Graph Clustering and Community Structure Detection

Hristo N. Djidjev

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 117-128

One of the most useful measures of cluster quality is the modularity of the partition, which measures the difference between the number of the edges joining vertices from the same cluster and the expected number of such edges in a random (unstructured) graph. In this paper we show that the problem of finding a partition maximizing the modularity of a given graph G can be reduced to a minimum weighted...

chapter

A Phrase Recommendation Algorithm Based on Query Stream Mining in Web Search Engines

M. Barouni-Ebrahimi, Ali A. Ghorbani

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 129-136

In this paper, a phrase recommender algorithm is proposed that suggests the related frequent phrases to an incomplete user query. The suggested phrases are extracted from past user queries based on the frequency rate of the phrases. A query recommender algorithm called OQD (Online Query Discovery) has also been designed for comparison purposes. Simulation results show the efficiency of the proposed...

chapter

Characterization of Graphs Using Degree Cores

John Healy, Jeannette Janssen, Evangelos Milios, William Aiello

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 137-148

Generative models are often used in modeling real world graphs such as the Web graph in order to better understand the processes through which these graphs are formed. In order to determine if a graph might have been generated by a given model one must compare the features of that graph with those generated by the model. We introduce the concept of a hierarchical degree core tree as a novel way of...

chapter

Web Structure Mining by Isolated Stars

Yushi Uno, Yoshinobu Ota, Akio Uemichi

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 149-156

The link structure of the Web is generally viewed as the webgraph, and web structure mining is a research area that mainly aims to find hidden communities in the Web and so on, by focusing on the webgraph. In this paper, we identify a common frequent substructure by observing the webgraph, and newly define it as an isolated star (i-star). We propose an efficient enumeration algorithm of i-stars, and...

chapter

Representing and Quantifying Rank - Change for the Web Graph

Akrivi Vlachou, Michalis Vazirgiannis, Klaus Berberich

Lecture Notes in Computer Science > Algorithms and Models for the Web-Graph > 157-165

One of the grand research and industrial challenges in recent years is efficient web search, inherently involving the issue of page ranking. In this paper we address the issue of representing and quantifying web ranking trends as a measure of web pages. We study the rank position of a web page among different snapshots of the web graph and propose normalized measures of ranking trends that are comparable...

INFONA - science communication portal

Algorithms and Models for the Web-Graph
Fourth International Workshop, WAW 2006, Banff, Canada, November 30 - December 1, 2006. Revised Papers

Modelling and Mining of Networked Information Spaces

Workshop on Algorithms and Models for the Web Graph

Expansion and Lack Thereof in Randomly Perturbed Graphs

Web Structure in 2005

Local/Global Phenomena in Geometrically Generated Graphs

Approximating PageRank from In-Degree

Probabilistic Relation between In-Degree and PageRank

Communities in Large Networks: Identification and Ranking

Combating Spamdexing: Incorporating Heuristics in Link-Based Ranking

Traps and Pitfalls of Topic-Biased PageRank

A Scalable Multilevel Algorithm for Graph Clustering and Community Structure Detection

A Phrase Recommendation Algorithm Based on Query Stream Mining in Web Search Engines

Characterization of Graphs Using Degree Cores

Web Structure Mining by Isolated Stars

Representing and Quantifying Rank - Change for the Web Graph

Filter options

Publication date

Keywords

INFONA - science communication portal

Algorithms and Models for the Web-Graph Fourth International Workshop, WAW 2006, Banff, Canada, November 30 - December 1, 2006. Revised Papers $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Algorithms and Models for the Web-Graph
Fourth International Workshop, WAW 2006, Banff, Canada, November 30 - December 1, 2006. Revised Papers