Search results

Items from 1 to 11 out of 11 results

chapter

Towards Building a Social Computing Tool for Social Scientists

Shamanth Kumar, Nitin Agarwal, Huan Liu

2010 3rd International Conference on Human-Centric Computing > 1 - 4

2010 3rd International Conference on Human-Centric Computing (HumanCom 2010)

This paper presents a social computing tool that centers around social scientists. In the past years, we have worked with social scientists and cultural anthropologists. We learned their ways of studying subjects in social media, what their needs are, and their interests. In the process, we have built a generic platform for collecting data in the blogosphere, tracking blogs of particular interests,...

chapter

Quest: An Adaptive Framework for User Profile Acquisition from Social Communities of Interest

N Dokoohaki, M Matskin

2010 International Conference on Advances in Social Networks Analysis and Mining > 360 - 364

2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010)

Within this paper we introduce a framework for semi- to full-automatic discovery and acquisition of bag-of-words style interest profiles from openly accessible Social Web communities. To do such, we construct a semantic taxonomy search tree from target domain (domain towards which we're acquiring profiles for), starting with generic concepts at root down to specific-level instances at leaves, then...

chapter

Dynamic Features of Social Tagging Vocabulary: Delicious, Flickr and YouTube

Daifeng Li, Ying Ding, Zheng Qin, S Milojevic, more

2010 International Conference on Advances in Social Networks Analysis and Mining > 316 - 320

2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010)

This article investigates the dynamic features of social tagging vocabularies in Delicious, Flickr and YouTube from 2003 to 2008. It analyzes the evolution of the usage of the most popular tags in each of these three social networks. We find that for different tagging systems, the dynamic features reflect different cognitive processes. At the macro level, the tag growth obeys power-law distribution...

chapter

UDBFC: An effective focused crawling approach based on URL Distance calculation

D Hati, A Kumar

2010 3rd International Conference on Computer Science and Information Technology > 2 > 59 - 63

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

Vertical search engines use focused crawlers as their key component and develops some specific algorithms to select web pages relevant to some pre-defined set of topics. Therefore, to effectively build up a semantic pattern for specific topics is extremely important to such search engines. Crawlers are software which can traverse the internet and retrieve web pages by hyperlinks. Here we propose an...

chapter

Semantic focused crawler based on Q-learning and Bayes classifier

Dong Chen, Fang Liying, Yan Jianzhuo, Bin Shi

2010 3rd International Conference on Computer Science and Information Technology > 8 > 420 - 423

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

Semantic focused crawler is an important part of semantic vertical search engine. It is receiving increasing attention as a well founded alternative to search web with the problem of locating topical resource on entire web. In order to retrieval documents related to a given topic, in this paper, we propose QBLP Algorithm which enable crawler adaptive with the changing environment. This feature makes...

chapter

Empirical study on crawler visibility of PDF documents in digital libraries

M Weideman

2010 3rd International Conference on Computer Science and Information Technology > 5 > 373 - 379

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

Digital library users might not enter a digital library through homepage menus. As a result, digital library owners should consider the visibility to search engines of stored PDF documents. The aim of this research project was to determine to what extent the visibility of these PDF documents can be improved. In a series of empirical experiments, 100 PDF documents stored on digital libraries were identified...

chapter

Measuring the Influence of Active Measurement on Unstructured Peer-to-Peer Network

Ying Sha, Zhibin Zhang, Jianlong Tan

2009 15th International Conference on Parallel and Distributed Systems > 764 - 769

2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009)

Although intensive researches have been performed regarding P2P network measurement, it is still unknown to what extent the measurement system influences the final measurement results. As an initial study, we investigated the influence of a measurement system on degree distribution of a P2P network. Theoretical analysis and simulation results suggest an interesting phase-transition phenomena when...

chapter

Implementation of Web Crawler

P. Gupta, K. Johari

2009 Second International Conference on Emerging Trends in Engineering&Technology > 838 - 843

2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009)

The World Wide Web is an interlinked collection of billions of documents formatted using HTML. Ironically the very size of this collection has become an obstacle for information retrieval. The user has to shift through scores of pages to come upon the information he/she desires. Web crawlers are the heart of search engines. Web crawlers continuously keep on crawling the web and find any new web pages...

chapter

A Forwarding-Based Task Scheduling Algorithm for Distributed Web Crawling over DHTs

Xiao Xu, Wei-Zhe Zhang, Hong-Li Zhang, Bin-Xing Fang, more

2009 15th International Conference on Parallel and Distributed Systems > 854 - 859

2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009)

Distributed Web crawling (DWC) over DHTs is proposed to solve the bottlenecks in the traditional Web crawling. The core of this kind of system is its fully distributed task scheduling mechanism in which the crawlers are treated as peers and the crawlees are treated as resources maintained by the peers. A system model based on the content addressable network (CAN) can further optimize the scheduling...

chapter

Reconstruction of web forms for efficient web search

N. Mittal, M.C. Govil, R. Nayak, N. Jain

2009 Proceeding of International Conference on Methods and Models in Computer Science (ICM2CS) > 1 - 5

2009 International Conference on Methods and Models in Computer Science (ICM2CS)

Websites, notable by URLs are large collection of Web pages. They make a huge database of heterogeneous information gathered and collected distributive. The accumulated information is differentiated on the basis of certain templates, their URLs and information contained in these pages. In this research, we mainly concentrate on Web-forums. In the current circumstances, a Web crawler crawls all the...

chapter

Design and Implementation of University Focused Crawler Based on BP Network Classifier

Hua Jiang, Bing Han, Ying Lin, Dan Zuo, more

2009 Second International Workshop on Knowledge Discovery and Data Mining > 44 - 47

2009 Second International Workshop on Knowledge Discovery and Data Mining. WKDD 2009

The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. Crawling the Web quickly and entirely is an expensive, unrealistic goal because of the required hardware and network resources. A focused crawler is an agent that targets a particular topic and visits and gathers only a relevant, narrow Web segment while trying not to waste...

Filter options

Content availability:
None

Publication date

Set your own date range

Keywords

CRAWLERS (11)
INTERNET (7)
DATA MINING (5)
SEARCH ENGINES (5)
WEB SITES (5)
HTML (3)
PATTERN CLASSIFICATION (3)
SEMANTIC WEB (3)
WORLD WIDE WEB (3)
COMMUNITIES (2)
FOCUSED CRAWLER (2)
INFORMATION RETRIEVAL (2)
PEER TO PEER COMPUTING (2)
SEMANTICS (2)
SOCIAL NETWORKING (ONLINE) (2)
URL (2)
WEB CRAWLER (2)
WEB PAGES (2)
ACCURACY (1)
ACTIVE MEASUREMENT (1)
ACTIVE MEASUREMENT SYSTEM (1)
ADAPTIVE CRAWLING (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANTHROPOLOGY (1)
AUSTRALIA (1)
AUTHENTICATION (1)
AUTOMATA (1)
BACKPROPAGATION (1)
BAG-OF-WORDS STYLE INTEREST PROFILE (1)
BAYES CLASSIFIER (1)
BAYES METHODS (1)
BEST FIRST (1)
BLOGS (1)
BLOGTRACKER (1)
BOYER-MOORE ALGORITHM (1)
BP NETWORK (1)
BP NETWORK CLASSIFIER (1)
BREADTH-FIRST SEARCH (1)
BREATH FIRST (1)
BUILDINGS (1)
CHILD PAGE (1)
CLUSTERING (1)
COGNITIVE PROCESSES (1)
COLLABORATION (1)
COMPUTER NETWORK MANAGEMENT (1)
CONTENT ADDRESSABLE NETWORK (1)
COPYRIGHT INFRINGEMENT (1)
CRAWLER (1)
CRAWLER VISIBILITY (1)
CRAWLER-CRAWLEE LATENCIES (1)
CRYPTOGRAPHY (1)
CULTURAL ANTHROPOLOGIST (1)
DATA ANALYSIS (1)
DATA COLLECTING (1)
DATA MODELS (1)
DATA VISUALIZATION (1)
DATABASE (1)
DATABASES (1)
DEGREE DISTRIBUTION (1)
DHT (1)
DIGITAL LIBRARIES (1)
DIGITAL LIBRARY (1)
DISTANCE CALCULATION (1)
DISTRIBUTED HASH TABLES (1)
DISTRIBUTED WEB CRAWLING (1)
DOCUMENT HANDLING (1)
DOCUMENTS (1)
DOM TREE (1)
DOMAIN SPECIFIC (1)
DOMAIN-SPECIFIC WEB SEARCH PORTAL (1)
DYNAMIC FEATURE (1)
DYNAMIC FEATURES (1)
ELECTRONIC MAIL (1)
EXTRATERRESTRIAL MEASUREMENTS (1)
FINITE AUTOMATA (1)
FINITE AUTOMATA ALGORITHM (1)
FLICKR (1)
FORWARDING-BASED TASK SCHEDULING ALGORITHM (1)
GALLIUM (1)
GOOGLE SEARCH (1)
GOVERNMENT (1)
HETEROGENEOUS INFORMATION (1)
HEURISTIC ALGORITHMS (1)
HYPERLINK (1)
HYPERMEDIA MARKUP LANGUAGES (1)
INDEXING (1)
INFORMATION CLASSIFICATION (1)
KNOWLEDGE ENGINEERING (1)
KNUTT-MORRI-PRATT ALGORITHM (1)
LAYOUT (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRARIES (1)
LINK EXTRACTOR TOOL (1)
LOAD BALANCING (1)
LOAD MANAGEMENT (1)
LOAD MODELING (1)
LOGIN PAGES (1)
MEASUREMENT SYSTEMS (1)
MSN SEARCH (1)
NETWORK PROXIMITY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options