Search results

Items from 1 to 11 out of 11 results

chapter

Research and Analysis of Search Engine Optimization Factors Based on Reverse Engineeing

Cen Zhu, Guixing Wu

2011 Third International Conference on Multimedia Information Networking and Security > 225 - 228

2011 3rd International Conference on Multimedia Information Networking and Security (MINES)

Search engine optimization (SEO) is a process of improving the prominence of a website. Following a reverse engineering approach, in this paper, we study and analyze the key influence factors in the process of web search. We firstly build a system to automatically crawl all factors of 200 thousand web pages. Then we make a content analysis including Page Rank, URL and HTML analysis based on top 20...

chapter

Functional-Based Table Category Identification in Digital Library

Seongchan Kim, Ying Liu

2011 International Conference on Document Analysis and Recognition > 1364 - 1368

2011 International Conference on Document Analysis and Recognition (ICDAR)

Better understanding the document logical components is crucial to many applications, e.g., document classification or data integration. As the development of digital libraries, more people realize the importance of the scientific tables, which contain valuable information concisely. Although tons of previous table works focus on table data extraction, few concrete works on understanding and utilizing...

article

Web Spam Detection: New Classification Features Based on Qualified Link Analysis and Language Models

Lourdes Araujo, Juan Martinez-Romo

IEEE Transactions on Information Forensics and Security > 2010 > 5 > 3 > 581 - 590

Web spam is a serious problem for search engines because the quality of their results can be severely degraded by the presence of this kind of page. In this paper, we present an efficient spam detection system based on a classifier that combines new link-based features with language-model (LM)-based ones. These features are not only related to quantitative data extracted from the Web pages, but also...

chapter

The URL Search Strategy Based on the Content and Link Analysis

Cailan Zhou, Xuan Sun, Hongjie Guo

2009 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2009 International Conference on Computational Intelligence and Software Engineering

The Web information which influences the topic relevance of URL is analyzed based on the research of the search strategy about the crawler. On this basis, a new URL search algorithm based on the content and link analysis is supplied to us. The experimental results show that the algorithm not only can solve the problem of topic isolated island to increase recall, but also can avoid the phenomenon of...

chapter

Research on Social Network Based on Meta-search Engine

Shen Yang, Liu Zi-tao, Luo Cheng, Li Ye

2009 Sixth Web Information Systems and Applications Conference > 179 - 183

2009 Sixth Web Information Systems and Applications Conference (WISA 2009)

In order to solve the problem that we can only collect data from one single data source at some fixed time after mining the keywords in a rather superficial level, and to take full use of the information returned by search engines to construct the social relationship network based on the semantic link of the searched subject, we do the regular research by using the ROST Content Mining System which...

chapter

An intelligent surfer model combining web contents and links based on simultaneous multiple-term query

B. Frikh, A.S. Djanfar, B. Ouhbi

2009 IEEE/ACS International Conference on Computer Systems and Applications > 543 - 545

2009 7th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA-2009)

The PageRank algorithm, proposed by [Page et al., 1998] is used in the Google search engine to improve the results of requests by taking into account the link structure of the Web. PageRank give the same weight to all pages that is the surfer model is proposed using a uniform distribution. Richardson and Domingoshave proposed a more interesting and intelligent surfer model combining the link and content...

chapter

Characterizing comment spam in the blogosphere through content analysis

A. Bhattarai, V. Rus, D. Dasgupta

2009 IEEE Symposium on Computational Intelligence in Cyber Security > 37 - 44

2009 IEEE Symposium on Computational Intelligence in Cyber Security

Spams are no longer limited to emails and Web-pages. The increasing penetration of spam in the form of comments in blogs and social networks has started becoming a nuisance and potential threat. In this work, we explore the challenges posed by this type of spam in the blogosphere with substantial generalization regarding other social media. Thus, we investigate the characteristics of comment spam...

chapter

HAWK: A Focused Crawler with Content and Link Analysis

Xiaoyun Chen, Xin Zhang

2008 IEEE International Conference on e-Business Engineering > 677 - 680

2008 IEEE International Conference on e-Business Engineering

Maintaining currency of search engine indices by exhaustive crawling is rapidly becoming impossible due to the increasing size of the web. Focused crawlers aim to search only the subset of the web related to a specific topic, and offer a potential solution to the problem. But it also has problems. The major problem is how to retrieve the maximal set of relevant and quality pages. To address this problem...

chapter

Extracting Social Network among Various Entities from Chinese News Stories by Content Analysis

Weijie Yang, Ruwei Dai, Xia Cui

2008 32nd Annual IEEE International Computer Software and Applications Conference > 929 - 934

2008 IEEE 32nd International Computer Software and Applications Conference (COMPSAC)

Social networks have recently attracted much attention for their importance to the semantic Web. Several methods exist to extract social networks for people from the Web based on co-occurrence information. This paper proposed a content analysis based method for automatic obtaining social networks among various entities from Chinese event-based news stories. First, the input articles are annotated...

chapter

Automatic New Topic Identification in Search Engine Transaction Logs Using Multiple Linear Regression

S. Ozmutlu, H. Cenk Ozmutlu, A. Spink

Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008) > 140

2008 41st Annual Hawaii International Conference on System Sciences

Content analysis of search engine user queries is an important task for search engine research, and identification of topic changes within a user search session is a key issue in content analysis of search engine user queries. The purpose of this study is to provide automatic new topic identification of search engine query logs, and estimate the effect of statistical characteristics of search engine...

chapter

LET: Towards More Precise Clustering of Search Results

Yi Zhang, Lidong Bing, Yexin Wang, Yan Zhang

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 2 > 385 - 389

2007 International Conference on Fuzzy Systems and Knowledge Discovery

Web users are always distracted by a large number of results returned from search engines. Clustering can efficiently facilitate users' browsing pages of certain topic. However, most traditional clustering methods are based on either content analysis or link analysis alone, which appears unilateral. In this paper, we propose an expanding clustering idea with the reasonable combination of content and...

Filter options

Keywords:
SEARCH ENGINES

Publication date

Set your own date range

Publication type

book (10)
article (1)

Keywords

DATA MINING (7)
INTERNET (5)
LINK ANALYSIS (4)
WEB PAGES (4)
ALGORITHM DESIGN AND ANALYSIS (3)
FEATURE EXTRACTION (3)
SOCIAL NETWORK (3)
APPROXIMATION ALGORITHMS (2)
CONTENT MANAGEMENT (2)
CRAWLERS (2)
ELECTRONIC MAIL (2)
GOOGLE (2)
PREDICTION ALGORITHMS (2)
QUERY PROCESSING (2)
SEARCH ENGINE (2)
SOCIAL NETWORK SERVICES (2)
SOFTWARE ALGORITHMS (2)
UNSOLICITED E-MAIL (2)
WEB PAGE (2)
WEB SITES (2)
ADAPTIVE ALGORITHM (1)
APPROXIMATION METHODS (1)
AUTOMATIC NEW TOPIC IDENTIFICATION (1)
BAIDU (1)
BIOLOGICAL SYSTEM MODELING (1)
BLOGOSPHERE (1)
BLOGS (1)
CHINESE EVENT-BASED NEWS STORIES (1)
CO-OCCURRENCE INFORMATION (1)
COHERENCE (1)
COMMENT SPAM (1)
CONTENT (1)
CONTENT MINING (1)
CONTENT MINING SYSTEM (1)
CRAWLER (1)
CROSS PAGE FRAMEWORK ADAPTIVE ALGORITHM (1)
DIRECTED GRAPH EXPRESSION (1)
DIRECTED GRAPHS (1)
DOCUMENT TABLE (1)
EMAIL (1)
ENGINES (1)
FAST SEARCH ENGINE (1)
FILTERING (1)
FILTRATION (1)
FOCUSED CRAWLER (1)
FUNCTION-BASED CLASSIFICATION (1)
GOOGLE SEARCH ENGINE (1)
HAWK (1)
HIDDEN MARKOV MODELS (1)
HTML (1)
HTML FRAMEWORK CODES INSTABILITY (1)
HYPERLINK (1)
HYPERMEDIA MARKUP LANGUAGES (1)
INDEXING (1)
INFORMATION RETRIEVAL (1)
INFORMATION SERVICES (1)
INSTRUMENTS (1)
INTELLIGENT SURFER MODEL (1)
KEYWORD (1)
KEYWORDS ATTRIBUTE SET (1)
KULLBACK-LEIBLER DIVERGENCE (1)
LANGUAGE MODEL (1)
LANGUAGE MODELS (LMS) (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEXICAL ANALYSIS (1)
LIBRARIES (1)
LINK INTEGRITY (1)
LINK STRUCTURE (1)
MAIN VERB (1)
MAIN VERBS RECOGNITION (1)
MERGING (1)
META SEARCH ENGINE (1)
META-SEARCH ENGINE (1)
METACOMPUTING (1)
METASEARCH (1)
MOROCCAN MINISTRY TOURISM WEB (1)
MULTIPLE LINEAR REGRESSION (1)
NAMED ENTITY (1)
OPTIMIZATION (1)
PAGE MONITORING (1)
PAGERANK ALGORITHM (1)
PATTERN CLASSIFICATION (1)
PATTERN CLUSTERING (1)
PEDIATRICS (1)
PORTABLE DOCUMENT FORMAT (1)
PROBABILISTIC LOGIC (1)
PROBABILITY DISTRIBUTION (1)
PROGRESSIVE ALGORITHM (1)
PROGRESSIVE SEARCH ALGORITHM (1)
QUALIFIED LINK ANALYSIS (1)
QUERY CLUSTERING (1)
QUERY FORMULATION (1)
REAL-TIME NETWORK (1)
REDUNDANCY (1)
REGRESSION ANALYSIS (1)
RELEVANCE FEEDBACK (1)
RELEVANCE MEASURE (1)
REVERSE ENGINEERING (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options