Search results

Items from 1 to 20 out of 43 results

chapter

Digging for Diamonds: Identifying Valuable Web Automation Programs in Repositories

J Jackson, C Scaffidi, K T Stolee

2011 International Conference on Information Science and Applications > 1 - 10

2011 International Conference on Information Science and Applications (ICISA 2011)

Web automation programs offer a means for users to enhance the usability of the web. These programs can be published on a wiki or other repository, thereby making them available for use by other users. However, in addition to programs of broad usefulness to the community at large, these repositories also contain many programs that are unreliable or highly specialized to the needs of very small sub-...

chapter

Content-Based Methods for Predicting Web-Site Demographic Attributes

Santosh Kabbur, Eui-Hong Han, George Karypis

2010 IEEE International Conference on Data Mining > 863 - 868

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Demographic information plays an important role in gaining valuable insights about a web-site's user-base and is used extensively to target online advertisements and promotions. This paper investigates machine-learning approaches for predicting the demographic attributes of web-sites using information derived from their content and their hyper linked structure and not relying on any information directly...

chapter

Augmenting Chinese Online Video Recommendations by Using Virtual Ratings Predicted by Review Sentiment Classification

Weishi Zhang, Guiguang Ding, Li Chen, Chunping Li

2010 IEEE International Conference on Data Mining Workshops > 1143 - 1150

2010 10th IEEE International Conference on Data Mining Workshops (ICDMW 2010)

In this paper we aim to resolve the recommendation problem by using the virtual ratings in online environments when user rating information is not available. As a matter of fact, in most of current websites especially the Chinese video-sharing ones, the traditional pure rating based collaborative filtering recommender methods are not fully qualified due to the sparsity of rating data. Motivated by...

chapter

Harvesting Web Images for Realistic Facial Expression Recognition

Kaimin Yu, Zhiyong Wang, Li Zhuo, Dagan Feng

2010 International Conference on Digital Image Computing: Techniques and Applications > 516 - 521

2010 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2010)

Large amount of labeled training data is required to develop robust and effective facial expression analysis methods. However, obtaining such data is typically a tedious and time-consuming task that is proportional to the size of the database. Due to the rapid advance of Internet and Web technologies, it is now feasible to collect a tremendous number of images with potential label information at a...

chapter

Using empirical risk minimization to detect community structure in the blogosphere

Jiaxuan Huang, Hongsen Huang

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering > 418 - 421

2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering (ISKE 2010)

When we are dealing with community structure detecting in the blogosphere, we have come to face some obstacles. The data in a blog may be updated frequently by its owner, making the whole blogosphere become very large during a short period of time. It can be very expensive to deal with such huge amount of data using those traditional methods. Meanwhile, few blogs in the blogosphere can be identified...

chapter

Functionalities for Blog Conversation: An Investigation about the Use of Quote and Reply

A de Miranda Marques, M Pimentel, S Siqueira

2010 Brazilian Symposium on Collaborative Systems - Simposio Brasileiro de Sistemas Colaborativos > 9 - 16

2010 VII Brazilian Symposium on Collaborative Systems (SBSC 2010)

The blog is featured as a communication system for dissemination of information and expression of opinions. Is the blog a system suitable for collaboration? The research presented in this paper investigates the use of citation and response to enable a conversation on the blog. From analysis of the discourse structuring and the possibilities of relationship among participants, were developed some research...

chapter

A Machine Learning Based Language Specific Web Site Crawler

P Tadapak, T Suebchua, A Rungsawang

2010 13th International Conference on Network-Based Information Systems > 155 - 161

13th International Conference on Network-Based Information Systems (NBiS 2010)

We propose an approach for gathering web pages written in a specific language. The approach consists of a language predictor and a web site crawler. The language predictor is a machine learning based component that can learn from an example host graph some characteristics of relevant hosts, and is used to calculate the language degree of a web server whether it has a high probability to serve web...

chapter

Blog extraction with template-independent wrapper

Zhixuan Zhang, Chuang Zhang, Zhiqing Lin, Bo Xiao

2010 2nd IEEE InternationalConference on Network Infrastructure and Digital Content > 313 - 317

2010 2nd IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2010)

Rich information is contributed to blogs by millions of users all around the world with the development of blogsphere. However, few work has been done on the study of blog extraction so far. Unlike the traditional template-dependent wrapper, not only blog articles but also blogroll is extracted with template-independent wrapper in this paper. In our method, blog extraction is formalized as a machine...

chapter

Measuring Similarity between Sets of Overlapping Clusters

Mark K Goldberg, Mykola Hayvanovych, Malik Magdon-Ismail

2010 IEEE Second International Conference on Social Computing > 303 - 308

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

The typical task of unsupervised learning is to organize data, for example into clusters, typically disjoint clusters (eg. the K-means algorithm). One would expect (for example) a clustering of books into topics to present overlapping clusters. The situation is even more so in social networks, a source of ever increasing data. Finding the groups or communities in social networks based on interactions...

chapter

Learning Influence Propagation of Personal Blogs with Content and Network Analyses

Il-Chul Moon, Dongwoo Kim, Yohan Jo, A H Oh

2010 IEEE Second International Conference on Social Computing > 669 - 674

2010 IEEE Second International Conference on Social Computing (SocialCom 2010). the Second IEEE International Conference on Privacy, Security, Risk and Trust (PASSAT 2010)

Weblogs (blogs) serve as a gateway to a large blog reader population, so blog authors can potentially influence a large reader population by expressing their thoughts and expertise in their blog posts. An important and complex problem, then, is figuring out why and how influence propagates through the blogosphere. While a number of previous research has looked at the network characteristics of blogs...

chapter

A comparative study of machine learning techniques in blog comments spam filtering

C Romero, M G Valdez, A Alanis

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 7

2010 International Joint Conference on Neural Networks (IJCNN 2010)

In this paper we compare four machine learning techniques for blog comments spam filtering. the machine learning techniques are the Naïve Bayes, K-nearest neighbor, neural networks and the support vector machines. For this comparative study we used a blog comment corpus that has been affected by spam, which is our study case in this work. We classify the comments of this blog comments corpus, which...

chapter

YouTubeCat: Learning to categorize wild web videos

Zheshen Wang, Ming Zhao, Yang Song, S Kumar, more

2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition > 879 - 886

2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages...

chapter

Web Tracking Site Detection Based on Temporal Link Analysis

Akira Yamada, Hara Masanori, Yutaka Miyake

2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops > 626 - 631

2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops (WAINA 2010)

Web tracking sites or Web bugs are potential but serious threats to users' privacy during Web browsing. Web sites and their associated advertising sites surreptitiously gather the profiles of visitors and possibly abuse or improperly expose them, even if visitors do not provide their profiles consciously. In order to prevent such activities in a corporate network, most companies employ filters that...

chapter

A dynamic web recommender system based on cellular learning automata

Mojdeh Talabeigi, Rana Forsati, Mohammad Reza Meybodi

2010 2nd International Conference on Computer Engineering and Technology > 7 > V7-755 - V7-761

2010 2nd International Conference on Computer Engineering and Technology (ICCET)

Different Web recommendation systems have been proposed to address the problem of information overload on the Internet. They attempt to guide users toward interesting and useful items in a large information space. They anticipate the information needs of on-line users and provide them with recommendations to facilitate and personalize their navigation. There are many approaches to building such systems,...

chapter

Web Mining: Key Accomplishments, Applications and Future Directions

Mahesh Thylore Ramakrishna, Latha Kolal Gowdar, Malatesh Somashekar Havanur, Banur Puttappa Mallikarjuna Swamy

2010 International Conference on Data Storage and Data Engineering > 187 - 191

2010 International Conference on Data Storage and Data Engineering (DSDE 2010)

The World-Wide Web provides every internet citizen with access to an abundance of information, but it becomes increasingly difficult to identify the relevant pieces of information. Research in web mining tries to address this problem by applying techniques from data mining and machine learning to Web data and documents. The Web Mining is an application of Data Mining. Without the internet, life would...

chapter

Web wrapper generation using tree alignment and transfer learning

Yingju Xia, Shu Zhang, Hao Yu

The 2nd International Conference on Software Engineering and Data Mining > 410 - 415

2nd International Conference on Software Engineering and Data Mining (SEDM 2010)

This paper studies the web wrapper generation for web pages of forum, blog and news web sites. While more and more web pages are dynamically generated using a common template populated with data from databases. This paper proposes a novel method that uses tree alignment and transfer learning method to generate the wrapper from this kind of web pages. We present a new tree alignment algorithm to find...

chapter

Emulating XML and Online Algorithms with Sangu

Dongmei Yan

2010 Second International Conference on Computer Modeling and Simulation > 3 > 145 - 148

2010 Second International Conference on Computer Modeling and Simulation (ICCMS 2010)

In recent years, much research has been devoted to the investigation of emulating XML; on the other hand, few have refined the essential unification of Byzantine fault tolerance and write-ahead logging. After years of key research into the World Wide Web, we argue the online algorithms, which embodies the natural principles of machine learning. Sangu, our new methodology for autonomous configurations,...

chapter

Large Scale Relation Acquisition Using Class Dependent Patterns

S. De Saeger, K. Torisawa, J. Kazama, K. Kuroda, more

2009 Ninth IEEE International Conference on Data Mining > 764 - 769

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

This paper proposes a minimally supervised method for acquiring high-level semantic relations such as causality and prevention from the Web. Our method learns linguistic patterns that express causality such as ??x gave rise to y??, and uses them to extract causal noun pairs like (global warming, malaria epidemic) from sentences like ??global warming gave rise to a new malaria epidemic??. The novelty...

chapter

A Modified System for Weblog Topic Relevance Retrieval

Si Li, Lei Du, Weiran Xu, Jun Guo

2009 Second International Conference on Future Information Technology and Management Engineering > 392 - 395

2009 Second International Conference on Future Information Technology and Management Engineering (FITME 2009)

Weblog is widely used, and the number of users is increasing rapidly. Weblog reflects every aspect of the society, such as politics, economy and culture, so the topic relevance retrieval research on Weblog becomes necessary. Because of a lot of noise in the corpus and it is usually difficult to obtain the appropriate query, the common methods sometimes fail to reach an acceptable precision. We design...

chapter

Machine Learning Approaches for Mood Classification of Songs toward Music Search Engine

Trung-Thanh Dang, K. Shirai

2009 International Conference on Knowledge and Systems Engineering > 144 - 149

2009 International Conference on Knowledge and Systems Engineering (KSE 2009)

Human often wants to listen to music that fits best his current emotion. A grasp of emotions in songs might be a great help for us to effectively discover music. In this paper, we aimed at automatically classifying moods of songs based on lyrics and metadata, and proposed several methods for supervised learning of classifiers. In future, we plan to use automatically identified moods of songs as metadata...

Data set:
ieee
Keywords:
INTERNET
WEB SITES
LEARNING (ARTIFICIAL INTELLIGENCE)
Publication type:
book

Publication date

Set your own date range

Content availability

Available (42)
None (1)

Keywords

DATA MINING (20)
MACHINE LEARNING (16)
INFORMATION RETRIEVAL (10)
INFORMATION SERVICES (9)
TRAINING (9)
WEB PAGES (9)
ACCURACY (6)
FEATURE EXTRACTION (6)
SUPPORT VECTOR MACHINES (6)
WEB SITE (6)
WORLD WIDE WEB (6)
PREDICTIVE MODELS (5)
SEARCH ENGINES (5)
CLASSIFICATION ALGORITHMS (4)
CLUSTERING ALGORITHMS (4)
COMMUNITIES (4)
NATURAL LANGUAGE PROCESSING (4)
PATTERN CLASSIFICATION (4)
PATTERN CLUSTERING (4)
SERVERS (4)
TESTING (4)
TRAINING DATA (4)
WEB MINING (4)
BLOGS (3)
COMPUTERS (3)
DATABASES (3)
INFORMATION FILTERS (3)
OPINION MINING (3)
SENTIMENT ANALYSIS (3)
SOCIAL NETWORKING (ONLINE) (3)
TEXT ANALYSIS (3)
BAYES METHODS (2)
BELIEF NETWORKS (2)
BIOLOGICAL SYSTEM MODELING (2)
BLOGOSPHERE (2)
CLASSIFICATION (2)
COMPUTER SCIENCE (2)
CONDITIONAL RANDOM FIELD (2)
CONFERENCES (2)
CRAWLERS (2)
DATA EXTRACTION (2)
DATA MODELS (2)
DATA PREPROCESSING (2)
DOCUMENT HANDLING (2)
ELECTRONIC COMMERCE (2)
ENTROPY (2)
EQUATIONS (2)
GOOGLE (2)
GRAPH THEORY (2)
HTML (2)
HUMANS (2)
INFORMATION FILTERING (2)
KNOWLEDGE EXTRACTION (2)
LEARNING SYSTEMS (2)
MACHINE LEARNING ALGORITHMS (2)
MANUALS (2)
MATHEMATICAL MODEL (2)
MATRIX ALGEBRA (2)
MEASUREMENT (2)
MOBILE COMMUNICATION (2)
MOBILE COMPUTING (2)
MOBILE HANDSETS (2)
NEURAL NETS (2)
NEURAL NETWORKS (2)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (2)
PREFETCHING (2)
PROBABILITY DISTRIBUTION (2)
RANDOM PROCESSES (2)
RECOMMENDER SYSTEMS (2)
SECURITY OF DATA (2)
SET THEORY (2)
SOCIAL NETWORK (2)
SOCIAL NETWORK SERVICES (2)
STORAGE MANAGEMENT (2)
TEXT CATEGORIZATION (2)
VIDEO RETRIEVAL (2)
WEB BLOG (2)
WEB PAGE (2)
WEB RECOMMENDATION (2)
YOUTUBE (2)
3D DIRECTED GRAPHS (1)
3D GRAPH VISUALISATION (1)
3D GRAPHICS (1)
ACCIDENTS (1)
ACTIVE INFORMATION COLLECTION (1)
ACTIVE INFORMATION FILTERING TECHNOLOGY (1)
ACTIVE INFORMATION PUBLISHING (1)
ACTIVE INFORMATION SERVICE (1)
ACTIVE INFORMATION SERVICE SYSTEM (1)
ACTIVE LEARNING (1)
ACTIVE WEB PAGE PARSING (1)
ACTIVITY PROFILE MONITORING (1)
ADAPTIVE SYSTEMS (1)
ADMISSION CONTROL (1)
ADVERTISING DATA PROCESSING (1)
ADVERTISING WEB SITES (1)
AFFECT ANALYSIS (1)
more

INFONA - science communication portal

Search results

Digging for Diamonds: Identifying Valuable Web Automation Programs in Repositories

Content-Based Methods for Predicting Web-Site Demographic Attributes

Augmenting Chinese Online Video Recommendations by Using Virtual Ratings Predicted by Review Sentiment Classification

Harvesting Web Images for Realistic Facial Expression Recognition

Using empirical risk minimization to detect community structure in the blogosphere

Functionalities for Blog Conversation: An Investigation about the Use of Quote and Reply

A Machine Learning Based Language Specific Web Site Crawler

Blog extraction with template-independent wrapper

Measuring Similarity between Sets of Overlapping Clusters

Learning Influence Propagation of Personal Blogs with Content and Network Analyses

A comparative study of machine learning techniques in blog comments spam filtering

YouTubeCat: Learning to categorize wild web videos

Web Tracking Site Detection Based on Temporal Link Analysis

A dynamic web recommender system based on cellular learning automata

Web Mining: Key Accomplishments, Applications and Future Directions

Web wrapper generation using tree alignment and transfer learning

Emulating XML and Online Algorithms with Sangu

Large Scale Relation Acquisition Using Class Dependent Patterns

A Modified System for Weblog Topic Relevance Retrieval

Machine Learning Approaches for Mood Classification of Songs toward Music Search Engine

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options