Search results for: Yu Wang

Items from 1 to 7 out of 7 results

article

Internet Traffic Classification Using Constrained Clustering

Yu Wang, Yang Xiang, Jun Zhang, Wanlei Zhou, more

IEEE Transactions on Parallel and Distributed Systems > 2014 > 25 > 11 > 2932 - 2943

Statistics-based Internet traffic classification using machine learning techniques has attracted extensive research interest lately, because of the increasing ineffectiveness of traditional port-based and payload-based approaches. In particular, unsupervised learning, that is, traffic clustering, is very important in real-life applications, where labeled training data are difficult to obtain and new...

chapter

Network traffic clustering using Random Forest proximities

Yu Wang, Yang Xiang, Jun Zhang

2013 IEEE International Conference on Communications (ICC) > 2058 - 2062

ICC 2013 - 2013 IEEE International Conference on Communications

The recent years have seen extensive work on statistics-based network traffic classification using machine learning (ML) techniques. In the particular scenario of learning from unlabeled traffic data, some classic unsupervised clustering algorithms (e.g. K-Means and EM) have been applied but the reported results are unsatisfactory in terms of low accuracy. This paper presents a novel approach for...

chapter

Deep Web Entity Identification Method Based on Improved Jaccard Coefficients

Yu Wang, Ying-hua Li

2009 International Conference on Research Challenges in Computer Science > 112 - 115

2009 International Conference on Research Challenges in Computer Science (ICRCCS 2009)

There are a large number of accessible deep Web sites on the Internet. However, even if identical entity has different representation formats on different Web sites. So entity identification plays a crucial role in deep Web data mining. This paper proposes an entity identification method in the field of Chinese books. First, using improved Jaccard coefficients to calculate similarity of text attributes...

chapter

A clustering algorithm based on latent semantic model

Bu-Yu Wang, Mei-An Li, Yong-Jiang Wang

2009 International Conference on Apperceiving Computing and Intelligence Analysis > 44 - 48

2009 International Conference on Apperceiving Computing and Intelligence Analysis (ICACIA 2009)

In order to precisely procure the Chinese person information on the web, especially distinguish from the namesake, this paper propose a clustering algorithm based on latent semantic model. It establishes for every document a latent semantic model of sentence-word matrix based on central distance, central segment, document length, etc, by building the central word library of person attributes. It clusters...

chapter

Mining the hottest topics on Chinese webpage based on the improved k-means partitioning

Yu Wang, Ya-Hui Xi, Liang Wang

2009 International Conference on Machine Learning and Cybernetics > 1 > 255 - 260

2009 Eighth International Conference on Machine Learning and Cybernetics (ICMLC)

This paper presents a new method for the mining the hottest topics on Chinese Web page which is based on the improved k-means partitioning algorithm. The dictionary applied to word segmentation is reduced by deleting words is which are useless for clustering, and the dictionary tree is created to be applied to word segmentation. Then the speed of word segmentation is improved. Correspondence between...

chapter

Facilitating wrapper generation with page analysis

Bo Wu, Xueqi Cheng, Yu Wang, Gang Zhang, more

2009 IEEE International Conference on Intelligence and Security Informatics > 191 - 193

2009 IEEE International Conference on Intelligence and Security Informatics (ISI)

Current approaches for generating wrappers for web page extraction suffer from the requirement of huge amount of labeled training pages to obtain satisfying results. On the other hand, the quality of data extracted by fully automatic methods is not reliable. In this paper, we propose a novel method to facilitate wrapper generation by combining wrapper induction and page analysis approaches. In addition...

chapter

Move Statistics-Based Traffic Classifiers Online

Yu Wang, Shun-Zheng Yu

2008 International Conference on Computer Science and Software Engineering > 4 > 721 - 725

2008 International Conference on Computer Science and Software Engineering (CSSE 2008)

A number of recent works have proposed using data mining and machine learning techniques to classify traffic flows based on statistical flow characteristics. Most of these classifiers work offline, since full-flow statistics are not available until a flow is finished. Therefore, it is usually too late to take actions for online deployment. In this paper, we propose a simple and effective technique...

Filter options

Keywords:
ACCURACY
INTERNET

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

DATA MINING (5)
CLUSTERING ALGORITHMS (4)
MACHINE LEARNING (4)
CLUSTERING (2)
IP NETWORKS (2)
NATURAL LANGUAGE PROCESSING (2)
STATISTICAL ANALYSIS (2)
TRAFFIC ANALYSIS (2)
ADAPTATION MODELS (1)
AHP (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ALGORITHMS (1)
ANALYTIC HIERARCHY PROCESS (1)
BOOKS (1)
CENTER WORD DISTANCE (1)
CENTRAL DISTANCE (1)
CENTRAL SEGMENT (1)
CENTRAL WORD LIBRARY (1)
CENTRAL WORD POSITION (1)
CENTRAL WORD SET (1)
CHINESE BOOKS (1)
CHINESE PERSON INFORMATION (1)
CHINESE WEB (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFICATION TREE ANALYSIS (1)
COMPUTERS (1)
DATA MODELS (1)
DATA STREAM (1)
DATABASES (1)
DECISION MAKING (1)
DECISION TREES (1)
DEEP WEB (1)
DEEP WEB DATA MINING (1)
DEEP WEB ENTITY IDENTIFICATION METHOD (1)
DEEP WEB SITES (1)
DICTIONARIES (1)
DICTIONARY TREE (1)
DOCUMENT HANDLING (1)
DOCUMENT LENGTH (1)
DOCUMENTS CLUSTERING (1)
DYNAMIC-EXTENDING CLUSTERING ALGORITHM (1)
ENTITY IDENTIFICATION (1)
HEURISTIC ALGORITHMS (1)
HOTTEST TOPICS MINING (1)
HUMANS (1)
IDENTIFICATION (1)
IMPROVED JACCARD COEFFICIENTS (1)
INFORMATION ANALYSIS (1)
INFORMATION RETRIEVAL (1)
INFROMATION EXTRACTION (1)
JACCARD COEFFICIENTS (1)
K-MEANS PARTITIONING (1)
LABELED TRAINING PAGES (1)
LATENT SEMANTIC MODEL (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRARIES (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MINING THE HOTTEST TOPICS (1)
NETWORK SECURITY (1)
PAGE ANALYSIS (1)
PARTITIONING ALGORITHMS (1)
PATTERN CLUSTERING (1)
PEDIATRICS (1)
PERSON ATTRIBUTES (1)
PRESSES (1)
RADIO FREQUENCY (1)
SENSITIVITY (1)
SENTENCE-WORD MATRIX (1)
SKELETON (1)
STATISTICAL FLOW (1)
TELECOMMUNICATION TRAFFIC (1)
TEXT ANALYSIS (1)
TEXT ATTRIBUTES (1)
TRAFFIC CLASSIFICATION (1)
TRAFFIC CLASSIFIERS ONLINE (1)
TRAFFIC CONTROL (1)
TRAFFIC FLOWS (1)
TRAFFIC PATTERNS (1)
TRAINING (1)
TRANSDUCERS (1)
TREES (MATHEMATICS) (1)
UNSUPERVISED LEARNING (1)
WEB MINING (1)
WEB PAGE EXTRACTION (1)
WEB PAGES (1)
WORD PROCESSING (1)
WORD SEGMENTATION (1)
WORLD WIDE WEB (1)
WRAPPER (1)
WRAPPER GENERATION (1)
more

INFONA - science communication portal

Search results for: Yu Wang

Internet Traffic Classification Using Constrained Clustering

Network traffic clustering using Random Forest proximities

Deep Web Entity Identification Method Based on Improved Jaccard Coefficients

A clustering algorithm based on latent semantic model

Mining the hottest topics on Chinese webpage based on the improved k-means partitioning

Facilitating wrapper generation with page analysis

Move Statistics-Based Traffic Classifiers Online

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options