Search results

chapter

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Dmitry Devyatkin, Ivan Smirnov, Ananyeva Margarita, Kobozeva Maria, more

2017 IEEE International Conference on Intelligence and Security Informatics (ISI) > 188 - 190

2017 IEEE International Conference on Intelligence and Security Informatics (ISI)

In this paper we present results of a research on automatic extremist text detection. For this purpose an experimental dataset in the Russian language was created. According to the Russian legislation we cannot make it publicly available. We compared various classification methods (multinomial naive Bayes, logistic regression, linear SVM, random forest, and gradient boosting) and evaluated the contribution...

chapter

Iterative evolution of feature space in text classification

Liutao Zhao, Yitian Ren, Bo Yan

2015 8th International Congress on Image and Signal Processing (CISP) > 1210 - 1214

2015 8th International Congress on Image and Signal Processing (CISP)

Nature language processing is an important part in data mining, which counts a lot in the internet age. Feature extraction effects the accuracy of text classification. This paper proposes a method of iterative feature space evolution to optimize the result. Adjusting the extended dictionary and the stop word list, we optimize the feature space time and again to get a better classifier model. The final...

chapter

Evaluation of text classification techniques for inappropriate web content blocking

Igor Kotenko, Andrey Chechulin, Dmitry Komashinsky

2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 1 > 412 - 417

2015 IEEE 8th International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

The paper is devoted to the issues of automated categorization of textual information which can be applied in the systems intended to block inappropriate content. The approach used for feature selection and construction is proposed. The text mining methods used for research (Decision Tree classifiers) are analyzed. Besides that, the techniques of Web sites analysis that provide information in different...

chapter

Classifying Text with Statistically Selected Features to Closely Related Categories

M. Janaki Meena, K.R. Chandran

2009 International Conference on Advances in Recent Technologies in Communication and Computing > 297 - 301

2009 International Conference on Advances in Recent Technologies in Communication and Computing. ARTCom 2009

Text classification is continuing to be one of the most researched problems due to continuously-increasing amount of electronic documents and digital data. Classifying documents to closely related categories is the most complex task in text categorization. Feature selection is an essential preprocessing step for improving the efficiency and accuracy of the text classifiers by removing redundant and...

INFONA - science communication portal

Search results

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Iterative evolution of feature space in text classification

Evaluation of text classification techniques for inappropriate web content blocking

Classifying Text with Statistically Selected Features to Closely Related Categories

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Exploring linguistic features for extremist texts detection (on the material of Russian-speaking illegal texts)

Iterative evolution of feature space in text classification

Evaluation of text classification techniques for inappropriate web content blocking

Classifying Text with Statistically Selected Features to Closely Related Categories

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options