Search results

Items from 1 to 11 out of 11 results

article

ALDOCX: Detection of Unknown Malicious Microsoft Office Documents Using Designated Active Learning Methods Based on New Structural Feature Extraction Methodology

Nir Nissim, Aviad Cohen, Yuval Elovici

IEEE Transactions on Information Forensics and Security > 2017 > 12 > 3 > 631 - 646

Attackers increasingly take advantage of innocent users who tend to casually open email messages assumed to be benign, carrying malicious documents. Recent targeted attacks aimed at organizations utilize the new Microsoft Word documents (*.docx). Anti-virus software fails to detect new unknown malicious files, including malicious docx files. In this paper, we present ALDOCX, a framework aimed at accurate...

chapter

Flexible document organization by mixing fuzzy and possibilistic clustering algorithms

Nilton V. Carvalho, Solange O. Rezende, Heloisa A. Camargo, Tatiane M. Nogueira

2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) > 790 - 797

2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

A powerful and flexible organization of documents can be obtained by mixing fuzzy and possibilistic clustering. In such organization, documents can belong to more than one cluster simultaneously with different compatibility degrees. Clusters represent topics, which are identified by one or more descriptors extracted by a proposed method. In this manuscript, we investigated whether or not the descriptors...

chapter

Document class recognition using a support vector machine approach

Jassem Mtimet, Hamid Amiri

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) > 161 - 166

2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)

In most document archiving systems, one of the main fields is to identify the category of documents. In most case, determination of the document category in archiving tasks requires the application of classification model, which have had successes in improving documents processing. However, concerns exploding the frequency of use of documents in many office managers have driven increasing interests...

chapter

Use of machine learning in big data analytics for insider threat detection

Michael Mayhew, Michael Atighetchi, Aaron Adler, Rachel Greenstadt

MILCOM 2015 - 2015 IEEE Military Communications Conference > 915 - 922

MILCOM 2015 - 2015 IEEE Military Communications Conference

In current enterprise environments, information is becoming more readily accessible across a wide range of interconnected systems. However, trustworthiness of documents and actors is not explicitly measured, leaving actors unaware of how latest security events may have impacted the trustworthiness of the information being used and the actors involved. This leads to situations where information producers...

chapter

Flexible document organization: Comparing fuzzy and possibilistic approaches

Tatiane M. Nogueira, Solange O. Rezende, Heloisa A. Camargo

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) > 1 - 8

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

System flexibility means the ability of a system to manage imprecise and/or uncertain information. A lot of commercially available Information Retrieval Systems (IRS) address this issue at the level of query formulation. Another way to make the flexibility of an IRS possible is by means of the flexible organization of documents. Such organization can be carried out using clustering algorithms by which...

chapter

Enhancing the Filtering-Out of the Back-to-Front Interference in Color Documents with a Neural Classifier

G F P e Silva, R D Lins, J M Silva, S Banergee, more

2010 20th International Conference on Pattern Recognition > 2415 - 2419

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Back-to-front, show-through, or bleeding are the names given to the interference that appears whenever one writes or prints on both sides of translucent paper. Such interference degrades image binarization and document transcription via OCR. The technical literature presents several algorithms to remove the back-to-front noise, but no algorithm is good enough in all cases. This article presents a...

chapter

The extraction of people's features from documents

Wen Zhou, Yao Li, Ping Yi, Dong Xu, more

2010 The 2nd International Conference on Computer and Automation Engineering (ICCAE) > 4 > 278 - 280

2nd International Conference on Computer and Automation Engineering (ICCAE 2010)

In this paper we describe an approach of people's feature extraction from documents. We give the experiment results and the implement of the system.

chapter

A New Hybrid Farsi Text Summarization Technique Based on Term Co-Occurrence and Conceptual Property of the Text

A. Zamanifar, B. Minaei-Bidgoli, M. Sharifi

2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing > 635 - 639

2008 Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD)

The importance of text summarization grows rapidly as the amount of information increases exponentially. This paper presents a new hybrid summarization technique that combines statistical properties of documents with Farsi linguistic features. The originality of the technique lies on the use of term co-occurrence property of the text. It could detect the number of subjects. The proposed technique...

chapter

Chapter 17: Geometrical versus Non-geometrical Image Categorization Using Horizontal and Vertical Color Features

M.M. Hassan, T. Helmy, M. Sarfraz

2008 3rd International Conference on Geometric Modeling and Imaging > 102 - 107

3rd International Conference on Geometric Modeling and Imaging (GMAI 2008)

Nowadays, with the development of high quality graphical softwares, almost every presentation, in addition to text, contains some kind of images too. According to the presentation needs, different kinds of images are used by the presenters but different kinds of images needs different type of treatments which evolve the image categorization research. In our work we try to categorize images into two...

chapter

Automatic categorization of figures in scientific documents

Prasenjit Mitra, C. Giles, James Wang, Xiaonan Lu

Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '6) > 129 - 138

2006 IEEE/ACM 6th Joint Conference on Digital Libraries

Figures are very important non-textual information contained in scientific documents. Current digital libraries do not provide users tools to retrieve documents based on the information available within the figures. We propose architecture for retrieving documents by integrating figures and other information. The initial step in enabling integrated document search is to categorize figures into a set...

chapter

Research on Information Retrieval System Based on Semantic Web and Multi-Agent

Luo Junwei, Xue Xiao

2010 International Conference on Intelligent Computing and Cognitive Informatics > 207 - 209

2010 International Conference on Intelligent Computing and Cognitive Informatics (ICICCI 2010)

The Semantic Web and Multi-Agent are effective means for constructing information retrieval systems. Despite a great deal of research, a number of challenges still exist before making Semantic Web and agent-based computing a widely accepted in information retrieval practice. In order to solve the problem of "difficult to feedback useful information to users", the paper developed a new information...

Filter options

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Content availability

Available (10)
None (1)

Publication type

book (10)
article (1)

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options