White Paper


Whitepaper Diario 1

MALICIOUS PDF DOCUMENTS DETECTION USING MACHINE LEARNING TECHNIQUES.
A PRACTICAL APPROACH WITH CLOUD COMPUTING APPLICATIONS

This work aims to verify whether using Machine Learning techniques for malware detection in PDF documents with JavaScript embedded could result in an effective way to reinforce traditional solutions like antivirus, sandboxes, etc.

We have developed a base framework for malware detection in PDF files, specially designed for cloud computing services, that allows to analyse documents online without needing the document content itself, thus preserving privacy.

In this paper we will present the comparison results between different supervised machine learning algorithms in malware detection and a overall description of our classification framework.