This is a series of articles about file formats and related security issues. In 2003 I had presented an article in French about this subject at the SSTIC conference: [SSTIC03]. In the following articles I will summarize it in English and provide updated information about common file formats.
Here is a list of all articles and presentations I've published about IT Security so far.
This article describes the OpenDocument file format (ODF), related security issues and useful resources. [WORK IN PROGRESS]
For now, see http://www.decalage.info/opendocument_openxml
This article describes the Microsoft Office Open XML file formats (docx, xlsx, pptx), related security issues and useful resources. [WORK IN PROGRESS]
This article describes the Microsoft Office legacy/binary file formats (doc, xls, ppt), related security issues and useful resources. [WORK IN PROGRESS]
This article describes the PDF file format, related security issues and useful resources. [WORK IN PROGRESS]
This short article shows how ExeFilter can be used to disable JavaScript in PDF files, which is effective against many Adobe Reader exploits discovered in 2009, including the recent zero-day CVE-2009-4324.
OVALdi, also named the OVAL Interpreter, is an open-source tool developed by MITRE to demonstrate how the OVAL language may be used to scan a computer for vulnerabilities. This article provides a few hints about how to use this tool.
pdfid is a Python tool to analyze and sanitize PDF files, written by Didier Stevens. Here is a version that I have slightly modified to be imported as a module (originally for ExeFilter).
Origapy is a Python interface to Origami, a PDF parser written in Ruby. It provides access to pdfclean.rb, in order to sanitize PDF files by disabling all active content (javascript, launch actions, embedded files, etc).