pdfid is a Python tool to analyze and sanitize PDF files, written by Didier Stevens. Here is a version that I have slightly modified to be imported as a module (originally for ExeFilter).
Origapy is a Python interface to Origami, a PDF parser written in Ruby. It provides access to pdfclean.rb, in order to sanitize PDF files by disabling all active content (javascript, launch actions, embedded files, etc).
BlindFTP is a simple and portable tool for file transfers through unidirectional network connections (without acknowledge), for instance an optical network diode.
OleFileIO_PL is a Python module to read Microsoft OLE2 files (also called Structured Storage or Compound Document File Format), such as Microsoft Office documents, Image Composer and FlashPix files, Outlook messages, ... This is an improved version of the OleFileIO module from PIL, the excellent Python Imaging Library v1.1.6 (See: http://www.pythonware.com/products/pil/index.htm), created and maintained by Fredrik Lundh.
ExeFilter is an open-source tool and python framework to filter file formats in e-mails, web pages or files. It detects many common file formats and can remove active content (scripts, macros, etc) according to a configurable policy.