Document processing for investigations
Gransk is a free and open source tool that aims to be a Swiss army knife of document processing and analysis.
Its primary objective is to quikly provide users with insight to their documents during investigations.
It includes a processing engine written in Python and a web interface. Under the hood it uses Apache Tika for
content extraction, Elasticsearch for data indexing, and dfVFS to unpack disk images.
Given a bunch of documents and the question “has there been commited a crime here?”, Gransk will help you with the following:
Pull out all text and metadata, and make it searchable (supporting more than 200 document types)
Organize the documents by metadata, like content type, document authors and email recipients
Highlight names, email addresses and more from text to help guide the investigation