by warren | March 1st, 2010
Well, after a few months of working quietly (and hard) behind the scenes, the project blog is finally being launched. It will be used for news items, pieces on some of the document extraction methods in use, as well as tutorials on getting specific data sets from the database.
Besides the pre-wrapped ‘simple’ data sets that can be downloaded from the website, the database can be queried to generate very detailed dumps of the information Muninn has collected, including ambiguous data that we might be resolved but still need more work. Our first hard drive went out by courier yesterday and we look forward to working with the files.