<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="https://blog.muninn-project.org"  xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>The Muninn Project - DBpedia</title>
 <link>https://blog.muninn-project.org/taxonomy/term/12?language=en</link>
 <description></description>
 <language>en</language>
<item>
 <title>Retrieving Historical Photos of Film Stars using DBpedia</title>
 <link>https://blog.muninn-project.org/node/81?language=en</link>
 <description>&lt;div class=&quot;field field-name-body field-type-text-with-summary field-label-hidden&quot;&gt;&lt;div class=&quot;field-items&quot;&gt;&lt;div class=&quot;field-item even&quot; property=&quot;content:encoded&quot;&gt;&lt;style type=&quot;text/css&quot;&gt;
&lt;!--/*--&gt;&lt;![CDATA[/* &gt;&lt;!--*/

/* This file is used by default by all Scribble documents.
   See also &quot;manual.css&quot;, which is added by default by the
   `scribble/manual` language. */

/* CSS seems backward: List all the classes for which we want a
   particular font, so that the font can be changed in one place.  (It
   would be nicer to reference a font definition from all the places
   that we want it.)

   As you read the rest of the file, remember to double-check here to
   see if any font is set. */

/* Monospace: */
.maincolumn, .refpara, .refelem, .tocset, .stt, .hspace, .refparaleft, .refelemleft {
  font-family: monospace;
}

/* Serif: */
.main, .refcontent, .tocview, .tocsub, .sroman, i {
  font-family: serif;
}

/* Sans-serif: */
.version, .versionNoNav, .ssansserif {
  font-family: sans-serif;
}
.ssansserif {
  font-size: 80%;
  font-weight: bold;
}

/* ---------------------------------------- */

p, .SIntrapara {
  display: block;
  margin: 1em 0;
}

h2 { /* per-page main title */
  margin-top: 0;
}

h3, h4, h5, h6, h7, h8 {
  margin-top: 1.75em;
  margin-bottom: 0.5em;
}

.SSubSubSubSection {
  font-weight: bold;
  font-size: 0.83em; /* should match h5; from HTML 4 reference */
}

/* Needed for browsers like Opera, and eventually for HTML 4 conformance.
   This means that multiple paragraphs in a table element do not have a space
   between them. */
table p {
  margin-top: 0;
  margin-bottom: 0;
}

/* ---------------------------------------- */
/* Main */

body {
  color: black;
  background-color: #ffffff;
}

table td {
  padding-left: 0;
  padding-right: 0;
}

.maincolumn {
  width: 43em;
  margin-right: -40em;
  margin-left: 15em;
}

.main {
  text-align: left;
}

/* ---------------------------------------- */
/* Navigation */

.navsettop, .navsetbottom {
  background-color: #f0f0e0;
  padding: 0.25em 0 0.25em 0;
}

.navsettop {
  margin-bottom: 1.5em;
  border-bottom: 2px solid #e0e0c0;
}

.navsetbottom {
  margin-top: 2em;
  border-top: 2px solid #e0e0c0;
}

.navleft {
  margin-left: 1ex;
  position: relative;
  float: left;
  white-space: nowrap;
}
.navright {
  margin-right: 1ex;
  position: relative;
  float: right;
  white-space: nowrap;
}
.nonavigation {
  color: #e0e0e0;
}

.searchform {
  display: inline;
  margin: 0;
  padding: 0;
}

.nosearchform {
  display: none;
}

.searchbox {
  width: 16em;
  margin: 0px;
  padding: 0px;
  background-color: #eee;
  border: 1px solid #ddd;
  text-align: center;
  vertical-align: middle;
}

#contextindicator {
  position: fixed;
  background-color: #c6f;
  color: #000;
  font-family: monospace;
  font-weight: bold;
  padding: 2px 10px;
  display: none;
  right: 0;
  bottom: 0;
}

/* ---------------------------------------- */
/* Version */

.versionbox {
  position: relative;
  float: right;
  left: 2em;
  height: 0em;
  width: 13em;
  margin: 0em -13em 0em 0em;
}
.version {
  font-size: small;
}
.versionNoNav {
  font-size: xx-small; /* avoid overlap with author */
}

.version:before, .versionNoNav:before {
  content: &quot;Version &quot;;
}

/* ---------------------------------------- */
/* Margin notes */

.refpara, .refelem {
  position: relative;
  float: right;
  left: 2em;
  height: 0em;
  width: 13em;
  margin: 0em -13em 0em 0em;
}

.refpara, .refparaleft {
  top: -1em;
}

.refcolumn {
  background-color: #F5F5DC;
  display: block;
  position: relative;
  width: 13em;
  font-size: 85%;
  border: 0.5em solid #F5F5DC;
  margin: 0 0 0 0;
}

.refcontent {
  margin: 0 0 0 0;
}

.refcontent p {
  margin-top: 0;
  margin-bottom: 0;
}

.refparaleft, .refelemleft {
  position: relative;
  float: left;
  right: 2em;
  height: 0em;
  width: 13em;
  margin: 0em 0em 0em -13em;
}

.refcolumnleft {
  background-color: #F5F5DC;
  display: block;
  position: relative;
  width: 13em;
  font-size: 85%;
  border: 0.5em solid #F5F5DC;
  margin: 0 0 0 0;
}


/* ---------------------------------------- */
/* Table of contents, inline */

.toclink {
  text-decoration: none;
  color: blue;
  font-size: 85%;
}

.toptoclink {
  text-decoration: none;
  color: blue;
  font-weight: bold;
}

/* ---------------------------------------- */
/* Table of contents, left margin */

.tocset {
  position: relative;
  float: left;
  width: 12.5em;
  margin-right: 2em;
}
.tocset td {
  vertical-align: text-top;
}

.tocview {
  text-align: left;
  background-color: #f0f0e0;
}

.tocsub {
  text-align: left;
  margin-top: 0.5em;
  background-color: #f0f0e0;
}

.tocviewlist, .tocsublist {
  margin-left: 0.2em;
  margin-right: 0.2em;
  padding-top: 0.2em;
  padding-bottom: 0.2em;
}
.tocviewlist table {
  font-size: 82%;
}

.tocviewlisttopspace {
  margin-bottom: 1em;
}

.tocviewsublist, .tocviewsublistonly, .tocviewsublisttop, .tocviewsublistbottom {
  margin-left: 0.4em;
  border-left: 1px solid #bbf;
  padding-left: 0.8em;
}
.tocviewsublist {
  margin-bottom: 1em;
}
.tocviewsublist table,
.tocviewsublistonly table,
.tocviewsublisttop table,
.tocviewsublistbottom table {
  font-size: 75%;
}

.tocviewtitle * {
  font-weight: bold;
}

.tocviewlink {
  text-decoration: none;
  color: blue;
}

.tocviewselflink {
  text-decoration: underline;
  color: blue;
}

.tocviewtoggle {
  text-decoration: none;
  color: blue;
  font-size: 75%; /* looks better, and avoids bounce when toggling sub-sections due to font alignments */
}

.tocsublist td {
  padding-left: 1em;
  text-indent: -1em;
}

.tocsublinknumber {
  font-size: 82%;
}

.tocsublink {
  font-size: 82%;
  text-decoration: none;
}

.tocsubseclink {
  font-size: 82%;
  text-decoration: none;
}

.tocsubnonseclink {
  font-size: 82%;
  text-decoration: none;
  padding-left: 0.5em;
}

.tocsubtitle {
  font-size: 82%;
  font-style: italic;
  margin: 0.2em;
}

/* ---------------------------------------- */
/* Some inline styles */

.indexlink {
  text-decoration: none;
}

.nobreak {
  white-space: nowrap;
}

pre { margin-left: 2em; }
blockquote { margin-left: 2em; }

ol          { list-style-type: decimal; }
ol ol       { list-style-type: lower-alpha; }
ol ol ol    { list-style-type: lower-roman; }
ol ol ol ol { list-style-type: upper-alpha; }

.SCodeFlow {
  display: block;
  margin-left: 1em;
  margin-bottom: 0em;
  margin-right: 1em;
  margin-top: 0em;
  white-space: nowrap;  
}

.SVInsetFlow {
  display: block;
  margin-left: 0em;
  margin-bottom: 0em;
  margin-right: 0em;
  margin-top: 0em;
}

.SubFlow {
  display: block;
  margin: 0em;
}

.boxed {
  width: 100%;
  background-color: #E8E8FF;
}

.hspace {
}

.slant {
  font-style: oblique;
}

.badlink {
  text-decoration: underline;
  color: red;
}

.plainlink {
  text-decoration: none;
  color: blue;
}

.techoutside       { text-decoration: underline; color: #b0b0b0; }
.techoutside:hover { text-decoration: underline; color: blue; }

/* .techinside:hover doesn&#039;t work with FF, .techinside:hover&gt;
   .techinside doesn&#039;t work with IE, so use both (and IE doesn&#039;t
   work with inherit in the second one, so use blue directly) */
.techinside                    { color: black; }
.techinside:hover              { color: blue; }
.techoutside:hover&gt;.techinside { color: inherit; }

.SCentered {
  text-align: center;
}

.imageleft {
  float: left;
  margin-right: 0.3em;
}

.Smaller {
  font-size: 82%;
}

.Larger {
  font-size: 122%;
}

/* A hack, inserted to break some Scheme ids: */
.mywbr {
  display: inline-block;
  height: 0;
  width: 0;
  font-size: 1px;
}

.compact li p {
  margin: 0em;
  padding: 0em;
}

.noborder img {
  border: 0;
}

.SAuthorListBox {
  position: relative;
  float: right;
  left: 2em;
  top: -2.5em;
  height: 0em;
  width: 13em;
  margin: 0em -13em 0em 0em;
}
.SAuthorList {
  font-size: 82%;
}
.SAuthorList:before {
  content: &quot;by &quot;;
}
.author {
  display: inline;
  white-space: nowrap;
}

/* print styles : hide the navigation elements */
@media print {
  .tocset,
  .navsettop,
  .navsetbottom { display: none; }
  .maincolumn {
    width: auto;
    margin-right: 13em;
    margin-left: 0;
  }
}
/*--&gt;&lt;!]]&gt;*/
&lt;/style&gt;&lt;style type=&quot;text/css&quot;&gt;
&lt;!--/*--&gt;&lt;![CDATA[/* &gt;&lt;!--*/

/* See the beginning of &quot;scribble.css&quot;. */

/* Monospace: */
.RktIn, .RktRdr, .RktPn, .RktMeta,
.RktMod, .RktKw, .RktVar, .RktSym,
.RktRes, .RktOut, .RktCmt, .RktVal,
.RktBlk {
  font-family: monospace;
  white-space: inherit;
}

/* Serif: */
.inheritedlbl {
  font-family: serif;
}

/* Sans-serif: */
.RBackgroundLabelInner {
  font-family: sans-serif;
}

/* ---------------------------------------- */
/* Inherited methods, left margin */

.inherited {
  width: 100%;
  margin-top: 0.5em;
  text-align: left;
  background-color: #ECF5F5;
}

.inherited td {
  font-size: 82%;
  padding-left: 1em;
  text-indent: -0.8em;
  padding-right: 0.2em;
}

.inheritedlbl {
  font-style: italic;
}

/* ---------------------------------------- */
/* Racket text styles */

.RktIn {
  color: #cc6633;
  background-color: #eeeeee;
}

.RktInBG {
  background-color: #eeeeee;
}

.RktRdr {
}

.RktPn {
  color: #843c24;
}

.RktMeta {
  color: black;
}

.RktMod {
  color: black;
}

.RktOpt {
  color: black;
}

.RktKw {
  color: black;
}

.RktErr {
  color: red;
  font-style: italic;
}

.RktVar {
  color: #262680;
  font-style: italic;
}

.RktSym {
  color: #262680;
}

.RktSymDef { /* used with RktSym at def site */
}

.RktValLink {
  text-decoration: none;
  color: blue;
}

.RktValDef { /* used with RktValLink at def site */
}

.RktModLink {
  text-decoration: none;
  color: blue;
}

.RktStxLink {
  text-decoration: none;
  color: black;
}

.RktStxDef { /* used with RktStxLink at def site */
}

.RktRes {
  color: #0000af;
}

.RktOut {
  color: #960096;
}

.RktCmt {
  color: #c2741f;
}

.RktVal {
  color: #228b22;
}

/* ---------------------------------------- */
/* Some inline styles */

.together {
  width: 100%;
}

.prototype, .argcontract, .RBoxed {
  white-space: nowrap;
}

.prototype td {
  vertical-align: text-top;
}

.RktBlk {
  white-space: inherit;
  text-align: left;
}

.RktBlk tr {
  white-space: inherit;
}

.RktBlk td {
  vertical-align: baseline;
  white-space: inherit;
}

.argcontract td {
  vertical-align: text-top;
}

.highlighted {
  background-color: #ddddff;
}

.defmodule {
  width: 100%;
  background-color: #F5F5DC;
}

.specgrammar {
  float: right;
}

.RBibliography td {
  vertical-align: text-top;
}

.leftindent {
 margin-left: 1em;
 margin-right: 0em;
}

.insetpara {
 margin-left: 1em;
 margin-right: 1em;
}

.Rfilebox {
}

.Rfiletitle {
  text-align: right;
  margin: 0em 0em 0em 0em;
}

.Rfilename {
  border-top: 1px solid #6C8585;
  border-right: 1px solid #6C8585;
  padding-left: 0.5em;
  padding-right: 0.5em;
  background-color: #ECF5F5;
}

.Rfilecontent {
  margin: 0em 0em 0em 0em;
}

.RpackageSpec {
  padding-right: 0.5em;
}

/* ---------------------------------------- */
/* For background labels */

.RBackgroundLabel {
   float: right;
   width: 0px;
   height: 0px;
}

.RBackgroundLabelInner {
   position: relative;
   width: 25em;
   left: -25.5em;
   top: 0px;
   text-align: right;
   color: white;
   z-index: 0;
   font-weight: bold;
}

.RForeground {
   position: relative;
   left: 0px;
   top: 0px;
   z-index: 1;
}

/* ---------------------------------------- */
/* History */

.SHistory {
  font-size: 82%;
}
/*--&gt;&lt;!]]&gt;*/
&lt;/style&gt;&lt;p&gt;This is a follow-up to the previous blog post on retrieving historical art from the Rijksmuseum. Like historical art, film star photos inform us about politics and human culture at particular times throughout history - but there are so many film star photos that it becomes difficult to devote sufficient attention to each individual photo. We can use &lt;a href=&quot;http://dbpedia.org/About&quot;&gt;DBpedia&lt;/a&gt; to retrieve historical photos of film stars and display them in our statistically generated scenes of historical events. &lt;em&gt;We&#039;ll display both film star photos and historical art that best fit the contexts of our statistically generated scenes - and use the context of the scenes that they are placed in to interpret them for their historical significance. &lt;/em&gt;&lt;/p&gt;
&lt;p&gt;
	&lt;img alt=&quot;&quot; src=&quot;http://i.imgur.com/pJjZvQS.png&quot; style=&quot;width: 382px; height: 263px;&quot; /&gt;&lt;/p&gt;
&lt;p&gt;As an example, let’s look at film stars who were active around the time that Anne Frank was alive. We&#039;ll look at film stars who became active 1900-1945, even though Anne Frank was born in 1929. Anne Frank &lt;a href=&quot;http://www.annefrank.org/en/Museum/Collecties/Movie-star-pictures/&quot;&gt;pasted photos of film stars on her wall&lt;/a&gt; who were active before she was born, such as &lt;a href=&quot;http://dbpedia.org/page/Greta_Garbo&quot;&gt;Greta Garbo&lt;/a&gt; who became active in 1920. Here’s a &lt;a href=&quot;http://www.w3.org/TR/2013/REC-sparql11-query-20130321/SPARQL&quot;&gt;SPARQL&lt;/a&gt; query to retrieve film stars who were active during that time period:&lt;/p&gt;
&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; cellspacing=&quot;1&quot; style=&quot;width: 500px;&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;p&gt;&lt;span style=&quot;font-family:courier new,courier,monospace;&quot;&gt;PREFIX wordnet: &amp;lt;&lt;a href=&quot;http://www.w3.org/2006/03/wn/wn20/instances/&amp;gt;&quot;&gt;http://www.w3.org/2006/03/wn/wn20/instances/&amp;gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style=&quot;font-family:courier new,courier,monospace;&quot;&gt;SELECT DISTINCT ?actor ?thumb ?start {&lt;br /&gt;
					  {&lt;br /&gt;
					    ?actor dbpprop:wordnet_type wordnet:synset-actor-noun-1 .&lt;br /&gt;
					  } UNION {&lt;br /&gt;
					    ?actor dbpedia-owl:occupation dbpedia:Actor .&lt;br /&gt;
					  }&lt;br /&gt;
					  ?actor dbpedia-owl:thumbnail ?thumb .&lt;br /&gt;
					  ?actor dbpedia-owl:activeYearsStartYear ?start .&lt;br /&gt;
					  FILTER (?start &amp;gt; &quot;1900-01-01&quot;^^xsd:date)&lt;br /&gt;
					  FILTER (?start &amp;lt; &quot;1945-01-01&quot;^^xsd:date)&lt;br /&gt;
					  FILTER EXISTS {&lt;br /&gt;
					    {&lt;br /&gt;
					      ?film dbpedia-owl:starring ?actor .&lt;br /&gt;
					    } UNION {&lt;br /&gt;
					      ?film dbpprop:starring ?actor .&lt;br /&gt;
					    }&lt;br /&gt;
					  }&lt;br /&gt;
					}&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;We can execute that SPARQL query and retrieve the results by sending a &lt;span style=&quot;font-style: italic&quot;&gt;GET&lt;/span&gt; request to DBpedia’s SPARQL endpoint:&lt;/p&gt;
&lt;table cellpadding=&quot;0&quot; cellspacing=&quot;0&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;p&gt;&lt;span style=&quot;font-family:courier new,courier,monospace;&quot;&gt;&lt;span class=&quot;stt&quot;&gt;&lt;a href=&quot;http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&amp;amp;format=json&amp;amp;timeout=30000&amp;amp;debug=on&amp;amp;query=&quot;&gt;http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&amp;amp;for...&lt;/a&gt;???&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;&lt;span style=&quot;font-style: italic&quot;&gt;Note: we’d set the query parameter to be our SPARQL query.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;Here&#039;s a sample of the results that are returned after executing the SPARQL query:&lt;/p&gt;
&lt;table border=&quot;1&quot; class=&quot;sparql&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;th&gt;
				actor&lt;/th&gt;
&lt;th&gt;
				thumb&lt;/th&gt;
&lt;th&gt;
				start&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
				&lt;a href=&quot;http://dbpedia.org/resource/Amparo_Rivelles&quot;&gt;http://dbpedia.org/resource/Amparo_Rivelles&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&lt;a href=&quot;http://commons.wikimedia.org/wiki/Special:FilePath/Amparo_Rivelles.jpg?width=300&quot;&gt;http://commons.wikimedia.org/wiki/Special:FilePath/Amparo_Rivelles.jpg?width=300&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&quot;1942+02:00&quot;^^&amp;lt;&lt;a href=&quot;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&quot;&gt;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
				&lt;a href=&quot;http://dbpedia.org/resource/Art_Acord&quot;&gt;http://dbpedia.org/resource/Art_Acord&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&lt;a href=&quot;http://commons.wikimedia.org/wiki/Special:FilePath/Art_Acord_Kephren_1917.jpg?width=300&quot;&gt;http://commons.wikimedia.org/wiki/Special:FilePath/Art_Acord_Kephren_1917.jpg?width=300&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&quot;1912+02:00&quot;^^&amp;lt;&lt;a href=&quot;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&quot;&gt;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
				&lt;a href=&quot;http://dbpedia.org/resource/Bobby_Burns&quot;&gt;http://dbpedia.org/resource/Bobby_Burns&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&lt;a href=&quot;http://commons.wikimedia.org/wiki/Special:FilePath/Cuckoo_Comedies_%281919%29_-_Ad_1.jpg?width=300&quot;&gt;http://commons.wikimedia.org/wiki/Special:FilePath/Cuckoo_Comedies_(1919)_-_Ad_1.jpg?width=300&lt;/a&gt;&lt;/td&gt;
&lt;td&gt;
				&quot;1908+02:00&quot;^^&amp;lt;&lt;a href=&quot;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&quot;&gt;http://www.w3.org/2001/XMLSchema#gYear&amp;gt;&lt;/a&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td class=&quot;rtecenter&quot;&gt;
				...&lt;/td&gt;
&lt;td class=&quot;rtecenter&quot;&gt;
				...&lt;/td&gt;
&lt;td class=&quot;rtecenter&quot;&gt;
				...&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;There are too many results to list in the table above. &lt;em&gt;So, how many film stars were active during that time period?&lt;/em&gt;&lt;/p&gt;
&lt;blockquote class=&quot;SCodeFlow&quot;&gt;&lt;table cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;RktBlk&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
					&lt;span class=&quot;stt&quot;&gt;&amp;gt; &lt;/span&gt;&lt;span class=&quot;RktPn&quot;&gt;(&lt;/span&gt;&lt;span class=&quot;RktSym&quot;&gt;film-stars/count&lt;/span&gt;&lt;span class=&quot;hspace&quot;&gt; &lt;/span&gt;&lt;span class=&quot;RktVal&quot;&gt;1900&lt;/span&gt;&lt;span class=&quot;hspace&quot;&gt; &lt;/span&gt;&lt;span class=&quot;RktVal&quot;&gt;1945&lt;/span&gt;&lt;span class=&quot;RktPn&quot;&gt;)&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;p&gt;&lt;span class=&quot;RktRes&quot;&gt;1633&lt;/span&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;/blockquote&gt;
&lt;p&gt;Let’s have a look at a sample of a film star photo that we would place in our statistically generated scenes of historical events:&lt;/p&gt;
&lt;blockquote class=&quot;SCodeFlow&quot;&gt;&lt;table cellpadding=&quot;0&quot; cellspacing=&quot;0&quot; class=&quot;RktBlk&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
					&lt;span class=&quot;stt&quot;&gt;&amp;gt; &lt;/span&gt;&lt;span class=&quot;RktPn&quot;&gt;(&lt;/span&gt;&lt;span class=&quot;RktSym&quot;&gt;film-stars/sample&lt;/span&gt;&lt;span class=&quot;hspace&quot;&gt; &lt;/span&gt;&lt;span class=&quot;RktVal&quot;&gt;1900&lt;/span&gt;&lt;span class=&quot;hspace&quot;&gt; &lt;/span&gt;&lt;span class=&quot;RktVal&quot;&gt;1945&lt;/span&gt;&lt;span class=&quot;RktPn&quot;&gt;)&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;p&gt;&lt;img alt=&quot;image&quot; src=&quot;http://i.imgur.com/gOUQ6fb.png&quot; style=&quot;width: 250px; height: 318px; border-width: 1px; border-style: solid;&quot; /&gt;&lt;/p&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;/blockquote&gt;
&lt;p&gt;There are a couple of problems with retrieving film star photos using DBpedia.&lt;/p&gt;
&lt;p&gt;One problem is that some photos of film stars are modern, and would look out of place in our exhibits. We could check if the photos are greyscale before displaying them, but this might not always be effective. For example, this photo of Philip Ober was taken in 1950, even though we only want film stars who were active between 1900 - 1945; we wouldn&#039;t want photos that were taken after the date when a particular historical scene that we are recreating takes place. There is no property that tells us when the photo of film star was taken.&lt;/p&gt;
&lt;p&gt;Another problem is that there isn’t information available about when some film stars were active, e.g. &lt;a href=&quot;http://dbpedia.org/page/Sonja_Henie&quot;&gt;Sonja Henie&lt;/a&gt; as mentioned in this &lt;a href=&quot;http://www.annefrank.org/en/Museum/Collecties/Movie-star-pictures/&quot;&gt;article&lt;/a&gt; from the Anne Frank House; this means that they are not included in the result of our SPARQL query. It might be possible to use the release dates of the films that actors/actresses starred in as a proxy for information on when they were active.&lt;/p&gt;
&lt;p&gt;We&#039;ve made a Unity3d picture frame asset that retrieves a random film star photo from DBpedia and displays it when the scene that it is placed in is played. The asset is available in &lt;a href=&quot;https://github.com/markfarrell/muninn&quot;&gt;this&lt;/a&gt; repository. Stay tuned for more on statistically generating historical scenes!&lt;/p&gt;
&lt;p&gt; &lt;/p&gt;
&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;form-item form-type-item&quot;&gt;
  &lt;label&gt;Language &lt;/label&gt;
 English
&lt;/div&gt;
&lt;div class=&quot;field field-name-field-tags field-type-taxonomy-term-reference field-label-above&quot;&gt;&lt;div class=&quot;field-label&quot;&gt;Tags:&amp;nbsp;&lt;/div&gt;&lt;div class=&quot;field-items&quot;&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/82?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Film Stars&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item odd&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/12?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;DBpedia&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/46?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;linked open data&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item odd&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/78?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Unity3d&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/79?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Arcadia&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item odd&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/80?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Clojure&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/81?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Racket&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</description>
 <pubDate>Thu, 29 Jan 2015 14:23:20 +0000</pubDate>
 <dc:creator>m4farrel</dc:creator>
 <guid isPermaLink="false">81 at https://blog.muninn-project.org</guid>
 <comments>https://blog.muninn-project.org/node/81?language=en#comments</comments>
</item>
<item>
 <title>Authority, Statistics and RDF</title>
 <link>https://blog.muninn-project.org/2011/05/119?language=en</link>
 <description>&lt;div class=&quot;field field-name-body field-type-text-with-summary field-label-hidden&quot;&gt;&lt;div class=&quot;field-items&quot;&gt;&lt;div class=&quot;field-item even&quot; property=&quot;content:encoded&quot;&gt;&lt;p&gt;The recent discussion on the &lt;a href=&quot;http://lod-lam.net/&quot; title=&quot;International Linked Open Data in Libraries Archives and Museums Summit&quot;&gt;LOD-LAM&lt;/a&gt; website have prompted me to begin writhing about some of the data available on Muninn and how it is generated. Creating a database from the contents of war archives nearly a century old presents some special challenges, some old some new. Jonathan Rochkind&#039; flippant remark &#039;Sorry Linked Open Data people&#039; was made in jest and drew a lot of responses. But it in the end you can&#039;t say that text-mining is better than linked open data anymore than apples are better than submarines as they don&#039;t perform the same function. Similarly, the problem of interpreting archival contents is central to Muninn&#039;s role as well as the question of what is an authoritative source (at least in the knowledge sense). There is no question that &quot;&lt;a href=&quot;http://bibwild.wordpress.com/2011/05/17/using-wikipedia-as-an-authority-file/&quot;&gt;applying statistical analysis text-mining ‘best guess’ type techniques, provides more relationships than dbpedia alone does&lt;/a&gt;&quot;.&lt;/p&gt;
&lt;!--break--&gt;&lt;p&gt;Generating relationships is easy, since anything can be a relationship, the hard part is figuring out which class of relationship actually means something even before the accuracy of its instances are judged. &lt;a href=&quot;http://dbpedia.org/&quot;&gt;Dbpedia&lt;/a&gt; on the other hand creates its relationships from very specific mark ups within wikipedia. Both the creation of the wikipedia rdf and the wikipedia infoboxes are human-driven and the number of classes is limited by the manpower available. There is still a possibility of errors, someone might have written in Bern as the dbpedia:capital of (dbpedia:populated_place) Germany, but there is no question this is the class that was intended to hold that information. Muninn uses linked open data to publish its processed, cleaned up, results. Behind the curtain, many domain-specific statistical and logical models (including statistical mining)look for particular patterns. However, as opposed to free-running relationship mining these are constrained to the specific linked open data tags that they actually fill. For example, Muninn automatically fills in &lt;a href=&quot;http://xmlns.com/foaf/spec/#term_knows&quot;&gt;foaf:knows&lt;/a&gt; tags if its calculates that two people know each other within a .95 confidence using a statistical &#039;best guess&#039;. But the &lt;a href=&quot;http://www.perceive.net/schemas/20021119/relationship/#friendOf&quot;&gt;rel:friendOf&lt;/a&gt; tag require specific evidence before it is instantiated, such as a record of a communication or an entry in a diary. Most statistical data-mining algorithms won&#039;t understand the relationships between the &lt;a href=&quot;http://en.wikipedia.org/w/index.php?title=Dominion_of_canada&quot;&gt;Dominion of Canada&lt;/a&gt; (oddly it currently redirects to &lt;a href=&quot;http://en.wikipedia.org/wiki/Canada&quot;&gt;Canada&lt;/a&gt; instead of &lt;a href=&quot;http://en.wikipedia.org/wiki/Canada_under_British_Imperial_control&quot;&gt;Canada under Imperial control&lt;/a&gt;) as a location and the &lt;a href=&quot;http://en.wikipedia.org/wiki/Dominion_of_Newfoundland&quot;&gt;Dominion of Newfoundland&lt;/a&gt; as a political entity. They will happily ignore the untidy &#039;Dominion&#039;, &#039;location&#039; and &#039;political&#039; tokens and simply file one as a owl:partOf the other since that is the strongest signal on the graph. It&#039;s a simple, easy to understand, wrong answer that causes all sorts of confusion down the line because it &#039;looks right&#039; but isn&#039;t. Should we should consider wikipedia as an &#039;authority&#039;? It is an outstanding resource for general background data and as a statistical resource to design things like tag clouds, auto tagging html text and creating specialized dictionaries. It does well by the law of averages since the crowd eventually fixes things. This also means that at any given time something will be broken at the page level and that is a problem for an authority. Do you really want your project to catalogue Japan as a rogue legislative element of the European Union because someone messed up a page edit last night? Or watch your reasoner spin madly because there is an edit war on whether Henry Kissinger is_a war criminal? The United States is part of the British Empire right? This is like some of the problems that &lt;a href=&quot;http://www.openstreetmap.org/&quot;&gt;Open Street Maps&lt;/a&gt; is having with bits of countries getting flooded after bad edits. Some data lends itself to statistical or social consensus making for a variety of reasons such as ease of observation. It&#039;s easy to fix a bad restaurant location and it only causes limited nuisances for downstream users. Fixing a bad coastline is a bit harder since few people are willing to go out with a transit to survey it. When possible, Muninn links to the appropriate dbpedia triple using rdf:seeAlso for background documents. It&#039;s not clear that it is a good idea to link Muninn triples with the dbpedia ones using owl:sameAs at this point since we don&#039;t know how authority and stability of triples works. To create a database from the documents requires some interpretation of the information beyond normal indexing or finding aids. Interpretation can range from codifying referencing standards to hard-core detective work. Right now, Muninn focuses on extracting basic facts and trying to link them across documents and entities, cleaning up ranks and names to a point where there is enough of a basic database to add more information. Eventually, there will be enough room for very high level analysis of the data and some interpretation will be done automatically. The nice thing about the Great War is the large number of forms in use, even if not all these are typed since this pre-classifies some of the information for us. Yet, there are many cases where the forms are not responsive to the users needs, so the clerk crosses out fields and replaces them by his own. It will be interesting in the long term to track when and why people do so. Text analysis is a little bit more demanding in that it gives good results when customized to the specific document or text types but that requires some hand holding and hand-coding by a human that needs to be doing too many things at once. There is an experimental text search interface that searches the human readable parts of the linked open data as well as other texts in the collection along with linguistic mood and style. The interface and presentation is still something that is being worked on and comments are welcome. A interesting occurrence is when faulty data is used within a document and is propagated within the organizations&#039; other document, such as impossible dates of birth. The date is obviously wrong, but is still useful as a means of referencing other documents within the archive. It is impossible to trick most databases into storing this information since it fails basic consistency checks, through we are able to force rdf into supporting it through create use of mark-ups. Thus we need to be able to provide information which is clearly wrong from a temporal aspect because it is still useful for linkages purposes.&lt;/p&gt;
&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class=&quot;form-item form-type-item&quot;&gt;
  &lt;label&gt;Language &lt;/label&gt;
 English
&lt;/div&gt;
&lt;div class=&quot;field field-name-field-tags field-type-taxonomy-term-reference field-label-above&quot;&gt;&lt;div class=&quot;field-label&quot;&gt;Tags:&amp;nbsp;&lt;/div&gt;&lt;div class=&quot;field-items&quot;&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/10?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;RDF&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item odd&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/11?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;Relationships&lt;/a&gt;&lt;/div&gt;&lt;div class=&quot;field-item even&quot; rel=&quot;dc:subject&quot;&gt;&lt;a href=&quot;/taxonomy/term/12?language=en&quot; typeof=&quot;skos:Concept&quot; property=&quot;rdfs:label skos:prefLabel&quot; datatype=&quot;&quot;&gt;DBpedia&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;</description>
 <pubDate>Tue, 31 May 2011 23:30:00 +0000</pubDate>
 <dc:creator>warren</dc:creator>
 <guid isPermaLink="false">7 at https://blog.muninn-project.org</guid>
 <comments>https://blog.muninn-project.org/2011/05/119?language=en#comments</comments>
</item>
</channel>
</rss>
