This list is updated on an ongoing basis, please email warren@muninn-project.org for additions and changes.
Stars: The openness rating of the data according to the 5-star deployment scheme suggested by Tim Berners-Lee, the inventor of the Web and Linked Data initiator.
Triples: The size of the dataset in triples.
URI Working: Whether the data is dereferencable, eg: reachable through an identifying URI. Blank / Local nodes do not count.
Ontology: If using a custom ontology, it is published.
Content negotiation: is your content retrievable in more than one serialization (RDF/XML, N3, Turtle, etc...) using either file name extensions or HTTP Content Negotiation?
SPARQL: Is there a publicly reachable SPARQL server that contains your data?
Raw Data Dump: Is your data available as a full data dump?
Void / DCAT: Is a void and/or dcat machine readable description of the data available?
Docs: Is there a human readable landing page available documenting the data set or project?
Name | Description | Stars | Triples | URI Working? | Ontology | Content Negotiation | SPARQL Server | Raw Data Dump | Void / DCAT | Docs |
---|---|---|---|---|---|---|---|---|---|---|
Muninn WW1 | The Muninn Project is a multidisciplinary, multinational, academic research project investigating millions of records pertaining to the First World War in archives around the world. Our aim is to take archives of digitized documents, extract the written data using massive amount of computing power and turn the resulting information into structured databases. These databases will then support further research in a number of different areas. | ⭐⭐⭐⭐⭐ | 30M | Yes | Yes | rdf, nt, ttl | Yes | Yes | Yes | Yes |
CWRC / Orlando | The Orlando Project is an experiment in the integration of text and technology. It has designed and continues to enhance digital tools to harness the power of computers for critical literary and historical research. The project's constantly expanding and improving storehouse of knowledge about women's lives and writings, the Orlando textbase itself, is rendered uniquely searchable and sortable by its encoding in a linked open data format. | ⭐⭐⭐⭐⭐ | 13K | Yes | Yes | rdf, nt, ttl, json | Yes | No | Yes | Yes |
Canlink Thesis | Can Link - a linked data project for Canadian theses is a proof of concept project of the Digital Projects Working Group of the Canadian Linked Data Initiative. Its goal is to increase the discoverability of Canadian theses and dissertations by leveraging the power of linked data to surface unexpected connections and relationships. | ⭐⭐⭐⭐⭐ | 100K | Yes | Yes |
rdf, nt, ttl, json, bibtex, ris |
Yes | Yes | Yes | Yes |
Out of the Trenches | Partners of the Pan-Canadian Documentary Heritage Network (PCDHN) have developed a “proof-of-concept” to showcase a sampling of the network’s wealth of digital resources using “linked open data” and principles of the semantic web. | ⭐⭐⭐⭐⭐ | 30K | Yes | No | rdf | No | Yes | No | No (Slides, Video) |
This dataset includes artifacts in the collection of the Canada Science and Technology Museums Corporation that are currently in the Canada Science and Technology Museum. | ⭐⭐⭐ | 2.8M | Broken | Broken | rdf | No | Yes | No | Yes | |
Canadian Subject Headings | Canadian Subject Headings (CSH) is a list of subject headings in the English language, using controlled vocabulary, to access and express the subject content of documents on Canada and Canadian topics. | ⭐⭐⭐ | 173K | Broken | No | rdf | No | Yes | No | Yes |