Semantic Blogging

February 11, 2007 · Posted in Uncategorized  · Posted by Guest: John Breslin

We’ve already seen how Web 2.0 has brought about a paradigm of tagged and commented-upon content: photos, bookmarks, events, videos, and blog posts. Blog posts are usually only tagged on the blog itself by the post creator, using free-text keywords such as “scotland”, “movies”, etc. (unless they are bookmarked and tagged by others using social bookmarking services like or personal aggregators like Gregarius). Technorati, the blog search engine, aims to use these keywords to build a “tagged web”. Both tags and hierarchial categorisations of blog posts can be further enriched using the SKOS framework. However, there is often much more to say about a blog post than simply what category it belongs in…

So let’s move on to semantic blogging (some ideas here are from Knud Moeller who is working on semiBlog). Traditional blogging is aimed at what can be called the “eyeball Web” – i.e. text, images or video content that is targetted mainly at people. Semantic blogging aims to enrich traditional blogging with metadata about the structure (what relates to what and how) and the content (what is this post about – a person, event, book, etc.). In this way, metadata-enriched blogging can be better understood by computers as well as people.

Last time I talked about structured blogging, where microcontent such as microformats is positioned inline in the HTML (and subsequent syndication feeds) and can be rendered via CSS. Structured blogging and semantic blogging do not compete, but rather offer metadata in slightly different ways (using microcontent / microformats and RDF respectively). There are already mechanisms such as GRDDL which can be used to move from one to the other.

So why would one choose to enhance their blogs and posts with semantics? Current blogging offers poor query possibilities (except for searching by keyword or seeing all posts labelled with a particular tag). There is little or no reuse of data offered (apart from copying URLs or text from posts). Some linking of posts is possible via direct HTML links or trackbacks, but again, nothing can be said about the nature of those links (are you agreeing with someone, linking to an interesting post, or are you quoting someone whose blog post is directly in contradiction with your own opinions?). Semantic blogging aims to tackle some of these issues, by facilitating better (i.e. more precise) querying when compared with keyword matching, by providing more reuse possibilities, and by creating “richer” links between blog posts.

It is not simply a matter of adding semantics for the sake of creating extra metadata, but rather a case of being able to reuse what data a person already has in their desktop or web space and making the resulting metadata available to others. People are already (sometimes unknowingly) collecting and creating large amounts of structured data on their computers, but this data is often tied into specific applications and locked within a user’s desktop (e.g. contacts in a person’s addressbook, events in a calendaring application, author and title information in documents, audio metadata in MP3 files). Semantic blogging can be used to “lift” or release this data onto the Web.

20070211a.pngLooking at the picture on the right, Aidan writes a blog post which he annotates using content from his desktop calendaring and addressbook applications. He publishes this post onto the Web, and John, reading this post, can reuse the embedded metadata in his own desktop applications.

20070211b.pngThe next picture is from a semantic blogging application called semiBlog. In this picture, a semantic blog post is being created by annotating a part of the post text about John with an address book entry that has extra metadata describing John. Once a blog has semantic metadata, it can be used to perform queries such as “which blog posts talk about papers by Stefan Decker?”; it can be used for browsing not only across blogs but also other kinds of discussion methods; or it can be used by blog readers for importing metadata into desktop applications (using the Web as a clipboard).

As well as semiBlog, other semantic blogging systems have been developed by HP, the National Institute of Informatics, Japan and MIT. But it’s not just blog posts that are being enhanced by structured metadata and semantics – it’s happening in many other Web 2.0 application areas. Wikis such as the Wikipedia have contained structured metadata in the form of templates for some time now, and at least twenty “semantic wikis” have also appeared to address a growing need for more structure in wikis. I’ll talk about semantic wikis next time, and in the meantime look forward to your comments…


9 Responses to “Semantic Blogging”

  1. IIA Blog: Semantic Blogging at Cloudlands on February 11th, 2007 2:34 pm

    […] I’ve just published my fourth guest post for the IIA Blog – it’s about Semantic Blogging. I think I only have a few days left in my guest slot so I hope to fit in one or two more posts about wikis and maybe social networks before the end… […]

  2. Michael Hausenblas on February 14th, 2007 9:41 am

    Have you taken RDFa [1] into consideration, yet?

    [1] RDFa Overview

  3. […] John Breslin has a not-long post on the iia blog discussing techniques and benefits of enriching a blog with additional metadata, starting with the “traditional” gains of improved search & navigation. Many of these ideas have been a few years now, with a lot of groundwork being covered by Steve Cayzer & co. at HP in 2003, with papers and demonstrators etc. […]

  4. Guest: John Breslin on February 14th, 2007 1:59 pm

    Michael – Yes, see Knud’s post here.

  5. Knud Möller on February 14th, 2007 3:19 pm

    Regarding the use of RDFa – John already mentioned my post on confuseddevelopment. I think RDFa would be an excellent solution for adding formal metadata to web pages (e.g. blog posts), and I definitely plan to use it in semiBlog. However, what is holding me back at the moment is the lack of parsers – if someone would add an RDFa parser to Redland, that would be wonderful!

  6. […] semiBlog got a mention in a post about Semantic Blogging on the Irish Internet Association’s Blog. Michael Hausenblas asked about the use of RDFa in a comment there. RDFa integration is still on the agenda, but I wish there were more parsers/extractors around. […]

  7. Michael Hausenblas on February 16th, 2007 11:56 am

    @ Knud: cf.
    for an up-to-date list of RDFa parsers.


    BTW: How about bugging Dave Beckett to go for RDFa ;) ?

  8. iia blog » To Wikis and Beyond on February 16th, 2007 3:05 pm

    […] Finally, just as in the semantic blogging scenario, wikis can enable the Web to be used as a clipboard, by allowing readers to drag structured information from wiki pages into other applications (for example, geographic data about locations on a wiki page could be used to annotate information on an event or a person in your calendar application or address book software respectively). […]

  9. Mark on August 21st, 2011 8:35 pm

    Semantic blogging aims to tackle some of these issues, by facilitating better (i.e. more precise) querying when compared with keyword matching, by providing more reuse possibilities, and by creating “richer” links between blog posts.

Leave a Reply