More introduction to RDF and SPARQL – How easy it is to make people believe a lie, and how hard it is to undo that work again

Introduction

I plan to give an introduction to features like COUNT, FILTER REGEX and GROUP BY which are supported by Tracker‘s SPARQL engine. We support more such features but I have to start the introduction somewhere. And overloading people with introductions to all features wont help me much with explaining things.

Since my last introduction to RDF and SPARQL I have added a few relationships and actors to the game.

We have Morrel, Max and Sasha being dogs, Sheeba and Query are cats, Picca is still a parrot, Fred and John are contacts. Fred claims that John is his friend. I changed the ontology to allow friendships between the animals too: Sasha claims that Morrel and Max are her friends. Sheeba claims Query is her friend. John bought Query. Fred being inspired by John decided to also get some pets: Morrel, Sasha and Sheeba.

Ontology

Let’s put this story in Turtle:

<test:Picca> a test:Parrot, test:Pet ;
	test:name "Picca" .

<test:Max> a test:Dog, test:Pet ;
	test:name "Max" .

<test:Morrel> a test:Dog, test:Pet ;
	test:name "Morrel" ;
	test:hasFriend <test:Max> .

<test:Sasha> a test:Dog, test:Pet ;
	test:name "Sasha" ;
	test:hasFriend <test:Morrel> ;
	test:hasFriend <test:Max> .

<test:Sheeba> a test:Cat, test:Pet ;
	test:name "Sheeba" ;
	test:hasFriend <test:Query> .

<test:Query> a test:Cat, test:Pet ;
	test:name "Query" .

<test:John> a test:Contact ;
	test:owns <test:Max> ;
	test:owns <test:Picca> ;
	test:owns <test:Query> ;
	test:name "John" .

<test:Fred> a test:Contact ;
	test:hasFriend <test:John> ;
	test:name "Fred" ;
	test:owns <test:Morrel> ;
	test:owns <test:Sasha> ;
	test:owns <test:Sheeba> .

Querytime!

Let’s first start with all friend relationships:

SELECT ?subject ?friend
WHERE { ?subject test:hasFriend ?friend }

  test:Morrel, test:Max
  test:Sasha, test:Morrel
  test:Sasha, test:Max
  test:Sheeba, test:Query
  test:Fred, test:John

Just counting these is pretty simple. In SPARQL all selectable fields must have a variable name, so we add the “as c” here.

SELECT COUNT (?friend) AS c
WHERE { ?subject test:hasFriend ?friend }

  5

We counted friend relationships, of course. Let’s say we want to count how many friends each subject has. This is a more interesting query than the previous one.

SELECT ?subject COUNT (?friend) AS c
WHERE { ?subject test:hasFriend ?friend }
GROUP BY ?subject

  test:Fred, 1
  test:Morrel, 1
  test:Sasha, 2
  test:Sheeba, 1

Actually, we’re only interested in the human friends:

SELECT ?subject COUNT (?friend) AS c
WHERE { ?subject test:hasFriend ?friend .
        ?friend a test:Contact
} GROUP BY ?subject

  test:Fred, 1

No no, we are only interested in friends that are either cats or dogs:

SELECT ?subject COUNT (?friend) AS c
WHERE { ?subject test:hasFriend ?friend .
       ?friend a ?type .
       FILTER ( ?type = test:Dog || ?type = test:Cat)
} GROUP BY ?subject"

  test:Morrel, 1
  test:Sasha, 2
  test:Sheeba, 1

Now we are only interested in friends that are either a cat or a dog, but whose name starts with a ‘S’.

SELECT ?subject COUNT (?friend) as c
WHERE { ?subject test:hasFriend ?friend ;
                 test:name ?n .
       ?friend a ?type .
       FILTER ( ?type = test:Dog || ?type = test:Cat) .
       FILTER REGEX (?n, '^S', 'i')
} GROUP BY ?subject

  test:Sasha, 2
  test:Sheeba, 1

Conclusions

Should we stop talking about ontologies and start talking about searchboxes and user interfaces instead? Although I certainly agree more UI-stuff is needed, I’m not sure yet. RDF and SPARQL are also about relationships and roles. Not just about matching stuff. Whenever we explain the new Tracker to people, most are stuck with ‘matching’ in their mind. They don’t think about a lot of other use-cases.

Such a search is just one use-case starting point: user entered a random search string and gives zero other meaning about what he needs. Many more situations can be starting points: When I select a contact in a user interface designed to show an archive of messages that he once sent to me, the searchbox becomes much more narrow, much more helpful.

As soon as you have RDF and SPARQL, and with Tracker you do, an application developer can start taking into account relationships between resources: The relationship between a contact in Instant Messaging and the attachments in an E-mail that he as a person has sent to you. Why not combine it with friendship relationships synced from online services?

With a populated store you can make the relationship between a friend who joined you on a trip, and photos of a friend of your friend who suggested the holiday location.

With GeoClue integration we could link his photos up with actual location markers. You’d find these photos that came from the friend of your friend, and we could immediately feed the location markers to the GPS software on your phone.

I really hope application developers have more imagination than just global searchboxes.

And this is just a use-case that is technically already possible with today’s high-end phones.

4 thoughts on “More introduction to RDF and SPARQL”

Zeeshan Ali (Khattak) says:

July 19, 2009 at 4:12 pm

Dude! Take some real example to explain. Hypothetical examples only confuses people. Unless you think that a dog claiming someone to be his/her friend is a real example? :)
pvanhoof says:

July 19, 2009 at 4:55 pm

@Zeeshan: I disagree. Entirely. This works way better than your proposal. Besides, dogs being friends with other dogs, is a perfect example of a relationship, role, resource situation (which is what this is about, if you read it).

Besides, in the Conclusions section I did *exactly* what you asked for.
Alexandre Mazari says:

July 19, 2009 at 8:46 pm

On the user side, I tend to think that the semantical aspect of the underlying store gives benefits more in a browsing experience than in a search one.

On the search aspect, apart from giving new constraints natures for advanced user, i fail to see any real advantages.

The ability to jump from entity to entity, knowing the nature of their connection (written by, owned by, mother of) is the real pervasive UI inovaltion.
A Generic Entity Viewer providing hyperlinks and/or thumbnails to linked entities with a human readable/translatable description of the relation could be, if well designed, a real gain of productivity for the user.
The view could be customized for known entity types.
Imagine it connected to the Semantic Web transparently to the user.

And actually some web-based tools od just that for the Semantic Web using SPARQL and RDF. Some code might be useful there, if not all :)
skierpage says:

July 20, 2009 at 2:06 am

Thanks for the (re-)introduction to SPARQL, it always confuses me with those leading question marks.

I was going to point out that the fairly popular FoaF (“Friend of a friend”) vocabulary captures friendship, but they settled for foaf:knows because they want it to be “intrinsically vague” :-) , and pushed friendOf, antagonistOf, ambivalentOf, etc. into a separate relationship model.

If you’re looking for proposals, I think just do the search against all parts of the triple in all namespaces. (CPUs are fast :-) ) Then print the matching triples as “Sasha hasFriend Morrel” , with tooltips to clarify the namespaces and everything linked to ease exploration. The few semantic explorers I’ve tried that try to expose semantics in the query turn every result into a “facet explorer” and are hard to grasp.

Comments are closed.

M	T	W	T	F	S	S
« May				Aug »
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31