<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Rearchitecting Tracker</title>
	<atom:link href="http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker/feed" rel="self" type="application/rss+xml" />
	<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker</link>
	<description>From the mind of Philip</description>
	<lastBuildDate>Sun, 28 Aug 2011 12:59:07 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
	<item>
		<title>By: ix</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1110</link>
		<dc:creator>ix</dc:creator>
		<pubDate>Sat, 04 Jul 2009 23:15:56 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1110</guid>
		<description>why is there a triple store being invented? Virtuoso or Hexastore (using file:// URIs or even inode integers to survive renames/moves) are certainly &#039;doing one thing well&#039; in that category

also requiring SPARQL and dbus is not condoning fractal decomposition/reuse of the (meta)data. and &#039;crawlers&#039; are universally annoying and certainly not necessary on a local FS. i&#039;m much more interested in an Inotify type solution which updates the indexes whenever files of specific types are created (eg EXIF from JPEGs, HTTP-environment data from browser requests and the RDFa within those documents) and exposes this index data as directories which can be queried using sh and/or python/ruby etc&#039;s stdlib FS tools to compose queries (see the Reiser4 &#039;vision&#039; docs from about oh.. 8,9 years ago?) and of course exposing the basic join/set-intersection primitives in a composable manner rather than requiring the godawful SPARQL syntax  - something that sadly most triplestores don&#039;t do, but the hash-table type dfs tools are providing (whether precalc&#039;d views a la couch or hadoop esque reductions

ive already written much of what i described but it is pure-ruby so i&#039;ve yet offload the heavy bits to the kernel and mess with userspace bridges between FS events and aforementioned stores.

heck, even a decent stab at automated indexing/querying of POSIX eattrs would be fantasticly useful compared to our crude heirchical path treewalking as the only way to list groups of files. one can use URIs for the attribute names after all.

i would really only find this sort of automated indexing useful for certain directories anyways (mail, browser cache, media), and of course it also entails patching apps (browser cache being a few opaque blobs completely unusable without the accompanying VMbeast firepig/webkit/opera and their hodpodge of adhoc SQLite databases bugs the crap out of me, not to mention the complete lack of interopability between said implementations in anything other than completely vain/trivial things like.. CSS ACIDtest compliance). bottling up the web in some mainframepickle track is preventing an evolution to a distributed/decentralized web, and ignoring all the metadata inherent in our files is just silly.
  well at least i can agree with you on that last point!</description>
		<content:encoded><![CDATA[<p>why is there a triple store being invented? Virtuoso or Hexastore (using file:// URIs or even inode integers to survive renames/moves) are certainly &#8216;doing one thing well&#8217; in that category</p>
<p>also requiring SPARQL and dbus is not condoning fractal decomposition/reuse of the (meta)data. and &#8216;crawlers&#8217; are universally annoying and certainly not necessary on a local FS. i&#8217;m much more interested in an Inotify type solution which updates the indexes whenever files of specific types are created (eg EXIF from JPEGs, HTTP-environment data from browser requests and the RDFa within those documents) and exposes this index data as directories which can be queried using sh and/or python/ruby etc&#8217;s stdlib FS tools to compose queries (see the Reiser4 &#8216;vision&#8217; docs from about oh.. 8,9 years ago?) and of course exposing the basic join/set-intersection primitives in a composable manner rather than requiring the godawful SPARQL syntax  &#8211; something that sadly most triplestores don&#8217;t do, but the hash-table type dfs tools are providing (whether precalc&#8217;d views a la couch or hadoop esque reductions</p>
<p>ive already written much of what i described but it is pure-ruby so i&#8217;ve yet offload the heavy bits to the kernel and mess with userspace bridges between FS events and aforementioned stores.</p>
<p>heck, even a decent stab at automated indexing/querying of POSIX eattrs would be fantasticly useful compared to our crude heirchical path treewalking as the only way to list groups of files. one can use URIs for the attribute names after all.</p>
<p>i would really only find this sort of automated indexing useful for certain directories anyways (mail, browser cache, media), and of course it also entails patching apps (browser cache being a few opaque blobs completely unusable without the accompanying VMbeast firepig/webkit/opera and their hodpodge of adhoc SQLite databases bugs the crap out of me, not to mention the complete lack of interopability between said implementations in anything other than completely vain/trivial things like.. CSS ACIDtest compliance). bottling up the web in some mainframepickle track is preventing an evolution to a distributed/decentralized web, and ignoring all the metadata inherent in our files is just silly.<br />
  well at least i can agree with you on that last point!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: juergbi</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1108</link>
		<dc:creator>juergbi</dc:creator>
		<pubDate>Tue, 26 May 2009 09:50:23 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1108</guid>
		<description>@pixelpapst: At the moment, Tracker is only a triple store. However, we are considering adding support for named graphs / quadruple store as it would help in various situations. There is an overhead in disk space and indexing performance, though, so we have to make sure that the advantages of a quad store outweigh the disadvantages.</description>
		<content:encoded><![CDATA[<p>@pixelpapst: At the moment, Tracker is only a triple store. However, we are considering adding support for named graphs / quadruple store as it would help in various situations. There is an overhead in disk space and indexing performance, though, so we have to make sure that the advantages of a quad store outweigh the disadvantages.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pvanhoof</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1109</link>
		<dc:creator>pvanhoof</dc:creator>
		<pubDate>Tue, 26 May 2009 08:03:29 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1109</guid>
		<description>@pixelpapst: We have ideas to store using a quadruple in parallel to our decomposed triple store. Indeed to support things like named graphs (which as you probably know will allow you to store tags while differentiating between who told you about the triple: remote, local, user-set, indexer-set, etc).

We also plan to use this capability for backup purposes, for example. And to give developers a store to write a synchronization framework for.

However, this is long-term planning. We can definitely use your help if you are interested in this specific aspect.

I asked Jürg to reply here too, as the quadruple ideas that we have came from him.</description>
		<content:encoded><![CDATA[<p>@pixelpapst: We have ideas to store using a quadruple in parallel to our decomposed triple store. Indeed to support things like named graphs (which as you probably know will allow you to store tags while differentiating between who told you about the triple: remote, local, user-set, indexer-set, etc).</p>
<p>We also plan to use this capability for backup purposes, for example. And to give developers a store to write a synchronization framework for.</p>
<p>However, this is long-term planning. We can definitely use your help if you are interested in this specific aspect.</p>
<p>I asked Jürg to reply here too, as the quadruple ideas that we have came from him.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: drago01</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1113</link>
		<dc:creator>drago01</dc:creator>
		<pubDate>Tue, 26 May 2009 06:14:54 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1113</guid>
		<description>The main thing that makes tracker useless to me is the missing ability to search for partial filenames see: http://bugzilla.gnome.org/show_bug.cgi?id=583676

Is any work being done on the indexer to improve this?</description>
		<content:encoded><![CDATA[<p>The main thing that makes tracker useless to me is the missing ability to search for partial filenames see: <a href="http://bugzilla.gnome.org/show_bug.cgi?id=583676" rel="nofollow">http://bugzilla.gnome.org/show_bug.cgi?id=583676</a></p>
<p>Is any work being done on the indexer to improve this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pixelpapst</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1107</link>
		<dc:creator>pixelpapst</dc:creator>
		<pubDate>Mon, 25 May 2009 23:23:44 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1107</guid>
		<description>Hi Philip,
one thing I&#039;m still not quite sure about: will tracker become a RDF triple store, or a quadruple store, also answering the question &quot;who told me about this triple&quot; ?
What I&#039;m aiming at: how will tracker differentiate between e.g. tags that are user-set and tags that were found by the indexer (and can be overwritten) ?
Can this information somehow be preserved in the turte-formatted cache files on external media that you described before ? (For this use case, it might be enough to just remember the specific cache file as source, though.)</description>
		<content:encoded><![CDATA[<p>Hi Philip,<br />
one thing I&#8217;m still not quite sure about: will tracker become a RDF triple store, or a quadruple store, also answering the question &#8220;who told me about this triple&#8221; ?<br />
What I&#8217;m aiming at: how will tracker differentiate between e.g. tags that are user-set and tags that were found by the indexer (and can be overwritten) ?<br />
Can this information somehow be preserved in the turte-formatted cache files on external media that you described before ? (For this use case, it might be enough to just remember the specific cache file as source, though.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pvanhoof</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1111</link>
		<dc:creator>pvanhoof</dc:creator>
		<pubDate>Mon, 25 May 2009 20:34:42 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1111</guid>
		<description>@tretle: right (and that&#039;s up to distributors). They can for example make a Turtle file and import this right after package install. We&#039;re still working on a backup function (which will also emit a turtle file). You can already import Turtle files.</description>
		<content:encoded><![CDATA[<p>@tretle: right (and that&#8217;s up to distributors). They can for example make a Turtle file and import this right after package install. We&#8217;re still working on a backup function (which will also emit a turtle file). You can already import Turtle files.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: tretle</title>
		<link>http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1112</link>
		<dc:creator>tretle</dc:creator>
		<pubDate>Mon, 25 May 2009 20:03:23 +0000</pubDate>
		<guid isPermaLink="false">http://pvanhoof.be/blog/index.php/2009/05/25/rearchitecting-tracker#comment-1112</guid>
		<description>Hi, I have an idea but im not sure if its a good one. It involves packagers for distributions.
If tracker is installed as default by a distributer then shouldnt the distributor create an initial index right before the final build goes live?
Then the time it would need to create an index of the system when a user tries tracker for the first time on their fresh install would be reduced right?</description>
		<content:encoded><![CDATA[<p>Hi, I have an idea but im not sure if its a good one. It involves packagers for distributions.<br />
If tracker is installed as default by a distributer then shouldnt the distributor create an initial index right before the final build goes live?<br />
Then the time it would need to create an index of the system when a user tries tracker for the first time on their fresh install would be reduced right?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

