[Xesam] Metadata Storage Daemon

Wed Jan 16 05:19:52 PST 2008

On Wed, 2008-01-16 at 14:02 +0100, Sebastian Trüg wrote:
> On Wednesday 16 January 2008 10:21:18 Kevin Kubasik wrote:
> > OK, well the obvious agreement is a need for time/change tracking, I
> > added a dbus signal called on inserts and a method to get all new
> > triples since a specified timestamp. As for file monitoring, while a
> > Gnome-wide service would be nice, I think that it is outside the scope
> > of a metadata daemon (personally, open to more discussion on this).
> >
> > I think that a rudimentary triple store (roughly like what I have
> > produced here) is a great _base_ for what we are all more or less
> > talking about. I think that the pushes for more searching/indexing
> > capabilities of the data here are missing the point, this is more a
> > simple storage engine. Powerful desktop search engines like Beagle and
> > Tracker can now both index the same stored metadata.
> 
> IMHO the indexing should be part of the store. And the search engines should 
> then use the store to query the data. Thus, we would have these components:
> 
> * Indexer (or better: analyser)
>   analyses files and writes the data into the store
> 
> * Store
>   Simple data store for triples (or quadruples) with a proper RDF API (like   
>   Soprano fx ;) for advanced queries and a simpler API to perform stuff like:
>   - getAllProperties( uri resource )
>   - setProperty( uri resource, uri property, value )
>   and so forth which handle time stamping and meta-meta-data updating
>   automatically.
>   This store also indexes the data and provides a query API which can be used 
>   by search engines. This query API is low level and not intended for the 
>   end user (I would opt for SPARQL here but I think you know that ;)
> 
> * Search client
>   Creates queries to the store from user queries.
>   (This is what has been described already in XesamQueryLanguage)
>   "Final" search clients would then be using this service for queries.
>   Thus, searching means three steps: 
>      user GUI -> search client service -> Store
> 
> * File watch service
>   Watches file systems for changes and updates the metadata accordingly.
> 
> I think it is important to keep the data in one place here. There is no reason 
> to keep separate stores and indexes for data from file analysers and from 
> user input (like tags) or any other application that likes to store 
> something.
> 

agree totally (except for explicit exposure of rdf semantics/sparql)

in tracker we store user/app defined metadata in a separate db but
sqlite allows you to construct a vitrual database which amalgamates
several sqlite db files to create a single virtual db. Where data is
stored is an implementation detail but obviously one place (or virtual
place) is more practical.

Its a good idea to separate expendable metadata from the indexer and
precious user/app defined ones to prevent any mishaps. Alternatively
backing up and restoring precious data can also be used in addition to a
primary store.

Having everything in one physical place with no backup is probably a bit
dangerous IMO

jamie