[Search-l] more than just interoperability
Nitin Borwankar
nitin at borwankar.com
Sun Jun 3 19:59:07 UTC 2007
jer wrote:
>> So, here it is: Getting data from existent social bookmarking
>> services may be an option we should consider. Think of it -
>> aggregating data from del.icio.us <http://del.icio.us>,
>> stumbleupon, etc. Now, I can't imagine how we'd get Yahoo to
>> give us the data from del.icio.us <http://del.icio.us>, but maybe
>> there are other providers who would be willing to do this. Or
>> perhaps we look at paying them for it, at least enough to cover
>> their bandwidth and other overhead.
>>
>> Anybody got an ideas around this type of thing?
>>
>>
>> Yeah.. its a good way to find the actual interest of the people thru
>> social book marking, digg and many other social websites.. But it all
>> matters whether they are ready to release data open to such open
>> source search projects..
>
>
> For the most part, all of those sites and all of that data *is* open,
> it just needs to be intelligently crawled and indexed. They're great
> seed sites for keeping a crawler fresh.
>
> Sure it would be nice to have it in a more digestible form, but it's
> all there already :)
>
> Jer
Hi All,
I did some work for a university professor who is trying to tackle the
problem that publishers of technical periodicals own the bibliography
citations in articles. However individual researchers own the
bibliographies of their own publications, so by aggregating the
bibliographies of individual researchers one can build an alternate open
source of bibliography data. Apply the same principle to del.ici.ous,
stumbleupon etc data.
Individuals who have data on these services can individually and
voluntarily copy their data out of those systems and into any public
aggregation
of such data. Now that Yahoo has BBAuth - a single-login authentication
service, one could build a single web page where such a volunteer
individual could go and authorize the download of their del.ici.ous data
into their account(s) on any other web service(s).
There is no need to crawl the web pages and get into the arms race of IP
blocking etc. that will naturally come up.
The bigger picture here is that we as individuals own our own data and
we should not let it be captive on web applications, rather we should be
able to aggregate it wherever we choose - and if we should choose to do
so we should be able to very simply push a few buttons and have *our own
data* transferred between web applications.
Nitin Borwankar.
>------------------------------------------------------------------------
>
>_______________________________________________
>Search-l mailing list
>Search-l at wikia.com
>http://lists.wikia.com/mailman/listinfo/search-l
>Change options or unsubscribe: http://lists.wikia.com/mailman/options/search-l
>
--
Nitin Borwankar
http://walruscarpenter.wordpress.com Of shoes and ships and sealing wax of cabbages and kings
http://greener.com Find, Learn, Act .... Greener, the search engine for the planet
http://tagschema.com Implementation of tag database applications
nitin at borwankar.com
510-872-7066
More information about the Search-l
mailing list