<br><br><div><span class="gmail_quote">On 6/1/07, <b class="gmail_sendername">Aerik Sylvan</b> <<a href="mailto:aerik@thesylvans.com">aerik@thesylvans.com</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
I've been thinking about something that is kind of tangential to this - one of the things we've discussed is getting a large amount of proactive human data - tags, or something like them. It would take a really large number of tags (or whatever) to be really useful. Hopefully something like millions of websites each tagged by hundreds of people with at least several tags. So a dataset of perhaps a billion records is easy to imagine.
<br><br>But, it's not easy to accumulate or process. Processing it is a technical hurdle which will be fun to tackle, but accumulating the data is a whole other matter.<br><br>So, here it is: Getting data from existent social bookmarking services may be an option we should consider. Think of it - aggregating data from
<a href="http://del.icio.us" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">del.icio.us</a>, stumbleupon, etc. Now, I can't imagine how we'd get Yahoo to give us the data from <a href="http://del.icio.us" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
del.icio.us</a>, but maybe there are other providers who would be willing to do this. Or perhaps we look at paying them for it, at least enough to cover their bandwidth and other overhead.
<br><br>Anybody got an ideas around this type of thing?</blockquote><div><br>Yeah.. its a good way to find the actual interest of the people thru social book marking, digg and many other social websites.. But it all matters whether they are ready to release data open to such open source search projects.. Its always possible to take those data for a small sum or even for free if we have more user base and many gets involved in the project.
<br><br>But as you mentioned, <a href="http://del.icio.us">del.icio.us</a>, stumbleupon.. they must be very open to support this project and must think about getting money with a open source business model instead of being more private and having deals with big heads around..
<br><br>Thats the difficult part i guess... <br><br>According to me, Web 2.0 is not that open and users are getting caged with the some closed social networking sites.. <br><br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<span class="sg">Aerik<br>
</span><br>_______________________________________________<br>Search-l mailing list<br><a onclick="return top.js.OpenExtLink(window,event,this)" href="mailto:Search-l@wikia.com">Search-l@wikia.com</a><br><a onclick="return top.js.OpenExtLink(window,event,this)" href="http://lists.wikia.com/mailman/listinfo/search-l" target="_blank">
http://lists.wikia.com/mailman/listinfo/search-l</a><br>Change options or unsubscribe: <a onclick="return top.js.OpenExtLink(window,event,this)" href="http://lists.wikia.com/mailman/options/search-l" target="_blank">http://lists.wikia.com/mailman/options/search-l
</a><br></blockquote></div><br><br clear="all"><br>-- <br>Pushparajan V<br><a href="http://www.vprajan.org">http://www.vprajan.org</a><br>- - - - - - - - <br>Know me: <a href="http://www.hackerkey.com/decrypt.php?hackerkey=v4sw57BCHJUY$hw3/5ln2pr6AFOPSck3ma4u7FLMSw7DTWXm6l6FGIKLRSU$i862NLJ0CAe6$t3b4en4a23Ns3MSr9g5AGO">
http://www.hackerkey.com/decrypt.php?hackerkey=v4sw57BCHJUY$hw3/5ln2pr6AFOPSck3ma4u7FLMSw7DTWXm6l6FGIKLRSU$i862NLJ0CAe6$t3b4en4a23Ns3MSr9g5AGO</a><br>- - - - - - - -