<div dir="ltr"><div><div>Hello sir!<br></div><br>Thank you for your gentle reply, I have visited to the github repository before but the code was not commented properly still I am trying to understand and figure out the redundant task in the algorithm by reading the code. For the same I asked you if there is any other document or information is available with you which can help me to understand it better would be great. It's okay if there is no documentation available!<br><br></div><div>Sir one more query, I am to write a proposal for this project so how difficult would it be if I write the code freshly for my algorithm or will it be better to use the previous code and make corrections into it.<br></div><div><br></div>Thanks<br><div><div><br></div></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Mar 20, 2016 at 7:21 PM, James Aylett <span dir="ltr"><<a href="mailto:james-xapian@tartarus.org" target="_blank">james-xapian@tartarus.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><span class="">On Sun, Mar 20, 2016 at 01:02:03PM +0530, MURTUZA BOHRA wrote:<br>
<br>
> I am interested in clustering of search results idea, I know clustering<br>
> techniques from the perspective of Machine learning! And I am figuring out<br>
> the way to implement machine learning technique for clustering search<br>
> result. I had 5-month back a project on LSI(latent simantic indexing and<br>
> raking the search result) through that if I am given the TF-IDF matrix then<br>
> clustering can be done quickly and efficiently which solves the problem<br>
> with GSoc-2016 project.<br>
<br>
</span>Hi, Murtuza! It certainly sounds like you have the right background to<br>
tackle this project.<br>
<span class=""><br>
> But I do not find any document on how it is implemented in GSoC-2010<br>
> because that would help to understand issues which were not taken care<br>
> previously. Please help me to access that document or part of the code<br>
> where actual clustering algorithm is implemented.<br>
<br>
</span>There was no 2010 GSoC work; there was some in 2014, but the project<br>
was unsuccessful and I believe got no further than an untested KMeans<br>
implementation.<br>
<br>
The earlier implementation (which wasn't part of GSoC) is available in<br>
the svn/clustering branch, as noted in the project description. Any<br>
documentation is likely to be code comments; I don't believe there's<br>
an internal architecture document for it. You can browse it on github<br>
(<a href="https://github.com/xapian/xapian/tree/svn/clustering" rel="noreferrer" target="_blank">https://github.com/xapian/xapian/tree/svn/clustering</a>), but it's<br>
probably easier just to clone it locally and look around.<br>
<span class="HOEnZb"><font color="#888888"><br>
J<br>
<br>
--<br>
James Aylett, occasional trouble-maker<br>
<a href="http://xapian.org" rel="noreferrer" target="_blank">xapian.org</a><br>
</font></span></blockquote></div><br></div>