[Xapian-devel] [GSoC 2014] About "Clustering of Search Results"
Chi Liu
liuchi09 at gmail.com
Sun Mar 16 22:13:28 GMT 2014
Hello,
I have submitted my proposal on GSoC.
But I have little idea about the timeline. Many things are difficult to be
determined.
Cheers,
Liu Chi
2014-03-11 21:33 GMT+08:00 Olly Betts <olly at survex.com>:
> On Tue, Mar 11, 2014 at 10:11:31AM +0800, Chi Liu wrote:
> > Thank you for your patient explanation about the project. My
> > understanding about the project "Clustering of Search Results" is that
> > we mainly focus on processing speed of the existing code.
>
> We need something which can cluster larger result sets faster than the
> current code. Speeding up the existing code might be the best way to do
> that, but we could start again. If we start again, I'd suggest it would
> be prudent to try to understand why the previous attempt didn't succeed.
> We don't want to end up repeating that.
>
> > By "find new approaches" I mean trying other known clustering algorithms.
>
> OK - that's fine then.
>
> > What I am concerned is whether the low efficiency is caused by
> > improper algorithm. I am reading the existing clustering branch code
> > and have not completely finished yet. I might be able to talk more
> > about existing code in my application of GSoC. But now, I really can
> > not comment before fully understanding exiting code.
>
> Sure.
>
> > My idea about measure clustering effectiveness is that when we trying
> > other known clustering algorithms, we can use the old clustering
> > result as a baseline. If the difference of clustering results is
> > acceptable and new clustering algorithm has high efficiency, we may
> > find a better approach. I will give more details about this in my
> > application of GSoC.
>
> Great.
>
> Cheers,
> Olly
>
--
Chi Liu
+86-15210624786
Undergraduate Student
Team of Search Engine and Web Mining
School of Electronic Engineering and Computer Science
Peking University, Beijing, 100871, P.R.China
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20140317/121a8af8/attachment.html>
More information about the Xapian-devel
mailing list