[Xapian-devel] GSOC 2011 : Weighting Schemes

Sumith Matharage sumith.matharage at gmail.com
Mon Mar 21 00:49:07 GMT 2011


Hi Olly,

At the moment no specific questions, since the project description itself
gives all the necessary information very clearly. I just thought of having a
look about different DFR schemes and BM25 scheme, to refresh and improve my
knowledge about those. Since I have closely worked with TF-IDF model
recently, thought of study more about DFR schemes, BM25 and  compare each
other to understand their pros and cons.

At the same time since I haven't used Xapian before, thought of
get familiarize with that too.

What is your input on that? Any suggestions to improve my knowledge in this
project?

Also, just thought of briefing you about myself. I graduated from University
of Moratuwa, Sri Lanka, Faculty of Engineering specializing in the field of
Computer Science and Engineering (http://www.cse.mrt.ac.lk/). I topped
the engineering batch (batch of more than 500 students) with a 4.06 GPA (out
 of 4.20). Also I was awarded the Gold Medal for the best Computer Science
and Engineering student in 2007. After that I joined the industry as a
software engineer and have more than 2 years of industry experience as a
software engineer. In 2009, I was awarded a scholarship by Monash university
to carry out my PhD studies and currently I am working in the field of Text
Weighting and Text Mining techniques to optimize text clustering results.

Thank you very much for your input.

Cheers,
Sumith

On Sat, Mar 19, 2011 at 9:41 PM, Olly Betts <olly at survex.com> wrote:

> On Sat, Mar 19, 2011 at 08:24:21PM +1100, Sumith Matharage wrote:
> > I'm Sumith, a postgraduate student in Monash university. I'm working in
> the
> > area of Text weighting schemes and Text Mining. When I'm going through
> the
> > GSOC project list, I felt interested in the 'Weighting Schemes' project.
> At
> > the moment, I have worked with different weighting schemes as TF-IDF and
> > would love to join and contribute with my ideas in this project.
>
> OK, sounds like you're well qualified on the theory side there then.
> Did you have any particular questions?
>
> Cheers,
>     Olly
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20110321/72eb3fb9/attachment.htm>


More information about the Xapian-devel mailing list