<span class="Apple-style-span" style="border-collapse: collapse; font-family: arial, sans-serif; font-size: 13px; ">Hi Olly,<div><br></div><div>At the moment no specific questions, since the project description itself gives all the necessary information very clearly. I just thought of having a look about different DFR schemes and BM25 scheme, to refresh and improve my knowledge about those. Since I have closely worked with TF-IDF model recently, thought of study more about DFR schemes, BM25 and compare each other to understand their pros and cons. </div>
<div><br></div><div>At the same time since I haven't used Xapian before, thought of get familiarize with that too. </div><div><br></div><div>What is your input on that? Any suggestions to improve my knowledge in this project? </div>
<div><br></div><div>Also, just thought of briefing you about myself. I graduated from University of Moratuwa, Sri Lanka, Faculty of Engineering specializing in the field of Computer Science and Engineering (<a href="http://www.cse.mrt.ac.lk/" target="_blank" style="color: rgb(17, 65, 112); ">http://www.cse.mrt.ac.lk/</a>). I topped the engineering batch (batch of more than 500 students) with a 4.06 GPA (out of 4.20). Also I was awarded the Gold Medal for the best Computer Science and Engineering student in 2007. After that I joined the industry as a software engineer and have more than 2 years of industry experience as a software engineer. In 2009, I was awarded a scholarship by Monash university to carry out my PhD studies and currently I am working in the field of Text Weighting and Text Mining techniques to optimize text clustering results.</div>
<div><br></div><div>Thank you very much for your input.</div><div><br></div><div>Cheers,</div><div>Sumith </div></span><br><div class="gmail_quote">On Sat, Mar 19, 2011 at 9:41 PM, Olly Betts <span dir="ltr"><<a href="mailto:olly@survex.com">olly@survex.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;"><div><div></div><div class="h5">On Sat, Mar 19, 2011 at 08:24:21PM +1100, Sumith Matharage wrote:<br>
> I'm Sumith, a postgraduate student in Monash university. I'm working in the<br>
> area of Text weighting schemes and Text Mining. When I'm going through the<br>
> GSOC project list, I felt interested in the 'Weighting Schemes' project. At<br>
> the moment, I have worked with different weighting schemes as TF-IDF and<br>
> would love to join and contribute with my ideas in this project.<br>
<br>
</div></div>OK, sounds like you're well qualified on the theory side there then.<br>
Did you have any particular questions?<br>
<br>
Cheers,<br>
<font color="#888888"> Olly<br>
</font></blockquote></div><br>