[Xapian-discuss] bigrams search speed and index documents

Ying Liu liux0395 at umn.edu
Thu Nov 5 23:52:38 GMT 2009


Hi Olly,

> I don't see how that would make any difference to search speed - the database
> will contain the same terms.
>   
It's very interesting to find out that the size of the generated 
database won't changed by the way how I build the index. Like you said, 
the database will contain the same terms, and it won't affect the 
searching speed. I have found the way to speed up the searching speed.

Thank you !
Ying

>   
>> Can I index a file directly instead of  TermGenerator?
>>     
>
> You can just call Document::add_term() and/or Document::add_posting() directly
> instead of generating a string to feed to TermGenerator.  That would be an
> easier and more efficient approach I think.
>
>   
>> In a previous post,  
>> http://lists.xapian.org/pipermail/xapian-discuss/2009-April/006626.html,  
>> it mentioned  tuning XAPIAN_FLUSH_THRESHOLD. How to do this to speed up  
>> the searching speed?
>>     
>
> XAPIAN_FLUSH_THRESHOLD only affects indexing.  It can slightly change where
> posting lists chunk boundaries are, and the internal layout of blocks in the
> Btree, which may indirectly affect search speed, but there's no direct effect
> on searching.
>
> Cheers,
>     Olly
>   




More information about the Xapian-discuss mailing list