[Xapian-discuss] bigrams search speed and index documents
Ying Liu
liux0395 at umn.edu
Thu Nov 5 23:52:38 GMT 2009
Hi Olly,
> I don't see how that would make any difference to search speed - the database
> will contain the same terms.
>
It's very interesting to find out that the size of the generated
database won't changed by the way how I build the index. Like you said,
the database will contain the same terms, and it won't affect the
searching speed. I have found the way to speed up the searching speed.
Thank you !
Ying
>
>> Can I index a file directly instead of TermGenerator?
>>
>
> You can just call Document::add_term() and/or Document::add_posting() directly
> instead of generating a string to feed to TermGenerator. That would be an
> easier and more efficient approach I think.
>
>
>> In a previous post,
>> http://lists.xapian.org/pipermail/xapian-discuss/2009-April/006626.html,
>> it mentioned tuning XAPIAN_FLUSH_THRESHOLD. How to do this to speed up
>> the searching speed?
>>
>
> XAPIAN_FLUSH_THRESHOLD only affects indexing. It can slightly change where
> posting lists chunk boundaries are, and the internal layout of blocks in the
> Btree, which may indirectly affect search speed, but there's no direct effect
> on searching.
>
> Cheers,
> Olly
>
More information about the Xapian-discuss
mailing list