[Xapian-discuss] TermGenerator question for the single quote character
James Aylett
james-xapian at tartarus.org
Wed Apr 8 17:28:14 BST 2009
On Wed, Apr 08, 2009 at 12:11:05PM -0400, tata 668 wrote:
> 2. Then I add the words I found at the end of the original text to
> index. Every added word is separated by a custom word delimiter
> (something like "||DEL||") to ensure two added words, side by side,
> wouln't be found as a phrase.
You could instead up the termpos between each of them, so you don't
end up with junk terms in your database.
Of course, you might want to enable phrase searching across these
things, in which case you want to (say) index "excite" at the same
termpos as "m'excite". That would require some more smarts in your
indexing, however, and probably can't be done with preprocessing.
J
--
James Aylett
talktorex.co.uk - xapian.org - uncertaintydivision.org
More information about the Xapian-discuss
mailing list