[Xapian-discuss] TermGenerator question for the single quote character

James Aylett james-xapian at tartarus.org
Wed Apr 8 17:28:14 BST 2009


On Wed, Apr 08, 2009 at 12:11:05PM -0400, tata 668 wrote:

> 2. Then I add the words I found at the end of the original text to 
> index. Every added word is separated by a custom word delimiter 
> (something like "||DEL||") to ensure two added words, side by side, 
> wouln't be found as a phrase.

You could instead up the termpos between each of them, so you don't
end up with junk terms in your database.

Of course, you might want to enable phrase searching across these
things, in which case you want to (say) index "excite" at the same
termpos as "m'excite". That would require some more smarts in your
indexing, however, and probably can't be done with preprocessing.

J

-- 
  James Aylett

  talktorex.co.uk - xapian.org - uncertaintydivision.org



More information about the Xapian-discuss mailing list