[Xapian-discuss] Total matches found and how to disable stemming.

Kevin SoftDev kevin.softdev at gmail.com
Thu Mar 9 23:14:38 GMT 2006


Thanks, we are doing much better when adding [--stemmer=none] option to the
scriptindex. All the words are indexed and able to search as they appear in
the orginal content. Very good! This way we can easly avoid stemming words
that are not of English origin.

#!/bin/sh
#---------------------------------------------------------#
# This script create a Xapian index
#---------------------------------------------------------#
echo -n "Xapian Index Start: "
date
echo "The pid of this process is $$"
#---------------------------------------------------------#
echo "Retrieving data from database"
DBUSER=user DBPASSWORD=password /usr/local/bin/dbi2omega myDatabase myTable
> myData.dat
rm -f /indexPath/*
echo "Indexing "
/usr/local/bin/scriptindex --stemmer=none /indexPath indexscript myData.dat
#---------------------------------------------------------#
echo -n "Xapian Index End: "
date



On 3/9/06, James Aylett <james-xapian at tartarus.org> wrote:
>
> On Thu, Mar 09, 2006 at 06:35:13PM +0000, Olly Betts wrote:
>
> > > search for hiking does not return any results:
> > > http://nitra.net/cgi-bin/hladaj.cgi?a=q&q=hiking&c=sk
> > >
> > > search for hike return result including hiking:
> > > http://nitra.net/cgi-bin/hladaj.cgi?a=q&q=hike&c=sk
> >
> > That hike matches hiking, but hiking doesn't strongly suggests that
> > stemming is happening at index time.  So you need to fix that.
>
> The result matching 'hike' contains the word 'hike' in the page
> text. I'd suggest that no stemming is happening, but that the HTML
> <title>...</title> isn't being indexed.
>
> J
>
> --
>
> /--------------------------------------------------------------------------\
> James Aylett                                                  xapian.org
> james at tartarus.org                               uncertaintydivision.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.tartarus.org/pipermail/xapian-discuss/attachments/20060309/2cf036ed/attachment-0001.htm


More information about the Xapian-discuss mailing list