[Xapian-tickets] [Xapian] #225: Spelling algorithm should consider frequency and not just edit-distance
Xapian
nobody at xapian.org
Tue Jan 6 00:39:28 GMT 2009
#225: Spelling algorithm should consider frequency and not just edit-distance
-------------------------+--------------------------------------------------
Reporter: philipn | Owner: olly
Type: defect | Status: assigned
Priority: normal | Milestone: 1.1.1
Component: Library API | Version: SVN trunk
Severity: normal | Resolution:
Keywords: | Blockedby:
Platform: All | Blocking:
-------------------------+--------------------------------------------------
Changes (by olly):
* status: new => assigned
* milestone: => 1.1.1
Old description:
> As described here:
> http://lists.tartarus.org/pipermail/xapian-
> discuss/2008-January/005104.html
>
> If the spelling correction algorithm considered frequency and edit-
> distance
> (using some reasonable heuristic) we would see dramatically better
> results.
> The current spelling algorithm will only correct words that never appear
> in the
> spelling index.
New description:
As described here:
http://thread.gmane.org/gmane.comp.search.xapian.general/5740/focus=5743
If the spelling correction algorithm considered frequency and edit-
distance
(using some reasonable heuristic) we would see dramatically better
results.
The current spelling algorithm will only correct words that never appear
in the
spelling index.
--
Comment:
It would be good to improve this during the 1.1.x series.
[Changed link to list discussion to point to gmane for easier browsing of
the thread.]
--
Ticket URL: <http://trac.xapian.org/ticket/225#comment:4>
Xapian <http://xapian.org/>
Xapian
More information about the Xapian-tickets
mailing list