[Xapian-tickets] [Xapian] #225: Spelling algorithm should consider frequency and not just edit-distance

Xapian nobody at xapian.org
Tue Jan 6 00:39:28 GMT 2009


#225: Spelling algorithm should consider frequency and not just edit-distance
-------------------------+--------------------------------------------------
 Reporter:  philipn      |        Owner:  olly     
     Type:  defect       |       Status:  assigned 
 Priority:  normal       |    Milestone:  1.1.1    
Component:  Library API  |      Version:  SVN trunk
 Severity:  normal       |   Resolution:           
 Keywords:               |    Blockedby:           
 Platform:  All          |     Blocking:           
-------------------------+--------------------------------------------------
Changes (by olly):

  * status:  new => assigned
  * milestone:  => 1.1.1


Old description:

> As described here:
> http://lists.tartarus.org/pipermail/xapian-
> discuss/2008-January/005104.html
>
> If the spelling correction algorithm considered frequency and edit-
> distance
> (using  some reasonable heuristic) we would see dramatically better
> results.
> The current spelling algorithm will only correct words that never appear
> in the
> spelling index.

New description:

 As described here:
 http://thread.gmane.org/gmane.comp.search.xapian.general/5740/focus=5743

 If the spelling correction algorithm considered frequency and edit-
 distance
 (using  some reasonable heuristic) we would see dramatically better
 results.
 The current spelling algorithm will only correct words that never appear
 in the
 spelling index.

--

Comment:

 It would be good to improve this during the 1.1.x series.

 [Changed link to list discussion to point to gmane for easier browsing of
 the thread.]

-- 
Ticket URL: <http://trac.xapian.org/ticket/225#comment:4>
Xapian <http://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list