[Xapian-devel] GSoC - Improving Spelling Correction
hitandyliu at gmail.com
Mon Apr 11 08:45:25 BST 2011
I’m a student in Peking University, China, my name is Xiaobing Liu, I’m
interest in Spelling correction project.
I think Xapian’s Spelling correction is based on edit distance with trusted
dictionary. I have some questions about the spelling correction algorithm.
1. The Spelling correction algorithm depends on which dictionary, the
dictionary in inverted index or other trusted dictionary?
2. A simple approach to get the correction words is traversing the
dictionary and ranking for each word, when the dictionary is huge, this
approach should be quite time consuming, how Xapian solve this problem
3. Since search engine always suggest only the top one correction for
spelling, improving the aspect “only a single suggested correction is
available via the API” may be not so useful.
Any discussion is appreciated.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Xapian-devel