[Xapian-tickets] [Xapian] #741: "Empty termnames aren't allowed" by indexing text in Arabic

Xapian nobody at xapian.org
Fri Nov 25 06:02:16 GMT 2016


#741: "Empty termnames aren't allowed" by indexing text in Arabic
-------------------------+-----------------------------
 Reporter:  Kelson       |             Owner:  olly
     Type:  defect       |            Status:  assigned
 Priority:  normal       |         Milestone:  1.4.2
Component:  Library API  |           Version:  1.4.1
 Severity:  normal       |        Resolution:
 Keywords:               |        Blocked By:
 Blocking:               |  Operating System:  Linux
-------------------------+-----------------------------

Comment (by Kelson):

 @Olly Do you mean I can safely ignore "Xapian::InvalidArgumentError"
 exceptions here?

 We use the stemmer of Xapian, here is the corresponding piece of code:

 {{{
 /* Build ICU Local object to retrieve ISO-639 language code (from
    ISO-639-3) */
 icu::Locale *languageLocale = new icu::Locale(language.c_str());

 /* Configuring language base steemming */
 try {
    this->stemmer = Xapian::Stem(languageLocale->getLanguage());
    this->indexer.set_stemmer(this->stemmer);
    this->indexer.set_stemming_strategy(Xapian::TermGenerator::STEM_ALL);
 } catch (...) {
    std::cout << "No steemming for language '" <<
 languageLocale->getLanguage() << "'" << std::endl;
 }
 }}}

--
Ticket URL: <https://trac.xapian.org/ticket/741#comment:2>
Xapian <https://xapian.org/>
Xapian



More information about the Xapian-tickets mailing list