[Xapian-discuss] auto stopwords
    Sam Liddicott 
    sam at liddicott.com
       
    Wed Dec 15 09:43:55 GMT 2004
    
    
  
As well as within-document-frequency and within-index-frequency is there 
any benefit in keeping the not-in-document-frequency, or the number of 
documents that do NOT contain a given term?
It would provider an at-a-glance view of how many documents a term might 
select, it could be good for auto-stopword selection by showing how 
useless a term is as a document selector.
Just in idea
Sam
    
    
More information about the Xapian-discuss
mailing list