[Xapian-discuss] Updating existing documents

Richard Boulton richard at lemurconsulting.com
Mon May 18 17:24:18 BST 2009


2009/5/18 Luis Zarrabeitia <kyrie at uh.cu>:
> Another newbie, related question:
>  How can you get the ID of the of the document to replace, given the new
>  document?
>
> The OP's problem seems to be that when he crawls the second time, he is
> indexing the same documents (and thus, they appear twice in the database).
> Thus, the second time he finds the document, he must know the ID that was
> assigned the first time (I recently had a similar situation[1], where I was
> trying to use document titles - this case seems to be similar, only with URLs
> instead of titles). Should the OP (or I) keep an external mapping URL->doc_id
> (and be careful with xapian-compacts), or is there a better way?

See http://trac.xapian.org/wiki/FAQ/UniqueIds

-- 
Richard



More information about the Xapian-discuss mailing list