[Xapian-discuss] Indexing text files with Xapian

Justine Demeyer justine.demeyer at gmail.com
Tue Oct 21 08:47:12 BST 2008


Thanks for your answer!!

You said : " once you've read a file, you can pass its contents to Xapian to
index and then later search the database to find matching files."

How do I pass the content of a file to Xapian? Word by word?

Thank you.

On Sat, Oct 18, 2008 at 09:02:58PM +0200, Justine Demeyer wrote:
> > So, I'm on Ubuntu and I use the Java Bindings. I have to index a large
> > amount of text files and then, make some search on them. So, my question
> is
> > : is it possible to "ask" to xapian to index the content of all files and
> > ask to him to return which files has a certain word in its content??
>
> The Xapian API doesn't provide any functionality to recurse or iterate
> directories, or open files and load their contents (such APIs already
> exist, so why reinvent the wheel?)  But once you've read a file, you can
> pass its contents to Xapian to index and then later search the database
> to find matching files.
>
> Or just use the omindex program from Omega to do the indexing, and your
> own Java code to search.
>
> > I suppose it is possible but I don't know how to do that. Is there
> someone
> > who can help me, give me some indications, give me some examples, some
> > documentation??
>
> The C++ API documentation is the best reference, except that method
> names are mapped to match Java conventions (to get_mset -> getMSet).
> The Java wrappers are lagging a bit currently, because they're
> hand-coded JNI which is painful to update.  I've been slowly working
> on some new wrappers based on SWIG, but haven't had time to work
> on them for a while I'm afraid.
>
> There are some Java examples in the source code - you can find them
> online here:
>
>
> http://trac.xapian.org/browser/trunk/xapian-bindings/java/org/xapian/examples
>
> Cheers,
>     Olly
>


More information about the Xapian-discuss mailing list