[Xapian-discuss] More than one Index?

John Wards j.wards at sportnetwork.net
Mon Jan 30 14:28:35 GMT 2006


On Mon, 2006-01-30 at 14:16 +0000, James Aylett wrote:
> > Now in terms of database size what difference would this make to the
> > database? The description is at least 5 times the length of the address
> > index and I have 650,000 documents...would I have to times the database
> > by 5 to get an idea? Which is quite scary as the database is 1.1 gig in
> > size at the moment.
> 
> If you're worried about database sizes, you could drop positional data
> unless you need it (phrase queries, for instance).

How do I do that then?

> How big is your source data? That's about 18k per document, which is
> fairly large unless I misunderstand your application...

My scriptindex input file is the following:

id: field=id
address: field=caption
address: index
countyid: boolean=XL
price: field
image: field
description: field=sample
link: field=url
underoffer: boolean=XUNDER
sold: boolean=XSOLD
added: field date=unix
range: boolean=XRANGE
propid: field
sortprice: field value=1

The only large part would be description but that is likely to be no
more than 300 words, probably a lot less on average.

Here is a ls -al of my database dir

-rw-r--r--  1 root  wheel        420 Jan 28 18:49 value_baseB
-rw-r--r--  1 root  wheel  259375104 Jan 28 18:49 position_DB
-rw-r--r--  1 root  wheel       3978 Jan 28 18:49 position_baseA
-rw-r--r--  1 root  wheel   85467136 Jan 28 18:49 postlist_DB
-rw-r--r--  1 root  wheel       3978 Jan 28 18:49 position_baseB
-rw-r--r--  1 root  wheel       1321 Jan 28 18:49 postlist_baseA
-rw-r--r--  1 root  wheel  623452160 Jan 28 18:49 record_DB
-rw-r--r--  1 root  wheel       1322 Jan 28 18:49 postlist_baseB
-rw-r--r--  1 root  wheel  164724736 Jan 28 18:48 termlist_DB
-rw-r--r--  1 root  wheel       9532 Jan 28 18:48 record_baseB
-rw-r--r--  1 root  wheel       9531 Jan 28 18:48 record_baseA
-rw-r--r--  1 root  wheel       2532 Jan 28 18:48 termlist_baseB
-rw-r--r--  1 root  wheel   26394624 Jan 28 18:48 value_DB
-rw-r--r--  1 root  wheel       2505 Jan 28 18:48 termlist_baseA
-rw-r--r--  1 root  wheel         10 Jan 28 18:47 meta
-rw-r--r--  1 root  wheel        403 Jan 28 18:47 value_baseA

I am on a FreeBSD jail and pay per extra Gig of space...and always have
a exact copy of my database for testing etc..so that would be an extra
10 gigish which my bank balance can't cope with just now.

Thanks
John




More information about the Xapian-discuss mailing list