[Xapian-discuss] More than one Index?
John Wards
j.wards at sportnetwork.net
Mon Jan 30 14:28:35 GMT 2006
On Mon, 2006-01-30 at 14:16 +0000, James Aylett wrote:
> > Now in terms of database size what difference would this make to the
> > database? The description is at least 5 times the length of the address
> > index and I have 650,000 documents...would I have to times the database
> > by 5 to get an idea? Which is quite scary as the database is 1.1 gig in
> > size at the moment.
>
> If you're worried about database sizes, you could drop positional data
> unless you need it (phrase queries, for instance).
How do I do that then?
> How big is your source data? That's about 18k per document, which is
> fairly large unless I misunderstand your application...
My scriptindex input file is the following:
id: field=id
address: field=caption
address: index
countyid: boolean=XL
price: field
image: field
description: field=sample
link: field=url
underoffer: boolean=XUNDER
sold: boolean=XSOLD
added: field date=unix
range: boolean=XRANGE
propid: field
sortprice: field value=1
The only large part would be description but that is likely to be no
more than 300 words, probably a lot less on average.
Here is a ls -al of my database dir
-rw-r--r-- 1 root wheel 420 Jan 28 18:49 value_baseB
-rw-r--r-- 1 root wheel 259375104 Jan 28 18:49 position_DB
-rw-r--r-- 1 root wheel 3978 Jan 28 18:49 position_baseA
-rw-r--r-- 1 root wheel 85467136 Jan 28 18:49 postlist_DB
-rw-r--r-- 1 root wheel 3978 Jan 28 18:49 position_baseB
-rw-r--r-- 1 root wheel 1321 Jan 28 18:49 postlist_baseA
-rw-r--r-- 1 root wheel 623452160 Jan 28 18:49 record_DB
-rw-r--r-- 1 root wheel 1322 Jan 28 18:49 postlist_baseB
-rw-r--r-- 1 root wheel 164724736 Jan 28 18:48 termlist_DB
-rw-r--r-- 1 root wheel 9532 Jan 28 18:48 record_baseB
-rw-r--r-- 1 root wheel 9531 Jan 28 18:48 record_baseA
-rw-r--r-- 1 root wheel 2532 Jan 28 18:48 termlist_baseB
-rw-r--r-- 1 root wheel 26394624 Jan 28 18:48 value_DB
-rw-r--r-- 1 root wheel 2505 Jan 28 18:48 termlist_baseA
-rw-r--r-- 1 root wheel 10 Jan 28 18:47 meta
-rw-r--r-- 1 root wheel 403 Jan 28 18:47 value_baseA
I am on a FreeBSD jail and pay per extra Gig of space...and always have
a exact copy of my database for testing etc..so that would be an extra
10 gigish which my bank balance can't cope with just now.
Thanks
John
More information about the Xapian-discuss
mailing list