#x27;::jsonb)
Rows Removed by Index Recheck: 4360296
Heap Blocks: exact=37031 lossy=872059
Hmm, looks like too small work_mem because lossy heap block count is too big.
--
Teodor Sigaev E-mail: teo...@sigaev.ru
Hm, I don't think having the compile/link lines be hidden up is
acceptable. Many times we need to debug some compile problem, and the
output is mandatory.
+1
Although it could be fixed by
VERBOSE=1 make
--
Teodor Sigaev E-mail: teo...@siga
Teodor Sigaev wrote:
I tried it on FreeBSD 64-bit, 16Gb, SSD, Core i7
( ./configure && gmake all; ) 168,99s user 15,46s system 97% cpu 3:09,61 total
( cmake . && gmake all; ) 75,11s user 11,34s system 100% cpu 1:26,30 total
( CFLAGS='-O2' cmake . && gmake
h
optimization level does cmake buld use by default? Which compiler does it take?
It's not obvious, because cmake build hides actual compiler command line.
Yury, pls, return back check target...
--
Teodor Sigaev
chunks.
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org
toast_decompress_datum
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http
lement some search method which decompress
some chunks of jsonb.
Could you send to me an example of that jsonb?
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general m
uses FTS with Chinese language.
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
ues
to an array, but that doesn't seem to work with prefix searching. Any pointers
would be much appreciated!
Thanks,
Albert
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via
Great, thank you!
I assume this one goes into 8.4.3, right?
Yeah, or apply patch
http://anoncvs.postgresql.org/cvsweb.cgi/pgsql/src/backend/access/gin/ginscan.c?r1=1.25&r2=1.26
--
Teodor Sigaev E-mail: teo...@sigae
can's strategy.
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.post
Thank you for the report, will see on this weekend
Vyacheslav Kalinin wrote:
Hello,
Basically, I started testing prefix matching in FTS and got into
troubles. Self-contained example follows:
--
Teodor Sigaev E-mail: teo...@sigaev.ru
ppose.
Will you apply this, or do you want me to?
I'm not able to provide a good error message in good English :(
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-gener
ntryInsert. ginEntryInsert is called
during pending list cleanup.
Patch removes checking of TOAST_INDEX_TARGET and use checking only by
GinMaxItemSize which is greater than TOAST_INDEX_TARGET. All size's check is now
in GinFormTuple.
--
Te
it seems that I should reduce the Fill Factor of some FTS indexes, but
what is the default ?
> The other index methods use fillfactor in different but roughly
analogous ways;
> the default fillfactor varies between methods
Actually, GIN doesn't use it.
--
Te
I tried to create an index including all of the fields I query on to
see if that would work, but I get an error the the index row is too
large:
=> create index master_index on source_listings(geo_lat, geo_lon,
price, bedrooms, region, city, listing_type, to_tsvector('english',
full_listing), p
char" issue? Does this affect the old contrib/tsearch2 code?
Checked - No, that was improvement for 8.3 :).
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-ge
n't mind, I can raise my
points, and see if anything useful comes out of it.
If you see bug/mistake/suspicious point, please, don't be quiet
Also, about that patch - it doesn't seem to apply cleanly to 8.4,
perhaps that file has changed too much (I based my 'review' ab
Tom Lane wrote:
Teodor Sigaev writes:
I reproduced the bug with a help of Grzegorz's point for 64-bit box.
Hmm, seems it's not so much a "64 bit" error as a "signed vs unsigned
char" issue?
Yes, but I don't understand why it worked in 32-bit box.
ld be some safety limit for lenght of str,
It's a C-string
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make change
I reproduced the bug with a help of Grzegorz's point for 64-bit box. So, patch
is attached and I'm going to commit it
--
Teodor Sigaev E-mail: teo...@sigaev.ru
WWW: http://www.sigaev.ru/
*** src/backe
tgres
BACKEND_PID and type run in gdb, next, execute CREATE DICTIONARY and type bt in gdb
Teodor Sigaev wrote:
Could you provide a backtrace? Do you use unchanged norwegian.stop file?
I'm not able to reproduce the bug - postgres just works.
Tommy Gildseth wrote:
While trying to create a new dict
Could you provide a backtrace? Do you use unchanged norwegian.stop file?
I'm not able to reproduce the bug - postgres just works.
Tommy Gildseth wrote:
While trying to create a new dictionary for use with PostgreSQL text
search, I get a segfault. My Postgres version is 8.3.5
--
Teodor S
No matter if I drop the trigger that update agg content and the fact
that I'm just updating d, postgresql will update the index?
Yes, due to MVCC. Update of row could produce new version (tuple) and new
version should be index as old one.
--
Teodor Sigaev
ns to develop fast_insert_gin patch which now in review
process.
Ivan Sergio Borgonovo wrote:
I've a table that contain a tsvector that is indexed (gin) and
triggers to update the tsvector that should then update the index.
--
Teodor Sigaev E-mail: teo...
select * from FT1IDX_trigger_create();
commit;
function are used since I've 2 triggers actually that I drop and
create.
Is there anything wrong in the above to make this update so slow on
a 2x Xeon 3.2GHz 4GbRAM and a RAID1 [sic] I know it is slow on write.
--
Teodor Sigaev
es are not
acceptable for french locale :(
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes
ting of your db
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.po
) effective_cache_size
2) The query
3) Output of EXPLAIN ANALYZE of query
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make
dex at once. Do you know why that's happening?
I suppose, if maintenance_work_mem is rather big then all data of index
accumulates in memory and so it writes at disk at once. With that test's options
size of index is equal to 40Mb.
--
Teodor Sigaev
We could extend IndexBuildHeapScan's API to support that, but I'm
not quite convinced that this is the issue.
That extension might be useful for bitmap index too to simplify index creation
process.
--
Teodor Sigaev E-mail: [EMAIL
X fooidx ON foo USING gin (v);
LOG: checkpoints are occurring too frequently (16 seconds apart)
HINT: Consider increasing the configuration parameter "checkpoint_segments".
CREATE INDEX
Time: 14557,750 ms
Again, near to the same time.
--
Teodor Sigaev
GIN's build algorithm could use bulk insert of ItemPointers if and only if they
should be inserted on rightmost page (exact piece of code - dataPlaceToPage() in
gindatapage.c, lines 407-427)
I'm not following. Rightmost page of what --- it can't be the whole
index, can it, or the case would ha
sequence scan's history.
GIN's build algorithm could use bulk insert of ItemPointers if and only if they
should be inserted on rightmost page (exact piece of code - dataPlaceToPage() in
gindatapage.c, lines 407-427)
Is any way to force table's scan from t
s any [alnum]+-[alnum]+ as a hyphenated word, but 8.3 treats
[digit]+-[digit]+ as two separated numbers.
So, you can play around pre-process texts before indexing or have a look on
regex dictionary (http://vo.astronet.ru/arxiv/dict_regex.html)
--
Teodor Sigae
20 hours to find the fix Teodor, Kudos !
Nothing for the pride :(, my bug.
Due to the importance of the fix, will we see very soon a 8.3.5 ?
Don't known, see discussion. I think, that will make sense.
--
Teodor Sigaev E-mail: [EMAIL PROT
Fixed, patch attached.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
diff -c -r src.orig/backend/access/gist/gistget.c
src/backend/access/gist/gistget.c
*** src.orig/backend/access/gist
nt on table1
(cost=0.00..8.27 rows=1 width=0) (actual time=0.011..0.408 rows=98
loops=1)
Total runtime: 0.477 ms
(3 rows)
--
Regards,
Sergey Konoplev
--
PostgreSQL articles in english & russian
http://gray-hemp.blogspot.com/search/label/postgresql/
--
Teodor Sigaev
8 rows=640 width=2)
Recheck Cond: (t = 1)
-> Bitmap Index Scan on qqidx (cost=0.00..19.46 rows=640 width=0)
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-genera
. Will see.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://www.postgresql.or
It *may* be because I'm using psql 8.0.3 and not the latest version (but
I'm stucked with that version), i'm just hoping that one of you have met
Upgrade to 8.0.17 - there was a several fixes in ISpell code.
--
Teodor Sigaev E-mail: [
tateAction into
table(s) with some caching for first step) , but it's complex to operate and
it's needed to prove correctness of changes in states before its become in use.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
made as module, I think.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http
SELECT id FROM document WHERE to_tsvector('english',text) @@
plainto_tsquery('english','despite this');
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
setext). Method should mark a needed
words/parts/lexemes etc.
4 ts_headline glues fragments into text and returns that.
We need a parser's headline method because only parser knows all about its
lexemes.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
ry per
word. So, if your language can work with C-structures then you can use that
language with tsearch with more or less performance pay. PL/pgSQL hasn't this
capability.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
writing my own stemmer in PL/pgSQL or in C as a postgres
function.
Yes, of course, you can develop your dictionary (-ies) and parser. Dut only in
C, because they are critical for performance.
--
Teodor Sigaev E-mail: [
I just stumbled on http://www.cs.purdue.edu/spgist/ which seems like
exactly what I need.
It doesn't work with 8.2 and up, because since 8.2 index should take care about
concurrent access itself and that implementation doesn't do it.
--
Teodor Sigaev
/gevel/). GiST-related functions in this
module is invented to help to developers, not for production use, so they
acquire exclusive lock on index.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http
sion of existing ts_headline.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To make changes to your subscription:
http://ww
.
In any case, the patch was developed for contrib version of tsearch.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--
Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
To ma
html#TEXTSEARCH-STATISTICS
For versions before 8.3 just use stat() function instead of ts_stat().
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(e
x27;ISIMO# tonto tont'isimo
O > -O, 'ISIMA# tonto tont'isima
Current implementation doesn't accept any character in ending except alpha ones.
i think 'I.. word is not correct for ispell,
this should be one Í letter
That's right, but you should convert di
Fixes are committed to CVS, hope, they will help you.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 9: In
I tryed to reproduce the bug but without success.
Could you provide a dump of text column?
Hannes Dorbath wrote:
Crash happens about 7 minutes after issuing the UPDATE statement with
current CVS HEAD. The table has around 5 million rows. It's always
reproducible.
--
Teodor S
led.
!>
Hmm, can you provide backtrace?
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 3: Have you ch
My dictionary at 506 line had:
Where do you take this file? And what is encdoing/locale setting of your db?
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of
pm');
to_tsvector
---
'httpd-2.2.3-5.src.rpm':1
Why don't I get: 'httpd', 'src', 'rpm', 'httpd-2.2.3-5.src.rpm' ?
Is this a bug or design?
Thank you!
Bob
Regards,
a cube (a data type
from the postgres cube extension module).
I traced the error to the gistUserPicksplit
function in the
gistsplit.c ... I surmise that this method is called whenever a page
split is necessary.
So, I know when this error occurs, but I don't know why.
Thoughts anyone?
C
ow to user
void page.
Is a fix being worked on?
If a fix is forthcoming, will it be available in the 8.2 series or only 8.3+?
Possibly, full fix in 8.4. But I will not promise.
8.3 will have protection from queries which doesn't match anything.
--
Teodor Sigaev
ted. Thank for your report and testing.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 9: In versions below 8.0, t
Ooops. Patch doesn't apply cleanly. New version.
Attached patch fixes that deadlock bug too. And, previous version of my
patch has a mistake which is observable on CREATE INDEX .. USING GIN query.
--
Teodor Sigaev E-mail: [EMAIL PROT
So,
splits occur rather rare.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 3: Have you checked our extensive
"ring" of buffers for VACUUM. Nevertheless, it's a possible
scenario in 8.2.
Attached patch fixes that deadlock bug too. And, previous version of my patch
has a mistake which is observable on CREATE INDEX .. USING GIN query.
--
Teodor Sigaev
Found a reason: if parent page is fully backuped after child's split then
forgetIncompleteSplit() isn't called at all.
Hope, attached patch fix that. Pls, test it.
PS I'm going away for weekend, so I'll not be online until Monday.
--
Teodor Sigaev
ck on buffer's lock. So, right
now I investigate the problem.
Neither GIST nor B-tree seems to throw an error in corresponding
locations also, so the potential for not being able to track this is
high. I'd want to throw errors in those locations also.
Agreed, I'll add more ch
vide more details about
your test?
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster
ry and this is effecting the
Warm Standby.
Hmm. I found that gin_xlog_cleanup doesn't reset incomplete_splits list. Is it
possible reason of bug?
Attached patch fixes it.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
I found out that using 'simple' instead of 'default' when using
to_tsvector() does excactly that, but I don't know how to change my
triggers and indexes to keep doing the same (using 'simple').
Suppose, your database is initialized with C locale. So, just mark
simple configuration as default:
free space map settings
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 1: if posting/reading through Usenet,
PostgreSQL
---(end of broadcast)---
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to [EMAIL PROTECTED] so that your
message can get through to the mailing list cleanly
--
Teo
Pls, check your steps or say me where I'm wrong :)
If you still have a problems, I can solve it if I'll have access to your
developer server...
% cd PGSQL_SRC
% zcat ~/tmp/tsearch_snowball_82-20070504.gz| patch -p0
% cd contrib/tsearch2
% gmake && su -c 'gmake install' && gmake installcheck
% c
pgsql encodings.
by the caller?
Yes, of course.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 9: In
you should be able to index the way you want. In contrib there a module
"cube" which does similar to what you want to 3D, extending it to 12D
shouldn't be too hard...
contrib/cube module implements N dimensional cube representation
--
Teodor Sigaev
Sorry, no - I tested on CVS HEAD, so dll isn't compatible :(
Wait a bit for 8.2.4
richardcraig wrote:
Teodor
As a non-C windows user (yes - throw stones at me :) ) Do you have a fixed
dll for this patch that I can try?
Thanks
Richard
Teodor Sigaev-2 wrote:
Solved, see attached pat
Fixed. Thanks for the report.
Anyway, just to signal that tsearch2 crashes if SELECT is not
granted to pg_ts_dict (other tables give a proper error message when
not GRANTed).On
--
Teodor Sigaev E-mail: [EMAIL PROTECTED
e and
Snowball doesn't use version mark or something similar. So, downloaded Snowball
core and stemmers in different time may be incompatible :(.
Our tsearch_core patch (moving tsearch into core of pgsql) solves that problem -
it contains all possible snowball stemmers.
--
Teo
nowball is out). It should return 'voyage' (=travel) instead of
'voyageuse' (=female traveler)
That's now what I want ; i want to use snowball to stem French words.
I'm going to make a debug build and try to debug it, but if anyone
can help
Solved, see attached patch. I had found old Celeron-300 box and install Windows
on it, and it was very slow :)
Nope, same result with this patch.
Thank you.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW
non-breakable space (0xa0) and that commit
assumes any character with C locale and multibyte encoding and > 0x7f is alpha.
To check theory, pls, apply attached patch.
If so, I'm confused, we can not assume that 0xa0 is a space symbol in any
multibyte encoding, even
: any data older than one month (which doesn't
change) with GIN index and new data with GiST. And one time per month moves data
from GiST to GIN.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW:
lions. Bigger collections
require engines like a google.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP
8.2 has fully rewritten text parser based on POSIX is* functions.
Thomas Pundt wrote:
On Wednesday 21 March 2007 14:25, Teodor Sigaev wrote:
| I can't reproduce your problem, but I have not Windows box, can anybody
| reproduce that?
just a guess in the wild; I once had a similar phenome
;test text');
to_tsvector
---
'test':1 'text':2
(1 row)
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)
No, the first X aren't more important, but being able to determine
word proximity is very important for partial phrase matching and
ranking. The closer the words, the "better" the match, all else being
equal.
exactly
---(end of broadcast)---
TIP
ranking purpose
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
---(end of broadcast)---
TIP 6: explain analyze is your friend
to_tsvector() could as well return the character number or a byte
pointer, I could see advantages for both. But the word number makes
little sense to me.
Word number is used only in ranking functions. If you don't need a ranking than
you could safely strip positional information.
--
T
tsquery. Small description of
hlparsetext is placed at
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/docs/HOWTO-parser-tsearch2.html
near the end. Description of HLWORD struct is some out of day, sorry.
--
Teodor Sigaev E-mail: [
Use GIN index instead of GiST
I have a table of books, with 120 registers. I have created an GIST
index over the title and subtitle,
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru
intarray. My question is whether I still should use intarray for
indexing (if yes then either I should use GIST or GIN) or maybe GIN
index is faster than GIST+intarray / GIN+intarray.
Yes, with intarray you can use GiST/GIN indexes which you wish
--
Teodor Sigaev
waiting socket to write,
so, may be there is symmetrical problem with read? Or pgwin32_select() is used
for waiting write too?
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru
called on null input;
# select isvoid( plainto_tsquery('the & any') );
NOTICE: query contains only stopword(s) or doesn't contain lexeme(s), ignored
isvoid
t
(1 row)
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
-
1
(1 row)
contrib_regression=# select numnode( plainto_tsquery('long table') );
numnode
-----
3
(1 row)
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
-
e configuration is not saved correctly?
Best regards
Manuel ...
---(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
Oh, I see. So, only 2.0.1 and I can't change that for 8.2 branch. :(
Hannes Dorbath wrote:
On 21.12.2006 18:32, Teodor Sigaev wrote:
Are you trying to convert openoffice (myspell) format to ispell with
help of my2ispell?
Yes:
http://groups.google.com/group/pgsql.general/browse_thread/t
Hmm, 2.0.1. But what's the difference? I don't watch changes in OpenOffice
hardly.
Hannes Dorbath wrote:
What version of OpenOffice MySpell dictionaries is supposed to work with
TSearch in 8.2?
The format used till OpenOffice 2.0.1 or the format starting from 2.0.2?
--
Teo
oking for.
IIRC I had TSearch2 with my `oldFormat' files working on an older
8.2-dev-snapshot.
Thanks for any hint.
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
--(end of broadcast)---
TIP 2: Don't 'kill -9' the postmaster
--
Teodor Sigaev E-mail: [EMAIL PROTECTED]
WWW: http://www.sigaev.ru/
he files seem to be ok and are UTF-8 encoded.
Best regards
Manuel
---(end of broadcast)---
TIP 4: Have you searched our list archives?
http://archives.postgresql.org/
--
Teodor Sigaev
performance
for queries like
select * from a,b where a.f = b.f or ( a.f is null and b.f is null)
NULL support is fast in MS SQL because MS SQL doesn't follow SQL standard: index
in MS SQL believes that (NULL = NULL) is true.
--
Teodor Sigaev E-mail: [EMAIL PROT
1 - 100 of 176 matches
Mail list logo