Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Francisco Olarte
Hi Florents: On Wed, 6 Jul 2022 at 12:35, Florents Tselai wrote: > 50M+ rows and iirc pg_relation_size was north of 80GB or so. Somebody already mentioned pg_table_size, but anyway this gives you 1.6Kb per row, which I would suspect is extremely low given your pdf content and the 1M truncation y

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Alvaro Herrera
On 2022-Jul-06, Florents Tselai wrote: > Actually, I monitored my disk usage and it was **definitely** working as > It had already eaten up an additional 30% of my disk capacity. Adding a column like this requires creating a second copy of the table, copying all the contents from the old table (

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Florents Tselai
> On 6 Jul 2022, at 1:35 PM, Florents Tselai wrote: > > > >> On 6 Jul 2022, at 1:11 PM, Francisco Olarte wrote: >> >> On Wed, 6 Jul 2022 at 11:55, Florents Tselai >> wrote: >>> Also, fwiw looking at top the CPU% and MEM% activity, looks like it does >>> data crunching work. >> ... >

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Florents Tselai
> On 6 Jul 2022, at 1:11 PM, Francisco Olarte wrote: > > On Wed, 6 Jul 2022 at 11:55, Florents Tselai > wrote: >> Also, fwiw looking at top the CPU% and MEM% activity, looks like it does >> data crunching work. > ... On 06.07.22 10:42, Florents Tselai wrote: > I have a beefy server

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Francisco Olarte
On Wed, 6 Jul 2022 at 11:55, Florents Tselai wrote: > Also, fwiw looking at top the CPU% and MEM% activity, looks like it does data > crunching work. ... > >> On 06.07.22 10:42, Florents Tselai wrote: > >>> I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a > >>> table hol

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Florents Tselai
Also, fwiw looking at top the CPU% and MEM% activity, looks like it does data crunching work. > On 6 Jul 2022, at 12:48 PM, Florents Tselai wrote: > > > >> On 6 Jul 2022, at 12:38 PM, Peter Eisentraut >> wrote: >> >> On 06.07.22 10:42, Florents Tselai wrote: >>> I have a beefy server (40+

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Florents Tselai
> On 6 Jul 2022, at 12:38 PM, Peter Eisentraut > wrote: > > On 06.07.22 10:42, Florents Tselai wrote: >> I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a >> table holding (key text, text text,) of around 50M rows. >> These are text fields extracted from 4-5 page pdfs

Re: ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Peter Eisentraut
On 06.07.22 10:42, Florents Tselai wrote: I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a table holding (key text, text text,) of around 50M rows. These are text fields extracted from 4-5 page pdfs each. I’m adding the following generated col to keep up with tsvectors

ADD COLUMN ts tsvector GENERATED too slow

2022-07-06 Thread Florents Tselai
I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a table holding (key text, text text,) of around 50M rows. These are text fields extracted from 4-5 page pdfs each. I’m adding the following generated col to keep up with tsvectors ALTER TABLE docs_text ADD COLUMN ts tsvect