Hi Florents:
On Wed, 6 Jul 2022 at 12:35, Florents Tselai wrote:
> 50M+ rows and iirc pg_relation_size was north of 80GB or so.
Somebody already mentioned pg_table_size, but anyway this gives you
1.6Kb per row, which I would suspect is extremely low given your pdf
content and the 1M truncation y
On 2022-Jul-06, Florents Tselai wrote:
> Actually, I monitored my disk usage and it was **definitely** working as
> It had already eaten up an additional 30% of my disk capacity.
Adding a column like this requires creating a second copy of the table,
copying all the contents from the old table (
> On 6 Jul 2022, at 1:35 PM, Florents Tselai wrote:
>
>
>
>> On 6 Jul 2022, at 1:11 PM, Francisco Olarte wrote:
>>
>> On Wed, 6 Jul 2022 at 11:55, Florents Tselai
>> wrote:
>>> Also, fwiw looking at top the CPU% and MEM% activity, looks like it does
>>> data crunching work.
>> ...
>
> On 6 Jul 2022, at 1:11 PM, Francisco Olarte wrote:
>
> On Wed, 6 Jul 2022 at 11:55, Florents Tselai
> wrote:
>> Also, fwiw looking at top the CPU% and MEM% activity, looks like it does
>> data crunching work.
> ...
On 06.07.22 10:42, Florents Tselai wrote:
> I have a beefy server
On Wed, 6 Jul 2022 at 11:55, Florents Tselai wrote:
> Also, fwiw looking at top the CPU% and MEM% activity, looks like it does data
> crunching work.
...
> >> On 06.07.22 10:42, Florents Tselai wrote:
> >>> I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a
> >>> table hol
Also, fwiw looking at top the CPU% and MEM% activity, looks like it does data
crunching work.
> On 6 Jul 2022, at 12:48 PM, Florents Tselai wrote:
>
>
>
>> On 6 Jul 2022, at 12:38 PM, Peter Eisentraut
>> wrote:
>>
>> On 06.07.22 10:42, Florents Tselai wrote:
>>> I have a beefy server (40+
> On 6 Jul 2022, at 12:38 PM, Peter Eisentraut
> wrote:
>
> On 06.07.22 10:42, Florents Tselai wrote:
>> I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a
>> table holding (key text, text text,) of around 50M rows.
>> These are text fields extracted from 4-5 page pdfs
On 06.07.22 10:42, Florents Tselai wrote:
I have a beefy server (40+ worker processes , 40GB+ shared buffers) and a table
holding (key text, text text,) of around 50M rows.
These are text fields extracted from 4-5 page pdfs each.
I’m adding the following generated col to keep up with tsvectors