pg_read_file() takes byte-offset and length as arguments, but we don't check the result text with pg_verify_mbstr(). Should pg_read_file() return bytea instead of text or adding some codes to verify the input? Only superusers are allowed to use the function, but it is still dangerous.
If we leave the result in text type and add verifier, we also need to consider how to handle multi-byte text. Offset and length should not split one multi-byte character. We can assume the offset as a correct boundary if we can trust users, but no one knows correct length before the function call. An idea is to have binary and text versions of pg_read_file: * pg_read_binary_file(filename, offset, length) : bytea * pg_read_text_file(filename, offset) : ROW( text, nextline_offset ) -- it returns the next line starting with 'offset'. but such changes could bring on compatibility problems. Comments, better ideas? Regards, --- ITAGAKI Takahiro NTT Open Source Software Center -- Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers