Have encountered an intriguing issue processing a table with a large
number of rows. The process is to loop over the table processing each
row executing the Apache AGE cypher function over each row
individually.The looping is necessary because of encountering a limit on
the number of rows that the cypher function would process.
The process terminates unexpectedly with the following message. Notable
that it runs for quite some time before termination.:
SQL Error [42703]: ERROR: could not find rte for
a01a724103fbb3d059b8387bf043dbc8
Where: PL/pgSQL function
analysis.create_trips(text,text,text,text,text,text,integer,text,integer)
line 5 at EXECUTE
Of note the the string refers to a value in the field service_key.
The first instance of the service_key when ordered is in row 7741 shown
below.
*row_num* *service_key* *service_id* *trip_id* *trip_headsign*
*route_id* *direction_id* *shape_id* *wheelchair_accessible*
7741 a01a724103fbb3d059b8387bf043dbc8 FR 307 Gungahlin Pl X1 0
1002 1
The database version is PostgreSQL 15.4 (Debian 15.4-2.pgdg120+1) on
x86_64-pc-linux-gnu, compiled by gcc (Debian 12.2.0-14) 12.2.0, 64-bit.
The attributes of the table allservices.trips are as follows:
* Total size with indexes: 60 MB
* Number of rows: 231,131
This is the function definition and the process to load the table
allservices.trips into the Apache AGE graph schema.
Any assistance in refining the process to ensure completion welcome.
Regards
Matt.
SELECT create_graph('transport_network');
CREATE OR REPLACE FUNCTION analysis.create_trips
(graph_name text,
service_key text,service_id text, trip_id text, trip_headsign text,
route_id text, direction_id int, shape_id text, wheelchair_accessible int)
returns text
LANGUAGE plpgsql
VOLATILE
as $trips$
declare
nodename text := graph_name || '.' || 'trips';
BEGIN
execute
format ('select * from cypher(''%1$s'', $$match (v:routes {id: %6$s})
create(v)-[:USES]->
(t:trips
{service_key: %2$s, service_id: %3$s, id: %4$s, headsign: %5$s,
route_id: %6$s, direction_id: %7$s, shape_id: %8$s,
wheelchair_accessible: %9$s})$$) as (t agtype);',
quote_ident(graph_name),
quote_ident(service_key),quote_ident(service_id),
quote_ident(trip_id),quote_ident(trip_headsign),
quote_ident(route_id),to_char(direction_id,'9'),quote_ident(shape_id),to_char(wheelchair_accessible,'9'));
return nodename;
END
$trips$
;
select create_vlabel('transport_network','trips');
do $$
declare temprow record;
graph_name text:='transport_network';
counter integer := 0 ;
begin
for temprow in select service_key, service_id, trip_id from
allservices.trips
order by service_key,trip_id
loop
counter := counter+1; -- Prevent replication of row
perform
analysis.create_trips
(graph_name,
a.service_key, a.service_id,
a.trip_id, a.trip_headsign,
a.route_id, a.direction_id, a.shape_id,
a.wheelchair_accessible)
from
(select row_number() over (order by service_key,trip_id) as row_num,
service_key, service_id,
trip_id, trip_headsign,
route_id, direction_id, shape_id,
coalesce(wheelchair_accessible,0) as wheelchair_accessible from
allservices.trips) a
where a.row_num=counter
;
end loop;
end; $$;