hi Vince, The deltaQuery is used to fetch the IDs of any changed items - DIH then runs the deltaImportQuery to fetch the document for each of those IDs, using a WHERE clause of the form "WHERE id = '${dataimporter.delta.id}'. (You can make it use compound PKs if necessary - and you might want a deletedPkQuery too to identify docs to remove.)
hope that helps! Andy On Fri, 1 Dec 2023 at 10:41, Vince McMahon <sippingonesandze...@gmail.com> wrote: > I need help, please. > > I keep looking at the Configuration, the full import works using the > entity "query". I have tested the SELECT statement from deltaQuery in a > Teradata terminal and that works. > > The following teradata query returns 18 rows from the Teradata terminal. > SELECT id, updated_at FROM Name1.<cowboyTable> WHERE update_at >= > cast(*'2023-12-01 > 09:46:10'* AS TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8' > DAY; > > The differences I can think of between the full import "query" and the > "deltaQuery" are those characters after the WHERE update_at ... > > So, if there is any attribute Solr delta import is looking for, it would > had been the variable dih.last_index_time or remaining teradata > keywords/syntax is confusing the Solr parser. > > the dih.last_index_time is from the file dataimport.properties, which has: > #Fri Dec 01 09:46:15 UTC 2023 > last_index_time=2023-12-01 09\:46\:10 > <cowboysEntity>.last_index_time=2023-12-01 09\:46\:10 > > > Configuration (db-data-config.xml) > <dataConfig> > <dataSource type="JdbcDataSource" > driver="com.teradata.jdbc.TeraDriver" > url= > "jdbc:teradata://<teradataServer>/database=Name1,tmode=TERA,charset=UTF8" > user="<user>" > password="<pwd>"/> > <document> > <entity name ="<cowboysEntity>" > pk ="id" > query ="SELECT id, updated_at FROM > Name1.<cowboyTable> WHERE updated_at between '2023-01-01 00:00:00' and > '2023-01-01 00:30:00' " > deltaQuery ="SELECT id, updated_at FROM > Name1.<cowboyTable> WHERE updated_at *>= cast('${dih.last_index_time}' AS > TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8' DAY *"> > <field column="id" name="id"/> > <field column="updated_at" name="updated_at"/> > </entity> > </document> > </dataConfig> > > On Fri, Dec 1, 2023 at 4:57 AM Vince McMahon < > sippingonesandze...@gmail.com> > wrote: > > > Hi, > > > > Could someone please help me with the what Solr is looking in deltaQuery > > of Delta Import ? > > > > I have got 18 rows fetched from the database, but none of the row is > > processed as document. > > > > There are the "WARN" In the solr.log, but, what attribute is not > > specified? It seems important because the Raw Debug-Response shows 18 > rows > > fetched and are empty/no processed documents. Thanks. > > > > "verbose-output": [ > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > ... > > > > > > solr.log > > 2023-12-01 09:24:38.325 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DataImporter Loading DIH Configuration: db-data-config.xml > > 2023-12-01 09:24:38.328 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DataImporter Data Configuration loaded successfully > > 2023-12-01 09:24:38.328 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DataImporter Starting Delta Import > > 2023-12-01 09:24:38.335 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SimplePropertiesWriter Read dataimport.properties > > 2023-12-01 09:24:38.336 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Starting delta collection. > > 2023-12-01 09:24:38.338 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Running ModifiedRowKey() for Entity: <cowboysEntity> > > 2023-12-01 09:24:38.339 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.JdbcDataSource Creating a connection for entity <cowboysEntity> > > with URL: jdbc:teradata://<teradataServer>/database > > =Name1,tmode=TERA,charset=UTF8 > > 2023-12-01 09:24:39.101 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.JdbcDataSource Time taken for getConnection(): 762 > > 2023-12-01 09:24:39.695 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Completed ModifiedRowKey for Entity: <cowboysEntity> > > rows obtained: 18 > > 2023-12-01 09:24:39.695 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Completed DeletedRowKey for Entity: <cowboysEntity> > > rows obtained : 0 > > 2023-12-01 09:24:39.695 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Completed parentDeltaQuery for Entity: > <cowboysEntity> > > 2023-12-01 09:24:39.696 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:39.940 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:40.138 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:40.337 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:40.532 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:40.728 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:40.924 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:41.121 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:41.322 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:41.524 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:41.724 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:41.928 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:42.129 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:42.350 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:42.551 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:42.758 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:42.955 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:43.155 WARN (qtp548482954-31) [ x:p6] > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not > > specified for entity : <cowboysEntity> > > 2023-12-01 09:24:43.361 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Delta Import completed successfully > > 2023-12-01 09:24:43.361 INFO (qtp548482954-31) [ x:p6] > > o.a.s.h.d.DocBuilder Time taken = 0:0:5.26 > > 2023-12-01 09:24:43.361 INFO (qtp548482954-31) [ x:p6] > > o.a.s.u.p.LogUpdateProcessorFactory [p6] webapp=/solr path=/dataimport > > params={core=p6&debug=true&indent=on&commit=true&name > > =dataimport&clean=false&wt=json&command=delta-import&_=1701422662358& > > verbose=true}{} 0 5036 > > 2023-12-01 09:24:43.605 INFO (qtp548482954-30) [ x:p6] > > o.a.s.c.S.Request [p6] webapp=/solr path=/dataimport params={indent=on > > &wt=json&command=status&_=1701422662358} status=0 QTime=0 > > > > > > > > Configuration (db-data-config.xml) > > <dataConfig> > > <dataSource type="JdbcDataSource" > > driver="com.teradata.jdbc.TeraDriver" > > url= > > "jdbc:teradata://<teradataServer>/database=Name1,tmode=TERA,charset=UTF8" > > user="<user>" > > password="<pwd>"/> > > <document> > > <entity name ="<cowboysEntity>" > > pk ="id" > > query ="SELECT id, updated_at FROM > > Name1.<cowboyTable> WHERE updated_at between '2023-01-01 00:00:00' and > > '2023-01-01 00:30:00' " > > deltaQuery ="SELECT id, updated_at FROM > > Name1.<cowboyTable> WHERE updated_at >= cast('${dih.last_index_time}' AS > > TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8' DAY "> > > <field column="id" name="id"/> > > <field column="updated_at" name="updated_at"/> > > </entity> > > </document> > > </dataConfig> > > > > > > In manage-schema file I have > > <field name="id" type="string" indexed="true" stored="true" required= > > "true" multiValued="false" /> > > <field name="update_at" type="string" indexed="true" stored="true" > > multiValued="false"/> > > > > Raw Debug-Response > > { > > "responseHeader": { > > "status": 0, > > "QTime": 5266 > > }, > > "initArgs": [ > > "defaults", > > [ > > "config", > > "db-data-config.xml" > > ] > > ], > > "command": "delta-import", > > "mode": "debug", > > "documents": [], > > "verbose-output": [ > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ], > > "entity:<cowboysEntity>", > > [ > > "document#1", > > [] > > ] > > ], > > "status": "idle", > > "importResponse": "", > > "statusMessages": { > > "Total Requests made to DataSource": "19", > > "Total Rows Fetched": "18", > > "Total Documents Processed": "0", > > "Total Documents Skipped": "0", > > "Delta Dump started": "2023-12-01 09:24:38", > > "Identifying Delta": "2023-12-01 09:24:38", > > "Deltas Obtained": "2023-12-01 09:24:39", > > "Building documents": "2023-12-01 09:24:39", > > "Total Changed Documents": "18", > > "Time taken": "0:0:5.26" > > } > > } > > > > > > > > > > >