Andy, you rock!  You've made my day.  it works after adding the
deltaImportQuery with the WHERE id =... into the entity section.

BTW, would you have a pointer when to use the variable  dataimporter vs dih
'${dataimporter.delta.id}?  I see some other online places use dih, when
doing dih.last_index_time

Thanks, again.


On Fri, Dec 1, 2023 at 9:44 AM Andy Webb <andyw...@apache.org> wrote:

> hi Vince,
>
> The deltaQuery is used to fetch the IDs of any changed items - DIH then
> runs the deltaImportQuery to fetch the document for each of those IDs,
> using a WHERE clause of the form "WHERE id = '${dataimporter.delta.id}'.
> (You can make it use compound PKs if necessary - and you might want a
> deletedPkQuery too to identify docs to remove.)
>
> hope that helps!
> Andy
>
> On Fri, 1 Dec 2023 at 10:41, Vince McMahon <sippingonesandze...@gmail.com>
> wrote:
>
> > I need help, please.
> >
> > I keep looking at the Configuration, the full import works using the
> > entity "query".  I have tested the SELECT statement from deltaQuery in a
> > Teradata terminal and that works.
> >
> > The following teradata query returns 18 rows from the Teradata terminal.
> > SELECT id, updated_at FROM  Name1.<cowboyTable>  WHERE update_at >=
> > cast(*'2023-12-01
> > 09:46:10'* AS TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8'
> > DAY;
> >
> > The differences I can think of between the full import "query" and the
> > "deltaQuery" are those characters after the WHERE update_at ...
> >
> > So, if there is any attribute Solr delta import is looking for, it would
> > had been the variable dih.last_index_time or remaining teradata
> > keywords/syntax is confusing the Solr parser.
> >
> > the dih.last_index_time is from the file dataimport.properties,  which
> has:
> > #Fri Dec 01 09:46:15 UTC 2023
> > last_index_time=2023-12-01 09\:46\:10
> > <cowboysEntity>.last_index_time=2023-12-01 09\:46\:10
> >
> >
> > Configuration (db-data-config.xml)
> > <dataConfig>
> >   <dataSource type="JdbcDataSource"
> >               driver="com.teradata.jdbc.TeraDriver"
> >               url=
> > "jdbc:teradata://<teradataServer>/database=Name1,tmode=TERA,charset=UTF8"
> >               user="<user>"
> >               password="<pwd>"/>
> >   <document>
> >     <entity name            ="<cowboysEntity>"
> >             pk              ="id"
> >             query           ="SELECT id, updated_at FROM
> > Name1.<cowboyTable> WHERE updated_at between '2023-01-01 00:00:00' and
> > '2023-01-01 00:30:00' "
> >             deltaQuery      ="SELECT id, updated_at FROM
> > Name1.<cowboyTable> WHERE updated_at *>= cast('${dih.last_index_time}' AS
> > TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8' DAY *">
> >       <field column="id" name="id"/>
> >       <field column="updated_at" name="updated_at"/>
> >     </entity>
> >   </document>
> > </dataConfig>
> >
> > On Fri, Dec 1, 2023 at 4:57 AM Vince McMahon <
> > sippingonesandze...@gmail.com>
> > wrote:
> >
> > > Hi,
> > >
> > > Could someone please help me with the what Solr is looking in
> deltaQuery
> > > of Delta Import ?
> > >
> > > I have got 18 rows fetched from the database, but none of the row is
> > > processed as document.
> > >
> > > There are the "WARN" In the solr.log, but, what attribute is not
> > > specified? It seems important because the Raw Debug-Response shows 18
> > rows
> > > fetched and are empty/no processed documents. Thanks.
> > >
> > >   "verbose-output": [
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     ...
> > >
> > >
> > > solr.log
> > > 2023-12-01 09:24:38.325 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DataImporter Loading DIH Configuration: db-data-config.xml
> > > 2023-12-01 09:24:38.328 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DataImporter Data Configuration loaded successfully
> > > 2023-12-01 09:24:38.328 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DataImporter Starting Delta Import
> > > 2023-12-01 09:24:38.335 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SimplePropertiesWriter Read dataimport.properties
> > > 2023-12-01 09:24:38.336 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Starting delta collection.
> > > 2023-12-01 09:24:38.338 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Running ModifiedRowKey() for Entity:
> <cowboysEntity>
> > > 2023-12-01 09:24:38.339 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.JdbcDataSource Creating a connection for entity
> <cowboysEntity>
> > > with URL: jdbc:teradata://<teradataServer>/database
> > > =Name1,tmode=TERA,charset=UTF8
> > > 2023-12-01 09:24:39.101 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.JdbcDataSource Time taken for getConnection(): 762
> > > 2023-12-01 09:24:39.695 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Completed ModifiedRowKey for Entity:
> <cowboysEntity>
> > > rows obtained: 18
> > > 2023-12-01 09:24:39.695 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Completed DeletedRowKey for Entity:
> <cowboysEntity>
> > > rows obtained : 0
> > > 2023-12-01 09:24:39.695 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Completed parentDeltaQuery for Entity:
> > <cowboysEntity>
> > > 2023-12-01 09:24:39.696 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:39.940 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:40.138 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:40.337 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:40.532 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:40.728 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:40.924 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:41.121 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:41.322 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:41.524 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:41.724 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:41.928 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:42.129 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:42.350 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:42.551 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:42.758 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:42.955 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:43.155 WARN  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.SqlEntityProcessor 'deltaImportQuery' attribute is not
> > > specified for entity : <cowboysEntity>
> > > 2023-12-01 09:24:43.361 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Delta Import completed successfully
> > > 2023-12-01 09:24:43.361 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.h.d.DocBuilder Time taken = 0:0:5.26
> > > 2023-12-01 09:24:43.361 INFO  (qtp548482954-31) [   x:p6]
> > > o.a.s.u.p.LogUpdateProcessorFactory [p6]  webapp=/solr path=/dataimport
> > > params={core=p6&debug=true&indent=on&commit=true&name
> > > =dataimport&clean=false&wt=json&command=delta-import&_=1701422662358&
> > > verbose=true}{} 0 5036
> > > 2023-12-01 09:24:43.605 INFO  (qtp548482954-30) [   x:p6]
> > > o.a.s.c.S.Request [p6]  webapp=/solr path=/dataimport params={indent=on
> > > &wt=json&command=status&_=1701422662358} status=0 QTime=0
> > >
> > >
> > >
> > > Configuration (db-data-config.xml)
> > > <dataConfig>
> > >   <dataSource type="JdbcDataSource"
> > >               driver="com.teradata.jdbc.TeraDriver"
> > >               url=
> > >
> "jdbc:teradata://<teradataServer>/database=Name1,tmode=TERA,charset=UTF8"
> > >               user="<user>"
> > >               password="<pwd>"/>
> > >   <document>
> > >     <entity name            ="<cowboysEntity>"
> > >             pk              ="id"
> > >             query           ="SELECT id, updated_at FROM
> > > Name1.<cowboyTable> WHERE updated_at between '2023-01-01 00:00:00' and
> > > '2023-01-01 00:30:00' "
> > >             deltaQuery      ="SELECT id, updated_at FROM
> > > Name1.<cowboyTable> WHERE updated_at >= cast('${dih.last_index_time}'
> AS
> > > TIMESTAMP(0) FORMAT 'YYYY-MM-DDbHH:MI:SS') - INTERVAL '8' DAY ">
> > >       <field column="id" name="id"/>
> > >       <field column="updated_at" name="updated_at"/>
> > >     </entity>
> > >   </document>
> > > </dataConfig>
> > >
> > >
> > > In manage-schema file I have
> > >     <field name="id" type="string" indexed="true" stored="true"
> required=
> > > "true" multiValued="false" />
> > >     <field name="update_at" type="string" indexed="true" stored="true"
> > > multiValued="false"/>
> > >
> > > Raw Debug-Response
> > > {
> > >   "responseHeader": {
> > >     "status": 0,
> > >     "QTime": 5266
> > >   },
> > >   "initArgs": [
> > >     "defaults",
> > >     [
> > >       "config",
> > >       "db-data-config.xml"
> > >     ]
> > >   ],
> > >   "command": "delta-import",
> > >   "mode": "debug",
> > >   "documents": [],
> > >   "verbose-output": [
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ],
> > >     "entity:<cowboysEntity>",
> > >     [
> > >       "document#1",
> > >       []
> > >     ]
> > >   ],
> > >   "status": "idle",
> > >   "importResponse": "",
> > >   "statusMessages": {
> > >     "Total Requests made to DataSource": "19",
> > >     "Total Rows Fetched": "18",
> > >     "Total Documents Processed": "0",
> > >     "Total Documents Skipped": "0",
> > >     "Delta Dump started": "2023-12-01 09:24:38",
> > >     "Identifying Delta": "2023-12-01 09:24:38",
> > >     "Deltas Obtained": "2023-12-01 09:24:39",
> > >     "Building documents": "2023-12-01 09:24:39",
> > >     "Total Changed Documents": "18",
> > >     "Time taken": "0:0:5.26"
> > >   }
> > > }
> > >
> > >
> > >
> > >
> > >
> >
>

Reply via email to