menna224 commented on issue #8018:
URL: https://github.com/apache/hudi/issues/8018#issuecomment-1455154156

   thank you @kazdy  for your reply. I tried to pass the schema this way to the 
config you mentioned but I get an error, I am not totally aware how to pass it 
can u please help?
   
   ```
   schema = "user_id: string, firstname: string, operation: string, timestamp: 
double"
    'hoodie.write.schema':schema
   
   ```
   got the following error:
   ```
   Caused by: org.apache.avro.SchemaParseException: 
com.fasterxml.jackson.core.JsonParseException: Unrecognized token 'user_id': 
was expecting (JSON String, Number, Array, Object or token 'null', 'true' or 
'false')
    at [Source: (String)"user_id: string, firstname: string, operation: string, 
timestamp: double"; line: 1, column: 8]
   ```
   seems like I shouldn't pass it as a string, but I couldn't get from the doc 
how I should pass it.
   
   Also, regarding your part when U mentioned "missing columns -> add missing 
columns to match current table schema"
   
   did u need to add extra logic in your code or by default the missing cols 
were added? or it's just by adding 
'hoodie.datasource.write.reconcile.schema':"true"?
   
   and if the missing cols were added wo extra logic in your code, were u using 
pyspark+glue? or what did u use exactly? thanks
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to