Re: Issue with serializable XML

2022-03-06 Thread Greg Silverman
= [XMLcleaner(x).xmlstring for x in df_text] > > df[‘TEXT_FIELD’] = cleaned > > > > Best, > > John > > > > *From: *Greg Silverman > *Date: *Sunday, March 6, 2022 at 5:10 PM > *To: *jrcas...@medicine.wisc.edu.invalid > > *Cc: *dev@ctakes.apache.org &g

Re: Issue with serializable XML

2022-03-06 Thread JOHN R CASKEY
To: jrcas...@medicine.wisc.edu.invalid Cc: dev@ctakes.apache.org Subject: Re: Issue with serializable XML Hi John, I thought I did. I'm using a pandas dataframe and passing it through this: files['note_text'] = files['note_text'].apply(lambda x: x.replace('[^\x0

Re: Issue with serializable XML

2022-03-06 Thread Greg Silverman
W-Madison > jrcas...@wisc.edu > > From: Greg Silverman > Sent: Sunday, March 6, 2022 12:40:00 PM > To: dev@ctakes.apache.org > Subject: Issue with serializable XML > > Got the error during processing of a large set of documents about mid-way

Re: Issue with serializable XML

2022-03-06 Thread JOHN R CASKEY
Sent: Sunday, March 6, 2022 12:40:00 PM To: dev@ctakes.apache.org Subject: Issue with serializable XML Got the error during processing of a large set of documents about mid-way through: org.xml.sax.SAXParseException: Trying to serialize non-XML 1.0 character: , 0x1c I encountered this once before

Issue with serializable XML

2022-03-06 Thread Greg Silverman
Got the error during processing of a large set of documents about mid-way through: org.xml.sax.SAXParseException: Trying to serialize non-XML 1.0 character: , 0x1c I encountered this once before, but I don't remember what the fix was. Running apache-ctakes-4.0.1-SNAPSHOT. Thanks! Greg-- -- Gre