= [XMLcleaner(x).xmlstring for x in df_text]
>
> df[‘TEXT_FIELD’] = cleaned
>
>
>
> Best,
>
> John
>
>
>
> *From: *Greg Silverman
> *Date: *Sunday, March 6, 2022 at 5:10 PM
> *To: *jrcas...@medicine.wisc.edu.invalid
>
> *Cc: *dev@ctakes.apache.org
&g
To: jrcas...@medicine.wisc.edu.invalid
Cc: dev@ctakes.apache.org
Subject: Re: Issue with serializable XML
Hi John,
I thought I did. I'm using a pandas dataframe and passing it through this:
files['note_text'] = files['note_text'].apply(lambda x:
x.replace('[^\x0
W-Madison
> jrcas...@wisc.edu
>
> From: Greg Silverman
> Sent: Sunday, March 6, 2022 12:40:00 PM
> To: dev@ctakes.apache.org
> Subject: Issue with serializable XML
>
> Got the error during processing of a large set of documents about mid-way
Sent: Sunday, March 6, 2022 12:40:00 PM
To: dev@ctakes.apache.org
Subject: Issue with serializable XML
Got the error during processing of a large set of documents about mid-way
through:
org.xml.sax.SAXParseException: Trying to serialize non-XML 1.0 character: ,
0x1c
I encountered this once before
Got the error during processing of a large set of documents about mid-way
through:
org.xml.sax.SAXParseException: Trying to serialize non-XML 1.0 character: ,
0x1c
I encountered this once before, but I don't remember what the fix was.
Running apache-ctakes-4.0.1-SNAPSHOT.
Thanks!
Greg--
--
Gre