https://bz.apache.org/bugzilla/show_bug.cgi?id=57031
--- Comment #12 from Tim Allison <[email protected]> --- Thank you, Dominik. Makes sense to wait. Will do. I'm also leery of changing the xml parser without serious testing. I just finished downloading and adding lots of doc[xm] files with your CommonCrawlDocumentDownload code. Will run regression testing on that corpus in addition to the few we had in our regular govdocx1+othercommoncrawl corpus. -- You are receiving this mail because: You are the assignee for the bug. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
