Re: Chardet oddity

2024-10-25 Thread Albert-Jan Roskam via Python-list
On Oct 24, 2024 17:51, Roland Mueller via Python-list wrote: ke 23. lokak. 2024 klo 20.11 Albert-Jan Roskam via Python-list ( python-list@python.org) kirjoitti: >    Today I used chardet.detect in the repl and it returned windows-1252 >    (incorrect, because it la

Re: Chardet oddity

2024-10-24 Thread Roland Mueller via Python-list
ke 23. lokak. 2024 klo 20.11 Albert-Jan Roskam via Python-list ( python-list@python.org) kirjoitti: >Today I used chardet.detect in the repl and it returned windows-1252 >(incorrect, because it later resulted in a UnicodeDecodeError). When I > ran >chardet as a script (which uses Unive

Re: Chardet oddity

2024-10-24 Thread Mark Bourne via Python-list
Albert-Jan Roskam wrote: Today I used chardet.detect in the repl and it returned windows-1252 (incorrect, because it later resulted in a UnicodeDecodeError). When I ran chardet as a script (which uses UniversalLineDetector) this returned MacRoman. Isn't charset.detect the correct

Chardet oddity

2024-10-23 Thread Albert-Jan Roskam via Python-list
Today I used chardet.detect in the repl and it returned windows-1252 (incorrect, because it later resulted in a UnicodeDecodeError). When I ran chardet as a script (which uses UniversalLineDetector) this returned MacRoman. Isn't charset.detect the correct way? I've used this method many