On 2015-10-12 01:56, Victor Hooi wrote:
Hi,
I'm attempting to parse MongoDB loglines.
The formatting of these loglines could best be described as JSON-like...
For example - arrays
Anyhow, say I had the following logline snippet:
{ Global: { acquireCount: { r: 2, w: 2 } }, Database: { acquireCount: { w:
2 } }, Collection: { acquireCount: { w: 1 } }, oplog: { acquireCount: { w: 1 }
} }
This won't parse with json.loads() - the main issues is the missing quotation marks
(") around the strings.
My question, is there a more lenient, or relaxed JSON parser available for
Python, that will try to do a best-efforts parsing of non-spec JSON?
Have you tried first adding the quotes using the re module?
>>> import json, re
>>> line = '{ Global: { acquireCount: { r: 2, w: 2 } }, Database: {
acquireCount: { w: 2 } }, Collection: { acquireCount: { w: 1 } }, oplog:
{ acquireCount: { w: 1 } } }'
>>> json.loads(re.sub(r'(\w+)', r'"\1"', line))
--
https://mail.python.org/mailman/listinfo/python-list