[jira] [Commented] (SOLR-16448) State.json serialized with indent size 2, this takes lots of extra bytes for large collection

Shawn Heisey (Jira) Tue, 13 Dec 2022 20:04:06 -0800


    [ 
https://issues.apache.org/jira/browse/SOLR-16448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17646910#comment-17646910
 ]


Shawn Heisey commented on SOLR-16448:
-------------------------------------

[~krisden] Maybe get rid of ALL the white space.  I think json is smaller than 
just about any format that's not binary, if it is completely minified.  Noble's 
gzip idea sounds promising as an additional step.  The nice thing with the 
current format is that the tree in ZK is completely human readable ... so I 
think the admin UI needs a way to parse whatever we finally decide to use back 
into a pretty human-readable format.

[~noble.paul] Wow.  Hard to imagine a cluster that large.  The costs for 
building and running that cluster must be quite high.  Does ZK handle the 
binary gzip data OK?  I'd be worried about it needing to be base64 encoded or 
something.

> State.json serialized with indent size 2, this takes lots of extra bytes for 
> large collection 
> ----------------------------------------------------------------------------------------------
>
>                 Key: SOLR-16448
>                 URL: https://issues.apache.org/jira/browse/SOLR-16448
>             Project: Solr
>          Issue Type: Improvement
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: SolrCloud
>    Affects Versions: 9.1
>            Reporter: Hitesh Khamesra
>            Assignee: Noble Paul
>            Priority: Major
>
> For large collection (#shards > 2000), state.json file size becomes more than 
> 1 mb. Solr serialized that with indent size 2. By reducing indent size to 0, 
> we can reduce size by more than 20%. That will be very efficient.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

[jira] [Commented] (SOLR-16448) State.json serialized with indent size 2, this takes lots of extra bytes for large collection

Reply via email to