[ 
https://issues.apache.org/jira/browse/SOLR-5819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13921506#comment-13921506
 ] 

Steve Rowe commented on SOLR-5819:
----------------------------------

I extracted the images from the pdf using {{mutool extract}}, then generated a 
unique list of their MD5 hashes.  There are only 92 unique images.

{noformat}
$ mutool extract ../apache-solr-ref-guide-4.7.pdf
$ for a in *.png ; do md5 -q $a ; done | sort -u > unique.images.md5.txt
$ wc -l unique.images.md5.txt
92
{noformat}

> Investigate & reduce size of ref-guide PDF
> ------------------------------------------
>
>                 Key: SOLR-5819
>                 URL: https://issues.apache.org/jira/browse/SOLR-5819
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Hoss Man
>
> As noted on the solr-user mailing list in response to the ANNOUNCE about the 
> 4.7 ref guide, the size of the 4.4, 4.5 & 4.6 PDF files were all under 5MB, 
> but the 4.7 PDF was 30MB.
> opening this issue to track trying to reduce this



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to