[ 
https://issues.apache.org/jira/browse/COMDEV-295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17688199#comment-17688199
 ] 

Andrew Musselman commented on COMDEV-295:
-----------------------------------------

It's still referred to in the readme fwiw, maybe that needs more TLC

{{crontab -l -u www-data: (Not currently stored in puppet)
# m h   dom mon dow   command
00 4,12,20 * * * cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
parsepmcs.py
00 01 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
mailglomper2.py
00 09 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
readjira.py
10 00 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
reportingcycles.py
20 00 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
pmcdates.py
30 00 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
bugzillastats.py
50 00 * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
health.py
32  * * * *      cd /var/www/reporter.apache.org/scripts && ./python3logger.sh 
readchecker.py
*/15 * * * *     pgrep -f "gunicorn3.*wsgi:app" || (cd 
/var/www/reporter.apache.org/scripts && gunicorn3 -b 127.0.0.1 -w 6 -D wsgi:app 
&& echo "restarting gunicorn")}}

> Consider whether to extract mod_mbox statistics locally
> -------------------------------------------------------
>
>                 Key: COMDEV-295
>                 URL: https://issues.apache.org/jira/browse/COMDEV-295
>             Project: Community Development
>          Issue Type: Improvement
>          Components: Reporter Tool
>            Reporter: Sebb
>            Priority: Major
>
> At present the reporter tool uses the mailglomper script to extract 
> information about the number and timing of mails sent to each list.
> To do this, it has to download the entire mailbox, even though it only needs 
> the ^From_ lines.
> Whilst the code attempts to avoid unnecessary downloads (by checking dates 
> and sizes), for busy mbox files there can be a lot of network traffic.
> It might make sense for some initial data extraction to be done locally and 
> cached in a file.
> This could be done as part of updating the mod_mbox indexes.
> For example, the script could extract just the ^From_ lines into a separate 
> file and store that alongside the .mbox file. Mailglomper would read the much 
> abbreviated file instead.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org

Reply via email to