janhoy opened a new issue, #9: URL: https://github.com/apache/solr-orbit-workloads/issues/9
Port the OSB `eventdata` workload. ~20M documents / ~15 GB of Apache access logs. High-throughput append-only indexing benchmark with realistic log-shaped documents — useful for measuring update handler and commit performance. > ⚠️ **Prerequisite:** Verify dataset licence is ASF-compatible before starting. ## Tasks - Confirm dataset licence - Convert OSB workload using `solr-orbit convert-workload` - Define operations: bulk indexing, basic log search, status-code faceting - Add 1k sample corpus for test-mode - Check whether any operations belong in `common_operations/` rather than this workload **Depends on:** apache/solr-orbit-workloads#3 (ASF dataset hosting must be resolved before corpus files can be finalised) ## References - OSB workload: https://github.com/opensearch-project/opensearch-benchmark-workloads/tree/main/eventdata - Creating workloads: https://github.com/apache/solr-orbit/blob/main/CREATE_WORKLOAD_GUIDE.md -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
