janhoy opened a new issue, #9:
URL: https://github.com/apache/solr-orbit-workloads/issues/9

   Port the OSB `eventdata` workload. ~20M documents / ~15 GB of Apache access 
logs. High-throughput append-only indexing benchmark with realistic log-shaped 
documents — useful for measuring update handler and commit performance.
   
   > ⚠️ **Prerequisite:** Verify dataset licence is ASF-compatible before 
starting.
   
   ## Tasks
   - Confirm dataset licence
   - Convert OSB workload using `solr-orbit convert-workload`
   - Define operations: bulk indexing, basic log search, status-code faceting
   - Add 1k sample corpus for test-mode
   - Check whether any operations belong in `common_operations/` rather than 
this workload
   
   **Depends on:** apache/solr-orbit-workloads#3 (ASF dataset hosting must be 
resolved before corpus files can be finalised)
   
   ## References
   - OSB workload: 
https://github.com/opensearch-project/opensearch-benchmark-workloads/tree/main/eventdata
   - Creating workloads: 
https://github.com/apache/solr-orbit/blob/main/CREATE_WORKLOAD_GUIDE.md


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to