Thanks for Penghui and Dianjin,

Project Name: Query compacted data of topic by Pulsar SQL

Project Description:

The Pulsar SQL is a SQL query engine for Pulsar topics data analytics based on 
the Trino, users could get data from Pulsar topics by SQL scripts. The topic 
compaction is used to compress messages which has a key, after compaction, the 
topic will only maintain the latest value of the key.

Currently, the Pulsar didn't support querying compacted data, it's a useful 
improvement for Pulsar SQL. Some key points for implementing this feature are 
understanding the topic compaction process, how to split topic data(compacted 
part and un-compacted part) to generate query tasks, and reading data of the 
compacted ledger.

Difficulty Level:
- [ ] Basic
- [x] Advanced

Project Validation Items:
Item 1: Understand and try to use topic compaction and the Pulsar SQL component.
Item 2: Prepare a design for this feature.
Item 3: Start coding.
Item 4: Add unit and integration tests.
Item 5: Add doc for this feature.

Project Mentor:
Your Name: Ran Gao
Your Email: r...@apache.org
Your Apache ID: rgao


On 2022/04/24 13:23:18 PengHui Li wrote:
> Hi all,
> 
> Open Source Promotion Plan, OSPP[1] is open for project registration,
> like Google Summer of Code. This Plan aims to encourage college students
> to participate in open-source projects and make contributions.
> 
> The project idea is open from 4/21, 2022 to 5/9, 2022(UTC+8).
> Apache Pulsar ended up with 6 projects seats. If you have great ideas,
> please copy the following template and fill in the fields.
> Once you have completed your project ideas, you can reply to this email.
> We will help you register for the program. Note that One mentor can
> mentor one project at a time and both mentors and students can
> receive financial awards for completed projects. Only Pulsar committers
> can be the mentors as asked by the OSPP.
> 
> The project can include feature catchup, ecological tool construction,
> new features of the pulsar and so on.
> 
> Welcome to join the Pulsar
> slack channel: #ospp-summer for asking any questions about OSPP.
> 
> [1] https://summer-ospp.ac.cn
> 
> Thanks,
> Penghui
> 
> ----------------------------------------
> Template
> 
> Project Name:
> Project Description: (at most 1000 words)
> Difficulty Level:
> - [ ] Basic
> - [ ] Advanced
> Project Validation Items:
> Item 1:__________________________
> Item 2:__________________________
> Item 3:__________________________
> …
> Project Mentor:
> Your Name:
> Your Email:
> Your Apache ID:
> 

Reply via email to