Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-04-30 Thread Aditya
Hi Danny, I’ve completed the implementation of the OpenAI text embedding handler and have opened a PR for it. Could you please take a look when you get a chance? Thanks, Aditya >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-04-30 Thread Aditya
https://github.com/apache/beam/pull/34700 On Sat, 26 Apr, 2025, 04:57 Aditya, wrote: > Hi Danny, > > I’ve completed the implementation of the OpenAI text embedding handler and > have opened a PR for it. > Could you please take a look when you get a chance? > > Thanks, > Aditya > >>

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-04-18 Thread Aditya
**Subject:** Help with OpenAI Embedding Handler Output Type Hint Warning Hi Danny, I’m running into a warning related to the `OpenAITextEmbeddings` handler in Apache Beam and could use your guidance. Here’s the warning I’m seeing: ``` WARNING:root:This output type hint will be ignored and not

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-04-10 Thread Aditya
Thanks for the clarification. On Tue, Apr 8, 2025 at 12:41 AM Danny McCormick wrote: > Yeah, that would be best - thanks! > > On Sun, Apr 6, 2025 at 6:34 AM Aditya wrote: > >> *Subject:* Question Regarding OpenAI Embedding Implementation >> >> Hi Danny, >> >> I have a quick question: >> >> Shou

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-04-10 Thread Danny McCormick via dev
Yeah, that would be best - thanks! On Sun, Apr 6, 2025 at 6:34 AM Aditya wrote: > *Subject:* Question Regarding OpenAI Embedding Implementation > > Hi Danny, > > I have a quick question: > > Should the OpenAI embedding implementation handle Chunk objects from the > RAG framework similarly to how

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-24 Thread Aditya
*Subject:* Proposal Submission for GSoC Hi Danny, I have submitted my proposal. Thank you for your help throughout the process! I appreciate your feedback and support. I hope my proposal gets selected so we can work together. Looking forward to your thoughts. Best regards, Aditya >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-24 Thread Danny McCormick via dev
Thanks - I left a few comments, overall it seems like a good proposal though. Thanks, Danny On Mon, Mar 24, 2025 at 3:20 PM Aditya wrote: > *Subject:* Additional Submission for GSoC > > Hi Danny, > > Please take a look at this as well. > > I will be submitting this to GSoC as they require a PDF

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-24 Thread Aditya
https://docs.google.com/document/d/1TI8edsp-Q2dee1fXWQr9-04KGxaAetu98CLzJmo1jJA/edit?usp=sharing On Mon, Mar 24, 2025 at 11:40 PM Aditya wrote: > *Subject:* Google Sheet Proposal Update > > Hi Danny, > > Thank you for your guidance. > > I have shared the Google Sheet proposal for your review. I

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-24 Thread Danny McCormick via dev
> Should I submit the proposal as a GitHub link or a PDF? Either is fine for submitting the proposal. > I’d also love any tips you have on improving my proposal. Have you shared it as a google doc like mentioned earlier? Thanks, Danny On Sun, Mar 23, 2025 at 8:37 AM Aditya wrote: > Subject:

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-21 Thread Danny McCormick via dev
Hey Aditya, I would recommend sharing this as a google doc or something which allows people to leave comments on it if you'd like feedback. At a high level, the proposal generally looks reasonable and well written to me, though! Thanks, Danny On Wed, Mar 19, 2025 at 5:10 PM Aditya wrote: > *Sub

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-17 Thread Danny McCormick via dev
You can email dev-unsubscr...@beam.apache.org. https://beam.apache.org/community/contact-us/#:~:text=After%20you%20subscribe%2C%20you'll,%40beam.apache.org. On Mon, Mar 17, 2025 at 10:55 AM A Gardner wrote: > Hi > > Need to unsubscribe, my gmail is rather full - how do I achieve this? > > Alex

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-17 Thread A Gardner
Hi Need to unsubscribe, my gmail is rather full - how do I achieve this? Alex On Mon, 17 Mar 2025 at 13:18, Danny McCormick via dev wrote: > Hey Aditya, there is not necessarily a single set of benchmarks which we > can use to evaluate an IO, and defining exactly what/how we should be > measur

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-17 Thread Danny McCormick via dev
Hey Aditya, there is not necessarily a single set of benchmarks which we can use to evaluate an IO, and defining exactly what/how we should be measuring completeness and performance is part of the work to be done here. I think this is a good thing for you to try to initially define in your project

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-16 Thread Aditya
*Subject:* Inquiry on Success Metrics for Pinecone and Tecton I/O Connectors (GSoC Proposal) Hi Danny, I hope you're doing well. As I prepare my GSoC proposal, I am seeking guidance on defining success metrics for the I/O connectors we plan to develop for Pinecone and Tecton. Specifically, I wou

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-12 Thread Aditya
*Subject:* Clarification on Sink and Source Handler Implementation Hi Danny, I need one more clarification—am I required to implement the sink and source handlers for Pinecone and the others only in Python? Just wanted to confirm. Thanks, Aditya On Thu, Mar 13, 2025 at 1:51 AM Aditya wrote: >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-12 Thread Aditya
Thanks for the clarification. On Thu, Mar 13, 2025 at 2:17 AM Danny McCormick wrote: > Yeah, that probably makes the most sense for most vector DBs and feature > stores. > > Thanks, > Danny > > On Wed, Mar 12, 2025 at 4:25 PM Aditya wrote: > >> *Subject:* Clarification on Sink and Source Handle

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-12 Thread Danny McCormick via dev
Yeah, that probably makes the most sense for most vector DBs and feature stores. Thanks, Danny On Wed, Mar 12, 2025 at 4:25 PM Aditya wrote: > *Subject:* Clarification on Sink and Source Handler Implementation > > Hi Danny, > > I need one more clarification—am I required to implement the sink a

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-05 Thread Aditya
Thanks, I will! On Thu, 6 Mar, 2025, 01:38 Danny McCormick, wrote: > Sure, you're welcome to try working on it, it would just be outside of the > scope of GSOC. Regardless, you are always welcome to make contributions to > Beam :) > > Thanks, > Danny > > On Wed, Mar 5, 2025 at 2:44 PM Aditya wr

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-05 Thread Danny McCormick via dev
Sure, you're welcome to try working on it, it would just be outside of the scope of GSOC. Regardless, you are always welcome to make contributions to Beam :) Thanks, Danny On Wed, Mar 5, 2025 at 2:44 PM Aditya wrote: > *Subject:* Implementation of OpenAI Embeddings Before GSoC > > Dear Sir, > >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-05 Thread Aditya
*Subject:* Implementation of OpenAI Embeddings Before GSoC Dear Sir, Before the GSoC program begins, I would like to start working on the implementation of OpenAI embeddings. Would that be acceptable? Looking forward to your guidance. Best regards, Aditya >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-05 Thread Aditya
Ok sure On Tue, 4 Mar, 2025, 21:40 Danny McCormick, wrote: > I generally agree that this would be good to add (along with something for > Anthropic and maybe others). I think it is not necessarily within the scope > of this project, though, so I would not recommend including it as an early > ite

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-04 Thread Danny McCormick via dev
I generally agree that this would be good to add (along with something for Anthropic and maybe others). I think it is not necessarily within the scope of this project, though, so I would not recommend including it as an early item in a project proposal (it could be a nice to have if there's time at

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-03 Thread Aditya
One more thing—there are two implementations of embedding in Apache Beam: Vertex AI and Hugging Face. OpenAI embeddings should also be added Thanks Aditya On Tue, Mar 4, 2025 at 1:42 AM Danny McCormick wrote: > Hey Aditya, > > I don't think there is a very well defined priority order. I;ll note t

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-03 Thread Danny McCormick via dev
Hey Aditya, I don't think there is a very well defined priority order. I;ll note that we already have enrichment handlers for Feast/Vertex AI for reading/enriching data with lookups to those systems, so I'd probably say the following prioritization makes sense: - Sink for Vertex/Feast (finish wha

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-03-02 Thread Aditya
Sir, I have a question regarding the implementation of the I/O connector for Pinecone and Tecton. Should it be developed in Java or Python? Pinecone provides an official Python client library but does not have one for Java. However, most of Apache Beam’s existing I/O connectors are written in Jav

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-27 Thread Aditya
Ok, thanks for the clarification. On Thu, Feb 27, 2025 at 1:43 AM Danny McCormick wrote: > Where available it is usually simpler to use the client libraries. > > Thanks, > Danny > > On Wed, Feb 26, 2025 at 6:06 AM Aditya wrote: > >> Sir one more thing >> >>> Should we use only client library or

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-26 Thread Danny McCormick via dev
Where available it is usually simpler to use the client libraries. Thanks, Danny On Wed, Feb 26, 2025 at 6:06 AM Aditya wrote: > Sir one more thing > >> Should we use only client library or api or both >

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-26 Thread Aditya
Sir one more thing > Should we use only client library or api or both

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-25 Thread Danny McCormick via dev
Sure, I have added you. Thanks, Danny On Mon, Feb 24, 2025 at 12:21 PM Aditya wrote: > Thanks for the reply. > > Can i ask something > Can I join slack communication channel of beam > > On Mon, 24 Feb, 2025, 22:44 Danny McCormick, > wrote: > >> Hey Aditya, glad to hear that you are interested

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-24 Thread Aditya
Thanks for the reply. Can i ask something Can I join slack communication channel of beam On Mon, 24 Feb, 2025, 22:44 Danny McCormick, wrote: > Hey Aditya, glad to hear that you are interested in this project. I've > tried to answer your questions below: > > > What are the key technical challeng

Re: Inquiry About GSoC Project - Beam ML Vector DB/Feature Store Integrations

2025-02-24 Thread Danny McCormick via dev
Hey Aditya, glad to hear that you are interested in this project. I've tried to answer your questions below: > What are the key technical challenges in integrating Beam with Pinecone and Tecton? The main challenges will be around understanding how those systems (and other similar systems) work, h