Re: [PR] Move gettingStarted into newcomers audience (comdev-site)
iagotb commented on PR #133: URL: https://github.com/apache/comdev-site/pull/133#issuecomment-1754744377 Hi @rbowen Just letting you know that getting started page is not working on [https://community.apache.org/newcomers/gettingStarted.html](https://community.apache.org/newcomers/gettingStarted.html) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@community.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: dev-unsubscr...@community.apache.org For additional commands, e-mail: dev-h...@community.apache.org
Tika parser not parsing email content
Hi team, I have been working on the Tika parser to parse a few text files and it has been working fine until I have come to an issue where it is not able to parse the text file if it contains 'email/message contents'. This means if the text file contains any of the terms like 'From: ', 'To: ', or 'Sent: ', it will fail to parse the text correctly. In my case, the parser is deleting the lines of text files and only a single line remains out of 40 lines. I am sharing a snippet of the text file for an example: > > *Some text here 1.* > *Some text here 2.* > *Some text here 3.* > *Original Message-* > *From: some_m...@abc.com * > *Sent: Thursday, October 31, 2019 9:52 AM* > *To: Some person, (The XYZ group)* > *Subject: RE: Mr. Random person phone call: MESSAGE* > *Hi,* > *I am available now to receive the call.* > *Some text here 4.* > *Some text here 5.**Some text here 6.* The Tika parser is reducing the above text to only one line as below: > *Subject: RE: Mr. Random person phone call: MESSAGE* Note that this is happening in the version later than Tika 1.19, with 1.19 is parsing the contents perfectly fine. Could you please help me to understand the issue or please suggest some path forward to this? This will be very helpful. Thanks in advance. -Kashif
Re: Tika parser not parsing email content
Hi, On Tue, Oct 10, 2023 at 12:52 PM Kashif Khan wrote: > ...I have been working on the Tika parser to parse a few text files and it has > been working fine until I have come to an issue... this list is for general community-related discussions, for Tika you'll have to ask on their own list, see https://tika.apache.org/mail-lists.html -Bertrand - To unsubscribe, e-mail: dev-unsubscr...@community.apache.org For additional commands, e-mail: dev-h...@community.apache.org
Building a chatbot over the ASF community website
Hello, I am sorry but today at the Lighting talks I wanted to show how to build in 2 minutes a chatbot that is about to answer to questions about the ASF community website. I didn't know that at the Lightning talks it is possible to show your laptop. I will put up the demo in a git repo as soon as possible and I will share it here. To run the demo you can you a laptop, and then we can run it on a VM somewhere at some point. Stay tuned Enrico
Approval for project "BoF" get-togethers ?
Hello ComDev, I'm the Apache Solr PMC chair and I have some brading/trademark questions pertaining to policies around event organization and ASF rules of such. I've read: [1] Policy for Event names using Apache marks: https://www.apache.org/foundation/marks/events.html#events [2] Approval of small Apache-related events: https://community.apache.org/events/small-events.html Question: * At ASF Community-over-Code, if someone organizes a Birds of a Feather for Solr and it gets onto the event schedule, should it be necessary to get the Solr PMC's approval beforehand? Would it matter if the person who arranged it is a PMC member themselves or not? Please ultimately explain the answer with a rationale against the current policy. It's unclear if the BoF *itself* is a "small Apache-related event" or if the fact that it's at an ASF ticketed conference overrides because then the policy wouldn't apply at all (nothing is "3rd party"). * If such a BoF were to be organized at a non-Apache conference (e.g. Berlin Buzzwords), presumably Solr PMC permission is needed as specified by [2]. An unclear aspect of the policy is what the "event" is -- is it the entire conference or could it be the proposed BoF talk as well, even though it's composed as part of another event? If we're only looking at the BoF/talk itself, then would it be "3rd party" if the primary speaker is a PMC member? The text at https://www.apache.org/foundation/marks/resources (search for "third party") seems to contrast PMC members & committers in a way to imply they are *not* third party. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley