Re: [PR] Move gettingStarted into newcomers audience (comdev-site)

2023-10-10 Thread via GitHub


iagotb commented on PR #133:
URL: https://github.com/apache/comdev-site/pull/133#issuecomment-1754744377

   Hi @rbowen 
   Just letting you know that getting started page is not working on 
[https://community.apache.org/newcomers/gettingStarted.html](https://community.apache.org/newcomers/gettingStarted.html)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@community.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Tika parser not parsing email content

2023-10-10 Thread Kashif Khan
Hi team,
I have been working on the Tika parser to parse a few text files and it has
been working fine until I have come to an issue where it is not able to
parse the text file if it contains 'email/message contents'.
This means if the text file contains any of the terms like 'From: ', 'To:
', or 'Sent: ', it will fail to parse the text correctly.
In my case, the parser is deleting the lines of text files and only a
single line remains out of 40 lines.

I am sharing a snippet of the text file for an example:

>
> *Some text here 1.*
> *Some text here 2.*
> *Some text here 3.*
> *Original Message-*
> *From: some_m...@abc.com *
> *Sent: Thursday, October 31, 2019 9:52 AM*
> *To: Some person, (The XYZ group)*
> *Subject: RE: Mr. Random person phone call: MESSAGE*
> *Hi,*
> *I am available now to receive the call.*
> *Some text here 4.*
> *Some text here 5.**Some text here 6.*


The Tika parser is reducing the above text to only one line as below:

> *Subject: RE: Mr. Random person phone call: MESSAGE*


Note that this is happening in the version later than Tika 1.19, with 1.19
is parsing the contents perfectly fine.

Could you please help me to understand the issue or please suggest some
path forward to this?
This will be very helpful.

Thanks in advance.
-Kashif


Re: Tika parser not parsing email content

2023-10-10 Thread Bertrand Delacretaz
Hi,

On Tue, Oct 10, 2023 at 12:52 PM Kashif Khan  wrote:
> ...I have been working on the Tika parser to parse a few text files and it has
> been working fine until I have come to an issue...

this list is for general community-related discussions, for Tika
you'll have to ask on their own list, see
https://tika.apache.org/mail-lists.html

-Bertrand

-
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org



Building a chatbot over the ASF community website

2023-10-10 Thread Enrico Olivelli
Hello,
I am sorry but today at the Lighting talks I wanted to show how to build in
2 minutes a chatbot that is about to answer to questions about the ASF
community website.

I didn't know that at the Lightning talks it is possible to show your
laptop.

I will put up the demo in a git repo as soon as possible and I will share
it here.

To run the demo you can you a laptop, and then we can run it on a VM
somewhere at some point.

Stay tuned
Enrico


Approval for project "BoF" get-togethers ?

2023-10-10 Thread David Smiley
Hello ComDev,

I'm the Apache Solr PMC chair and I have some brading/trademark questions
pertaining to policies around event organization and ASF rules of such.

I've read:
[1] Policy for Event names using Apache marks:
https://www.apache.org/foundation/marks/events.html#events
[2] Approval of small Apache-related events:
https://community.apache.org/events/small-events.html

Question:
* At ASF Community-over-Code, if someone organizes a Birds of a Feather for
Solr and it gets onto the event schedule, should it be necessary to get the
Solr PMC's approval beforehand?  Would it matter if the person who arranged
it is a PMC member themselves or not?  Please ultimately explain the answer
with a rationale against the current policy.  It's unclear if the BoF
*itself* is a "small Apache-related event" or if the fact that it's at an
ASF ticketed conference overrides because then the policy wouldn't apply at
all (nothing is "3rd party").

* If such a BoF were to be organized at a non-Apache conference (e.g.
Berlin Buzzwords), presumably Solr PMC permission is needed as specified by
[2].

An unclear aspect of the policy is what the "event" is -- is it the entire
conference or could it be the proposed BoF talk as well, even though it's
composed as part of another event?  If we're only looking at the BoF/talk
itself, then would it be "3rd party" if the primary speaker is a PMC
member?  The text at https://www.apache.org/foundation/marks/resources
(search for "third party") seems to contrast PMC members & committers in a
way to imply they are *not* third party.

~ David Smiley
Apache Lucene/Solr Search Developer
http://www.linkedin.com/in/davidwsmiley