solr basic authentication

2022-06-27 Thread Anchal Sharma2
Hi All ,

Is there any standard tool for encrypting -decrypting the password present in 
security.json ?
"credentials":{"solr":"encrypted password​ present here"}},

I  want to change the default password present in the security.json and then 
upload the json with updated password to our  zookeeper .We are using basic 
authentication on solr v8.11.1 and zookeeper 3.5.8  .

Thank you
Anchal Sharma


[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022

2022-06-27 Thread Gavin McDonald
 To all committers and non-committers.

This is a final call to apply for travel/hotel assistance to get to and
stay in New Orleans
for ApacheCon 2022.

Applications have been extended by one week and so the application deadline
is now the 8th July 2022.

The rest of this email is a copy of what has been sent out previously.

We will be supporting ApacheCon North America in New Orleans, Louisiana,
on October 3rd through 6th, 2022.

TAC exists to help those that would like to attend ApacheCon events, but
are unable to do so for financial reasons. This year, We are supporting
both committers and non-committers involved with projects at the
Apache Software Foundation, or open source projects in general.

For more info on this year's applications and qualifying criteria, please
visit the TAC website at http://www.apache.org/travel/
Applications have been extended until the 8th of July 2022.

Important: Applicants have until the closing date above to submit their
applications (which should contain as much supporting material as required
to efficiently and accurately process their request), this will enable TAC
to announce successful awards shortly afterwards.

As usual, TAC expects to deal with a range of applications from a diverse
range of backgrounds. We therefore encourage (as always) anyone thinking
about sending in an application to do so ASAP.

Why should you attend as a TAC recipient? We encourage you to read stories
from
past recipients at https://apache.org/travel/stories/ . Also note that
previous TAC recipients have gone on to become Committers, PMC Members, ASF
Members, Directors of the ASF Board and Infrastructure Staff members.
Others have gone from Committer to full time Open Source Developers!

How far can you go! - Let TAC help get you there.


===

Gavin McDonald on behalf of the Travel Assistance Committee.


Re: SolrCloud on Red Hat OpenShift Service on AWS

2022-06-27 Thread Vincenzo D'Amore
Ping

On Fri, 24 Jun 2022 at 15:27, Vincenzo D'Amore  wrote:

> Hi all,
>
> I'm guessing if anyone has already deployed SolrCloud on Red Hat OpenShift
> Service on AWS.
> Can we run the Solr official image on ROSA out of the box or do we need to
> do some customization?
> Honestly I would prefer to run SolrCloud in a plain Kubernetes cluster
> deployment, but my customers are asking me to do it with ROSA.
>
> Do you know any drawbacks?
> Any thoughts or suggestions will be appreciated.
>
> Best regards,
> Vincenzo
>
>
> --
> Vincenzo D'Amore
>
> --
Vincenzo D'Amore


Re: solr basic authentication

2022-06-27 Thread Srijan
Have you looked into this?

https://www.planetcobalt.net/sdb/solr_password_hash.shtml

On Mon, Jun 27, 2022 at 3:08 AM Anchal Sharma2  wrote:

> Hi All ,
>
> Is there any standard tool for encrypting -decrypting the password present
> in security.json ?
> "credentials":{"solr":"encrypted password​ present here"}},
>
> I  want to change the default password present in the security.json and
> then upload the json with updated password to our  zookeeper .We are using
> basic authentication on solr v8.11.1 and zookeeper 3.5.8  .
>
> Thank you
> Anchal Sharma
>


Re: Semantic Knowledge Graph theoric question

2022-06-27 Thread Alessandro Benedetti
Hi Davide,
I assume that "abstracts_gene_pubtator_annotation_ids" just contains
un-tokenized id, so I don't think there is a matter of tokenization,
shingles etc.
What we want is a single ID to be a single term in the index.

If you want to debug the relatedness calculation, take a look here :
org.apache.solr.search.facet.RelatednessAgg#computeRelatedness
The formula you mentioned is ok, but I would recommend remote
debugging Solr and putting some breakpoints there to investigate if
something doesn't look right.

Let me know!

--
*Alessandro Benedetti*
CEO @ Sease Ltd.
*Apache Lucene/Solr Committer*
*Apache Solr PMC Member*

e-mail: a.benede...@sease.io


*Sease* - Information Retrieval Applied
Consulting | Training | Open Source

Website: Sease.io 
LinkedIn  | Twitter
 | Youtube
 | Github



On Wed, 22 Jun 2022 at 08:37, Danilo Tomasoni  wrote:

> Hello Dave, first of all thank you for your answer.
>
> I need to clarify that I've used separate (and quite good) NER  algorithms
> offline and the results were imported to solr.
>
> Unfortunately the approach that you suggest using the morelikethis
> functionality is not suitable for my needs since I need to discover
> statistically significative relations between NER entities, while MLT will
> give me NER entities "similar" to the ones I'm looking for, as far as I
> understand.
>
> Anyone knows why the relatedness is high even if the foreground (and even
> background) popularity is 0?
>
> Danilo Tomasoni
>
> Fondazione The Microsoft Research - University of Trento Centre for
> Computational and Systems Biology (COSBI)
> Piazza Manifattura 1,  38068 Rovereto (TN), Italy
> tomas...@cosbi.eu<
> https://webmail.cosbi.eu/owa/redir.aspx?C=VNXi3_8-qSZTBi-FPvMwmwSB3IhCOjY8nuCBIfcNIs_5SgD-zNPWCA..&URL=mailto%3acalabro%40cosbi.eu
> >
> http://www.cosbi.eu<
> https://webmail.cosbi.eu/owa/redir.aspx?C=CkilyF54_imtLHzZqF1gCGvmYXjsnf4bzGynd8OXm__5SgD-zNPWCA..&URL=http%3a%2f%2fwww.cosbi.eu%2f
> >
>
> As for the European General Data Protection Regulation 2016/679 on the
> protection of natural persons with regard to the processing of personal
> data, we inform you that all the data we possess are object of treatment in
> the respect of the normative provided for by the cited GDPR.
> It is your right to be informed on which of your data are used and how;
> you may ask for their correction, cancellation or you may oppose to their
> use by written request sent by recorded delivery to The Microsoft Research
> – University of Trento Centre for Computational and Systems Biology Scarl,
> Piazza Manifattura 1, 38068 Rovereto (TN), Italy.
> P Please don't print this e-mail unless you really need to
> 
> Da: Dave 
> Inviato: martedì 21 giugno 2022 19:51
> A: users@solr.apache.org 
> Oggetto: Re: Semantic Knowledge Graph theoric question
>
> [CAUTION: EXTERNAL SENDER]
> [Please check correspondence between Sender Display Name and Sender Email
> Address before clicking on any link or opening attachments]
>
>
> Two hints. The ner from solr isn’t very good, and the relatedness function
> is iffy at best.
>
> I would take a different approach. Get the ner data as you have it now and
> use shingles to make a better formed complete index using stop words then
> use the mlt mech to see if it’s better.   If it is, great if not it’s just
> an idea.
>
>
> > On Jun 21, 2022, at 12:02 PM, Danilo Tomasoni  wrote:
> >
> > Hello all,
> > I'm experimenting with the SKG features available through json.facet API
> in solr 8.11 to discover semantic relations between medical text
> pre-annotated with NER algorithms.
> > I store the NER annotations, annotation id, span ecc in separate solr
> fields, to keep text clean.
> >
> > The first results looks promising but I found a behaviour that surprises
> me.
> > To give a bit of context I'm looking for covid-related papers with a
> standard query (q parameter)
> > Then I set my foreground query to be a set of keywords in OR related to
> the mithochondria, and the background query is set to *.
> >
> > Then the json.facet parameters are like
> >
> > "json.facet": {
> >"gene":{
> >  "type": "terms",
> >  "field": "abstracts_gene_pubtator_annotation_ids",
> >  "sort": { "r1": "desc" },
> >  "limit": 3,
> >  "facet": {
> >"r1" : "relatedness($fore,$back)"
> >}
> >  }
> >}
> > This should give gene stored in abstracts_gene_pubtator_annotation_ids
> that are more likely to occur in mitochondrial papers.
> > Running a test query gives me this surprising result
> >
> > ...
> >"gene": {
> >  "buckets": [
> >{
> >  "val": "3091",
> >  "count": 1,
> >  "rtitles1": {
> >"relatedness": 0.55649,
> >"foreg

Delete by Id in solr cloud

2022-06-27 Thread Satya Nand
Hi,

I have an 8 shards collection, where I am using *compositeId* routing
with *router.field
*(a field named parentglUsrId). The unique Id of the collection is a
different field *displayid*.

I am trying a delete by id operation where I pass a list of displayids to
delete. I observed that no documents are being deleted. when I checked the
logs I found that the deletion request for an Id might not go to the
correct shard and perform a request on some other shard that was not
hosting this Id. This might be due to solr trying to find the shard based
on the hash of displayid but my sharding is done on the basis of
parentglUsrId.


is there anything I am missing? Because it seems like a simple operation.
what do I need to do to broadcast a delete by id request to all shards so
relevant id can be deleted on each shard?