Re: Issues when indexing PDF files

2015-12-17 Thread Charlie Hull
hat comes together with Solr. What could be the reason that causes this? Regards, Edwin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Many patterns against many sentences, storing all results

2016-01-06 Thread Charlie Hull
s. The smaller-scale prototype was done with postgres full text searching, but that can't do exact phrase matching or other more sophisticated searches, so it's out. Thanks very much Will -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile

XJoin, a way to use external data sources with Solr

2016-01-29 Thread Charlie Hull
em useful: http://www.flax.co.uk/blog/2016/01/25/xjoin-solr-part-1-filtering-using-price-discount-data/ http://www.flax.co.uk/blog/2016/01/29/xjoin-solr-part-2-click-example/ We're very interested in other use cases - one that occurs to us is security filtering. Cheers Charlie -- Ch

Re: Tutorial or Code Samples to explain how to Write Solr Plugins

2016-02-03 Thread Charlie Hull
Here's one we wrote recently for indexing ontologies with Solr as part of the BioSolr project: https://github.com/flaxsearch/BioSolr/tree/master/ontology/solr and a presentation on how it works (explained in the second half of the talk) https://www.youtube.com/watch?v=v1qKNX_axdI - hope this helps!

Re: How is Tika used with Solr

2016-02-10 Thread Charlie Hull
or does it link with Tika JARs directly? If it links in directly, are there known issues with Solr integrated with Tika because of Tika issues? Thanks Steve -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Reverse Eningeer Query For a Given Result Set?

2016-02-18 Thread Charlie Hull
ut exactly what differences you can tolerate. HTH, Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: What search metrics are useful?

2016-02-25 Thread Charlie Hull
ere a video or presentation on search metrics that would be useful? -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Indexing Twitter - Hypothetical

2016-03-04 Thread Charlie Hull
ot enough resent results is an example. I highly doubt that a single SolrCloud is the best answer here. Maybe one cloud for each month and a lot of external logic? - Toke Eskildsen -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: What is the best way to index 15 million documents of total size 425 GB?

2016-03-04 Thread Charlie Hull
ldsen, State and University Library, Denmark -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Relevancy for "tablet"

2016-03-09 Thread Charlie Hull
s to adjust - I would recommend you take a look at Doug Turnbull and John Berryman's new book 'Relevant Search' (available on MEAP at Manning Publications) which is an excellent take on this. In short, you need a sensible methodology for tuning relevance, otherwise it can easily be

Re: using data from external service in Solr: value source or auxiliary core?

2016-03-14 Thread Charlie Hull
-external-service-in-Solr-value-source-or-auxiliary-core-tp4263334.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Verifying - SOLR Cloud replaces load balancer?

2016-04-19 Thread Charlie Hull
pology, this can increase performance if you are returning large amounts of data - many or large fields or many documents. Cheers Tom -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Running out of disk space for Solr, a proposed solution

2016-04-21 Thread Charlie Hull
targetted at Solr: https://github.com/flaxsearch/harahachibu There's a blog post explaining how and why we built it at http://www.flax.co.uk/blog/2016/04/21/running-disk-space-elasticsearch-solr/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 m

Re: Use faceted search to drill down in hierarchical structure and omit node data outside current selection

2015-07-29 Thread Charlie Hull
m Use faceted search to drill down in hierarchical structure and omit node data outside current selection, click here. NAML wlEmoticon-smile[1].png (1K) < http://lucene.472066.n3.nabble.com/attachment/4219517/0/wlEmoticon-smile%5B1%5D.png -- View this message in context: http://lucene.472066.

Re: Please answer my question on StackOverflow ... "Best approach to guarantee commits in SOLR"

2015-08-26 Thread Charlie Hull
h engines and designed for this use. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Charlie Hull
gt; Think about your query pattern when you decide how to shard. If most of your queries are for recent articles, then some shards will be loaded far more than others. Here's a rather old blog post we wrote on the subject (actually based on Xapian, another open source search engine, but the con

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-25 Thread Charlie Hull
works well for indexing both English and Chinese (Bilingual) documents, or do we need tokenizers that are customised for chinese (Eg: HMMChineseTokenizerFactory)? Regards, Edwin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-25 Thread Charlie Hull
Paoding? Solr v4.6 I believe. Charlie Regards, Edwin On 25 September 2015 at 16:43, Charlie Hull wrote: On 23/09/2015 16:23, Alexandre Rafalovitch wrote: You may find the following articles interesting: http://discovery-grindstone.blogspot.ca/2014/01/searching-in-solr-analyzing-results

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-30 Thread Charlie Hull
wellit has been a little painful. Have you tried to use HMMChineseTokenizer and JiebaTokenizer as well? I don't think so. Charlie Regards, Edwin On 25 September 2015 at 18:46, Charlie Hull wrote: On 25/09/2015 11:43, Zheng Lin Edwin Yeo wrote: Hi Charlie, Thanks for your

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

2015-09-30 Thread Charlie Hull
On 30 September 2015 at 16:20, Charlie Hull wrote: On 30/09/2015 04:09, Zheng Lin Edwin Yeo wrote: Hi Charlie, Hi, I've checked that Paoding's code is written for Solr 3 and Solr 4 versions. It is not written for Solr 5, thus I was unable to use it in my Solr 5.x version. I'

Re: Facet queries blow out the filterCache

2015-10-02 Thread Charlie Hull
estion is, why does this facet query have anything to do with the filterCache? This causes a huge amount of filterCache churn with no apparent benefit. -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> -- Sincerely yours Mikhail

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
/libs/previewgen It uses a headless version of Open Office under the hood to generate thumbbnail previews for various common file types, plus some ImageMagick for PDF, all wrapped up in Python. Bear in mind this is 6 years old so some updating might be required! Cheers Charlie -- Charlie Hull Flax -

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
/libs/previewgen It uses a headless version of Open Office under the hood to generate thumbbnail previews for various common file types, plus some ImageMagick for PDF, all wrapped up in Python. Bear in mind this is 6 years old so some updating might be required! Cheers Charlie -- Charlie Hull Flax -

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
er feeling, but of course, it requires a community approach. Yes...and in an enterprise situation, this will depend on users spending time working on enhancing content, which is a battle seldom won :) Charlie Maybe both are needed if there's an infinite budget... Paul Charlie Hull <mai

Bioinformatics search event in Cambridge UK Feb 3rd & 4th 2016

2015-10-14 Thread Charlie Hull
It's linked to our project BioSolr which is developing Solr features for bioinformaticians such as ontology indexers, JOINs with external data and faceting improvements (although we're hoping they're also of general use). Cheers Charlie -- Charlie Hull Flax - Open Source Enterp

Re: DIH parallel processing

2015-10-15 Thread Charlie Hull
r updates. There are lots of great examples of high-performance indexing code available e.g.: http://bryanbende.com/development/2014/08/16/indexing-wikipedia-with-apache-solr/ Best Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767

Re: Solr node removed from zookeeper

2015-10-28 Thread Charlie Hull
tion I'm sorry for the long post. Thank you, Andrei -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-node-removed-from-zookeeper-tp4236931.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search te

Re: Closing Windows CMD kills Solr

2015-10-29 Thread Charlie Hull
HTH Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr Search: Access Control / Role based security

2015-11-18 Thread Charlie Hull
rests of the night, What immortal hand or eye Could frame thy fearful symmetry?" William Blake - Songs of Experience -1794 England -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: NRT vs Redis for Dynamic Data in SOLR (like counts, viewcounts, etc) -

2015-12-15 Thread Charlie Hull
ent. -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Geo Aggregations and Search Alerts in Solr

2015-02-24 Thread Charlie Hull
of Lucene. More news very soon! There are a couple of videos on that page that will explain further. We suspect our approach is considerably faster than the Percolator, and it's on the list to benchmark the two. Cheers Charlie Thank you. -- Charlie Hull Flax - Open Source Enterpris

Re: Creating facets based on the content field

2015-03-23 Thread Charlie Hull
w stored in core0's "content" field. I wonder how you create facets based on the field's contents, if you don't know in advance what it contains (unless you have compiled a list of frequently-occurring words in the PDFs, after reading them.) Many thanks. Philippe -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Indexing PDF and MS Office files

2015-04-16 Thread Charlie Hull
lusive use of the intended recipient. If you receive this e-mail in error please delete it from your system immediately and notify us either by e-mail or telephone. You should not copy, forward or otherwise disclose the content of the e-mail. The views expressed in this communication may no

Re: Solr + RDF = SolRDF

2015-04-28 Thread Charlie Hull
tion is warmly welcome ;) Best, Andrea [1] http://andreagazzarini.blogspot.it/2014/12/a-solr-rdf-store-and-sparql-endpoint-in.html [2] http://andreagazzarini.blogspot.it/2015/04/rdf-faceting-with-apache-solr-solrdf.html [3] https://github.com/agazzarini/SolRDF/wiki/Faceted%20search -- Charlie Hull

Re: Deleting Fields

2015-06-01 Thread Charlie Hull
flax.co.uk/blog/2011/06/24/how-to-remove-a-stored-field-in-lucene/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
//lucene.472066.n3.nabble.com/Merging-Sets-of-Data-from-Two-Different-Sources-tp4211166.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
ng-Sets-of-Data-from-Two-Different-Sources-tp4211166p4211169.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Merging Sets of Data from Two Different Sources

2015-06-11 Thread Charlie Hull
om/Merging-Sets-of-Data-from-Two-Different-Sources-tp4211166p4211172.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Can I instruct the Tika Entity Processor to skip the first page using the DIH?

2015-07-09 Thread Charlie Hull
: http://lucene.472066.n3.nabble.com/Can-I-instruct-the-Tika-Entity-Processor-to-skip-the-first-page-using-the-DIH-tp4216373.html Sent from the Solr - User mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr 5 options

2015-07-15 Thread Charlie Hull
e schema to include that field with the guessed fieldType. It's sort of like the managed schema, except it's managed automatically instead of by the admin. I personally would not want Solr to guess on the schema, I would want to explicitly define Solr's behavior ... but not everyone does things the same way that I do. Thanks, Shawn -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: [scottchu] What kind of configuration to use for this size of news data?

2016-05-11 Thread Charlie Hull
articles data? (Our business has this possibility of needs.) Yes, I guess so, but why copy it when you could just search it with a filter for the paper types? I'd like to hear and use some well suggestion and experiences. Thanks in advance and best regards. Scott Chu @ 2016/5/11 11:26 G

Re: [scottchu] What kind of configuration to use for this size ofnews data?

2016-05-11 Thread Charlie Hull
/11 (週三) - Original Message - From: Charlie Hull To: solr-user@lucene.apache.org CC: Date: 2016/5/11 (週三) 16:21 Subject: Re: [scottchu] What kind of configuration to use for this size ofnews data? On 11/05/2016 04:27, scott.chu wrote: Fix some typos, add some words and resend same question

Re: dtSearch parser & Introduction

2016-05-13 Thread Charlie Hull
for my stuff--do I add it to an existing subdirectory or create a new package? I think it'd be great if I could get a bare-bones example of a parser so that I can modify it--perhaps even keeping it in a separate Java project. Don't feel like you have to answer all of my questions--an ans

Re: dtSearch parser & Introduction

2016-05-13 Thread Charlie Hull
On 13/05/2016 10:41, Charlie Hull wrote: On 12/05/2016 23:50, Brandon Miller wrote: Hello, all! I'm a BloombergBNA employee and need to obtain/write a dtSearch parser for solr (and probably a bunch of other things a little later). I've looked at the available parsers and though

Re: Hierarchial Support - Solr

2016-05-19 Thread Charlie Hull
| | 1 | 13 | 131 | | 1 | 13 | 132 | | 1 | 13 | 133 | -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Add a new field dynamically to each of the result docs and sort on it

2016-06-01 Thread Charlie Hull
you are dealing with "X", you are assuming "Y" will help you, and you are asking about "Y" without giving more details about the "X" so that we can understand the full issue. Perhaps the best solution doesn't involve "Y" at all? See Also: http:

Re: Bypassing ExtractingRequestHandler

2016-06-10 Thread Charlie Hull
entries to keep up with the pulse of the project? Thanks, Justin -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr more like this

2016-07-06 Thread Charlie Hull
d pdf file and get mlt result.can I do this?? If Solr hasn't indexed a PDF file, it can't work out it's 'like this'. So I'd say, no, you can't. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr and Drupal

2016-08-10 Thread Charlie Hull
defaults being set in the Solr configuration files. This is a generic issue when Solr or another search engine is embedded in another product - the people doing the embedding may not know enough about search to do it right. In any case, you'll probably be fine, but do be aware. Cheers

Three Lucene hackdays coming soon

2016-08-24 Thread Charlie Hull
ry to achieve! Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Apache SOLR Search Errors ?

2016-09-06 Thread Charlie Hull
r mailing list archive at Nabble.com. -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Hackday next month

2016-09-21 Thread Charlie Hull
join us. http://www.meetup.com/New-England-Search-Technologies-NEST-Group/events/233492535/ Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Hackday next month

2016-09-22 Thread Charlie Hull
ut others but it certainly would work much better for me. -Anshum On Wed, Sep 21, 2016 at 2:18 PM Charlie Hull wrote: Hi all, If you're coming to Lucene Revolution next month in Boston, we're running a Lucene-focused hackday (Lucene, Solr, Elasticsearch) kindly hosted by BA Ins

Re: (ANNOUNCEMENT) Solr Examples reading group

2016-10-05 Thread Charlie Hull
trong opinions about Solr examples, feel free to reach out directly and share them via email or in person. The opinions do not have to be positive, though having them constructive would be an nice. :-) Newsletter and resources for Solr beginners and intermediates: http://www.solr-start.com/

Re: Writing Solr Custom Components

2016-10-05 Thread Charlie Hull
h-ontology-indexing/ Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

London Lucene Hackday is now running

2016-10-07 Thread Charlie Hull
Hi all, We're running a Lucene hackday in London - you can follow along with Twitter using hashtag #LuceneSolrLondon and see what we're doing on Github at https://github.com/flaxsearch/london-hackday-2016 - as the README shows we're currently looking at: 1. A Browser-driven explorer for Lu

Re: London Lucene Hackday is now running

2016-10-07 Thread Charlie Hull
g reports out > of Jira exports. > > On 7 Oct 2016 4:52 PM, "Charlie Hull" wrote: > > > Hi all, > > > > We're running a Lucene hackday in London - you can follow along with > > Twitter using hashtag #LuceneSolrLondon and see what we're

Re: London Lucene Hackday is now running

2016-10-12 Thread Charlie Hull
day, October 7, 2016 5:24 PM, Charlie Hull > wrote: > > > Yes I'll blog about it and we'll try and get as much as possible captured > in the Github folder. If you've got ideas for Tuesday please could you add > them to that event's Meetup page? > > Cheers &g

Re: London Lucene Hackday is now running

2016-10-21 Thread Charlie Hull
On 07/10/2016 10:52, Charlie Hull wrote: Hi all, We're running a Lucene hackday in London - you can follow along with Twitter using hashtag #LuceneSolrLondon and see what we're doing on Github at https://github.com/flaxsearch/london-hackday-2016 - as the README shows we're curre

Re: Solr 6.4. Can't index MS Visio vsdx files

2017-07-04 Thread Charlie Hull
ktrace. You'll probably get more helpful answers over on POI. Sorry, I can't help with this... Best, Tim P.S. 3.1. ooxml-schemas-1.3.jar instead of poi-ooxml-schemas-3.15.jar You shouldn't need both. Ooxml-schemas-1.3.jar should be a super set of poi-ooxml-schem

Re: Need guidance for distributing data base on date interval in a collection

2017-07-18 Thread Charlie Hull
Hi, You should also consider how you should shard for best performance: for example, if most of your queries are for recent documents, you could end up with them all hitting only one shard. Here's an old blog we wrote on this subject (it mentions another open source engine, Xapian, but ignore that

Re: Debug Queries field explaination

2017-07-20 Thread Charlie Hull
d what these exactly mean. You might take a look at Open Source Connections excellent Splainer tool - www.splainer.io Cheers Charlie Thanks. --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700

Re: Architecture: How to commercialise a Solr based application?

2017-08-07 Thread Charlie Hull
. Thank you, Paul --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: MongoDb vs Solr

2017-08-07 Thread Charlie Hull
omi Francesco --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr staying constant on popularity indexes

2017-10-10 Thread Charlie Hull
t;just supply a few search terms"-scenario. - Toke Eskildsen, Royal Danish Library --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: E-Commerce Search: tf-idf, tie-break and boolean model

2017-10-17 Thread Charlie Hull
.com/Solr-User-f472068.html --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Ways to weight SOLR search results without development

2017-10-24 Thread Charlie Hull
ecipient(s), you are hereby notified that any dissemination of this communication is strictly prohibited. If you have received this communication in error, please erase all copies of the message and its attachments and notify us immediately. -- Charlie Hull Flax - Open Source Enterprise

Re: Fwd: configuring Solr with Tesseract

2017-11-06 Thread Charlie Hull
g to need to do some coding I think - Python would be a good choice - to feed your source files to Tika for OCR and extraction, and then the resulting text to Solr for indexing. Cheers Charlie Thank you very much -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118

Re: Quick Query about

2017-11-09 Thread Charlie Hull
cess the PDF files from another server. * * --> Kind regards, Karan -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)87

Re: solr expertise

2017-11-16 Thread Charlie Hull
re in etc. Best Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Does apache solr stores the file?

2017-12-07 Thread Charlie Hull
ibed above is the usual way to deal with this situation. Best Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Any Insights SOLR Rank tuning tool

2017-12-14 Thread Charlie Hull
re both sets of search results) is essential. Remember that you might even have to replicate some 'wrong' behaviour of the old engine as people are used to it! Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr/lucene "planet" + recommendations for blogs to follow

2016-11-23 Thread Charlie Hull
Hi all, We also blog about various Solr topics at www.flax.co.uk/blog and also run the London Lucene/Solr Meetup. I'd encourage you to attend a Meetup if you can find one locally, they're great places to hear about Solr projects and meet others working in the field. Alex & others efforts in creati

Re: The state of Solr 5. Is it in maintenance mode only?

2016-11-29 Thread Charlie Hull
the 3.x days. No changes were committed for SOLR-2242. The changes for SOLR-6348 were committed to 5.2 and 6.0. I have updated the fix versions in the older issue to match. The versions should probably all be removed, but I am not sure what our general rule is for duplicates. Thanks, Shawn -- C

Re: [ANN] InvisibleQueriesRequestHandler

2016-12-05 Thread Charlie Hull
system using Solr is doing (e.g. Hybris, Drupal...) so to be able to run multiple searches in Solr itself is very useful. Nice one! Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Stop Solr Node (in distress)?

2016-12-20 Thread Charlie Hull
-- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: SolrCloud different score for same document on different replicas.

2017-01-05 Thread Charlie Hull
ckday (see item 3) https://github.com/flaxsearch/london-hackday-2016 I'm not sure there is a way to get a homogenous score - this patch tries to keep you connected to the same replica during a session so you don't see results jumping over pagination. Cheers Charlie -- Charlie Hul

Re: A tool to quickly browse Solr documents ?

2017-01-24 Thread Charlie Hull
er: https://github.com/flaxsearch/marple Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: How to combine third party search data as top results ?

2017-02-01 Thread Charlie Hull
, allowing you to use results from an external system with Solr. Here are two blog posts about it: http://www.flax.co.uk/blog/2016/01/25/xjoin-solr-part-1-filtering-using-price-discount-data/ http://www.flax.co.uk/blog/2016/01/29/xjoin-solr-part-2-click-example/ Cheers Charlie -- Charlie Hull

Re: Upserting doc fields from a SearchComponent

2017-02-01 Thread Charlie Hull
some experience in personalizing a search response in the Solr guts. Best Ugo -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Announcing Marple, a RESTful API & GUI for inspecting Lucene indexes

2017-02-24 Thread Charlie Hull
uch a work in progress (we started it at the Lucene hackday we ran in London last autumn) so contributions, bug reports & feature requests very welcome! We'll also be talking about it at the next London Lucene/Solr Meetup on March 23rd. Best Charlie -- Charlie Hull Flax - Open Sou

Re: Announcing Marple, a RESTful API & GUI for inspecting Lucene indexes

2017-02-24 Thread Charlie Hull
On 24/02/2017 17:24, Charlie Hull wrote: Hi all, Very pleased to announce the first release of Marple, an open source tool for inspecting Lucene indexes. We've blogged about it here: http://www.flax.co.uk/blog/2017/02/24/release-1-0-marple-lucene-index-detective/ which contains links t

Re: minimal solrconfig example

2017-03-02 Thread Charlie Hull
ethod of hacking large chunks of it out and seeing what breaks what. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Urgent - Solr indexing is taking hours and dashboard page is not getting rendered at all :(

2017-03-09 Thread Charlie Hull
d also of course ask Hybris support for help. Charlie Inline image 1 -- Regards Gaurav Srivastava -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Partial Match with DF

2017-03-16 Thread Charlie Hull
Hi Mark, Open Source Connection's excellent www.splainer.io might also be useful to help you break down exactly what your query is doing. Cheers Charlie P.S. planning a blog soon listing 'useful Solr tools' On 16 March 2017 at 14:39, Mark Johnson wrote: > You're right! The fields I'm sea

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Charlie Hull
tell me how can I remove all kind of caching from solr/lucene ? Hi Nilesh, The operating system will surely still be caching disk reads? Charlie Thanks, Nilesh kamani -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web

Re: Poll: Master-Slave or SolrCloud?

2017-04-25 Thread Charlie Hull
his? See https://twitter.com/sematext/ status/854927627748036608 Thanks, Otis -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull

Re: Poll: Master-Slave or SolrCloud?

2017-04-28 Thread Charlie Hull
perhaps that's a good signal for Solr development/priorities in terms of ZK hiding/automating/embedding/something... Otis -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ On Tue, Apr 25, 2017 at 4:50 AM, Cha

Re: Solr Web Crawler - Robots.txt

2017-06-02 Thread Charlie Hull
cene.apache.org/solr which I am guessing allows it. I was also wondering how to find the name of the crawler bin/post uses. --- This email has been checked for viruses by AVG. http://www.avg.com -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334

Re: OCR - Saving multi-term position

2014-07-03 Thread Charlie Hull
#x27;) but the approach is certainly sound. The following talk from Lucene Revolution is about this kind of thing: http://www.youtube.com/watch?v=rmRCsrJp2A8 Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Any Solr consultants available??

2014-07-25 Thread Charlie Hull
wsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr vs ElasticSearch

2014-08-01 Thread Charlie Hull
On 01/08/2014 06:43, Alexandre Rafalovitch wrote: Maybe Charlie Hull can answer that: https://twitter.com/FlaxSearch/status/494859596117602304 . He seems to think that - at least in some cases - Solr is faster. I'll try to expand on the tweet. Firstly, this is a totally unscien

Re: Solr vs ElasticSearch

2014-08-01 Thread Charlie Hull
onal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On Fri, Aug 1, 2014 at 3:44 PM, Charlie Hull wrote: On 01/08/2014 06:43, Alexandre Rafalovitch wrote: Maybe Ch

Re: Revolution writeup

2013-11-26 Thread Charlie Hull
there, mostly though. Sorry if I missed your talk -- I'm hoping to catch up when the videos are posted... http://blog.safariflow.com/2013/11/25/this-revolution-will-be-televised/ -Mike Sokolov -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobil

Re: Converting Fast ESP queries(fql) to Solr query syntax

2013-11-28 Thread Charlie Hull
h FAST ESP. Our presentation is here if you want to find out more: http://www.lucenerevolution.org/2013/Lucene-Solr-Revolution-2013-Dublin-Presentations#Charlie_Hull Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Introducing Luwak for high-performance stored Lucene queries

2013-12-06 Thread Charlie Hull
media monitoring applications but it could equally be useful for categorisation, classification etc. It's currently based on a fork of Lucene (details supplied) but hopefully it'll work with release versions soon. Feedback is very welcome! Cheers Charlie -- Charlie Hull Fla

Re: Introducing Luwak for high-performance stored Lucene queries

2013-12-06 Thread Charlie Hull
s C Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/ On Fri, Dec 6, 2013 at 9:29 AM, Charlie Hull wrote: Hi all, We've now released the library we mentioned in our presentation at Lucene Revolution: https://github.com/f

Re: Solr hanging when extracting a some broken .doc files

2013-12-18 Thread Charlie Hull
PDF files, yes there are such horrors as 3000 page PDFs!). We usually run it in an external process so it can be watched and killed if necessary. Cheers Charlie -- Charlie Hull Flax - Open Source Enterprise Search tel/fax: +44 (0)8700 118334 mobile: +44 (0)7767 825828 web: www.flax.co.uk

Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Charlie Hull
linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Wed, Dec 18, 2013 at 3:47 PM, Charlie Hull wrote: On 17/12/2013 15:29, Augusto Camarotti wrote: Hi guys,

Re: Zookeeper as Service

2014-01-09 Thread Charlie Hull
ch may mean any error messages end up in a black hole, with you simply getting something unhelpful 'service failed to start' error messages from Windows itself if something goes wrong. The 'working directory' is another thing that needs careful setting up. Cheers Charlie

  1   2   3   >