[
https://issues.apache.org/jira/browse/LUCENE-5205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073323#comment-14073323
]
ASF GitHub Bot commented on LUCENE-5205:
----------------------------------------
GitHub user tballison opened a pull request:
https://github.com/apache/lucene-solr/pull/68
Lucene5205
LUCENE-5205
1) merge from trunk
2) roll in March 10, 2014 LUCENE-5205 patch for improved stopword handling
3) roll in SOLR-5410
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tballison/lucene-solr lucene5205
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/lucene-solr/pull/68.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #68
----
commit 3687d27902c3d993291a9f169f1c4a338c417327
Author: Uwe Schindler <[email protected]>
Date: 2014-06-11T17:50:45Z
SOLR-5940: post.jar reports back detailed error in case of error responses
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601970
13f79535-47bb-0310-9956-ffa450edef68
commit 4f2da71473619def348518402cc567f429047cc0
Author: Joel Bernstein <[email protected]>
Date: 2014-06-11T19:35:19Z
SOLR-6150: Improving AnalyticsMergeStrategyTest
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1601997
13f79535-47bb-0310-9956-ffa450edef68
commit 109c4c47679a193ac3ca3a4a449d759dbad59725
Author: shalin Shekhar Mangar <[email protected]>
Date: 2014-06-12T11:18:33Z
SOLR-6056: Don't publish recovery state until recovery runs to avoid
overwhelming the overseer state queue
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602123
13f79535-47bb-0310-9956-ffa450edef68
commit d553138492454798b9abeff7e610f0e8f3ddfb8b
Author: Michael McCandless <[email protected]>
Date: 2014-06-12T11:54:20Z
fix typo
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602131
13f79535-47bb-0310-9956-ffa450edef68
commit 4dd3197621324234e77e741fd843c4d76df07719
Author: Noble Paul <[email protected]>
Date: 2014-06-12T12:18:21Z
SOLR-6048 the assert was not really failing the test
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602138
13f79535-47bb-0310-9956-ffa450edef68
commit 2cdb0941446628663849f56ffbe4b42c62d00e0c
Author: Shai Erera <[email protected]>
Date: 2014-06-12T12:26:20Z
add comments to clarify code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602140
13f79535-47bb-0310-9956-ffa450edef68
commit d1274853919c1c9867e8e71117ff1303b6cc8816
Author: shalin Shekhar Mangar <[email protected]>
Date: 2014-06-12T15:45:08Z
Fix typo, rf is actually 3 in code
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602210
13f79535-47bb-0310-9956-ffa450edef68
commit 0b9f7edd3109467052137004d36abb7f793e5835
Author: Robert Muir <[email protected]>
Date: 2014-06-12T19:40:36Z
LUCENE-5748: Add SORTED_NUMERIC docvalues type
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602277
13f79535-47bb-0310-9956-ffa450edef68
commit e2f2c2fdaa77b4c17f6922fb9c5e25b02563855a
Author: Uwe Schindler <[email protected]>
Date: 2014-06-13T08:54:20Z
LUCENE-5754: Allow "$" as part of variable and function names in
expressions module
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602344
13f79535-47bb-0310-9956-ffa450edef68
commit 40137f9162350a6281e0d3fba99898fd66be28b2
Author: Adrien Grand <[email protected]>
Date: 2014-06-13T11:39:43Z
LUCENE-5695: DocIdSet implements Accountable.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602387
13f79535-47bb-0310-9956-ffa450edef68
commit ccf0a812d1644e70b33157d5c33b34e78889f327
Author: Simon Willnauer <[email protected]>
Date: 2014-06-13T11:41:19Z
LUCENE-5756: Implement Accountable from IndexWriter
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602388
13f79535-47bb-0310-9956-ffa450edef68
commit 0114c4e7292aa261996688b4f0813622d3ff99b3
Author: Simon Willnauer <[email protected]>
Date: 2014-06-13T11:49:54Z
Add Import Layout Table to idea codestyle
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602389
13f79535-47bb-0310-9956-ffa450edef68
commit 0e92dc55e6293c26c020550742e2272547589df7
Author: Robert Muir <[email protected]>
Date: 2014-06-13T20:41:17Z
LUCENE-5757: move RamUsageEstimator reflector to test-framework
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602515
13f79535-47bb-0310-9956-ffa450edef68
commit 912e74424411c9055371924f403c0f66535c3066
Author: Chris M. Hostetter <[email protected]>
Date: 2014-06-13T21:15:50Z
SOLR-5426: Fixed a bug in ReverseWildCardFilter that could cause
InvalidTokenOffsetsException when highlighting
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602525
13f79535-47bb-0310-9956-ffa450edef68
commit e9cb1382808cdd8f04dd837ce7fc473ed1e4a0b2
Author: Robert Muir <[email protected]>
Date: 2014-06-13T21:55:20Z
LUCENE-5760: Speed up BufferedIndexInput.randomAccessSlice
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602530
13f79535-47bb-0310-9956-ffa450edef68
commit 2df9ee28785e35e6e102b647b29e79660c10083f
Author: shalin Shekhar Mangar <[email protected]>
Date: 2014-06-14T14:46:10Z
SOLR-6161: SolrDispatchFilter should throw java.lang.Error back even if
wrapped in another exception
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602590
13f79535-47bb-0310-9956-ffa450edef68
commit 0b567b02453725acb47ed451b518a6103cc7a898
Author: shalin Shekhar Mangar <[email protected]>
Date: 2014-06-14T14:57:30Z
SOLR-6153: ReplicationHandler backup response format should contain backup
name
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602592
13f79535-47bb-0310-9956-ffa450edef68
commit 2ca52f0e5949f85427616ee2d3c6dfb836a80ff8
Author: Alan Woodward <[email protected]>
Date: 2014-06-16T09:17:23Z
SOLR-6169: Properly remove deprecated CoreAdminHandler handleAlias action
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602825
13f79535-47bb-0310-9956-ffa450edef68
commit 9fe4454fdd1160a4df87b3f65512a00bf822c5d1
Author: Robert Muir <[email protected]>
Date: 2014-06-16T11:07:19Z
LUCENE-5762: Disable old codecs as much as possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602845
13f79535-47bb-0310-9956-ffa450edef68
commit 4525f7ab6e5289a1f5357a577b1fb5dca7f6d46f
Author: Dawid Weiss <[email protected]>
Date: 2014-06-16T12:17:57Z
SOLR-6151: Intermittent TestReplicationHandlerBackup failures.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602854
13f79535-47bb-0310-9956-ffa450edef68
commit 7c506afc3bda28d056389e7679588cf8e6be0b94
Author: David Wayne Smiley <[email protected]>
Date: 2014-06-16T12:38:43Z
LUCENE-5648: (NumberRangePrefixTree) Bug-fix in initIter optimization.
Re-index required.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602857
13f79535-47bb-0310-9956-ffa450edef68
commit 730c70a59d26f2bc5aedc474c317c277b4c14b9e
Author: Robert Muir <[email protected]>
Date: 2014-06-16T13:14:01Z
LUCENE-5761: Remove DiskDocValuesFormat
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602862
13f79535-47bb-0310-9956-ffa450edef68
commit a55cbab679d5a88dbb36be7bf08d4dd27d39f602
Author: Adrien Grand <[email protected]>
Date: 2014-06-16T14:22:02Z
LUCENE-5759: Add PackedInts.unsignedBitsRequired.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602873
13f79535-47bb-0310-9956-ffa450edef68
commit c412a51bc386cb8636a96daaa4bf7ed4c63101bd
Author: Adrien Grand <[email protected]>
Date: 2014-06-16T14:25:12Z
LUCENE-5764: Add tests to DocIdSet.ramBytesUsed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602876
13f79535-47bb-0310-9956-ffa450edef68
commit 83c64d82de99a9180bb03f7648aca10412a513eb
Author: Adrien Grand <[email protected]>
Date: 2014-06-16T14:41:15Z
LUCENE-5765: Add tests to OrdinalMap.ramBytesUsed.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602880
13f79535-47bb-0310-9956-ffa450edef68
commit a28e98b142418347d260662e1b8c14f86536f9d5
Author: Timothy Potter <[email protected]>
Date: 2014-06-16T16:44:15Z
SOLR-6157: Added some logging and re-opened the socket proxy to try to
figure out why this test is hanging; reenabling temporarily to see if these
changes help diagnose the cause of the hang.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602924
13f79535-47bb-0310-9956-ffa450edef68
commit 5664fdd77a785b1c620527e1ecebb6659c457580
Author: Timothy Potter <[email protected]>
Date: 2014-06-16T18:28:07Z
SOLR-6015: Moving change note to 4.9 bugfix section.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602950
13f79535-47bb-0310-9956-ffa450edef68
commit f233dd7698ea7f2134e86bc696868f7116d6140e
Author: Timothy Potter <[email protected]>
Date: 2014-06-16T18:36:08Z
SOLR-5956: Use getInstanceDir instead of getRawInstanceDir as that was
causing issues when trying to create a backup of an index on Linux when
solr.solr.home is a symbolic link.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602953
13f79535-47bb-0310-9956-ffa450edef68
commit c1e8bb94e81eaa3c3188e81966ce311b624c0e96
Author: Adrien Grand <[email protected]>
Date: 2014-06-16T20:33:44Z
LUCENE-5767: OrdinalMap optimizations.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1602997
13f79535-47bb-0310-9956-ffa450edef68
commit 2b985b580148ecb9f969a918c715f6488d92577b
Author: Robert Muir <[email protected]>
Date: 2014-06-16T21:56:05Z
LUCENE-5768: hunspell condition checks with character classes were buggy
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1603007
13f79535-47bb-0310-9956-ffa450edef68
----
> [PATCH] SpanQueryParser with recursion, analysis and syntax very similar to
> classic QueryParser
> -----------------------------------------------------------------------------------------------
>
> Key: LUCENE-5205
> URL: https://issues.apache.org/jira/browse/LUCENE-5205
> Project: Lucene - Core
> Issue Type: Improvement
> Components: core/queryparser
> Reporter: Tim Allison
> Labels: patch
> Fix For: 4.9
>
> Attachments: LUCENE-5205-cleanup-tests.patch,
> LUCENE-5205-date-pkg-prvt.patch, LUCENE-5205.patch.gz, LUCENE-5205.patch.gz,
> LUCENE-5205_dateTestReInitPkgPrvt.patch,
> LUCENE-5205_improve_stop_word_handling.patch,
> LUCENE-5205_smallTestMods.patch, LUCENE_5205.patch,
> SpanQueryParser_v1.patch.gz, patch.txt
>
>
> This parser extends QueryParserBase and includes functionality from:
> * Classic QueryParser: most of its syntax
> * SurroundQueryParser: recursive parsing for "near" and "not" clauses.
> * ComplexPhraseQueryParser: can handle "near" queries that include multiterms
> (wildcard, fuzzy, regex, prefix),
> * AnalyzingQueryParser: has an option to analyze multiterms.
> At a high level, there's a first pass BooleanQuery/field parser and then a
> span query parser handles all terminal nodes and phrases.
> Same as classic syntax:
> * term: test
> * fuzzy: roam~0.8, roam~2
> * wildcard: te?t, test*, t*st
> * regex: /\[mb\]oat/
> * phrase: "jakarta apache"
> * phrase with slop: "jakarta apache"~3
> * default "or" clause: jakarta apache
> * grouping "or" clause: (jakarta apache)
> * boolean and +/-: (lucene OR apache) NOT jakarta; +lucene +apache -jakarta
> * multiple fields: title:lucene author:hatcher
>
> Main additions in SpanQueryParser syntax vs. classic syntax:
> * Can require "in order" for phrases with slop with the \~> operator:
> "jakarta apache"\~>3
> * Can specify "not near": "fever bieber"!\~3,10 ::
> find "fever" but not if "bieber" appears within 3 words before or 10
> words after it.
> * Fully recursive phrasal queries with \[ and \]; as in: \[\[jakarta
> apache\]~3 lucene\]\~>4 ::
> find "jakarta" within 3 words of "apache", and that hit has to be within
> four words before "lucene"
> * Can also use \[\] for single level phrasal queries instead of " as in:
> \[jakarta apache\]
> * Can use "or grouping" clauses in phrasal queries: "apache (lucene solr)"\~3
> :: find "apache" and then either "lucene" or "solr" within three words.
> * Can use multiterms in phrasal queries: "jakarta\~1 ap*che"\~2
> * Did I mention full recursion: \[\[jakarta\~1 ap*che\]\~2 (solr~
> /l\[ou\]\+\[cs\]\[en\]\+/)]\~10 :: Find something like "jakarta" within two
> words of "ap*che" and that hit has to be within ten words of something like
> "solr" or that "lucene" regex.
> * Can require at least x number of hits at boolean level: "apache AND (lucene
> solr tika)~2
> * Can use negative only query: -jakarta :: Find all docs that don't contain
> "jakarta"
> * Can use an edit distance > 2 for fuzzy query via SlowFuzzyQuery (beware of
> potential performance issues!).
> Trivial additions:
> * Can specify prefix length in fuzzy queries: jakarta~1,2 (edit distance =1,
> prefix =2)
> * Can specifiy Optimal String Alignment (OSA) vs Levenshtein for distance
> <=2: (jakarta~1 (OSA) vs jakarta~>1(Levenshtein)
> This parser can be very useful for concordance tasks (see also LUCENE-5317
> and LUCENE-5318) and for analytical search.
> Until LUCENE-2878 is closed, this might have a use for fans of SpanQuery.
> Most of the documentation is in the javadoc for SpanQueryParser.
> Any and all feedback is welcome. Thank you.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]