[
https://issues.apache.org/jira/browse/SOLR-12593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16602589#comment-16602589
]
David Smiley commented on SOLR-12593:
-------------------------------------
For example follow the tutorial/example in the ref guide on the feature in
which you post a PDF doc that will undoubtedly contain at least one data
metadata attribute. Any way, nevermind as I've been trying this out. I
rescind my proposal to change the config of /extract to explicitly add
processor=parse-date. Dates are just one data type; conceptually there's the
same issue with numbers and yet this content handler doesn't have special
considerations for them. I think a tip in the ref guide would be enough to
suggest using the parse\- related URPs to ensure dates and numbers are handled
properly.
I think the ref guide tutorial on this feature should be migrated from using
the "techproducts" example config to the "schemaless" example config (which is
really the default config). The techproducts config doesn't even have this URP
set up and I don't think it makes sense at all to use that config for this
feature. I'll post an update.
> Remove date parsing functionality from extraction contrib
> ---------------------------------------------------------
>
> Key: SOLR-12593
> URL: https://issues.apache.org/jira/browse/SOLR-12593
> Project: Solr
> Issue Type: Improvement
> Security Level: Public(Default Security Level. Issues are Public)
> Components: contrib - Solr Cell (Tika extraction)
> Reporter: David Smiley
> Assignee: David Smiley
> Priority: Major
> Fix For: master (8.0)
>
> Time Spent: 3h 50m
> Remaining Estimate: 0h
>
> The date parsing functionality in the extraction contrib is obsoleted by
> equivalent functionality in ParseDateFieldUpdateProcessorFactory. It should
> be removed. We should add documentation within this part of the ref guide on
> how to accomplish the same (and test it).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]