Hello, I have a problem with the way magnolia(5.3.12) is indexing page content. It is indexing not just page content but css classes content too. Which brings me to results that are not accurate
the search index config: <SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex"> <param name="path" value="${wsp.home}/index" /> <param name="useCompoundFile" value="true" /> <param name="minMergeDocs" value="100" /> <param name="volatileIdleTime" value="3" /> <param name="maxMergeDocs" value="100000" /> <param name="mergeFactor" value="10" /> <param name="maxFieldLength" value="10000" /> <param name="bufferSize" value="10" /> <param name="cacheSize" value="1000" /> <param name="queryClass" value="org.apache.jackrabbit.core.query.QueryImpl" /> <param name="respectDocumentOrder" value="true" /> <param name="resultFetchSize" value="2147483647" /> <param name="extractorPoolSize" value="3" /> <param name="extractorTimeout" value="100" /> <param name="extractorBackLogSize" value="100" /> <param name="enableConsistencyCheck" value="false" /> <param name="forceConsistencyCheck" value="false" /> <param name="autoRepair" value="false" /> <param name="onWorkspaceInconsistency" value="log" /> </SearchIndex> the sql: QUERY_PATTERN = "select * from nt:base where jcr:path like ''{0}/%'' and contains(*, ''{1}'') order by jcr:path"; Is there something I am missing so that I can search only in actual content and not in all the HTML page? Thanks, Roxana -- Context is everything: http://forum.magnolia-cms.com/forum/thread.html?threadId=3393d9ea-35de-457d-b0e9-60345ec9e5a9 ---------------------------------------------------------------- For list details, see http://www.magnolia-cms.com/community/mailing-lists.html Alternatively, use our forums: http://forum.magnolia-cms.com/ To unsubscribe, E-mail to: <user-list-unsubscr...@magnolia-cms.com> ----------------------------------------------------------------