2017-10-14 13:02:48 +02:00
.. _configuration_options_index:
Indexing
========
Holds settings regarding the indexing, e.g. of TYPO3 records, to search services.
Configured as::
plugin {
tx_searchcore {
settings {
indexing {
identifier {
indexer = FullyQualifiedClassname
// the settings
}
}
}
}
}
Where `` identifier `` is up to you, but should match table names to make :ref: `TcaIndexer` work.
The following settings are available. For each setting its documented which indexer consumes it.
.. _rootLineBlacklist:
rootLineBlacklist
-----------------
2017-10-29 12:25:25 +01:00
Used by: :ref: `TcaIndexer` , :ref: `PagesIndexer` .
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Defines a blacklist of page uids. Records below any of these pages, or subpages, are not
indexed. This allows you to define areas that should not be indexed.
The page attribute *No Search* is also taken into account to prevent indexing records from only one
page without recursion.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Contains a comma separated list of page uids. Spaces are trimmed.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
plugin.tx_searchcore.settings.indexing.<identifier>.rootLineBlacklist = 3, 10, 100
2017-10-14 13:02:48 +02:00
Also it's possible to define some behaviour for the different document types. In context of TYPO3
tables are used as document types 1:1. It's possible to configure different tables. The following
options are available:
.. _additionalWhereClause:
additionalWhereClause
---------------------
2017-10-29 12:25:25 +01:00
Used by: :ref: `TcaIndexer` , :ref: `PagesIndexer` .
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Add additional SQL to where clauses to determine indexable records from the table. This way you
can exclude specific records like `` tt_content `` records with specific `` CType `` values or
something else. E.g. you can add a new field to the table to exclude records from indexing.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
plugin.tx_searchcore.settings.indexing.<identifier>.additionalWhereClause = tt_content.CType NOT IN ('gridelements_pi1', 'list', 'div', 'menu')
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
.. attention ::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Make sure to prefix all fields with the corresponding table name. The selection from
database will contain joins and can lead to SQL errors if a field exists in multiple tables.
2017-10-14 13:02:48 +02:00
.. _abstractFields:
abstractFields
--------------
2017-10-29 12:25:25 +01:00
Used by: :ref: `PagesIndexer` .
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Define which field should be used to provide the auto generated field "search_abstract".
The fields have to exist in the record to be indexed. Therefore fields like `` content `` are also
possible.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
# As last fallback we use the content of the page
plugin.tx_searchcore.settings.indexing.<identifier>.abstractFields := addToList(content)
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Default::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
abstract, description, bodytext
2017-10-14 13:02:48 +02:00
.. _mapping:
mapping
-------
2017-10-29 12:25:25 +01:00
Used by: Elasticsearch connection while indexing.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Define mapping for Elasticsearch, have a look at the official docs: https://www.elastic.co/guide/en/elasticsearch/reference/5.2/mapping.html
You are able to define the mapping for each property / columns.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
plugin.tx_searchcore.settings.indexing.tt_content.mapping {
CType {
type = keyword
2017-10-14 13:02:48 +02:00
}
2017-10-29 12:25:25 +01:00
}
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
The above example will define the `` CType `` field of `` tt_content `` as `` type: keyword `` . This
makes building a facet possible.
2017-10-14 13:02:48 +02:00
.. _index:
index
-----
2017-10-29 12:25:25 +01:00
Used by: Elasticsearch connection while indexing.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Define index for Elasticsearch, have a look at the official docs: https://www.elastic.co/guide/en/elasticsearch/reference/5.2/indices-create-index.html
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
plugin.tx_searchcore.settings.indexing.tt_content.index {
analysis {
analyzer {
ngram4 {
type = custom
tokenizer = ngram4
char_filter = html_strip
filter = lowercase, asciifolding
2017-10-14 13:02:48 +02:00
}
2017-10-29 12:25:25 +01:00
}
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
tokenizer {
ngram4 {
type = ngram
min_gram = 4
max_gram = 4
2017-10-14 13:02:48 +02:00
}
}
}
2017-10-29 12:25:25 +01:00
}
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
`` char_filter `` and `` filter `` are a comma separated list of options.
2017-10-14 13:02:48 +02:00
.. _dataProcessing:
dataProcessing
--------------
2017-10-29 12:25:25 +01:00
Used by: All connections while indexing.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Configure modifications on each document before sending it to the configured connection. Same as
provided by TYPO3 for :ref: `t3tsref:cobj-fluidtemplate` through
:ref: `t3tsref:cobj-fluidtemplate-properties-dataprocessing` .
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
All processors are applied in configured order. Allowing to work with already processed data.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Example::
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
plugin.tx_searchcore.settings.indexing.tt_content.dataProcessing {
1 = Codappix\SearchCore\DataProcessing\CopyToProcessor
1 {
to = search_spellcheck
}
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
2 = Codappix\SearchCore\DataProcessing\CopyToProcessor
2 {
to = search_all
2017-10-14 13:02:48 +02:00
}
2017-10-29 12:25:25 +01:00
}
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
The above example will copy all existing fields to the field `` search_spellcheck `` . Afterwards
all fields, including `` search_spellcheck `` will be copied to `` search_all `` .
E.g. used to index all information into a field for :ref: `spellchecking` or searching with
different :ref: `mapping` .
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
The following Processor are available:
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
.. toctree ::
:maxdepth: 1
:glob:
2017-10-20 16:36:26 +02:00
2017-10-29 12:25:25 +01:00
dataProcessing/CopyToProcessor
2017-11-08 20:20:37 +01:00
dataProcessing/RemoveProcessor
2017-10-29 12:25:25 +01:00
dataProcessing/GeoPointProcessor
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
The following Processor are planned:
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
`` Codappix\SearchCore\DataProcessing\ReplaceProcessor ``
Will execute a search and replace on configured fields.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
`` Codappix\SearchCore\DataProcessing\RootLevelProcessor ``
Will attach the root level to the record.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
`` Codappix\SearchCore\DataProcessing\ChannelProcessor ``
Will add a configurable channel to the record, e.g. if you have different areas in your
website like "products" and "infos".
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
`` Codappix\SearchCore\DataProcessing\RelationResolverProcessor ``
Resolves all relations using the TCA.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
Of course you are able to provide further processors. Just implement
`` Codappix\SearchCore\DataProcessing\ProcessorInterface `` and use the FQCN (=Fully qualified
class name) as done in the examples above.
2017-10-14 13:02:48 +02:00
2017-10-29 12:25:25 +01:00
By implementing also the same interface as necessary for TYPO3
:ref: `t3tsref:cobj-fluidtemplate-properties-dataprocessing` , you are able to reuse the same code
also for Fluid to prepare the same record fetched from DB for your fluid.