Elasticsearch with Czech analyzer
Docker image: openlobby/openlobby-es-czech
Batteries included:
- ICU Analysis Plugin
- Czech Hunspell dictionaries (from OpenOffice)
X-Pack plugin is removed.
Usage
Create Index with these settings:
{
'settings': {
'analysis': {
'filter': {
'czech_stop': {
'type': 'stop',
'stopwords': '_czech_',
},
'czech_stemmer': {
'type': 'stemmer',
'language': 'czech',
},
'cs_CZ': {
'type': 'hunspell',
'locale': 'cs_CZ',
'dedup': True,
},
'czech_synonym': {
'type': 'synonym',
'synonyms_path': 'analysis/cs_CZ/synonym.txt',
},
},
'analyzer': {
'czech': {
'tokenizer': 'standard',
'filter': [
'icu_folding',
'lowercase',
'czech_synonym',
'czech_stop',
'czech_stemmer',
'cs_CZ',
]
}
}
}
}
}
You can use 'czech'
analyzer on text fields now.
There is custom dictionary of synonyms included. You can adjust it for your needs or remove it from analyzer settings.
Build
If you don't want to use pre-built container from
openlobby/openlobby-es-czech
you can build it locally: make build
Run for local development
You need Docker Compose.
- Run on port
9200
:make run
- Stop:
make stop
- Stop and/or remove all data:
make destroy