Skip to content
Snippets Groups Projects
Select Git revision
  • master default protected
1 result

openlobby-es-czech

  • Clone with SSH
  • Clone with HTTPS
  • Jan Bednařík's avatar
    jan.bednarik authored
    deb45420
    History

    Elasticsearch with Czech analyzer

    Batteries included:

    • ICU Analysis Plugin
    • Czech Hunspell dictionaries (from OpenOffice)

    X-Pack plugin is removed.

    Usage

    Create your index with these settings:

    {
        'settings': {
            'analysis': {
                'filter': {
                    'czech_stop': {
                        'type': 'stop',
                        'stopwords': '_czech_',
                    },
                    'czech_stemmer': {
                        'type': 'stemmer',
                        'language': 'czech',
                    },
                    'cs_CZ': {
                        'type': 'hunspell',
                        'locale': 'cs_CZ',
                        'dedup': True,
                    }
                },
                'analyzer': {
                    'czech': {
                        'tokenizer': 'standard',
                        'filter': [
                            'icu_folding',
                            'lowercase',
                            'czech_stop',
                            'czech_stemmer',
                            'cs_CZ',
                        ]
                    }
                }
            }
        }
    }

    You can use 'czech' analyzer on text fields now.

    Build

    Build docker container: make build

    Run for local development

    Run Elasticsearch on port 9200: make run Stop Elasticsearch container: make stop

    Stop and/or remove volume with data: make destroy