Class WAAnalyzer

  • All Implemented Interfaces:
    Closeable, AutoCloseable

    public final class WAAnalyzer
    extends org.apache.lucene.analysis.Analyzer
    Extends lucene Analyzer : prunes from a tokens stream all the meaningless words and prunes all the special characters.
    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.lucene.analysis.Analyzer

        org.apache.lucene.analysis.Analyzer.ReuseStrategy, org.apache.lucene.analysis.Analyzer.TokenStreamComponents
    • Field Summary

      • Fields inherited from class org.apache.lucene.analysis.Analyzer

        GLOBAL_REUSE_STRATEGY, PER_FIELD_REUSE_STRATEGY
    • Method Detail

      • getAnalyzer

        public static org.apache.lucene.analysis.Analyzer getAnalyzer​(String language)
        Returns the analyzer to be used with texts of the given language. The analyzers are cached.
        Parameters:
        language - the ISO 631-1 code of the language
        Returns:
        an analyser of a text written in the given language.
      • createComponents

        protected org.apache.lucene.analysis.Analyzer.TokenStreamComponents createComponents​(String s)
        Returns a tokens stream built on top of the given reader.
        Specified by:
        createComponents in class org.apache.lucene.analysis.Analyzer
      • getLanguage

        public String getLanguage()