Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-2051

analysis.jsp is incorrect for protWords etc

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.1, 4.0-ALPHA
    • None
    • Admin UI
    • None

    Description

      Analysis.jsp gives the incorrect results if you use "protwords.txt" or "stemdict.txt" or the like.

      This is because this is now implemented with KeywordAttribute (so you can easily override any stemmer etc).

      For example, if your schema had "foobars" in protwords.txt, analysis.jsp would show it being stemmed to "foobar", even though this doesnt actually happen.

      The problem is that this jsp is downconverting the entire tokenstream to Token in between processing, so it silently discards KeywordAttribute and you get the wrong result.

      Note: this issue isnt about displaying other attributes such as KeywordAttribute (which would be a new feature). Its about not throwing them away so that the analysis actually represents what happens.

      Attachments

        1. dynamic-AttributeSource.patch
          6 kB
          Uwe Schindler
        2. SOLR-2051.patch
          9 kB
          Robert Muir
        3. SOLR-2051.patch
          8 kB
          Robert Muir
        4. SOLR-2051.patch
          8 kB
          Robert Muir

        Activity

          People

            Unassigned Unassigned
            rcmuir Robert Muir
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: