Le 4 juil. 2012 à 05:47, savitha sundaramurthy a écrit :
This is due to
the multi-lingual document (one document in four languages).
The multilinguality is, I think, on top of Savitha's priority.
The embedded solr server running on the machine doesnt have ability to
handle multiple languages at this time. So it's returnng weird results. I
am working on it. Yes, paul its on my top priority and researching more
about it.
Savitha,
this description reminds me of an old problem I had: having a whole infrastructure with
fixed analyzers. At the time I had simply written an analyzers that used thread-local to
recap the language and fork to the right analyzer at the tokenStream method. However, I
think there is more modern to this: MultiplexingAnalyzer or something like this. The
classics seem to indicated that using a suffix with the language is useful (and allows the
administrator to configure which analyzer is used for which language).
Paul