/apache-solr-non-ascii-characters
Apache Solr non-ASCII characters
This day I had a problem while indexing French and Russian content with Apache Solr: French and Russian characters where corrupted. The solution was very simple: I added URIEncoding="UTF-8" attribute...
Nov 29, 2012
~1 min read
This day I had a problem while indexing French and Russian content with Apache Solr: French and Russian characters where corrupted.
The solution was very simple: I added
URIEncoding="UTF-8"
attribute to the Connector tag which runs Solr in Tomcat's server.xml. Something like
<Connector port="8080" protocol="HTTP/1.1"
connectionTimeout="20000"
redirectPort="8443" URIEncoding="UTF-8" />
Then Tomcat has to be restarted :-)