// /apache-solr-non-ascii-characters

Apache Solr non-ASCII characters

This day I had a problem while indexing French and Russian content with Apache Solr: French and Russian characters where corrupted. The solution was very simple: I added URIEncoding="UTF-8" attribute...

This day I had a problem while indexing French and Russian content with Apache Solr: French and Russian characters where corrupted.

The solution was very simple: I added
URIEncoding="UTF-8"

attribute to the Connector tag which runs Solr in Tomcat's server.xml. Something like
<Connector port="8080" protocol="HTTP/1.1" 
connectionTimeout="20000"
redirectPort="8443" URIEncoding="UTF-8" />

Then Tomcat has to be restarted :-)
IT-Digest AI Assistant