How to quantify number of errors due to character encoding?

76 views Asked by At

I have a data process where strings are passed through many applications/databases with different character encodings. Some of the steps along the way are UTF-8 some aren't but the end database where I access that data is an oracle database that uses Windows-1252.

I am trying to convince my database administers that this is an issue but am having difficulty quantifying the magnitude of the issue. Is there a way to take the strings that are stored on my database and count the number of strings that contain at least one character that is showing up differently than it would have if the database used encoding UTF-8?

0

There are 0 answers