I have speech transcriptions with lots of special characters in a column in a dataframe, like so:
">like I don't understand< sorry like how old's your mom¿"
"°ye[a:h]°"
"°I don't know°"
When I read-in the dataframe using read.table, I get the following output where several funny new characters have incorrecly been inserted:
Output in R:
">like I don't understand< sorry like how old's your mom¿"
"°ye[a:h]°"
"°I don't know°"
How can I fix this issue?
You can specify the enconding while importing or just it after importing the data.
Option 1
Option 2