0

I have this folder named train and there are 4 other folders in it such as economy, sports, magazine and health. These 4 folders contain txt files and my goal is to read all the train data in them.

news_data = load_files(r"C:\train")

I used this code to read all the files inside the folder and it gives me all the data but the problem is these txt files include Turkish characters like "ü","ö" and so on. So when I print news_data, I see words like "Roma\x92da" which should be "Roma'da" and "d\xf6rt" which should be "dört". How can I solve this problem?

desertnaut
  • 52,940
  • 19
  • 125
  • 157
gbu
  • 11

0 Answers0