0

I use the requests library to scrape some data from a webpage. This webpage contains special characters as unicode characters: '>' as '\u003e' for example.

A problem I have however is that python does not seem to treat the scraped unicode characters as unicode strings, but rather as regular strings starting with a backslash, i.e '\\u003e' instead of '\u003e'. Therefore when I try to unescape the unicode string I just get '\u003e' instead of '>'.

Does anyone have any idea of how to fix this?

hantan
  • 1
  • 2

0 Answers0