0

Is there any tool that provides language recognition functionality?

Like if I input a sentence of English, it will suggest that the string may be English.

I think I need one of the following stuff.

  1. Python language recognition library.
  2. Web service that provides such functionality.

Anyone can help?

xiaohan2012
  • 8,986
  • 20
  • 64
  • 99

2 Answers2

2

I have used the GuessLanguage module pretty extensively. It detects 100+ languages and works really well for LONG pieces of text. English is very good. Other languages I am not sure. If you want to process short sentences at a time or tweets it is pretty bad as it was designed for doing detection on full articles.

My company (social media analytics) is currently looking for a replacement package as well and are exploring NLTK as an option.

JiminyCricket
  • 6,632
  • 7
  • 41
  • 58
1

Have you tried nltk? See this SO question for further information.

There is also a web service from xerox which claims to do what you want.

Community
  • 1
  • 1
Ocaso Protal
  • 18,151
  • 8
  • 74
  • 79