Questions tagged [computational-linguistics]

A branch of science that uses computers and mathematical methods to construct and investigate linguistic theory. Its technological and algorithmic implementation is called NLP.

A branch of science that uses computers and mathematical methods to construct and investigate linguistic theory. Its technological and algorithmic implementation is called Natural Language Processing (NLP) .

554 questions
10
votes
3 answers

How to test if a string of words is a grammatical sentence

Is there a way to test to see if a string of words forms a complete sentence? For example: The dog jumped over the fence == Good The cat square seven the triangle == BAD I was thinking the type of words (verb, noun, etc.) and order of the words…
Steven Smethurst
  • 211
  • 1
  • 2
  • 6
5
votes
2 answers

What value does lexical density add to analysis?

I came across the concept of lexical density while reading "Embracing a New Creed: Lexical Patterning and the Encoding of Ideology"[1] by Oliver Mason and Rhiannon Platt and was wondering what practical benefit it is for linguistic analysis. [1]…
swasheck
  • 201
  • 1
  • 5
4
votes
1 answer

How to calculate the co-occurrence between two words in a window of text?

I want to build a keyword extractor based on the TextRank model as explained in RMPT04. But I don't understand how to calculate the co-occurrence between two words in a window of text explained in the point 3.1. Moreover, is a corpus necessary?
Arnold Paye
  • 49
  • 1
  • 1
  • 2
4
votes
2 answers

NLP techniques on semantic similarity with different sentence construction

"S1. I lived in Paris for many years. I, therefore, know many places in Paris"; "S2. Having lived in Paris for long, I know several places there". Can NLP techniques infer that S1 and S2 are similar semantically? If yes, which one (references)?
Sanjay
  • 41
  • 3
4
votes
2 answers

Obtaining the stem of a word

I wonder if you know a computational method to obtain the stem of any English word. By stem I mean the part of the word which is never affected by plurals, temporal forms, and so on. For example, stem("cars") = car; stem("children") = child; stem…
user14662
  • 41
  • 1
4
votes
1 answer

What are good (state of the art?) methods for automatic grammar correction?

I'm just beginning to study automatic grammar correction, and was wondering what good methods are for this problem. I have a system that will select spelling candidates for each term, individually, and was considering the following approach for…
Dylon
  • 141
  • 2
4
votes
1 answer

How do we measure the degree of similarity between words?

Consider these words: reminder - remainder They are very similar in sound and letters. Yet they are far from each other in meaning. In computational linguistics, do we have algorithms to measure the amount of similarity between words?
Saeed Neamati
  • 1,365
  • 1
  • 10
  • 24
3
votes
3 answers

How to determine whether a given sentence is demanding an answer or is providing some information?

I tried with basic thing like whether question starts with who/what/.. but there are a lot many sentences which do not start with interrogative words but still demands an answer like "hotels in singapore". I have boiled down the logic that the…
3
votes
1 answer

Software to measure F-Score (formality) in English

Francis Heylighen and Jean-Marc Dewaele defined a measure of language formality called "F-Measure" in a paper published in 1999 titled "Formality of Language: definition, measurement and behavioral determinants". Is there software available that…
Eric R. Rath
  • 131
  • 2
3
votes
2 answers

Is there a Machine Comprehension system integrating several different linguistic constructions?

Given the following linguistics (and some non-linguistic) constructions: entailment, implicature, Strawson entailment, semantic underspecification, discourse representation theory, rhetorical structure theory and description logic. My question is:…
R. S.
  • 33
  • 2
3
votes
1 answer

Vocabulary List From word2vec and GloVe

Is there a way I can access just the vocabulary list of pre-trained vectors for word2vec and GloVe? I do not need the entire n-dimensional embeddings.
Adam_G
  • 576
  • 3
  • 16
3
votes
3 answers

Compare 2 english text corpora

I have 2 English text corpora. One is people talking about topic "A" while other is people talking about topic "B". From a language point of view - the way people express themselves on topic "A" is different from topic "B". I want to understand and…
Anuj Gupta
  • 131
  • 2
3
votes
2 answers

What are the main differences between machine learning and classic approaches to natural language processing problems?

What types of NLP problems are best suited to machine learning and which are best solved by more classical approaches (e.g. syntactic and semantic analysis?)
Sol
  • 191
  • 5
3
votes
2 answers

context-free grammar

How we could create a context-free grammar that generates sentences of arbitrary length like: the cat died. the cat the dog chased died. the cat the dog the rat bit chased died. the cat the dog the rat the elephant admired bit chased died. this is…
liza
  • 71
  • 4
2
votes
0 answers

How to find the characteristics of a bunch of word Clusters?

My Motivations I'm trying to learn German and realized there's a confounding fact with the structure of German: every noun has a gender which seems unrelated to the noun itself in many cases. Unlike languages such as English, each noun has a…
yash
  • 121
  • 2
1
2 3 4