Highest Voted Questions - Artificial Intelligence Stack Exchange

8

votes

3 answers

Have capsule neural networks been used to NLP problems?

The capsule neural network seems to be a good solution for problems that involve hierarchies. For example, a face is composed of eyes, a nose and ears; a hand is made of fingers, nails, and a palm; and a human is composed of a face and hands. Many…

asked Dec 16 '17 at 18:51

Ahti Ahde

278
2
7

8

votes

2 answers

How to design a neural network that gets the author name of a piece of art as input?

I'm building a neural net to predict the value of a piece of art with a wide range of inputs (size, art medium, etc.) and I would like to include the author as an input as well (it is often a huge factor in the value of a single piece of art). My…

asked Nov 24 '17 at 13:56

Vince Britz

81
1

8

votes

2 answers

Where to publish a first article in Deep Reinforcement Learning?

What would be examples of journals that are good for a first publication in the field of Deep Reinforcement Learning? I am in the process of writing about the research results of DQN-related algorithms. I have 3 requirements - it should be indexed…

asked Nov 07 '17 at 09:02

Evalds Urtans

377
3
9

8

votes

3 answers

What are the differences between machine learning, pattern recognition and data mining?

I know a little about these subjects. I found them similar to each other. Can anybody explain the differences between them?

asked Oct 22 '17 at 14:34

user10314

91
1
2

8

votes

2 answers

Can someone help me understand the intuition behind the query, key and value matrices in the transformer architecture?

I have been working mechanically with transformers, hoping that with time clarity about what the query, key, and value matrices represent will develop; but I am still lost. Would greatly benefit from a simplified explanation.

asked Jul 24 '23 at 00:08

Chinmay

511
13

8

votes

2 answers

Are there any algorithms that can measure the concept similarity between texts?

Are there any algorithms (or software libraries) that can be used to detect the similarity of concepts in text, regardless of articulation, grammar, synonyms, etc.? For example, these phrases: Outside, it is warm. Outside, it is hot. Outside, it is…

asked Sep 26 '17 at 01:06

omega

189
1

8

votes

3 answers

How do I know if my backpropagation is implemented correctly?

I'm working on an implementation of the backpropagation algorithm for a simple neural network, which predicts a probability of survival (1 or 0). However, I can't get it above 80%, no matter how much I try to set the right hyperparameters. I suspect…

asked Sep 03 '17 at 12:22

Damian Matkowski

83
4

8

votes

2 answers

Is GPT-4 based on GPT-3 or was it trained from the scratch?

To me it looks like GPT-4 is based on GPT-3. On the other hand, there were rumors that training of GPT-3 was done with errors, but re-train was impossible due to the costs.

asked Mar 16 '23 at 17:44

Anixx

321
1
11

8

votes

4 answers

OpenAI ChatGPT gives a network error at long responds. How can I fix it?

When OpenAI's ChatGPT replies with a very long answer, it will return a network error. When you check the network console, the POST request will fail with a ERR_HTTP2_PROTOCOL_ERROR: The time of the "crash" is around 1 minute.

asked Dec 11 '22 at 22:20

Lars Flieger

197
1
1
5

8

votes

1 answer

What is the difference between logic-based and rule-based AI?

I always thought rule-based was synonymous with logic-based AI. Logic has axioms and rules of inference, whereas rule-based AI has a knowledge base (essentially, axioms) and if-then rules to create new knowledge (essentially inference rules). But in…

asked Aug 09 '17 at 01:39

samlaf

221
2
6

8

votes

2 answers

How should I handle action selection in the terminal state when implementing SARSA?

I recently started learning about reinforcement learning. Currently, I am trying to implement the SARSA algorithm. However, I do not know how to deal with $Q(s', a')$, when $s'$ is the terminal state. First, there is no action to choose from in this…

asked Aug 04 '17 at 13:45

Hai Nguyen

562
4
14

8

votes

2 answers

Examples of uses of artificial intelligence at work

I'm going to give a talk, and I'm preparing the material. The purpose of the conversation is to convince companies in my region that it is possible to apply artificial intelligence in solving everyday business problems. I would like some examples to…

applications

asked Jul 21 '17 at 20:36

José Henrique Luckmann

207
1
6

8

votes

1 answer

What do symmetric weights mean and how does it make backpropagation biologically implausible?

I was reading a paper on alternatives to backpropagation as a learning algorithm in neural networks. In this paper, the author talks about the disadvantages of backpropagation, and one of the disadvantages stated is that backpropagation requires…

asked May 08 '22 at 12:53

0jas

83
4

8

votes

2 answers

Why is the perplexity a good evaluation metric for chatbots?

A few papers I have come across say that BLEU is not an appropriate evaluation metric for chatbots, so they use the perplexity. First of all, what is perplexity? How to calculate it? And why is perplexity a good evaluation metric for chatbots?

asked Jun 01 '17 at 02:50

RuiZhang1993

89
2

8

votes

1 answer

What are other ways of handling invalid actions in scenarios where all rewards are either 0 (best reward) or negative?

I created an OpenAI Gym environment, and I would like to check the performance of the agent from OpenAI Baselines DQN approach on it. In my environment, the best possible outcome for the agent is 0 - the robot needs zero non-necessary resources to…

asked May 29 '17 at 09:02

AlexGuevara

263
1
8

Most Popular