Most Popular
1500 questions
8
votes
3 answers
Have capsule neural networks been used to NLP problems?
The capsule neural network seems to be a good solution for problems that involve hierarchies. For example, a face is composed of eyes, a nose and ears; a hand is made of fingers, nails, and a palm; and a human is composed of a face and hands.
Many…
Ahti Ahde
- 278
- 2
- 7
8
votes
2 answers
How to design a neural network that gets the author name of a piece of art as input?
I'm building a neural net to predict the value of a piece of art with a wide range of inputs (size, art medium, etc.) and I would like to include the author as an input as well (it is often a huge factor in the value of a single piece of art).
My…
Vince Britz
- 81
- 1
8
votes
2 answers
Where to publish a first article in Deep Reinforcement Learning?
What would be examples of journals that are good for a first publication in the field of Deep Reinforcement Learning?
I am in the process of writing about the research results of DQN-related algorithms.
I have 3 requirements - it should be indexed…
Evalds Urtans
- 377
- 3
- 9
8
votes
3 answers
What are the differences between machine learning, pattern recognition and data mining?
I know a little about these subjects. I found them similar to each other. Can anybody explain the differences between them?
user10314
- 91
- 1
- 2
8
votes
2 answers
Can someone help me understand the intuition behind the query, key and value matrices in the transformer architecture?
I have been working mechanically with transformers, hoping that with time clarity about what the query, key, and value matrices represent will develop; but I am still lost. Would greatly benefit from a simplified explanation.
Chinmay
- 511
- 13
8
votes
2 answers
Are there any algorithms that can measure the concept similarity between texts?
Are there any algorithms (or software libraries) that can be used to detect the similarity of concepts in text, regardless of articulation, grammar, synonyms, etc.?
For example, these phrases:
Outside, it is warm.
Outside, it is hot.
Outside, it is…
omega
- 189
- 1
8
votes
3 answers
How do I know if my backpropagation is implemented correctly?
I'm working on an implementation of the backpropagation algorithm for a simple neural network, which predicts a probability of survival (1 or 0).
However, I can't get it above 80%, no matter how much I try to set the right hyperparameters. I suspect…
Damian Matkowski
- 83
- 4
8
votes
2 answers
Is GPT-4 based on GPT-3 or was it trained from the scratch?
To me it looks like GPT-4 is based on GPT-3.
On the other hand, there were rumors that training of GPT-3 was done with errors, but re-train was impossible due to the costs.
Anixx
- 321
- 1
- 11
8
votes
4 answers
OpenAI ChatGPT gives a network error at long responds. How can I fix it?
When OpenAI's ChatGPT replies with a very long answer, it will return a network error. When you check the network console, the POST request will fail with a ERR_HTTP2_PROTOCOL_ERROR:
The time of the "crash" is around 1 minute.
Lars Flieger
- 197
- 1
- 1
- 5
8
votes
1 answer
What is the difference between logic-based and rule-based AI?
I always thought rule-based was synonymous with logic-based AI. Logic has axioms and rules of inference, whereas rule-based AI has a knowledge base (essentially, axioms) and if-then rules to create new knowledge (essentially inference rules).
But in…
samlaf
- 221
- 2
- 6
8
votes
2 answers
How should I handle action selection in the terminal state when implementing SARSA?
I recently started learning about reinforcement learning. Currently, I am trying to implement the SARSA algorithm. However, I do not know how to deal with $Q(s', a')$, when $s'$ is the terminal state. First, there is no action to choose from in this…
Hai Nguyen
- 562
- 4
- 14
8
votes
2 answers
Examples of uses of artificial intelligence at work
I'm going to give a talk, and I'm preparing the material. The purpose of the conversation is to convince companies in my region that it is possible to apply artificial intelligence in solving everyday business problems.
I would like some examples to…
José Henrique Luckmann
- 207
- 1
- 6
8
votes
1 answer
What do symmetric weights mean and how does it make backpropagation biologically implausible?
I was reading a paper on alternatives to backpropagation as a learning algorithm in neural networks. In this paper, the author talks about the disadvantages of backpropagation, and one of the disadvantages stated is that backpropagation requires…
0jas
- 83
- 4
8
votes
2 answers
Why is the perplexity a good evaluation metric for chatbots?
A few papers I have come across say that BLEU is not an appropriate evaluation metric for chatbots, so they use the perplexity.
First of all, what is perplexity? How to calculate it? And why is perplexity a good evaluation metric for chatbots?
RuiZhang1993
- 89
- 2
8
votes
1 answer
What are other ways of handling invalid actions in scenarios where all rewards are either 0 (best reward) or negative?
I created an OpenAI Gym environment, and I would like to check the performance of the agent from OpenAI Baselines DQN approach on it.
In my environment, the best possible outcome for the agent is 0 - the robot needs zero non-necessary resources to…
AlexGuevara
- 263
- 1
- 8