Most Popular

1500 questions
8
votes
3 answers

Have capsule neural networks been used to NLP problems?

The capsule neural network seems to be a good solution for problems that involve hierarchies. For example, a face is composed of eyes, a nose and ears; a hand is made of fingers, nails, and a palm; and a human is composed of a face and hands. Many…
Ahti Ahde
  • 278
  • 2
  • 7
8
votes
2 answers

How to design a neural network that gets the author name of a piece of art as input?

I'm building a neural net to predict the value of a piece of art with a wide range of inputs (size, art medium, etc.) and I would like to include the author as an input as well (it is often a huge factor in the value of a single piece of art). My…
8
votes
2 answers

Where to publish a first article in Deep Reinforcement Learning?

What would be examples of journals that are good for a first publication in the field of Deep Reinforcement Learning? I am in the process of writing about the research results of DQN-related algorithms. I have 3 requirements - it should be indexed…
Evalds Urtans
  • 377
  • 3
  • 9
8
votes
3 answers

What are the differences between machine learning, pattern recognition and data mining?

I know a little about these subjects. I found them similar to each other. Can anybody explain the differences between them?
user10314
  • 91
  • 1
  • 2
8
votes
2 answers

Can someone help me understand the intuition behind the query, key and value matrices in the transformer architecture?

I have been working mechanically with transformers, hoping that with time clarity about what the query, key, and value matrices represent will develop; but I am still lost. Would greatly benefit from a simplified explanation.
Chinmay
  • 511
  • 13
8
votes
2 answers

Are there any algorithms that can measure the concept similarity between texts?

Are there any algorithms (or software libraries) that can be used to detect the similarity of concepts in text, regardless of articulation, grammar, synonyms, etc.? For example, these phrases: Outside, it is warm. Outside, it is hot. Outside, it is…
omega
  • 189
  • 1
8
votes
3 answers

How do I know if my backpropagation is implemented correctly?

I'm working on an implementation of the backpropagation algorithm for a simple neural network, which predicts a probability of survival (1 or 0). However, I can't get it above 80%, no matter how much I try to set the right hyperparameters. I suspect…
8
votes
2 answers

Is GPT-4 based on GPT-3 or was it trained from the scratch?

To me it looks like GPT-4 is based on GPT-3. On the other hand, there were rumors that training of GPT-3 was done with errors, but re-train was impossible due to the costs.
Anixx
  • 321
  • 1
  • 11
8
votes
4 answers

OpenAI ChatGPT gives a network error at long responds. How can I fix it?

When OpenAI's ChatGPT replies with a very long answer, it will return a network error. When you check the network console, the POST request will fail with a ERR_HTTP2_PROTOCOL_ERROR: The time of the "crash" is around 1 minute.
Lars Flieger
  • 197
  • 1
  • 1
  • 5
8
votes
1 answer

What is the difference between logic-based and rule-based AI?

I always thought rule-based was synonymous with logic-based AI. Logic has axioms and rules of inference, whereas rule-based AI has a knowledge base (essentially, axioms) and if-then rules to create new knowledge (essentially inference rules). But in…
samlaf
  • 221
  • 2
  • 6
8
votes
2 answers

How should I handle action selection in the terminal state when implementing SARSA?

I recently started learning about reinforcement learning. Currently, I am trying to implement the SARSA algorithm. However, I do not know how to deal with $Q(s', a')$, when $s'$ is the terminal state. First, there is no action to choose from in this…
Hai Nguyen
  • 562
  • 4
  • 14
8
votes
2 answers

Examples of uses of artificial intelligence at work

I'm going to give a talk, and I'm preparing the material. The purpose of the conversation is to convince companies in my region that it is possible to apply artificial intelligence in solving everyday business problems. I would like some examples to…
8
votes
1 answer

What do symmetric weights mean and how does it make backpropagation biologically implausible?

I was reading a paper on alternatives to backpropagation as a learning algorithm in neural networks. In this paper, the author talks about the disadvantages of backpropagation, and one of the disadvantages stated is that backpropagation requires…
0jas
  • 83
  • 4
8
votes
2 answers

Why is the perplexity a good evaluation metric for chatbots?

A few papers I have come across say that BLEU is not an appropriate evaluation metric for chatbots, so they use the perplexity. First of all, what is perplexity? How to calculate it? And why is perplexity a good evaluation metric for chatbots?
8
votes
1 answer

What are other ways of handling invalid actions in scenarios where all rewards are either 0 (best reward) or negative?

I created an OpenAI Gym environment, and I would like to check the performance of the agent from OpenAI Baselines DQN approach on it. In my environment, the best possible outcome for the agent is 0 - the robot needs zero non-necessary resources to…
AlexGuevara
  • 263
  • 1
  • 8