Most Popular

1500 questions
6
votes
3 answers

Is random initialization of the weights the only choice to break the symmetry?

My knowledge Suppose you have a layer that is fully connected, and that each neuron performs an operation like a = g(w^T * x + b) were a is the output of the neuron, x the input, g our generic activation function, and finally w and b our…
gvgramazio
  • 706
  • 2
  • 8
  • 19
6
votes
2 answers

Why are GRU and LSTM better than standard RNNs?

It seems that older RNNs have a limitation for their use cases and have been outperformed by other recurrent architectures, such as the LSTM and GRU.
6
votes
1 answer

Issue with simple game AI

A few months ago I made a simple game that is similar to the dinosaur game in Google Chrome - you jump over obstacles, or don't jump over levitating obstacles, and jump to collect bitcoins, which can be placed at 5 different heights. I used a very…
6
votes
2 answers

Why does self-playing tic-tac-toe not become perfect?

I trained a DQN that learns tic-tac-toe by playing against itself with a reward of -1/0/+1 for a loss/draw/win. Every 500 episodes, I test the progress by letting it play some episodes (also 500) against a random player. As shown in the picture…
user3877351
  • 91
  • 1
  • 5
6
votes
0 answers

Speciation in NEAT - Advantages of keeping stable number of species

I found several methods for setting the compatibility distance in NEAT: some normalize it, some don't, some automatically adjust it. In a few tests I am running, using normalized static compatibility distance, the number of species increase very…
kuma
  • 341
  • 1
  • 10
6
votes
3 answers

How to use computer vision to find corners of a soccer field based on location coordinates?

I want to use computer vision to allow my robot to detect the corners of a soccer field based on its current position. Matlab has a detectHarrisFeatures feature, but I believe it is only for 2D mapping. The approach that I want to try is to collect…
aa1
  • 163
  • 3
6
votes
2 answers

What is the current research in artificial intelligence in the field of data compression?

What is the current research in artificial intelligence and machine learning in the field of data compression? I have done my research on the PAQ series of compressors, some of which use neural networks for context mixing.
user15736
  • 71
  • 2
6
votes
2 answers

What is the difference between on and off-policy deterministic actor-critic?

In the paper Deterministic Policy Gradient Algorithms, I am really confused about chapter 4.1 and 4.2 which is "On and off-policy Deterministic Actor-Critic". I don't know what's the difference between two algorithms. I only noticed that the…
fish_tree
  • 247
  • 1
  • 6
6
votes
1 answer

What if the more fit parent has fewer nodes compared to the other, will the disjoint and excess genes be discarded?

In the paper Efficient Evolution of Neural Network Topologies (2002), the authors say Genes that do not match are inherited from the more fit parent What if the more fit parent has fewer nodes compared to the other, will the disjoint/excess genes…
Neil Nahid
  • 91
  • 3
6
votes
1 answer

When to expand and when to simulate in Monte Carlo Tree Search?

In Monte Carlo Tree Search (MCTS), we start at root node $R$. Then we select some leaf node $L$. And we expand $L$ by one or more child nodes and simulate from the child to the end of the game. When should we expand and when should we simulate in…
Soroush
  • 98
  • 1
  • 6
6
votes
1 answer

How do you encode a chess move in a neural network?

In a neural network for chess (or checkers), the output is a piece or square on the board and an end position. How would one encode this? As far as I can see choosing a starting square is 8x8=64 outputs and an ending square is 8x8=64 outputs. So the…
zooby
  • 2,206
  • 1
  • 13
  • 22
6
votes
0 answers

Is a mathematical formula a form of intelligence?

Warning: This question takes us into VALIS territory, but I wouldn't underestimate the profundity of that particular philosopher. There is a non-AI definition of intelligence which is simply "information" (see definition 2.3). If that information…
DukeZhou
  • 6,227
  • 5
  • 25
  • 53
6
votes
3 answers

Has anybody tried unsupervised deep learning from youtube videos?

YouTube has a huge amount of videos, many of which also containing various spoken languages. This would presumably provide something like the data that a "challenged" baby would experience - "challenged" meaning a baby without arms or legs…
Wolphram jonny
  • 284
  • 2
  • 14
6
votes
2 answers

Two data classes for a convolutional neural network, can one have a LOT more images for training than the other?

I have two classes in the training set: one that has images with a feature and the other of images without that feature. Can there be a LOT more images with "no feature" so I can fit in all possible false positives?
Vasya T
  • 69
  • 1
6
votes
2 answers

Are leaf nodes included in the calculation of average branching factor for search trees?

In the search tree below, there are 11 nodes, 5 of which are leaves. There are 10 branches. Is the average branching factor given by 10/6, or 10/11? Are leaves included in the calculation? Intuitively, I would think not, since we are interested in…
Data
  • 161
  • 3