Most Popular
1500 questions
16
votes
8 answers
How to classify data which is spiral in shape?
I have been messing around in tensorflow playground. One of the input data sets is a spiral. No matter what input parameters I choose, no matter how wide and deep the neural network I make, I cannot fit the spiral. How do data scientists fit data of…
Souradeep Nanda
- 263
- 1
- 2
- 7
16
votes
4 answers
What exactly is a hidden state in an LSTM and RNN?
I'm working on a project, where we use an encoder-decoder architecture. We decided to use an LSTM for both the encoder and decoder due to its hidden states. In my specific case, the hidden state of the encoder is passed to the decoder, and this…
user8714896
- 797
- 1
- 6
- 24
16
votes
1 answer
Will parameter sweeping on one split of data followed by cross validation discover the right hyperparameters?
Let's call our dataset splits train/test/evaluate. We're in a situation where we require months of data. So we prefer to use the evaluation dataset as infrequently as possible to avoid polluting our results. Instead, we do 10 fold cross validation…
Philipp Cannons
- 161
- 6
16
votes
2 answers
What is the difference between graph convolution in the spatial vs spectral domain?
I've been reading different papers regarding graph convolution and it seems that they come into two flavors: spatial and spectral. From what I can see the main difference between the two approaches is that for spatial you're directly multiplying the…
razvanc92
- 1,128
- 1
- 9
- 18
16
votes
2 answers
How can I automate the choice of the architecture of a neural network for an arbitrary problem?
Assume that I want to solve an issue with a neural network that either I can't fit to existing architectures (perceptron, Konohen, etc) or I'm simply not aware of the existence of those or I'm unable to understand their mechanics and I rely on my…
Zoltán Schmidt
- 633
- 7
- 14
16
votes
1 answer
How to stay a up-to-date researcher in ML/RL community?
As a student who wants to work on machine learning, I would like to know how it is possible to start my studies and how to follow it to stay up-to-date. For example, I am willing to work on RL and MAB problems, but there are huge literatures on…
Amin
- 481
- 2
- 12
16
votes
3 answers
Is the optimal policy always stochastic if the environment is also stochastic?
Is the optimal policy always stochastic (that is, a map from states to a probability distribution over actions) if the environment is also stochastic?
Intuitively, if the environment is deterministic (that is, if the agent is in a state $s$ and…
nbro
- 40,472
- 12
- 105
- 192
15
votes
1 answer
Why does the policy network in AlphaZero work?
In AlphaZero, the policy network (or head of the network) maps game states to a distribution of the likelihood of taking each action. This distribution covers all possible actions from that state.
How is such a network possible? The possible actions…
chessprogrammer
- 2,890
- 2
- 15
- 26
15
votes
4 answers
What does "stationary" mean in the context of reinforcement learning?
I think I've seen the expressions "stationary data", "stationary dynamics" and "stationary policy", among others, in the context of reinforcement learning. What does it mean? I think stationary policy means that the policy does not depend on time,…
Paula Vega
- 428
- 4
- 8
15
votes
3 answers
Does Monte Carlo tree search qualify as machine learning?
To the best of my understanding, the Monte Carlo tree search (MCTS) algorithm is an alternative to minimax for searching a tree of nodes. It works by choosing a move (generally, the one with the highest chance of being the best), and then performing…
Inertial Ignorance
- 511
- 3
- 14
15
votes
2 answers
What is the difference between artificial intelligence and computational intelligence?
Having analyzed and reviewed a certain amount of articles and questions, apparently, the expression computational intelligence (CI) is not used consistently and it is still unclear the relationship between CI and artificial intelligence…
quintumnia
- 1,183
- 2
- 10
- 34
15
votes
3 answers
What is a "trajectory" in reinforcement learning?
I'm now learning about reinforcement learning, but I just found the word "trajectory" in this answer.
However, I'm not sure what it means. I read a few books on the Reinforcement Learning but none of them mentioned it. Usually these introductionary…
Blaszard
- 1,037
- 3
- 11
- 25
15
votes
4 answers
Can some one help me understand this paragraph from Nvidia's progressive GAN paper?
In the paper Progressive growing of gans for improved quality, stability, and variation (ICLR, 2018) by Nvidia researchers, the authors write
Furthermore, we observe that mode collapses traditionally
plaguing GANs tend to happen very quickly, over…
Inkplay_
- 421
- 4
- 8
15
votes
1 answer
What is the fringe in the context of search algorithms?
What is the fringe in the context of search algorithms?
tahasozgen
- 287
- 1
- 2
- 7
15
votes
1 answer
Who first coined the term Artificial Intelligence?
Who first coined the term Artificial Intelligence? Is there a published research paper that first used that term?
WilliamKF
- 2,513
- 1
- 25
- 31