Most Popular

1500 questions
8
votes
1 answer

Why are documents kept separated when training a text classifier?

Most of the literature considers text classification as the classification of documents. When using the bag-of-words and Bayesian classification, they usually use the statistic TF-IDF, where TF normalizes the word count with the number of words per…
freesoul
  • 246
  • 1
  • 5
8
votes
1 answer

What is the relationship between these two taxonomies for machine learning with neural networks?

Could you please let me know which of the following classification of Neural Network's learning algorithm is correct? The first one classifies it into: supervised, unsupervised and reinforcement learning. However, the second one provides a…
ebrahimi
  • 195
  • 5
8
votes
1 answer

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders? I cannot find any resources for that. Is it safe to assume that, since it works for other DNNs, it will also make sense to use it and will offer benefits on…
Glrs
  • 231
  • 3
  • 8
8
votes
3 answers

How to model inhibitory synapses in the artificial neuron?

In the brain, some synapses are stimulating and some inhibiting. In the case of artificial neural networks, ReLU erases that property, since in the brain inhibition doesn't correspond to a 0 output, but, more precisely, to a negative input. In the…
Ziemo
  • 223
  • 1
  • 7
8
votes
2 answers

5 years later, are maxout networks dead, and why?

Maxout networks were a simple yet brilliant idea of Goodfellow et al. from 2013 to max feature maps to get a universal approximator of convex activations. The design was tailored for use in conjunction with dropout (then recently introduced) and…
user209974
  • 191
  • 1
  • 1
8
votes
2 answers

Can LSTM neural networks be sped up by a GPU?

I am training LSTM neural networks with Keras on a small mobile GPU. The speed on the GPU is slower than on the CPU. I found some articles that say that it is hard to train LSTMs (and, in general, RNNs) on GPUs because the training cannot be…
Dieshe
  • 289
  • 1
  • 2
  • 6
8
votes
5 answers

Is the smartest robot more clever than the stupidest human?

Most humans are not good at chess. They can't write symphonies. They don't read novels. They aren't good athletes. They aren't good at logical reasoning. Most of us just get up. Go to work in a factory or farm or something. Follow simple…
zooby
  • 2,206
  • 1
  • 13
  • 22
8
votes
3 answers

What are the state-of-the-art approaches for detecting the most important "visual attention" area of an image?

I'm trying to detect the visual attention area in a given image and crop the image into that area. For instance, given an image of any size and a rectangle of say $L \times W$ dimension as an input, I would like to crop the image to the most…
Mary
  • 973
  • 6
  • 13
8
votes
2 answers

What are the real world uses for SAT solvers?

Why somebody would use SAT solvers (Boolean satisfiability problem) to solve their real world problems? Are there any examples of the real uses of this model?
kenorb
  • 10,483
  • 3
  • 44
  • 94
8
votes
2 answers

What is experience replay in laymen's terms?

I've been reading Google's DeepMind Atari paper and I'm trying to understand the concept of "experience replay". Experience replay comes up in a lot of other reinforcement learning papers (particularly, the AlphaGo paper), so I want to understand…
user491626
  • 241
  • 1
  • 4
8
votes
2 answers

Why does a one-layer hidden network get more robust to poor initialization with growing number of hidden neurons?

In a nutshell: I want to understand why a one hidden layer neural network converges to a good minimum more reliably when a larger number of hidden neurons is used. Below a more detailed explanation of my experiment: I am working on a simple 2D…
Chrigi
  • 181
  • 5
8
votes
1 answer

What is an intuitive explanation of how Google's AutoML works?

I recently read that Google has developed a new AI that anyone can upload data to and it will instantly generate models, i.e. an image recognition model based on that data. Can someone explain to me in a detailed and intuitive manner how this AI…
Seth Simba
  • 1,186
  • 1
  • 11
  • 29
8
votes
3 answers

How can 3 same size CNN layers in different ordering output different receptive field from the input layer?

Below is a quote from CS231n: Prefer a stack of small filter CONV to one large receptive field CONV layer. Suppose that you stack three 3x3 CONV layers on top of each other (with non-linearities in between, of course). In this arrangement, each…
Inkplay_
  • 421
  • 4
  • 8
8
votes
2 answers

Using neural network to recognise patterns in matrices

I am trying to develop a neural network which can identify design features in CAD models (i.e. slots, bosses, holes, pockets, steps). The input data I intend to use for the network is a n x n matrix (where n is the number of faces in the CAD model).…
Darren Taggart
  • 169
  • 1
  • 4
8
votes
6 answers

Is there a limit to the increase of intelligence?

Some argue that humans are somewhere along the middle of the intelligence spectrum, some say that we are only at the very beginning of the spectrum and there's so much more potential ahead. Is there a limit to the increase of intelligence? Could it…
Pre-alpha
  • 91
  • 4