Highest Voted Questions - Artificial Intelligence Stack Exchange

8

votes

1 answer

Why are documents kept separated when training a text classifier?

Most of the literature considers text classification as the classification of documents. When using the bag-of-words and Bayesian classification, they usually use the statistic TF-IDF, where TF normalizes the word count with the number of words per…

asked Jul 24 '18 at 23:03

freesoul

246
1
5

8

votes

1 answer

What is the relationship between these two taxonomies for machine learning with neural networks?

Could you please let me know which of the following classification of Neural Network's learning algorithm is correct? The first one classifies it into: supervised, unsupervised and reinforcement learning. However, the second one provides a…

asked Jul 24 '18 at 16:31

ebrahimi

195
5

8

votes

1 answer

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders?

Does it make sense to use batch normalization in deep (stacked) or sparse auto-encoders? I cannot find any resources for that. Is it safe to assume that, since it works for other DNNs, it will also make sense to use it and will offer benefits on…

asked Jul 23 '18 at 09:39

Glrs

231
3
8

8

votes

3 answers

How to model inhibitory synapses in the artificial neuron?

In the brain, some synapses are stimulating and some inhibiting. In the case of artificial neural networks, ReLU erases that property, since in the brain inhibition doesn't correspond to a 0 output, but, more precisely, to a negative input. In the…

asked Jul 20 '18 at 08:54

Ziemo

223
1
7

8

votes

2 answers

5 years later, are maxout networks dead, and why?

Maxout networks were a simple yet brilliant idea of Goodfellow et al. from 2013 to max feature maps to get a universal approximator of convex activations. The design was tailored for use in conjunction with dropout (then recently introduced) and…

asked Jul 10 '18 at 21:35

user209974

191
1
1

8

votes

2 answers

Can LSTM neural networks be sped up by a GPU?

I am training LSTM neural networks with Keras on a small mobile GPU. The speed on the GPU is slower than on the CPU. I found some articles that say that it is hard to train LSTMs (and, in general, RNNs) on GPUs because the training cannot be…

asked Jul 09 '18 at 04:55

Dieshe

289
1
2
6

8

votes

5 answers

Is the smartest robot more clever than the stupidest human?

Most humans are not good at chess. They can't write symphonies. They don't read novels. They aren't good athletes. They aren't good at logical reasoning. Most of us just get up. Go to work in a factory or farm or something. Follow simple…

asked Jul 04 '18 at 19:52

zooby

2,206
1
13
22

8

votes

3 answers

What are the state-of-the-art approaches for detecting the most important "visual attention" area of an image?

I'm trying to detect the visual attention area in a given image and crop the image into that area. For instance, given an image of any size and a rectangle of say $L \times W$ dimension as an input, I would like to crop the image to the most…

asked Jun 15 '18 at 14:32

Mary

973
6
13

8

votes

2 answers

What are the real world uses for SAT solvers?

Why somebody would use SAT solvers (Boolean satisfiability problem) to solve their real world problems? Are there any examples of the real uses of this model?

asked Aug 02 '16 at 16:31

kenorb

10,483
3
44
94

8

votes

2 answers

What is experience replay in laymen's terms?

I've been reading Google's DeepMind Atari paper and I'm trying to understand the concept of "experience replay". Experience replay comes up in a lot of other reinforcement learning papers (particularly, the AlphaGo paper), so I want to understand…

asked May 30 '18 at 19:09

user491626

241
1
4

8

votes

2 answers

Why does a one-layer hidden network get more robust to poor initialization with growing number of hidden neurons?

In a nutshell: I want to understand why a one hidden layer neural network converges to a good minimum more reliably when a larger number of hidden neurons is used. Below a more detailed explanation of my experiment: I am working on a simple 2D…

asked Apr 05 '18 at 08:59

Chrigi

181
5

8

votes

1 answer

What is an intuitive explanation of how Google's AutoML works?

I recently read that Google has developed a new AI that anyone can upload data to and it will instantly generate models, i.e. an image recognition model based on that data. Can someone explain to me in a detailed and intuitive manner how this AI…

asked Jan 29 '18 at 06:50

Seth Simba

1,186
1
11
29

8

votes

3 answers

How can 3 same size CNN layers in different ordering output different receptive field from the input layer?

Below is a quote from CS231n: Prefer a stack of small filter CONV to one large receptive field CONV layer. Suppose that you stack three 3x3 CONV layers on top of each other (with non-linearities in between, of course). In this arrangement, each…

asked Jan 23 '18 at 18:45

Inkplay_

421
4
8

8

votes

2 answers

Using neural network to recognise patterns in matrices

I am trying to develop a neural network which can identify design features in CAD models (i.e. slots, bosses, holes, pockets, steps). The input data I intend to use for the network is a n x n matrix (where n is the number of faces in the CAD model).…

asked Jan 22 '18 at 21:40

Darren Taggart

169
1
4

8

votes

6 answers

Is there a limit to the increase of intelligence?

Some argue that humans are somewhere along the middle of the intelligence spectrum, some say that we are only at the very beginning of the spectrum and there's so much more potential ahead. Is there a limit to the increase of intelligence? Could it…

asked Jan 13 '18 at 15:02

Pre-alpha

91
4

Most Popular