Highest Voted Questions - Artificial Intelligence Stack Exchange

10

votes

1 answer

Can Q-learning be used in a POMDP?

Can Q-learning (and SARSA) be directly used in a Partially Observable Markov Decision Process (POMDP)? If not, why not? My intuition is that the policies learned will be terrible because of partial observability. Are there ways to transform these…

asked Apr 03 '19 at 02:40

drerD

298
2
6

10

votes

1 answer

Are decision tree learning algorithms deterministic?

Are decision tree learning algorithms deterministic? Given a fixed dataset, do they always produce a tree with the same structure? What about the random forest?

asked Apr 01 '19 at 06:29

mallea

203
2
5

10

votes

2 answers

What's the difference between hyperbolic tangent and sigmoid neurons?

Two common activation functions used in deep learning are the hyperbolic tangent function and the sigmoid activation function. I understand that the hyperbolic tangent is just a rescaling and translation of the sigmoid function: $\tanh(z) =…

asked Aug 02 '16 at 19:02

bpachev

410
4
11

10

votes

4 answers

How is it that AI can become biased, and what are the proposals to mitigate this?

This is not meant to be negative or a joke but rather looking for a productive solution on AI development, engineering and its impact on human life: Lately with my Google searches, the AI model keeps auto filling the ending of my searches…

asked Feb 15 '19 at 19:30

P.S.

203
1
6

10

votes

3 answers

How to use CNN for making predictions on non-image data?

I have a dataset which I have loaded as a data frame in Python. It consists of 21392 rows (the data instances, each row is one sample) and 1972 columns (the features). The last column i.e. column 1972 has string type labels (14 different categories…

asked Feb 07 '19 at 22:33

JChat

235
1
2
8

10

votes

3 answers

Should I choose a model with the smallest loss or highest accuracy?

I have two Machine Learning models (I use LSTM) that have a different result on the validation set (~100 samples data): Model A: Accuracy: ~91%, Loss: ~0.01 Model B: Accuracy: ~83%, Loss: ~0.003 The size and the speed of both models are almost the…

asked Feb 07 '19 at 06:32

malioboro

2,819
3
21
47

10

votes

2 answers

Can artificial intelligence be thought of as optimization?

In this video an expert says, "One way of thinking about what intelligence is [specifically with regard to artificial intelligence], is as an optimization process." Can intelligence always be thought of as an optimization process, and can artificial…

asked Aug 02 '16 at 17:56

dynrepsys

1,363
11
22

10

votes

3 answers

Is the expert system still in use today?

In my country, the Expert System class is mandatory, if you want to take the AI specialization in most universities. In class, I learned how to make a rule-based system, forward chaining, backward chaining, Prolog, etc. However, I have read…

asked Feb 02 '19 at 23:24

malioboro

2,819
3
21
47

9

votes

1 answer

What happens when I mix activation functions?

There are several activation functions, such as ReLU, sigmoid or $\tanh$. What happens when I mix activation functions? I recently found that Google has developed Swish activation function which is (x*sigmoid). By altering activation function can it…

asked Jan 04 '19 at 13:39

JSChang

93
1
6

9

votes

2 answers

Can someone help me to understand the alpha-beta pruning algorithm?

I understand the minimax algorithm, but I am unable to understand deeply the minimax algorithm with alpha-beta pruning, even after having looked up several sources (on the web) and having tried to read the algorithm and understand how it works. Do…

asked Jan 03 '19 at 14:20

Sunshine

91
1
2

9

votes

2 answers

What is the concept of channels in CNNs?

I am trying to understand what channels mean in convolutional neural networks. When working with grayscale and colored images, I understand that the number of channels is set to 1 and 3 (in the first conv layer), respectively, where 3 corresponds to…

asked Dec 30 '18 at 06:36

anonuser01

321
1
3
13

9

votes

3 answers

Why does the "reward to go" trick in policy gradient methods work?

In the policy gradient method, there's a trick to reduce the variance of policy gradient. We use causality, and remove part of the sum over rewards so that only actions happened after the reward are taken into account (See here…

asked Dec 20 '18 at 01:00

Konstantin Solomatov

298
2
10

9

votes

1 answer

Is it possible to clean up an audio recording of a lecture using some type of AI system?

Is it possible to clean up an audio recording of a lecture from a smartphone (i.e. remove the background noise) using some type of AI system?

asked Dec 12 '18 at 15:15

Thibault Molleman

99
1
1
3

9

votes

4 answers

Why is the variational auto-encoder's output blurred, while GANs output is crisp and has sharp edges?

I observed in several papers that the variational autoencoder's output is blurred, while GANs output is crisp and has sharp edges. Can someone please give some intuition why that is the case? I did think a lot but couldn't find any logic.

asked Nov 09 '18 at 18:08

Trect

269
1
4
7

9

votes

2 answers

What is the difference between goal-based and utility-based agents?

What is the difference between goal-based and utility-based agents? Please, provide a real-world example.

asked Oct 28 '18 at 13:30

RashkRizwan

135
1
2
8

Most Popular