Most Popular

1500 questions
10
votes
1 answer

Can Q-learning be used in a POMDP?

Can Q-learning (and SARSA) be directly used in a Partially Observable Markov Decision Process (POMDP)? If not, why not? My intuition is that the policies learned will be terrible because of partial observability. Are there ways to transform these…
drerD
  • 298
  • 2
  • 6
10
votes
1 answer

Are decision tree learning algorithms deterministic?

Are decision tree learning algorithms deterministic? Given a fixed dataset, do they always produce a tree with the same structure? What about the random forest?
mallea
  • 203
  • 2
  • 5
10
votes
2 answers

What's the difference between hyperbolic tangent and sigmoid neurons?

Two common activation functions used in deep learning are the hyperbolic tangent function and the sigmoid activation function. I understand that the hyperbolic tangent is just a rescaling and translation of the sigmoid function: $\tanh(z) =…
bpachev
  • 410
  • 4
  • 11
10
votes
4 answers

How is it that AI can become biased, and what are the proposals to mitigate this?

This is not meant to be negative or a joke but rather looking for a productive solution on AI development, engineering and its impact on human life: Lately with my Google searches, the AI model keeps auto filling the ending of my searches…
P.S.
  • 203
  • 1
  • 6
10
votes
3 answers

How to use CNN for making predictions on non-image data?

I have a dataset which I have loaded as a data frame in Python. It consists of 21392 rows (the data instances, each row is one sample) and 1972 columns (the features). The last column i.e. column 1972 has string type labels (14 different categories…
JChat
  • 235
  • 1
  • 2
  • 8
10
votes
3 answers

Should I choose a model with the smallest loss or highest accuracy?

I have two Machine Learning models (I use LSTM) that have a different result on the validation set (~100 samples data): Model A: Accuracy: ~91%, Loss: ~0.01 Model B: Accuracy: ~83%, Loss: ~0.003 The size and the speed of both models are almost the…
malioboro
  • 2,819
  • 3
  • 21
  • 47
10
votes
2 answers

Can artificial intelligence be thought of as optimization?

In this video an expert says, "One way of thinking about what intelligence is [specifically with regard to artificial intelligence], is as an optimization process." Can intelligence always be thought of as an optimization process, and can artificial…
dynrepsys
  • 1,363
  • 11
  • 22
10
votes
3 answers

Is the expert system still in use today?

In my country, the Expert System class is mandatory, if you want to take the AI specialization in most universities. In class, I learned how to make a rule-based system, forward chaining, backward chaining, Prolog, etc. However, I have read…
malioboro
  • 2,819
  • 3
  • 21
  • 47
9
votes
1 answer

What happens when I mix activation functions?

There are several activation functions, such as ReLU, sigmoid or $\tanh$. What happens when I mix activation functions? I recently found that Google has developed Swish activation function which is (x*sigmoid). By altering activation function can it…
JSChang
  • 93
  • 1
  • 6
9
votes
2 answers

Can someone help me to understand the alpha-beta pruning algorithm?

I understand the minimax algorithm, but I am unable to understand deeply the minimax algorithm with alpha-beta pruning, even after having looked up several sources (on the web) and having tried to read the algorithm and understand how it works. Do…
Sunshine
  • 91
  • 1
  • 2
9
votes
2 answers

What is the concept of channels in CNNs?

I am trying to understand what channels mean in convolutional neural networks. When working with grayscale and colored images, I understand that the number of channels is set to 1 and 3 (in the first conv layer), respectively, where 3 corresponds to…
anonuser01
  • 321
  • 1
  • 3
  • 13
9
votes
3 answers

Why does the "reward to go" trick in policy gradient methods work?

In the policy gradient method, there's a trick to reduce the variance of policy gradient. We use causality, and remove part of the sum over rewards so that only actions happened after the reward are taken into account (See here…
9
votes
1 answer

Is it possible to clean up an audio recording of a lecture using some type of AI system?

Is it possible to clean up an audio recording of a lecture from a smartphone (i.e. remove the background noise) using some type of AI system?
9
votes
4 answers

Why is the variational auto-encoder's output blurred, while GANs output is crisp and has sharp edges?

I observed in several papers that the variational autoencoder's output is blurred, while GANs output is crisp and has sharp edges. Can someone please give some intuition why that is the case? I did think a lot but couldn't find any logic.
Trect
  • 269
  • 1
  • 4
  • 7
9
votes
2 answers

What is the difference between goal-based and utility-based agents?

What is the difference between goal-based and utility-based agents? Please, provide a real-world example.
RashkRizwan
  • 135
  • 1
  • 2
  • 8