Most Popular
1500 questions
6
votes
1 answer
How does the uniform-cost search algorithm work?
What is the uniform-cost search (UCS) algorithm? How does it work? I would appreciate seeing a graphical execution of the algorithm. How does the frontier evolve in the case of UCS?
Iram Shah
- 315
- 2
- 5
- 14
6
votes
2 answers
Are there any pretrained models for human recognition from all angles?
I need to be able to detect and track humans from all angles, especially above.
There are, obviously, quite a few well-studied models for human detection and tracking, usually as part of general-purpose object detection, but I haven't been able to…
T3db0t
- 161
- 1
- 4
6
votes
5 answers
How do compute the table for $p(s',r|s,a)$ (exercise 3.5 in Sutton & Barto's book)?
I am trying to study the book Reinforcement Learning: An Introduction (Sutton & Barto, 2018). In chapter 3.1 the authors state the following exercise
Exercise 3.5 Give a table analogous to that in Example 3.3, but for $p(s',r|s,a)$. It should have…
MrYouMath
- 255
- 1
- 7
6
votes
2 answers
When is the Markov decision process not adequate for goal-directed learning tasks?
In the book Reinforcement Learning: An Introduction (Sutton and Barto, 2018). The authors ask
Exercise 3.2: Is the MDP framework adequate to usefully represent all goal-directed learning tasks? Can you think of any clear
exceptions?
I thought…
MrYouMath
- 255
- 1
- 7
6
votes
1 answer
Does it make sense to apply softmax on top of relu?
While working through some example from Github I've found this network (it's for FashionMNIST but it doesn't really matter).
Pytorch forward method (my query in upper case comments with regards to applying Softmax on top of Relu?):
def…
Jed
- 61
- 2
6
votes
0 answers
What is meant by "model discriminability for local patches within the receptive field"?
In the abstract of the paper Network In Network, the authors write
We propose a novel deep network structure called "Network In Network"(NIN) to enhance model discriminability for local patches within the receptive field
What does the part in bold…
harsh kumar Chourasia
- 161
- 3
6
votes
4 answers
Is it suitable to find inverse of last layer's activation function and apply it on the target output?
I have a neural network with the following structure:
I am expecting specific outputs from the neural network which are the target values for my training. Let's say the target values are 0.8 for the upper output node and -0.3 for the lower output…
Vikhyat Agarwal
- 165
- 1
- 8
6
votes
1 answer
Should I be decaying the learning rate and the exploration rate in the same manner?
Should I be decaying the learning rate and the exploration rate in the same manner? What's too slow and too fast of an exploration and learning rate decay? Or is it specific from model to model?
rtz
- 91
- 6
6
votes
1 answer
Which nodes are expanded in the expansion phase of MCTS?
I'm confused regarding a specific detail of MCTS.
To illustrate my question, let's take the simple example of tic-tac-toe.
After the selection phase, when a leaf node is reached, the tree is expanded in the so-called expansion phase. Let's say a…
chessprogrammer
- 2,890
- 2
- 15
- 26
6
votes
1 answer
What does "hard for AI" look like?
In theoretical computer science, there is a massive categorization of the difficulty of various computational problems in terms of their asymptotic worst-time computational complexity. There doesn't seem to be any analogous analysis of what problems…
Stella Biderman
- 301
- 1
- 13
6
votes
2 answers
How to model categorical variables / enums?
I am new to the field and I am trying to understand how is possible to use categorical variables / enums?
Lets say we have a data set and 2 of its features are home_team and away_team, the possible values of these 2 features are all the NBA…
Avraam Mavridis
- 161
- 3
6
votes
2 answers
Can one use an Artificial Neural Network to determine the size of an object in a photograph?
My question relates to but doesn't duplicate a question that has been asked here.
I've Googled a lot for an answer to the question: Can you find the dimensions of an object in a photo if you don't know the distance between the lens and the object,…
ding
- 161
- 1
- 3
6
votes
2 answers
What is the definition of rationality?
I'm having a little trouble with the definition of rationality, which goes something like:
An agent is rational if it maximizes its performance measure given its current knowledge.
I've read that a simple reflex agent will not act rationally in a…
Mr. Eivind
- 578
- 4
- 27
6
votes
2 answers
How can I create an artificially intelligent aimbot for a game like CS:GO?
How can I create an artificially intelligent aimbot for a game like Counter-Strike Global Offensive (CS:GO)?
I have an initial solution (or approach) in mind. We can train an image recognition model that will recognize the head of the enemy (in the…
Mrunal Sonawane
- 63
- 1
- 4
6
votes
2 answers
Why does the fitness of my neural network to play tic-tac-toe keep oscillating?
I wrote a simple feed-forward neural network that plays tic-tac-toe:
9 neurons in input layers: 1 - my sign, -1 - opponent's sign, 0 - empty;
9 neurons in hidden layer: value calculated using ReLU;
9 neurons in output layer: value calculated using…
Perpetuum
- 61
- 2