Most Popular
1500 questions
6
votes
1 answer
How can a language model keep track of the provenance of the main knowledge/sources used to generate a given output?
One of the main criticisms against the use of ChatGPT on Stack Exchange is that it doesn't attribute the main knowledge/sources used to generate a given output. How can a language model keep track of the provenance of the main knowledge/sources used…
Franck Dernoncourt
- 3,004
- 1
- 19
- 34
6
votes
0 answers
How can an Artificial Intelligence system be ethically trained to generate art?
There have been a lot of popular AI-generating image systems put out recently, with such systems as Midjourney and Dall-E catching attention with how well put-together many of the automatically generated images are.
However, there has been a lot of…
Mithical
- 2,905
- 5
- 27
- 39
6
votes
1 answer
How to create and train (with mutation and selection) a neural network to predict the next state of a board?
I'm aiming to create a neural network that can learn to predict the next state of a board using the rules of Conway's Game of Life.
Technically, I have three questions, but I felt that they needed to be together to get the full picture.
My network…
Aric
- 275
- 1
- 6
6
votes
2 answers
Can a neural network with linear activation functions produce a connection of linear functions?
From a lecture in machine learning, I know that a linear activation function can only produce a linear function, but I don't know if it can produce a connected linear function, like the one in the image below. This function consists of multiple…
HansPeterLoft
- 237
- 2
- 2
6
votes
2 answers
How to shorten the development time of a neural network?
I am developing an LSTM for sequence tagging. During the development, I do various changes in the system, for example, add new features, change the number of nodes in the hidden layers, etc. After each change, I check the accuracy using…
Erel Segal-Halevi
- 285
- 1
- 5
6
votes
1 answer
Can reinforcement learning algorithms be applied to computer vision problems?
Can reinforcement learning algorithms be applied to computer vision problems? If yes, what are some examples of these applications?
user3751
6
votes
1 answer
Is large language model and foundation model the same thing?
I read a lot about foundation model and large language model.
However, I dont find a clear definition what exactly is a foundation model. Is large language model and foundation model the same thing?
Exploring
- 343
- 6
- 16
6
votes
2 answers
What are some intermediate or advanced books on neural networks?
Is anyone able to recommend some resources (preferably books) on the topic of neural networks that goes beyond that of introductory reading?
I'm still relatively new to the subject, however, I have successfully created my own neural network, so I…
Astronought
- 179
- 3
6
votes
1 answer
How to add more features to the input of a machine learning algorithm?
I am trying to perform a binary classification of tweets using machine learning.
The usual way of doing this seems to be putting a hand-classified tweet's words into a big vector, then use that vector as input to an algorithm, which then predicts…
schoon
- 245
- 2
- 7
6
votes
2 answers
What is the exact role of model $p_\theta$ in diffusion models for the reverse process?
I'm reading this interesting blog post explaining diffusion probabilistic models and trying to understand the following.
In order to compute the reverse process, we need to consider the posterior distribution $q(\textbf{x}_{t-1} | \textbf{x}_t)$…
James Arten
- 307
- 1
- 11
6
votes
1 answer
When exactly does the split into different heads in Multi-Head-Attention occur?
I am confused by the Multi-Head part of the Multi-Head-Attention used in Transformers. My question concerns the implementations in Pytorch of nn.MultiheadAttention and its forward method multi_head_attention_forward and whether these are actually…
Aushilfsgott
- 63
- 1
- 5
6
votes
2 answers
Is algorithmic bias due to the training dataset used?
I recently read about algorithmic bias in facial recognition.
Is algorithmic bias due to the training dataset used, or is it due to something else?
Karan Nadagoudar
- 285
- 1
- 5
6
votes
1 answer
Is there ever a need to combine deep learning frameworks? (Eg. TensorFlow & Torch)?
Imagine a simple scenario of having a large repository using one framework and integrated with data/robots, etc., then having a new feature requested and the framework missing some vital functionality that is available in another framework (say a…
Jaden Travnik
- 3,817
- 1
- 17
- 35
6
votes
1 answer
Is it feasible to train a Machine Learning Model (with image inputs) in an average personal computer?
There are lots of examples of machine learning systems that can recognize objects and extract other information from images with very high precision. To train the models of such systems is necessary (I guess) a computer with a lot of computational…
user7369
6
votes
1 answer
How do neural networks manage to do regression?
I'm trying to learn about neural networks, and I'm interested in gaining a better conceptual understanding of how they work to solve certain problems. I'm having trouble in conceptually understanding how they succeed in doing regression (i.e.…
PeterW
- 241
- 1
- 4