Highest Voted Questions - Artificial Intelligence Stack Exchange

6

votes

1 answer

How can a language model keep track of the provenance of the main knowledge/sources used to generate a given output?

One of the main criticisms against the use of ChatGPT on Stack Exchange is that it doesn't attribute the main knowledge/sources used to generate a given output. How can a language model keep track of the provenance of the main knowledge/sources used…

asked Dec 16 '22 at 18:12

Franck Dernoncourt

3,004
1
19
34

6

votes

0 answers

How can an Artificial Intelligence system be ethically trained to generate art?

There have been a lot of popular AI-generating image systems put out recently, with such systems as Midjourney and Dall-E catching attention with how well put-together many of the automatically generated images are. However, there has been a lot of…

asked Dec 05 '22 at 15:04

Mithical

2,905
5
27
39

6

votes

1 answer

How to create and train (with mutation and selection) a neural network to predict the next state of a board?

I'm aiming to create a neural network that can learn to predict the next state of a board using the rules of Conway's Game of Life. Technically, I have three questions, but I felt that they needed to be together to get the full picture. My network…

asked Aug 04 '17 at 15:15

Aric

275
1
6

6

votes

2 answers

Can a neural network with linear activation functions produce a connection of linear functions?

From a lecture in machine learning, I know that a linear activation function can only produce a linear function, but I don't know if it can produce a connected linear function, like the one in the image below. This function consists of multiple…

asked Aug 03 '17 at 07:57

HansPeterLoft

237
2
2

6

votes

2 answers

How to shorten the development time of a neural network?

I am developing an LSTM for sequence tagging. During the development, I do various changes in the system, for example, add new features, change the number of nodes in the hidden layers, etc. After each change, I check the accuracy using…

asked Aug 02 '17 at 08:51

Erel Segal-Halevi

285
1
5

6

votes

1 answer

Can reinforcement learning algorithms be applied to computer vision problems?

Can reinforcement learning algorithms be applied to computer vision problems? If yes, what are some examples of these applications?

asked Jul 30 '17 at 10:33

user3751

6

votes

1 answer

Is large language model and foundation model the same thing?

I read a lot about foundation model and large language model. However, I dont find a clear definition what exactly is a foundation model. Is large language model and foundation model the same thing?

asked Aug 15 '22 at 19:49

Exploring

343
6
16

6

votes

2 answers

What are some intermediate or advanced books on neural networks?

Is anyone able to recommend some resources (preferably books) on the topic of neural networks that goes beyond that of introductory reading? I'm still relatively new to the subject, however, I have successfully created my own neural network, so I…

asked Jul 14 '17 at 16:08

Astronought

179
3

6

votes

1 answer

How to add more features to the input of a machine learning algorithm?

I am trying to perform a binary classification of tweets using machine learning. The usual way of doing this seems to be putting a hand-classified tweet's words into a big vector, then use that vector as input to an algorithm, which then predicts…

asked Jul 11 '17 at 13:08

schoon

245
2
7

6

votes

2 answers

What is the exact role of model $p_\theta$ in diffusion models for the reverse process?

I'm reading this interesting blog post explaining diffusion probabilistic models and trying to understand the following. In order to compute the reverse process, we need to consider the posterior distribution $q(\textbf{x}_{t-1} | \textbf{x}_t)$…

asked Jun 24 '22 at 11:19

James Arten

307
1
11

6

votes

1 answer

When exactly does the split into different heads in Multi-Head-Attention occur?

I am confused by the Multi-Head part of the Multi-Head-Attention used in Transformers. My question concerns the implementations in Pytorch of nn.MultiheadAttention and its forward method multi_head_attention_forward and whether these are actually…

asked May 17 '22 at 12:42

Aushilfsgott

63
1
5

6

votes

2 answers

Is algorithmic bias due to the training dataset used?

I recently read about algorithmic bias in facial recognition. Is algorithmic bias due to the training dataset used, or is it due to something else?

asked Jun 05 '17 at 09:00

Karan Nadagoudar

285
1
5

6

votes

1 answer

Is there ever a need to combine deep learning frameworks? (Eg. TensorFlow & Torch)?

Imagine a simple scenario of having a large repository using one framework and integrated with data/robots, etc., then having a new feature requested and the framework missing some vital functionality that is available in another framework (say a…

asked May 29 '17 at 02:22

Jaden Travnik

3,817
1
17
35

6

votes

1 answer

Is it feasible to train a Machine Learning Model (with image inputs) in an average personal computer?

There are lots of examples of machine learning systems that can recognize objects and extract other information from images with very high precision. To train the models of such systems is necessary (I guess) a computer with a lot of computational…

asked May 22 '17 at 15:52

user7369

6

votes

1 answer

How do neural networks manage to do regression?

I'm trying to learn about neural networks, and I'm interested in gaining a better conceptual understanding of how they work to solve certain problems. I'm having trouble in conceptually understanding how they succeed in doing regression (i.e.…

asked May 09 '17 at 13:08

PeterW

241
1
4

Most Popular