Most Popular
1500 questions
17
votes
1 answer
Visualization of multiple Markov models
I am working on a project where we compare over 10 different Markov models, each representing a different treatment plan.
Most often single models are visualized with a decision tree or transition state diagram. However, with multiple different…
Andrew Brown
- 279
- 1
- 3
17
votes
3 answers
How do vector databases work (for the lay-coder)?
A vector database is a type of database that stores and manages unstructured data, such as text, images, or audio, in vector embeddings (high-dimensional vectors) to make it easy to find and retrieve similar objects quickly.
How do you make that…
Lance
- 273
- 2
- 7
17
votes
4 answers
What does "linear in parameters" mean?
The model of linear regression is linear in parameters.
What does this actually mean?
Albert Gao
- 273
- 1
- 2
- 5
17
votes
3 answers
When should we consider a dataset as imbalanced?
I'm facing a situation where the numbers of positive and negative examples in a dataset are imbalanced.
My question is, are there any rules of thumb that tell us when we should subsample the large category in order to force some kind of balancing in…
Rami
- 594
- 1
- 5
- 16
16
votes
5 answers
R: machine learning on GPU
Are there any machine learning packages for R that can make use of the GPU to improve training speed (something like theano from the python world)?
I see that there is a package called gputools which allows execution of code on the gpu, but I'm…
Simon
- 1,071
- 2
- 10
- 28
16
votes
3 answers
How to self-learn data science?
I am a self-taught web developer and am interested in teaching myself data science, but I'm unsure of how to begin. In particular, I'm wondering:
What fields are there within data science? (e.g., Artificial Intelligence, machine learning, data…
xyhhx
- 263
- 2
- 6
16
votes
2 answers
When to choose linear regression or Decision Tree or Random Forest regression?
I am working on a project and I am having difficulty in deciding which algorithm to choose for regression. I want to know under what conditions should one choose a linear regression or Decision Tree regression or Random Forest regression? Are there…
Jason Donnald
- 321
- 2
- 4
- 6
16
votes
7 answers
Data Science Podcasts?
What are some podcasts which are related to data science?
This is a similar question to the reference request question on CrossValidated.
Details/rules:
The podcasts (the theme and the episodes) should be related to data science. (For example: A…
Dawny33
- 8,296
- 12
- 48
- 104
16
votes
3 answers
Parallel and distributed computing
What is(are) the difference(s) between parallel and distributed computing? When it comes to scalability and efficiency, it is very common to see solutions dealing with computations in clusters of machines, and sometimes it is referred to as a…
Rubens
- 4,107
- 5
- 23
- 42
16
votes
3 answers
How does attention mechanism learn?
I know how to build an attention in neural networks. But I don’t understand how attention layers learn the weights that pay attention to some specific embedding.
I have this question because I’m tackling a NLP task using attention layer. I believe…
user2790103
- 263
- 2
- 5
16
votes
4 answers
How to scale an array of signed integers to range from 0 to 1?
I'm using Brain to train a neural network on a feature set that includes both positive and negative values. But Brain requires input values between 0 and 1. What's the best way to normalize my data?
Jonathan Shobrook
- 313
- 1
- 3
- 8
16
votes
4 answers
seaborn heatmap not displaying correctly
For some reason, my heatmap is not displaying correctly anymore. It was working just fine even with 6 classes. Since the last time I used it, I've installed many packages (including plotly). I don't know what exactly has caused this. How can I make…
user2804064
- 263
- 1
- 2
- 4
16
votes
1 answer
what is the main difference between GAN and autoencoder?
what is the main difference between GAN and other older generative models? what were the characteristics of GAN that made it more successful than other generative models?
Muhammad Hanif Sarwari
- 325
- 1
- 3
- 8
16
votes
4 answers
Should input images be normalized to -1 to 1 or 0 to 1
Many ML tutorials are normalizing input images to value of -1 to 1 before feeding them to ML model. The ML model is most likely a few conv 2d layers followed by a fully connected layers. Assuming activation function is ReLu.
My question is, would…
vuamitom
- 163
- 1
- 1
- 5
16
votes
2 answers
What is the difference between an autoencoder and an encoder-decoder?
I want to know if there is a difference between an autoencoder and an encoder-decoder.
Kahina
- 624
- 1
- 9
- 23