Highest Voted Questions - Data Science Stack Exchange

12

votes

1 answer

Finding the inverse of a matrix with neural networks

I have a generic matrix A which is symmetric, positive definite and sparsely populated (it's also quite big, say composed of tens to hundreds thousands rows). I would like to have a neural network learn how to find a matrix B that resembles the…

asked Oct 19 '16 at 17:10

Sovos

231
2
4

12

votes

3 answers

How to get the probability of belonging to clusters for k-means?

I need to get the probability for each point in my data set. The idea is to compute distance matrix (first column contsins distances to first cluster, second column conteins distances to second cluster and etc). The closest point has probability =…

asked Oct 10 '16 at 06:17

Толкачёв Иван

317
1
3
8

12

votes

1 answer

Tensorflow neural network TypeError: Fetch argument has invalid type

I am making a simple neural network using tensorflow, with data I collected myself, however, it's not cooperating :P I have run into an error that I can't fix or find the fix for and I would love your help. The errormessage: TypeError: Fetch…

asked Oct 09 '16 at 13:23

Hasse Iona

453
1
3
13

12

votes

3 answers

Mass convert categorical columns in Pandas (not one-hot encoding)

I have pandas dataframe with tons of categorical columns, which I am planning to use in decision tree with scikit-learn. I need to convert them to numerical values (not one hot vectors). I can do it with LabelEncoder from scikit-learn. The problem…

asked Sep 18 '16 at 16:45

user1700890

345
1
3
13

12

votes

2 answers

Features of word vectors in Word2Vec

I am trying to do sentiment analysis. In order to convert the words to word vectors, I am using Word2Vec model. Suppose I have all the sentences in a list named 'sentences' and I am passing these sentences to word2vec as follows: model =…

asked Sep 09 '16 at 19:32

enterML

3,031
9
27
38

12

votes

2 answers

Machine Learning Best Practices for Big Dataset

I am about to graduate from my Master and had learnt about machine learning as well as performed research projects with it. I wonder about the best practices in the industry when performing machine learning tasks with Big Datasets (like 100s GB or…

asked Sep 07 '16 at 22:40

iLoeng

243
3
5

12

votes

2 answers

Neural Networks - Loss and Accuracy correlation

I'm a bit confused by the coexistence of Loss and Accuracy metrics in Neural Networks. Both are supposed to render the "exactness" of the comparison of $y$ and $\hat{y}$, aren't they? So isn't the application of the two redundant in the training…

asked Aug 25 '16 at 13:20

Hendrik

8,587
17
42
55

12

votes

5 answers

How to overcome training example's different lengths when working with Word Embeddings (word2vec)

I'm working on Sentiment Analysis over tweets using word2vec as word representation. I have trained my word2vec model. But when I'm going to train my classifier, I'm facing the issue that every tweet has different length and the classifier…

asked Aug 01 '16 at 10:03

antorqs

221
2
5

12

votes

8 answers

Definition of a model in machine learning

This definition does not quite apply since we are not always assuming an underlying distribution. So what is a model really? Can a Gradient Boosted Model (GBM) with specified hyperparameters be considered a model? Is a model a collection of rules?

machine-learning

asked Jul 21 '16 at 14:12

organic agave

221
1
2
4

12

votes

1 answer

Hashing Trick - what actually happens

When ML algorithms, e.g. Vowpal Wabbit or some of the factorization machines winning click through rate competitions (Kaggle), mention that features are 'hashed', what does that actually mean for the model? Lets say there is a variable that…

asked Oct 10 '14 at 03:48

B_Miner

702
1
7
20

12

votes

3 answers

Which supervised learning algorithms are available for matching?

I'm working on a non-profit where we try to help potential university applicants by matching them with alumni that want to share their experience/wisdom and, at the moment, it is happening manually. So I'll have two tables, one with students and one…

asked Jun 21 '16 at 15:43

k1nd3rm4x1

123
1
1
5

12

votes

3 answers

Why use cosine similarity instead of scaling the vectors when calculating the similarity of vectors?

I'm watching a NLP video on Coursera. It's discussing how to calculate the similarity of two vectors. First it discusses calculating the Euclidean distance, then it discusses the cosine similarity. It says that cosine similarity makes more sense…

asked Sep 13 '22 at 09:31

Allure

275
2
7

12

votes

2 answers

Document classification using convolutional neural network

I'm trying to use CNN (convolutional neural network) to classify documents. CNN for short text/sentences has been studied in many papers. However, it seems that no papers have used CNN for long text or document. My problem is that there are too many…

asked Apr 11 '16 at 09:10

lenhhoxung

231
2
8

12

votes

1 answer

Which of the 180 algorithms in R's caret package are feasible?

R's caret package works with 180 models. The author warns that some of the package may be intractably slow or less accurate than top-choice models. The author is not wrong about this. I've tried to train Boruta and evtree models and had to give up…

asked Mar 16 '16 at 18:47

Hack-R

1,919
1
21
34

12

votes

2 answers

Opensource tools for help in mining stream of leader board scores

Consider a stream containing tuples (user, new_score) representing users' scores in an online game. The stream could have 100-1,000 new elements per second. The game has 200K to 300K unique players. I would like to have some standing queries like:…

asked May 16 '14 at 20:07

Tahir Akhtar

315
2
9

Most Popular