Highest Voted Questions - Data Science Stack Exchange

24

votes

3 answers

How to add a new category to a deep learning model?

Say I have done transfer learning on a pre-trained network to recognize 10 objects. How can I add a $11^{th}$ item that the network can classify without losing all the 10 categories I already trained and the information from the original pre-trained…

asked Dec 10 '16 at 01:43

nnrales

343
1
2
6

24

votes

3 answers

Keyword/phrase extraction from Text using Deep Learning libraries

Perhaps this is too broad, but I am looking for references on how to use deep learning in a text summarization task. I have already implemented text summarization using standard word-frequency approaches and sentence-ranking, but I'd like to explore…

asked Feb 03 '16 at 10:56

shanky_thebearer

373
1
3
11

23

votes

3 answers

Nearest neighbors search for very high dimensional data

I have a big sparse matrix of users and items they like (in the order of 1M users and 100K items, with a very low level of sparsity). I'm exploring ways in which I could perform kNN search on it. Given the size of my dataset and some initial tests I…

asked Aug 14 '14 at 00:50

cjauvin

451
3
7

23

votes

2 answers

Why do activation functions have to be monotonic?

I am currently preparing for an exam on neural networks. In several protocols from former exams I read that the activation functions of neurons (in multilayer perceptrons) have to be monotonic. I understand that activation functions should be…

asked Dec 06 '15 at 11:41

Martin Thoma

18,880
35
95
169

23

votes

14 answers

Data Science conferences?

This is a similar question like the Statistics Conferences question at CrossValidated What are the most significant annual Data Science conferences? Rules: Include a link to the conference Please include links for the talks (be it youtube, the…

community

asked Nov 20 '15 at 11:21

Dawny33

8,296
12
48
104

23

votes

4 answers

How to perform one hot encoding on multiple categorical columns

I am trying to perform one-hot encoding on some categorical columns. From the tutorial I am following, I am supposed to do LabelEncoding before One hot encoding. I have successfully performed the labelencoding as shown below #categorical…

asked Apr 05 '20 at 20:21

radioactive

395
2
3
7

23

votes

4 answers

Incremental Learning with sklearn: warm_start, partial_fit(), fit()

I have built an ML model with the goal of making predictions for targets of the following week. In general, new data will come in and be processed at the end of each week and be in the same data structure as before. In other words, the same number…

asked Feb 24 '20 at 10:15

Adam

896
1
6
13

23

votes

4 answers

In SVM Algorithm, why vector w is orthogonal to the separating hyperplane?

I am a beginner on Machine Learning. In SVM, the separating hyperplane is defined as $y = w^T x + b$. Why we say vector $w$ orthogonal to the separating hyperplane?

asked Jun 09 '15 at 14:39

Chong Zheng

339
1
2
4

23

votes

4 answers

Dimensionality and Manifold

A commonly heard sentence in unsupervised Machine learning is High dimensional inputs typically live on or near a low dimensional manifold What is a dimension? What is a manifold? What is the difference? Can you give an example to describe…

asked May 05 '15 at 17:48

alvas

2,410
7
25
40

23

votes

2 answers

Formal proof of vanilla policy gradient convergence

So I stumbled upon this question, where the author asks for a proof of vanilla policy gradient procedures. The answer provided points to some literature, but the formal proof is nowhere to be included. Looking at Sutton,Barto- Reinforcement…

asked Jun 15 '19 at 16:58

Markus Peschl

280
1
7

23

votes

8 answers

Rationale behind most published works in medical imaging trying to reduce false positives

In medical image processing, most of the published works try to reduce false positive rate (FPR) while in reality, false negatives are more dangerous than false positives. What is the rationale behind it?

asked May 23 '19 at 07:48

Hunar

1,147
2
11
33

23

votes

4 answers

What statistical model should I use to analyze the likelihood that a single event influenced longitudinal data

I am trying to find a formula, method, or model to use to analyze the likelihood that a specific event influenced some longitudinal data. I am having difficultly figuring out what to search for on Google. Here is an example scenario: Image you own a…

asked Jun 20 '14 at 03:18

Peter Kirby

333
1
4

23

votes

4 answers

Is logistic regression actually a regression algorithm?

The usual definition of regression (as far as I am aware) is predicting a continuous output variable from a given set of input variables. Logistic regression is a binary classification algorithm, so it produces a categorical output. Is it really a…

asked Jun 19 '14 at 08:56

joews

526
1
4
10

23

votes

1 answer

How does Gradient Descent and Backpropagation work together?

Please forgive me as I am new to this. I have attached a diagram trying to model my understanding of neural network and Back-propagation? From videos on Coursera and resources online I formed the following understanding of how neural network…

asked Jan 28 '19 at 13:34

Mohamed Mahyoub

345
1
2
5

23

votes

3 answers

How to grow a list of related words based on initial keywords?

I recently saw a cool feature that was once available in Google Sheets: you start by writing a few related keywords in consecutive cells, say: "blue", "green", "yellow", and it automatically generates similar keywords (in this case, other colors).…

asked Jun 17 '14 at 06:05

nassimhddd

587
4
12

Most Popular