Most Popular
1500 questions
9
votes
5 answers
Decision tree with final decision being a linear regression
Question:
I want to implement a decision tree with each leaf being a linear regression, does such a model exist (preferable in sklearn)?
Example case 1:
Mockup data is generated using the formula:
y = int(x) + x * 1.5
Which looks like:
I want to…
Nathan
- 193
- 1
- 6
9
votes
1 answer
How can you build a model that extracts data out from receipts?
I'm trying to build a model that is capable of identifying information on receipts and invoices.
I have used google cloud vision api for text extraction from the receipt but the problem is it just returns all the text from a receipt. I am looking to…
user_12
- 347
- 3
- 10
9
votes
4 answers
Validation loss much higher than training loss
I am training a CNN on some text data. The sentences are padded and embedded and fed to a CNN. The model architecture is:
model = Sequential()
model.add(Embedding(max_features, embedding_dims, input_length=maxlen))
model.add(Conv1D(128, 5,…
NoLand'sMan
- 215
- 1
- 3
- 6
9
votes
4 answers
How to deal with spelling errors NLP
I have some data where the main column is the description of one product. The main task is to extract the name of some product from this column, where it sometimes is spelled wrong and amended in other words. I have more than a thousand possible…
Roland
- 221
- 2
- 4
9
votes
1 answer
Proper masking in the transformer model
Concerning the transformer model, a mask is used to mask out attention scores (replace with 1e-9) prior to the matrix multiplication with the value tensor. Regarding the masking, I have 3 short questions and would appreciate if you could clarify…
beginneR
- 193
- 1
- 4
9
votes
1 answer
R - Interpreting neural networks plot
I know there are similar question on stats.SE, but I didn't find one that fulfills my request; please, before mark the question as a duplicate, ping me in the comment.
I run a neural network based on neuralnet to forecast SP500 index time series and…
Quantopik
- 279
- 3
- 14
9
votes
2 answers
Are there any graph embedding algorithms like this already?
I wrote an algorithm for generating node embeddings based on the graph's topology. Most of the explanation is done in the readme file and the examples.
The question is:
Am I reinventing the wheel?
Does this approach have any practical advantages…
monomonedula
- 201
- 1
- 2
9
votes
2 answers
How prevalent is `C/C++` in machine learning development?
I am currently a data scientist mostly doing NLP, and I do most of my work inPython. Since I didn't get a CS degree in undergrad, I've been limited to very high level languages; Java, Python, and R. I somehow even took Data Structures and Algorithms…
gust
- 237
- 1
- 7
9
votes
3 answers
How to run a pyspark application in windows 8 command prompt
I have a python script written with Spark Context and I want to run it. I tried to integrate IPython with Spark, but I could not do that. So, I tried to set the spark path [ Installation folder/bin ] as an environment variable and called…
SRS
- 1,065
- 5
- 11
- 22
9
votes
3 answers
Are validation sets necessary for Random Forest Classifier?
Is it necessary to have train, test and validation sets when using random forest classifier?
I understand it is important with Neural Networks but I am not understanding the importance of it with RF. I understand the idea of having a third unseen…
codiearcher
- 277
- 1
- 3
- 7
9
votes
2 answers
Relationship between VC dimension and degrees of freedom
I'm studying machine learning and I feel there is a strong relationship between the concept of VC dimension and the more classical (statistical) concept of degrees of freedom.
Can anyone explain such a connection?
stochazesthai
- 543
- 4
- 5
9
votes
2 answers
Any differences in regularisation in MLP between batch and individual updates?
I have just learned about regularisation as an approach to control over-fitting, and I would like to incorporate the idea into a simple implementation of backpropagation and Multilayer perceptron (MLP) that I put together.
Currently to avoid…
Neil Slater
- 28,918
- 4
- 80
- 100
9
votes
1 answer
Using Vowpal Wabbit for NER
The Vowpal Wabbit (VW) apparently supports sequence tagging functionality via SEARN. The problem is that I cannot find anywhere detailed parameter list with explanations and with some examples. The best I could find is Zinkov's blog entry with a…
Vladislavs Dovgalecs
- 481
- 3
- 8
9
votes
1 answer
Theano in deep learning research
How widely is Theano used in deep learning research?
Is Theano a good start to learn the implementation of machine learning algorithms?
Will learning the implementation of something like a feed forward network really help? Do graduate students…
Karthik Thiagarajan
9
votes
1 answer
Are there any unsupervised learning algorithms for time sequenced data?
Each observation in my data was collected with a difference of 0.1 seconds. I don't call it a time series because it don't have a date and time stamp. In the examples of clustering algorithms (I found online) and PCA the sample data have 1…
umair durrani
- 344
- 2
- 8