Most Popular

1500 questions
9
votes
1 answer

How does a convolutional ply differ from an ordinary convolutional network?

I am currently working on recreating the results of this paper. In the paper they describe a method for using CNN for features extraction, and have a acoustic model that is Dnn-hmm and pretrained using RBM. Section III subsection A states…
Carlton Banks
  • 619
  • 1
  • 6
  • 26
9
votes
5 answers

Perceptron learning rate

Today I've seen many Perceptron implementations with learning rates. According to Wikipedia: there is no need for a learning rate in the perceptron algorithm. This is because multiplying the update by any constant simply rescales the weights…
bkoodaa
  • 323
  • 3
  • 5
  • 8
9
votes
2 answers

Is there a C library for machine learning algorithms?

Are there any machine learning libraries for C. Specifically interested in unsupervised learning.
9
votes
1 answer

feature extraction for a pretrained model in keras

Keras has a way to extract the features of a pretrained model, described here https://keras.io/applications/ from keras.applications.vgg16 import VGG16 from keras.preprocessing import image from keras.applications.vgg16 import…
spore234
  • 603
  • 8
  • 14
9
votes
1 answer

What is the memory cost of a CNN?

I was recently thinking about the memory cost of (a) training a CNN and (b) inference with a CNN. Please note, that I am not talking about the storage (which is simply the number of parameters). How much memory does a given CNN (e.g. VGG-16 D) need…
Martin Thoma
  • 18,880
  • 35
  • 95
  • 169
9
votes
3 answers

What is the correct way to compute Mean F1 score?

I have a set of 10 experiments that compute precision, recall and f1-score for each experiment. Now, average precision & average recall is easy to compute. I have some confusion regarding average f1-score. There are 2 ways on how i can compute mean…
Pinkesh Badjatiya
  • 249
  • 1
  • 2
  • 7
9
votes
4 answers

Algorithms and techniques for spell checking

Can anyone suggest any algorithm and technique for spell checking? After some googling, I found some interesting ones such as this one from Peter Norvig, http://norvig.com/spell-correct.html and few others. However, most of them were written many…
edwin
  • 343
  • 1
  • 3
  • 10
9
votes
4 answers

Which book is a standard for introduction to genetic algorithms?

I have heard of genetic algorithms, but I have never seen practical examples and I've never got a systematic introduction to them. I am now looking for a textbook which introduces genetic algorithms in detail and gives practical examples how they…
Martin Thoma
  • 18,880
  • 35
  • 95
  • 169
9
votes
3 answers

What knowledge do I need in order to write a simple AI program to play a game?

I'm a B.Sc graduate. One of my courses was 'Introduction to Machine Learning', and I always wanted to do a personal project in this subject. I recently heard about different AI training to play games such as Mario, Go, etc. What knowledge do I need…
9
votes
1 answer

Machine Learning: Writing Poems

I'm a student of machine learning, and these days I was trying to learn how to use the TensorFlow library. I've gone through various tutorials and trial&errors with tensorflow, and I thought the best way to learn it for real would be to make use of…
Daniel
  • 181
  • 1
  • 11
9
votes
3 answers

How do you evaluate ML model already deployed in production?

so to be more clear lets consider the problem of loan default prediction. Let's say I have trained and tested off-line multiple classifiers and ensembled them. Then I gave this model to production. But because people change, data and many other…
tomtom
  • 247
  • 3
  • 5
9
votes
1 answer

Is time series multi-step ahead forecasting a sequence to sequence problem?

I'm using the keras package in order to train an LSTM for a univariate time series of type numeric (float). Performing a 1-step ahead forecast is trivial, but I'm not sure how to perform a, let's say, 10-step ahead forecast. Two questions: 1) I read…
sevelf
  • 91
  • 1
  • 3
9
votes
1 answer

How do i pass data into keras?

I am currently struggling to understand how i should train my regression network using keras. I am not sure how I should pass my input data to the network. Both the input data and the output data is stored as a list of numpy arrays. Each input numpy…
Loser
  • 165
  • 1
  • 2
  • 7
9
votes
1 answer

How to extract paragraphs from text document?

I have extracted text data from pdf files of annual reports of companies using pdftotext. The extracted file content looks like: Sample pdf file is here FORWARD-LOOKING STATEMENTS In this Annual Report, we have disclosed forward-looking…
Sanjeev
  • 191
  • 1
  • 1
  • 4
9
votes
1 answer

After the training phase, is it better to run neural networks on a GPU or CPU?

My understanding is that GPUs are more efficient for running neural nets, but someone recently suggested to me that GPUs are only needed for the training phase. Once trained, it's actually more efficient to run them on CPUs. Is this true?
Crashalot
  • 223
  • 2
  • 5