Why does this model fail?

219 views Asked by At

Here is the data set
https://gist.github.com/kirkstrobeck/d8b768867890807f9dc9

When using Google Prediction API it will go from RUNNING for about 30 minutes, then ERROR: INTERNAL ERROR.

Why does it fail? It seems to be a standard consumable regression model data set.

1

There are 1 answers

4
Vera Gonzalez On

When attempting to answer this question, I looked at the API you speak of as well as its requirements. These requirements lie in the file format and how the text in said file is formatted. The first thing I will point out is that the Google Prediction API that "is uploaded to Google Cloud Storage as a CSV (comma-separated value) file." Your file is a TXT(at least on GitHub), but appears to have the correct structure of a CSV. However, when you take a look at the standards for this filetype, almost everyone has a different way they want it done. In the case of Google, they have very strict requirements on the file format(they also have some good examples here: cloud.google.com/prediction/docs/developer-guide#examples). Long story short, you shouldn't have spaces between your columns, it might cause an error in the processing seeing how it doesn't match the Wikipedia standards or Google's requirements.

EDIT: Sorry about the weird link stuff, I don't have enough rep to do more than two yet.