NAME PRICE SALES VIEWS AVG_RATING VOTES COMMENTS
Module 1 $12.00 69 12048 5 3 26
Module 2 $24.99 12 52858 5 1 14
Module 3 $10.00 1 1381 -1 0 0
Module 4 $22.99 46 57841 5 8 24
.................
So, Let's say I have statistics of sales. I would like to find out:
- How
Price
/etc impact onSales
? - Detect which are features the most impactable?
- Which should be optimized price to achieve the most sales?
Please advise which Python libraries can help here? Any example would be great here!
The python machine learning library scikit-learn is most appropriate in your case. There is a sub-module called feature_selection fits exactly your needs. Here is an example.
Now, we see only 38 out of 500 features are selected.
To further build a predictive model, we could consider the popular GradientBoostRegression.
From the graph, we see the model did pretty a good job: Most variation in 'Sales' has been captured by our model.