statistics

Use Azure: Clean Missing Data, SQL Transformation and Compute Median

This article solves the following challenge: 

Replace missing value with median of attribute for the corresponding class

First take the dataset and apply several (for each class label) SQL Transformations:

select * from t1 where label = "ATTRIBUTE";

Then Compute the median for the resulting dataset for each class label seperatly and use this computed median as the paramater for "clean missing data". Then add all rows back together, as they were all split by the SQL Transformation.
Finally run your cross validation on the dataset and review the result.
Please review the attached file on how to structure your model in azure for this strategy.

Attachments: 

solution using azure

Evaluate complexity of present statement:

Select ratingCancelGuessingPassing knowledgeKnowledgeableExpert

Your rating: 3 Average: 2.7 (3 votes)

Taggings:

Tracking the source of visit of our website

As a company or the owner of a website, it is necessary to understand the characteristics of our visitors. The information could be useful to enhance the website itself, or even to create a targeted advertisement. These information including the location and the source of the user visit. In the other hand, the statistics of each page in our website are also important. For example: how often user visit the particular page, which page is the most visited page, where the user usually navigate from one page to another. Then the question remain: how can we gain these information in a nice, simple and easy way?
Subscribe to statistics