-
Notifications
You must be signed in to change notification settings - Fork 395
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Null Imputation for DateUnitCircleVectorizer #555
base: master
Are you sure you want to change the base?
Conversation
Thanks for the contribution! It looks like @mweilsalesforce is an internal user so signing the CLA is not required. However, we need to confirm this. |
Codecov Report
@@ Coverage Diff @@
## master #555 +/- ##
==========================================
- Coverage 86.78% 86.78% -0.01%
==========================================
Files 347 347
Lines 12026 12024 -2
Branches 403 387 -16
==========================================
- Hits 10437 10435 -2
Misses 1589 1589
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks pretty clean to me!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm!
How do I get rid of this CLA stuff? |
@michaelweilsalesforce did you fill out the form ? |
Yes.
|
kicking cla bot |
@michaelweilsalesforce do you have your salesforce email associated with @mweilsalesforce as well ? Or did you accidentally switch accounts and not realize it ? The cla bot is seeing two authors and it needs both authors to sign cla. |
Both emails are [email protected] and [email protected] |
Somehow I couldn't attach my salesforce account because it was linked to github enterprise. That's why I have created this other account |
Related issues
Issue
Describe the proposed solution
Using (1, 0) instead of (0, 0) for null default value.
Describe alternatives you've considered
Alternatives do not only concern this transformer but the other
vectorizer
that can return the mode as imputation technique.Instead of getting the mode, randomly select an existing non null value so that the distribution of the feature is not changed.
However, this remains difficult :
Additional context
This is in the context where we have this HourOfDay circular representation of a
MM-DD-YYYY 00h00m00s
date not being thrown out by SanityChecker because of Variance not being 0.