add double ml notebook and fix a minor typo in dml doc #24

heimengqi · 2019-03-28T22:51:02Z

please take a look at my notebook, especially the last part (multi treatment and multi output). Let me know if I am doing anything wrong or anything I need to add.

vsyrgkanis · 2019-03-29T12:51:05Z

I would change how we plot the cross price elasticities in the final cells and write some more comments in the way that you process the data to fit cross price. For how to depict: can we do something like: we have 3x3 subplots and each subplot contains the cross price elasticity as a function income. This way we can visualize it as a matrix and each matrix contains a plot. Also I would like to have an example that uses the FistaRegressor from here:
http://contrib.scikit-learn.org/lightning/generated/lightning.regression.FistaRegressor.html
and here
https://github.com/scikit-learn-contrib/lightning/blob/master/lightning/impl/fista.py
as our final model and we use 'trace' as the penalty. Similar to what we had in the demo presentation. This would show how to perform nuclear norm penalization for multiple treatments and multiple outcomes. This is important for latent factor models (i.e. the products interact in some low rank latent space).

kbattocchi · 2019-03-29T13:33:52Z

@vasilismsr From an economist's point of view, is the nuclear norm the right regularization? Low rank is nice, but it seems like they'd be more likely to try to apply a mixed effects model of some sort (e.g. shrink all individual own-price elasticities towards one value and all cross-price elasticities towards another).

vsyrgkanis · 2019-03-29T13:37:58Z

Yeap. Low rank is the right way for high dimensional product spaces. Such latent factor models have been studied in pricing (see athey and blei, or the paper we wrote with taddy).

kbattocchi · 2019-03-29T13:48:28Z

@vasilismsr But here we aren't in a high-dimensional regime; we're regressing the three log quantites on the three prices interacted with income (plus an intercept), so there are only 18 coefficients total.

Maybe we should see if we can find (or generate) a high-dimensional example as well, so that we can demonstrate using the trace norm in a more appropriate setting.

vsyrgkanis · 2019-03-29T13:51:07Z

I know. But the notebook is more for expository purposes, i.e. you could even do this. And we can say in the cell above that this would be more appropriate when you have many products and you believe they interact in a latent space. I think its an important special case and we could show we can handle it. We could in principle have a simulated data example too where we have many products.

vasilismsr

looks good

kbattocchi

As Vasilis says this looks good, but I've made a few minor suggestions.

notebooks/Double Machine Learning Examples.ipynb

kbattocchi · 2019-04-05T16:54:56Z

As we discussed over lunch, see what happens if you add shuffle=True to the KFold constructor in the DML fit method and then use many fewer bootstrap samples - does that solve the confidence interval issue?

notebooks/Double Machine Learning Examples.ipynb

add double ml notebook and fix a minor typo in dml doc

2c28c9f

heimengqi requested review from kbattocchi, moprescu and vasilismsr March 28, 2019 22:51

heimengqi added 2 commits April 1, 2019 16:54

change the plot for cross price elasticities

5febd71

add a bootstrap CI for OJ data

9c679c0

vasilismsr approved these changes Apr 2, 2019

View reviewed changes

kbattocchi requested changes Apr 5, 2019

View reviewed changes

fix plot legend typo

60c87b9

moprescu suggested changes Apr 5, 2019

View reviewed changes

notebooks/Double Machine Learning Examples.ipynb Outdated Show resolved Hide resolved

notebooks/Double Machine Learning Examples.ipynb Outdated Show resolved Hide resolved

notebooks/Double Machine Learning Examples.ipynb Outdated Show resolved Hide resolved

heimengqi and others added 4 commits April 5, 2019 17:27

change dml shuffle True and update notebook based on all feedbacks

9010511

Merge branch 'master' into mehei/dmlnb

b0b6974

Merge branch 'master' into mehei/dmlnb

7dde8e1

Added random state to metalearner tests.

3403209

kbattocchi mentioned this pull request Apr 8, 2019

Add missing notebooks for double ML, Deep IV #20

Closed

kbattocchi approved these changes Apr 9, 2019

View reviewed changes

heimengqi requested a review from moprescu April 9, 2019 19:09

moprescu approved these changes Apr 9, 2019

View reviewed changes

heimengqi merged commit d2118d5 into master Apr 9, 2019

heimengqi deleted the mehei/dmlnb branch April 11, 2019 14:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add double ml notebook and fix a minor typo in dml doc #24

add double ml notebook and fix a minor typo in dml doc #24

heimengqi commented Mar 28, 2019

vsyrgkanis commented Mar 29, 2019 •

edited

Loading

kbattocchi commented Mar 29, 2019

vsyrgkanis commented Mar 29, 2019

kbattocchi commented Mar 29, 2019

vsyrgkanis commented Mar 29, 2019

vasilismsr left a comment

kbattocchi left a comment

kbattocchi commented Apr 5, 2019

add double ml notebook and fix a minor typo in dml doc #24

add double ml notebook and fix a minor typo in dml doc #24

Conversation

heimengqi commented Mar 28, 2019

vsyrgkanis commented Mar 29, 2019 • edited Loading

kbattocchi commented Mar 29, 2019

vsyrgkanis commented Mar 29, 2019

kbattocchi commented Mar 29, 2019

vsyrgkanis commented Mar 29, 2019

vasilismsr left a comment

Choose a reason for hiding this comment

kbattocchi left a comment

Choose a reason for hiding this comment

kbattocchi commented Apr 5, 2019

vsyrgkanis commented Mar 29, 2019 •

edited

Loading