You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now, in order to ensure that all submodules have input gradients / output gradients, a dummy tensor is added to the inputs of the watch'd module. This adds new computational overhead since now gradients are computed to the input. This should be optional.
The text was updated successfully, but these errors were encountered:
Right now, in order to ensure that all submodules have input gradients / output gradients, a dummy tensor is added to the inputs of the
watch
'd module. This adds new computational overhead since now gradients are computed to the input. This should be optional.The text was updated successfully, but these errors were encountered: