A few fixes #22

dacorvo · 2023-10-25T14:03:00Z

While debugging the OPT model, I found out that a few things were not working as expected when batching inputs.

For a start, is_contiguous was never called (and dispatched) because it must be overloaded directly in subclasses (would need to check how many methods are actually needed in Tensor subclasses).

Then, I realized that the chain of aten operations when torch Function is disabled in the Tensor subclass was not necessarily equivalent to their "functional" counterparts.

A good example is torch.matmul: if I disable torch Function for QTensor and invoke torch.matmul, I end up with a sequence of operations that does not check if inputs are contiguous, which leads to failures when the modeling code is not bullet-proof.

Finally, the calibration code was not strictly correct, as during calibration we were always using the "optimal" scale for inputs and outputs where it should be the "current" scale evaluated using a momentum.

The torch.Tensor implementation seems to deal with non-contiguous inputs just fine, but if we don't overload it for QTensor and let instead the chain of aten operations flow, it starts with a sequence of expand/view calls that are not compatible with non-contiguous inputs. It is better to overload matmul anyway, as we might be able to call directly an optimized kernel here.

dacorvo added 9 commits October 25, 2023 15:53

refactor(qtensor): add helper to check if variable is a scalar

44fe36a

feat(qtensor): support multiplication by a scalar

d1b39c9

refactor(qtensor): introduce absmax_scale

ca49ef5

fix(calibration): also calibrate custom quantized modules

b996df0

fix(calibration): requantize input/output with the module scales

6d92477

fix(cqtensor): return self if the tensor is contiguous

0ad1afb

feat(qtensor): support unsqueeze

6509995

feat(examples): evaluate by batch in SST2 example

608fabe

dacorvo merged commit 5935f29 into main Oct 25, 2023

dacorvo deleted the a_few_fixes branch October 25, 2023 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A few fixes #22

A few fixes #22

dacorvo commented Oct 25, 2023 •

edited

Loading

A few fixes #22

A few fixes #22

Conversation

dacorvo commented Oct 25, 2023 • edited Loading

dacorvo commented Oct 25, 2023 •

edited

Loading