Skip to content
This repository was archived by the owner on Aug 15, 2019. It is now read-only.

For matMul gradient broadcasting logic must account for temporary reshaping of inputs. #1598

Merged
merged 5 commits into from
Mar 1, 2019

Conversation

annxingyuan
Copy link
Collaborator

@annxingyuan annxingyuan commented Mar 1, 2019

Fixes broadcasting logic in gradient computation for fused matMul.

This PR will fix test failures on tensorflow/tfjs-layers#397


This change is Reviewable

@annxingyuan annxingyuan requested review from dsmilkov and nsthorat March 1, 2019 15:34
@annxingyuan annxingyuan self-assigned this Mar 1, 2019
@annxingyuan annxingyuan added the BUG label Mar 1, 2019
Copy link
Contributor

@nsthorat nsthorat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reviewed 2 of 2 files at r1.
Reviewable status: :shipit: complete! 1 of 1 approvals obtained (waiting on @annxingyuan, @dsmilkov, and @nsthorat)


src/ops/fused_test.ts, line 177 at r1 (raw file):

    const grads = tf.grads((a, b, c) => {
      const prod = tf.matMul(a, b, false, false);

can you make these booleans named variables for readability? thx

@annxingyuan annxingyuan merged commit 5cdd7f5 into master Mar 1, 2019
@annxingyuan annxingyuan deleted the fix_broadcasting_for_fuseds_op branch March 1, 2019 18:16
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants