`CompositeLossMetrics` now performs a weighted sum of losses. #1251

ds-hwang · 2025-06-10T16:22:57Z

Currently, CompositeLossMetrics sums the losses without considering their weights (i.e., the number of live targets). To make this a weighted sum, downstream code has been implementing CompositeLossWeights to inject the number of live targets into loss_weights. This is essentially patching a surprising logic (initail loss sum) with complex logic (CompositeLossWeights) into a straightforward one (weighted sum).

Therefore, we’re changing the default loss aggregation logic to be straightforward from the beginning.

From now on, our standarized loss aggregation logic is

loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(each_loss_weight * num_each_samples)

Historically, the complex logic was introduced because the weights of losses returned by child metrics were unknown. But now that child metrics return losses as WeightedScalar, we can adopt a simpler, cleaner aggregation logic.

Note: alternative formulation could be

loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(num_each_samples)

However, when num_each_samples is large and each_loss_weight is small, the denominator can become disproportionately large. So we discard this option.

Currently, `CompositeLossMetrics` sums the losses without considering their weights (i.e., the number of live targets). To make this a weighted sum, downstream code has been implementing `CompositeLossWeights` to inject the number of live targets into `loss_weights`. This is essentially patching a surprising logic (initail loss sum) with complex logic (CompositeLossWeights) into a straightforward one (weighted sum). Therefore, we’re changing the default loss aggregation logic to be straightforward from the beginning. From now on, our standarized loss aggregation logic is ``` loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(each_loss_weight * num_each_samples) ``` Historically, the complex logic was introduced because the weights of losses returned by child metrics were unknown. But now that child metrics return losses as `WeightedScalar`, we can adopt a simpler, cleaner aggregation logic. Note: alternative formulation could be ``` loss = sum(each_loss_weight * each_loss * num_each_samples) / sum(num_each_samples) ``` However, when num_each_samples is large and each_loss_weight is small, the denominator can become disproportionately large. So we discard this option.

ds-hwang · 2025-06-10T16:24:03Z

@markblee Could you take a look? From 1399

markblee

(Will approve after the internal review completes.)

ds-hwang requested review from ruomingp, markblee and a team as code owners June 10, 2025 16:22

markblee reviewed Jun 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`CompositeLossMetrics` now performs a weighted sum of losses. #1251

`CompositeLossMetrics` now performs a weighted sum of losses. #1251

Uh oh!

ds-hwang commented Jun 10, 2025

Uh oh!

ds-hwang commented Jun 10, 2025

Uh oh!

markblee left a comment

Uh oh!

Uh oh!

CompositeLossMetrics now performs a weighted sum of losses. #1251

Are you sure you want to change the base?

CompositeLossMetrics now performs a weighted sum of losses. #1251

Uh oh!

Conversation

ds-hwang commented Jun 10, 2025

Uh oh!

ds-hwang commented Jun 10, 2025

Uh oh!

markblee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

`CompositeLossMetrics` now performs a weighted sum of losses. #1251

`CompositeLossMetrics` now performs a weighted sum of losses. #1251