Use TensorIndexer for the view tests #4237

naoyam · 2025-04-11T04:07:17Z

Enabled TensorIndexer for the reshape tests.

I temporarily added a codegen diff result to this PR. This one is more concise as I disabled index hoisting. As far as I can see, there's no concerning change. I haven't verified everything, but I believe most of them are because TensorIndexer can detect more divisible splits, which helps generate simplified indices through more aggressive contig indexing.

Once approved, I'll remove the html file.

Context

Part of #4175.

I'm planning to enable the new indexer globally by default once we are sufficiently confident with it. I'm going to enable it for some of the C++ tests for now. Just manually checking the diff results seems to be the only way to gain some confidence.

All the tests are passing in my local branch, but just having green test results don't necessarily mean everything is properly ported to the new indexer. I'll also check perf changes with the benchmarks, but they may not give clear signals as indexing is just one piece of performance bottlenecks.

naoyam · 2025-04-11T04:07:34Z

!test --diff

github-actions · 2025-04-11T04:08:02Z

Review updated until commit 26ad85b

Description

Enabled TensorIndexer for view and reshape tests
Added setup methods to enable TensorIndexer options
Temporarily included codegen diff results for review

Changes walkthrough 📝

Relevant files

Enhancement

test_gpu_view.cpp `Enable TensorIndexer in view and reshape tests` tests/cpp/test_gpu_view.cpp Modified GpuViewTest to inherit from NVFuserTest and added SetUp method to enable TensorIndexer Modified ReshapeReduction to inherit from NVFuserFixtureParamTest and added SetUp method to enable TensorIndexer	+15/-2

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

🧪 PR contains tests
⚡ Recommended focus areas for review Code Duplication The `SetUp` method is duplicated in both `GpuViewTest` and `ReshapeReduction` classes. Consider creating a base class to avoid code duplication. class GpuViewTest : public NVFuserTest { protected: void SetUp() override { NVFuserTest::SetUp(); EnableOptionsGuard::getCurOptions().set(EnableOption::IdModel, {"all"}); } }; Test Coverage Ensure that the new tests cover all edge cases and scenarios, especially since the new indexer is being enabled. Verify that the existing tests are still relevant and effective. TEST_F(GpuViewTest, FusionViewDtypeSameSizeOutput) { Performance Impact Evaluate the performance impact of enabling TensorIndexer globally. Conduct thorough benchmarking to ensure that the benefits outweigh any potential regressions. class GpuViewTest : public NVFuserTest { protected: void SetUp() override { NVFuserTest::SetUp(); EnableOptionsGuard::getCurOptions().set(EnableOption::IdModel, {"all"}); } };

naoyam · 2025-04-11T19:45:53Z

!test --diff

This reverts commit 03a1b69.

jjsjann123

github cannot show the diff 😢

Not totally sure how I should interpret the diff.

Searching by ^-
generated cuda indexing does look simpler. I'm a bit surprised to see the first code diff like these (line 11925 in the diff):

- if (threadIdx.x + (128 * blockIdx.x)) < 120)
+ if (threadIdx < 120)

I'm not holding anything against this PR, since it's only turning it on in the test. But let's remove the temporary file first before stamping it. I don't want to accidentally add that in our history.

naoyam · 2025-04-14T21:54:04Z

github cannot show the diff 😢

Not totally sure how I should interpret the diff.

Searching by ^- generated cuda indexing does look simpler. I'm a bit surprised to see the first code diff like these (line 11925 in the diff):
- if (threadIdx.x + (128 * blockIdx.x)) < 120)
+ if (threadIdx < 120)
I'm not holding anything against this PR, since it's only turning it on in the test. But let's remove the temporary file first before stamping it. I don't want to accidentally add that in our history.

IIRC, that's because the loop ID parallelized by BIDx is actually just a broadcast ID and that the new indexer is able to simplify the index.

Do you have any questions with other changes? Any concern?

I'm planning to enable the new indexer globally by default once we are sufficiently confident with it. I'm going to enable it for some of the C++ tests for now. Just manually checking the diff results seems to be the only way to gain some confidence.

All the tests are passing in my local branch, but just having green test results don't necessarily mean everything is properly ported to the new indexer. I'll also check perf changes with the benchmarks, but they may not give clear signals as indexing is just one piece of performance bottlenecks.

naoyam · 2025-04-14T21:55:55Z

github cannot show the diff 😢

No, it doesn't. Please download it and open it locally.

jjsjann123 · 2025-04-15T07:41:56Z

IIRC, that's because the loop ID parallelized by BIDx is actually just a broadcast ID and that the new indexer is able to simplify the index.
Do you have any questions with other changes? Any concern?

Thanks. my earlier quick scanning does seem to see indexing code getting at least shorter. So that's a positive thing.

I'll also check perf changes with the benchmarks, but they may not give clear signals as indexing is just one piece of performance bottlenecks.

Are we seeing mixed performance impact? I don't think we necessarily have to answer all those questions, but is there any significant regression that's worth investigation?
Since this PR switches the indexing mode on only on cpp tests, not benchmark. Does this mean perf impact would still be later conducted on python benchmark, when we switch to use TensorIndexer by default?

I don't have much concern on this PR other than that question above.

But most importantly, let's remove the diff code so I can stamp it.

…view

naoyam · 2025-05-09T21:54:45Z

!test

jjsjann123

thanks for getting me to double check.
Looks like tmp files are cleaned up.

🚢

Enabled TensorIndexer for the reshape tests. I temporarily added a codegen diff result to this PR. This one is more concise as I disabled index hoisting. As far as I can see, there's no concerning change. I haven't verified everything, but I believe most of them are because TensorIndexer can detect more divisible splits, which helps generate simplified indices through more aggressive contig indexing. Once approved, I'll remove the html file. ### Context Part of #4175. I'm planning to enable the new indexer globally by default once we are sufficiently confident with it. I'm going to enable it for some of the C++ tests for now. Just manually checking the diff results seems to be the only way to gain some confidence. All the tests are passing in my local branch, but just having green test results don't necessarily mean everything is properly ported to the new indexer. I'll also check perf changes with the benchmarks, but they may not give clear signals as indexing is just one piece of performance bottlenecks.

Use TensorIndexer with the view tests

34bb2ba

temp

03a1b69

naoyam added 2 commits April 11, 2025 15:30

Revert "temp"

c06cf91

This reverts commit 03a1b69.

tmp

0644258

naoyam marked this pull request as ready for review April 11, 2025 22:38

naoyam requested a review from jjsjann123 April 11, 2025 22:38

jjsjann123 reviewed Apr 14, 2025

View reviewed changes

naoyam added 2 commits May 9, 2025 14:50

Merge remote-tracking branch 'origin/main' into tensorindexer_enable_…

d7b48f9

…view

remove tmp file

26ad85b

naoyam requested a review from jjsjann123 May 9, 2025 21:54

jjsjann123 approved these changes May 12, 2025

View reviewed changes

naoyam merged commit 7c1fa59 into main May 12, 2025
53 checks passed

naoyam deleted the tensorindexer_enable_view branch May 12, 2025 16:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use TensorIndexer for the view tests #4237

Use TensorIndexer for the view tests #4237

Uh oh!

naoyam commented Apr 11, 2025 •

edited

Loading

Uh oh!

naoyam commented Apr 11, 2025

Uh oh!

github-actions bot commented Apr 11, 2025 •

edited

Loading

Uh oh!

naoyam commented Apr 11, 2025

Uh oh!

jjsjann123 left a comment

Uh oh!

naoyam commented Apr 14, 2025

Uh oh!

naoyam commented Apr 14, 2025

Uh oh!

jjsjann123 commented Apr 15, 2025

Uh oh!

naoyam commented May 9, 2025

Uh oh!

jjsjann123 left a comment

Uh oh!

Uh oh!

Uh oh!

Use TensorIndexer for the view tests #4237

Use TensorIndexer for the view tests #4237

Uh oh!

Conversation

naoyam commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Uh oh!

naoyam commented Apr 11, 2025

Uh oh!

github-actions bot commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes walkthrough 📝

PR Reviewer Guide 🔍

Uh oh!

naoyam commented Apr 11, 2025

Uh oh!

jjsjann123 left a comment

Choose a reason for hiding this comment

Uh oh!

naoyam commented Apr 14, 2025

Uh oh!

naoyam commented Apr 14, 2025

Uh oh!

jjsjann123 commented Apr 15, 2025

Uh oh!

naoyam commented May 9, 2025

Uh oh!

jjsjann123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

naoyam commented Apr 11, 2025 •

edited

Loading

github-actions bot commented Apr 11, 2025 •

edited

Loading