Add Two Additional Slurm Array Examples #906

Premas · 2025-02-18T16:43:21Z

Pull Request

Overview

This pull request introduces two additional Slurm array examples.

Proposed Changes

Add an example to count the number of words in a file line by line using a Slurm array job.
Add an example to count the number of words from multiple files in a directory in parallel using a Slurm array job.

Related Issues

Fixes #693

wwarriner

This is a great start!

As we spoke about in our meeting, this is a great start! Let's have example 4.1 be reading data from lines in a file. To do this, you can put the contents of files you generate into one line each in the same single file.

Then we can have a 4.2 where you first generate a collection of paths using a glob based on file extension, put that into a file, one path per line. Then in each job read one path from the file and do something with it. Then 4.2 is like 4.1, with one extra preprocessing step. With that said, please still generate a complete example for 4.2.

These two jobs can do the same things (count words), but in two different approaches. One is reading data from a single file, the other is reading data from separate files with a list.

Premas · 2025-02-28T17:52:12Z

Thanks for reviewing. I have incorporated the following changes,

Example 1: Each array task independently reads a file line by line and processes the word count for each line.
Example 2: Multiple files are independently processed by array tasks, with each task calculating the word count for a specific file. Also, used find and globbing in example2.

wwarriner

This is great! I have a few thoughts on how to help engage readers.

And I think we'll want to be sure to include what we talked about with changing 4.1 and 4.2 to do the same thing in two different ways. One is reading files from a list in a text file. The other is reading files directly from the filesystem using find.

docs/cheaha/slurm/slurm_tutorial.md

wwarriner

Make sure to test the code if you haven't.

This all looks great. A bit of heading reorganization will have this ready to merge. Good work!

docs/cheaha/slurm/slurm_tutorial.md

wwarriner

Great work Prema! This is looking good.

I think there is some room to clean up and make it look more professional. There are a few key points worth addressing for clarity as well. Good work, if you have questions we can discuss.

docs/cheaha/slurm/slurm_tutorial.md

wwarriner · 2025-04-15T21:32:13Z

docs/cheaha/slurm/slurm_tutorial.md

@@ -239,7 +239,58 @@ $ sacct -j 27099591

 Array jobs are more effective when you have a larger number of similar tasks to be executed simultaneously with varied input data, unlike `srun` parallel jobs which are suitable for running a smaller number of tasks concurrently (e.g. less than 5). Array jobs are easier to manage and monitor multiple tasks through unique identifiers.


When I got to the last example I noticed the arrays start at "1". Please start them at "0" instead, to make them more readily compatible with constructs like bash arrays and most programming languages.

You might add a paragraph in the overall introduction to your examples about when to choose zero-indexed and one-indexed --arrays.

In my mind...

Researcher should prefer starting with zero. If their software or programming language is one-indexed (like MATLAB, R, Julia, and Fortran), then it is reasonable to start with one, unless they are using bash arrays. Then they need to consider their options. If needed, researchers can convert between them using bash arithmetic expansion $(($SLURM_ARRAY_TASK_ID + 1)) and $(($SLURM_ARRAY_TASK_ID - 1)) instead of just $SLURM_ARRAY_TASK_ID.

You might also have a subsection "key terminology" that introduces a few things, and provide a reference point. Besides $SLURM_ARRAY_TASK_ID, can you think of anything else that might benefit from being in the "key terminology" section?

All of this is my opinion. Another alternative would be to introduce these concepts in each example, keeping them independent, but that requires more writing and maintenance.

I will leave it to you to decide how to handle this.

wwarriner

Looks great, thank you Prema! Hard work paying off well!

wwarriner

Oops I missed that there was still the --array starting at zero, my apologies! Once that's in we can call it finished.

added array example1

e869309

Premas self-assigned this Feb 18, 2025

Premas added the pr: review PR is ready for review label Feb 18, 2025

wwarriner requested changes Feb 18, 2025

View reviewed changes

wwarriner added pr: changes requested Review complete, needs changes and removed pr: review PR is ready for review labels Feb 18, 2025

Premas added 5 commits February 25, 2025 10:28

add example 2

10ae95f

added script

3ac534d

typos

988e932

add output to example1

3f280fc

add output to example 2

98be6cf

Premas marked this pull request as ready for review February 28, 2025 17:45

Premas added pr: review PR is ready for review and removed pr: changes requested Review complete, needs changes labels Feb 28, 2025

added usage of find command in example 4.2

d2f7a12

wwarriner requested changes Mar 6, 2025

View reviewed changes

wwarriner added pr: changes requested Review complete, needs changes and removed pr: review PR is ready for review labels Mar 6, 2025

Premas added pr: merge PR is ready to merge pr: changes requested Review complete, needs changes and removed pr: changes requested Review complete, needs changes pr: merge PR is ready to merge labels Mar 7, 2025

Premas added 7 commits March 11, 2025 09:58

revised structurin

028e3e2

organizing data

1c3ec0d

alter example4.2

d778ad8

lint verified

17637cc

add ex-4.2 description

493e25f

added example 4.3

839430b

example4.1 typos

ca0bd33

Premas added 2 commits March 25, 2025 10:51

fix typos example4.2

9ea93f0

typos example4.3

5ad36bc

Premas added pr: review PR is ready for review and removed pr: changes requested Review complete, needs changes labels Mar 25, 2025

wwarriner requested changes Mar 28, 2025

View reviewed changes

docs/cheaha/slurm/slurm_tutorial.md Outdated Show resolved Hide resolved

wwarriner added pr: changes requested Review complete, needs changes and removed pr: review PR is ready for review labels Mar 28, 2025

Premas added 3 commits March 28, 2025 12:20

retested code and updated

b5a0b74

headings changed

f0d6adc

updated to four examples

fa9f399

Premas added pr: review PR is ready for review and removed pr: changes requested Review complete, needs changes labels Mar 31, 2025

wwarriner requested changes Apr 15, 2025

View reviewed changes

wwarriner added pr: changes requested Review complete, needs changes and removed pr: review PR is ready for review labels Apr 15, 2025

Premas added 2 commits April 16, 2025 11:10

rewording

9311eab

fix typos

521465e

wwarriner mentioned this pull request May 1, 2025

Add information on --ntasks-per-socket for multiple GPU jobs. #977

Merged

Premas added 2 commits May 16, 2025 11:57

links and comments

a00bcfa

minor changes

a6517e8

wwarriner approved these changes May 19, 2025

View reviewed changes

wwarriner added pr: merge PR is ready to merge pr: changes requested Review complete, needs changes and removed pr: changes requested Review complete, needs changes pr: merge PR is ready to merge labels May 19, 2025

wwarriner requested changes May 19, 2025

View reviewed changes

array indexing

048f658

		@@ -239,7 +239,58 @@ $ sacct -j 27099591

		Array jobs are more effective when you have a larger number of similar tasks to be executed simultaneously with varied input data, unlike `srun` parallel jobs which are suitable for running a smaller number of tasks concurrently (e.g. less than 5). Array jobs are easier to manage and monitor multiple tasks through unique identifiers.

Add Two Additional Slurm Array Examples #906

Are you sure you want to change the base?

Add Two Additional Slurm Array Examples #906

Uh oh!

Conversation

Premas commented Feb 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request

Overview

Proposed Changes

Related Issues

Uh oh!

wwarriner left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Premas commented Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wwarriner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wwarriner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wwarriner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wwarriner Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

wwarriner left a comment

Choose a reason for hiding this comment

Uh oh!

wwarriner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Premas commented Feb 18, 2025 •

edited

Loading

wwarriner left a comment •

edited

Loading

Premas commented Feb 28, 2025 •

edited

Loading