How to optimize TorchIO's read operation? #568

sarthakpati · 2021-06-10T18:47:31Z

sarthakpati
Jun 10, 2021

Hi,

How does TorchIO read the underlying data? Does it read all the data only once and then save it in memory and apply the requisite transforms per epoch or does it read the data from the supplied files per epoch and then apply the transforms (if any)?

The reason why I am asking is because I am running on systems that share a network filesystem. Because of this reason, any I/O is a premium operation and takes a long time. I am curious because for some cases, an epoch takes ~10 hours on this cluster but <1 hour on my local machine(s).

Cheers,
Sarthak

Answered by fepegar

Jun 11, 2021

Hi, @sarthakpati. Good question!

It depends. When you instantiate an image, the data is not loaded. If you put it into a subject and you put that subject into a dataset, nothing is loaded until you actually need the data (e.g., for a transform). In the SubjectsDataset the loaded data is from a deep copied version of the subject, so the original instance is untouched. If you do load the data from the original subject instance, you won't need to read from disk every single time. I suppose you could use that approach if you have slow I/O and a lot of RAM.

from contextlib import contextmanager

import torch
import psutil  # installed it with pip
from tqdm import tqdm
import torchio as tio


def

View full answer

romainVala · 2021-06-10T20:40:03Z

romainVala
Jun 10, 2021

Normally, only once,
but through the pytorch DataLoader though ...

2 replies

sarthakpati Jun 10, 2021
Author

So this would only read the data once?

train_dataloader = torch.utils.data.DataLoader(
        training_data_for_torch, # this is of type torchio.Queue
        batch_size=1,
        shuffle=True,
    )

## epoch training starts

fepegar Jun 11, 2021
Maintainer

@romainVala please see my reply below.

fepegar · 2021-06-11T16:19:33Z

fepegar
Jun 11, 2021
Maintainer

Hi, @sarthakpati. Good question!

It depends. When you instantiate an image, the data is not loaded. If you put it into a subject and you put that subject into a dataset, nothing is loaded until you actually need the data (e.g., for a transform). In the SubjectsDataset the loaded data is from a deep copied version of the subject, so the original instance is untouched. If you do load the data from the original subject instance, you won't need to read from disk every single time. I suppose you could use that approach if you have slow I/O and a lot of RAM.

from contextlib import contextmanager

import torch
import psutil  # installed it with pip
from tqdm import tqdm
import torchio as tio


def print_used_memory():
    gib = psutil.virtual_memory().used / 2**30
    print(f'RAM used: {gib:.1f} GiB')


colin = tio.datasets.Colin27()
subject_dict = dict(
    t1=tio.ScalarImage(colin.t1.path),
    label=tio.LabelMap(colin.brain.path),
)

subjects = [
    tio.Subject(
        t1=tio.ScalarImage(colin.t1.path),
        label=tio.LabelMap(colin.brain.path),
    )
    for _ in range(100)
]
dataset = tio.SubjectsDataset(subjects)
loader = torch.utils.data.DataLoader(dataset, batch_size=2, num_workers=4)

print_used_memory()

for batch in tqdm(loader):  # subject is deep-copied, images from copies are loaded
    pass


print_used_memory()

print('Loading data...')
for subject in tqdm(subjects):
    subject.load()  # load images, caching the voxel data in RAM

print_used_memory()

for batch in tqdm(loader):  # images were already loaded before
    pass

print_used_memory()

Output:

RAM used: 7.2 GiB
100%|██████████████████████████████| 50/50 [00:08<00:00,  5.66it/s]
RAM used: 7.2 GiB
Loading data...
100%|████████████████████████████| 100/100 [00:25<00:00,  3.88it/s]
RAM used: 11.9 GiB
100%|██████████████████████████████| 50/50 [00:01<00:00, 31.24it/s]
RAM used: 11.9 GiB

As you can see, for the cases in which images were already preloaded, each iteration was about 6x faster, at the cost of some more RAM usage.

Does that make sense?

7 replies

sarthakpati Jun 11, 2021
Author

That would be awesome! Thanks, again.

fepegar Jun 11, 2021
Maintainer

No worries!

Geeks-Sid Jun 14, 2021

This looks great! 💯

romainVala Jun 14, 2021

thanks,
do I understand correctly, that at the end of you epoch, all your dataset in then in memory ?
so the second epoch, would be faster

is it exactly the same with the torchio Queue (and numworker >0)

fepegar Jun 14, 2021
Maintainer

do I understand correctly, that at the end of you epoch, all your dataset in then in memory ?

If you use load() manually, the whole dataset will always be in the RAM.

If you don't, the only images in RAM are the ones being used currently.

With the queue, the images being processed + the patches stored in the queue are what's stored in RAM.

How to optimize TorchIO's read operation? #568

Uh oh!

sarthakpati Jun 10, 2021

Replies: 2 comments · 9 replies

Uh oh!

romainVala Jun 10, 2021

Uh oh!

sarthakpati Jun 10, 2021 Author

Uh oh!

fepegar Jun 11, 2021 Maintainer

Uh oh!

Uh oh!

fepegar Jun 11, 2021 Maintainer

Uh oh!

sarthakpati Jun 11, 2021 Author

Uh oh!

fepegar Jun 11, 2021 Maintainer

Uh oh!

Geeks-Sid Jun 14, 2021

Uh oh!

Uh oh!

romainVala Jun 14, 2021

Uh oh!

fepegar Jun 14, 2021 Maintainer

sarthakpati
Jun 10, 2021

Replies: 2 comments 9 replies

romainVala
Jun 10, 2021

sarthakpati Jun 10, 2021
Author

fepegar Jun 11, 2021
Maintainer

fepegar
Jun 11, 2021
Maintainer

sarthakpati Jun 11, 2021
Author

fepegar Jun 11, 2021
Maintainer

fepegar Jun 14, 2021
Maintainer