Open
Description
I've found that the calculations for constructing graphs can be a cpu bottleneck in training (especially when not many cpus are available). It would be very useful if the graphs could be constructed ahead of time/before training to streamline dataloading. I suspect this will be even more of an issue for complicated graph encodings.
This could be a step after the data processing step (e.g. i3 files to parquet) and will be dependent on the graph definition.