Why are there better results when using images in range [0, 255] instead of [0, 1]?

I was running into issues trying to re-create the original paper, and stumbled upon this repository. 

I was able to re-create the results when using the caffe pretrained model (which has images in the range of [0, 255]), but had drastically different results when using pytorch's pretrained model (which has images in the range of [0, 1]). I noticed this tidbit of code in your repository:

https://github.com/gordicaleksa/pytorch-neural-style-transfer/blob/f5650deaf9c9aeac4afe31d2afde0a12326a776e/utils/utils.py#L43-L49


I applied that same transformation, and got results that are comparable to the original paper. I am somewhat confused about _why_ this works, though. If pytorch's vgg19 is trained on millions of images in the range of `[0, 1]`, wouldn't it just interpret anything above 1 as being pure white? 

	# normalize using ImageNet's mean
	# [0, 255] range worked much better for me than [0, 1] range (even though PyTorch models were trained on latter)
	transform = transforms.Compose([
	transforms.ToTensor(),
	transforms.Lambda(lambda x: x.mul(255)),
	transforms.Normalize(mean=IMAGENET_MEAN_255, std=IMAGENET_STD_NEUTRAL)
	])

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Why are there better results when using images in range [0, 255] instead of [0, 1]? #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions