To train the CNN, RGB images(in some cases color-ROI) are preprocessed by resizing it to the smallest dimension to 256, and then we crop center 256*256 region. After this per-pixel mean(across all the image) is subtracted. I don't get a proper sense of this concept. Is there anyone can explain?
Kshitij Singh answered .
2025-11-20
mean(AllImages(17, 38, :))
mean(AllImages, 3)
where the 3 indicates the third dimension, provided that the different images are stored as having a different third-dimensional coordinate in the array.