We have used very recent computer vision models based on Visual Transformers to predict for each pixel of an outdoor photo, which class of object it is, e.g. "street", "vegetation", "water, etc. This will be used in further research for investigating the effect of water bodies and vegetation on the human mind.