It depends entirely on the polycon count of the models and the resolutions of the textures and videos used on the models.
Not all models are created equal.
One efficiently built scene that includes several virtual sets will perform better than another scene with a single badly built element that uses resources wastefully.
#XPression