Generating media files from video comprises dividing the video (801) into spatial regions and generating video tracks (802) to store the divided video, each track associated with metadata (804). The metadata comprises spatial information (e.g. position & size information in sphere coordinates or 2D space) indicating that the video is divided into spatial regions, and an identifier indicating that it is from the same source. The files are generated including the generated tracks along with the metadata. Also disclosed is the metadata comprising composition information (e.g. composition type and/or picture size) for organizing generated tracks to display a full picture. Media files including the generated tracks are generated / parsed (e.g. according to ISO BMFF, Base Media File Format). Also disclosed is encoding media data in a first track and a second track being generated which references the first and comprises transform descriptions to be applied to the first track samples, e.g. in the metadata header or sample table box. Alternative media versions can be provided using the transforms described in the second track(s) on the first track samples. Also described is the second track describing samples representing the result of applying the transformation operator(s) to first track samples.
展开▼