机译:Transformer-based Cross Reference Network for video salient object detection
College of Information Engineering Shanghai Maritime University;
School of Software Northwestern Polytechnical University;
School of Computer Science and Technology Harbin Institute of Technology ShenzhenDepartment of Computer Science Electrical Engineering and Mathematical Sciences Western Norway University of Applied Sciences;
Cross-modal integration; Transformer; Video salient: Object detection;