School of Artificial Intelligence and Computer Science;
Jiangnan University;
Wuxi 214122;
China;
Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence;
Wuxi 214122;
China;
Centre for Vision;
Speech and Signal Processing;
University of Surrey;
Guildford;
GU27XH;
UK;
Action recognition; spatiotemporal relation; multi-branch fusion; long-term representation; video classification;