Some embodiments described herein cover a machine learning architecture with a separated perception subsystem and application subsystem. These subsystems can be co-trained. In one example embodiment, a data item is received and information from the data item is processed by a first node to generate a sparse feature vector. A second node processes the sparse feature vector to determine an output. A relevancy rating associated with the output is determined. A determination is made as to whether to update the first node based on update criteria associated with the first node, wherein the update criteria comprise a relevancy criterion and a novelty criterion. The second node is updated based on the relevancy rating.
展开▼