Google has revealed its new neural network system for multi-tasking which is called MultiModal. The system is capable of learning how to detect objects in images, provide captions, recognize speech, translate between four pairs of languages as well as parse grammar and syntax, all at once.
Last edited by a moderator: