The Krauthammer group publishes their work on deep learning-based multimodal fusion techniques to reduce annotation burden

Figure 4 Lopez et al.
Figure 4 Lopez et al.
Fig. 4: Lopez et al. Front. Big Data, 02 June 2020 © 2020 The Author(s)


 

Figure 4. Model fusion model architecture, showing individual unimodal CNN feature extractors for images and text along with the concatenation fusion mechanism, a terminal network consisting of three dense layers.