15:30 〜 15:50
[DES2/AIS2-2(Invited)] Challenges of Integrating Vision and Language
Deep Learning, Vision and Language, Computer Vision, Natural Language Processing, Encoder-Decoder
The benefits of deep learning are not limited to advanced recognition and generation of data in different modalities, such as images, acoustic signals. As a result of the fact that they are now implemented using commoditized tools based on deep learning, it has become possible to import approaches to understanding other modal data quickly. As a result of the fact that they are now implemented...