Multimodal attention networks for low-level vision-and-language navigation
Computer Vision and Image Understanding, Elsevier, Volume 210, page 1-10 - 2021
BibTex references
@Article\{LCBCC21, author = "Landi, Federico and Cornia, Marcella and Baraldi, Lorenzo and Corsini, Massimiliano and Cucchiara, Rita", title = "Multimodal attention networks for low-level vision-and-language navigation", journal = "Computer Vision and Image Understanding, Elsevier", volume = "210", pages = "1-10", year = "2021", keywords = "Vision-and-language navigation; Embodied AI; Multi-modal attention", url = "http://vcg-legacy.isti.cnr.it/Publications/2021/LCBCC21" }