Publication | Closed Access
Multimodal architecture for video captioning with memory networks and an attention mechanism
40
Citations
35
References
2017
Year
Artificial IntelligenceNatural Language ProcessingMultimodal LlmEngineeringMachine LearningMemory NetworksVision Language ModelVideo SummarizationVisual Question AnsweringAttention MechanismDeep LearningMultimodal ArchitectureComputer VisionMachine TranslationMulti-modal Summarization
| Year | Citations | |
|---|---|---|
Page 1
Page 1