Publication | Closed Access
SAMT-generator: A second-attention for image captioning based on multi-stage transformer network
46
Citations
26
References
2024
Year
Natural Language ProcessingMultimodal LlmImage AnalysisEngineeringText-to-image RetrievalVisual GroundingVision Language ModelVisual Question AnsweringDeep LearningMulti-stage Transformer NetworkComputer VisionMachine Translation
| Year | Citations | |
|---|---|---|
Page 1
Page 1