Publication | Open Access
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing
163
Citations
0
References
2020
Year
Convolutional Neural NetworkEngineeringMachine LearningDeep Learning ModelsLanguage ProcessingNatural Language ProcessingMultimodal LlmApache MxnetImage AnalysisVisual GroundingData ScienceVideo TransformerMachine TranslationMachine VisionFeature LearningVision Language ModelPre-trained ModelsComputer ScienceDeep LearningDeep Learning ToolkitsComputer VisionLinguistics
We present GluonCV and GluonNLP, the deep learning toolkits for computer vision and natural language processing based on Apache MXNet (incubating). These toolkits provide state-of-the-art pre-trained models, training scripts, and training logs, to facilitate rapid prototyping and promote reproducible research. We also provide modular APIs with flexible building blocks to enable efficient customization. Leveraging the MXNet ecosystem, the deep learning models in GluonCV and GluonNLP can be deployed onto a variety of platforms with different programming languages. The Apache 2.0 license has been adopted by GluonCV and GluonNLP to allow for software distribution, modification, and usage.