2 million pairs of images and descriptions, the pictures cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. They depict the overall scene of the image, the details within the scene, and the emotions conveyed by the image. The description is provided in both English and Chinese languages.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1437?source=Github
2 million pairs of images and descriptions
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, as well as an aesthetic subset
image format is .jpg, text format is .txt
in principle, the description should be no less than 250 Chinese characters
overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture
the proportion of correctly labeled images is not less than 95%
Commercial License