Use this Dataset

Discuss on Discord

Date

2 years ago

Size

23.21 GB

Organization

Publish URL

Paper URL

Tags

This dataset is a multimodal image and text dataset launched by Tsinghua University and BAAI in 2024. "CapsFusion: Rethinking Image-Text Data at Scale"It has been accepted by CVPR 2024. This dataset is a high-quality resource for large-scale multimodal pre-training. This version contains corresponding captions from the LAION-2B and LAION-COCO datasets, which facilitates comparative analysis and further in-depth research on the quality of image-text data. Each data entry has four fields:

Image URL
LAION-2B Title (original alternative text from the web)
LAION-COCO subtitles (synthesized by BLIP)
CapsFusion Title (Research Team)

CapsFusion-120M.torrent

Seeding 1Downloading 0Completed 183Total Downloads 361

CapsFusion-120M/
- README.md
  1.34 KB
- README.txt
  2.69 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

23.21 GB

Organization

Publish URL

Paper URL

arxiv.org

Tags

This dataset is a multimodal image and text dataset launched by Tsinghua University and BAAI in 2024. "CapsFusion: Rethinking Image-Text Data at Scale"It has been accepted by CVPR 2024. This dataset is a high-quality resource for large-scale multimodal pre-training. This version contains corresponding captions from the LAION-2B and LAION-COCO datasets, which facilitates comparative analysis and further in-depth research on the quality of image-text data. Each data entry has four fields:

Image URL
LAION-2B Title (original alternative text from the web)
LAION-COCO subtitles (synthesized by BLIP)
CapsFusion Title (Research Team)

CapsFusion-120M.torrent

Seeding 1Downloading 0Completed 183Total Downloads 361

CapsFusion-120M/
- README.md
  1.34 KB
- README.txt
  2.69 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp