UC Berkeley Researchers Introduce the Touch-Vision-Language (TVL) Dataset for Multimodal Alignment

UC Berkeley Researchers Introduce the Touch-Vision-Language (TVL) Dataset for Multimodal Alignment