How Multimodality Makes LLM Alignment More Challenging

How Multimodality Makes LLM Alignment More Challenging