Towards Open-World Referring Expression Comprehension: A Benchmark with Training-free Multi-task Consistency Checker