We propose RegionCLIP that significantly extends CLIP to learn region-level visual representations. RegionCLIP enables fine-grained alignment between image regions and textual concepts, and thus ...
Linux or macOS with Python ≥ 3.7 PyTorch ≥ 1.10 and torchvision that matches the PyTorch installation. Detectron2 ≥ 0.6 (other versions are not verified) python ...
Translating satellite imagery into maps requires intensive effort and time, especially leading to inaccurate maps of the affected regions during disaster and conflict. The combination of availability ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results