Abstract: Most of the existing superpixel segmentation-based synthetic aperture radar (SAR) target detection algorithms cannot keep the independence of small targets under complex background, ...
Abstract: Pixel-level adaptive convolution, which overcomes the deficiency of the spatial-invariance of standard convolution, is always limited to performing feature extraction from local patches and ...
We present Follow-Your-Emoji, a diffusion-based framework for portrait animation, which animates a reference portrait with target landmark sequences. [FollowYourEmoji ...
Ollama is an app developed as a tool for easily running large-scale language models in a local environment. In July 2025, a GUI app was released that allowed users to download and run various open ...
DEEM is an exploration of using diffusion models as the eyes of multi-modal large language models, with the goal of eliminating potential biases in different visual encoders from a vision-centric ...
Ollama, the popular software for running AI models locally, now supports image generation on macOS. The feature is still experimental, with Windows and Linux support coming later. Two models are ...