资讯

An illustration of a magnifying glass. An illustration of a magnifying glass.
The official implementation for "PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation", accepted to AAAI-25. The official implementation for "PixelMan: ...
We are releasing a Foundational FSOD challenge as part of the Workshop on Visual Perception and Learning in an Open World at CVPR 2024. We are accepting submissions till 7th June 2024!
Abstract: Page object detection is crucial for document understanding. Different granularities for objects can result in different performances. In this study, block level region object detection is ...
Abstract: In this article, a lateral feature enhancement (LFE) backbone network is proposed to enrich feature representation effectively for page object detection (POD) across various scales. Our LFE ...