Camera Topics
The camera topics are managed by the maivin-camera
service and handles interfacing with a local camera to produce camera frames which can be shared to multiple consumers with low overhead by leveraging zero-copy through DMA buffers. The service can also produce encoded frames in H.264 or JPEG suitable for recording or streaming to a remote device. The following is a list of key features provided by the camera service.
- Video4Linux2 Cameras
- i.MX 8M Plus ISP
- Using the VSI ISP driver
- Publishes camera intrinsic parameters from ISP configuration
- Zero-Copy frame publishing using Linux dma-buf
- Hardware H.264 Encoder
- VSI Hantro (i.MX 8M Plus)
- 1080 @ 60FPS
- 4K @ 30FPS (special extension)
- Software JPEG Encoder
The camera topic is published under the /camera
namespace and offers the following sub-topics. Some topics are optional and might not be available on the current system, refer to the camera service configuration documentation for details.
/camera/info
The /camera/info
topic publishes information about the camera using the CameraInfo schema. The schema provides the specifications for the camera resolution and optical parameters.
/camera/dma
The /camera/dma
topic uses the custom DmaBuffer EdgeFirst schema for transmitting Linux dma-buf descriptors. This enables high-performance zero-copy of camera buffers between applications with trivial overhead. A DmaBuffer provides the frame file descriptor and parent process descriptor along with the buffer resolution and format described using a FOURCC code.
The mechanism for sharing buffers is for the camera service to publish its own pid
along with the file descriptor (fd
) of the buffer along with the buffer parameters (width, height, stride, fourcc). When a subscriber receives the message it must first duplicate the file descriptor into its own process space, this is done using the pidfd_getfd system call. Once the file descriptor has been duplicated it can be used normally, either used as-is with an API which can consume a dma-buf
or by using mmap to map the contents of the buffer into user-space.
sequenceDiagram
loop
autonumber
Camera Service->>Client Application: DmaBuffer(pid, fd, ...)
Client Application-->>Linux Kernel: pidfd_getfd(pid, fd)
Linux Kernel->>Client Application: fd duplicate
Client Application-->>Linux Kernel: mmap(fd)
Linux Kernel->>Client Application: ptr to camera pixels
Client Application-->Linux Kernel: Client Application Processing
Client Application-->>Linux Kernel: munmap(ptr)
Client Application-->>Linux Kernel: close(fd)
end
- Camera service publishes a DmaBuffer for each frame received from the camera.
- Client application calls
pidfd_getfd(pid, fd, 0)
to acquire a local duplicate of the file descriptor. - A new file descriptor is returned, to release the
dma-buf
object we will need to callclose(fd)
later. - Client application calls
mmap(fd, ...)
to acquire a local pointer to the camera buffer's pixel data. - A read-only pointer is returned to the client application, it will need to be released using
munmap(ptr)
later. - Client application processes the pixel data.
- Client application calls
munmap(ptr)
to free the mapped buffer. - Client application calls
close(fd)
to release thedma-buf
object.
Mapping DMA Buffers
Mapping DMA buffers into user-space requires additional synchronization primitives around accesses. We cover these details in our camera sample application. Further details are documented in the Linux Kernel Manual under CPU Access to DMA Buffer Objects.
/camera/h264
The /camera/h264
topic uses Foxglove's CompressedVideo schema to publish h.264 encoded video frames. The h.264 encoder uses key-frames, I-Frames, at a typical rate of 1Hz with the frames in-between encoded as P/B-Frames. The decoder requires an initial I-Frame before it can decode additional frames. This is typically handled transparently but means when sending CompressedVideo
data to the h.264 decoder it could take up to a second until valid output is produced.
/camera/jpeg
The /camera/jpeg
topic uses ROS2's CompressedImage schema to publish JPEG encoded camera frames. Each frame is a complete JPEG image and can be decoded using any standard JPEG decoder.