Skip to content

Camera Topics

The camera topics are managed by the maivin-camera service and handles interfacing with a local camera to produce camera frames which can be shared to multiple consumers with low overhead by leveraging zero-copy through DMA buffers. The service can also produce encoded frames in H.264 or JPEG suitable for recording or streaming to a remote device. The following is a list of key features provided by the camera service.

  • Video4Linux2 Cameras
  • i.MX 8M Plus ISP
    • Using the VSI ISP driver
    • Publishes camera intrinsic parameters from ISP configuration
  • Zero-Copy frame publishing using Linux dma-buf
  • Hardware H.264 Encoder
    • VSI Hantro (i.MX 8M Plus)
    • 1080 @ 60FPS
    • 4K @ 30FPS (special extension)
  • Software JPEG Encoder

The camera topic is published under the /camera namespace and offers the following sub-topics. Some topics are optional and might not be available on the current system, refer to the camera service configuration documentation for details.

/camera/info

The /camera/info topic publishes information about the camera using the CameraInfo schema. The schema provides the specifications for the camera resolution and optical parameters.

/camera/dma

The /camera/dma topic uses the custom DmaBuffer EdgeFirst schema for transmitting Linux dma-buf descriptors. This enables high-performance zero-copy of camera buffers between applications with trivial overhead. A DmaBuffer provides the frame file descriptor and parent process descriptor along with the buffer resolution and format described using a FOURCC code.

The mechanism for sharing buffers is for the camera service to publish its own pid along with the file descriptor (fd) of the buffer along with the buffer parameters (width, height, stride, fourcc). When a subscriber receives the message it must first duplicate the file descriptor into its own process space, this is done using the pidfd_getfd system call. Once the file descriptor has been duplicated it can be used normally, either used as-is with an API which can consume a dma-buf or by using mmap to map the contents of the buffer into user-space.

sequenceDiagram
    loop
    autonumber
    Camera Service->>Client Application: DmaBuffer(pid, fd, ...)
    Client Application-->>Linux Kernel: pidfd_getfd(pid, fd)
    Linux Kernel->>Client Application: fd duplicate
    Client Application-->>Linux Kernel: mmap(fd)
    Linux Kernel->>Client Application: ptr to camera pixels
    Client Application-->Linux Kernel: Client Application Processing
    Client Application-->>Linux Kernel: munmap(ptr)
    Client Application-->>Linux Kernel: close(fd)
    end
  1. Camera service publishes a DmaBuffer for each frame received from the camera.
  2. Client application calls pidfd_getfd(pid, fd, 0) to acquire a local duplicate of the file descriptor.
  3. A new file descriptor is returned, to release the dma-buf object we will need to call close(fd) later.
  4. Client application calls mmap(fd, ...) to acquire a local pointer to the camera buffer's pixel data.
  5. A read-only pointer is returned to the client application, it will need to be released using munmap(ptr) later.
  6. Client application processes the pixel data.
  7. Client application calls munmap(ptr) to free the mapped buffer.
  8. Client application calls close(fd) to release the dma-buf object.

Mapping DMA Buffers

Mapping DMA buffers into user-space requires additional synchronization primitives around accesses. We cover these details in our camera sample application. Further details are documented in the Linux Kernel Manual under CPU Access to DMA Buffer Objects.

/camera/h264

The /camera/h264 topic uses Foxglove's CompressedVideo schema to publish h.264 encoded video frames. The h.264 encoder uses key-frames, I-Frames, at a typical rate of 1Hz with the frames in-between encoded as P/B-Frames. The decoder requires an initial I-Frame before it can decode additional frames. This is typically handled transparently but means when sending CompressedVideo data to the h.264 decoder it could take up to a second until valid output is produced.

/camera/jpeg

The /camera/jpeg topic uses ROS2's CompressedImage schema to publish JPEG encoded camera frames. Each frame is a complete JPEG image and can be decoded using any standard JPEG decoder.