roadrover

A ROS 2 Humble project for data collection, offline perception, lane detection, object detection, and ego state estimation, intended to run on a Raspberry Pi.

Guides


🍓 Raspberry Pi 4 Setup	Full bring-up from flashing Ubuntu to first recording — start here if you are setting up new hardware
📖 README (this page)	ROS 2 package usage, launch files, offline pipeline, and API reference

Hardware

Device	Default path
USB camera (monocular)	`/dev/video0`
Stereo USB camera (OV9715 dual-lens)	`/dev/video4`
GPS receiver (NMEA serial, USB)	`/dev/ttyUSB0` at 4800 baud
BerryGPS-IMU-4 GPS (NMEA serial, UART)	`/dev/ttyAMA0` at 9600 baud
BerryGPS-IMU-4 IMU + barometer	I2C bus 1

Prerequisites

Ubuntu 22.04 with ROS 2 Humble installed
The following ROS 2 packages available in your workspace or installed via apt:

sudo apt install ros-humble-usb-cam \
                 ros-humble-nmea-navsat-driver \
                 ros-humble-foxglove-bridge

Raspberry Pi note: run the same apt command after installing ROS 2 Humble on the Pi.

Setup

1. Clone the repository

mkdir -p ~/ros2_ws/src
cd ~/ros2_ws/src
git clone git@github.com:prasadshingne/roadrover.git

2. Build

cd ~/ros2_ws
source /opt/ros/humble/setup.bash
colcon build --symlink-install

3. Source the workspace

source ~/ros2_ws/install/setup.bash

Add this line to ~/.bashrc to source automatically on every terminal:

echo "source ~/ros2_ws/install/setup.bash" >> ~/.bashrc

Usage

Launch everything

ros2 launch roadrover_bringup bringup.launch.py

This starts three nodes:

Node	Package	Topic(s)
`usb_cam`	`usb_cam`	`/usb_cam/image_raw/compressed`, `/usb_cam/camera_info`
`nmea_navsat_driver`	`nmea_navsat_driver`	`/fix`, `/vel`, `/time_reference`
`nmea_navsat_driver_berry`	`nmea_navsat_driver`	`/fix_berry` (BerryGPS UART)
`berry_imu_node`	`roadrover_imu`	`/imu/raw` (100 Hz), `/imu/baro` (10 Hz)
`foxglove_bridge`	`foxglove_bridge`	WebSocket on port 8765

Visualize with Foxglove Studio

Open Foxglove Studio in a browser or the desktop app.
Click Open connection → Foxglove WebSocket.
Enter ws://<device-ip>:8765 (use ws://localhost:8765 if running locally).
Subscribe to /usb_cam/image_raw for camera feed and /fix for GPS.

Stereo camera

For stereo-based depth estimation, connect the OV9715 synchronized dual-lens USB camera. It outputs a single side-by-side 1280×480 MJPEG frame which the stereo_splitter node splits into separate left and right topics.

Implementation note: stereo_splitter is a C++ ROS 2 node (roadrover_stereo package). It was written in C++ rather than Python because the two sequential JPEG encodes per frame at 30 fps exceed what Python can sustain — Python achieved only ~15 fps even with threading, while C++ delivers the full 30 fps.

ros2 launch roadrover_bringup stereo.launch.py

This starts stereo_splitter and foxglove_bridge on port 8765. Published topics:

Topic	Type	Content
`/stereo/left/image_raw/compressed`	`CompressedImage`	Left eye JPEG
`/stereo/right/image_raw/compressed`	`CompressedImage`	Right eye JPEG
`/stereo/left/camera_info`	`CameraInfo`	Left camera metadata (uncalibrated)
`/stereo/right/camera_info`	`CameraInfo`	Right camera metadata (uncalibrated)

Why no raw image topics? Uncompressed 640×480 BGR8 at 30 fps is ~900 KB per eye per frame (~54 MB/s total). Publishing compressed-only reduces bandwidth to ~3 MB/s with no loss to the perception pipeline.

Node parameters (set in stereo.launch.py):

Parameter	Default	Description
`device`	`/dev/video4`	V4L2 device path
`width`	`1280`	Combined frame width (both eyes)
`height`	`480`	Frame height per eye
`fps`	`30`	Camera frame rate (hardware supports 30 or 60 fps)
`jpeg_quality`	`80`	JPEG encode quality; lower values reduce CPU if needed

Verify the camera device before launching:

v4l2-ctl --device=/dev/video4 --list-formats-ext

The stereo camera outputs MJPG at 1280x480 (30 fps) or 2560x960 (30/60 fps for the full sensor). If it appears on a different device node, update device in stereo.launch.py.

Foxglove visualization: open ws://localhost:8765 and add two Image panels — subscribe one to /stereo/left/image_raw/compressed and the other to /stereo/right/image_raw/compressed.

Calibration: camera_info messages are published with zeroed intrinsics. Proper stereo calibration (checkerboard + camera_calibration ROS package) is required before running any stereo depth algorithms.

Recording and replay

Record a session

ros2 launch roadrover_bringup record.launch.py

Starts all sensors and records the following topics to ~/roadrover_bags/session_<timestamp>/:

Topic	Content
`/usb_cam/image_raw/compressed`	Compressed camera frames (MJPEG)
`/usb_cam/camera_info`	Camera calibration
`/fix`	GPS position — USB receiver (NavSatFix, ~1 Hz)
`/vel`	GPS velocity (TwistStamped)
`/time_reference`	GPS time
`/fix_berry`	GPS position — BerryGPS-IMU-4 (NavSatFix, ~1 Hz)
`/imu/raw`	IMU — BerryGPS-IMU-4 angular velocity + linear acceleration (100 Hz)
`/imu/baro`	Barometric pressure — BerryGPS-IMU-4 (10 Hz)

The bag auto-splits into a new file every 30 seconds (--max-bag-duration 30), keeping each file manageable for offline processing. Typical bag size is ~1 GB per 10 minutes of driving.

Why no raw image topic? /usb_cam/image_raw at 640×480 RGB 30 fps requires ~27 MB/s sustained write speed — at the limit of a Pi 4 SD card — and produces 20+ GB bags. Dropping it in favour of the compressed topic reduces write load to ~2 MB/s with no loss to the perception pipeline, which operates entirely on /usb_cam/image_raw/compressed.

Stop recording with Ctrl-C. The bag is fully written before the process exits.

Replay a session

ros2 launch roadrover_bringup replay.launch.py bag_path:=$HOME/roadrover_bags/session_<timestamp>

This starts Foxglove bridge and plays the bag together. Then open Foxglove Studio at ws://localhost:8765.

For playback control, play the bag manually instead:

# Start bridge only
ros2 launch roadrover_bringup replay.launch.py bag_path:=$HOME/roadrover_bags/session_<timestamp>

# Useful flags
ros2 bag play <path> --rate 0.5          # half speed
ros2 bag play <path> --start-offset 30  # skip first 30 s
ros2 bag play <path> --loop             # repeat continuously

Session processing pipeline

Process an entire recorded session — map download, chunk splitting, perception, and scenario extraction — in one command:

python3 src/roadrover_perception/scripts/pipeline_session.py ~/roadrover_bags/session_<timestamp>

The script prints a per-chunk dry-run table showing duration and peak speed, then waits for confirmation before writing any output. Pass --yes to skip the prompt in automated workflows.

What it does, in order:

Step	Detail
Velocity pre-check	Scans `/vel` messages; skips the entire session if the vehicle never exceeded 0.5 m/s
Map download	Runs `make_map.py` once on the full session; result cached in `session_<ts>_map/` and reused on re-runs
Chunk splitting	Splits the bag into 30 s chunks using `rosbag2_py`; raw chunks kept in `session_<ts>_chunks/chunk_NNN/`
Skip stationary chunks	Any chunk whose peak speed < 0.5 m/s is skipped (e.g. waiting at traffic lights at session start)
`process_bag.py`	Rotate 180° + lane detection + YOLO + map matching → `chunk_NNN_processed/`
`make_scenario.py`	Ego + actor trajectory extraction → `chunk_NNN_scenario/scenario.xosc` + `map.xodr`

Output layout:

session_<ts>_map/          map_graph.pkl, map.geojson, lanes.geojson
session_<ts>_chunks/
  chunk_000/               raw 30 s bag (kept)
  chunk_000_processed/     processed bag
  chunk_000_scenario/      scenario.xosc + map.xodr
  chunk_001/ ...

Tests

Unit tests cover the perception components that have historically caused hard-to-debug failures. Run with the system Python (not Anaconda/conda) so the ROS pytest plugins load correctly:

python3 -m pytest tests/ -v

Test file	What it covers
`tests/test_lane_tracker.py`	`LaneTracker._fit()` rejection gates (the staircase-bug fix), `bev_lateral()` self-calibrating scale, EMA smoothing, stale-frame counter
`tests/test_lane_offset.py`	Ego ENU position formula — regression guard against reverting to the old snap=left-boundary convention that placed the ego one full lane width off-road
`tests/test_ego_state.py`	Speed from velocity components, heading derivation, yaw-rate angle unwrapping at ±π, longitudinal acceleration EMA, MIN_SPEED gate

These tests have no dependency on ROS at runtime — ROS types are stubbed out in tests/conftest.py.

Offline perception pipeline

src/roadrover_perception/scripts/process_bag.py reads an original recorded bag, runs the full perception stack, and writes a new bag:

# From the repo root (YOLOv8s weights must be present as yolov8s.pt)
python3 src/roadrover_perception/scripts/process_bag.py ~/roadrover_bags/session_<timestamp>

# Optionally specify the output path
python3 src/roadrover_perception/scripts/process_bag.py <bag> --output <out_bag>

What it does

Step	Detail
Image rotation	All camera frames rotated 180° in-place
Object detection	YOLOv8s on GPU; detections in the car hood region are filtered out
Lane detection	Bird's-eye view (BEV) perspective warp + sliding window search; EMA-smoothed degree-2 polynomial per lane
Ego state estimation	Heading, yaw rate, longitudinal and lateral acceleration derived from GPS velocity
Map matching	GPS fix snapped to nearest OSM road edge; lane number estimated from GPS lateral offset + BEV measurement
Actor tracking	IoU tracker on YOLO vehicle detections; 3D box positions projected to ENU via bounding-box-height model

Output topics

Topic	Type	Content
`/perception/image_annotated`	`CompressedImage`	YOLO boxes + lane overlay + speed
`/perception/actors`	`MarkerArray`	Tracked vehicle boxes (orange CUBEs) in ENU `map` frame; requires `--map-graph`
`/ego/odometry`	`nav_msgs/Odometry`	Heading (quaternion), speed, yaw rate
`/ego/imu`	`sensor_msgs/Imu`	Yaw rate, longitudinal accel, lateral accel
`/ego/matched_fix`	`NavSatFix`	GPS snapped to estimated lane centre
`/ego/lane_info`	`String`	Road name, lane number, method, GPS offset, BEV offset
`/ego/marker`	`Marker` (CUBE)	Car-box in ENU `map` frame, yaws with heading
`/ego/pose`	`PoseStamped`	Ego pose in `map` frame
`/map/lanes`	`MarkerArray`	Lane boundary LINE_STRIP markers in ENU `map` frame (cyan = interior, white = edge, yellow = centreline)
`/tf`	`TFMessage`	`map → base_link` transform per GPS fix

Ego state signals

All signals are currently derived from the GPS /vel topic. The BerryGPS-IMU-4 now provides /imu/raw at 100 Hz, which will be fused with GPS via an EKF (e.g. robot_localization) to give smooth high-frequency pose estimates between 1 Hz GPS fixes.

Signal	Source	Method
Heading	`/vel` east/north components	`atan2(vy_north, vx_east)`
Yaw rate	Heading	Finite diff + EMA smoothing
Longitudinal accel	Speed	`d(speed)/dt` + EMA
Lateral accel	Speed + yaw rate	`speed × yaw_rate` (centripetal)

Map matching and lane localization

Pass --map-graph to enable map matching. The graph is produced by make_map.py (see OSM map pipeline below).

python3 src/roadrover_perception/scripts/process_bag.py ~/roadrover_bags/session_<timestamp> \
    --map-graph ~/roadrover_bags/map_graph.pkl

The localization runs in two stages per GPS fix:

GPS snap — the raw fix is projected onto the nearest OSM road edge to get the road position along the route. On divided highways ox.nearest_edges() can return the opposing carriageway; if the nearest edge's tangent strongly opposes ego heading (dot product < −0.3), the matcher searches all edges within 30 m for the closest heading-consistent alternative (dot > 0.3) and uses that instead.
Lane-centre placement — the snap point is EMA-smoothed (α = 0.6) to absorb GPS-fix jitter, then offset laterally to the detected lane centre using:
```
ego_x = snap_x + (N − lane_num + 0.5) × LANE_WIDTH × sin(heading)
ego_y = snap_y − (N − lane_num + 0.5) × LANE_WIDTH × cos(heading)
```
lane_num (1 = rightmost) is estimated from the BEV camera's bev_d_left measurement (distance in metres to the ego's left lane boundary), EMA-smoothed with α = 0.10 and a 4-fix hysteresis counter to prevent transient jumps.

The /ego/lane_info string topic reports road | lane k/N [method] | gps_offset ±X m | bev_d_left Y m for every GPS fix.

Why not use raw GPS for the lateral position? GPS lateral accuracy on a motorway is typically 5–15 m — far worse than a lane width — due to multipath reflections from trees, bridges, and nearby structures. In recorded sessions the GPS antenna reported a position ~10 m west of the ego's actual lane, placing it in the median when converted directly to ENU. The OSM snap point reliably tracks the road regardless of this lateral error and is the correct base for lane-level positioning. Do not replace the snap + offset formula with raw GPS coordinates.

Viewing in Foxglove

Open the processed bag in Foxglove Studio (File → Open local file). Useful panel configurations:

Image panel → /perception/image_annotated — annotated video with lanes and YOLO boxes
Map panel → /fix — GPS track on a satellite map
3D panel → add topics /map/lanes (lane boundaries), /ego/marker (ego car box), and /perception/actors (detected vehicle boxes); set Display frame to base_link to follow the ego
Raw messages panel → /ego/lane_info — live road/lane diagnostics
Plot panel — add series for time-series signals:

Signal	Topic	Field path
Speed (m/s)	`/ego/odometry`	`twist.twist.linear.x`
Yaw rate (rad/s)	`/ego/odometry`	`twist.twist.angular.z`
Longitudinal accel	`/ego/imu`	`linear_acceleration.x`
Lateral accel	`/ego/imu`	`linear_acceleration.y`

Foxglove 3D tip: the TF visualizer draws a connection line between the map origin and base_link. To hide it, open the panel settings → Transforms → uncheck Show connection lines.

Known limitation: 3D actor position noise

The orange actor boxes in the 3D view will visibly jitter. This is expected and has two root causes:

1. Monocular depth estimation is noisy. Distance is inferred from bounding-box height using a pinhole model (d = f × H / h_px). YOLO's detection boundaries shift 5–15% frame-to-frame even for a stationary vehicle. At 40 m range, a 10% height variation translates to ±4 m of distance error per frame — with no depth sensor to correct it, this noise is irreducible.

2. GPS anchor discontinuity. Ego position is updated at 1 Hz from GPS and dead-reckoned between fixes using NMEA velocity. Each GPS fix carries a new lateral error (motorway GPS accuracy is ±5–15 m). When the map-matched ego position shifts at a fix boundary, all actor ENU positions shift by the same amount simultaneously, appearing as a periodic snap in the 3D view.

Why not fix it? Eliminating this properly requires hardware this rover doesn't have: a stereo camera or LiDAR for metric depth, or a radar for accurate radial distance. The tracking identity (which box belongs to which vehicle across frames) is correct — only the absolute 3D position estimate is noisy. Sensor fusion with any of those modalities would replace the bounding-box-height heuristic and solve both issues.

Lane detection debug tool

Inspect the BEV pipeline on a single frame without running the full bag:

python3 src/roadrover_perception/scripts/debug_lanes.py <bag_path> --frame 50 --out-dir /tmp/lane_debug

Output images in /tmp/lane_debug/:

File	Content
`0_original.jpg`	Raw rotated frame
`1_clahe.jpg`	CLAHE-enhanced grayscale
`2_edges.jpg`	Canny edges
`3_roi.jpg`	Trapezoid ROI boundary
`4_masked_edges.jpg`	Edges inside ROI
`5_bev_edges.jpg`	Edges warped to bird's-eye view
`6_bev_windows.jpg`	Sliding windows in BEV
`7_bev_fit.jpg`	Polynomial fit in BEV
`8_lane_overlay.jpg`	Lanes warped back to image space

OSM map pipeline

Before running map matching, generate the road network for a recorded session:

# 1. Download OSM road network and build lane geometry
python3 src/roadrover_perception/scripts/make_map.py ~/roadrover_bags/session_<timestamp>

This reads the /fix GPS track from the bag, downloads the matching OSM road graph, and writes three files to the bag's parent directory:

File	Content
`map.geojson`	Road centrelines — drag onto Foxglove Map panel
`lanes.geojson`	Lane boundary lines — consumed by `process_bag.py` for `/map/lanes` markers
`map_graph.pkl`	Pickled osmnx graph — passed to `process_bag.py` via `--map-graph`

Lane boundaries are computed per OSM edge using shapely.offset_curve. For one-way roads the lanes are intended to be centred on the OSM edge, but for tightly-curved geometries the left-side offset curves can collapse; in practice all lane markers extend to the right of the OSM edge. Bidirectional roads split lanes left and right of the centreline. The ego positioning formula in process_bag.py is calibrated to this "OSM edge = left road boundary" convention.

# 2. (Optional) Generate OpenDRIVE for simulation
python3 src/roadrover_perception/scripts/make_xodr.py ~/roadrover_bags/map_graph.pkl
# → ~/roadrover_bags/map.xodr
# Verify: esmini --odr map.xodr --window 60 60 1200 800

Each OSM edge is split into per-polyline-segment roads so curves are preserved. For one-way roads the reference line is shifted to the carriageway's left edge so that the outermost (rightmost) lane centre aligns with the ego position written by process_bag.py. Bidirectional roads split lanes symmetrically around the OSM centreline.

Scenario extraction

Extract an OpenSCENARIO 1.x scenario from a processed bag. The ego trajectory comes from /ego/pose; actors are detected with YOLOv8s and tracked across frames using IoU matching.

python3 src/roadrover_perception/scripts/make_scenario.py \
    ~/roadrover_bags/session_<timestamp>_processed \
    --map-graph ~/roadrover_bags/map_graph.pkl \
    --out-dir ~/roadrover_bags/scenario_out

This writes two files to --out-dir:

File	Content
`map.xodr`	OpenDRIVE road network (auto-generated via `make_xodr.py`)
`scenario.xosc`	OpenSCENARIO 1.x file — ego + detected actor trajectories

The processed bag must have been produced by process_bag.py --map-graph (requires /ego/pose topic).

Actor detection pipeline:

Re-run YOLOv8s on /perception/image_annotated frames (vehicle classes: car, motorcycle, bus, truck)
Track detections across frames with greedy IoU matching (IOU_THRESHOLD=0.30)
Project each bounding-box bottom-centre to ENU via a pinhole road-plane model
Export tracks with ≥ 10 qualifying frames (configurable via --min-track-frames)

Scenario behaviour:

20-second cap — only the first 20 seconds of the drive are exported. This keeps the scenario focused and avoids long static segments when actors leave the camera field of view.
Actors disappear naturally — each actor is given a DeleteEntityAction at the time its track ends (i.e. when it left the camera FOV), so actors vanish from the simulation rather than freezing in place.
No trajectory extrapolation — actor waypoints come directly from the YOLO detections. Extrapolating noisy monocular-camera positions produces physically implausible off-road paths, so actors simply end where they were last detected.

Verify in esmini:

cd ~/roadrover_bags/scenario_out
esmini --osc scenario.xosc --window 60 60 1200 800

Extracted scenario replayed in esmini (ego + detected actors, 20 s):

Real recording in Foxglove (annotated camera feed, 3D lane markers, actor boxes):

Temporal alignment between the two views is approximate — GPS updates at 1 Hz and actor distances are estimated from bounding-box height on a monocular camera, so spatial correspondence is limited by those sensor constraints.

Changing device paths

If your camera or GPS receiver is on a different device node, override the defaults as launch arguments or edit src/roadrover_bringup/launch/bringup.launch.py:

# Override at launch time
ros2 launch roadrover_bringup bringup.launch.py \
    video_device:=/dev/video0 \
    gps_port:=/dev/ttyUSB1 \
    berry_gps_port:=/dev/ttyAMA1 \
    berry_gps_baud:=9600

Argument	Default	Description
`video_device`	`/dev/video0`	USB camera device
`gps_port`	`/dev/ttyUSB0`	USB GPS serial port
`gps_baud`	`4800`	USB GPS baud rate
`berry_gps_port`	`/dev/ttyAMA0`	BerryGPS UART serial port
`berry_gps_baud`	`9600`	BerryGPS baud rate

Rebuild after any changes:

colcon build --symlink-install --packages-select roadrover_bringup

Troubleshooting

Camera not found

v4l2-ctl --list-devices

Find your camera and update video_device accordingly.

GPS not found

ls /dev/ttyUSB*

Update port accordingly. You may also need to add yourself to the dialout group:

sudo usermod -aG dialout $USER   # log out and back in after this

Recording stops unexpectedly / usb_cam node crashes

This is caused by disk write pressure. Check that /usb_cam/image_raw is not in the recorded topics — the uncompressed image stream requires ~27 MB/s sustained, which exceeds typical SD card write throughput on a Pi 4 and causes the rosbag2 write queue to overflow. The current record.launch.py records only the compressed topic (~2 MB/s), which should not trigger this issue.

Cannot open Foxglove connection

Confirm the bridge is running: ros2 node list | grep foxglove
Check that port 8765 is not blocked by a firewall.

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
docs		docs
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
pytest.ini		pytest.ini

Folders and files

Latest commit

History

Repository files navigation

roadrover

Guides

Hardware

Prerequisites

Setup

1. Clone the repository

2. Build

3. Source the workspace

Usage

Launch everything

Visualize with Foxglove Studio

Stereo camera

Recording and replay

Record a session

Replay a session

Session processing pipeline

Tests

Offline perception pipeline

What it does

Output topics

Ego state signals

Map matching and lane localization

Viewing in Foxglove

Known limitation: 3D actor position noise

Lane detection debug tool

OSM map pipeline

Scenario extraction

Changing device paths

Troubleshooting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages