The CARLA open-source simulator team and Alpha Drive partnered together to provide researchers an open platform for the community to perform fair and reproducible evaluations, simplifying the comparison between different approaches.
The CARLA team created the baselines for evaluation while the Alpha Drive platform provides cloud orchestration and benchmarking capabilities allowing developers to compare amongst themselves and the latest research in the field.
The CARLA Autonomous Driving Leaderboard is provided FREE for public participation through the additional help of sponsors:
For the CARLA Leaderboard, the CARLA team developed various traffic situations based on the NHTSA pre-crash typology.
Agents experience multiple instances of 10 traffic scenarios when evaluated. Some of the scenarios include lane merging, lane changing, negotiations at traffic intersections, coping with pedestrians, and other elements.
A full list of the developed scenarios that were developed can be found here.
Participants are given access to a base set of environments and scenarios created by the CARLA team.
When teams are ready, they may submit their agents for evaluation using the Alpha Drive platform.
When submitting to the leaderboard, teams will not have access to the final set of environments and scenarios and will only have access to their final scores in order to eliminate bias.
Previously, human behavior has determined risk in all driving scenarios. Currently, algorithm behavior plays a more pivotal role in on-road risk and will continue as we move towards a more autonomous future.
Current standards for measuring autonomous vehicle safety rely on driver disengagements. This is not an accurate measure nor possible when the safety driver is taken out of the vehicle in a fully autonomous system.
Data has always informed risk.
The data we collect and how we collect it needs to shift.
The RAND Corporation has outlined ways to drive to safety and what would be necessary to create a framework for measuring automated vehicle safety. We aim to follow their framework.
The metrics and measurements of evaluating automated and autonomous systems is an area that still needs to be defined and refined. Multiple stakeholders have differing vantage points and data to help inform the evaluation.
Alpha Drive provides the platform to enable key stakeholders (i.e. regulators, insurers, or otherwise) to define metrics and measurements for evaluation. The platform gives the automated and autonomous system developers access to this data while in development. It creates a data feedback loop to refine the right metrics and measurements for evaluation as the technology matures and provides transparency as the technology is deployed to the market.
This is just a start and by publishing this data publically we hope to further engage with other stakeholders and help use their data to develop more advanced metrics and measurements.
The driving proficiency of an agent can be characterized by multiple metrics. For this leaderboard, the CARLA team selected a set of metrics that help understand different aspects of driving.
Infractions
The CARLA leaderboard offers individual metrics for a series of infractions. Each of these has a penalty coefficient that will be applied every time it happens. Ordered by severity, the infractions are the following.
Besides these, there is one additional infraction which has no coefficient, and instead affects the computation of route completion $(R_{i})$.
Additional Events
Some events will interrupt the simulation, preventing the agent to continue.